Suni011164.1
Basic Information
- Insect
- Silvanus unidentatus
- Gene Symbol
- lilli_1
- Assembly
- GCA_963930825.1
- Location
- OZ005760.1:2232407-2249145[+]
Transcription Factor Domain
- TF Family
- AF-4
- Domain
- AF-4 domain
- PFAM
- PF05110
- TF Group
- Unclassified Structure
- Description
- This family consists of AF4 (Proto-oncogene AF4) and FMR2 (Fragile X syndrome) nuclear proteins. These proteins have been linked to human diseases such as acute lymphoblastic leukaemia and mental disabilities [1]. The family also contains a Drosophila AF4 protein homologue Lilliputian which contains an AT-hook domain. Lilliputian represents a novel pair-rule gene that acts in cytoskeleton regulation, segmentation and morphogenesis in Drosophila [2].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 8 9.8e-09 0.0002 20.9 0.1 1 81 6 90 6 153 0.66 2 8 1 2.1e+04 -13.0 21.5 444 472 188 215 167 219 0.50 3 8 1.5e-11 3.1e-07 30.2 18.0 340 501 240 409 230 413 0.76 4 8 1 2.1e+04 -9.7 22.9 145 259 499 611 415 625 0.57 5 8 0.68 1.4e+04 -4.9 10.5 431 476 632 682 596 699 0.49 6 8 1 2.1e+04 -12.4 36.2 84 251 669 839 657 852 0.50 7 8 0.00094 19 4.5 17.8 74 269 932 1124 893 1130 0.52 8 8 0.11 2.2e+03 -2.3 1.8 177 215 1299 1341 1226 1369 0.48
Sequence Information
- Coding Sequence
- ATGTGTGATTTATTTTACAGTGTGGAAAGGGACCGCCTTCGTGAGCGGGAACGGCTGGCGAGGGCCCAAATGTCTTCACAAGCGGTCGAACAAGAAAGTCCAGGTGCCTCGGGTGATTTTCTATTTAGTGGTCCGATAAAAGTGAACCCGTCCTCTGCTGATCCCGTCACACAACAAATACAAAGCAAACTCGGCGACTTTCAACGTGTTCGCCATTATTTGGACCAGAAAGATTCCGCTTTAATTGGAGTGGATGGTGGTGCACCGCCTCCAAGTCCAGGTGTACCATCCTCCAGGCACCATTCGATCTTAAGTCTACCACCCAATTCCGGACCATGCACCTCTGCAAGACTCCAGCCGTCACCTGAATCTCGTACTGAGTTTAAGAAGCCTCACCACCATCTCCATCACACGTCACATCAGCGTAGCGGTTACGTGAAACCGGCGGACGGGAAGCCGCCGTATGGAGGCCGGGGCGGCTATCCGGGCCAGCCAGTCAAGCATGGCAGCATTAATAATCATCGACCAAATGGCATATTGCCGGCAAAGGGTCCACCTTTATCCTCGCCCACATCGTCAATGTCCAGCACCACAACGGCATCTTCCTCTTCTACTAGTTCCTCATCGTCTTCATCAGCATCGTCAAATCGTTTACATGGTGGACAAAGAAATCAACGCATTCCATTTGAGAATCAAGGTCCTACGGCCGGCTCCAGGGAATCCTTGCCTTCCGCAAATCCAGAcgtaaataatatatttaagGAAATGAGAGAGGTTCCATCGCCTTTGACTGCAATGGCTGCGACGCCGAGGATAGAACTGGACAATAAATATACAACTTGTACATTTAATCCAGTTCTTGCCAAGTTGACTGAAGCATCACCAGCACCATCCACACCAAAAAAACGCGATCGTCTGCCAGCCCCACGGCCCTCCACtGATCTAAAAGATGATCTCGATTTGTCCGAGGAGAGTGACGATGAACAAAAGCGAGAGGCCTTACCTTCTTCCAAGTTATCCGTTGAAAAGATGTTGTCGCCATTAGGTGGCCTAACACCAGCTAACCAAAAAATGGAACGTCCTCCTGATCCTGCCCATTCGCCCATTGGTAGTTCATCAAGTGAGTCTGGCTCTGATTCCGGTTCGGAAAGTGATTCAACAAGTGACGATTCCGGTGACGAGAATGTCACATCAATGAGTCGAACAGCAGCACCGCCGCCCGTCGTCTCCACGTCAGTATTACCAGCCGAAGCGCAACCATCCTCCCCGAAAATGGAGGAGGAGAGTAAGCAACTCCGTTGGAATTTGGGCTCGTTTGTGGTGTCACCCAAGCCGCAAACGTCGCCGCTCTTGTCGCCGATTAAATCAGCAACGGGCGCCACCAGTCCGGATAGCAGAAAACGCGACAAAGAAGAGTCGGATACGAGCGATTCAACGCGGGATTTAGGGAGAGTCGTTGCGGAGGCGTTCGCCTCCAATGCCGTGCCACTGTTAAGTGATTTCTCCGATTCCAATTCAGAAAAGGAGTCCCAGAAACGGTCGAAAAGACGCAAAAGGACAAACAATCCACCGCCGCGTAGTGATGACGATGATAGTGAAGACAGTGAGGATGATGATGAACGGACAAAAATCACAAAACCAGTGCCGCGTGTTAGTCCGCGTACGAAATCAGTTGACTCACTTAGTGAATCTGATGATTCGGAATTTAGCGTTGTTAATAATAGTGCCAAAAAAtccaaaacaattaaatcAGCTAATAGTGATAAGCCGAAATCGAATCGTGGCAGACCGCGAAagtacaataataataaaacgtCGAATATAGCGGACAAGAAGCAACGCGGTCGGCCACCAATCAAAAGTCAACAAAGTGATACGGACACTGAAGTGCGGAAGAAGCGTGGAAGGCCGCCGAAAACGGCCCGACCATCTTCACCTCTGAGCAGTTCGGACGAGGATGGACCGCCGCCAGCGAAGAAGTTTGAGAAACCGTTAAGACGTCGGACCAAGTCGAAGAGGGATTCGAACAGTAGTTCGGATTCGGATGTGAGTCCGCCCAGACGACATGAGTCCGTTGAGATTGAAACGCCAAAGAAACCGCACAAACACAAATCACCTCGTTCTGATGAGAAAAGTAGAAAGAATAAGAAAGAATCAGACGATGAATGGGGTGAGAAGAATAAGAATAAATTAAGACACCATTTTGAGCCTGAAAAACGAACGAAACCAGAATCGCCTCGTAAGAAGGATCCACAAAGACGGAAAGGACGAATACCACAACCAAAACAACAGAAGAGTATTTCGACATTACCGACAACTACGAGTGATTCGAGTGATAGTGATGTTAAACATAAATCGCACACATCTCCCATACGAACTGTAAATAATCGATCATCTAGTGGATCTGACTCTGATAGACGATTAAGTCGATCATCGGAAAGTGAGTGTAGTGATAATGGCAGAAGGAAAAAGTCACCAACGAAAATTGATTCGTCAATTAAAGTGGAAGAAACGAAACCGGTACAAGATAAAAAGAAGAGTGATACATTGAGGAAATTGTTCACATTGAAAAGGGACTCGGAAGGTGGAAAAGGAGGAGGCAAAGGTGGTGGCAAAGGAGGGAAAGGAGGAAAAGGAAAAGGTGGAGTGAGTGTGATAATGGTAGATGAAAACTATGAACGTAGCAGTTCATCTGTGGAAGATGAGACAATGCCAACAATATCATCAAATCCGGCACTATTATCACCATTGTCCACCGTCGAGAAACGTAGCCCTTTGATACCGGCCCGTGTTAATTCACCCTACAATATGTACAATGAATCAATCAAAACAGAAAAGACTGATTTATCAATTAATACTGATCATAATCATGTAACAAATGAGAAAAGCATAATGGTGAGAATTGATTTGAATCGTTTACAAGACATTCCAGCATTACGAAGATATTTGGATGCGAAACGTACATCATGGCGTAATAATAATGCACACGATAATATGATGAAATTCAATGAGAAGAATCGTTTATTATACGACaatgaaatggagaataataggatGACGATAGAAAAAATAAAGGAAAGTGACAACGAAACGAAAAAGTTGAATAAAATTGACCAACCCAACACCGATCGTAAACATAAGAAACGAAAACGTCGTAATAGTAGCAGTTCCGTATCATCACATTCGACTATAAGCAGCATGTCACATAGTAGTAGTAAAAAGAATAAAGagaattttgtaaataataataaaactaaGGAAAACCGTAAAGCGAAACGACGAAAAGAGGACGGTGCGACCACTTCAAGTGACTTAATCATGGCTAACAATAGATCTCATACTGATAATCTCACATTAACCAATGTACCACCGACCAACCACGAACGGGAAGGACTTGGTGGCGGATGTAGTGGTGTTAGTAATAATAATGGTGGATCTGGTAGCCGTTTATCGCCCGAACGGACATTTCCAATACGGACATCTGGTCATAGGGAATACCATTCGTATTTTGAAGCGCCCGAAGAACCTTCCGAATACGAAGAAAGAGATCAAAATCAGTACCTGAACGATGCGAAACGATTGAAACATTTGGCCGACAAAGAAGTGGACCCAATAAAACAATGCATGCTGTACCTGGAAGCTGTGCTATTCTTTTTGTTGACTGGAAATGCCATGGAACATGAGAGTGTCACCGAGAAGGCCGCTTTCACAATGTACAAAGATACacttaatttaattaaATTTATTTCATCAAAGTTTCGCAATCAACAAACTGCATCTTCAGTGCATAATAAACTAGCTGTTCTAAGTTACCGGTGCCAAGCCCTCTTATactataaattgtttaaaatgaGGAAACAAGAAAGTAGAGAGATACAGAAGACTATTGGCGAGTTTTGCAGCAATAAAaatgcTACGATGCCTCAGGAGCAACAGAATCACCAACAAGGCCAGGGAACGCCTTCGCCTTTGTCACCGACACCTTCGCCTGCGGGCTCTGTTGGTTCGGTCGGAAGCCAATCGTCCGGCTATCTCAGTGGTGAATTACGAGGCAATAATAGTAACAATAACGCTCCTGTACCATCAACACATGCACAAACGCCCGGTGTTTGGGTACCATTACCAGTTTTCAATGCCATTTCCAAACAGAATCTTCAATTTAGTTACTTGCTGAGTTATCAGGAGCTTTGGGATACGGCAGATTCGCTTGTGATGAAAGGAAAACACACCGACTTCTTTATCGAATTGGACCGACAATGCAGACCATTAACAATGCATAGTTCATTAATAGACTTGGTCAGGCATGTTCGTGAAGGGATCAATCGACTTAAAAGTCAAAGTTGA
- Protein Sequence
- MCDLFYSVERDRLRERERLARAQMSSQAVEQESPGASGDFLFSGPIKVNPSSADPVTQQIQSKLGDFQRVRHYLDQKDSALIGVDGGAPPPSPGVPSSRHHSILSLPPNSGPCTSARLQPSPESRTEFKKPHHHLHHTSHQRSGYVKPADGKPPYGGRGGYPGQPVKHGSINNHRPNGILPAKGPPLSSPTSSMSSTTTASSSSTSSSSSSSASSNRLHGGQRNQRIPFENQGPTAGSRESLPSANPDVNNIFKEMREVPSPLTAMAATPRIELDNKYTTCTFNPVLAKLTEASPAPSTPKKRDRLPAPRPSTDLKDDLDLSEESDDEQKREALPSSKLSVEKMLSPLGGLTPANQKMERPPDPAHSPIGSSSSESGSDSGSESDSTSDDSGDENVTSMSRTAAPPPVVSTSVLPAEAQPSSPKMEEESKQLRWNLGSFVVSPKPQTSPLLSPIKSATGATSPDSRKRDKEESDTSDSTRDLGRVVAEAFASNAVPLLSDFSDSNSEKESQKRSKRRKRTNNPPPRSDDDDSEDSEDDDERTKITKPVPRVSPRTKSVDSLSESDDSEFSVVNNSAKKSKTIKSANSDKPKSNRGRPRKYNNNKTSNIADKKQRGRPPIKSQQSDTDTEVRKKRGRPPKTARPSSPLSSSDEDGPPPAKKFEKPLRRRTKSKRDSNSSSDSDVSPPRRHESVEIETPKKPHKHKSPRSDEKSRKNKKESDDEWGEKNKNKLRHHFEPEKRTKPESPRKKDPQRRKGRIPQPKQQKSISTLPTTTSDSSDSDVKHKSHTSPIRTVNNRSSSGSDSDRRLSRSSESECSDNGRRKKSPTKIDSSIKVEETKPVQDKKKSDTLRKLFTLKRDSEGGKGGGKGGGKGGKGGKGKGGVSVIMVDENYERSSSSVEDETMPTISSNPALLSPLSTVEKRSPLIPARVNSPYNMYNESIKTEKTDLSINTDHNHVTNEKSIMVRIDLNRLQDIPALRRYLDAKRTSWRNNNAHDNMMKFNEKNRLLYDNEMENNRMTIEKIKESDNETKKLNKIDQPNTDRKHKKRKRRNSSSSVSSHSTISSMSHSSSKKNKENFVNNNKTKENRKAKRRKEDGATTSSDLIMANNRSHTDNLTLTNVPPTNHEREGLGGGCSGVSNNNGGSGSRLSPERTFPIRTSGHREYHSYFEAPEEPSEYEERDQNQYLNDAKRLKHLADKEVDPIKQCMLYLEAVLFFLLTGNAMEHESVTEKAAFTMYKDTLNLIKFISSKFRNQQTASSVHNKLAVLSYRCQALLYYKLFKMRKQESREIQKTIGEFCSNKNATMPQEQQNHQQGQGTPSPLSPTPSPAGSVGSVGSQSSGYLSGELRGNNSNNNAPVPSTHAQTPGVWVPLPVFNAISKQNLQFSYLLSYQELWDTADSLVMKGKHTDFFIELDRQCRPLTMHSSLIDLVRHVREGINRLKSQS
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_01121607;
- 90% Identity
- -
- 80% Identity
- -