Bdim019028.1
Basic Information
- Insect
- Balanococcus diminutus
- Gene Symbol
- lilli
- Assembly
- GCA_959613365.1
- Location
- OY390718.1:55801543-55810709[+]
Transcription Factor Domain
- TF Family
- AF-4
- Domain
- AF-4 domain
- PFAM
- PF05110
- TF Group
- Unclassified Structure
- Description
- This family consists of AF4 (Proto-oncogene AF4) and FMR2 (Fragile X syndrome) nuclear proteins. These proteins have been linked to human diseases such as acute lymphoblastic leukaemia and mental disabilities [1]. The family also contains a Drosophila AF4 protein homologue Lilliputian which contains an AT-hook domain. Lilliputian represents a novel pair-rule gene that acts in cytoskeleton regulation, segmentation and morphogenesis in Drosophila [2].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 11 9.4e-11 1.9e-06 27.6 6.9 4 138 23 153 22 274 0.62 2 11 1.3e-07 0.0027 17.2 0.1 346 381 295 331 284 353 0.86 3 11 0.093 1.9e+03 -2.1 1.5 410 432 504 526 450 606 0.64 4 11 4e-06 0.082 12.3 14.3 455 513 638 694 631 695 0.78 5 11 0.77 1.6e+04 -5.1 11.3 440 488 702 759 696 772 0.50 6 11 1 2e+04 -11.0 19.8 443 484 823 866 802 880 0.49 7 11 1 2e+04 -15.6 26.0 95 226 973 1095 927 1206 0.41 8 11 0.0061 1.2e+02 1.8 8.8 143 191 1400 1448 1317 1469 0.63 9 11 0.65 1.3e+04 -4.9 14.7 135 230 1586 1687 1531 1733 0.38 10 11 0.31 6.4e+03 -3.8 3.9 444 475 1732 1764 1715 1776 0.63 11 11 0.28 5.7e+03 -3.7 0.6 437 474 2034 2071 2010 2080 0.70
Sequence Information
- Coding Sequence
- ATGAAAGGCGGCGCTTATTCATCGATGAAGACGGTCGCGGATTTAAGCTCCTCCAAACCTTGCGTCGAACGAGAACGTTTGAGGGAACGCGAAAGAGAAGCTCGTGCCCAGATGACCTTCGAAGCGGATCAGCGAGCGTCTCAAGATATGATCAATTCGGCGCCGTTGTTTGGCGAAATAGTTCGAGTAAATCCGAAATCAAACGACAAGGAACGGCAGCAAATCGAACGTAAGCTAGGCCGATTCGAAGACGTCAAACATCTGCTCGCCGACCAAGATGTTACGAATCTTTTCGGCGTAGATGGTCAACCTCCGCCAAGTCCTGCGCCAAGTATTTCTTCCggctcgtcgtcgtcgtccatCGTCGGTCacgaattcaaaaaacctAACGCCTGCTCTCAGTCGCATCAAACGAAATCTTCTcatcaccatcaccaccaccatcaTCACTCCAGCGCTCAAAGGAATACGTTCGCCAAACccagcggcggcggcggcggcggcgacgGAAAGTCGGCATACGCGAATCGCAATAGTTTTTACGCGGCTCCGTCGTCCAAGCACGGCAACGCCGCTTCTATTAACCTACCACCAACGTCGTCGGCCAGGGGAGGCGGCGCCCTTTCAACGGCCATCAATTCGTCGTCCATCGGCAGTGTAAACAacagcggcggcggcggcggcggctcaTTGGCCGCAACCAACCATCATTACGGTTCAACGTCCAACCCGACTTCGTCTTCGATTCACGATTATTCTGTTAGAATACAACAATCGGCGGCCAAAAATTTACCGAAAATAAATTGCGAGGCAAGTAGCGTTATCCTGCGAGATTCTGCGCGCGGTATGGGTTGTGCGAACGACAACGCGTCTTCGGTGTCTCTAACCAACGTGGAAAATATTCTGGACGAAATGACGTCGAGCTTGCGAACACCGTTGACCGCGATCGCCGCTACGCCGCGTAAAGAAGTCGAATCCAAGTTCACGTTCAATCCGATTTCCGGAAAGGTGGAATCTTTGCTGCCTTTGCCGAGAACTGCCGTCGATAAAAGTAAGGCTACGGGCGACGCACACTGGGCAGCGTTGAGCGTCGAAAATCCGACTCTAGTAAACCAATGGAACACGCCATACCGTTCCGGCGTCCGGCGTTACCgtTGGTGGCCATCTCGATCGCTCTACGCTCAAACGCTCTATATTTTTGCGCTCAACGCTGATCAGTATGGCCGTAGTAGTGTAAGCGTCGTCGCTTTCGACTCGTGCAACAAATCGAACGCTTTGAATAATGTATTGGCATCTTCGGGCGGCGGACGTAGCGCCTTTATTGCGTCGGCTTTTCGAAACGACGACTCGACCAACGTACTAACGACGCCATTCTCCTCCAACTCTTTGTCCGCTGCGGGCGTCGGTAATGGCAGTTCCAGAAGTACGGGTAGCGGCGGTTACGCTTACTCGCCGTCCGTCGCCGGTATCGCCGCTTCGTCGCAGGCAAACAAAGAGAAATTAGATTCTACCACGTTGCCGTGTTTGAAATCGGATCTAAGCTTGTCCGAGGATACGGACGACGGCGATGGAGCGGAAAACGGAAACGATTGCGACGatgaagacgacgacgacgacgacgacgacgaggaAGAAGATGGATTGCGAGGACGACGCGCCTCACGTTCCAGAAGCAATACCGCCgatcaaaaagaaaacaacaaGTACAAGATGACGAGCGGTGCGGGCGTAGCTTTTGTTCCCAAAAAAGAACCATCATCGGCGgcagcggcggcggcggcgacgacgacgacgacgacgacagcAGCCACTAAAGCGGACGAGCTAATTTACGGTGTAATCGCGACGATTCGATCGTCTAATGGAGCGTTCTCGTGTATTTTCTGTTTCGCTAGTTTACGAAAGTGCGATGCGGCCGCCAGCGAAACTAGAAATACGTCGTCCAGCGACGACAGCGACTCGGATTCCGGTTCGGAGAGCGAAGATTCGAGCAGCGATTCGCCTTCGCCGCCTTCATTGCGTCTGCCGGTGGTCACCGAGCAGGAAGAAAGTAAACATcgttggaatttgaaaaacttcttgCCACCGTCGGCGCACGGAAATTCTACGCCGACGTCCGCTCACAATTCGCAgAACCGTCGTCGTCAAAATTGCGAAAAGTCGAAAATAGGAGACTCGGATAGCTCGGCGGGTAGCGCGGCAAGAAGAAAATCCAGGCTAACTTCCTGCAGTACTTCGGATTCGGATAACGATAGCGCCAGCGCCAAGGTCAACGCGAGATCGCGTAAGCGAATCGTCGACAACGGTACCAAGGCTGGCTCAAACGTCGGAATCGCAGGCGCAACGGGCAGCATGTGCGGCGGTAACGCGCCTGTATCCGGGAATACcggcgccgccgccgccgccgtaTCCGCAAGCGTCGATAAGCGACGAAAGTACAGACTTCGATCGTCGTCTTATTCGCAGTCTTCGGACGAAGATTCGGATCGAGCTTCTTTGCCTTACGTTACCGCAAATTCCAGCCGAATCAACGGCAGCGGAAGCAGTAGCAGCGGCAGCAGTAGTAGTAGCAGCAGCGGCGGCGGCAGTTGTAGCGGTAAAGGATCAGTCGGCAATGCGAATAATAGTTGTTCGACGACCGTAGGTGGCGCTGGGTTGGTGGCCGCCAATTCGTTGAGTTGCGCGAATAATTTTGCAagtggcggcggcggcggcggcggcggcgctgTCAATAGTAGTGGTTGCGCGACTGCCGGCAGTGGAAATGTTACGGTAGTGGGCGTTGTAAGGCCGCACCAAAGGAAACCGCCATCGCCGCAGATTTTCCATCACAATGCCCGAAGTCGTAACTCTGCCGGAATGCGTGACTCGTCTCCTTCGGCCACCAACGACGAACGCTATTCCACGTTCGtgcagcagcagcaacaacaacaacaacagcgtACGCTGCATATGACCGGTGCTGGTAATAACGCGGCGCCGGAACCGCAGTGCGACTTATCTGCCAGCCGTGCGGATAACACcatggttttgaaaaacatgaacAGAGCTGCGATCAACGACCATTACCAACAGACTCGTAATAGTAGCGGAGGCGGTAAAAGTAGCAACAATAACGTTTCTAGCAGTTCCGTAGCGAATACCGGCCgaatcggcggcggcggcgataATATTAAAGTTTCTACGCTGCAGTCTGGTGAGAAATCAGTTTCGCAGACGAACAAAACGAACGCTTCTTCTAAAGTGCGCCGCAAATCGAAATCACCCGCTCTGGCTTCTTCCAACGAATCTCCAGCCAAGAAAAAAAGAGGACGAAAACGAACCATCAAAGTACCAGAGTTGACCGACAGCGACGAAGAAACGCCCAAGGCGAAAGCGGCACCCGAAAAGAAGAGACCAGGTCGACCGCCTCTCAAACGCAACGACACCGACGACTTGGATTGGAATTGCAGCTCTAGGCGTAAAACGACAGATAAATTCCCGAGCTCGACGAACGTGTGGTGCGACCGCGAGAGACGTAGAAGCAGTTTAAGAATGAGCAGTTTTACTACCGTCGATTCGGATAGCGAGACGGAAATTCAAACGGCTACGGTGAAACCGACGGCTCGCGTCATCCCCGCTGTAAAGAGAAAAGATGGCGGCGGCGGAAGCGTACGCATCAATCGTGATTCTAGTTGCGAATCGCTAAGAAACGACGGCGATTTCAACGCCATATCGGACGGTATCAAGAGCAGCGGTGGTGGGGGTGGCGGCGGTGGCGGGagtggcggcggcggcgttGTTAAAGTGCTGGAAAGTCCGCCCAAATTGGACGTCGAATGTATAGCCGTTCAAGATAAAAAGAAAAGCGACACTTTGCGAAAGTTGTTTTCGCGTCGTGAAGAGGGCGGCGGTAAAACAGGCGGTAAGGGCAAAGGCGGTAAAGGAAAGTGCGGAGTTATCGTTATGGAATCCGAGGTCGAACGTAAACTATTGCAACGTTCGTCCGTCGTCAGTCCGTCGTTGGCGTTAGCGTCGCCGcctacgacgacgacgacgtcgtcgtcgtcgtcttcgaATTTATCAAAGATACAAGATCGCGTTCCGAACTTGACGCACGTGGCTATTTCTTCTCCCAATTTGCAATCCAACGACGGCGCGGACGGTAGATTTTCCGAAATTGGTATCGCGAACAGCGGCGGCGGTGGCGGCGGCGGCAACGATAACGCTATCAGCGGCAGCGGCAACAATCATCATCACCAAtaccatcatcatcatcatcatcaccaccaccatAATCATcaccagcagcagcagcagcagcagcagcagcagcacaACAATCACAACCTTAACCTCCACCTTCAGCACTCGAAcaccaacaacaacaacaacggcAGCGCTGTCGGTGTGGGACAGGCCGATGCTGTCGTCGATGCGATTCCTAAACTGACTTACAACGAAGCCGGTAAACCTTCGTTGATGTGTAAAATCGATCTTAGTAAAATTCCGTATATATTGGCGAAAAAACGCTCCGAAGAAGTTCGAATCAAGACCGAATTACCAGACACTAGGCAAACGACCAACATCAACGCGTCGGACGGCGTCGTTGTGAATACAAGTAGCACGAATACGACCGCTATAAATGTGTCTTTGATCGAACCTGTGTGGGACCGTCGTCGCAACGAAAACACAACCGCGTCGTGTCCTGCGGCGGCGTCAGCGACAACGGACACCGCGCCACCTGTTGCCGTCGTCGAAACTAGCGATGTTCGTCGCGGCGACAGTAGCAAACAGCATCATCAAAGATCGCAGTCGTTGCCGCATACGCGTAAAAAGCATTCGTCCAAAAAGACGAAGAATAGCAAGCGCAAGCATTACGCGGACGAATCGCGCGACATAACGGCCGCGTCACCGGATGCGCTGGCCGCGGCGGCGGTGGCAGACGATAATAACGCGTCGATATTCGCTACGGCGTCGACGGTTCATACGCCTTCCGTAGACACGACGGACATGGATTCGTCGTTATCGAGTGATTCGGAAGCGAAAAAGGTGTCCGGCGGCGTTGCAGCCAACTTTGCACGTTACGGCGgcaataaaaatacgaaaaggtcgacggcggcggcggcggcgccgCCGGAACGCGCGTCCCTCACACCCGGCGACTCGGTCGTTCGCAACAATTTGATCTCGCCGTGTTATCCTATGAATTTGAACGACGTTCAACATCAAAACGTAAACGTAAACGCGGTCGTCGATCAAAGTTCCGGAGACGACGAATCGTGGGACAGTTCGGACAGCAGTTGCAGTAATTGTTCGGCGTCGTCTCTACCATGTCACGACGCTGGCGGCGGCGGAGGCGGCGACAAATACTCTAAAATTGGCAAACGTTCTTCGCAGTATCGCAGTTGTagtaaaaagaagaagaaaaaattgcgcaACGAACATCATTCGCAACGATGTTCCACGTCATTGGTAAGCGTTATACAACGTAGCAAAGGTGACTGCGAGAGCGTCGGGAGCGTCGGTGACATCGTAGTCGGCGGCGGCGACGGTACGACGAATTTGATTCATTTGCCGAATACCAGCCACGAAAGGAGTAGCATAGGGGCGGAGGCGCCGACTGCAACGTTCAACGGTTACCATCAAGTGGCCATCAGCGATCTGGTTCGACCGACGGTGCCCACCCATCACGGCATTTATTATTCGTATTTGGAACATAAAGCTTCGGAAGAAACCAATTCCGAATCGGACAACGTTAGTCCGAATTTATATTTGATGGAAGCGAAACGTCTGAAACACGCCGCCGATACGGAACGCGATTCAGTCGCTCAAGGAATGCAATACTTGGAAGCTATTTTATCGTTCGTATTGACCGGCCATGTTTTGGAACGTAAAAATCAAATGGATACCGCCTTCAATATGTACAGCGAAACGTTGAAGCTTATCGTGtatatttcgaagaaatttcgCACTCTATGCACCAATGCGACGCAATCCAGTATACCGAATAAAATAGCGATATTGAGTTTACGATGCGAATccattttgaacttgaaactGTACACGATGCGTGAACACGAAGTGAAAGACGTACACAGCACGGTCGCCGAATATTTCAGCAAGCCTGTGGCGCCGGAAGCGTGCTTGATGTCGAACGCGGCTGGAATTCCGAGTCCGCATTCGCCGACGCCTTCGCCGGCCAGTTCGGTCGGAAGTAATTCTTCTTCGGGTTACAGTACTGGCGATCGCAGATCGCTGGCCGCAATGTCCCAGTATGTCGCCGTTCCCTGTACCGTGCACGTCGCCTTGCAGAAGCAGCATAATTTCTATTCTAATCTAATCGCGTCTTTCGGCTATTGGGAAGAAGCGGATAAACTCGTATTTTCGGGCGGAAATAAAGatTTCTTCATCGAATTGGACAGGTTTTGCGGACCGCTTACGTTGCATAGCTCGTTGAAAGATTTGGTACATTACGTGCGCGTTGGCATTCAGAGGTTAAAAGCCATCATCAGTCACAACCACGGCGTATGA
- Protein Sequence
- MKGGAYSSMKTVADLSSSKPCVERERLREREREARAQMTFEADQRASQDMINSAPLFGEIVRVNPKSNDKERQQIERKLGRFEDVKHLLADQDVTNLFGVDGQPPPSPAPSISSGSSSSSIVGHEFKKPNACSQSHQTKSSHHHHHHHHHSSAQRNTFAKPSGGGGGGDGKSAYANRNSFYAAPSSKHGNAASINLPPTSSARGGGALSTAINSSSIGSVNNSGGGGGGSLAATNHHYGSTSNPTSSSIHDYSVRIQQSAAKNLPKINCEASSVILRDSARGMGCANDNASSVSLTNVENILDEMTSSLRTPLTAIAATPRKEVESKFTFNPISGKVESLLPLPRTAVDKSKATGDAHWAALSVENPTLVNQWNTPYRSGVRRYRWWPSRSLYAQTLYIFALNADQYGRSSVSVVAFDSCNKSNALNNVLASSGGGRSAFIASAFRNDDSTNVLTTPFSSNSLSAAGVGNGSSRSTGSGGYAYSPSVAGIAASSQANKEKLDSTTLPCLKSDLSLSEDTDDGDGAENGNDCDDEDDDDDDDDEEEDGLRGRRASRSRSNTADQKENNKYKMTSGAGVAFVPKKEPSSAAAAAAATTTTTTTAATKADELIYGVIATIRSSNGAFSCIFCFASLRKCDAAASETRNTSSSDDSDSDSGSESEDSSSDSPSPPSLRLPVVTEQEESKHRWNLKNFLPPSAHGNSTPTSAHNSQNRRRQNCEKSKIGDSDSSAGSAARRKSRLTSCSTSDSDNDSASAKVNARSRKRIVDNGTKAGSNVGIAGATGSMCGGNAPVSGNTGAAAAAVSASVDKRRKYRLRSSSYSQSSDEDSDRASLPYVTANSSRINGSGSSSSGSSSSSSSGGGSCSGKGSVGNANNSCSTTVGGAGLVAANSLSCANNFASGGGGGGGGAVNSSGCATAGSGNVTVVGVVRPHQRKPPSPQIFHHNARSRNSAGMRDSSPSATNDERYSTFVQQQQQQQQQRTLHMTGAGNNAAPEPQCDLSASRADNTMVLKNMNRAAINDHYQQTRNSSGGGKSSNNNVSSSSVANTGRIGGGGDNIKVSTLQSGEKSVSQTNKTNASSKVRRKSKSPALASSNESPAKKKRGRKRTIKVPELTDSDEETPKAKAAPEKKRPGRPPLKRNDTDDLDWNCSSRRKTTDKFPSSTNVWCDRERRRSSLRMSSFTTVDSDSETEIQTATVKPTARVIPAVKRKDGGGGSVRINRDSSCESLRNDGDFNAISDGIKSSGGGGGGGGGSGGGGVVKVLESPPKLDVECIAVQDKKKSDTLRKLFSRREEGGGKTGGKGKGGKGKCGVIVMESEVERKLLQRSSVVSPSLALASPPTTTTTSSSSSSNLSKIQDRVPNLTHVAISSPNLQSNDGADGRFSEIGIANSGGGGGGGNDNAISGSGNNHHHQYHHHHHHHHHHNHHQQQQQQQQQQHNNHNLNLHLQHSNTNNNNNGSAVGVGQADAVVDAIPKLTYNEAGKPSLMCKIDLSKIPYILAKKRSEEVRIKTELPDTRQTTNINASDGVVVNTSSTNTTAINVSLIEPVWDRRRNENTTASCPAAASATTDTAPPVAVVETSDVRRGDSSKQHHQRSQSLPHTRKKHSSKKTKNSKRKHYADESRDITAASPDALAAAAVADDNNASIFATASTVHTPSVDTTDMDSSLSSDSEAKKVSGGVAANFARYGGNKNTKRSTAAAAAPPERASLTPGDSVVRNNLISPCYPMNLNDVQHQNVNVNAVVDQSSGDDESWDSSDSSCSNCSASSLPCHDAGGGGGGDKYSKIGKRSSQYRSCSKKKKKKLRNEHHSQRCSTSLVSVIQRSKGDCESVGSVGDIVVGGGDGTTNLIHLPNTSHERSSIGAEAPTATFNGYHQVAISDLVRPTVPTHHGIYYSYLEHKASEETNSESDNVSPNLYLMEAKRLKHAADTERDSVAQGMQYLEAILSFVLTGHVLERKNQMDTAFNMYSETLKLIVYISKKFRTLCTNATQSSIPNKIAILSLRCESILNLKLYTMREHEVKDVHSTVAEYFSKPVAPEACLMSNAAGIPSPHSPTPSPASSVGSNSSSGYSTGDRRSLAAMSQYVAVPCTVHVALQKQHNFYSNLIASFGYWEEADKLVFSGGNKDFFIELDRFCGPLTLHSSLKDLVHYVRVGIQRLKAIISHNHGV
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -