Basic Information

Gene Symbol
lilli
Assembly
GCA_959613365.1
Location
OY390718.1:55801543-55810709[+]

Transcription Factor Domain

TF Family
AF-4
Domain
AF-4 domain
PFAM
PF05110
TF Group
Unclassified Structure
Description
This family consists of AF4 (Proto-oncogene AF4) and FMR2 (Fragile X syndrome) nuclear proteins. These proteins have been linked to human diseases such as acute lymphoblastic leukaemia and mental disabilities [1]. The family also contains a Drosophila AF4 protein homologue Lilliputian which contains an AT-hook domain. Lilliputian represents a novel pair-rule gene that acts in cytoskeleton regulation, segmentation and morphogenesis in Drosophila [2].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 11 9.4e-11 1.9e-06 27.6 6.9 4 138 23 153 22 274 0.62
2 11 1.3e-07 0.0027 17.2 0.1 346 381 295 331 284 353 0.86
3 11 0.093 1.9e+03 -2.1 1.5 410 432 504 526 450 606 0.64
4 11 4e-06 0.082 12.3 14.3 455 513 638 694 631 695 0.78
5 11 0.77 1.6e+04 -5.1 11.3 440 488 702 759 696 772 0.50
6 11 1 2e+04 -11.0 19.8 443 484 823 866 802 880 0.49
7 11 1 2e+04 -15.6 26.0 95 226 973 1095 927 1206 0.41
8 11 0.0061 1.2e+02 1.8 8.8 143 191 1400 1448 1317 1469 0.63
9 11 0.65 1.3e+04 -4.9 14.7 135 230 1586 1687 1531 1733 0.38
10 11 0.31 6.4e+03 -3.8 3.9 444 475 1732 1764 1715 1776 0.63
11 11 0.28 5.7e+03 -3.7 0.6 437 474 2034 2071 2010 2080 0.70

Sequence Information

Coding Sequence
ATGAAAGGCGGCGCTTATTCATCGATGAAGACGGTCGCGGATTTAAGCTCCTCCAAACCTTGCGTCGAACGAGAACGTTTGAGGGAACGCGAAAGAGAAGCTCGTGCCCAGATGACCTTCGAAGCGGATCAGCGAGCGTCTCAAGATATGATCAATTCGGCGCCGTTGTTTGGCGAAATAGTTCGAGTAAATCCGAAATCAAACGACAAGGAACGGCAGCAAATCGAACGTAAGCTAGGCCGATTCGAAGACGTCAAACATCTGCTCGCCGACCAAGATGTTACGAATCTTTTCGGCGTAGATGGTCAACCTCCGCCAAGTCCTGCGCCAAGTATTTCTTCCggctcgtcgtcgtcgtccatCGTCGGTCacgaattcaaaaaacctAACGCCTGCTCTCAGTCGCATCAAACGAAATCTTCTcatcaccatcaccaccaccatcaTCACTCCAGCGCTCAAAGGAATACGTTCGCCAAACccagcggcggcggcggcggcggcgacgGAAAGTCGGCATACGCGAATCGCAATAGTTTTTACGCGGCTCCGTCGTCCAAGCACGGCAACGCCGCTTCTATTAACCTACCACCAACGTCGTCGGCCAGGGGAGGCGGCGCCCTTTCAACGGCCATCAATTCGTCGTCCATCGGCAGTGTAAACAacagcggcggcggcggcggcggctcaTTGGCCGCAACCAACCATCATTACGGTTCAACGTCCAACCCGACTTCGTCTTCGATTCACGATTATTCTGTTAGAATACAACAATCGGCGGCCAAAAATTTACCGAAAATAAATTGCGAGGCAAGTAGCGTTATCCTGCGAGATTCTGCGCGCGGTATGGGTTGTGCGAACGACAACGCGTCTTCGGTGTCTCTAACCAACGTGGAAAATATTCTGGACGAAATGACGTCGAGCTTGCGAACACCGTTGACCGCGATCGCCGCTACGCCGCGTAAAGAAGTCGAATCCAAGTTCACGTTCAATCCGATTTCCGGAAAGGTGGAATCTTTGCTGCCTTTGCCGAGAACTGCCGTCGATAAAAGTAAGGCTACGGGCGACGCACACTGGGCAGCGTTGAGCGTCGAAAATCCGACTCTAGTAAACCAATGGAACACGCCATACCGTTCCGGCGTCCGGCGTTACCgtTGGTGGCCATCTCGATCGCTCTACGCTCAAACGCTCTATATTTTTGCGCTCAACGCTGATCAGTATGGCCGTAGTAGTGTAAGCGTCGTCGCTTTCGACTCGTGCAACAAATCGAACGCTTTGAATAATGTATTGGCATCTTCGGGCGGCGGACGTAGCGCCTTTATTGCGTCGGCTTTTCGAAACGACGACTCGACCAACGTACTAACGACGCCATTCTCCTCCAACTCTTTGTCCGCTGCGGGCGTCGGTAATGGCAGTTCCAGAAGTACGGGTAGCGGCGGTTACGCTTACTCGCCGTCCGTCGCCGGTATCGCCGCTTCGTCGCAGGCAAACAAAGAGAAATTAGATTCTACCACGTTGCCGTGTTTGAAATCGGATCTAAGCTTGTCCGAGGATACGGACGACGGCGATGGAGCGGAAAACGGAAACGATTGCGACGatgaagacgacgacgacgacgacgacgacgaggaAGAAGATGGATTGCGAGGACGACGCGCCTCACGTTCCAGAAGCAATACCGCCgatcaaaaagaaaacaacaaGTACAAGATGACGAGCGGTGCGGGCGTAGCTTTTGTTCCCAAAAAAGAACCATCATCGGCGgcagcggcggcggcggcgacgacgacgacgacgacgacagcAGCCACTAAAGCGGACGAGCTAATTTACGGTGTAATCGCGACGATTCGATCGTCTAATGGAGCGTTCTCGTGTATTTTCTGTTTCGCTAGTTTACGAAAGTGCGATGCGGCCGCCAGCGAAACTAGAAATACGTCGTCCAGCGACGACAGCGACTCGGATTCCGGTTCGGAGAGCGAAGATTCGAGCAGCGATTCGCCTTCGCCGCCTTCATTGCGTCTGCCGGTGGTCACCGAGCAGGAAGAAAGTAAACATcgttggaatttgaaaaacttcttgCCACCGTCGGCGCACGGAAATTCTACGCCGACGTCCGCTCACAATTCGCAgAACCGTCGTCGTCAAAATTGCGAAAAGTCGAAAATAGGAGACTCGGATAGCTCGGCGGGTAGCGCGGCAAGAAGAAAATCCAGGCTAACTTCCTGCAGTACTTCGGATTCGGATAACGATAGCGCCAGCGCCAAGGTCAACGCGAGATCGCGTAAGCGAATCGTCGACAACGGTACCAAGGCTGGCTCAAACGTCGGAATCGCAGGCGCAACGGGCAGCATGTGCGGCGGTAACGCGCCTGTATCCGGGAATACcggcgccgccgccgccgccgtaTCCGCAAGCGTCGATAAGCGACGAAAGTACAGACTTCGATCGTCGTCTTATTCGCAGTCTTCGGACGAAGATTCGGATCGAGCTTCTTTGCCTTACGTTACCGCAAATTCCAGCCGAATCAACGGCAGCGGAAGCAGTAGCAGCGGCAGCAGTAGTAGTAGCAGCAGCGGCGGCGGCAGTTGTAGCGGTAAAGGATCAGTCGGCAATGCGAATAATAGTTGTTCGACGACCGTAGGTGGCGCTGGGTTGGTGGCCGCCAATTCGTTGAGTTGCGCGAATAATTTTGCAagtggcggcggcggcggcggcggcggcgctgTCAATAGTAGTGGTTGCGCGACTGCCGGCAGTGGAAATGTTACGGTAGTGGGCGTTGTAAGGCCGCACCAAAGGAAACCGCCATCGCCGCAGATTTTCCATCACAATGCCCGAAGTCGTAACTCTGCCGGAATGCGTGACTCGTCTCCTTCGGCCACCAACGACGAACGCTATTCCACGTTCGtgcagcagcagcaacaacaacaacaacagcgtACGCTGCATATGACCGGTGCTGGTAATAACGCGGCGCCGGAACCGCAGTGCGACTTATCTGCCAGCCGTGCGGATAACACcatggttttgaaaaacatgaacAGAGCTGCGATCAACGACCATTACCAACAGACTCGTAATAGTAGCGGAGGCGGTAAAAGTAGCAACAATAACGTTTCTAGCAGTTCCGTAGCGAATACCGGCCgaatcggcggcggcggcgataATATTAAAGTTTCTACGCTGCAGTCTGGTGAGAAATCAGTTTCGCAGACGAACAAAACGAACGCTTCTTCTAAAGTGCGCCGCAAATCGAAATCACCCGCTCTGGCTTCTTCCAACGAATCTCCAGCCAAGAAAAAAAGAGGACGAAAACGAACCATCAAAGTACCAGAGTTGACCGACAGCGACGAAGAAACGCCCAAGGCGAAAGCGGCACCCGAAAAGAAGAGACCAGGTCGACCGCCTCTCAAACGCAACGACACCGACGACTTGGATTGGAATTGCAGCTCTAGGCGTAAAACGACAGATAAATTCCCGAGCTCGACGAACGTGTGGTGCGACCGCGAGAGACGTAGAAGCAGTTTAAGAATGAGCAGTTTTACTACCGTCGATTCGGATAGCGAGACGGAAATTCAAACGGCTACGGTGAAACCGACGGCTCGCGTCATCCCCGCTGTAAAGAGAAAAGATGGCGGCGGCGGAAGCGTACGCATCAATCGTGATTCTAGTTGCGAATCGCTAAGAAACGACGGCGATTTCAACGCCATATCGGACGGTATCAAGAGCAGCGGTGGTGGGGGTGGCGGCGGTGGCGGGagtggcggcggcggcgttGTTAAAGTGCTGGAAAGTCCGCCCAAATTGGACGTCGAATGTATAGCCGTTCAAGATAAAAAGAAAAGCGACACTTTGCGAAAGTTGTTTTCGCGTCGTGAAGAGGGCGGCGGTAAAACAGGCGGTAAGGGCAAAGGCGGTAAAGGAAAGTGCGGAGTTATCGTTATGGAATCCGAGGTCGAACGTAAACTATTGCAACGTTCGTCCGTCGTCAGTCCGTCGTTGGCGTTAGCGTCGCCGcctacgacgacgacgacgtcgtcgtcgtcgtcttcgaATTTATCAAAGATACAAGATCGCGTTCCGAACTTGACGCACGTGGCTATTTCTTCTCCCAATTTGCAATCCAACGACGGCGCGGACGGTAGATTTTCCGAAATTGGTATCGCGAACAGCGGCGGCGGTGGCGGCGGCGGCAACGATAACGCTATCAGCGGCAGCGGCAACAATCATCATCACCAAtaccatcatcatcatcatcatcaccaccaccatAATCATcaccagcagcagcagcagcagcagcagcagcagcacaACAATCACAACCTTAACCTCCACCTTCAGCACTCGAAcaccaacaacaacaacaacggcAGCGCTGTCGGTGTGGGACAGGCCGATGCTGTCGTCGATGCGATTCCTAAACTGACTTACAACGAAGCCGGTAAACCTTCGTTGATGTGTAAAATCGATCTTAGTAAAATTCCGTATATATTGGCGAAAAAACGCTCCGAAGAAGTTCGAATCAAGACCGAATTACCAGACACTAGGCAAACGACCAACATCAACGCGTCGGACGGCGTCGTTGTGAATACAAGTAGCACGAATACGACCGCTATAAATGTGTCTTTGATCGAACCTGTGTGGGACCGTCGTCGCAACGAAAACACAACCGCGTCGTGTCCTGCGGCGGCGTCAGCGACAACGGACACCGCGCCACCTGTTGCCGTCGTCGAAACTAGCGATGTTCGTCGCGGCGACAGTAGCAAACAGCATCATCAAAGATCGCAGTCGTTGCCGCATACGCGTAAAAAGCATTCGTCCAAAAAGACGAAGAATAGCAAGCGCAAGCATTACGCGGACGAATCGCGCGACATAACGGCCGCGTCACCGGATGCGCTGGCCGCGGCGGCGGTGGCAGACGATAATAACGCGTCGATATTCGCTACGGCGTCGACGGTTCATACGCCTTCCGTAGACACGACGGACATGGATTCGTCGTTATCGAGTGATTCGGAAGCGAAAAAGGTGTCCGGCGGCGTTGCAGCCAACTTTGCACGTTACGGCGgcaataaaaatacgaaaaggtcgacggcggcggcggcggcgccgCCGGAACGCGCGTCCCTCACACCCGGCGACTCGGTCGTTCGCAACAATTTGATCTCGCCGTGTTATCCTATGAATTTGAACGACGTTCAACATCAAAACGTAAACGTAAACGCGGTCGTCGATCAAAGTTCCGGAGACGACGAATCGTGGGACAGTTCGGACAGCAGTTGCAGTAATTGTTCGGCGTCGTCTCTACCATGTCACGACGCTGGCGGCGGCGGAGGCGGCGACAAATACTCTAAAATTGGCAAACGTTCTTCGCAGTATCGCAGTTGTagtaaaaagaagaagaaaaaattgcgcaACGAACATCATTCGCAACGATGTTCCACGTCATTGGTAAGCGTTATACAACGTAGCAAAGGTGACTGCGAGAGCGTCGGGAGCGTCGGTGACATCGTAGTCGGCGGCGGCGACGGTACGACGAATTTGATTCATTTGCCGAATACCAGCCACGAAAGGAGTAGCATAGGGGCGGAGGCGCCGACTGCAACGTTCAACGGTTACCATCAAGTGGCCATCAGCGATCTGGTTCGACCGACGGTGCCCACCCATCACGGCATTTATTATTCGTATTTGGAACATAAAGCTTCGGAAGAAACCAATTCCGAATCGGACAACGTTAGTCCGAATTTATATTTGATGGAAGCGAAACGTCTGAAACACGCCGCCGATACGGAACGCGATTCAGTCGCTCAAGGAATGCAATACTTGGAAGCTATTTTATCGTTCGTATTGACCGGCCATGTTTTGGAACGTAAAAATCAAATGGATACCGCCTTCAATATGTACAGCGAAACGTTGAAGCTTATCGTGtatatttcgaagaaatttcgCACTCTATGCACCAATGCGACGCAATCCAGTATACCGAATAAAATAGCGATATTGAGTTTACGATGCGAATccattttgaacttgaaactGTACACGATGCGTGAACACGAAGTGAAAGACGTACACAGCACGGTCGCCGAATATTTCAGCAAGCCTGTGGCGCCGGAAGCGTGCTTGATGTCGAACGCGGCTGGAATTCCGAGTCCGCATTCGCCGACGCCTTCGCCGGCCAGTTCGGTCGGAAGTAATTCTTCTTCGGGTTACAGTACTGGCGATCGCAGATCGCTGGCCGCAATGTCCCAGTATGTCGCCGTTCCCTGTACCGTGCACGTCGCCTTGCAGAAGCAGCATAATTTCTATTCTAATCTAATCGCGTCTTTCGGCTATTGGGAAGAAGCGGATAAACTCGTATTTTCGGGCGGAAATAAAGatTTCTTCATCGAATTGGACAGGTTTTGCGGACCGCTTACGTTGCATAGCTCGTTGAAAGATTTGGTACATTACGTGCGCGTTGGCATTCAGAGGTTAAAAGCCATCATCAGTCACAACCACGGCGTATGA
Protein Sequence
MKGGAYSSMKTVADLSSSKPCVERERLREREREARAQMTFEADQRASQDMINSAPLFGEIVRVNPKSNDKERQQIERKLGRFEDVKHLLADQDVTNLFGVDGQPPPSPAPSISSGSSSSSIVGHEFKKPNACSQSHQTKSSHHHHHHHHHSSAQRNTFAKPSGGGGGGDGKSAYANRNSFYAAPSSKHGNAASINLPPTSSARGGGALSTAINSSSIGSVNNSGGGGGGSLAATNHHYGSTSNPTSSSIHDYSVRIQQSAAKNLPKINCEASSVILRDSARGMGCANDNASSVSLTNVENILDEMTSSLRTPLTAIAATPRKEVESKFTFNPISGKVESLLPLPRTAVDKSKATGDAHWAALSVENPTLVNQWNTPYRSGVRRYRWWPSRSLYAQTLYIFALNADQYGRSSVSVVAFDSCNKSNALNNVLASSGGGRSAFIASAFRNDDSTNVLTTPFSSNSLSAAGVGNGSSRSTGSGGYAYSPSVAGIAASSQANKEKLDSTTLPCLKSDLSLSEDTDDGDGAENGNDCDDEDDDDDDDDEEEDGLRGRRASRSRSNTADQKENNKYKMTSGAGVAFVPKKEPSSAAAAAAATTTTTTTAATKADELIYGVIATIRSSNGAFSCIFCFASLRKCDAAASETRNTSSSDDSDSDSGSESEDSSSDSPSPPSLRLPVVTEQEESKHRWNLKNFLPPSAHGNSTPTSAHNSQNRRRQNCEKSKIGDSDSSAGSAARRKSRLTSCSTSDSDNDSASAKVNARSRKRIVDNGTKAGSNVGIAGATGSMCGGNAPVSGNTGAAAAAVSASVDKRRKYRLRSSSYSQSSDEDSDRASLPYVTANSSRINGSGSSSSGSSSSSSSGGGSCSGKGSVGNANNSCSTTVGGAGLVAANSLSCANNFASGGGGGGGGAVNSSGCATAGSGNVTVVGVVRPHQRKPPSPQIFHHNARSRNSAGMRDSSPSATNDERYSTFVQQQQQQQQQRTLHMTGAGNNAAPEPQCDLSASRADNTMVLKNMNRAAINDHYQQTRNSSGGGKSSNNNVSSSSVANTGRIGGGGDNIKVSTLQSGEKSVSQTNKTNASSKVRRKSKSPALASSNESPAKKKRGRKRTIKVPELTDSDEETPKAKAAPEKKRPGRPPLKRNDTDDLDWNCSSRRKTTDKFPSSTNVWCDRERRRSSLRMSSFTTVDSDSETEIQTATVKPTARVIPAVKRKDGGGGSVRINRDSSCESLRNDGDFNAISDGIKSSGGGGGGGGGSGGGGVVKVLESPPKLDVECIAVQDKKKSDTLRKLFSRREEGGGKTGGKGKGGKGKCGVIVMESEVERKLLQRSSVVSPSLALASPPTTTTTSSSSSSNLSKIQDRVPNLTHVAISSPNLQSNDGADGRFSEIGIANSGGGGGGGNDNAISGSGNNHHHQYHHHHHHHHHHNHHQQQQQQQQQQHNNHNLNLHLQHSNTNNNNNGSAVGVGQADAVVDAIPKLTYNEAGKPSLMCKIDLSKIPYILAKKRSEEVRIKTELPDTRQTTNINASDGVVVNTSSTNTTAINVSLIEPVWDRRRNENTTASCPAAASATTDTAPPVAVVETSDVRRGDSSKQHHQRSQSLPHTRKKHSSKKTKNSKRKHYADESRDITAASPDALAAAAVADDNNASIFATASTVHTPSVDTTDMDSSLSSDSEAKKVSGGVAANFARYGGNKNTKRSTAAAAAPPERASLTPGDSVVRNNLISPCYPMNLNDVQHQNVNVNAVVDQSSGDDESWDSSDSSCSNCSASSLPCHDAGGGGGGDKYSKIGKRSSQYRSCSKKKKKKLRNEHHSQRCSTSLVSVIQRSKGDCESVGSVGDIVVGGGDGTTNLIHLPNTSHERSSIGAEAPTATFNGYHQVAISDLVRPTVPTHHGIYYSYLEHKASEETNSESDNVSPNLYLMEAKRLKHAADTERDSVAQGMQYLEAILSFVLTGHVLERKNQMDTAFNMYSETLKLIVYISKKFRTLCTNATQSSIPNKIAILSLRCESILNLKLYTMREHEVKDVHSTVAEYFSKPVAPEACLMSNAAGIPSPHSPTPSPASSVGSNSSSGYSTGDRRSLAAMSQYVAVPCTVHVALQKQHNFYSNLIASFGYWEEADKLVFSGGNKDFFIELDRFCGPLTLHSSLKDLVHYVRVGIQRLKAIISHNHGV

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-