Basic Information

Gene Symbol
lilli
Assembly
GCA_900065295.1
Location
FIZT01041768.1:3254-12809[+]

Transcription Factor Domain

TF Family
AF-4
Domain
AF-4 domain
PFAM
PF05110
TF Group
Unclassified Structure
Description
This family consists of AF4 (Proto-oncogene AF4) and FMR2 (Fragile X syndrome) nuclear proteins. These proteins have been linked to human diseases such as acute lymphoblastic leukaemia and mental disabilities [1]. The family also contains a Drosophila AF4 protein homologue Lilliputian which contains an AT-hook domain. Lilliputian represents a novel pair-rule gene that acts in cytoskeleton regulation, segmentation and morphogenesis in Drosophila [2].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 10 3.5e-12 7.4e-08 32.3 3.5 4 133 21 153 19 189 0.62
2 10 1.2e-06 0.025 14.0 0.1 338 381 194 238 177 264 0.86
3 10 1 2.1e+04 -7.4 12.2 163 246 357 438 283 450 0.39
4 10 2.6e-07 0.0055 16.2 21.2 436 513 439 515 430 516 0.77
5 10 1 2.1e+04 -6.5 10.1 130 220 568 658 519 706 0.44
6 10 1 2.1e+04 -7.9 15.3 117 244 808 935 775 945 0.38
7 10 0.0027 56 3.0 0.6 370 486 939 1052 934 1064 0.86
8 10 1 2.1e+04 -6.3 15.1 114 265 1296 1442 1260 1457 0.38
9 10 1 2.1e+04 -6.3 8.5 444 480 1459 1495 1452 1515 0.68
10 10 0.46 9.6e+03 -4.4 0.5 447 478 1761 1792 1733 1804 0.63

Sequence Information

Coding Sequence
ATGAAAGGCGGTGGTTATTCGATGAAGGCAGTTGCGGATTTGAGTTCGAAATCTAGTGTCGATCGAGAACGTTTGAGAGAACGCGAAAGAGAAGCTCGAGCCCAGATGACTTTCGAAGCAGATCAGCGAGCGTCTCAAGATGTTATAAATTCGGCGCCGCTGTTCGGCGAAATAGTTCGCGTAAATCCAAAATCCAACGACAAAGAAAGACAGCAAATCGAACGTAAACTGGGCCGATTTGAAGACGTGAAACATCTATTGGCCGATCAAGATGTGACGAATCTTTTCGGCGTAGACGGCCAACCTCCGCCCAGTCCAGCTCCAAATATTTCTTCAggttcgtcgtcttcgtcgtcgactGCTGGCcacgaattcaaaaaacccAACGCCTGTCTTCAGTCTCATCCGACGAAATCTTCacatcatcatcaccatcaccatcaccatcatcaTTCCGGCGCTCAAAGgAACGCGTTCGCTAAGCCCAGTGACGGAAAATTAGGGTACGCTAATCGCGGCAGTTCGTACACGACTCCGTCGACGAAACACCTACGAGACTCTTTGCGAGGTATGAGTTTTACGGACGATAATACGTCGGTTTCTCTAACTAACGTTGAAAATATCTTAGACGAAATGACGTCGGGAGTCCGTACGCCGTTGACAGCAATCGCAGCTACGCCACGAAAAGAAGTCGAGTCAAAGTTTACTTTCAATCCGATCTCCGGAaagGTCGAATCGTTACTGCCCATGCCGAGAATGGCTATCGATAAAGGCAACAAATGCAATGCCACGAATAACGTACTGGGATCTACTGGCagtggcggtggcggtggccGTAATTCGTTCATTTCTCCCGTTTTTCGTAACGACGACACGATGCCCTTCTCTTCCACTTCGTTGTCTGTGGCTAGCGTATCAAAAAATGTTGGTGGAGGATACGCTTATTCGACATCTGCGAACGACATGATTTCGTCGCCAGCAAGCAAAGAGAAAATCGATTCGACCGCTCTGCCATGTTTAAAATCGGATCTGAGTTTGTCCGAAGATACAGACGAAGGAGATGGAGTAGAGAACGGAAATGATTGCGACGATGAAGACGATGACGaggaggaagaagaagaagaagacgaggACGACGGCAACGAAGAGGAGGAGGACGacgaaaacatgaaaaatggcTCGCGAGAACAACGTCAATTGCTGCCGCAGCGTTCCAGAGGCAATGCGAccgacgaaaaagaaaataataagTTACATGTGATGCAGTGCAAGGCGATGGGCGGCGCGTCTTTCGCTTCCAAAAAAGAactatcatcatcatcatcttcTGCGGACCCTGCTAAAACAGAAGAGTTACGAAAATGCGAAGCTGCCGTTAGCGAGGGTAGAAATACGTCTTCCAGTGACGACAGTGACTCCGACTCCGGTTCCGGTTCAGAAAGCGAAGATTCGAGCAGTGACTCGCCTTCGCCGCCGTCTCTGCGTTTGCCGGTGGTGGCCGAGCAGGAAGAGAGCAAGCATAggtggaatttgaaaaatttcttaccgCCGTCCGCACACGAAAACTCGACGCCCGCGTCGACGCATAATTCGCAGGTAATGTATTGTTGTGCGCGATGCGAAAATGTCGCTTTTACACGCGAGACTATCACTTGTACGTTTACGTCTAATTCTGCTTCTGTGTTGGATCGCGTGCAGGCGCGTCGTcgtcaaaattgtgaaaaaccaaaagccaACGACTCGGACAGTTCGGTGGGCAGCGCGAGAAGAAATACAAGATTGGCCTCTTGCAGTACATCAGACTCGGACAACGGCAACCGCCACCGCAACGGCAGCGTAAGATCGTCCGCTAGTTCGCGTAAGCGTGCCATCGACAGTAATACGAAGGCTGGCTCGAATAAGGGCCTCGGCTCCGGTAACGTAAACGCCGGCCCAGTGAGTGTCGGCGGCGGTAATGTATCCGCTACCGACGACGCAGTCGGATCCAACGACGTCGATAACCGTCGAAAGTGCCGGCTGCGATCGTCGTCCTACTCGCAATCTTCTGACGATGATTCGGATCGAGCTTCGCCTTACGCCGGCGCGAATACCATGCGAAACAACAGCGGAGGCAGCAATTTGAACGGCATCACTGTGGGTAGCGGAGGATGCAGCAGCGGTATTAGCGGTAATGGTAACAATAGTAGTTGTTCGACGTCTGTCGGCTGCGCAAGTTTAGTGGCCAATTCGTTGAGTTGCGCGAATAACTTTGCCAGCAACGGCGGAGGCGCCAGTGGTAACGGTTGCGCAGCGACAAGTGGAAATGTCACCGTAGTGGGTGTTGTCAGACCGCATCAAAGAAAACCTCCTTCGCcgcaaatttttcaccttggTGCCAGAGGTCGTTCCTCAGCCGGCATGCGCGATTCGTCTCCGACGGCGGTAACCAACGACGAACGCTATTCCGCGTTCGTgcagcagcaacagcagcagcaacaacgtTCGCTACATTTGACTAGCGCAAGCGGAAATGCGCCAGAATCTCAATGCGATTTATCTGCCGGCCAACCCGGCAGTACGCACGCCAGTGTGGTCGCTAGTGGCAGCCGTGTCGCCGGCGGCAACCTCAAAGTTGCCATGCTGCAGTCGAACGACAAGTCAACTttgcaaacaaacaaaaccaGCGTTTCTTCTAAAGTTCGTCGTAAGTCGAAATCGCCGGCGCTAGCTTCCTCCAACGAATCTccggcgaagaaaaaaagaggacGCAAACGTACCATCAAAGTGCCAGAATTATCGGACAGCGACGAAGATACGCCCAAGACGAAAGCGGCGCCGGAAAAGAAGAGACCAGGTCGACCACCTCTCAAACGCAACGATACCGATGACTTGGATTGGAATTACAATTCTAGGCGTAAAATGACGGATAAATTTCCAAGCTCGGCGAACGTTTGGTGTGATCGCGAACGACGACGCAGTAGTTTACGAATGAGCAGTTTTACTACGGTGGATTCGGATAGCGAAATAGAAATACCGTCGGCGACGGTAAAACCAACGGCTCGCGTCATTCCTGctgtgaaaagaaaagatggcggcggcggcggcggcggtggcagCGGCAGCAGTAGCGTACGCAACAATCGTGATTCGAGTTGCGAATCGATGAGAAACGATAGCGGCGATTTCAACGCGATATCGGACAGCATCAAAAATATGGGTGGCAGCTGCGGCATTGTTAAAGCGTTAGAAAGTCCGCCCAAACTGGACGTCGAATGTATAGCTGTTCAAGATAAAAAGAAAAGCGACACTTTGCGAAAGTTGTTTTCGCGTCGTGAAGAAGGCGGTGGCAAAACAGGCGGCAAAGGAAAAGGTGGTAAAGGAAAGTGCGGCGTCATCGTTATGGAATCCGAAGTCGAACGAAAATTGTTACAGCGTTCGTCCGTCGTCAGTCCAGCATCGCCTTCTTCCAATTTGCCGCAGATGCAAAATGGCGTTCTCGATATGACGCCGCACGCGGCTATTTCTTCGCCTAGCTTGCAATCCAACGACGACTTATCCGGTTACAACGACAATATCGGCGGTGCTGGTGctggtggcggcggcggcggcggcggcggcggcggcaacGGCAACGGCAACAACAGCTGTATAGATGGAATGAGTCAAGTTGACGCCGCGGTCGATATGTTTCCCAAGTTGACTTACAACGAAGCTGGTAAACCCTCTTTGATGTGTAAAATCGATCTCAGTAGAATACCTTATATTGTGGCAAAGAAACGCTCCGAAGAAATTCGAATCAAATCCGAATTAGCGGACACCAGGCAAACGACAAACAACGTGCAAGACGGTAGCGGCGGCGCTGTCGCCGTCATTACGCTTGACGCGTCTTCCACGAATACCACTTCCAATGTGCCTTTGATCGAACCGGTATCGGATCGTCGCCGTCACGATAATGTGAAAGACGTCGCTGCGTCGATGATGACCGCTGAAAACAGCGACCTTCATCGCGATCAAAATAGACAACTGCATGAAAAGTCGCAGCCACCGACGTTGACACGTCCGCGTAAAAAACACGCGTCCAAGAAGACGAGAAGCGGCAAACGTAAGCATTACGCGGACGAAGCGCGCGACGTAACGATGGCGGCGGCGTCACCGGCTGCGGCGGCCGAGGACGATGATGGTACGTCGATATTTGCTAACGCGTCTACCGTGCATACGCCTTCGGTAGACACCACGGACATGGATTCGGCGTTGTCGAGCGATTCGGATACGCGAAAGGTAGCACGTTGCGCTaatcaaaatgcaaaaaagtcGCTAGCTGTACACGACTCGTCGCACGCAACCGATTCGTCCATCCCCAGCAACATGATCTCGTCATGTTACGCTGTGACTTTGAACGATGTTCAGCATCATGAGAATGCAGGCGCGGTAGCCGGTCAGAGTTCGGCGGACGACGAATCGTGGGACAGTTCGGAAAGCAGTTGCAGTAATTGTTCGGCATCGTCCACGCCTTGTCACGACTCGGTAGacaaatactcgaaaaatggCAAGCGTTCCTCTCGGTACCGCAGttgcggcggcggtggcgttaaatcgaaaaagaaaaagaagaaaaaaaagaagcttcGAAATGAGCAGTCGCCAAGATGTTCCACATCGTTGGGTGATTGCGCAGGCAGTATTGGTGGCGGTGGCGAcggcggtggcggtggtgTTGGCGACCATCTGCCCAACACGAGCCACGAAAGAAGCATTATGACGGAGACGCCAACCACGACAACGTTCAACGGTTATCATCCAGTTGTCATCGGCGATCTAGTTCGACCCACAGTGCCTACGCATCACGGCATTTACTACTCGTACCTCGAACACAAAGCTTCCGAAGACTTGAATTCCGAATCGGACAACGTTAGTCCGAATTTGTACTTGATGGAAGCGAAACGTCTGAAACATGCCGCCGATACGGAAAGAGATTCGGTCGCGCAAGGAATGCAATACTTGGAGGCTATCTTATCGTTCGTGCTGACCGGCCACGTTTTAGAACGTAAAAATCAAATGGATACCGCATTTAATATGTACAGCGAAACGTTGAAACTTATCGTgtatatttcaaaaaaatttcgcacttTATGCACGAATGCGGCGCAATCCAGTATACCAAATAAAATCGCCATTTTGAGTTTGCGGTGCGAATCTATTCTCAACTTGAAGCTTTATACGATGCGAGAAAACGAAGTGAAAGAAGTACACAACGCAGTTGCCGAATATTTTAACAAGCATGTTGCGGCGGAAGCGTGCTTAATTTCTAACGCGGCTGGAGTACCCAGTCCGCATTCGCCGACGCCTTCGCCCGCCAGTTCGGTCGGAAGTAATTCATCCGGTTATAGTACCGGCGATCGTAGGTCTGTCGCCATGTCTCAGTGTATCGCTGTGCCCTTCAATTTGCACGTGGCGTTGCAAAAGCAGCATAATTTCTATTCCAACTTGATCGCGTCCTTCAGCTATTGGGAAGAAGCGGATAAACTTATATTTTCTGGCGGAAATAaggatTTCTTCGTCGAATTGGACAGATTTTGCGGCCCGCTAACGTTGCATAGTTCGTTGAAAGATTTGGTGCATTACGTGCGCGTTGGTATTCAACGATTGAAAGCAATCATTAGTCATAATCACGGCGTATGA
Protein Sequence
MKGGGYSMKAVADLSSKSSVDRERLREREREARAQMTFEADQRASQDVINSAPLFGEIVRVNPKSNDKERQQIERKLGRFEDVKHLLADQDVTNLFGVDGQPPPSPAPNISSGSSSSSSTAGHEFKKPNACLQSHPTKSSHHHHHHHHHHHSGAQRNAFAKPSDGKLGYANRGSSYTTPSTKHLRDSLRGMSFTDDNTSVSLTNVENILDEMTSGVRTPLTAIAATPRKEVESKFTFNPISGKVESLLPMPRMAIDKGNKCNATNNVLGSTGSGGGGGRNSFISPVFRNDDTMPFSSTSLSVASVSKNVGGGYAYSTSANDMISSPASKEKIDSTALPCLKSDLSLSEDTDEGDGVENGNDCDDEDDDEEEEEEEDEDDGNEEEEDDENMKNGSREQRQLLPQRSRGNATDEKENNKLHVMQCKAMGGASFASKKELSSSSSSADPAKTEELRKCEAAVSEGRNTSSSDDSDSDSGSGSESEDSSSDSPSPPSLRLPVVAEQEESKHRWNLKNFLPPSAHENSTPASTHNSQVMYCCARCENVAFTRETITCTFTSNSASVLDRVQARRRQNCEKPKANDSDSSVGSARRNTRLASCSTSDSDNGNRHRNGSVRSSASSRKRAIDSNTKAGSNKGLGSGNVNAGPVSVGGGNVSATDDAVGSNDVDNRRKCRLRSSSYSQSSDDDSDRASPYAGANTMRNNSGGSNLNGITVGSGGCSSGISGNGNNSSCSTSVGCASLVANSLSCANNFASNGGGASGNGCAATSGNVTVVGVVRPHQRKPPSPQIFHLGARGRSSAGMRDSSPTAVTNDERYSAFVQQQQQQQQRSLHLTSASGNAPESQCDLSAGQPGSTHASVVASGSRVAGGNLKVAMLQSNDKSTLQTNKTSVSSKVRRKSKSPALASSNESPAKKKRGRKRTIKVPELSDSDEDTPKTKAAPEKKRPGRPPLKRNDTDDLDWNYNSRRKMTDKFPSSANVWCDRERRRSSLRMSSFTTVDSDSEIEIPSATVKPTARVIPAVKRKDGGGGGGGGSGSSSVRNNRDSSCESMRNDSGDFNAISDSIKNMGGSCGIVKALESPPKLDVECIAVQDKKKSDTLRKLFSRREEGGGKTGGKGKGGKGKCGVIVMESEVERKLLQRSSVVSPASPSSNLPQMQNGVLDMTPHAAISSPSLQSNDDLSGYNDNIGGAGAGGGGGGGGGGGNGNGNNSCIDGMSQVDAAVDMFPKLTYNEAGKPSLMCKIDLSRIPYIVAKKRSEEIRIKSELADTRQTTNNVQDGSGGAVAVITLDASSTNTTSNVPLIEPVSDRRRHDNVKDVAASMMTAENSDLHRDQNRQLHEKSQPPTLTRPRKKHASKKTRSGKRKHYADEARDVTMAAASPAAAAEDDDGTSIFANASTVHTPSVDTTDMDSALSSDSDTRKVARCANQNAKKSLAVHDSSHATDSSIPSNMISSCYAVTLNDVQHHENAGAVAGQSSADDESWDSSESSCSNCSASSTPCHDSVDKYSKNGKRSSRYRSCGGGGVKSKKKKKKKKKLRNEQSPRCSTSLGDCAGSIGGGGDGGGGGVGDHLPNTSHERSIMTETPTTTTFNGYHPVVIGDLVRPTVPTHHGIYYSYLEHKASEDLNSESDNVSPNLYLMEAKRLKHAADTERDSVAQGMQYLEAILSFVLTGHVLERKNQMDTAFNMYSETLKLIVYISKKFRTLCTNAAQSSIPNKIAILSLRCESILNLKLYTMRENEVKEVHNAVAEYFNKHVAAEACLISNAAGVPSPHSPTPSPASSVGSNSSGYSTGDRRSVAMSQCIAVPFNLHVALQKQHNFYSNLIASFSYWEEADKLIFSGGNKDFFVELDRFCGPLTLHSSLKDLVHYVRVGIQRLKAIISHNHGV

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-