Basic Information

Gene Symbol
lilli
Assembly
GCA_029852145.1
Location
CM056846.1:3159287-3186172[+]

Transcription Factor Domain

TF Family
AF-4
Domain
AF-4 domain
PFAM
PF05110
TF Group
Unclassified Structure
Description
This family consists of AF4 (Proto-oncogene AF4) and FMR2 (Fragile X syndrome) nuclear proteins. These proteins have been linked to human diseases such as acute lymphoblastic leukaemia and mental disabilities [1]. The family also contains a Drosophila AF4 protein homologue Lilliputian which contains an AT-hook domain. Lilliputian represents a novel pair-rule gene that acts in cytoskeleton regulation, segmentation and morphogenesis in Drosophila [2].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 9 0.22 2.6e+03 -1.7 0.2 4 28 31 55 29 59 0.81
2 9 2.3e-08 0.00028 21.3 0.7 24 130 76 183 68 200 0.57
3 9 4.2e-09 5e-05 23.7 0.7 341 448 283 388 261 400 0.69
4 9 3 3.5e+04 -12.3 30.3 442 501 451 519 432 534 0.38
5 9 3 3.5e+04 -18.5 27.0 177 460 571 648 502 672 0.25
6 9 3 3.5e+04 -14.7 25.1 422 482 652 709 576 725 0.58
7 9 0.12 1.4e+03 -0.9 10.6 419 502 727 822 721 827 0.67
8 9 0.027 3.2e+02 1.2 12.3 113 212 1050 1150 954 1199 0.47
9 9 0.18 2.2e+03 -1.5 1.4 129 220 1236 1328 1204 1364 0.56

Sequence Information

Coding Sequence
ATGTGCAAGTGTGACGACACTTGTGTGAGGGGAAATGCGATAAGCCGATTGCCAATGCTCGTGGGTTTGCTAGATATCGATCGAAGCGTGGACCGGGACCGGCTTCGAGAGCGTGAGCGGCAGGCACGCGCGGCGATGTCGGTCCAGGCAGAGCAGGCAGCAGCAGCCGGAGGTCCAGAGAATTGCCACAGCCACCACAATCACGGCCATCACCATCATCACGCTAATTCCCACGTGTCTGCAGCTGCCTCGCTCTTCCGCGCCCCCGTCAagGTAAATCCTGACGCGCACGACCGAACAACTCAGCAGATCCAATCCAAGCTCGGGAACTACTCACTGGTAAAGCATCTGCTGGACGAGCCAAAGAGACTGATAGGAATTGAAGGTGTACCAGCTAGTCCAGCGCCGGGGTCGAGCTCATCTACACGTTTATCTTCGAGCACAAGTTGTAGGAGTTCGCCGTCATcgcaagaatttaaaaaaccaggCGGTAATGGTCCCCGAacatcatcgtcatcatcatcaacaacCGGGTCTACTTCGAGCCATACCTCCCAGCGCGGCGGTTTCATAAAACCCGCGGATGGGAAACCACCTTATGGTGGTCGAGGTGGCTATCCAGGGCAACCAGTAAAACACGGTGGCAGTAGCAACGATCACAGAAGCCACGGGATCCTTCCCGCAAAGGGTCCACCTTCGTCAATCCCTGGTAATGCCAATTCTACCGGAAACAGCGGTGGTCTAACCTCTTCCGGAAATTGTCCTCCCCCCGGTAATTCGGGTAATTTGAGCAGAGTTCACGCAGCTGCTTCCAGACTCCCAAGGTTACCTCTTGACAACgGAGTAAGACATGGGTCGGATCCAGCAGACTTGGAAAACATCCTCAAGGAAATGACGATGCCGCCCATACCACTTACAGCTATTGCACAGACACCAAGAAAAGAACTGGAATCCAAGTTCACATTCAACCCTGTACTAGCTAAGCTGACTGAGATACCGCCTGAACCTATCAAGCCGCCAcaaCGTGAACGACATGGGACCACACGTTTATCAGcTGATCTTGAGCGTGATTTAAGTCTCTCGGAAGACAGTGAAGATGAAGGCGTCAAGACGTCAATTTCAAGAGTATCAAGATCCACTGCAAGTCCAAGTATAAACGCTGAtCATTCAACACCACTGGCACCAGCTATGACACCGGTCCCTGCACCTCTAGCACCGATGTCACCCAGAGCATTGTCGCCATTGGTGACTTTATCACCTCCGCGACCTATAAGCCCGCTCAGGACCACGCCAAAGCACACGGTGTCCGAGCGATTGCTGTCACCGTCATCCAGCTCACCGAGAAAAGGCTCGCCGACAATGCCGACGAGGCCTCCTAGTCCTCCTGGCCAAGCGCCATTGAGTTCTGGCAGCGCCAGCTCTAGTTCGGACTCTGGATCAGACTCGGGTACTGACAGCAGTGATGATTCTGAGGATGAAAACGCACCTCCACCACCAAAAGGTCCTTCCACACCGCCTTCCGTGTCTCCAAAAGTCCCGGTGGAAGAACCTCCTCCTGCAGAGGAGTCCAAGCCTCGTTGGAATCTCAGTAGTTTCTTCAACAAAACTGCCGTACAAGCTGGTGACAAAAATTCAGAGAATAAAACTGCTCagaATGAAGCAACTCGTCGTGACAGTTCACCTGAGGAAGCGTCTCTAGATATGAGAATACGCCGGTCGGATGGAGCTCATCATAATAACGAGTGGCAGCTCGATGAAGCGTTAAAAAGAACTAGAAACGCGACGATGTTGACGGTCCTGAGTGATGGTGATCAAAATTCCGACTCGGagaaaaagaagaaagaaGAATTGAGATCTCAAATTCAAGATAAACCAAAAGCACCAGATGTGCGAAAACGTGGTCGGCCACGTAAAACAGCTAAAGAAACGGTCAAGAGTCCGAGAAGTCATAGAATGTCGTCAGAGGAATCTAAAGCCAGCTCTAAACGCAGTAAACCTCGTAGTGTAAGCAATCCTAAGAAAAAAACTACAGCAATACCGCTGCCAAGGGTTCCACTTGATAACAGCGACGAGTTGAGTGACTCAAGATCACGTGACAGATCGAGCGACTCGGAGTGCGAGCAGTCGAGAATATCTCCAGCCTCGACAGTGATCCCAGTTGACAAACGAAGATCACGGTTAAGTATCTCTTCGAGTGAAGACGAGAAAACGACGAATAAGCACAGCGCTTCGGAAGACGACGCGCGGTGGAGAAGACTATCGTCAAAGCGCAATAAGCTAACAGATTCTCCAACTAAAAAGCAAGACAAGAAGAAGAGCCCCAACAAAGTTAAACCTCAACGTCCCAGGTCGAGGATAACAAATAACTCGGGCTGTGCTTCTGATTCCGACAGTGAATTAGAGACAACGTTGAGGAACAATCGCGTACAGCCAGTCGCGAGAGTACCACCGAGACCCAGAGCCCCGCTTACAAGAGTTACCTCACCTGACAATTCTGACAGCGACAACAGTCCAACGCCAAAACTACAAGAAGAAGACGCCGGCAATGTGCAGGATAAGAAAAAGAATGATACACTTAGGAAACTATTTTCAAGTGCCAAGGGCGGTGCGAAAGGTGGCGGTAAGGGCGGCAAAGGTGGCAAAGGCGGTGGCAAATGTGGAATTTACGTGGAAGAATACACCGCGAGTACGCCGACGGGCGGTGAAAGTCCATACAAAAGACCGTCGTCACAATCTTCCGCTACAGCAAACTTCCCACCCTTGAAATATGTCAATGGAGTACCCATACTTGTGTGTAAAGTCGATCTTAGCCGACTTTCTCATGTCCCACGGCCGTCGCGAGGTCAAGAATTGAGACAGCGAACGGAATTACCGGACACAAGGCCGGCTTCACGACAATCTGTTAAATCCGAGCGACCGCCTACGCCGGAGGAGGGTGAAATTATCGATACCCCTACACCACCACCGTCCATCGATTTCAGGACACACGTTGATAATCATCACGTCACGACTGATCAAGCTGTGTTATCAAAAAGTAAACGTACTGAGAGTAAGAGTGAATTAGTTGTAACTTTAGCTGGGGCGGATTCGAAAAATCGTGCGATACCTGGTGGCAGTGGATTATCTAGTGGCCCTACGAACTCCAGTGCCGGTGCTAGTGCTAATGTTATTACAGATAATACTGGGGATCGAGTGCCGAAGCGTAAGCGCAACACTAGTTGCAGTTCTGTCTCTAGTTTAAATATGTGTTCAATGGACAGTAAAGTTAAATCCACAAGTGAGCACAAggagaagaaaaaaagaaaacgacATCACACTGACAAAGACTCAAACTCATCCAGATCATCTTCCCGACaacAAAATGACATCCAACCCACAAATCACGAGCGGGAAGAAAGATCTGACGTTAATTTGCTGCCTCCACCGGTGGCACCACCTCAACGCGTCTACTATTCTTACTTCAATCATCAAAATGACGTTTTGGAGGACCAGGATAGGGACCAGAACCAGTACCTGACGGAAGCTAAAAGATTAAAGCACAGTGCAGATGAGGAATGCGAGCTGACGGCCCAGGGTATGCTCTACTTGGAGGCAGTATTGTATTTCCTACTTACAGGCCACGCCATGGAGTCAGATCCAGTAACTGAAAGATCCTCTTTTACCATGTACAGAGACACACTTAGTCTTATAAAATACATCTCATCTAAGTTCAAGAGCCAACAGAACAATTCACCTGAGAGCAGTATCCATAACAAGTTGGCTATCTTGAGTTTATGGTGCCAgtctcttatttatttaaaactctaCAAAATGCGGCAACACGAAGTCAAGGAAAACGCTAAAATTCTTGCAGAGTACACGCAAAAAccTGCACAGCCGACCCTCGTCCATGCTGAAGGCCAAGGAACTCCATCGTTGTCTCCAACACCATCGCCGGCCGGTTCGGTTGGTTCCGTTGGTAGCCAGAGTTCGGGATACAGCAGCGGTGAGCTCGCGAATCGCGGTAACGTCGTCTCAGGACAGCCATCAGCTTCAACTTTTGTCAGCGTTCCGCTTGCAATTCACTCGATGCTGGTAAAGCAGAATCAACATTTCTCATTATTAACTAATTGTCACGATTTGTGGGATCAGGCGACCCTATTAGTTACGGACAAACATCGAGattttttcattgaattagACGAAAAACTTGGTCCGCTGACATTAAGAAGTTCATTACGGGACTTGGTGCGTTACGTACAAGCTGGAATTAAGAAGTTAAGAGCTCTCTGA
Protein Sequence
MCKCDDTCVRGNAISRLPMLVGLLDIDRSVDRDRLRERERQARAAMSVQAEQAAAAGGPENCHSHHNHGHHHHHANSHVSAAASLFRAPVKVNPDAHDRTTQQIQSKLGNYSLVKHLLDEPKRLIGIEGVPASPAPGSSSSTRLSSSTSCRSSPSSQEFKKPGGNGPRTSSSSSSTTGSTSSHTSQRGGFIKPADGKPPYGGRGGYPGQPVKHGGSSNDHRSHGILPAKGPPSSIPGNANSTGNSGGLTSSGNCPPPGNSGNLSRVHAAASRLPRLPLDNGVRHGSDPADLENILKEMTMPPIPLTAIAQTPRKELESKFTFNPVLAKLTEIPPEPIKPPQRERHGTTRLSADLERDLSLSEDSEDEGVKTSISRVSRSTASPSINADHSTPLAPAMTPVPAPLAPMSPRALSPLVTLSPPRPISPLRTTPKHTVSERLLSPSSSSPRKGSPTMPTRPPSPPGQAPLSSGSASSSSDSGSDSGTDSSDDSEDENAPPPPKGPSTPPSVSPKVPVEEPPPAEESKPRWNLSSFFNKTAVQAGDKNSENKTAQNEATRRDSSPEEASLDMRIRRSDGAHHNNEWQLDEALKRTRNATMLTVLSDGDQNSDSEKKKKEELRSQIQDKPKAPDVRKRGRPRKTAKETVKSPRSHRMSSEESKASSKRSKPRSVSNPKKKTTAIPLPRVPLDNSDELSDSRSRDRSSDSECEQSRISPASTVIPVDKRRSRLSISSSEDEKTTNKHSASEDDARWRRLSSKRNKLTDSPTKKQDKKKSPNKVKPQRPRSRITNNSGCASDSDSELETTLRNNRVQPVARVPPRPRAPLTRVTSPDNSDSDNSPTPKLQEEDAGNVQDKKKNDTLRKLFSSAKGGAKGGGKGGKGGKGGGKCGIYVEEYTASTPTGGESPYKRPSSQSSATANFPPLKYVNGVPILVCKVDLSRLSHVPRPSRGQELRQRTELPDTRPASRQSVKSERPPTPEEGEIIDTPTPPPSIDFRTHVDNHHVTTDQAVLSKSKRTESKSELVVTLAGADSKNRAIPGGSGLSSGPTNSSAGASANVITDNTGDRVPKRKRNTSCSSVSSLNMCSMDSKVKSTSEHKEKKKRKRHHTDKDSNSSRSSSRQQNDIQPTNHEREERSDVNLLPPPVAPPQRVYYSYFNHQNDVLEDQDRDQNQYLTEAKRLKHSADEECELTAQGMLYLEAVLYFLLTGHAMESDPVTERSSFTMYRDTLSLIKYISSKFKSQQNNSPESSIHNKLAILSLWCQSLIYLKLYKMRQHEVKENAKILAEYTQKPAQPTLVHAEGQGTPSLSPTPSPAGSVGSVGSQSSGYSSGELANRGNVVSGQPSASTFVSVPLAIHSMLVKQNQHFSLLTNCHDLWDQATLLVTDKHRDFFIELDEKLGPLTLRSSLRDLVRYVQAGIKKLRAL

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_01003517;
90% Identity
iTF_01003517;
80% Identity
-