Mmed003533.2
Basic Information
- Insect
- Microplitis mediator
- Gene Symbol
- lilli
- Assembly
- GCA_029852145.1
- Location
- CM056846.1:3159287-3186172[+]
Transcription Factor Domain
- TF Family
- AF-4
- Domain
- AF-4 domain
- PFAM
- PF05110
- TF Group
- Unclassified Structure
- Description
- This family consists of AF4 (Proto-oncogene AF4) and FMR2 (Fragile X syndrome) nuclear proteins. These proteins have been linked to human diseases such as acute lymphoblastic leukaemia and mental disabilities [1]. The family also contains a Drosophila AF4 protein homologue Lilliputian which contains an AT-hook domain. Lilliputian represents a novel pair-rule gene that acts in cytoskeleton regulation, segmentation and morphogenesis in Drosophila [2].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 9 0.22 2.6e+03 -1.7 0.2 4 28 31 55 29 59 0.81 2 9 2.3e-08 0.00028 21.3 0.7 24 130 76 183 68 200 0.57 3 9 4.2e-09 5e-05 23.7 0.7 341 448 283 388 261 400 0.69 4 9 3 3.5e+04 -12.3 30.3 442 501 451 519 432 534 0.38 5 9 3 3.5e+04 -18.5 27.0 177 460 571 648 502 672 0.25 6 9 3 3.5e+04 -14.7 25.1 422 482 652 709 576 725 0.58 7 9 0.12 1.4e+03 -0.9 10.6 419 502 727 822 721 827 0.67 8 9 0.027 3.2e+02 1.2 12.3 113 212 1050 1150 954 1199 0.47 9 9 0.18 2.2e+03 -1.5 1.4 129 220 1236 1328 1204 1364 0.56
Sequence Information
- Coding Sequence
- ATGTGCAAGTGTGACGACACTTGTGTGAGGGGAAATGCGATAAGCCGATTGCCAATGCTCGTGGGTTTGCTAGATATCGATCGAAGCGTGGACCGGGACCGGCTTCGAGAGCGTGAGCGGCAGGCACGCGCGGCGATGTCGGTCCAGGCAGAGCAGGCAGCAGCAGCCGGAGGTCCAGAGAATTGCCACAGCCACCACAATCACGGCCATCACCATCATCACGCTAATTCCCACGTGTCTGCAGCTGCCTCGCTCTTCCGCGCCCCCGTCAagGTAAATCCTGACGCGCACGACCGAACAACTCAGCAGATCCAATCCAAGCTCGGGAACTACTCACTGGTAAAGCATCTGCTGGACGAGCCAAAGAGACTGATAGGAATTGAAGGTGTACCAGCTAGTCCAGCGCCGGGGTCGAGCTCATCTACACGTTTATCTTCGAGCACAAGTTGTAGGAGTTCGCCGTCATcgcaagaatttaaaaaaccaggCGGTAATGGTCCCCGAacatcatcgtcatcatcatcaacaacCGGGTCTACTTCGAGCCATACCTCCCAGCGCGGCGGTTTCATAAAACCCGCGGATGGGAAACCACCTTATGGTGGTCGAGGTGGCTATCCAGGGCAACCAGTAAAACACGGTGGCAGTAGCAACGATCACAGAAGCCACGGGATCCTTCCCGCAAAGGGTCCACCTTCGTCAATCCCTGGTAATGCCAATTCTACCGGAAACAGCGGTGGTCTAACCTCTTCCGGAAATTGTCCTCCCCCCGGTAATTCGGGTAATTTGAGCAGAGTTCACGCAGCTGCTTCCAGACTCCCAAGGTTACCTCTTGACAACgGAGTAAGACATGGGTCGGATCCAGCAGACTTGGAAAACATCCTCAAGGAAATGACGATGCCGCCCATACCACTTACAGCTATTGCACAGACACCAAGAAAAGAACTGGAATCCAAGTTCACATTCAACCCTGTACTAGCTAAGCTGACTGAGATACCGCCTGAACCTATCAAGCCGCCAcaaCGTGAACGACATGGGACCACACGTTTATCAGcTGATCTTGAGCGTGATTTAAGTCTCTCGGAAGACAGTGAAGATGAAGGCGTCAAGACGTCAATTTCAAGAGTATCAAGATCCACTGCAAGTCCAAGTATAAACGCTGAtCATTCAACACCACTGGCACCAGCTATGACACCGGTCCCTGCACCTCTAGCACCGATGTCACCCAGAGCATTGTCGCCATTGGTGACTTTATCACCTCCGCGACCTATAAGCCCGCTCAGGACCACGCCAAAGCACACGGTGTCCGAGCGATTGCTGTCACCGTCATCCAGCTCACCGAGAAAAGGCTCGCCGACAATGCCGACGAGGCCTCCTAGTCCTCCTGGCCAAGCGCCATTGAGTTCTGGCAGCGCCAGCTCTAGTTCGGACTCTGGATCAGACTCGGGTACTGACAGCAGTGATGATTCTGAGGATGAAAACGCACCTCCACCACCAAAAGGTCCTTCCACACCGCCTTCCGTGTCTCCAAAAGTCCCGGTGGAAGAACCTCCTCCTGCAGAGGAGTCCAAGCCTCGTTGGAATCTCAGTAGTTTCTTCAACAAAACTGCCGTACAAGCTGGTGACAAAAATTCAGAGAATAAAACTGCTCagaATGAAGCAACTCGTCGTGACAGTTCACCTGAGGAAGCGTCTCTAGATATGAGAATACGCCGGTCGGATGGAGCTCATCATAATAACGAGTGGCAGCTCGATGAAGCGTTAAAAAGAACTAGAAACGCGACGATGTTGACGGTCCTGAGTGATGGTGATCAAAATTCCGACTCGGagaaaaagaagaaagaaGAATTGAGATCTCAAATTCAAGATAAACCAAAAGCACCAGATGTGCGAAAACGTGGTCGGCCACGTAAAACAGCTAAAGAAACGGTCAAGAGTCCGAGAAGTCATAGAATGTCGTCAGAGGAATCTAAAGCCAGCTCTAAACGCAGTAAACCTCGTAGTGTAAGCAATCCTAAGAAAAAAACTACAGCAATACCGCTGCCAAGGGTTCCACTTGATAACAGCGACGAGTTGAGTGACTCAAGATCACGTGACAGATCGAGCGACTCGGAGTGCGAGCAGTCGAGAATATCTCCAGCCTCGACAGTGATCCCAGTTGACAAACGAAGATCACGGTTAAGTATCTCTTCGAGTGAAGACGAGAAAACGACGAATAAGCACAGCGCTTCGGAAGACGACGCGCGGTGGAGAAGACTATCGTCAAAGCGCAATAAGCTAACAGATTCTCCAACTAAAAAGCAAGACAAGAAGAAGAGCCCCAACAAAGTTAAACCTCAACGTCCCAGGTCGAGGATAACAAATAACTCGGGCTGTGCTTCTGATTCCGACAGTGAATTAGAGACAACGTTGAGGAACAATCGCGTACAGCCAGTCGCGAGAGTACCACCGAGACCCAGAGCCCCGCTTACAAGAGTTACCTCACCTGACAATTCTGACAGCGACAACAGTCCAACGCCAAAACTACAAGAAGAAGACGCCGGCAATGTGCAGGATAAGAAAAAGAATGATACACTTAGGAAACTATTTTCAAGTGCCAAGGGCGGTGCGAAAGGTGGCGGTAAGGGCGGCAAAGGTGGCAAAGGCGGTGGCAAATGTGGAATTTACGTGGAAGAATACACCGCGAGTACGCCGACGGGCGGTGAAAGTCCATACAAAAGACCGTCGTCACAATCTTCCGCTACAGCAAACTTCCCACCCTTGAAATATGTCAATGGAGTACCCATACTTGTGTGTAAAGTCGATCTTAGCCGACTTTCTCATGTCCCACGGCCGTCGCGAGGTCAAGAATTGAGACAGCGAACGGAATTACCGGACACAAGGCCGGCTTCACGACAATCTGTTAAATCCGAGCGACCGCCTACGCCGGAGGAGGGTGAAATTATCGATACCCCTACACCACCACCGTCCATCGATTTCAGGACACACGTTGATAATCATCACGTCACGACTGATCAAGCTGTGTTATCAAAAAGTAAACGTACTGAGAGTAAGAGTGAATTAGTTGTAACTTTAGCTGGGGCGGATTCGAAAAATCGTGCGATACCTGGTGGCAGTGGATTATCTAGTGGCCCTACGAACTCCAGTGCCGGTGCTAGTGCTAATGTTATTACAGATAATACTGGGGATCGAGTGCCGAAGCGTAAGCGCAACACTAGTTGCAGTTCTGTCTCTAGTTTAAATATGTGTTCAATGGACAGTAAAGTTAAATCCACAAGTGAGCACAAggagaagaaaaaaagaaaacgacATCACACTGACAAAGACTCAAACTCATCCAGATCATCTTCCCGACaacAAAATGACATCCAACCCACAAATCACGAGCGGGAAGAAAGATCTGACGTTAATTTGCTGCCTCCACCGGTGGCACCACCTCAACGCGTCTACTATTCTTACTTCAATCATCAAAATGACGTTTTGGAGGACCAGGATAGGGACCAGAACCAGTACCTGACGGAAGCTAAAAGATTAAAGCACAGTGCAGATGAGGAATGCGAGCTGACGGCCCAGGGTATGCTCTACTTGGAGGCAGTATTGTATTTCCTACTTACAGGCCACGCCATGGAGTCAGATCCAGTAACTGAAAGATCCTCTTTTACCATGTACAGAGACACACTTAGTCTTATAAAATACATCTCATCTAAGTTCAAGAGCCAACAGAACAATTCACCTGAGAGCAGTATCCATAACAAGTTGGCTATCTTGAGTTTATGGTGCCAgtctcttatttatttaaaactctaCAAAATGCGGCAACACGAAGTCAAGGAAAACGCTAAAATTCTTGCAGAGTACACGCAAAAAccTGCACAGCCGACCCTCGTCCATGCTGAAGGCCAAGGAACTCCATCGTTGTCTCCAACACCATCGCCGGCCGGTTCGGTTGGTTCCGTTGGTAGCCAGAGTTCGGGATACAGCAGCGGTGAGCTCGCGAATCGCGGTAACGTCGTCTCAGGACAGCCATCAGCTTCAACTTTTGTCAGCGTTCCGCTTGCAATTCACTCGATGCTGGTAAAGCAGAATCAACATTTCTCATTATTAACTAATTGTCACGATTTGTGGGATCAGGCGACCCTATTAGTTACGGACAAACATCGAGattttttcattgaattagACGAAAAACTTGGTCCGCTGACATTAAGAAGTTCATTACGGGACTTGGTGCGTTACGTACAAGCTGGAATTAAGAAGTTAAGAGCTCTCTGA
- Protein Sequence
- MCKCDDTCVRGNAISRLPMLVGLLDIDRSVDRDRLRERERQARAAMSVQAEQAAAAGGPENCHSHHNHGHHHHHANSHVSAAASLFRAPVKVNPDAHDRTTQQIQSKLGNYSLVKHLLDEPKRLIGIEGVPASPAPGSSSSTRLSSSTSCRSSPSSQEFKKPGGNGPRTSSSSSSTTGSTSSHTSQRGGFIKPADGKPPYGGRGGYPGQPVKHGGSSNDHRSHGILPAKGPPSSIPGNANSTGNSGGLTSSGNCPPPGNSGNLSRVHAAASRLPRLPLDNGVRHGSDPADLENILKEMTMPPIPLTAIAQTPRKELESKFTFNPVLAKLTEIPPEPIKPPQRERHGTTRLSADLERDLSLSEDSEDEGVKTSISRVSRSTASPSINADHSTPLAPAMTPVPAPLAPMSPRALSPLVTLSPPRPISPLRTTPKHTVSERLLSPSSSSPRKGSPTMPTRPPSPPGQAPLSSGSASSSSDSGSDSGTDSSDDSEDENAPPPPKGPSTPPSVSPKVPVEEPPPAEESKPRWNLSSFFNKTAVQAGDKNSENKTAQNEATRRDSSPEEASLDMRIRRSDGAHHNNEWQLDEALKRTRNATMLTVLSDGDQNSDSEKKKKEELRSQIQDKPKAPDVRKRGRPRKTAKETVKSPRSHRMSSEESKASSKRSKPRSVSNPKKKTTAIPLPRVPLDNSDELSDSRSRDRSSDSECEQSRISPASTVIPVDKRRSRLSISSSEDEKTTNKHSASEDDARWRRLSSKRNKLTDSPTKKQDKKKSPNKVKPQRPRSRITNNSGCASDSDSELETTLRNNRVQPVARVPPRPRAPLTRVTSPDNSDSDNSPTPKLQEEDAGNVQDKKKNDTLRKLFSSAKGGAKGGGKGGKGGKGGGKCGIYVEEYTASTPTGGESPYKRPSSQSSATANFPPLKYVNGVPILVCKVDLSRLSHVPRPSRGQELRQRTELPDTRPASRQSVKSERPPTPEEGEIIDTPTPPPSIDFRTHVDNHHVTTDQAVLSKSKRTESKSELVVTLAGADSKNRAIPGGSGLSSGPTNSSAGASANVITDNTGDRVPKRKRNTSCSSVSSLNMCSMDSKVKSTSEHKEKKKRKRHHTDKDSNSSRSSSRQQNDIQPTNHEREERSDVNLLPPPVAPPQRVYYSYFNHQNDVLEDQDRDQNQYLTEAKRLKHSADEECELTAQGMLYLEAVLYFLLTGHAMESDPVTERSSFTMYRDTLSLIKYISSKFKSQQNNSPESSIHNKLAILSLWCQSLIYLKLYKMRQHEVKENAKILAEYTQKPAQPTLVHAEGQGTPSLSPTPSPAGSVGSVGSQSSGYSSGELANRGNVVSGQPSASTFVSVPLAIHSMLVKQNQHFSLLTNCHDLWDQATLLVTDKHRDFFIELDEKLGPLTLRSSLRDLVRYVQAGIKKLRAL
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_01003517;
- 90% Identity
- iTF_01003517;
- 80% Identity
- -