Basic Information

Gene Symbol
lilli
Assembly
GCA_027563275.1
Location
JANEHQ010000445.1:81177-136908[+]

Transcription Factor Domain

TF Family
AF-4
Domain
AF-4 domain
PFAM
PF05110
TF Group
Unclassified Structure
Description
This family consists of AF4 (Proto-oncogene AF4) and FMR2 (Fragile X syndrome) nuclear proteins. These proteins have been linked to human diseases such as acute lymphoblastic leukaemia and mental disabilities [1]. The family also contains a Drosophila AF4 protein homologue Lilliputian which contains an AT-hook domain. Lilliputian represents a novel pair-rule gene that acts in cytoskeleton regulation, segmentation and morphogenesis in Drosophila [2].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 8 2.6e-12 8.5e-08 32.7 5.0 4 155 32 179 30 230 0.57
2 8 0.0019 62 3.5 4.4 301 361 266 330 257 378 0.76
3 8 0.0021 69 3.4 28.6 401 514 440 558 397 558 0.55
4 8 1 3.3e+04 -8.3 20.1 106 275 566 736 558 742 0.36
5 8 1 3.3e+04 -8.8 15.4 423 500 740 811 697 818 0.53
6 8 1 3.3e+04 -11.3 17.4 424 472 788 836 748 858 0.39
7 8 0.0028 93 2.9 3.8 146 220 967 1032 948 1074 0.57
8 8 1 3.3e+04 -6.6 2.4 204 219 1341 1356 1322 1365 0.40

Sequence Information

Coding Sequence
ATGAACTACCGTCCGAAACCCAATCCTTTCGATGGATTGAGGAACTCTCCGTCTTCGCACAACCGTGGAGGACTCAGCTACATGACTACAATGGAACGGGAAAAACAACGTGAGAAAGAAAGGCAGTCCAGACAGCAACAAGTTTCCGAGAACAATGTCAAAGACAAGAAAGAACTGTTCGGCTCTCCAGTAAAGGTGGACGATGCGAACGACGACGAACAGAGCCAGCTGATCCAGAGTCAGTTGGGTTCGTTCGACGACGCCAAACCGTACTTGGACCTGGTGTTCAGCTGCGGAATCACCGGCATGCCGCCGAGTCCCGCACCTCCTCCGCCGCCGTCATCGTCGCTGTCCGTCGCCGGTATGATGCAACACCACCACTCGTCGGCATCGTCGTATCCCTCCTCGTCGTCCGCAACCGTCTCCGCAGCATCCGCCAAGTCACACCATCACGCCGCCACCGCCGCTGCTGCCGTGATCGGCCAACAACAGTCGTCGTCCAGGAGATCAGTATCGCCCGCGTCCGGCGGTGGATCTGTGGCACCCACTGTCGCGGCGTCCTTTAAGAAACCGACCGTCACGTCGCACGCCATGGCTAGGTCGTCGTCATCNNNNNNNNNNNNNNNNNNNTCCTCTTCCTCGGCGCACTTTCAACCACCCGCCAACCCACCCAGGCCGCTGACGATTGGGCGGCCGGAATGGAGACGCCAACATCCTCTGTTTCGAGTTGGTTTATCGTATATTGGCGGTCGTAAAACGGCACACACAAGGAGCAGTAACTTTTTGAAACCTGGTGAAATGAAGCCACCTTACTCGTCGATGAGCCGATCGACGAGCAGTAGCATGAGAAGTAGCCATTCTTCGCACCACAGGCCTACACACAAGTCCAACCTTGCCATATCAAATGTAAAGAATAGTCATAATTCAAACGATACCAACACGCCCAGCACGATCGAAGACATCCTAAACGAAATGACCGTCGGCCTTCCACTCGTGTCGGACATCGCTGAAACTCCGAGGGTCTCGATCGTCGACTCCAAGTACACCGCAGACGGAAAGGTGGAACCGCAGCAAAGGATGAGTCCCAAAGACGCGTCCGAAAGCCTGAAAAAAATGCTTGGCGAACCAAAACAACTGTTTGCTGATCCTGTGCACAGAAACCAACAGCCGTCGTCGCACGGCGTCTCGCCCGACGCCGCTGCTTCAGTCTCCGCTGCAGCAGCAGCAGCCGCGGCGGCCGTGCCCATGGACGCCAAACCGTCGGGGCCGTACAACCATTCGCTGTTCGCATCGTACATGGTCAAACCGTCGGTCAGAGGGTCTAACATAATGCCGCAACCGCAGCAGACGCAACTGATGCAGGAATCGCAACAGTTGTCGTTATCGCCACCCACTGACAAGATCCGCCGGAGTTCCACGTCGACCAACGTCATATCGTCTGACACAACCAACTACAATTCCACAAAGGCCGACAAAACCAGTAGTGATATGTCTGTGGATGAAGACAGCTCGTCCAGCGAAGAAGGCGAGAAGGACAGTAGTTCGGATAGCAGCTGTAGTGACAGTGAGAGTAACGACGAGAAAAAGGACACCGTCCCACCTGTGGCTGTTTCACCGCCCCCGCAAGAGGAACAGGAAGAAGAGCCCAAGTGGTATCTGAAAAACTTTTTGAAAAAAAGCACGCCCGACACGAACAACACGACCAGGACATCACAACAGGAGGAACCTGTTGGATTGGGCGATCCTAAATCTAGTCCAAGTTTTCCTACGCAGCGATCTGAGGTTGTTGAATTATTATCAGAAATGTCTGACTCAGACTCTAACCACAGTGATAAACAAAAAAGCCAATCTGATNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNCGAAGACGAAGACGAAAAAAAGAAAAAGCCTCGTGACACTAGTTGTGATAGTGATTCATCCTACAGTAGCCGTTCACATGCATCATCTAAACCAATAACAGACTTACACAAGAAATACTCAGCATTAGCAAAGTGTGAGAAAAAGGATCTCAAGAAATCATTAGTAAAACCTGCTAAATTAGTAGAAAGGAAATCAACTCCAGTGCTGGTAAAAAAAGAGGAACCAGTTGAAGAACCAGTTGCGACAAAGCGAGGTCGCAAGAAGGGTTCGACAAAAACTCCTAAACCTCTTCCGCCATCAAAAATCAAGTCAAAACCTTTGGTTTCATCAGATACAGAGGATGATGAACCACCTAAACCAGTTAAGATAAGTGAAAAACGAAGAGGTCGACCACCAGGTAGTAAAAAGTTAAAAGTGGTATCAGCTAGCAGTGAATCGGAAGAATGGACTGATAGTAGCCGAAAGAGTCGAAAAAAAGATTCCGAAAAGAAGTCTTCACATTCAGTAAAGAAAACACCGTCCTGGGAAAAAAATCAACCTCAAAAACCTGTAGCTCGAGTCAGTGCTGCTTTGCGTCCACCACGTTCAATTTCGAGCAGTGCAGATTCAGACTCTTCTGATGTAGAATGTAGGCCGCCACCCATGTCAACAGCAACTGAAAGTCCACCCAAACTTGATGTAGAGGGAATCAAAGTGCAAGATAAGAAAAAGAATGATACTTTAAGAAAACTGTTTATAACACGACGTGAAGATGGTGGGGCTAAGAGTGGTGGAAAATCCAAAGGTGGAAAAGGTGGCAAAGGTGGAGTTATTATAATTGATAACAATGAAGCAATGCGCAATGACAACGAACGTGTTATATCTCCTGTACCAGTCATTCCACTCATGCCTAAAGAACCAGAAGGCCTAAAGACTTTAAAACTATCTGTGATGTGTAAAATACCATTAAGTAAGTTGCCACCTAATTTAAATTACTTGGTGAAACCGGCAAGATCTGAAGAGTTGAGAACTTGTGCTGATTTGTCAAATACAAGGCAAGATGAAAAAAAACAAAAGCATAGACATCATAAACATCATCACAAAGGCAGTCCATCACAAACAATTTCAGGATCAGAAAAATCTAAATCCAATGAATCTCCAGCGAACACTGCTCCGCCTACTCAGCCTCTTCTCCAACAGCCTTCTTCTGATGGCTATTCCAAAGGCATGATGGATAATGTTAATGTATGGCGTAAGCCAACATCTGTTTGTGCAGTAATACCAGAAGCAAAAGTTAAACCTCTTGAGCATAGTACCATGTCATACTTGCAACGCCCAGCTGCCACCAGTGGTTTGATTCACGGCCCTTACGAAGAATCATCTGAAGATGAAGATGGTGGACCACCTCCTTTCCTTCCTTATAGTACCACCTCTGTTCAAATAATGGACCCTCGTTATAAACGTTCACTGGAACTAGACCAATACTCTCCTAATGCGTCAAAAAGACGAAAATTTCACAATCCAACTGGGTCAGCAACACTAGGAAGATATGCTACAGGAATAGGTGACGGATTAATAAATGATATATTGGTGGATCGTATGCATGAAGTTCCTGTACAACCTCCTCCCAGACAACCATCATGGCGCATGCAATCCATGCACCACCAGTATCCTCCGAGGAAAAGATTCTTCTCATACTTTGTAACTGACTTGTACCCTGAAAACAATATGCACCAAGAGGTCCCGTTGAAGGAGGCACAAGCATTGACCAAATTAGCTGAGTATGAGCCTGATCCCATTACACAAGAAATGAAATATTTGGACGGTATACTGTGTTTTGTACTTAGCGGACATTTGATGGAGAATGATGGGACTAGGGAAAGAGCAGTGCTCAAGATTTATAATGACACTGTTGACTTAATTAAAGTTATTTGGTCAAAAATATACAACTACCGTGCAGACTGTGATCACGAGGAGCTAGACGAAATATTTGAAATGGCTGATAATCCAGAAAGGGATAATAGGCTGTTAATACTTTGGATGAGGTGTCTTAGTTTCTTACGATTGAAATTGTTTAAACTACTTGTATACCAAAATCGACAGAATTTCAAGACTGTTCAACAACATTTCCTGAAGAATGTGGGTTCAAGTCCCATTTCACCATCTCCATCACCAGCCAGTTCTGTGGAGAGTCACTCATCAGGATACTGTAGTAGTAGCATCACGCCCAGTGGCGGAGCTGCACCCGGGGCTGCGACAGGTGTGATCGGTGTTCCAATAGTTGTGCATAATGCTATTCATTCACAGCACATATTATATTGTCATTTGGCAGCGGCACACGAAATGTGGCAACGTGCTGACCTTTTAGTTATGCGCGGGAAACATACACAATTCTTTGTTGAGATGGACCGACATTGTGGACCTCTCACACTGCATAGTACCGGCCATGATCTCACACTGTACGCTCGCATTGCCATTTCTCGCATGAGATGTGAGTTTAACATAAAAAATTACCTTCCATGA
Protein Sequence
MNYRPKPNPFDGLRNSPSSHNRGGLSYMTTMEREKQREKERQSRQQQVSENNVKDKKELFGSPVKVDDANDDEQSQLIQSQLGSFDDAKPYLDLVFSCGITGMPPSPAPPPPPSSSLSVAGMMQHHHSSASSYPSSSSATVSAASAKSHHHAATAAAAVIGQQQSSSRRSVSPASGGGSVAPTVAASFKKPTVTSHAMARSSSSXXXXXXSSSSAHFQPPANPPRPLTIGRPEWRRQHPLFRVGLSYIGGRKTAHTRSSNFLKPGEMKPPYSSMSRSTSSSMRSSHSSHHRPTHKSNLAISNVKNSHNSNDTNTPSTIEDILNEMTVGLPLVSDIAETPRVSIVDSKYTADGKVEPQQRMSPKDASESLKKMLGEPKQLFADPVHRNQQPSSHGVSPDAAASVSAAAAAAAAAVPMDAKPSGPYNHSLFASYMVKPSVRGSNIMPQPQQTQLMQESQQLSLSPPTDKIRRSSTSTNVISSDTTNYNSTKADKTSSDMSVDEDSSSSEEGEKDSSSDSSCSDSESNDEKKDTVPPVAVSPPPQEEQEEEPKWYLKNFLKKSTPDTNNTTRTSQQEEPVGLGDPKSSPSFPTQRSEVVELLSEMSDSDSNHSDKQKSQSDXXXXXXXXXXXXEDEDEKKKKPRDTSCDSDSSYSSRSHASSKPITDLHKKYSALAKCEKKDLKKSLVKPAKLVERKSTPVLVKKEEPVEEPVATKRGRKKGSTKTPKPLPPSKIKSKPLVSSDTEDDEPPKPVKISEKRRGRPPGSKKLKVVSASSESEEWTDSSRKSRKKDSEKKSSHSVKKTPSWEKNQPQKPVARVSAALRPPRSISSSADSDSSDVECRPPPMSTATESPPKLDVEGIKVQDKKKNDTLRKLFITRREDGGAKSGGKSKGGKGGKGGVIIIDNNEAMRNDNERVISPVPVIPLMPKEPEGLKTLKLSVMCKIPLSKLPPNLNYLVKPARSEELRTCADLSNTRQDEKKQKHRHHKHHHKGSPSQTISGSEKSKSNESPANTAPPTQPLLQQPSSDGYSKGMMDNVNVWRKPTSVCAVIPEAKVKPLEHSTMSYLQRPAATSGLIHGPYEESSEDEDGGPPPFLPYSTTSVQIMDPRYKRSLELDQYSPNASKRRKFHNPTGSATLGRYATGIGDGLINDILVDRMHEVPVQPPPRQPSWRMQSMHHQYPPRKRFFSYFVTDLYPENNMHQEVPLKEAQALTKLAEYEPDPITQEMKYLDGILCFVLSGHLMENDGTRERAVLKIYNDTVDLIKVIWSKIYNYRADCDHEELDEIFEMADNPERDNRLLILWMRCLSFLRLKLFKLLVYQNRQNFKTVQQHFLKNVGSSPISPSPSPASSVESHSSGYCSSSITPSGGAAPGAATGVIGVPIVVHNAIHSQHILYCHLAAAHEMWQRADLLVMRGKHTQFFVEMDRHCGPLTLHSTGHDLTLYARIAISRMRCEFNIKNYLP

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_01346444;
90% Identity
iTF_00973037;
80% Identity
-