Nrib008376.1
Basic Information
- Insect
- Nasonovia ribisnigri
- Gene Symbol
- lilli
- Assembly
- GCA_027563275.1
- Location
- JANEHQ010000445.1:81177-136908[+]
Transcription Factor Domain
- TF Family
- AF-4
- Domain
- AF-4 domain
- PFAM
- PF05110
- TF Group
- Unclassified Structure
- Description
- This family consists of AF4 (Proto-oncogene AF4) and FMR2 (Fragile X syndrome) nuclear proteins. These proteins have been linked to human diseases such as acute lymphoblastic leukaemia and mental disabilities [1]. The family also contains a Drosophila AF4 protein homologue Lilliputian which contains an AT-hook domain. Lilliputian represents a novel pair-rule gene that acts in cytoskeleton regulation, segmentation and morphogenesis in Drosophila [2].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 8 2.6e-12 8.5e-08 32.7 5.0 4 155 32 179 30 230 0.57 2 8 0.0019 62 3.5 4.4 301 361 266 330 257 378 0.76 3 8 0.0021 69 3.4 28.6 401 514 440 558 397 558 0.55 4 8 1 3.3e+04 -8.3 20.1 106 275 566 736 558 742 0.36 5 8 1 3.3e+04 -8.8 15.4 423 500 740 811 697 818 0.53 6 8 1 3.3e+04 -11.3 17.4 424 472 788 836 748 858 0.39 7 8 0.0028 93 2.9 3.8 146 220 967 1032 948 1074 0.57 8 8 1 3.3e+04 -6.6 2.4 204 219 1341 1356 1322 1365 0.40
Sequence Information
- Coding Sequence
- ATGAACTACCGTCCGAAACCCAATCCTTTCGATGGATTGAGGAACTCTCCGTCTTCGCACAACCGTGGAGGACTCAGCTACATGACTACAATGGAACGGGAAAAACAACGTGAGAAAGAAAGGCAGTCCAGACAGCAACAAGTTTCCGAGAACAATGTCAAAGACAAGAAAGAACTGTTCGGCTCTCCAGTAAAGGTGGACGATGCGAACGACGACGAACAGAGCCAGCTGATCCAGAGTCAGTTGGGTTCGTTCGACGACGCCAAACCGTACTTGGACCTGGTGTTCAGCTGCGGAATCACCGGCATGCCGCCGAGTCCCGCACCTCCTCCGCCGCCGTCATCGTCGCTGTCCGTCGCCGGTATGATGCAACACCACCACTCGTCGGCATCGTCGTATCCCTCCTCGTCGTCCGCAACCGTCTCCGCAGCATCCGCCAAGTCACACCATCACGCCGCCACCGCCGCTGCTGCCGTGATCGGCCAACAACAGTCGTCGTCCAGGAGATCAGTATCGCCCGCGTCCGGCGGTGGATCTGTGGCACCCACTGTCGCGGCGTCCTTTAAGAAACCGACCGTCACGTCGCACGCCATGGCTAGGTCGTCGTCATCNNNNNNNNNNNNNNNNNNNTCCTCTTCCTCGGCGCACTTTCAACCACCCGCCAACCCACCCAGGCCGCTGACGATTGGGCGGCCGGAATGGAGACGCCAACATCCTCTGTTTCGAGTTGGTTTATCGTATATTGGCGGTCGTAAAACGGCACACACAAGGAGCAGTAACTTTTTGAAACCTGGTGAAATGAAGCCACCTTACTCGTCGATGAGCCGATCGACGAGCAGTAGCATGAGAAGTAGCCATTCTTCGCACCACAGGCCTACACACAAGTCCAACCTTGCCATATCAAATGTAAAGAATAGTCATAATTCAAACGATACCAACACGCCCAGCACGATCGAAGACATCCTAAACGAAATGACCGTCGGCCTTCCACTCGTGTCGGACATCGCTGAAACTCCGAGGGTCTCGATCGTCGACTCCAAGTACACCGCAGACGGAAAGGTGGAACCGCAGCAAAGGATGAGTCCCAAAGACGCGTCCGAAAGCCTGAAAAAAATGCTTGGCGAACCAAAACAACTGTTTGCTGATCCTGTGCACAGAAACCAACAGCCGTCGTCGCACGGCGTCTCGCCCGACGCCGCTGCTTCAGTCTCCGCTGCAGCAGCAGCAGCCGCGGCGGCCGTGCCCATGGACGCCAAACCGTCGGGGCCGTACAACCATTCGCTGTTCGCATCGTACATGGTCAAACCGTCGGTCAGAGGGTCTAACATAATGCCGCAACCGCAGCAGACGCAACTGATGCAGGAATCGCAACAGTTGTCGTTATCGCCACCCACTGACAAGATCCGCCGGAGTTCCACGTCGACCAACGTCATATCGTCTGACACAACCAACTACAATTCCACAAAGGCCGACAAAACCAGTAGTGATATGTCTGTGGATGAAGACAGCTCGTCCAGCGAAGAAGGCGAGAAGGACAGTAGTTCGGATAGCAGCTGTAGTGACAGTGAGAGTAACGACGAGAAAAAGGACACCGTCCCACCTGTGGCTGTTTCACCGCCCCCGCAAGAGGAACAGGAAGAAGAGCCCAAGTGGTATCTGAAAAACTTTTTGAAAAAAAGCACGCCCGACACGAACAACACGACCAGGACATCACAACAGGAGGAACCTGTTGGATTGGGCGATCCTAAATCTAGTCCAAGTTTTCCTACGCAGCGATCTGAGGTTGTTGAATTATTATCAGAAATGTCTGACTCAGACTCTAACCACAGTGATAAACAAAAAAGCCAATCTGATNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNCGAAGACGAAGACGAAAAAAAGAAAAAGCCTCGTGACACTAGTTGTGATAGTGATTCATCCTACAGTAGCCGTTCACATGCATCATCTAAACCAATAACAGACTTACACAAGAAATACTCAGCATTAGCAAAGTGTGAGAAAAAGGATCTCAAGAAATCATTAGTAAAACCTGCTAAATTAGTAGAAAGGAAATCAACTCCAGTGCTGGTAAAAAAAGAGGAACCAGTTGAAGAACCAGTTGCGACAAAGCGAGGTCGCAAGAAGGGTTCGACAAAAACTCCTAAACCTCTTCCGCCATCAAAAATCAAGTCAAAACCTTTGGTTTCATCAGATACAGAGGATGATGAACCACCTAAACCAGTTAAGATAAGTGAAAAACGAAGAGGTCGACCACCAGGTAGTAAAAAGTTAAAAGTGGTATCAGCTAGCAGTGAATCGGAAGAATGGACTGATAGTAGCCGAAAGAGTCGAAAAAAAGATTCCGAAAAGAAGTCTTCACATTCAGTAAAGAAAACACCGTCCTGGGAAAAAAATCAACCTCAAAAACCTGTAGCTCGAGTCAGTGCTGCTTTGCGTCCACCACGTTCAATTTCGAGCAGTGCAGATTCAGACTCTTCTGATGTAGAATGTAGGCCGCCACCCATGTCAACAGCAACTGAAAGTCCACCCAAACTTGATGTAGAGGGAATCAAAGTGCAAGATAAGAAAAAGAATGATACTTTAAGAAAACTGTTTATAACACGACGTGAAGATGGTGGGGCTAAGAGTGGTGGAAAATCCAAAGGTGGAAAAGGTGGCAAAGGTGGAGTTATTATAATTGATAACAATGAAGCAATGCGCAATGACAACGAACGTGTTATATCTCCTGTACCAGTCATTCCACTCATGCCTAAAGAACCAGAAGGCCTAAAGACTTTAAAACTATCTGTGATGTGTAAAATACCATTAAGTAAGTTGCCACCTAATTTAAATTACTTGGTGAAACCGGCAAGATCTGAAGAGTTGAGAACTTGTGCTGATTTGTCAAATACAAGGCAAGATGAAAAAAAACAAAAGCATAGACATCATAAACATCATCACAAAGGCAGTCCATCACAAACAATTTCAGGATCAGAAAAATCTAAATCCAATGAATCTCCAGCGAACACTGCTCCGCCTACTCAGCCTCTTCTCCAACAGCCTTCTTCTGATGGCTATTCCAAAGGCATGATGGATAATGTTAATGTATGGCGTAAGCCAACATCTGTTTGTGCAGTAATACCAGAAGCAAAAGTTAAACCTCTTGAGCATAGTACCATGTCATACTTGCAACGCCCAGCTGCCACCAGTGGTTTGATTCACGGCCCTTACGAAGAATCATCTGAAGATGAAGATGGTGGACCACCTCCTTTCCTTCCTTATAGTACCACCTCTGTTCAAATAATGGACCCTCGTTATAAACGTTCACTGGAACTAGACCAATACTCTCCTAATGCGTCAAAAAGACGAAAATTTCACAATCCAACTGGGTCAGCAACACTAGGAAGATATGCTACAGGAATAGGTGACGGATTAATAAATGATATATTGGTGGATCGTATGCATGAAGTTCCTGTACAACCTCCTCCCAGACAACCATCATGGCGCATGCAATCCATGCACCACCAGTATCCTCCGAGGAAAAGATTCTTCTCATACTTTGTAACTGACTTGTACCCTGAAAACAATATGCACCAAGAGGTCCCGTTGAAGGAGGCACAAGCATTGACCAAATTAGCTGAGTATGAGCCTGATCCCATTACACAAGAAATGAAATATTTGGACGGTATACTGTGTTTTGTACTTAGCGGACATTTGATGGAGAATGATGGGACTAGGGAAAGAGCAGTGCTCAAGATTTATAATGACACTGTTGACTTAATTAAAGTTATTTGGTCAAAAATATACAACTACCGTGCAGACTGTGATCACGAGGAGCTAGACGAAATATTTGAAATGGCTGATAATCCAGAAAGGGATAATAGGCTGTTAATACTTTGGATGAGGTGTCTTAGTTTCTTACGATTGAAATTGTTTAAACTACTTGTATACCAAAATCGACAGAATTTCAAGACTGTTCAACAACATTTCCTGAAGAATGTGGGTTCAAGTCCCATTTCACCATCTCCATCACCAGCCAGTTCTGTGGAGAGTCACTCATCAGGATACTGTAGTAGTAGCATCACGCCCAGTGGCGGAGCTGCACCCGGGGCTGCGACAGGTGTGATCGGTGTTCCAATAGTTGTGCATAATGCTATTCATTCACAGCACATATTATATTGTCATTTGGCAGCGGCACACGAAATGTGGCAACGTGCTGACCTTTTAGTTATGCGCGGGAAACATACACAATTCTTTGTTGAGATGGACCGACATTGTGGACCTCTCACACTGCATAGTACCGGCCATGATCTCACACTGTACGCTCGCATTGCCATTTCTCGCATGAGATGTGAGTTTAACATAAAAAATTACCTTCCATGA
- Protein Sequence
- MNYRPKPNPFDGLRNSPSSHNRGGLSYMTTMEREKQREKERQSRQQQVSENNVKDKKELFGSPVKVDDANDDEQSQLIQSQLGSFDDAKPYLDLVFSCGITGMPPSPAPPPPPSSSLSVAGMMQHHHSSASSYPSSSSATVSAASAKSHHHAATAAAAVIGQQQSSSRRSVSPASGGGSVAPTVAASFKKPTVTSHAMARSSSSXXXXXXSSSSAHFQPPANPPRPLTIGRPEWRRQHPLFRVGLSYIGGRKTAHTRSSNFLKPGEMKPPYSSMSRSTSSSMRSSHSSHHRPTHKSNLAISNVKNSHNSNDTNTPSTIEDILNEMTVGLPLVSDIAETPRVSIVDSKYTADGKVEPQQRMSPKDASESLKKMLGEPKQLFADPVHRNQQPSSHGVSPDAAASVSAAAAAAAAAVPMDAKPSGPYNHSLFASYMVKPSVRGSNIMPQPQQTQLMQESQQLSLSPPTDKIRRSSTSTNVISSDTTNYNSTKADKTSSDMSVDEDSSSSEEGEKDSSSDSSCSDSESNDEKKDTVPPVAVSPPPQEEQEEEPKWYLKNFLKKSTPDTNNTTRTSQQEEPVGLGDPKSSPSFPTQRSEVVELLSEMSDSDSNHSDKQKSQSDXXXXXXXXXXXXEDEDEKKKKPRDTSCDSDSSYSSRSHASSKPITDLHKKYSALAKCEKKDLKKSLVKPAKLVERKSTPVLVKKEEPVEEPVATKRGRKKGSTKTPKPLPPSKIKSKPLVSSDTEDDEPPKPVKISEKRRGRPPGSKKLKVVSASSESEEWTDSSRKSRKKDSEKKSSHSVKKTPSWEKNQPQKPVARVSAALRPPRSISSSADSDSSDVECRPPPMSTATESPPKLDVEGIKVQDKKKNDTLRKLFITRREDGGAKSGGKSKGGKGGKGGVIIIDNNEAMRNDNERVISPVPVIPLMPKEPEGLKTLKLSVMCKIPLSKLPPNLNYLVKPARSEELRTCADLSNTRQDEKKQKHRHHKHHHKGSPSQTISGSEKSKSNESPANTAPPTQPLLQQPSSDGYSKGMMDNVNVWRKPTSVCAVIPEAKVKPLEHSTMSYLQRPAATSGLIHGPYEESSEDEDGGPPPFLPYSTTSVQIMDPRYKRSLELDQYSPNASKRRKFHNPTGSATLGRYATGIGDGLINDILVDRMHEVPVQPPPRQPSWRMQSMHHQYPPRKRFFSYFVTDLYPENNMHQEVPLKEAQALTKLAEYEPDPITQEMKYLDGILCFVLSGHLMENDGTRERAVLKIYNDTVDLIKVIWSKIYNYRADCDHEELDEIFEMADNPERDNRLLILWMRCLSFLRLKLFKLLVYQNRQNFKTVQQHFLKNVGSSPISPSPSPASSVESHSSGYCSSSITPSGGAAPGAATGVIGVPIVVHNAIHSQHILYCHLAAAHEMWQRADLLVMRGKHTQFFVEMDRHCGPLTLHSTGHDLTLYARIAISRMRCEFNIKNYLP
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_01346444;
- 90% Identity
- iTF_00973037;
- 80% Identity
- -