Basic Information

Insect
Euura lappo
Gene Symbol
-
Assembly
GCA_018257835.1
Location
JAEUYN010002256.1:205879-212986[+]

Transcription Factor Domain

TF Family
TSC22
Domain
TSC22 domain
PFAM
PF01166
TF Group
Basic Domians group
Description
These proteins are highly similar in a region of about 50 residues that include a conserved leucine-zipper domain most probably involved in homo- or hetero-dimerisation. Drosophila protein bunched [1] (gene bun) (also known as shortsighted), a probable transcription factor required for peripheral nervous system morphogenesis, eye development and oogenesis.
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 47 0.39 4e+03 -1.2 0.1 10 44 49 82 44 88 0.71
2 47 0.26 2.7e+03 -0.6 0.8 13 35 102 124 97 127 0.85
3 47 0.23 2.4e+03 -0.5 0.1 14 45 138 166 134 181 0.56
4 47 0.017 1.8e+02 3.2 0.9 14 33 221 240 214 246 0.62
5 47 0.079 8.2e+02 1.0 0.8 16 34 265 283 263 292 0.81
6 47 0.0021 22 6.1 1.6 17 44 434 461 421 472 0.82
7 47 0.035 3.6e+02 2.2 0.7 21 44 469 492 463 502 0.75
8 47 0.011 1.1e+02 3.8 2.0 14 51 483 518 478 521 0.90
9 47 0.042 4.3e+02 1.9 2.0 15 51 505 541 499 547 0.87
10 47 1.8 1.9e+04 -3.4 5.0 28 43 553 568 522 573 0.75
11 47 0.0012 12 6.9 3.6 15 44 575 604 571 615 0.90
12 47 0.018 1.9e+02 3.1 4.8 15 42 624 651 606 653 0.66
13 47 0.0086 89 4.1 5.8 13 34 650 671 646 686 0.76
14 47 0.00023 2.4 9.1 4.6 11 53 690 732 687 736 0.90
15 47 1.5 1.6e+04 -3.1 11.9 13 31 769 787 732 812 0.64
16 47 0.016 1.7e+02 3.3 0.3 18 44 795 821 790 824 0.83
17 47 0.024 2.5e+02 2.7 1.5 19 39 817 837 809 854 0.77
18 47 0.16 1.6e+03 0.1 0.4 20 44 853 877 844 882 0.66
19 47 0.028 2.9e+02 2.5 0.6 14 36 868 890 863 890 0.89
20 47 0.00015 1.6 9.7 2.7 12 45 887 918 886 924 0.89
21 47 1.4 1.4e+04 -3.0 11.2 16 44 926 954 911 969 0.77
22 47 3.5e-05 0.36 11.8 3.2 11 37 970 996 968 1012 0.88
23 47 0.0017 17 6.4 2.8 11 43 1019 1051 1013 1056 0.70
24 47 0.0023 24 5.9 5.0 13 41 1056 1084 1047 1087 0.66
25 47 0.00076 7.9 7.5 4.2 13 50 1084 1119 1082 1125 0.85
26 47 0.012 1.2e+02 3.7 1.6 12 42 1125 1155 1119 1156 0.92
27 47 3.9e-05 0.41 11.6 2.6 14 43 1155 1184 1153 1190 0.91
28 47 0.00095 9.9 7.2 2.9 13 44 1189 1220 1187 1222 0.92
29 47 0.00011 1.1 10.2 1.8 11 43 1222 1254 1219 1255 0.94
30 47 6.1e-05 0.63 11.0 3.0 15 43 1254 1282 1252 1291 0.89
31 47 0.0013 13 6.8 1.4 11 43 1292 1324 1290 1327 0.87
32 47 5.6e-05 0.58 11.1 1.9 11 42 1327 1358 1322 1359 0.94
33 47 0.0001 1.1 10.3 1.6 15 43 1359 1387 1357 1390 0.93
34 47 0.00058 6 7.9 3.2 13 49 1385 1419 1384 1425 0.86
35 47 0.0044 45 5.1 1.1 11 43 1425 1457 1423 1465 0.80
36 47 0.0063 65 4.6 1.5 18 42 1467 1491 1457 1496 0.80
37 47 4.8e-05 0.49 11.3 2.4 13 43 1504 1534 1502 1538 0.92
38 47 0.0013 14 6.7 2.4 11 44 1537 1570 1535 1579 0.88
39 47 0.0022 23 6.0 3.8 15 43 1569 1597 1566 1606 0.87
40 47 0.0045 47 5.0 2.9 15 42 1590 1617 1584 1619 0.86
41 47 0.00041 4.3 8.3 1.5 15 43 1618 1646 1615 1652 0.91
42 47 0.00017 1.7 9.6 0.8 11 42 1649 1680 1646 1683 0.91
43 47 0.0018 19 6.3 0.9 15 44 1681 1710 1678 1718 0.88
44 47 0.011 1.1e+02 3.8 0.4 10 42 1718 1750 1713 1754 0.71
45 47 0.0054 56 4.8 0.5 11 42 1754 1785 1752 1789 0.89
46 47 2.3e-05 0.24 12.4 0.6 17 55 1788 1826 1783 1828 0.93
47 47 0.59 6.1e+03 -1.8 0.1 17 37 2037 2057 2034 2072 0.87

Sequence Information

Coding Sequence
ATGGCGGGTAGAACGTGCAGATGCGGTTGCGCGGATCCGGTCGAGATGTCGGTTCCGGATCCTCCGCACGAGGGTTCCTGCGGTTGCGGGTATAACCCCTTCGTCGACGACCGTTCCGACGAGACTGAGATTAAGGATTTGTCCTACGCGGTGCGGAAGCTGAGTCTGATGAAATGCCAGATGAAAAAATGGCGCATGGAGCGACTCCAGTTGGAGAGCGAGGCGCGGGCTTTGAAGCAGGTGCTGCAAGCTCACGGGCTCAACGATGACATCGTCAGGCCTGATCCCCTGTTGACACACTTCAGGGAGGAGAACGAGAGGCTTGAAAATGAGAAAAGCGAGCTCGCGGAGAGGGTGAAATTCCTTGGAGATGAGCTCGCCGAGCGGGAGAGGAGCGAGTATCCAATGTGCGAGGCTGTTGACAGAATGAGGGACAGGATCAGAGCCATGCGGGAGGAGCACCAGGCCGAAAAACGGAGACTCAGAGGTATAATTTCCGACCTTAAAATCCGCCTCCAAGAAGCGGAGTCCGAGTCTTCCTGCGCCGCGATAAACAGGCTGCGAGCGAAGCTCAGGGAGTTGATGAAGGGGGGAAAGGAGGCGGACCACCGAGTGTCCTTCGTGGTCCAGCGCTCGATTGAAACGCTGGTCGAGCTTACGGACAACGTTGACGAGCTCAAGGCCGAGATCGAGAGACTCAGAGCTGAGATAAAGAGGTTGAAAGACCTCCTCAAAGCCTGCGAAGATCGTCCGGTTCCGGAGGTCGGGGAGAAACCCCCCGCCGAAATCGACGTGTCGGAGTTGTTGAACAGGATAAAAGAGCTCGAAGAGCTCATAGCTCAGTTGCGAAAACAGCTGATCGATAAAGATGCGCTGATTAATTCCCTACAGAATGAAACCTTCGAGCTAGGATCTGATAATAAACGATTGTCCACGGATCTTGATCAGTTCAAGGTCAGCTATAACGCTCTCATGCAGGAAGTCAAGGCCATGAAGGAGGAGCTCAAGAAAAGGGACGCGAAGGTTTCCGAGCTTTTGAGAGACCTTCGAGCATCGGCAATCGACATGCTGGGCTTGAACCGTCTGCAGAGTGAGATGGAAGCGATAAAGCCACAGATGTATAATCTTGAGATAGAGCGCGATCAGCTCTTATCGGAGCTCGGTAAAGTCCGCGGTGTCGTTTCGGAACGAAATGACCAGATAATAAAGATCCTCGAAGAGCGAGACAAGCACGTTCGAGAACTGAAAAAAGCCTCCAGTCTGATGCAGGCGACGATAGAGCCCATGAAGGAGAAGGAGAGTGTGTTGAAAGGCGAGATTGACGAGTGCAAAGACCGTATAGCGGAGCTCGAACGTGAGCTTGCAGAGCTCAGGAAGAAACTGGCAGAGCTAGAAGCTCAGAATTCCAAGATACCTGTCCTTGAAGCGAGGATAAAAGAGCTGGAAGACGAGCTAGAAAAGCTCCGAGCTCAGTTGGCAGCTGCGAACGCCAGGATGACTGAGCTGGAGAAAGAAATTGCCGGTTTAAAAGCGGCCAAAGAGGAGCTAGAGAAAGAGCTCGGAGCGGCTAAAAACGAAATAGAAAAGCTCAGGGCGGAGCTCGCGAGTGAGAAAGCCGAGAAAGAAGCTGCGCTTAAAGACCTTTCGAACTGTAGAGCAGAGAACGAGAAGCTAAAAGCGGAGAACGAGAGACTTAAGAATGAGCTAGATGAGGCTAAAGCTGAGACTGACAAGCTCAGGAAAAGTTTAGACGAACTTAAAAAAGAACTAGAAGCTTTGAAAGCTGAGAATGACGATTTGAAGAATCTACTCACGGGGGCTAAAAGTGAGATCGAAAAGCTCAGAGGCGAGCTTGAGAAGCTGAAAGCTGAGAATGACGAGCTTAAAAAAGAACTTGAAAAGCTCAGCGCGGAAGTAGACAAGCTGAAAGCAGATAACAACAACCTCAGATCAGAGCTTGAAAAACTCAAAGACAAAATAAACGAGCTCCAAGCTGAATTGGAAAAGCTTAAGAGCGCTTTGGTGGCGGCTAATGATGAGGTAGAGAAGCTCAAAAATGCTCTGACTGCTGCTAACGCAGAAATTGAGAAACTGAAAAACGATCTAAACCAAATGAAAGCCGATCTTGATAATCTGAAATCTGAAAACGATCGTTTGAAAAATGAGCTAGCAGCTGAAAAGGCGGAGAACTTGAAGCTGAAAAACGAGCTGGAAGGGATGAAAGCGGAGGTAGAGAAGCTGAAAGGAGAGAATGAGAAGCTTAAGGCTGAGAAAGACGCTCTGGAAGCAGAGTTGAACAAATTAAAGGAAGAAAATGGAAAACTGAAGTCGGAGCTCGAAAAGCTCAAGAGCGAGAACGATGCGTTGATATCAGAAAATCAGGATTTGAAGAAAAGCCTGAGTGACGCAGAGGCGAAGATCAAAGCTCTCGAAGCTGAGAACGCGGATCTATTGAAAAGTATATCCGAGCTTAAAGACCGGATAAAAAAGCTCGAGGACGAACTCGAGGCAGAAAAAAGCGCGAAAGAAGCGGCGCTGAAGGAGCTCGCGGCGATAAAGTCAGAGCTCAAAGCTCTGTTGGCTGAAATGGACAAACTAAAGGCGGAGCTTGATAGACTGAAGGGAGAAATCGAAGAGCTAAATAAAAAGCTTGACGACGTGAAAAACGAGCTTGATCAGCTGAAATCAAAGTACAATGATTTGGTGGCAGAGAATGAAAAGCTTAAAGCTGAAGTTGAAAGGCTGAAGGTAGAGAACGACGGCCTGAAAGCTGACCTAGAAAAAATGAAAGCAGAGCTCGAAAAGCTCAAAACGGAGAACGAGAAGCTAAGGGATGAGAATGAGAAGCTGAAAAAAGAGCTGAAAGACGCAGAGGAGAAGATGAAAGCTCTTGAAGATCAGGTCAAAGCTTGCAAAGAGGAGAAAGAAAGGCTACGAAAAGAGATCCAGGAGCTCAAAGACCGGATTAATCAGTTGGATAATGAGCTCGCTGCAGAGAAAGCCGCGAAAGACGCTGCTTTGAAGGAGTTGGCAGCCCTGAAGACAGAGCTAGCTGCGTTGAGAGCGGAGCTTGATAAAGTTAGAGCAGAAAATTCCAAGCTCAAAGAAGATTTAGACAAACTAAAAGCAGAGAATGAAGCACTCAAGAACGAGAATAGCGGATTGAAGGAAGAAGTTGAGAAGCTGAAAGCAGAGAATTCGAAGCTACAAGAAGAGCTCGATGTTCTAAAGGCGCAGAACGCGAAACTCAAAGAATCTTTAGATAAGCTAAATGACGAAATTAGAGCTCTGCAAGCTCAGAATGAGAAACTGAAAGCAGAACTCGATCAGCTCAAAGCTGAGAATGAAAAACTGAAAAATGAACTGGCCTCTGCTAGGGACGAAGCTGCGAAGCTCAAAAATGATTTGGACAGGTTGAAATCCGAGCTCGAAGCCCTTGTGGCAGAGAATAATAGATTGAAAGGCGAAGTTGAGAGGCTTAAAGCGGAGAATGCCAAGTTACAAGAAGACCTTAATGCCCTGAAGTCCGAAAACGCGAAGCTCAAAGAATCCTTGGACAAATTAAACGACGAAATGGACGCGCTCCGAGCTCAGAATGAAAAACTAAAAGCAGAGCTTGATCAGCTCAAAGCTGAGAACGAGAAGCTGAAGAATGAATTGGCCGCTGCTAGGGAAGAAGTTGCGAAGCTCAAAAGCGATTCGGACAGGTTGAAAAATGAGCTCGAAGCTCTTGCAGCAGAGAATAATAGATTGAAGGGAGAAGTTGAGAAGCTCAAAGCGGAGAATGCGAAGCTGCAAGACAATCTCAATGCTCTGAAGAACGAAAATGCGAAGCTTAAAGAAGCCCTTGATAAGCTCAGCGATGAAATAAAAGCTCTGCGCGCTCAGAACGACAAGCTCAAAGGAGAACTAGACCAGCTCAAAAAAGAACTTGACCAGTTGAAAGCTGAGAATGAGAAGTTGAAGAATGATTTGGCGGCTGCTAGAGAAGAAGTAGCAAAGCTCAAAAATGATTTAGAGAAAATGAAATCTGAGCTTGCCGCTCTGGCGGCAGAAAATAATAGATTGAAGGGAGAAGTTGAAAGGCTCAAAGTAGAAAACGCGAAGCTATTAGACGATCTAAACGCTTTGAGGTCGGAAAATGCGAAGCTCAAAGAAACCCTGGACAAATTGAACGACGAAATTAAAGCACTGCGAGCTCAGAATGAGAAGCTAAAAGGAGAACTTGATCAGCTCAGAGCAGAGAATGAAAAGCTGAAAAATGACTTGGCAGCTGCGAGGGTAGAAATAGGGAAGCTCAAAAACGATTTAGACAAGTTGAAATCCGAGCTCGAAGCTCTCGCTGCAGAAAATAATAAAATGAAAGGAGAACTTGAGAGGCTCAAAGCAGAGAACGCACAGCTCAAAGAAGCTCTTGACAAACTAAATTCTGAGATCAAGGCACTCCGAGCAGAGAATGACAAGCTCAAAGGAGATCTTGACAAACTTAAAGGAGAACTTGAAAAGCTCAGAGGAGAATTTGACCAGCTCAAAAAAGAATGTGACGAGCTCAAAAAAGAACTTGACCAACTCAAAGTTGAGAACGAGAAGCTGAAGAATGACTTAGCGGCGGCTAGATCAGAAATAGCGAAGCTCAAAAGCGACCTCGACAAGTTGAAATCTGAGCTCGAAGCCCTTGCAGCAGAAAATAACAAATTAAAGGGTGAACTCGAGAAGCTTAAAGCAGAGAATGCTAAGCTATTAGATGATCTGAATGCCATGAAAGCAGAAAACGCAAAGCTCAAAGAAGCCCTTGACAAACTCAACGAAGAGATGAAAGCTCTTAGAGATGAGAACAATAAGCTCAAGGGAGAACTCGATAAGCTCAAAGGAGAGCTTGACCAGCTCAAAAAAGATCTTGACCAGCTCAAAGCTGAAAATGAGAAGTTGAAGAATGACTTGGCAGCTGCTAAGGCGGAGGTAGAAAAGCTGAAAAATGATTTGGGGAGATTGCAAGCTGAGCTTGAAGCTCTCGCTGCGGAAAATAACAGAATGAAAGGAGAGCTTGACAGGCTTAAAGCGGAGAACGCGAAGCTGCTAGATGACTTGAAAGCGCTTGGGGAAGAAAATGCGAAGCTGAAAGAATCCCTCGATAAGCTCAATGACGAGATGAAAGCTCTGCGAGCTCAGAATGAAAAGCTTAAAGGTGATCTTGACCAGTTGAAAAAGGAACTTGACCAGCTCAAAGCTGACAATGACAAATTGAAGAAAGACTTGGCGGCTGCTAAGGCGGAAGTAGAAAAGCTCAAAAACGATTTGGGGAAATTGAAAGCTGAGCTTGAAGCCCTCGCTGCAGAGACTGACAGAATGAAAGCAGACTTCGAAAAGCTGAAAGCAGAAAATTCAAAGCTTTTAGGAGACCTAAGAGCTCTGAAGGAAGAGAACAGCCGGAGGAAATCTGAGATTACGCCGGAGTCGGTTAAGAAACTACCGAAGGAACGAAAAGCTTACGAGGATAAGCCACCTCTTCTTCGAGCAGCAACGGAAGATGGTCGGAGGGCCCTCAGAGAAGATCGGCCACTTTCGATAACAAAAAGGGACCAAGGCTCGCAGGGCGAGGGCTGCGGGGATTACGAAAACGCTAACGAACAGCTAAGGAGAAACATGAGCATGCAGGACAGGGCCGTGCAGAGGATCCTGAACTTCATGAAGTACCTACTAGGAGAGAGAAGCTCTCCTCCGGAGATGGCCGGGGAACTCACCCACACGATGTCGTCGGTGATGAGGAACAGATTCGCCAAAGACATCATGGAGGTGCTGAAGGAGTCACAGTTCCTTTCGGAAAGCATATTCAATGCTGAAACCGAGGTTCAACGTCTCGCCAGGCTGCTCGATGACATGAACAAAGATACAAAAGACATGGACGGTCTGGGTGACGCCTTCGACGCAGAATCTTGGCTGAAGTCCCTGACGCTGACCGAACTGGCGGAACTCCACGACCGTATTTGCATGGTGACATCCTGCATGGTGCAGCAGGACATAAACCCGGACGACTACGTGGACACCGTAACCGAAACGGACGGCGTGTGCAGGCCCTGCGTCGACGGTGACGACGTGGCCGGTGACTACGAGGCCCTGAGCAGGCGGATAGCCGCTCTGCAGTCAGAGATAAACAGGAAACAGAGCGAGGCCGCCGCCAAGGTCGAGAGCATGAGGAGAGCCATGTTGAGGGAACAGGAGAACCTCATCAGATTGTCGGACGAGATGAACAACCAGAAGCGGAGGAACCTTTCCATGAAGATTTGTCAGGAGGAAGTATGCGACTGCGAAGAGCAGGGTGACCTTGAAAGGTACTCGACGACCCTCGAGACCCAAACTCCCCGTATGATTCCAAGGTACCATTTATCGATTGTTAGAGATGAAGAACCGTGTCCGGGATCAACCGACGAAAGACTGTCGAGATCCAAAGGCCAGCCCCAGCCAGCGTGTTCGATGCCGGTGAAACACGCGGACGTCCCTTGCTGCCCGAGGCCCTGCTGTCCGTCCAAGATGACCTCCAAAAACAGGGACTGA
Protein Sequence
MAGRTCRCGCADPVEMSVPDPPHEGSCGCGYNPFVDDRSDETEIKDLSYAVRKLSLMKCQMKKWRMERLQLESEARALKQVLQAHGLNDDIVRPDPLLTHFREENERLENEKSELAERVKFLGDELAERERSEYPMCEAVDRMRDRIRAMREEHQAEKRRLRGIISDLKIRLQEAESESSCAAINRLRAKLRELMKGGKEADHRVSFVVQRSIETLVELTDNVDELKAEIERLRAEIKRLKDLLKACEDRPVPEVGEKPPAEIDVSELLNRIKELEELIAQLRKQLIDKDALINSLQNETFELGSDNKRLSTDLDQFKVSYNALMQEVKAMKEELKKRDAKVSELLRDLRASAIDMLGLNRLQSEMEAIKPQMYNLEIERDQLLSELGKVRGVVSERNDQIIKILEERDKHVRELKKASSLMQATIEPMKEKESVLKGEIDECKDRIAELERELAELRKKLAELEAQNSKIPVLEARIKELEDELEKLRAQLAAANARMTELEKEIAGLKAAKEELEKELGAAKNEIEKLRAELASEKAEKEAALKDLSNCRAENEKLKAENERLKNELDEAKAETDKLRKSLDELKKELEALKAENDDLKNLLTGAKSEIEKLRGELEKLKAENDELKKELEKLSAEVDKLKADNNNLRSELEKLKDKINELQAELEKLKSALVAANDEVEKLKNALTAANAEIEKLKNDLNQMKADLDNLKSENDRLKNELAAEKAENLKLKNELEGMKAEVEKLKGENEKLKAEKDALEAELNKLKEENGKLKSELEKLKSENDALISENQDLKKSLSDAEAKIKALEAENADLLKSISELKDRIKKLEDELEAEKSAKEAALKELAAIKSELKALLAEMDKLKAELDRLKGEIEELNKKLDDVKNELDQLKSKYNDLVAENEKLKAEVERLKVENDGLKADLEKMKAELEKLKTENEKLRDENEKLKKELKDAEEKMKALEDQVKACKEEKERLRKEIQELKDRINQLDNELAAEKAAKDAALKELAALKTELAALRAELDKVRAENSKLKEDLDKLKAENEALKNENSGLKEEVEKLKAENSKLQEELDVLKAQNAKLKESLDKLNDEIRALQAQNEKLKAELDQLKAENEKLKNELASARDEAAKLKNDLDRLKSELEALVAENNRLKGEVERLKAENAKLQEDLNALKSENAKLKESLDKLNDEMDALRAQNEKLKAELDQLKAENEKLKNELAAAREEVAKLKSDSDRLKNELEALAAENNRLKGEVEKLKAENAKLQDNLNALKNENAKLKEALDKLSDEIKALRAQNDKLKGELDQLKKELDQLKAENEKLKNDLAAAREEVAKLKNDLEKMKSELAALAAENNRLKGEVERLKVENAKLLDDLNALRSENAKLKETLDKLNDEIKALRAQNEKLKGELDQLRAENEKLKNDLAAARVEIGKLKNDLDKLKSELEALAAENNKMKGELERLKAENAQLKEALDKLNSEIKALRAENDKLKGDLDKLKGELEKLRGEFDQLKKECDELKKELDQLKVENEKLKNDLAAARSEIAKLKSDLDKLKSELEALAAENNKLKGELEKLKAENAKLLDDLNAMKAENAKLKEALDKLNEEMKALRDENNKLKGELDKLKGELDQLKKDLDQLKAENEKLKNDLAAAKAEVEKLKNDLGRLQAELEALAAENNRMKGELDRLKAENAKLLDDLKALGEENAKLKESLDKLNDEMKALRAQNEKLKGDLDQLKKELDQLKADNDKLKKDLAAAKAEVEKLKNDLGKLKAELEALAAETDRMKADFEKLKAENSKLLGDLRALKEENSRRKSEITPESVKKLPKERKAYEDKPPLLRAATEDGRRALREDRPLSITKRDQGSQGEGCGDYENANEQLRRNMSMQDRAVQRILNFMKYLLGERSSPPEMAGELTHTMSSVMRNRFAKDIMEVLKESQFLSESIFNAETEVQRLARLLDDMNKDTKDMDGLGDAFDAESWLKSLTLTELAELHDRICMVTSCMVQQDINPDDYVDTVTETDGVCRPCVDGDDVAGDYEALSRRIAALQSEINRKQSEAAAKVESMRRAMLREQENLIRLSDEMNNQKRRNLSMKICQEEVCDCEEQGDLERYSTTLETQTPRMIPRYHLSIVRDEEPCPGSTDERLSRSKGQPQPACSMPVKHADVPCCPRPCCPSKMTSKNRD

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

90% Identity
iTF_00719118;
80% Identity
iTF_00718265;