Basic Information

Gene Symbol
-
Assembly
GCA_018420115.1
Location
JAFFZH010002400.1:8875-15958[-]

Transcription Factor Domain

TF Family
TSC22
Domain
TSC22 domain
PFAM
PF01166
TF Group
Basic Domians group
Description
These proteins are highly similar in a region of about 50 residues that include a conserved leucine-zipper domain most probably involved in homo- or hetero-dimerisation. Drosophila protein bunched [1] (gene bun) (also known as shortsighted), a probable transcription factor required for peripheral nervous system morphogenesis, eye development and oogenesis.
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 47 0.32 4.4e+03 -1.5 0.1 10 44 49 82 43 88 0.68
2 47 0.17 2.4e+03 -0.6 0.8 13 35 102 124 97 127 0.85
3 47 0.15 2.1e+03 -0.4 0.1 14 45 138 166 134 181 0.56
4 47 0.011 1.6e+02 3.2 0.9 14 33 221 240 214 246 0.62
5 47 0.049 6.7e+02 1.1 0.8 16 34 265 283 262 286 0.81
6 47 0.014 1.9e+02 2.9 0.6 19 43 436 460 428 468 0.64
7 47 0.011 1.5e+02 3.2 0.7 20 44 468 492 461 503 0.83
8 47 0.1 1.4e+03 0.1 6.0 14 51 483 518 481 547 0.76
9 47 0.092 1.3e+03 0.3 1.4 27 43 552 568 543 576 0.75
10 47 0.00066 9.1 7.1 3.8 15 44 575 604 569 611 0.91
11 47 0.0069 94 3.9 6.5 15 39 624 648 605 653 0.49
12 47 0.016 2.2e+02 2.7 7.4 18 43 648 673 645 688 0.59
13 47 0.00016 2.1 9.1 4.6 11 53 690 732 687 736 0.90
14 47 0.1 1.4e+03 0.1 3.6 19 49 733 761 729 769 0.72
15 47 0.076 1e+03 0.5 6.7 15 44 750 779 741 785 0.89
16 47 0.0026 36 5.2 4.3 13 46 769 802 766 813 0.90
17 47 0.022 3e+02 2.2 3.4 17 45 794 822 788 836 0.71
18 47 0.014 1.9e+02 2.9 1.3 19 39 817 837 809 854 0.77
19 47 0.00029 3.9 8.3 1.3 14 44 868 898 863 900 0.91
20 47 0.00031 4.2 8.2 2.2 12 46 887 919 885 926 0.89
21 47 0.031 4.3e+02 1.8 5.5 16 45 926 955 919 969 0.78
22 47 9.6e-05 1.3 9.8 4.2 11 38 970 997 968 1014 0.86
23 47 0.0013 18 6.2 3.3 11 43 1019 1051 1013 1056 0.66
24 47 0.0025 34 5.3 4.5 13 41 1056 1084 1047 1087 0.54
25 47 0.0004 5.4 7.8 3.8 13 49 1084 1118 1081 1124 0.85
26 47 0.00043 5.9 7.7 1.1 11 42 1124 1155 1119 1156 0.94
27 47 3.3e-05 0.45 11.3 3.2 14 44 1155 1185 1153 1190 0.89
28 47 0.013 1.8e+02 3.0 4.8 13 42 1189 1218 1183 1222 0.63
29 47 2.5e-05 0.34 11.7 2.0 11 42 1222 1253 1220 1254 0.94
30 47 2.6e-05 0.36 11.6 3.7 14 49 1253 1283 1251 1291 0.86
31 47 0.00074 10 7.0 1.4 11 43 1292 1324 1290 1328 0.87
32 47 1e-05 0.14 12.9 1.0 11 42 1327 1358 1325 1359 0.94
33 47 2e-05 0.27 12.0 1.7 14 43 1358 1387 1356 1390 0.92
34 47 0.00031 4.3 8.2 2.9 13 49 1385 1419 1384 1426 0.86
35 47 0.0026 36 5.2 1.0 11 43 1425 1457 1423 1465 0.80
36 47 0.0027 37 5.2 1.1 18 41 1467 1490 1454 1496 0.74
37 47 1.8e-05 0.25 12.1 2.0 13 43 1504 1534 1502 1538 0.92
38 47 0.00038 5.2 7.9 0.6 11 43 1537 1569 1535 1572 0.92
39 47 0.0013 17 6.2 2.4 15 43 1569 1597 1567 1606 0.87
40 47 0.0029 40 5.1 2.8 16 42 1591 1617 1585 1619 0.87
41 47 0.00034 4.7 8.0 1.7 15 43 1618 1646 1615 1651 0.91
42 47 0.00017 2.4 9.0 1.6 11 43 1649 1681 1647 1684 0.91
43 47 0.0011 14 6.5 2.4 15 44 1681 1710 1679 1717 0.89
44 47 0.0058 80 4.1 3.5 11 44 1698 1731 1698 1755 0.76
45 47 0.0002 2.7 8.8 0.8 11 43 1754 1786 1752 1789 0.90
46 47 2.4e-05 0.32 11.8 0.9 16 55 1787 1826 1784 1828 0.93
47 47 0.4 5.5e+03 -1.8 0.1 17 37 2023 2043 2020 2058 0.87

Sequence Information

Coding Sequence
ATGGCCGGTAGAACGTGCAGATGCGGCTGCGCGGATCCGGTCGAGATGTCGGTTCCGGATCCTCCGCACGAGGGTTCCTGCGGTTGCGGGTATAACCCTTTCGTCGACGACCGTTCCGACGAGACTGAGATCAAGGATTTGTCCTACGCGGTGCGGAAGCTGAGTCTGATGAAATGCCAGATGAAAAAATGGCGCATGGAGCGACTCCAGCTGGAGAGCGAGGCGCGGTCTTTGAAGCAGGTGCTGCAAGCTCACGGGCTCAACGACGACATCGTCAGGCCTGATCCCCTGTTGACACACTTCAGGGAGGAGAACGAGAGACTTGAAAATGAGAAAAGCGAGCTCGCGGAGAGGGTGAAATTCCTTGGAGATGAGCTCGCCGAGCGGGAGAGGAGCGAGTATCCAATGTGCGAGGCTGTTGACAGAATGAGGGACAGGATCAGAGCCATGAGGGAGGAGCACCAGGCCGAAAAACGGAGACTCAGAGGTATAATTTCCGACCTTAAAATCCGCCTCCAAGAAGCGGAGTCCGAGTCTTCCTGCGCCGCGATAAACAGGCTGCGAGCAAAGCTCAGAGAGTTGATGAAGGGGGGAAAGGAGGCGGACCACCGAGTGTCCGTCGTGGTCCAGCGCTCGATTGAAACGCTGGTCGAGCTTACGGACAACGTTGACGAGCTCAAGGCCGAGATCGAGAGACTCAGAGCTGAGATAAAGAGGCTGAAGGACCTCCTCAAGGCCTGCGAAGATCGTCCGGTTCCGGAGGCCGGGGAGAAGCCCCCCGCCGAAATGGACGTGTCGGAGTTGTTGAACAGGATAAAAGAGCTCGAAGAGCTCATAGCTCAGTTGAGAAAGCAGCTGATCGATAAAGACGCGCTGATTAATTCCCTACAGAATGAAACCTTCGAGCTAGGATCTGATAATAAACGATTGTCCACGGATCTTGATCAGTTCAAGGTCAGCTATAACGCTCTTATGCAGGAAGTCAAGGCCATGAAGGAGGAGCTCAAGAAGAGGGACGCGAAGGTTTCCGAGCTTTTGAGAGACCTTCGAGCATCCGCAATCGACATGCTGGGCTTGAACCGTCTGCAGAGTGAGATGGAAGCGATAAAGCCACAGATGTATAATCTCGAGATAGAGCGCGATCAGCTCTTATCGGAGCTCGGTAAAGTCCGCGGTGTCGTTTCGGAGCGAAATGACCAGATAATAAAGATCCTCGAAGAGCGAGACAAGCACGTTCGAGAACTTAAAAAAGCATCCAGTCTGATGCAGGCGACGATAGAGCCCATGAAGGAGAAGGAGAGTGCGTTGAAAGGCGAGATCAACGGGTACAAAGACCGCATAGCGGAGCTCGAACGTGAGCTTGCAGAGCTCAGGAAGAAACTGGCAGAGCTAGAAGCTCAGAATTCCAAGATACCTGGCCTTGAAGCGAGGATAAAAGAGCTGGAAGACGAGCTAGAAAAGCTCCGAGCTCAGTTGGCAGCTGCGAACGCCAGGATGACTGAGCTGGAGAAAGAGATTGCCGGTTTAAAAGCGGCCAAAGAGGAGCTAGAGAAAGAGCTCGGAGCGGCTAAAAACGAAATAGAAAAGCTCAGGGCGGAGCTTGCGAGTGAGAAAGCCGAGAAAGAAGCTGCGCTTAAAGAGCTTTCGAACTGTAGAGCAGAGAACGAGAAGCTAAAAGCGGAGAACGAGAGACTTAAGAATGAGCTAGATGAGGCTAAAGCTGAGACTGAAAAGCTCAGGAAAAATTTAGACGAACTTAAAAAAGAACTAGAAGCTCTGAAAGCTGAGAATGACGATTTGAAGAATATACTCGCGGGGGCTAAAAGTGAGATCGAAAAACTCAGAGGCGAGCTCGAGAAGCTGAAAGCTGAGAATGACGAGCTTAAAAAAGAACTTGAAAAGCTCCGCGCGGAAGTAGACATGCTCAAAGCAGAGAACAACAACCTCAAATCAGAGCTTGAAAAACTCAAAGACAAAATAAATGAGCTCGAAGCTGAATTGGAAAAGCTTAAGAGCGCTTTGGCGGCGGCCAATGATGAGGTAGAGAAGCTCAAAAATGCTCTGACTGCTGCTAACGCAGAAATTGAGAAACTGAAAAACGATCTAAACCAAATGAAAGCCGATCTTGATAATCTGAAATCTGAAAACGATCGTTTGAAAAATGAGCTAGCAGCTGAAAAGGCGGAAAACTTGAAGCTGAAAAACGAGCTGGAAGGGATGAAAGCGGAACTAGAGAAGCTGAAAGGAGAAAATGAGAAGCTTAAGGCTGAGAAAGACGCTCTGGAAGCAGAGTTGAACAAATTAAAGGAAGAAAATGGAAAACTGAAGTCGGAGCTCGAAAAGCTCAAGAGCGAGAACGATGCGTTGATATCAGAAAATCAGAATTTGAAGAAAAGCCTGAGTGACGCAGAGGCGAAGATCAAAGCTCTAGAAGCTGAGAACGCGGATCTATTGAAAAGTATATCCGAGCTTAAAGACCGGATAAAAAAGCTCGAGGACGAGCTCGAGGCAGAAAAAAGCGCTAAAGAAGCGGCGCTAAAAGAGCTCGCAGCCATAAAGTCAGAGCTCAAAGCTCTGTTGGCTGAAATGGACAAACTAAAGGCGGAGCTTGATAGACTGAAGGGAGAAATCGAAGAGCTCAATAAAAAGCTTGACGATGTGAAAAACGAGCGTGATCAGTTGAAATCAAAGTACAATGATCTGGTGGCAGAGAATGAAAAGCTCAAAGCTGAAGTTGAAAGGCTGAAGGTAGAGAACGACGGCCTGAAAGCTGACCTAGAAAAAATGAAAGCAGAGCTTGAAAAGCTCAAAACGGAGAACGAGAAGCTAAGAGGTGAGAATGAGAAGCTGAAGAAAGAGCTGAAAGACGCAGAGGAGAAGATAAAAGCTCTTGAAGATCAGGTAAAAGCTTGCGAAGAGGAGAAAGAAAGGCTACGAAAAGAGATCCAGGAGCTCAAAGACCGGATTAATCAGCTGGATAATGAGCTCGCTGCAGAGAGAGCCGCGAAAGACGCTGCTTTGAAGGAGTTGGCAGCCCTGAAGACAGAGCTAGCTGCGTTGAGAGCGGAGCTTGATAAAGTTAGAGCAGAAAATTCCAAGCTCAAAGAAGATCTAGACAAACTCAAAGCAGAGAATGAAGCACTTAAGAACGAGAATAGCAGATTGAAGGAAGAAGTTGAGAAGCTGAAAGCAGACAATTCGAAGCTACAAGAAGAGCTCGATGTTCTAAAAGTGCAGAACGCGAAACTCAAAGAATCTTTAGATAAGCTAAATGACGAAATTAGAGCTCTGCAAGCTCAGAATGAGAAACTGAAAGCAGAACTCGATCAGCTCAAAGCTGAGAATGAAAAACTGAAAAATGAACTGGCCGCTGCTAGGGACGAAGCTGCGAAGCTCAAAAATGATTTGGACAAGTTGAAATCCGAGCTCGAAGCCCTTGCGGCAGAGAATAATAGATTGAAAGGCGAAGTTGAGAGGCTTAAAGCGGAGAATGCCAAGTTACAAGAAGACCTTAACGCCCTGAAGTCCGAAAACGCGAAGCTCAAAGAATCCTTGGAAAAATTAAACGACGAAATGAAAGCGCTCCGAGCTCAGAATGAAAAACTAAAAGCAGAGCTTGATCAGCTCAAAGCTGAGAACGAGAAGCTCAAAAATGAATTGGCCGCTGCTAGGGAAGAAGTTGCGAAGCTCAAAAGCGATTTGGACAGGTTGAAAAACGAGCTCGAAGCTCTTGCAGCAGAGAATAATCGACTGAAGGGAGAAGTTGAGAAGCTCAAAGCGGAGAATGCGAAGCTGCAAGACGATCTCAATGCTCTGAAGAATGAAAATGCGAAGCTTAAAGAAACCCTTGATAAGCTCAACGATGAAATAAAAGCTCTGCGCGCTCAGAACGACAAGCTCAAAGGAGAACTAGACCAGCTCAAAAAAGAACTTGACCAGTTGAAAGCCGAGAATGAGAAGTTGAAGAATGATTTGGCGGCTGCTAGAGATGAAGTAGCGAAGCTCAAAAATGATTTAGAGAAATTGAAATCTGAGCTCGCAGCTCTTGCGGCAGAAAATAATAGATTGAAGGGAGAGGTTGAAAAGCTTAAAGCAGAAAACGCGAAGCTATTAGACGATCTAAACGCTTTGAGGTCGGAAAATGCGAAGCTCAAAGAAACCCTGGACAAATTGAACGACGAAATTAAAGCTCTGCGAGCTCAGAATGAGAAGCTAAAAGGAGAACTTGATCAGCTCAGAGCAGAGAATGAAAAGCTGAAAAATGACTTGGCAGCTGCGAGGGTAGAAATAGGGAAGCTCAAAAACGATTTAGACAAGTTGAAATCCGAGCTCGAAGCTCTCGCTGCAGAAAATAATAAAATGAAAGGAGAACTTGAGAGGCTCAAAGCAGAAAACGCACAGCTCAAAGAAGCTCTTGACAAACTAAATTCTGAGATCAAGGCACTCCGAGCAGAGAATGACAAGCTCAAAGGGGATCTTGACAAACTTAAAGGAGAACTTGAAAAGCTCAGAGGAGAATTTGACCAGCTCAAAAAAGAATGTGACGAGCTCAAAAAAGAACTTGACCAACTCAAAGCTGAGAACGAGAAGCTGAAGAATGACTTAGCGGCGGCTAGATCAGAAATAGCGAAGCTCAAAAGCGACCTCGACAAGTTGAAATCTGAGCTCGAAGCCCTCGCAGCAGAAAATAACAAATTAAAGGGCGAACTCGATAGGCTTAAAGCAGAAAATGCTAAGCTATTAGATGATCTGAATGCCATGAAAGCAGAAAACGCAAAGCTCAAAGAAGCCCTTGACAAACTCAACGAAGAGATGAAAGCTCTGAGAGATGAGAACAACAAGCTCAAGGGAGAACTCGATAAGCTCAAAGGAGAGCTTGACCAGCTCAAAAAAGATCTTGACCAGCTCAAAGCTGAAAATGAGAAGTTGAAGAATGACTTGGCAGCTGCTAAGGCGGAGGTAGAAAAGCTAAAAAATGATTTGGGGAGACTGCAAGCTGAGCTTGAAGCTCTCGCTGCGGAAAATAACAGAATGAAAGGAGAGCTTGAGAGGCTTAAAGCGGAGAACGCGAAGCTGCTAGACGACTTGAAAGCACTTAGGGACGAAAACACGAAGCTGAAAGAATCTCTCGATAAGCTCAATGACGAGATGAAAGATCTGCGAGCTCAGAATGAAAAGCTTAAAGGTGATCTTGACCAGTTGAAAAAGGAACTTGACCAGCTCAAAGCTGACAATGACAAGTTGAAGAAAGACTTGGCGGCTGCTAAGGCGGAAGTAGAAAAGCTCAAAAATGATTTGGGGAAATTGAAAGCTGAGCTTGAAGCCCTCGCTGCAGAGAATGACAGAATGAAAGCAGACCTCGAAAAGCTGAAAGCAGAAAATTCGAAGCTTTTAGGAGACCTAAAAGCTCTGAAGGAAGAGAACAGCCGGAGGAAATCTGAGATTACGCCGGAGTCGGTTAAGAAACTACCGAAGGAACGAAAAGATGGTCGGAGGGCATTCGGAGAAGATCGGCCACCTTCGATAACAAAAAGGGACCAAGGCTCGCAGGGCGCGGGCTGTGGGGATTACGAAAACGCTAACGAACAGCTGAGGAGGAACATGAGCATGCAGGACAGGGCCGTGCAGAGGATCCTGAACTTCATGAAGTACCTACTAGGAGAGAGAACCTCTCCTCCGGAGATGGCCGGGGAACTCACCCACACGATGTCGTCGGTGATGAGGAACAGATTCGCCAGAGACATCATGGATGTGCTGAAGGAGTCACAGTTCCTTTCGGAAAGCATATTCAACGCTGAAACCGAGGTTCAACGTCTCGCCAGACTGCTCGACGACATGAACAAAGATACAAGAGACATGGACGGTCTGGGTGACGCCTTCGACGCAGAGTCTTGGCTGAAGTCCCTGACGCTGACCGAACTGGCGGAACTCCACGACCGTATTTGCATGGTGACATCCTGCATGGTGCAGCAGGACATAAACCCGGACGACTACGTGGACACCGTGACCGAAACGGACGGCGTGTGCAGGCCCTGCGTCGACGGTGACGACGTGGCCGGTGACTACGAGGCCCTGAGCAGACGGATAGCCGCTCTGCAGTCGGAGATAAACAGGAAACAGAACGAGGCCGCCGCCAAGGTCGAGAGCATGAGGAGAGCCATGTTGAGGGAACAGGAGAACCTCATCAGATTGTCGGACGAGATGAACAACCAGAAGCGGAGGAACCTTTCCATGAAGATTTGTCAGGAGGAAGTATGCGACTGCGAAGAGGAGGATGACCTTGAAAGGTACTCGACGACCCTCGAGGCTCAAACTCCCCGCATGATTCCAAGCTACCATTTATCGATTGTTAGAGATGAAGAAGTCCCGTGTCCGGGACCAACCGACGAAAGAGTTTCGAGACCGAAAGGCCCCCAGCCAGCGTGTTCGATGCCGGTGAAGCACGCGGACGTCCCTTGCTGCCCGAGGCCCTGCTGTCCGTCCAAGATGACCTCCAAAAACAGGGACTGA
Protein Sequence
MAGRTCRCGCADPVEMSVPDPPHEGSCGCGYNPFVDDRSDETEIKDLSYAVRKLSLMKCQMKKWRMERLQLESEARSLKQVLQAHGLNDDIVRPDPLLTHFREENERLENEKSELAERVKFLGDELAERERSEYPMCEAVDRMRDRIRAMREEHQAEKRRLRGIISDLKIRLQEAESESSCAAINRLRAKLRELMKGGKEADHRVSVVVQRSIETLVELTDNVDELKAEIERLRAEIKRLKDLLKACEDRPVPEAGEKPPAEMDVSELLNRIKELEELIAQLRKQLIDKDALINSLQNETFELGSDNKRLSTDLDQFKVSYNALMQEVKAMKEELKKRDAKVSELLRDLRASAIDMLGLNRLQSEMEAIKPQMYNLEIERDQLLSELGKVRGVVSERNDQIIKILEERDKHVRELKKASSLMQATIEPMKEKESALKGEINGYKDRIAELERELAELRKKLAELEAQNSKIPGLEARIKELEDELEKLRAQLAAANARMTELEKEIAGLKAAKEELEKELGAAKNEIEKLRAELASEKAEKEAALKELSNCRAENEKLKAENERLKNELDEAKAETEKLRKNLDELKKELEALKAENDDLKNILAGAKSEIEKLRGELEKLKAENDELKKELEKLRAEVDMLKAENNNLKSELEKLKDKINELEAELEKLKSALAAANDEVEKLKNALTAANAEIEKLKNDLNQMKADLDNLKSENDRLKNELAAEKAENLKLKNELEGMKAELEKLKGENEKLKAEKDALEAELNKLKEENGKLKSELEKLKSENDALISENQNLKKSLSDAEAKIKALEAENADLLKSISELKDRIKKLEDELEAEKSAKEAALKELAAIKSELKALLAEMDKLKAELDRLKGEIEELNKKLDDVKNERDQLKSKYNDLVAENEKLKAEVERLKVENDGLKADLEKMKAELEKLKTENEKLRGENEKLKKELKDAEEKIKALEDQVKACEEEKERLRKEIQELKDRINQLDNELAAERAAKDAALKELAALKTELAALRAELDKVRAENSKLKEDLDKLKAENEALKNENSRLKEEVEKLKADNSKLQEELDVLKVQNAKLKESLDKLNDEIRALQAQNEKLKAELDQLKAENEKLKNELAAARDEAAKLKNDLDKLKSELEALAAENNRLKGEVERLKAENAKLQEDLNALKSENAKLKESLEKLNDEMKALRAQNEKLKAELDQLKAENEKLKNELAAAREEVAKLKSDLDRLKNELEALAAENNRLKGEVEKLKAENAKLQDDLNALKNENAKLKETLDKLNDEIKALRAQNDKLKGELDQLKKELDQLKAENEKLKNDLAAARDEVAKLKNDLEKLKSELAALAAENNRLKGEVEKLKAENAKLLDDLNALRSENAKLKETLDKLNDEIKALRAQNEKLKGELDQLRAENEKLKNDLAAARVEIGKLKNDLDKLKSELEALAAENNKMKGELERLKAENAQLKEALDKLNSEIKALRAENDKLKGDLDKLKGELEKLRGEFDQLKKECDELKKELDQLKAENEKLKNDLAAARSEIAKLKSDLDKLKSELEALAAENNKLKGELDRLKAENAKLLDDLNAMKAENAKLKEALDKLNEEMKALRDENNKLKGELDKLKGELDQLKKDLDQLKAENEKLKNDLAAAKAEVEKLKNDLGRLQAELEALAAENNRMKGELERLKAENAKLLDDLKALRDENTKLKESLDKLNDEMKDLRAQNEKLKGDLDQLKKELDQLKADNDKLKKDLAAAKAEVEKLKNDLGKLKAELEALAAENDRMKADLEKLKAENSKLLGDLKALKEENSRRKSEITPESVKKLPKERKDGRRAFGEDRPPSITKRDQGSQGAGCGDYENANEQLRRNMSMQDRAVQRILNFMKYLLGERTSPPEMAGELTHTMSSVMRNRFARDIMDVLKESQFLSESIFNAETEVQRLARLLDDMNKDTRDMDGLGDAFDAESWLKSLTLTELAELHDRICMVTSCMVQQDINPDDYVDTVTETDGVCRPCVDGDDVAGDYEALSRRIAALQSEINRKQNEAAAKVESMRRAMLREQENLIRLSDEMNNQKRRNLSMKICQEEVCDCEEEDDLERYSTTLEAQTPRMIPSYHLSIVRDEEVPCPGPTDERVSRPKGPQPACSMPVKHADVPCCPRPCCPSKMTSKNRD

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00718290;
90% Identity
iTF_00719118;
80% Identity
iTF_00719118;