Basic Information

Gene Symbol
-
Assembly
GCA_943736035.1
Location
CALSEY010000015.1:814312-821154[+]

Transcription Factor Domain

TF Family
TSC22
Domain
TSC22 domain
PFAM
PF01166
TF Group
Basic Domians group
Description
These proteins are highly similar in a region of about 50 residues that include a conserved leucine-zipper domain most probably involved in homo- or hetero-dimerisation. Drosophila protein bunched [1] (gene bun) (also known as shortsighted), a probable transcription factor required for peripheral nervous system morphogenesis, eye development and oogenesis.
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 40 0.0074 1.8e+02 3.8 0.1 14 38 124 148 119 158 0.90
2 40 0.0074 1.8e+02 3.8 0.9 14 34 189 209 182 215 0.63
3 40 0.069 1.6e+03 0.7 0.6 15 34 241 260 238 263 0.78
4 40 0.051 1.2e+03 1.1 0.0 22 41 269 288 266 310 0.87
5 40 0.00083 20 6.8 2.8 13 35 421 443 410 448 0.43
6 40 0.026 6.1e+02 2.0 0.2 22 43 447 468 442 476 0.70
7 40 0.01 2.4e+02 3.3 1.9 15 43 461 489 457 496 0.87
8 40 0.039 9.2e+02 1.5 8.0 15 51 482 518 476 524 0.89
9 40 0.068 1.6e+03 0.7 3.2 12 41 500 536 499 544 0.63
10 40 3.7e-05 0.89 11.1 2.1 11 43 548 580 546 584 0.93
11 40 4.4e-05 1.1 10.9 4.6 10 48 581 618 580 619 0.85
12 40 0.00014 3.4 9.3 2.8 15 44 615 644 614 659 0.89
13 40 0.086 2.1e+03 0.3 0.1 13 22 655 664 651 667 0.85
14 40 0.00037 8.8 7.9 2.2 11 45 667 701 665 707 0.89
15 40 0.00062 15 7.2 5.4 15 42 706 733 703 740 0.88
16 40 0.017 4.2e+02 2.6 1.4 15 34 741 760 736 764 0.86
17 40 2.7e-05 0.64 11.6 4.6 11 45 765 799 759 806 0.89
18 40 0.00042 9.9 7.8 3.5 18 49 807 836 801 844 0.81
19 40 0.0016 38 5.9 2.9 15 46 853 884 849 896 0.89
20 40 0.0054 1.3e+02 4.2 1.0 18 36 898 916 893 926 0.90
21 40 0.024 5.8e+02 2.1 0.1 18 44 933 959 926 969 0.79
22 40 0.0065 1.5e+02 3.9 0.2 13 42 970 999 968 1001 0.91
23 40 0.00055 13 7.4 5.7 13 43 998 1028 997 1033 0.93
24 40 0.036 8.6e+02 1.6 2.5 13 37 1033 1057 1032 1061 0.92
25 40 0.005 1.2e+02 4.3 2.4 11 41 1059 1089 1057 1104 0.78
26 40 0.015 3.6e+02 2.8 2.3 11 42 1108 1139 1101 1145 0.83
27 40 0.00021 4.9 8.7 4.0 14 49 1146 1176 1143 1184 0.86
28 40 0.0037 89 4.7 2.4 19 48 1172 1199 1171 1208 0.72
29 40 0.0071 1.7e+02 3.8 0.7 18 45 1185 1212 1179 1218 0.78
30 40 0.0012 30 6.2 0.3 12 42 1214 1244 1211 1247 0.89
31 40 0.00078 19 6.9 1.4 15 43 1245 1273 1242 1281 0.90
32 40 0.0059 1.4e+02 4.1 0.9 19 47 1270 1296 1269 1312 0.68
33 40 0.001 25 6.5 0.2 11 42 1311 1342 1309 1344 0.91
34 40 0.0008 19 6.9 2.1 15 44 1343 1372 1340 1377 0.90
35 40 0.0013 32 6.1 2.8 15 47 1364 1394 1360 1411 0.81
36 40 0.0035 82 4.8 0.2 12 42 1410 1440 1407 1442 0.87
37 40 4.5e-05 1.1 10.9 0.4 15 43 1441 1469 1438 1475 0.90
38 40 0.00043 10 7.7 0.3 11 42 1472 1503 1470 1506 0.90
39 40 0.00083 20 6.8 1.7 15 44 1490 1519 1489 1542 0.86
40 40 0.34 8e+03 -1.5 0.1 17 37 1823 1843 1821 1849 0.91

Sequence Information

Coding Sequence
ATGGCCAATCGTACAACGTGTCAATGCGGATGCACAGACCCGCCAGAAATGACAGCCGCAGATCCTCCTCACGAGGGTTCGTGCGGTTGCAGCTACAACCCTTTCGCCGATCAAGGGAGAGACGCGGAAATAACAGATTTATCATTTGCACTGCGTAAATTAACCCTGATGAAATGTCAGATGAAAAAATGGAGAATGGAACGTCTGCAGTTGGAGAGCGAAGCGAGAGGTTTGAAACAGGTGCTTCAGGCCCACGGTTTGAACGACGATATCGTGAGACCAGATCCTCTGGTCGCTCATCTTCGAGAGTATAATGCGAGGCTGGAGAACGACAAGGAGGAGCTTGAGGAAAGCGTTAAAAGCCTGTCCGAAACCGTATCAGAATTAAAAGATCTCATAGCCGGACTGAAGATCCAGCTGCAAGAAGCCCAGAACGAGTCATCGTGCGCTGCCCTAAACCGACTTCGAGCAAAACTTAAAGAAATGACGAAAGGCGGTCAGGAAGCTGACCAAAGGGTATCCATGGTGGTCCAACGGTCAATAGAAACTCTGACTGAGTTAACGGAAAACGTTGACGATCTTAAGGCCGAGATCGAGAGACTTAGAGCTGAGATAAAGAGGCTGAAGGACTTGCTGAAGACATGCGAGGATCGAAAGGACGTTGGGGTCGAGACGATCGTCGTGGACGCTAAACCGGTAGAAAAACCGCTTGTGGAAATGGACGTCTCGGAATTGTTGAACAGGATAAAGGAACTTGAGGCGTTGATAGCTCAGCTGAGAAAACAACTGGTTGATAAAGATGCAGCGATTAATGACCTCCAGAATCAATCGTTTGAAGTCAGTTCCGAAAACAAACGTCTGTCCGTAGATTTGGACCAGATGAAGGTCAGCTTCAACGCTGTTATGCAGGAGGTCAAGGCTATGAAGGATGAACTTAAGAAGAGGGACGTCAAGGTATCCGAGCTTCTCAAAGACCTCAAAGCATCCGCGATCGACATTCTGGGATTAAACAGACTGCAGAGTGAAATGGACGCAATCAAACCCCAGATGTACAATCTCGAGGTAGAACGCGACCAGCTATTATCTGAGCTTGGTAAAGTGCGGGGCGTTGTATCGGAGCGGAATGATCAGATCATTAAAATACTCGAGGAAAGGGATAAGCACGTTAAAGCACTGGCCAGAGCCTCGAACGTAATGCAGGCGACGGTTGAACCGATGATAGAGAAAGAAGCGGCTCTCAAAAACGAGGTCCAGGGATTGAAGGACCGGATAGAACAGCTTGAGCGAGAGCTAGCCGAGCTCAGGAAAAAACTCGCTCAATTAGAAGCGGAGAATGCTGAGATACCTGGACTAGTCGATAAAATTAAGGAGCTGGAAAGTGAGCTAGGAAAGCTCAGGTCTCAGTTGGCCGAGGCGAAGTCTAGGATGGATGAACTTGAGAAAGAAGTAGCTCGGCTTAAAGCCGAGAAAGCAGAACTAGAGAAAGAGCTTGAAGAGGTGAGGAAGGAGATTGAAAAGCTGAAGGAAGAGCTCACTGAGGAGAAAGCCGCGAAAGAAGCTGCCCACAAAGATCTAAGGGATTGTAGAGCTGAGAATGAGAAGCTCAAAGCGGCAAATGAGCTGTTAATTAACGAGCTCAACACGGTTAAAGCAGAGGCCGACAAAATGAGAAACGATTTAGAGAAACTGAAAGAAGAAATGAAAGATCTAAAGACTGAGAATGATCAGTTGAAAAATCTATTCGCTGTGGCCAAAAATGAGATCGATAAGCTCAGAGGTGAGCTCGAGAAGCTCAAGGCAGAGAATGAGAAGCTCAAGGCAGAGAATGAGAAGCTCAAGGCAGAGAATGAGACGCTGAAAAATGAAGTGGAGAAATTGAACGGAGATATAGGTAAACTGAAGACAGAAAATGACAGCCTCAAAGCAGAAGTGGACAAACTTATAAACGGGCTCACTAGACTTCAGGAAGAAGTCGAGAAGTTGAAGAATAGTTTAGCAGCAGCTAAAGCTGAAGCTGAGAAGCTCAAACAAGATTTGGAAGACATGAGAAATGAGCGCGATAAATTAAGGGCTGAGAACGATCAGATCAGAAGTCAGTTAGCCAATGCTGAAGCTGAGAACGAGAAGTTGAAAAGTGAGCTCGAAAAGCTGAAGGAAGAAGTAAATAAATTAAAACAAGAAAATAGCGATCTTAAGTCAGAGAGAGACAGGCTGCAAGCAGAGCTCGAAAAACTTAGAAATAAGCTCAGTGAGCTCGAGGCTGAGATTGAAAAGGTGAAGAACGCCTTATCAGCGGCCAAGGCTGAGGTTGAGGAGCTCAAAAATGATCTGGAAAAAATGAAAACAGATCTTGAGAAGCTCAAGTCAGAAAATGATCAGCTGAAAAATGAGTTAGCCGATGCTAAGGCAGAGAACACGAGGCTCAAAAAGGAGCTCGATGAATTGAAGGGAGAAATAGACAAGCTGAGAGAAGAGAATAAGAACCTGAAGGCAGAAAAAGACAGGCTTGAAGCAGAGCTCAACAATATTAGAGGAGAATTAGATAGTCTCAAGGGTGAGAATGAAAGACTCAAGGCTGACCTCGAGAAACTTAAAAGCGAGCACGAAGCCCTGAAATCAGAAAATGAGAAGTTGAAGAAGAGCTTGAGTGACGCAGAGGCGAAGGTGAAAGCACTCGAAGCCTCGAACACCGAACTCGCGAATAAAATTGCAGAGCTAAAGAACCAAATTGATAAACTTGAGAACGAGTTGGGGTCAGAAAAAGCTGCGAAAGAAGCAGCGATTAAGGAATTGGCGGCCATCAAGGCTGAGCTGAAAGCTCTGCTGGAAGAAATGGACAAACTTAAGGCAGACTGCGACAGACTGAAAGGACAAGTCGATGATCTCACCAAACAGCTATCAGATTTGAGAAATGATTTTGATCAGCTCAAGTCTAAGTATGCCGATTTGTCGGCAGAGAGAGAAAAGTTCAAAGCTGAAGTTGATAAGTTGAAGGAAGAAAATGAAAGACTGAAAAATGACCTTGAGAAGTTGAAAGCAGAACTCGACGCGTTGAAGACAGAGAATCGTACGCTCAAGGAAGAGAATGCTAAATGGAAGGAAGAGAACGAGAAGCTGAAGAAAGCTGTGAACGAGGCTGAAGCTAGGATAAAAACACTTGAGGATGAGGTGAGGGCATGCGAAGAGGAAAAGGCAAGGCTGCGAAAAGAGATCGAGGGCCTGAAAGATCGGATTGAAGAAATGAAGAAGGAGCTCACTGCAGAGAAGGCTGCGACCGAGACAGCCTTGAAGGAACTTGGAGCCGTTAAGGCCGAGCTAGCTGCACTGAGAGCAGAGTTGGATAAAGTGAGAGCAGAGAACGCCAAGCTTAAAAGTGAGCTGGATAAAGTGAAATCAGAAAACGACGAGCTCAGAAATGAGAATGACAAAATGAAAGGCGAAATCGAGAGGCTGAAAGCAGAGGCTGCAAAATTACAAAATGATCTGAATAATTTGAAGGCAGAGAACGCGAAGCTTAAAGAAGATCTGGAAAAGCTAAATGCTGAAAACAAGACTCTCAGAGCTGAGAATGACAAATTGAAGGGAGAAATGGATCAGCTCAAGTCAGAGAACGCGAAATTGAAGGATGATCTTTCTACTGCTAGATCAGAAGCTGCAAAGCTCAAAAATGATTTAGATAAACTGAAATCTGACCTGGATGCGTTGCAAGCAGAGAATAACAGAATGAAGGGTGAACTTGATAGGCTAAGGTCAGAGAATGCAAAACTACAGGAAGATCTAAATAGCTTGAAGGCAGAGAATGGAAAGCTCAAGGGAGAACTTGACAAGCTAAATGCTGAAAATAAAGCTCTGAGATCTGAAAATGATAAATTGAAGGGAGAACTTGATCAGCTCAAGTCAGAGAACGCGAAATTGAAGGATGATCTTTCTGCTGCTAGATCAGAAACTGCAAAACTCAAGAATGACTTGGATAAACTGAAATCTGACCTGGATGCAATGCAAGCAGAGAATAACAGGATGAAGGGCGAACTTGACAGGCTAAAATCAGAGAATGCAAAACTACAGGAAGATCTGAATAGTCTGAGGGCAGAGAATGGAAAGCTCAAAGAAGAACTTGACAAGCTAAATGCTGAAAATAAAGCTCTGAGATCTGAAAATGATAAATTGAAGGGAGAACTCGATCAGCTCAAGTCAGAGAACGCGAAATTGAAGGATGATCTTTCTGCTGCTAAATCAGAAGCTGCAAAGCTCAAGAACGATTTGGATAAACTAAAATCTGACCTGGATGCGATGCAAGCAGAGAATAACAGAATGAAGGGTGAACTTGAGAAGCTGAAATCAGAGATTGCAAAACTAGAAGATGATCTAAATACCCTGAAGGCAGATAATGCGAAAATGAGGGATGACCTTGCTGCCGCTAAAGCAGAAGTGTCAAAACTCAAGAATGACTTAGATAAACTAAAGTCTGAGTTGAATGGATTGAAAGCAGAGAATGCAAAACTACACGAAGATCTTGATGCCTTAAAGGAAGAAAATGCCACGCTAAAGTCTGAGGTTGACAAACTGAAAATCGATAACAAAAAGTTGAATAAGGATCTTCAGAATGCAAAGTCAGAGTTAGACAAGTCAAAGTCTGAGGTCGATAATCCGAAGAAATTAAGCAGTGAACTAAAAGGTTTGAAAAAACCTATTTTAGAAACACCGAAGAAGAGATTCGATAGTGAAAAGGCAAAGGACACTGTGAAGAAAGAACCACAGATTAGAGTACCGCGGTCTGTTCCATCCGGGAAATTTTTTAAAAGTGATCAACGACCCTCGGTAATAAAAAAGGACCAAGGTTCACAGGGCGCGGGTTGTGGCGATTACGAAAATGCAAACGAACAGCTGAGGATGAACATGAATATGCAAGAGAGAGCTGTACAACGGATACGAGACTTTACGAAATACATTTTGGGCGAGAGGGATTCCCCACCTGAAATGGCCCAGGAATTAAAGCGTCGAATGTCTTTGCTAACGCGGAAAAATTTACCCGAAGACATAATGCAGCTATTGCAGGAGTCAGAGTTTTTATCGGATTCGATATTCGCCGCTGAAACTGAGGTCCAACGACTGGGCGAATTATTCGAGGACTTGAAGAAGCTTAGAGATCAAGTGGGTTTAGAACTTCGAGTTGGAATTGTATCTAACTATTTTTTTTTTTTTAATCAAGGATATCAGAAATTACCTATTTTCCAGGATACCAAGGATCTAGATGGGTTTGGTGACGCCTTCGACGCCGAGTCCTGGCTGAAAACCCTGACGTTAACGGAATTGGCCGAACTTCACGACAGGATATGCCTCGTTACTTCGTGCATCGTACAGCAGGACATAAACCCTGAGGATTACCTCGACGGTATTGAAACCGAAGGAATTTGTCGTCCTTGTAATTTACCTGAGGAATTCGACGAGGATTCAATTCCCGATTACGAAACGTTGAACCGTAGAATAGGAGCGCTCCAAATGCAGATAAATAAAAAACAGGACGAGGCGGCTGAGAAGATGACTAAAATGCGTAAAACCATGTGGCGAGAACAGGAAAATCTGATTAAATTATCGGAGGAAATGAACGCGCAGAAGCGTAGAAATTTATCAATGAAAATAAAAATCAACGAAAATTTAGACCCGGAAATAGGAGAAAATGAAGAAAATGAAACGGCGGTTATTTGCGATCGAAAATTACGTCTCCCAAAGATCAGCGCGATAAAGAATGATAAATTTGGGGAGAATAAGAGCAATTCAGGACGTTTTATTGGCCCCAATTCAAAGGGCTGTAACATCCCGGTGAATTGGCTCCCTTGCTGCTTTAACCCCGAAGAAAATGAGGCCGATTCAAGATCGTCTTTAACCCCGGTTAGGGTTCAGAGGCAAAAACCACCACCTTGTGCGGCTCCGGTTAAACACGCGGATGTACCTTGCTGTGTAAAGCCATGCTGCCCTTCGGCTATGAATAGGGATACATTGTTCGGTATAAAAAGGTCGAGGGATATATGA
Protein Sequence
MANRTTCQCGCTDPPEMTAADPPHEGSCGCSYNPFADQGRDAEITDLSFALRKLTLMKCQMKKWRMERLQLESEARGLKQVLQAHGLNDDIVRPDPLVAHLREYNARLENDKEELEESVKSLSETVSELKDLIAGLKIQLQEAQNESSCAALNRLRAKLKEMTKGGQEADQRVSMVVQRSIETLTELTENVDDLKAEIERLRAEIKRLKDLLKTCEDRKDVGVETIVVDAKPVEKPLVEMDVSELLNRIKELEALIAQLRKQLVDKDAAINDLQNQSFEVSSENKRLSVDLDQMKVSFNAVMQEVKAMKDELKKRDVKVSELLKDLKASAIDILGLNRLQSEMDAIKPQMYNLEVERDQLLSELGKVRGVVSERNDQIIKILEERDKHVKALARASNVMQATVEPMIEKEAALKNEVQGLKDRIEQLERELAELRKKLAQLEAENAEIPGLVDKIKELESELGKLRSQLAEAKSRMDELEKEVARLKAEKAELEKELEEVRKEIEKLKEELTEEKAAKEAAHKDLRDCRAENEKLKAANELLINELNTVKAEADKMRNDLEKLKEEMKDLKTENDQLKNLFAVAKNEIDKLRGELEKLKAENEKLKAENEKLKAENETLKNEVEKLNGDIGKLKTENDSLKAEVDKLINGLTRLQEEVEKLKNSLAAAKAEAEKLKQDLEDMRNERDKLRAENDQIRSQLANAEAENEKLKSELEKLKEEVNKLKQENSDLKSERDRLQAELEKLRNKLSELEAEIEKVKNALSAAKAEVEELKNDLEKMKTDLEKLKSENDQLKNELADAKAENTRLKKELDELKGEIDKLREENKNLKAEKDRLEAELNNIRGELDSLKGENERLKADLEKLKSEHEALKSENEKLKKSLSDAEAKVKALEASNTELANKIAELKNQIDKLENELGSEKAAKEAAIKELAAIKAELKALLEEMDKLKADCDRLKGQVDDLTKQLSDLRNDFDQLKSKYADLSAEREKFKAEVDKLKEENERLKNDLEKLKAELDALKTENRTLKEENAKWKEENEKLKKAVNEAEARIKTLEDEVRACEEEKARLRKEIEGLKDRIEEMKKELTAEKAATETALKELGAVKAELAALRAELDKVRAENAKLKSELDKVKSENDELRNENDKMKGEIERLKAEAAKLQNDLNNLKAENAKLKEDLEKLNAENKTLRAENDKLKGEMDQLKSENAKLKDDLSTARSEAAKLKNDLDKLKSDLDALQAENNRMKGELDRLRSENAKLQEDLNSLKAENGKLKGELDKLNAENKALRSENDKLKGELDQLKSENAKLKDDLSAARSETAKLKNDLDKLKSDLDAMQAENNRMKGELDRLKSENAKLQEDLNSLRAENGKLKEELDKLNAENKALRSENDKLKGELDQLKSENAKLKDDLSAAKSEAAKLKNDLDKLKSDLDAMQAENNRMKGELEKLKSEIAKLEDDLNTLKADNAKMRDDLAAAKAEVSKLKNDLDKLKSELNGLKAENAKLHEDLDALKEENATLKSEVDKLKIDNKKLNKDLQNAKSELDKSKSEVDNPKKLSSELKGLKKPILETPKKRFDSEKAKDTVKKEPQIRVPRSVPSGKFFKSDQRPSVIKKDQGSQGAGCGDYENANEQLRMNMNMQERAVQRIRDFTKYILGERDSPPEMAQELKRRMSLLTRKNLPEDIMQLLQESEFLSDSIFAAETEVQRLGELFEDLKKLRDQVGLELRVGIVSNYFFFFNQGYQKLPIFQDTKDLDGFGDAFDAESWLKTLTLTELAELHDRICLVTSCIVQQDINPEDYLDGIETEGICRPCNLPEEFDEDSIPDYETLNRRIGALQMQINKKQDEAAEKMTKMRKTMWREQENLIKLSEEMNAQKRRNLSMKIKINENLDPEIGENEENETAVICDRKLRLPKISAIKNDKFGENKSNSGRFIGPNSKGCNIPVNWLPCCFNPEENEADSRSSLTPVRVQRQKPPPCAAPVKHADVPCCVKPCCPSAMNRDTLFGIKRSRDI

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_01412840;
90% Identity
iTF_01412840;
80% Identity
iTF_01412840;