Basic Information

Gene Symbol
-
Assembly
GCA_947538915.1
Location
OX384538.1:13230491-13237121[+]

Transcription Factor Domain

TF Family
TSC22
Domain
TSC22 domain
PFAM
PF01166
TF Group
Basic Domians group
Description
These proteins are highly similar in a region of about 50 residues that include a conserved leucine-zipper domain most probably involved in homo- or hetero-dimerisation. Drosophila protein bunched [1] (gene bun) (also known as shortsighted), a probable transcription factor required for peripheral nervous system morphogenesis, eye development and oogenesis.
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 37 0.49 7.8e+03 -2.1 0.7 17 34 113 130 101 135 0.80
2 37 1 1.6e+04 -3.1 0.1 18 38 160 180 156 181 0.86
3 37 0.0094 1.5e+02 3.4 1.3 14 34 221 241 213 248 0.60
4 37 0.073 1.1e+03 0.6 0.5 16 34 274 292 270 296 0.71
5 37 1.2 2e+04 -3.4 0.0 23 41 302 320 300 327 0.84
6 37 0.00063 10 7.2 2.0 19 45 445 471 440 481 0.65
7 37 0.0058 92 4.1 4.7 14 45 492 521 490 526 0.87
8 37 0.0028 44 5.1 5.0 18 54 517 553 514 556 0.87
9 37 0.23 3.7e+03 -1.0 0.2 27 40 561 574 551 581 0.59
10 37 0.0008 13 6.9 2.1 11 45 580 614 578 623 0.88
11 37 0.00013 2 9.4 1.8 15 44 633 662 630 670 0.88
12 37 0.0062 98 4.0 1.3 13 43 666 696 664 699 0.90
13 37 0.00037 5.9 7.9 4.2 11 46 685 720 680 730 0.87
14 37 0.019 3e+02 2.5 1.5 19 43 728 752 722 761 0.80
15 37 0.001 17 6.5 2.4 15 46 766 797 762 804 0.90
16 37 0.0055 87 4.2 0.8 18 37 811 830 806 846 0.85
17 37 0.054 8.5e+02 1.0 0.0 19 44 847 872 839 882 0.75
18 37 0.03 4.7e+02 1.8 0.0 13 36 883 906 880 909 0.88
19 37 0.0011 18 6.4 4.0 13 43 911 941 909 944 0.93
20 37 0.095 1.5e+03 0.2 2.9 13 37 946 970 944 976 0.89
21 37 0.055 8.7e+02 1.0 1.8 13 29 981 997 972 1012 0.65
22 37 0.0021 33 5.5 0.5 15 43 1011 1039 1005 1042 0.89
23 37 0.0032 51 4.9 4.5 12 41 1029 1058 1020 1072 0.65
24 37 0.0015 24 6.0 0.9 14 42 1080 1108 1075 1112 0.73
25 37 0.0025 39 5.3 0.2 11 42 1112 1143 1107 1145 0.92
26 37 1.9e-05 0.3 12.1 1.6 14 43 1143 1172 1140 1179 0.87
27 37 0.0039 62 4.7 1.4 18 44 1168 1194 1167 1205 0.77
28 37 7.1e-05 1.1 10.2 0.1 11 42 1210 1241 1204 1243 0.92
29 37 2e-05 0.32 12.0 1.8 15 43 1242 1270 1239 1278 0.86
30 37 0.0036 57 4.8 1.2 18 44 1266 1292 1265 1303 0.77
31 37 7.2e-05 1.1 10.2 0.0 11 42 1308 1339 1303 1341 0.93
32 37 2.1e-05 0.32 12.0 1.9 15 49 1340 1369 1337 1377 0.85
33 37 0.0048 76 4.4 0.9 18 44 1364 1390 1363 1401 0.74
34 37 1.8e-05 0.29 12.1 0.0 11 42 1406 1437 1401 1439 0.94
35 37 0.00015 2.4 9.2 0.5 15 44 1438 1467 1436 1476 0.89
36 37 0.0002 3.2 8.8 0.0 14 44 1500 1530 1497 1540 0.91
37 37 0.96 1.5e+04 -3.0 0.1 16 36 1797 1817 1796 1819 0.86

Sequence Information

Coding Sequence
ATGGAGAATCGCACTACGTGTCAATGCGGATGCACGAACCCACCAGAAATGACAGCCGCAGATCCTCCTTACGAGGGTTCATGCGGTTGCAGCTACAACCCCTTCGCCGATCATGAGAGAGACGCGGAAATCACAGAATTATCATTCGCGCTGCGAAAATTGACCCTGATGAAATGCCAGATGAAAAAATGGAGAATGGAGCGTCTGCAGTTGGAGAGCGAGACGAGAGGTTTGAAACAGGTGCTGCAGGCCCACGGTTTGAACGACGACATCGTGAGACCAGATCCGCTGGTCGCTCATCTTCGAGAGTATAATGCGAGGCTGGAGAACGACAAGGAGGAGCTCGAGGAGCGCGTTAAAAGCCTGTCCGAAACCGTATCGGAGTACGAGAGCCAGGAGAGCTCGTCATCCGGAGCTGTGAATAGACTGCGCGACAGAATTCGAGCGATCAAGGAAGCCAACGTCGTCGAGAAACGGAGGCTGAGAGACCTTATAGCTGGACTGAAGATCCGGCTGCAAGAAGCCCAGAACGAGTCGTCCTGCGCTGCCCTAAATCGACTTCGAGCTAAGCTGAGAGAGATGATGAAAGGCGGCCAAGAAGCTGACCAAAGAGTATCCATGGTGGTCCAACGGTCGATAGAAACTCTGACCGAGCTAACGGAAAATGTTGACGACCTCAAGGCCGAGATCGAGAGACTCAGAGCTGAGATAAAGAGGCTGAAGGATCTGCTCAAGAAATGCGAAGATCGGACGGACGTCGGGGTCGAGACGACCGTCGTGGACGTTAAACCGGTGGAAAAACCGCTCGTGGAAATGGACGTCGCGGAATTGTTGAACAGGATCAAGGAGCTCGAGGCGCTGATAGCTCAGCTGAGAAAACAACTGGTTGACAAAGATGCTGGGATTAATGACCTCCAGAATCAATCGTTCGAGGTCAGTTTGGACAACAAACGTCTGTCCGCAGATTTGGACCAGATGAAGGTCAGCTTCAACGCTGTTATGCAGGAGGTCAAGGCTATGAAGGATGAACTCAAGAAGAGGGACGTCAAGGTATCCGAGCTTCTCAAAGAGCTCAAAGCATCCGCGATCGATATTCTGGGATTAAACACACTGCAGAGTGAAATGGACGCAATCAAGCCCCAGATGTACAATCTCGAGGTAGAGCGCGGCCAGCTATTATCCGAGCTTGGGAAAGTGCGGGGCGTTGTATCGGAGCGGAATGATCAGATCATTAAAATACTCGAAGAGAGGGACAAGCACGTTAAAGCATTGGCCAGGGCCTCGAACGTGATGCAGGCGACGGTCGAACCGTTGATGGAGAAAGAAGCGGCTCTCAAAAACGAGGTCCAAGGATTGAAAGACCGGATAGAACAGCTCGAGCGAGAGCTAGCCGAGCTCAGGAAGAAACTGGCTCAATTAGAAGCGCAGAATGCTGAGATACCTGGACTGGTCGACAGGATTAAGGAGCTGGAAAATGAGCTAGAAAAGCTCAGGTCTCAGTTGGCCGAGGCCAAGTCCAGGATGGATGAGCTCGAGAAAGAACTAGCTCGCCTCAAAGCGGAGAAAGAAGAACTGGAGAAAGAGCTCGGGGAGGCGAGGAAGGAGAATGAGAAGCTGAAGGAAGAGCTCAGTGCGGAGAAAGCTGCGAAAGAAGCTGCCCAAAAAGATCTCAGGGATTGTAGAGCTGAGAATGAGAAGCTCAAAGCAGAAAATGAGAAGCTAATTAATGAGCTAAGCGCGGCTAAGGCAGAGGCCGACAAACTGAGAAACGATTCAGAGAAACTGAGGGAAGAAATGGAAGGTCTGAAGGCTGAGAATGATCGGCTGAAAAATCTGCTCACCGCCGCCAAAAATGAGATCGACAAGCTCGGAGGTGAGCTCGAGAAGCTCAAGGCAGAGAATGAGAAACTCAAAAATGAGGTAGAGAAACTGAACGGAGATATAGGTAAACTGAAGACGGAAAATGACAGCCTCAAAGCAGAGCTCGACAAACTTAGAAACGAGCTCGCTGGACTTCGGGACGAAATTGAGAAGCTGAAGAATGCTTTAGCAGCAGCCAAGGCTGAGGCTGAGAAGCTCAAAAATGATCTGGAAAAAATGAAAACAGATGTTGAGAAGCTCAAGGCGGAAAATGATCAGCTGAAAAATGAGCTAGCCGACGCTAAGGCAGAGAACGCGAGGCTCAAAAAGGAGCTCGATGACTCGAAGGGAGAAATAGACAAGCTGAGAGAAGAGAACAAGAACCTCAAGGCAGAAAAAGACAAAATTAGAGGAGAATTAGATGGTCTCAAGGGTGAGAATGAAAGGCTGAAGGCTGACCTCGAGAAACTCAAAAGCGAGTACGAGGCCCTGAAATCGGAAAATGAGAAGTTGAAGAAGAGCTTGAGCGACGCAGAGGCGAAGGCGAAAGCGCTCGAAGTCTCGAACGCTGAACTTGCGAGTAAAATTGCAGAGCTGAAGAACCAAATTGATAAACTTGAGAACGAATTGGCGTCGGAAAAAGCTGCGAAGGAAGCGGCGATCAAGGAATTGGCGGCTATCAAGGCCGAGCTAAAAGCTCTGCTGGCAGAAATGGACAAACTTAAGGCAGACTGCGACAGACTGAAAGGACAAGTCGACGATCTCACTAAACAGCTATCAGATTTGAGAAATGATTTTGATCAGCTCAGGTCCAAGTATGCCGACTTGTCGGCAGAGAGAGACAAGCTCAAAGCTGAGGTTGATAAGTCGAAGGAAGAAAATGACAGACTGAAAAATGACTTGGAGAAGCTCAAGGCAGAGCTCGACGCGTCGAAGACGGAGAATCGTACGCTCAAGGAAGAGAATGCCAAATGGAAGGAGGAGAACGAGAAGCTGAAGAAAGCCGTGAGCGAGGCTGAAACTAGGATGAAAATTCTTGAGGACGAGGTAAAGGCATGCGAAGAGGAAAAGGCAAGGCTGCGAAAAGAGCTCGAAGGTCTGAAAGACCGGATCGAAGAAATGAAGAAGGAGCTCGCTGCAGAGAAAGCTGCGAAAGAGGCAGCCTTGAAGGAACTTGGGGCCCTTAAGACCGAGCTAGCTGCTCTGAGAGCAGAGCTGGACAAAGTGAGGACAGAGAACGCCCAGCTAAAAAGCGAGCTCGATAAACTGAAATCAGAAAACGCCGAGCTCAGAAATGAGAATGAGAAAATGAAAGGCGAAATTGATAAGCTGAAAGGGGAGGTGGGAAAATTACAAAATGATCTGAATACCTCGAGGGCGGAGAGTGCGAAGCTCAAAGAAGATCTGGACAAGCTGAATGCCGAAAACAAGACTCTCAGAGCTGAGAATGACAAATTGAAGGATGATCTTGGTGCCGCCAGATCAGAAGCTGCGAAACTCAAGAACGACTTTGACAAGCTGAAATCTGAACTGGATGCGATGCAAGCAGAGAATAACAGAATGAAGGGAGAACTCGATAAGCTGAAATCAGAGATTGCGAAACTACAAGATGATCTGAATACCGCGAAGGCAGAGAATGGGAAGCTCAAAGAAGACCTTGAGAAACTGAATGCTGAAAACAAGGCTTTGAGATCCGAGAATGAGAAATTTAAAGGAGAACTGGATCAGCTCAAGTCAGAGAACGCGAAACTTAAGGATGACCTCGCTGCTGCTAGATCGGAAGTAGCAGGACTCAAGAACGACTTGGATAAACTGAAATCCGAACTGGGTGCGATGCAAGCGGAGAATAACAGAATGAAGGGAGAACTCGATAAGCTGAAATCAGAGATTGCGAAACTGCAAGATGATCTGAATACCGCGAAGGCAGAGAATGGGAAGCTCAAAGAAGACCTTGAGAAACTGAATGCTGAAAACAAGGCTTTGAGATCCGAGAATGAGAAATTTAAAGGAGAACTGGATCAGCTCAAGTCAGAGAACGCGAAACTTAAGGATGACCTCGCTGCTGCTAGATCGGAAGTAGCAGGACTCAAGAACGACTTGGATAAACTGAAATCCGAACTGGGTGCGATGCAAGCGGAGAATAACAGAATGAAGGGAGAACTCGATAAGCTGAAATCAGAGATTGCGAAACTGCAAGATGATCTGAATACCGCGAAGGCAGAGAATGGGAAGCTCAAAGAAGACCTTGAGAAACTGAATGCTGAAAACAAGGCTTTGAGATCCGAGAATGATAAATTCAAAGGAGAACTGGATCAGCTCAAGTCAGAGAACGCGAAACTTAAGGATGACCTCGCTGCTGCTAGATCGGAAGTAGCAGGACTCAAGAACGACTTGGATAAACTGAAATCCGAACTGGGTGCGATGCAAGCGGAGAATAACAGACTGAAGGGAGAATCCGACAAGCTGAAATCAGAGATGGCGAAACTACAAGACGATCTGAACACCCTGAAGTCAGAGAATGCGAAAATGAAGGATGAGCTCGCCGCCGCTAAGGCAGAAGCGTCGAGACTCGAGAATGACTTAGTCAAGCTGAAATCTGTGCTGGACGCGATGCAAGCAGAGAATAACAGAATGAACGGAGAACTTGACGGGCTGAAGGCGGAGAATGCAAAACTACAGAAAGACCTTGGTGCCTTGAAGAAGGAGAATGCCACGCTGAAGTCTGGGATCGACAAACTGAAAACCGACAACGATAAGTTGAACAAGGATCTTCGGAACGCGAAATCAGAGTCAGACAAGCCGAAAGCTGGGGTCGACAATCTCAAGAAATCGGGGAACGAACCGAAAGAGACACGGAGGAGACTCGATACCGAAAAGGCGAAGGAGAGTGGGAAGAAAGAGCCCCAGATTAAAGCACCGCGGTCTGTTCCATCCGGGAAATTTTTTAAAAGTGATCAACGACCCTCGGTCGTAAAAAAGGACCAAGGTTCACAGGGCGCGGGTTGTGGCGATTACGAAAATGCAAACGAACAGCTGAGGAGAAACATGAATATGCAGGAGAGAGGCGTGCAACGGATACAAGACTTCGCGAAGTACGTTTTGGGCGAGAGAAGTTCCCCGCCAGAAATGGCCCGGGAATTAAGGCACCGAATGTCCTGGGCAGCGCGCAGGAGCTTACCCGAAGACATAACGCAGCTTTTGGAGGAGTCGGAGCTTTTATCGGACTCGATATTCGCCGCCGAAACCGACGTCCAGCGGCTGGTCAAGCTCGTCGAGGAACTGAAGGAGCTCGGAGATCAAGAAACTCAGGACCTGGACGGGCTCGGTGACGCCTTCGACGCCGAGTCCTGGCTCAAGACACTGACGTTGACGGAACTCGCCGAACTCCACGACAGGATATGCCTGGTGACTTCGTGCATGGTTCAGCAGGACGTAAACCCCGAGGATTACGTCGACGGCATCGAAACCGAGGGAATTTGCCGGCCCTGTAGTTTGCCCGATGAATTCGCGGAGGACTCGACGCCCGAGTACGAAGCTTTGAACAGGAGAATAGGGGCGCTTCAAATGCAGATAAACAAGAAAAAGGACGAGGCGGATGACAGGGTGAAGAAAATGCGCGAAACCGTGTGGCGAGAGCAGGAAAATCTCATCAAATTATCGGAGGAAATGAACGCCCAGAAGCGGAGGAATTTATCGATGAAGATTAAAATGAACGAAAATATAGAGGCGGAAGGGGATGAAAAGGGAGGAAAGGAAACGGCGGTCCTCTGCGACCGGAAGTGTCTCCCGAAGTTCGTTATTCGAAACTCTTTTCCCTCCTTTTCCACTTTTCGATTTTTAATTCCGGTATTTATTGGCCGCAGGATCAGCGCGATAAAGGACGATAAATTTGGGGAGAAGAAGGGCAATTCGGGACGTTTTATTGGCGCCAATTCAAACGCCTGTAACATCCCGGTGAATTGGCTCCCTCGCTTTAACCCCGAGGAAAATGAGCCCGATTCAAGATCGTCTTTAACCCCGGTTAGGGTTCAGAGACAAAAACCACCGCCCTGTGCGGCTCCGGTTAGACACGCGGATGTACCTTGCTGCGTCAAGCCTTGCTGCCCCTCGGCTATGAATAGGGATACACTGTTCGGCAAAAAGAGGGATCTGAACACCGCGAAAAATTGA
Protein Sequence
MENRTTCQCGCTNPPEMTAADPPYEGSCGCSYNPFADHERDAEITELSFALRKLTLMKCQMKKWRMERLQLESETRGLKQVLQAHGLNDDIVRPDPLVAHLREYNARLENDKEELEERVKSLSETVSEYESQESSSSGAVNRLRDRIRAIKEANVVEKRRLRDLIAGLKIRLQEAQNESSCAALNRLRAKLREMMKGGQEADQRVSMVVQRSIETLTELTENVDDLKAEIERLRAEIKRLKDLLKKCEDRTDVGVETTVVDVKPVEKPLVEMDVAELLNRIKELEALIAQLRKQLVDKDAGINDLQNQSFEVSLDNKRLSADLDQMKVSFNAVMQEVKAMKDELKKRDVKVSELLKELKASAIDILGLNTLQSEMDAIKPQMYNLEVERGQLLSELGKVRGVVSERNDQIIKILEERDKHVKALARASNVMQATVEPLMEKEAALKNEVQGLKDRIEQLERELAELRKKLAQLEAQNAEIPGLVDRIKELENELEKLRSQLAEAKSRMDELEKELARLKAEKEELEKELGEARKENEKLKEELSAEKAAKEAAQKDLRDCRAENEKLKAENEKLINELSAAKAEADKLRNDSEKLREEMEGLKAENDRLKNLLTAAKNEIDKLGGELEKLKAENEKLKNEVEKLNGDIGKLKTENDSLKAELDKLRNELAGLRDEIEKLKNALAAAKAEAEKLKNDLEKMKTDVEKLKAENDQLKNELADAKAENARLKKELDDSKGEIDKLREENKNLKAEKDKIRGELDGLKGENERLKADLEKLKSEYEALKSENEKLKKSLSDAEAKAKALEVSNAELASKIAELKNQIDKLENELASEKAAKEAAIKELAAIKAELKALLAEMDKLKADCDRLKGQVDDLTKQLSDLRNDFDQLRSKYADLSAERDKLKAEVDKSKEENDRLKNDLEKLKAELDASKTENRTLKEENAKWKEENEKLKKAVSEAETRMKILEDEVKACEEEKARLRKELEGLKDRIEEMKKELAAEKAAKEAALKELGALKTELAALRAELDKVRTENAQLKSELDKLKSENAELRNENEKMKGEIDKLKGEVGKLQNDLNTSRAESAKLKEDLDKLNAENKTLRAENDKLKDDLGAARSEAAKLKNDFDKLKSELDAMQAENNRMKGELDKLKSEIAKLQDDLNTAKAENGKLKEDLEKLNAENKALRSENEKFKGELDQLKSENAKLKDDLAAARSEVAGLKNDLDKLKSELGAMQAENNRMKGELDKLKSEIAKLQDDLNTAKAENGKLKEDLEKLNAENKALRSENEKFKGELDQLKSENAKLKDDLAAARSEVAGLKNDLDKLKSELGAMQAENNRMKGELDKLKSEIAKLQDDLNTAKAENGKLKEDLEKLNAENKALRSENDKFKGELDQLKSENAKLKDDLAAARSEVAGLKNDLDKLKSELGAMQAENNRLKGESDKLKSEMAKLQDDLNTLKSENAKMKDELAAAKAEASRLENDLVKLKSVLDAMQAENNRMNGELDGLKAENAKLQKDLGALKKENATLKSGIDKLKTDNDKLNKDLRNAKSESDKPKAGVDNLKKSGNEPKETRRRLDTEKAKESGKKEPQIKAPRSVPSGKFFKSDQRPSVVKKDQGSQGAGCGDYENANEQLRRNMNMQERGVQRIQDFAKYVLGERSSPPEMARELRHRMSWAARRSLPEDITQLLEESELLSDSIFAAETDVQRLVKLVEELKELGDQETQDLDGLGDAFDAESWLKTLTLTELAELHDRICLVTSCMVQQDVNPEDYVDGIETEGICRPCSLPDEFAEDSTPEYEALNRRIGALQMQINKKKDEADDRVKKMRETVWREQENLIKLSEEMNAQKRRNLSMKIKMNENIEAEGDEKGGKETAVLCDRKCLPKFVIRNSFPSFSTFRFLIPVFIGRRISAIKDDKFGEKKGNSGRFIGANSNACNIPVNWLPRFNPEENEPDSRSSLTPVRVQRQKPPPCAAPVRHADVPCCVKPCCPSAMNRDTLFGKKRDLNTAKN

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_01411216;
90% Identity
iTF_01412046;
80% Identity
iTF_01412046;