Basic Information

Gene Symbol
-
Assembly
GCA_949628255.1
Location
OX451207.1:22159994-22167186[+]

Transcription Factor Domain

TF Family
TSC22
Domain
TSC22 domain
PFAM
PF01166
TF Group
Basic Domians group
Description
These proteins are highly similar in a region of about 50 residues that include a conserved leucine-zipper domain most probably involved in homo- or hetero-dimerisation. Drosophila protein bunched [1] (gene bun) (also known as shortsighted), a probable transcription factor required for peripheral nervous system morphogenesis, eye development and oogenesis.
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 45 1.1 1.9e+04 -3.2 4.5 13 40 101 128 96 147 0.58
2 45 0.047 7.9e+02 1.2 0.3 14 32 137 155 134 177 0.78
3 45 0.031 5.2e+02 1.8 0.5 16 34 222 240 213 248 0.57
4 45 0.41 6.9e+03 -1.8 0.5 19 34 276 291 274 294 0.68
5 45 0.0015 25 6.0 1.7 11 39 443 471 441 483 0.65
6 45 0.015 2.5e+02 2.8 4.7 14 47 491 522 484 527 0.86
7 45 0.11 1.8e+03 0.1 2.2 15 49 513 547 511 555 0.87
8 45 0.42 7.1e+03 -1.9 1.1 31 42 557 568 531 582 0.58
9 45 6.3e-05 1.1 10.4 2.1 11 44 579 612 577 626 0.84
10 45 0.00019 3.2 8.9 2.0 15 43 632 660 628 661 0.92
11 45 0.0056 94 4.2 2.6 13 34 658 679 656 694 0.87
12 45 0.011 1.8e+02 3.2 1.4 13 45 686 718 681 724 0.88
13 45 0.02 3.4e+02 2.4 3.8 17 45 725 751 720 757 0.82
14 45 0.0071 1.2e+02 3.8 2.0 17 44 753 780 749 784 0.87
15 45 3.1e-05 0.52 11.4 2.4 11 45 782 816 780 824 0.89
16 45 0.0037 62 4.7 3.9 20 42 826 848 819 860 0.72
17 45 0.068 1.1e+03 0.7 5.3 15 41 870 896 849 902 0.61
18 45 0.017 2.9e+02 2.6 5.1 15 45 877 907 868 921 0.74
19 45 0.006 1e+02 4.1 0.5 19 37 923 941 921 960 0.85
20 45 0.37 6.2e+03 -1.7 0.0 19 42 958 981 946 995 0.69
21 45 0.0082 1.4e+02 3.6 0.4 15 41 996 1022 975 1030 0.82
22 45 0.0037 63 4.7 3.5 14 43 1030 1059 1022 1060 0.89
23 45 0.00024 4 8.5 4.9 13 43 1050 1080 1048 1085 0.91
24 45 0.047 7.9e+02 1.2 3.0 13 38 1071 1096 1069 1111 0.68
25 45 0.0085 1.4e+02 3.6 1.1 13 30 1106 1123 1099 1139 0.80
26 45 0.0066 1.1e+02 3.9 0.5 15 43 1136 1164 1130 1170 0.89
27 45 0.0045 76 4.5 2.0 11 42 1146 1177 1140 1182 0.85
28 45 3.4e-06 0.057 14.5 1.1 14 43 1184 1213 1181 1220 0.91
29 45 0.014 2.3e+02 2.9 0.8 19 43 1210 1234 1209 1250 0.59
30 45 0.00016 2.6 9.1 0.5 11 42 1251 1282 1249 1284 0.93
31 45 7.4e-05 1.2 10.2 0.3 15 43 1283 1311 1280 1318 0.89
32 45 0.012 2e+02 3.1 0.5 19 41 1308 1330 1306 1348 0.52
33 45 5.6e-06 0.094 13.8 1.0 11 42 1349 1380 1344 1383 0.93
34 45 7.4e-05 1.2 10.2 0.3 15 43 1381 1409 1378 1416 0.89
35 45 0.057 9.6e+02 0.9 0.3 20 36 1421 1437 1408 1446 0.45
36 45 0.00017 2.8 9.0 0.9 11 42 1447 1478 1441 1481 0.92
37 45 7.6e-05 1.3 10.1 0.5 15 43 1479 1507 1476 1516 0.88
38 45 0.017 2.8e+02 2.6 0.7 19 41 1504 1526 1503 1545 0.52
39 45 5.5e-06 0.093 13.8 1.1 11 43 1545 1577 1542 1580 0.92
40 45 7.6e-05 1.3 10.1 0.4 15 43 1577 1605 1574 1612 0.89
41 45 0.048 8e+02 1.2 0.5 24 37 1614 1627 1603 1642 0.42
42 45 8.6e-06 0.15 13.2 1.0 11 42 1643 1674 1638 1677 0.93
43 45 0.00085 14 6.8 1.7 13 44 1673 1704 1671 1716 0.87
44 45 0.17 2.9e+03 -0.6 0.1 15 33 1731 1749 1727 1751 0.86
45 45 0.92 1.5e+04 -2.9 0.1 17 37 1973 1993 1972 1996 0.88

Sequence Information

Coding Sequence
ATGGCTAATCGTACGTGTCAATGCGGATGCACAGACCCGCCGGAAATGTCGGCCGCAGATCCTCCCCATGAGGGTTCGTGCGGTTGTAGTTACAACCCTTTCGCGGATCAAGGAAGAGAGAGTGAAATCACGGAATTATCATTCGCACTGCGTAAATTAACTCTGATGAAATGCCAGATGAAAAAATGGAGAATGGAACGTCTTCAGTTGGAGAGCGAGGCCAGAGGATTAAAGCAAGTTCTTCAGGCCCACGGTTTGAACGACGATATCGTCAGACCAGATCCTCTGCTTGCTCATCTTCGGGAGCAGAATGAGAGACTGGAGAACGAGAAAGCGGAGCTCGAGGAGAGCGTGAAGAGCCTTTCCGAGGCCGTATCGGAGTATGAGAATCAGGAATCCTCGACGTCCGATGCTGTGAAtaaattgcgtgaaaaaattcGGACAATGAAGGAAACTAACGCCGTTGAGAAAAGGAGATTAAGAGACCTCATATCTGGGCTGAAGATTCGGCTCCAAGAAGCTGAGAACGAGTCGTCATGCGCTGCCTTAAATCGACTGCGAGCAAAACTTCGAGAAATGATGAAAGGCGGTCAAGAAGCTGACCAAAGAGTTTCTATGGTGGTTCAGCGGTCGATAGAAACTCTGACCGAGTTAACGGGGAACGTCGACGATCTCAAGGCCGAGATCGAGAGACTTAGAGCCGAGATAAAGAGGCTGAAGGATTTGCTAAAGACATGCGAAGAGCGAAAGGACGTCGGGGTTGAGACGACGGTCGTGGACGTTAAACCGGTCGAGAAACCGCTCGACGAAATGGACGACTCGGAATTGTTGAACAGGATCAAGGAACTCGAGGCGCTGATAGCTCAGCTGAGAAAACAATTAATCGATAAAGATGCTGCGATGAATGATCTCCATAATCAATTGTTCGATGCTGGTGCAGACAACAAACGTTTGTCCACCGATTTGGACCAGATGAAGGTCAGCTACAACGCCGTTATGCAGGAGGTCAAGGCCATGAAGGATGAACTGAAGAAGAGGGACGCCAAGGTATCCGAACTTCTCAAAGACCTGAAAGCATCCGCGATCGATATGCTGGGATTAAACAGACTGCAGAGTGAAATGGACGCAATCAAGCCCCAGATGTACAATCTCGAGGTAGAACGCGGCCAGCTATTATCTGAGCTCGGTAGAGTGCGGGGCGTTGTATCGGAGCGGAATGATCAGATCATTAAAATACTCGAGGATAGGGACAAGCACGTTAAAGCACTGGCCAAGGCATCAAGCTTAATGCAGGCGACGGTTGAACCgatgaaagagaaagaaacggCTTTAAAAAATGAGGTTCAAGGATTTAAAGACCGGATAGCAGAGCTTGAACTCGAGCTATCCGAGCTCAGGAAGAAGCTAGCTCAATTAGAAGAGGAAAATTCTAAGATACCTGGACTAGTACATAAAATTAAGGAGCTCGAAGACGAACTAGAAAAGCTCAGGTCTCAGTTGACCGAGGCAAACTCCAGGGTGGTTGAGCTTGAGAAAGAAGTAGCCCGGTTCAAAGCTGAGAAAGAAGAACTAGAGAGTGAGCTTGGTGAGGCGAGAAAGGAGCTTGAAAAGCTGAAGGAGGAGCTCGCTAAGGAGAAAGCTGCGAAAGACGCTGCTCTAAAAGATCTTGAGGATTGCAGAGCAGAGAACGAGAAGCTCAGAGCGGAAAATGAACGCCTAATAAATGAGCTAAACGCGGCTAAGGCTGAGAGCGATAAGCTCAGAAACGATTTGGGGAAACTGAAagaagaaatggaaaatttaaaGGCTGAAAATGATCAGTTGAAAGATCAGTTCAACGTGGCCAAAAATGAGATCGACAAGCTCAGAGGTGAGCTCGAGAATCTCAAGGCAGAGAATGACAAGCTCAAAAATGACGTAGAGAAACTTAACGGAGATATGGATAAACTGAAGACGGAAAATAACAGCCTCAAAGAAGAGCTCGACAAACTTAGAAACAAGCTCGCTGGACTTGAGGATGAAATCGAGAAACTAAAGAGCGCTTTGGCTGCAGCTGAAGCTGAGGCTGAGAAGCTCAAACAAGATTTAGAAAACACGAAAAACGAGCTCGATAAATTAAGGGCTGAGAGTGATCAGCTCAAAAGTCAGTTAGCCGATGCCAAAGCTGATAACGAGAGGCTTAAAATTGAACTCGAAAAACTGAAGGGAGAAACAGAGAAactaaaacaagaaaacagcGAACTTAAGGCAGAGAGAGACAAGCTGCAAGCAGACCTCGAAAAACTTGGAAAAAGGCTCGATGAGCTTGAATCTGAAAATGAGAAACTGAAGAACGCTTTAACAGCGGCTAAAGCTGAGGTTGAAAGGCTCAAAAATGATCTAGACAAAATGAAATCCGATATCGAGAAGCTAAAGGCAGAAAATGATCAGCTGAATAACGAGCTGGCCAATGCTAAGGCAGAGAACTCGAGGGTCAGAAAGGAGCTTGATGAGTTGAAGGGAGAAATGGAGAAGCtgagagaagagaataaaaaccTTAAGGCAGAAAGAGACGGGCTAGAAGCGGAGCTCAAAAAACTTAGAGGAGAATTAGACAGTCTTAAGGGTGAGAATGAGAGACTTAAAGGTGAGAATGAAAGACTTAAGGCCGAGCTTGACAAACTTAAAAGCGAGAACGAGGCCTTGAAATCAGAAACCGAGAAGTTGAAAAAGAGCTTGAGCGATGCAGAGGCAAAGCTAAAAGAGCTTGAAGCCTCTAACGCTGATCTCGCGAATAAAATATCAGAGCTGAAGAACCAAATTGATAAACTTCAGACCGAGTTGGCGTCAGAAAAAGCTGCCAAAGAAGCAGCTGTTAAGGAATTGGCGGCTATTAAGGCCGAGCTAAAAGCTCTGCTGGCGGAAATGGACAAACTCAAGGCAGACTGCGACAGGCTAACAGGACAAGTAGACGATCTCACTAAACAACTATCGGATTTGAAGAACGATTTTGATCAGCTCAAGTCCAAATATGCCGAATTGGCGACAGAGAGAGACAAGCTCAAGACTGAGCTCGATAAGTTCAAGGCAGAAAACgacaagatgaaaaatgacCTGGAGAAGCTCAAGGCAGAGCTGGACGCGTTAAAAACCGAGAATGGTAAGCTCAAAGAGGAGAACGGTAAGCTTAAGGAGGAAATTGGTAAGCTAAAAGAAGAGAATGGTAAGTTTAAGGAAGAGAACGAGGAGCTGAGGAAAGACTTGCGCGATGCTGAAGCTCGGATAAAAACACTTGAGAACGAGGTAAAGTCGTGCGAGGAGGAAAAATCAAGGCTGCGAAAAGAGATCGAGGGCCTGAAAAATGGGATTGAAGAACTCAAGAAGGAGCTTGATGCAGAGAAAGCAGCGAAAGAGGCAGCCTTGAAGGAGTTATCAGCTCTAAAAACTGAGCTAGCTGCACTGAGAGCAGAGCTGGATAAAGTAAGGGCAGAGAACGCCAAGTTAAAAAGTGAGCTCGATAAACTGAAATCAGAAAACGACGAGCTCAGGaatgacaataataaaatgaagggAGAGATTGATAAGCTCAAAGCAGAGGTTGCGAAATTAGAAAATGATCTAAGTGCCTTGAAGGCAGAGAACACCAAACTCAAAGAAGATCTGGACAAGCTAGATGCAGAAAATAAAGCTCTGAGATCCGATAATAACAAGTTGAAAGGAGAATTGGATCAGCTCAAATCTGAGAACGCGAAATTGAAGGACGATTTAGCTGCGGCTAGAGAAGAAGCTACAAAGCTCAAAAATGATTTGGATAAACTGAAATCCGACTTGGATGCGATGCGAgcagagaataataaaatgaagggTGAGCTTGAAAGGCTGAAGTCAGATATTGCAGGATTACAAAACGACCTGAACACCCTGAAAGCAGATAATGCGAAGCTCAAAGAAGACCTCGACAAACTAAATGCAGAAAATAAGTCTCTGAGAGCTGATAACGACAAGTTGAAAGGCGAATTGGATCAGCTAAAATCTGAGAACGCGAAATTGAAGGACGATTTAGCTGCGGCTAGAGAAGAAGTTACAAAGCTCAAAAATGATTTGGATAAACTAAAATCCGACTTGGATGCGATGCGAGCAGAGAATAATAAGATGAAGGGTGAGCTTGAAAGGCTGAAGTCAGATATTGCAGGATTACAAAACGACCTGAACACCCTGAAAGCAGATAATGCGAAGCTCAAAGAAGACCTCGACAAACTAAATGCAGAAAATAAGTCTCTGAGAGCTGATAACGACAAGTTGAAAGGAGAATTGGATCAGCTAAAATCTGAGAACGCGAAATTGAAGGACGATTTAGCTGCGGCTAGAGAAGAAGCTACAAAGCTCAAAAATGATTTGGATAAACTGAAATCCGACTTGGATGCGATGCGAgcagagaataataaaatgaagggTGAGCTTGAAAGGCTGAAGTCAGATATTGCGGGATTACAAAACGATCTGAACACCCTGAAAGCAGATAATGCGAAGCTCAAAGAAGACCTCGACAAACTAAATGCAGAAAATAAGTCTCTGAGAGCTGATAACGACAAGTTGAAAGGCGAATTGGATCAGCTAAAATCTGAGAACGCGAAATTGAAGGACGATTTAGCTGCGGCTAGAGAAGAAGTTACAAAGCTCAAAAATGATTTGGATAAACTAAAATCCGACTTGGATGCGATGCGAGCAGAGAATAATAAGATGAAGGGTGAGCTTGAAAGGCTGAAGTCAGATATTGCAGGATTACAAAACGACCTGAACACCCTGAAAGCAGATAATGCGAAGCTCAAAGAAGACCTCGACAAACTAAATGCAGAAAATAAGTCTCTGAGAGCTGATAACGACAAGTTGAAAGGAGAATTGGATCAGCTAAAATCTGAGAACGCGAAATTGAAGGACGATTTAGCTGCGGTTGGAGAAGAAGTGTCAAGGCTCAAAAATGACTTGGATAAACTGAAATCCGAACGGGACGCGCTGCAAGCGGAGAATAACAAGATTAAAGGAGAGCTTGAAAGGCTGAAATCCGAGAATGCGAAACTACATAAAGATCTTGATGTCTTGAAGCAGGAGAACGGCAAGCTAAAGTCTGAGGTTGACAAACTGAAAATCGATaatgaaaagttgaataaGGATCTTCAGAAGGCAAACTCGGAGTTAGACAAGCACAAAGCCGAGGTCGATAATCTGAAGAAATTAACCAAGAAACTAAAAGAGACACCGAGAAAACTCGATACCCTGAAGCCAAATTACACCGTGGAAAAAGAACTGCAGATTAAAGTACCCCGCGTTGTTCCATCgggtaaaattattaaaagtGATCAACGACCCTCGGTTATAAAAAAGGACCAAGGTTCACAGGGCGCGGGTTGTGGCGATTACGAAAATGCAAACGAACAGCTGATGAGAAACATGAATATGCAAGAGAGGGCCGTACAGCGTATACgagaatttataaaatacgTACTCGGCGAGAGGAGTTCGCCACCCGAAATGGCGCGGGAATTGATCCATCGCATGTCTTCGGTAACGCGGAATAATTTTCCCGAAGACATAATGCGGATTCTGAAAGAGTCGCAGTTCTTATCGGAATCGATATTCAATGCCGAAACCGAGGTTCAACGTCTGGCCAAGCCCGAGGAGGAATTAAACAACCCGAAGGATGaaGATATTCGGGACGTCGATGGACTCGGTGACGCCTTCGACGCCGAGTCGTGGTTGAAAacacTGACGTTAACCGAATTGGCAGAACTCCACGACAGGATATGCCTCGTAACTTCGTGCATGGTTCAGCAGGACATAAACCCCGAGGATTACGTGGACGGTATTGAAACCGATGGAATTTGCCATCCTTGTAGTACCGTAACGGAGGAATTCACCGACGATTCAATTCCCGATTACGAAGCTTTGAACAGAAGAATCGGAGCTCTTCAAATGCAGATAAACAAGAAACAGGACGAGGCTGCtaaaaaggttaaaaaaatgcGCCAATCTATGTGGCGGGAACAGGAAAACCTGATCAAACTATCCGAGGAAATGAACGCccaaaaacgaagaaatttaTCGATGAAGATTAAGATCAATGAGAACATCGAAGCGGATGTAGAAggagaaacaaataaaaatgaaaagacgGTTCTATGCGGTGGAAAACTTCCCCCAAAgatcggACAGAGGACGGACGATATGTTCGgtgagaaaaattgcaattccGGACGTTTTATCGGCGCCAATTCCAACGTCGCGTTGAATTTGGTCCCTTGCTTTAACCCAGAGGAAAATAAGCCCGATTCAGGATCGTCTTTGACCGCGGTTAGGGTTGGGAGAAAAAAACCACCGCTGTGTGCGGCACCGGTTAGACACGCGGACGTACCTTGCCGTGTTAAGCCCCGCTGTCCCTCGGCTATGAATCGAGTTATTTAG
Protein Sequence
MANRTCQCGCTDPPEMSAADPPHEGSCGCSYNPFADQGRESEITELSFALRKLTLMKCQMKKWRMERLQLESEARGLKQVLQAHGLNDDIVRPDPLLAHLREQNERLENEKAELEESVKSLSEAVSEYENQESSTSDAVNKLREKIRTMKETNAVEKRRLRDLISGLKIRLQEAENESSCAALNRLRAKLREMMKGGQEADQRVSMVVQRSIETLTELTGNVDDLKAEIERLRAEIKRLKDLLKTCEERKDVGVETTVVDVKPVEKPLDEMDDSELLNRIKELEALIAQLRKQLIDKDAAMNDLHNQLFDAGADNKRLSTDLDQMKVSYNAVMQEVKAMKDELKKRDAKVSELLKDLKASAIDMLGLNRLQSEMDAIKPQMYNLEVERGQLLSELGRVRGVVSERNDQIIKILEDRDKHVKALAKASSLMQATVEPMKEKETALKNEVQGFKDRIAELELELSELRKKLAQLEEENSKIPGLVHKIKELEDELEKLRSQLTEANSRVVELEKEVARFKAEKEELESELGEARKELEKLKEELAKEKAAKDAALKDLEDCRAENEKLRAENERLINELNAAKAESDKLRNDLGKLKEEMENLKAENDQLKDQFNVAKNEIDKLRGELENLKAENDKLKNDVEKLNGDMDKLKTENNSLKEELDKLRNKLAGLEDEIEKLKSALAAAEAEAEKLKQDLENTKNELDKLRAESDQLKSQLADAKADNERLKIELEKLKGETEKLKQENSELKAERDKLQADLEKLGKRLDELESENEKLKNALTAAKAEVERLKNDLDKMKSDIEKLKAENDQLNNELANAKAENSRVRKELDELKGEMEKLREENKNLKAERDGLEAELKKLRGELDSLKGENERLKGENERLKAELDKLKSENEALKSETEKLKKSLSDAEAKLKELEASNADLANKISELKNQIDKLQTELASEKAAKEAAVKELAAIKAELKALLAEMDKLKADCDRLTGQVDDLTKQLSDLKNDFDQLKSKYAELATERDKLKTELDKFKAENDKMKNDLEKLKAELDALKTENGKLKEENGKLKEEIGKLKEENGKFKEENEELRKDLRDAEARIKTLENEVKSCEEEKSRLRKEIEGLKNGIEELKKELDAEKAAKEAALKELSALKTELAALRAELDKVRAENAKLKSELDKLKSENDELRNDNNKMKGEIDKLKAEVAKLENDLSALKAENTKLKEDLDKLDAENKALRSDNNKLKGELDQLKSENAKLKDDLAAAREEATKLKNDLDKLKSDLDAMRAENNKMKGELERLKSDIAGLQNDLNTLKADNAKLKEDLDKLNAENKSLRADNDKLKGELDQLKSENAKLKDDLAAAREEVTKLKNDLDKLKSDLDAMRAENNKMKGELERLKSDIAGLQNDLNTLKADNAKLKEDLDKLNAENKSLRADNDKLKGELDQLKSENAKLKDDLAAAREEATKLKNDLDKLKSDLDAMRAENNKMKGELERLKSDIAGLQNDLNTLKADNAKLKEDLDKLNAENKSLRADNDKLKGELDQLKSENAKLKDDLAAAREEVTKLKNDLDKLKSDLDAMRAENNKMKGELERLKSDIAGLQNDLNTLKADNAKLKEDLDKLNAENKSLRADNDKLKGELDQLKSENAKLKDDLAAVGEEVSRLKNDLDKLKSERDALQAENNKIKGELERLKSENAKLHKDLDVLKQENGKLKSEVDKLKIDNEKLNKDLQKANSELDKHKAEVDNLKKLTKKLKETPRKLDTLKPNYTVEKELQIKVPRVVPSGKIIKSDQRPSVIKKDQGSQGAGCGDYENANEQLMRNMNMQERAVQRIREFIKYVLGERSSPPEMARELIHRMSSVTRNNFPEDIMRILKESQFLSESIFNAETEVQRLAKPEEELNNPKDEDIRDVDGLGDAFDAESWLKTLTLTELAELHDRICLVTSCMVQQDINPEDYVDGIETDGICHPCSTVTEEFTDDSIPDYEALNRRIGALQMQINKKQDEAAKKVKKMRQSMWREQENLIKLSEEMNAQKRRNLSMKIKINENIEADVEGETNKNEKTVLCGGKLPPKIGQRTDDMFGEKNCNSGRFIGANSNVALNLVPCFNPEENKPDSGSSLTAVRVGRKKPPLCAAPVRHADVPCRVKPRCPSAMNRVI

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00938403;
90% Identity
iTF_00938403;
80% Identity
iTF_00938403;