Basic Information

Gene Symbol
-
Assembly
GCA_964007535.1
Location
OZ023328.1:26501643-26521753[+]

Transcription Factor Domain

TF Family
TSC22
Domain
TSC22 domain
PFAM
PF01166
TF Group
Basic Domians group
Description
These proteins are highly similar in a region of about 50 residues that include a conserved leucine-zipper domain most probably involved in homo- or hetero-dimerisation. Drosophila protein bunched [1] (gene bun) (also known as shortsighted), a probable transcription factor required for peripheral nervous system morphogenesis, eye development and oogenesis.
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 36 0.14 1.9e+03 0.2 0.6 19 42 81 108 75 118 0.64
2 36 0.6 7.8e+03 -1.8 0.4 26 49 300 326 285 334 0.53
3 36 1.6 2.1e+04 -3.2 0.0 20 41 332 353 319 360 0.58
4 36 0.0023 30 6.0 0.1 25 49 390 412 388 416 0.85
5 36 0.13 1.7e+03 0.4 0.3 24 49 537 562 530 570 0.75
6 36 0.0095 1.2e+02 4.0 0.7 15 34 574 593 570 606 0.71
7 36 0.0035 46 5.4 0.2 22 46 625 650 615 658 0.82
8 36 6e-05 0.78 11.0 0.0 21 45 664 688 663 691 0.91
9 36 3.6e-05 0.46 11.8 0.0 23 48 706 731 704 738 0.92
10 36 4.2e-06 0.055 14.7 0.1 21 48 742 769 736 777 0.90
11 36 0.00043 5.6 8.3 0.0 21 45 781 805 780 813 0.89
12 36 2.3e-05 0.3 12.3 0.0 23 48 823 848 820 853 0.91
13 36 5.8e-06 0.075 14.3 0.0 21 43 859 881 858 888 0.89
14 36 0.00027 3.5 9.0 0.0 21 45 897 921 896 928 0.89
15 36 3.7e-05 0.48 11.7 0.0 23 47 939 963 937 967 0.92
16 36 1.3e-05 0.17 13.1 0.1 21 45 975 999 968 1008 0.89
17 36 0.00018 2.4 9.5 0.0 21 45 1014 1038 1013 1045 0.89
18 36 2.2e-05 0.28 12.4 0.0 23 48 1056 1081 1053 1090 0.91
19 36 3.1e-06 0.04 15.1 0.2 21 45 1092 1116 1086 1124 0.91
20 36 0.00018 2.4 9.5 0.0 21 45 1131 1155 1130 1162 0.89
21 36 6.4e-05 0.83 10.9 0.0 23 47 1173 1197 1170 1205 0.89
22 36 3.3e-06 0.043 15.1 0.2 21 45 1209 1233 1203 1239 0.91
23 36 0.00018 2.4 9.5 0.0 21 45 1248 1272 1247 1279 0.89
24 36 1.4e-05 0.19 13.0 0.0 23 49 1290 1316 1287 1323 0.89
25 36 1e-06 0.014 16.7 0.1 21 48 1326 1353 1318 1360 0.91
26 36 5.2e-05 0.68 11.2 0.0 22 48 1365 1391 1364 1397 0.88
27 36 1.8e-05 0.24 12.7 0.0 22 49 1403 1430 1402 1435 0.90
28 36 1.6e-05 0.2 12.9 0.2 21 45 1439 1463 1438 1475 0.88
29 36 0.063 8.2e+02 1.3 0.2 19 34 1476 1491 1471 1497 0.81
30 36 0.0082 1.1e+02 4.2 0.8 27 51 1506 1528 1503 1534 0.79
31 36 0.2 2.5e+03 -0.2 0.2 26 47 1530 1552 1524 1561 0.72
32 36 1.9 2.5e+04 -3.4 0.1 28 41 1608 1621 1607 1622 0.54
33 36 0.04 5.2e+02 2.0 0.1 15 37 1635 1657 1630 1664 0.68
34 36 1.2 1.5e+04 -2.7 0.0 25 43 1786 1804 1785 1819 0.73
35 36 0.35 4.5e+03 -1.0 2.9 15 45 1819 1852 1813 1861 0.65
36 36 0.52 6.8e+03 -1.6 0.4 15 31 1891 1907 1889 1911 0.84

Sequence Information

Coding Sequence
ATGGTCCTAATATGGCAATATTGGCCAATTGCTCTATTCCAGTTGTATAAACGTGATGAGGATGGAGCCGACAGCTCGGGAGCCGAGTCTGACATGGACGACAGTGAGTTAAAGAAGTTGCTATTGCGCAAAGAGTCACAGATCGCCTCGTTGAAGAACCAATTGGCCGCTAAGGTCCAGGAATTAGCAGTGAAAGACAAGGCAATGAGTTCTGCTATGGGGAGTGTCGACAACTCGGACCTTCGCAAACAGATAGAAAACTGccagactgaattgaataaaattaaaaatcagtcATCTGATGAGTGTGAGGCCTTGCGCAGAGAAAAAGATGCCTTGGCAGAAAAGCTCAGCGGTCTTGCCAACATCGAGGCCGCTAACCTCGAGCTGAAGCGCAAGTCCGACAGCATGGACGACGTCATAGCTGAGCGTGACATGTTGCGCATGCGCCTCAAGGACTATGAGAAAATAGACGCCGATCTAGACAATCTCAAGTTTCGCGCCAAAGAGGCTGATGTGATGCGCATGGAGCGAGACAGGCTGAAAGATCGCAATGATGAGTTAGCCACCATTGAGACGGCCTATCTGAAGCTACTGGAGGAGACGAAAGATTATGAAACGCTCGCGGCTGAGCGAGAGATGTATAGGTTGAAGAACAATGATGGCCGTAAAGGACTTGACCAAGCAATGAAGGATCTTGAACAAGAGCGAGACACCCTGAACAGGGATCTGAAAGAGTACGAGTGCGCGATAGCTGACCAAGAAGACGAAATAAAGCGTCTTTTGCTGATGGTGGAGAAACTGAGCACGACCACTAAGAGCAAGGAGGATCGTATGCAGAAGGGTTTGTCTGAATTGCGCCTCCAGATAGACAACAAAGATGAGCGCATTAGCCTCCTTGAGAACCAGCTGGCTGAGTTGCCAAACCTTGAGGCAGAGATCAAGTTGTTGAAAGATCGGCTGAAGAATACAATTGTCGGTGTTGCTCAAGAGAAAGATCAGCTTGACAATTTGTCCAAGACGGAGAGCAAACTGCGCGCAGACATGGGCACACTCGAGGCCGAACTGCAGAATGAACGTCGGCTCAGCAAGTCACTCAAAGACAAGCTGAACAATCTCAGTTCGAGCGGTGCTCAGCCAGTGAAAACTGGGCAGTTCAGCGAGGAAGAGCTGGTGGCGAAGATTAATGCTGTGCAAAACGAGAACGATGCTCTGAAAGCTTCGCTAGAGGTGTTGAAGGGGGCTGCTGGCGGTGGTATTGATCTAGCAGAAGAGCTAGCCCGAGAGCGAAAGGCTAGGCTGTATGCCGAAGAGATGTGTAGGGAAGCTGTCGACACATTGGAGAAAATGACAGCAGACAAGGAGAAGGAGTTCAAAGCCAAGCTTGGAGCAGCATCTCCTAGTGAGAAAGAACTCGTACGTCACTCTGAGAAAAAGTTtgaggaggagaagaaggaaAAGGAGATGCTTCAAGGAGTGTTGGATTTCACTCAAGACGAGTTGGAAAACGAAAAAATAGCTCGACGAAAATGTGAGGAGAGGATAAAGCAATTAGAACAAGCCATGTCAATGGACAGTAAGGGATCCGAAGTGAGCAGGAAAGATGCTCTCGACATAGCTAACGAGTTGgaacagctaaaaaaacaaaatgaccTACTCAAGAAGTCAATTGCTGGTAGTCAGACAGATTTGCAGAAAAGtcaatttgaaatgtttgataAAGTCCAAGCGCTGGAGAAACAGATCAAGGAATTACTGGCTGAAAATGCAGAGCTGAGGAAGTCGGATGCCATCCGCGATGAACAGGAAAAGGTACTTCGTTTGCAGTTAGAGAGCATCCAAGATTTGCTGAAAGAAGAACAAAAAAACGCCAAGATTCAAGAACAGTTGAACGATAAAGTGAAATCATTGGAAGCAGAAAATGCAGAATTGAGAAAGGGTCTTCCAACTGGTACAACTTCAAAGCCATCAACCACAGGTCCAGATGACAACAAGGTTAAAGAATTGTTGATGAAAATCCAACAGTTGGAAGCAGAAAATGCAGACTTGAAGAAAGGAGTACCAGCTGGTTCAACTCCAAAACCTTCGACAGCAGGTGCAGATAACAGTTATATAATAAAAGAGTTGGAGATGAAAATCAAAGATTTGGAAGTAGAAAATACAGATTTGAAAAGAGGCTTACCACCAGGTACCACCCCTAAGCCTTCAGTGACAGGGGcagataacaagatgaaagagttagaagagagaataaaaaaattagaGGCAGAAAATGCAGACTTGAAGAAAGCTATACCTCCTGGTACAGCTCCAAAGCCATCATCAACAGTGCCAGATGACAATAGAGTGAAGGATTTGTTAATGAAAATTCAAGACTTGGAAGCAGAGAATGCAGACTTGAAGAAAGGAGTACCTGCTGGTACAACTCCAAAACCTTCAAAGACCGGTGCAGATTATAGTAATATAATTAAAgagttggaaatgaaaatcaaagATTTAGAAGTAGAAAATACAGATTTGAAGAAAGGCTTACCACCAGGTACGACTCCCAAGCCTTCAATGACAGGGGCAGATAACAAAATAAAGGAGTTGGAAAAAAAGATTAAAGATTTGGAGGCCGAAAATTCTGCCTTGAAAGGTTTACCTCCTGGCACGACTCCAAAGCCATCATCAACAGGGCCAGACGACAATAGAGTGAAAGACTTGTTAATGAAAATCCAAGAGTTGGAAGCAGAAAATGCAGACTTGAAGAAAGGTTTACCTGCTGGTACAACTCCAAAACCTTCAAAGACCGGTACAGATAATAGTTATATCATAAAGGAGTTAGAAATGAAAATCAAAGATTTAGAAGTAGAAAATACAGATTTGAAGAAAGGCTTACCACCAGGTACGACCCCCAAGCCTTCAGTCACAGGGgcagaaaacaaaataaaagaGCTGGAAAAGACAATTAAAGATTTGGAGGCTGAAAATTCCGCCTTTAGGAAAGGATTACCTGCAGGTACGACTCCGAAGCCATCATCAACAAGTGCAGACGACAATAGAGTGAAGGATTTGTTAATGAAAATCCAAGAGTTGGAAGCAGAAAATGCAGACTTGAAGAAAGGAGTACCGGCAGGTACAACTCCAAAACCATCGAAGACTGGTCCAGATTATAGTAATATAATTAAAgagttggaaatgaaaatcaaagATTTAGAAGTAGAAAATACAGATTTGAAAAAAGGCTTACCACCAGGTACAACTCCTAAGCCTTCAGTGACAGGGGCGGATAACAAAATAAAGGAGTTGGAAAAGAAGATTAAAGATTTGGAGGCTGAAAATTCAGCTCTCAAAAAAGGTTTACCTGCAGGTACGACTCCAAAGCCATCATCAACAAGTGCAGACGACAATAGAGTGAAGGATTTGTTAATGAAAATCCAAGAGTTGGAAGCAGAAAATGCAGACTTGAAGAAAGGAGTACCGGCAGGTACAACTCCAAAACCATCGAAGACTGGTCCAGATTATAGTAATATAATTAAAgagttggaaatgaaaatcaaagATTTAGAAGTAGAAAATACAGATTTGAAAAAAGGGAAACCACCAGGTACAACTCCTAAGCCTTCAGTGACAGGGGCGGATAACAAAATAAAGGAGTTGGAAAAGAAGATTAAAGATTTGGAGGCTGAAAATTCAGCCCTCAAAAAAGGTTTACCTGCAGGTACGACTCCAAAGCCACCATCAACAGGTGCAGATGACAATAGAGTGAAGGATTTGTTAATGAAAATCCAAGAGTTGGAAGCAGAAAATGCAGACTTAAAGAAAGGAGTACCGGCAGGTACAACTCCAAAACCATCGAAGACTGGTCCAGATTATAGTAATATAATTAAAgagttggaaatgaaaatcaaagATTTAGAAGTAGAAAATACGGATTTGAAAAAAGGCTTACCACCAGGTACGACTTCTAAGCCTTCAGTGACAGGGGCGGATAACAAAATAAAGGAGttggaaaagaaaattaaaGATTTAGAGACAGAAAATGCAAACTTGAAGAAAGGCTCACCTCCTAGTACGACCCCTAAACCTTCAGCCACAGGGGCAGATCCCAAAATGAAGGAGTTGGAAAAGAAAATCAAAGATTTAGAGACAGAAAATGCAAATTTGAAGAAAGGCTCAGCTCCTAGTACAACCCCCAAGACTTCAGGAACAGGGGAagatttcaaattgaaagagTTGGAAAAGAAAATCAAAGATTTAGAGGCAGAAAATGCAAATTTGAAGAAAGGCTTACCTCCTAGTACAACGTCCAAACCTTCAATGGTAGGGGCAGATAAAATAAAGGAGCTTGAAAAGAAAATAAGAGATTTGGAGGCAGAAAATGATGGGCTGAAAAAAGCCCAGTCTGGTTCAAAGACCGATATTTCGAATAAATTTATTGCTTTAGAAAAGAAAATCAAAGATTTGGAGGCTCAGAATGCTGAATTAAGAGCCAGACAATTAACTGAACCAGTCAGCGGACGTCAGGACTCTGAGAAACGAATGAAAGAATTAGAAGCTGAGAATGCAAAACTGAAGAAAACGAGCGAGGACCTTGAAAAGACTCTCAAGAACCGAGCCAATGAACTTCAAACAGCAAATGATAATCTTAAAAGAATCCAACTCTCACCAGAGGATCGTGCCAAATTCGATGAAGAGCTCGATGCTAGGAACAAGGCCGAAAAAGCTAAAATGGAGTCTATTATAGAGCAGCTAGAAGGTCAAGTCAAGTCGTTCAAGAACAAAGATACGGATATTTCGACCTACAGCCAAGGATTGCCAAAGTCTGATGACGATAAAAAACGCATCAAGCAACTTGAAGATGAGGTAAGACTGTTCCGTTCTGGTGATAAGTCAGGAGCTTCTACCGCAGCGTACACAGAAATAGATCGTCTGGAGAAAAAATGTGCAAAATTAGAAGCAGAGATTGAGAAACTCAAAATGGAAAAGACAGGTTTGGTTGATAGTTACGAACATCATATAGCTGGTCTTAcggaaagttttgaaaaccGTCTCAGAATGAAAACAAAAGAGTCGATGTTAAAACTGCAGGAGagcattgaaaagttcaaacaAACGCTGAGTCCTAAAGATCACGCTGCCAATATGAAAGCAGTCGAAGCAAAAAGTGCAAAGAAGTTGGCAACGATGAAAGAAGAATATGAGAAACGTATCGCCGACATCATTGCTGCCAACTCCAAAGAAAAGGAAACAACTAATAAGGTATACAATGGCAACTTAACGACATTGAAAGAAACGTATCAACGCGATATGAGGGCATTGCACGAGCGTCACAATGCTAGCGTAACACAATTGCAGGTCATCCACGAAGAAGAAGTACAAAAGATTAACAGCGAGTATGAATCAGAACTCCAGGTCTTGAAAAATAGTCAGGATGGAGTGATAAGCGAAGTGATGAAGCAGTCAGACTCAGAGATCGAGAAGTTGAAGAAGAATTATGATGACATGGTCAACTCAATCATGAAAAGACATCAAGAGGAGACAGAAAATCTTCAAAAAAAACAGAATGAGAGCACGgcaaatattgagaaaaaattcaAGCGAGAGACAGGTTTAGATGATGACTTTGATAGGGCAAAACTTGAATATGAAGCAGAATTGCAGCGTTTGAAAGatagtaataaaaaaaaaatcgaggtTTTAAAAGTGAGGTATGCAGAACTGATGGAAGATGAGCGTAGAAAGAATAATGGGGAAGTAGGCAGATTAGGAACGACGAGAAACTTGGAGCAATCGCGAACAGAAGTATATGAGTTACGTTCCACAGGTGGCGAGTTTGACAAGCAGGTCGAACTGCGTGGCACGAGCGTTTCTGGTCTCAGCCTTGAAGGAATCCTACGCAAGGCACTGGCACAAGGATTCGAATCGTTGTCGTTTGATGAGCTCTCATTCTTACACGAGCGCTGCTGTTCATCCACGGCCCGAATGATCATGGAGAACCCCGCATTGCAATTGGCACCCATGAAGTTCACGCTGCAGGGTGCCGACAAGCAGCAGCTGTGTAGTCGTATTGCGTGTCTTGAGCAGGAGATTTCACGCAGGCAGTTGGAAACGCAGAAGAGAGTGAGTGCGCTCGAGGGCTTGGTGAAGCATGAGAAGCGACGTCTCAGCGAGGTGAAGGATGCACTGACACAGTCGCTGACTCAGGAGCAACGGCGCTCGCTGGAGCTTCACTCGCGGCTCGGCACTCAGACACAGCAGCAGGCCGAGATCGAGTCCGAGCGCGACTTGCTACGCAGACAGCTTGCTTACCTTGAGGGGAAGGCCAAGATTGCAATTCCAGAGctggagaaagagaaaaagaagaatCGCATCATGCTGGAGGAATTAAACAAAGAGCGTAACAACATCAAACACCTGCAGTCTCTACTTTCAGCTGAACGCATTCGAGCCAAGACTTCCAACCTCAAGGAGTCGGACcttatcgagACTCTGCGCATCAAGCTGGAGACCGCACTTGACAACGAGGCACGCATGAAGTTCAAGTACGAAAAGCTAGAGGCATTATACAACAGCTTGCAGGTGCCTACCAACCAACGCCGTCCCTGTACCACCTCCCGGCCTGGTACCGCTTCCCGTCCTGGTTCCTCGTCATGTTCAGAGTCGTGCCCTGCTCCATGCAATACTCGATGCTTGGATACTTCCTCTACCCCATGCCAACCTATCCGGCTGCAAGATTCGTACAAGCTGCAGGTACGGGATACGAGATACATATAG
Protein Sequence
MVLIWQYWPIALFQLYKRDEDGADSSGAESDMDDSELKKLLLRKESQIASLKNQLAAKVQELAVKDKAMSSAMGSVDNSDLRKQIENCQTELNKIKNQSSDECEALRREKDALAEKLSGLANIEAANLELKRKSDSMDDVIAERDMLRMRLKDYEKIDADLDNLKFRAKEADVMRMERDRLKDRNDELATIETAYLKLLEETKDYETLAAEREMYRLKNNDGRKGLDQAMKDLEQERDTLNRDLKEYECAIADQEDEIKRLLLMVEKLSTTTKSKEDRMQKGLSELRLQIDNKDERISLLENQLAELPNLEAEIKLLKDRLKNTIVGVAQEKDQLDNLSKTESKLRADMGTLEAELQNERRLSKSLKDKLNNLSSSGAQPVKTGQFSEEELVAKINAVQNENDALKASLEVLKGAAGGGIDLAEELARERKARLYAEEMCREAVDTLEKMTADKEKEFKAKLGAASPSEKELVRHSEKKFEEEKKEKEMLQGVLDFTQDELENEKIARRKCEERIKQLEQAMSMDSKGSEVSRKDALDIANELEQLKKQNDLLKKSIAGSQTDLQKSQFEMFDKVQALEKQIKELLAENAELRKSDAIRDEQEKVLRLQLESIQDLLKEEQKNAKIQEQLNDKVKSLEAENAELRKGLPTGTTSKPSTTGPDDNKVKELLMKIQQLEAENADLKKGVPAGSTPKPSTAGADNSYIIKELEMKIKDLEVENTDLKRGLPPGTTPKPSVTGADNKMKELEERIKKLEAENADLKKAIPPGTAPKPSSTVPDDNRVKDLLMKIQDLEAENADLKKGVPAGTTPKPSKTGADYSNIIKELEMKIKDLEVENTDLKKGLPPGTTPKPSMTGADNKIKELEKKIKDLEAENSALKGLPPGTTPKPSSTGPDDNRVKDLLMKIQELEAENADLKKGLPAGTTPKPSKTGTDNSYIIKELEMKIKDLEVENTDLKKGLPPGTTPKPSVTGAENKIKELEKTIKDLEAENSAFRKGLPAGTTPKPSSTSADDNRVKDLLMKIQELEAENADLKKGVPAGTTPKPSKTGPDYSNIIKELEMKIKDLEVENTDLKKGLPPGTTPKPSVTGADNKIKELEKKIKDLEAENSALKKGLPAGTTPKPSSTSADDNRVKDLLMKIQELEAENADLKKGVPAGTTPKPSKTGPDYSNIIKELEMKIKDLEVENTDLKKGKPPGTTPKPSVTGADNKIKELEKKIKDLEAENSALKKGLPAGTTPKPPSTGADDNRVKDLLMKIQELEAENADLKKGVPAGTTPKPSKTGPDYSNIIKELEMKIKDLEVENTDLKKGLPPGTTSKPSVTGADNKIKELEKKIKDLETENANLKKGSPPSTTPKPSATGADPKMKELEKKIKDLETENANLKKGSAPSTTPKTSGTGEDFKLKELEKKIKDLEAENANLKKGLPPSTTSKPSMVGADKIKELEKKIRDLEAENDGLKKAQSGSKTDISNKFIALEKKIKDLEAQNAELRARQLTEPVSGRQDSEKRMKELEAENAKLKKTSEDLEKTLKNRANELQTANDNLKRIQLSPEDRAKFDEELDARNKAEKAKMESIIEQLEGQVKSFKNKDTDISTYSQGLPKSDDDKKRIKQLEDEVRLFRSGDKSGASTAAYTEIDRLEKKCAKLEAEIEKLKMEKTGLVDSYEHHIAGLTESFENRLRMKTKESMLKLQESIEKFKQTLSPKDHAANMKAVEAKSAKKLATMKEEYEKRIADIIAANSKEKETTNKVYNGNLTTLKETYQRDMRALHERHNASVTQLQVIHEEEVQKINSEYESELQVLKNSQDGVISEVMKQSDSEIEKLKKNYDDMVNSIMKRHQEETENLQKKQNESTANIEKKFKRETGLDDDFDRAKLEYEAELQRLKDSNKKKIEVLKVRYAELMEDERRKNNGEVGRLGTTRNLEQSRTEVYELRSTGGEFDKQVELRGTSVSGLSLEGILRKALAQGFESLSFDELSFLHERCCSSTARMIMENPALQLAPMKFTLQGADKQQLCSRIACLEQEISRRQLETQKRVSALEGLVKHEKRRLSEVKDALTQSLTQEQRRSLELHSRLGTQTQQQAEIESERDLLRRQLAYLEGKAKIAIPELEKEKKKNRIMLEELNKERNNIKHLQSLLSAERIRAKTSNLKESDLIETLRIKLETALDNEARMKFKYEKLEALYNSLQVPTNQRRPCTTSRPGTASRPGSSSCSESCPAPCNTRCLDTSSTPCQPIRLQDSYKLQVRDTRYI

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00241926;
90% Identity
-
80% Identity
-