Basic Information

Gene Symbol
-
Assembly
GCA_963966675.1
Location
OZ016545.1:7813190-7823729[-]

Transcription Factor Domain

TF Family
TF_bZIP
Domain
bZIP domain
PFAM
AnimalTFDB
TF Group
Basic Domians group
Description
bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 48 2 1.9e+03 -0.4 0.1 26 50 44 68 40 75 0.67
2 48 1.9 1.8e+03 -0.4 0.8 33 55 89 111 86 114 0.66
3 48 7.4 7.1e+03 -2.3 1.0 36 60 137 161 128 164 0.76
4 48 0.00051 0.48 11.1 1.4 30 60 203 233 196 236 0.89
5 48 1.9 1.8e+03 -0.4 2.8 28 63 268 303 266 305 0.91
6 48 0.1 95 3.7 0.3 29 55 355 381 345 389 0.63
7 48 5.9 5.6e+03 -2.0 5.6 27 62 395 434 392 437 0.65
8 48 1.4 1.3e+03 0.1 3.7 33 62 443 472 436 475 0.87
9 48 0.00012 0.12 13.0 3.0 24 63 476 515 474 517 0.92
10 48 0.0083 7.9 7.2 5.8 24 64 504 544 504 545 0.94
11 48 0.00035 0.33 11.6 4.3 26 63 541 578 538 580 0.90
12 48 0.34 3.2e+02 2.0 7.9 24 60 581 617 578 622 0.92
13 48 0.00023 0.22 12.2 2.2 24 63 630 669 629 671 0.95
14 48 0.002 1.9 9.1 8.1 23 63 657 697 656 699 0.95
15 48 0.11 1e+02 3.6 8.4 18 63 673 718 671 725 0.78
16 48 0.0012 1.1 9.9 6.3 33 60 723 750 707 761 0.71
17 48 0.15 1.4e+02 3.2 2.0 22 62 747 787 739 790 0.70
18 48 0.0043 4.1 8.1 8.0 24 62 777 815 773 825 0.57
19 48 0.0024 2.3 8.9 4.6 21 56 830 865 818 870 0.78
20 48 0.028 26 5.5 8.8 27 65 857 895 855 898 0.65
21 48 0.00015 0.14 12.8 6.7 29 63 894 928 893 930 0.90
22 48 0.00012 0.11 13.1 4.4 22 65 929 972 928 972 0.93
23 48 0.00019 0.18 12.4 7.9 22 62 978 1018 977 1021 0.90
24 48 0.00059 0.56 10.9 3.5 24 64 1022 1062 1019 1063 0.89
25 48 0.00038 0.36 11.5 12.7 22 65 1069 1112 1058 1112 0.94
26 48 0.002 1.9 9.1 3.6 26 57 1101 1132 1099 1140 0.87
27 48 0.077 73 4.1 3.5 36 57 1111 1132 1110 1168 0.79
28 48 0.00013 0.12 13.0 4.7 25 63 1170 1208 1159 1210 0.85
29 48 0.01 9.7 6.9 1.7 24 63 1204 1243 1201 1245 0.92
30 48 0.00016 0.16 12.6 7.0 24 64 1253 1293 1252 1294 0.93
31 48 0.053 50 4.6 6.9 26 64 1297 1335 1292 1336 0.81
32 48 0.03 29 5.4 3.2 29 64 1321 1356 1320 1357 0.91
33 48 0.029 27 5.4 7.9 25 58 1338 1378 1336 1381 0.65
34 48 8.7e-05 0.083 13.5 10.8 21 64 1362 1405 1356 1406 0.91
35 48 0.0011 1.1 9.9 7.3 21 56 1376 1411 1373 1418 0.55
36 48 0.0048 4.6 7.9 7.5 23 53 1434 1464 1432 1478 0.86
37 48 1.1e-05 0.011 16.3 7.5 26 65 1482 1521 1479 1521 0.91
38 48 0.021 20 5.9 10.6 24 65 1536 1577 1519 1577 0.91
39 48 2.3e-05 0.022 15.3 12.8 22 64 1562 1604 1561 1605 0.94
40 48 0.34 3.3e+02 2.0 5.0 24 61 1599 1636 1599 1640 0.81
41 48 0.098 93 3.7 11.6 29 63 1635 1669 1633 1678 0.88
42 48 0.0077 7.3 7.3 10.1 26 65 1681 1720 1671 1727 0.70
43 48 0.12 1.2e+02 3.4 7.1 24 61 1721 1758 1718 1762 0.84
44 48 0.27 2.6e+02 2.3 6.6 21 60 1760 1799 1754 1803 0.88
45 48 4.4e-06 0.0042 17.7 6.3 27 64 1805 1842 1798 1843 0.91
46 48 0.0011 1.1 10.0 2.5 25 61 1838 1874 1837 1877 0.92
47 48 1.6 1.5e+03 -0.2 1.0 28 60 1928 1960 1921 1979 0.71
48 48 4.3 4.1e+03 -1.5 1.2 36 51 2062 2077 2041 2085 0.64

Sequence Information

Coding Sequence
ATGGAATCGATGAATCCACCGAACGAGCCATGTTGTTGCTGCAGTTACAATCCCTTCAGCGACAATGCGAAAGAATCGGAGATTTACGACCTCTCGTTTGCCCTGAGGAAACTGGCCGTAATGAAGTGTCAGATGAAGAAATGGCGAATGGAACGACTTCAGTTCGAGAGCGAAAATAGGTCTTTGAAACAAGCCCTCCAGTCATTCGGTGTAAATGTGGATGAGATATTGAAGCCTGATCCGCTGCTCGTGCAGTCCCGGGAAGAAATCGAATGGCTGCAAAATGCAAACGCGGCGCTCGAAGATAAAGTGAGGGATCTAGAAGAAACCCTCGCCGAACGAGATTGCTGCGACGATCCTTGTGCATCGGTACACTATATCAGGGAGAAGATGAGAGATCTGCGGGAGCGCTTTGCACTTGAAAAAAAAGAATTAAGGGACGTGATATCGAACTTGAGATTAAAACTCGCGCAGACTGAGGAGGACGTGAGCTGTCCTGCGATATATCGCCTGAGGGCGAAGCTCCGCGAACTGATGAAAGGAGGTCAAACAGCGGACCAGCAGGTCTCGAAAGTCGTGGAGAGATCGATTGAAACGTTGGTGGATCTGTCGAACAGCTGCGACGATCTGCGTCTTGAAAACGAGCAGCTTTTGGCCCAGATAGCCGAGCTACGTCGTCAATTGGCGGAACTTGAAAAAATGCGACCGACAGAACCTGTGAGAGTAAGGCGAATGGTAGAACCTGTGGAAGAAATGCTACGAACAGCCGAGACAACGACGGTTCCTGAATACATAGACGTCTCCGACCTGCTGCAGAAACTCGAAGACTGCGATCACACCGTTTCCGAATTGAAACGTCAATTAGACGAGAAAGATAAGCTTATCGAGGCGCTGAAGAAGCAACTTGAAAACATGGTAGATCAGCAGGCTTTGCTGGATGAGATCGGTGCCATGAAGGGGGAACTCTCGAAACGAGACGATAAGaTGAGGAACCTTCTGAACGAGATGAGGCAATCAGAGATAGACCTGTTGGGCATGAGCAACCTAAGATCGGAGCTGGACACCTTGAAGCCTCGATTATCTGAACTCGAGGATGAGAGGGACTCGTTGATAGACGAAGTGGCGAAGATGAGAAAGTTATTAGCCGAAAGGAACGATCAAATAATTGATATACTGGAGCACAAGAACAAGTTAGAGCAGGAGCTGGCGGAGAAGGAGGCGGAGGCTCAGCAGATTATCGACAGCTTGAAACAGGAGATGGACGACTTGCAGGCGCGAATGGCGAATTTGCAGGGTGAGCTCGACGAGCGTGATAAGCGAATCGCGGAGCTGGAGAAGTGTTGCTCCGAGAGGGACGAGCTTTTAGAGAAATTACAGGCCGCGCAGGACGAGTTGGCTTCGCTGCGAGACGAACTAGCGTCTGCGAAAGCTACAATAGGGGATCTTAAGGGTGAAGTAGATGCCCTGAAGAAAGACAAGGAGAAGCTGCAGAGAGAGTTGGACGAAATCAAGGAGCAGATGAATGCGTTGAGTGACCAACTAGCGGAGGAGAGAGCCGCGAAGGAGGCTCTCCAAAAGGAGCTAGAATCTGCCCGGAAAGACCTCGACGACCTGCGAAAGGAGAACTCCGATCTGAAGGGTCAGCTGGAGCAAGCAAGGGAGGAGAATGACAAGCTCAAAAAAGAGAACGATGCTTTCAAAGCTGAGCTGGATAACATGACCTCGGAGCTTGATAATCTGAAGAAAGCGAATGACGAGCTGCAGAAGAACTTGGACGCAGCGAAGCTGGAGAACGAAAAGTTGAAGAACGATCTAGAAGAATCTCGACAAGAGATGGAAAGGTTGCGAGCTGATGGTGATGCGTTGAAGAGTCTAGATGCGGATACAAAAGCTAAGATGGACAAACTTCAGTCGCAATTGGACGATCTGAAAGGGGAAAGGGATCGGTTGGCTGGCGAGAATGCGGACATGAAGGCCAAGAATTTGGAATTGGAACGGCAATTGGATGACGCGACGAAAGCGTTGGAAAAAATGAGAGCAGAGAATGCTGATCTGCTGGAGGAGCTGGAACGTTTGAAGGCAGAACTAGCGAGAGCCAATAGCACGATTGCACAATTGAAAGAGGAGATGGAATCCCTGAAGAACGCGTTGGACAAGTGCGTGGGTGAGATGGAGAGGCTGACATCCGAGAACGAGGACCTTAAGGCGAAGAACGAGGCTCTTAAAGCGGAGCTTGATCGATGCAAGGCCGACAGGGACTCGCTGCAGAACGACTTACAGCGAACCAAGGCGGAATTGGACGGTGTGATCAACGAACTGAACAAGCTGAAGGAGGAGCACGGGTTGCTTAAACAAAATTTCGACCGACTGGAAGGTGAAAGGGATAAGCTGCAGGAGGAGCTGAACAATCTGAAGAACGAGATGGGGAAGCTTCGAGAGGAAATAGATGCCGCGAAGCAGAGAGAAGCTGCGTTAAATTCCGAGCTTGATCGTGTTAAGAAGGATGACGATGCGTTGGCATCGGAATTGGGTAAACTGAAAGATGAAAATGCAGCTTTGCAGGACGAGAGGGATCGATTGAAGAAACAATTGAACGACGCGAACGCTGACATCGAGAAGTTGAAGGAAGAGCTGGATAGGTTGAAGGACGAGCTGGCTAGGTTGAAGGACGAACTCGAGAAGCTGGGAAAGGAGAGAGATCAGTTGAAGCAGGACAATGAGAAACTGAATGCCGAAGTTGAGCAACTTCGCAAAGAAAGAGACTCGTTAAACGACGAGCTTGATCGCGTTAAGAAGGAGAAGGATGCGTTAGCATCGGAATTGGGCAAACTGAAAGATGAAAATTCAGCTTTGCAGGGTGAAAGGGATCGATTGGAGAAACAATTGGACGACGCGAAAGCTGAGATCGAGAAGTTGAAGGAAGAGCTGGCTAGGTTGAAGGACGAACTCGAGAAGCTGGGAAAGGAGAGAGATCAGTTGAAGCAGGACAATGAGAAACTGAATGCCGAAGTTGAGCAACTTCGCAAAGAAAGAGACTCGATAAACGACGAGCTTGGTCGTGTTAAGAAGGAGAAGGATGCGTTAGCATCGGAATTGGGCAAACTGAAAGATGAAAATTCAGCTTTGCAGGGTGAAAGGGATCGATTGGAGAAACAATTGGACGACGCGAACGCTGAGATCGAGAAGCTGAAGGAAGAGCTGGCTAGGTTGAAGGACGAGCACGAGAAGCTGGGGAAGGAGAGAGATCAGTTGAAGCAGGACAACGAGAAACTGAACGCCGAAGTTGAACAACTGCGCAAAGAAAGAGACTCGTTGAACGACGATTTAAAGAAGCTGCGGGATGAGAATAGCAGGTTACAAGGCGAGGTGAATGATTTGAGGGAAAAACTGAATGATGAGAGAAAGGCCAACGAGATCTTGAAGAAAGATTTGATGATGTTAGACAGCGAGGTGAAGGATTTGGGTAAGGCTCTTGATGAGGCCAGGGCGAAAAATGCTGCCTTAGCGGAGGAGAATCAAACGCTTCAGTCGAAGTTAAAGGATTTGCAAGATGAGCTTGGCAGCTTGAGGGCAGAGTGCGCTGATTTGAAGTCGGAGATCGAAGACTTGAAGCAGCAAATTGCTGATTTGAAGGCGAAGATCGCTAAATTGGAGGAGGACGTGGAACATTGGAAGTTAGAGAACTGTAAGATCAAGATGGAGGTGGATAAACTGAAGGACGACTTGGAGAACGCGTTAAAGGAGCTGAATGACTGCAAGGCCGCGAAGGAAGCGTGCCAGGCAGAGGTGAACCGTCTCAAAGCCGAGAAAGCCGATCTAGAGAAGAAGCTCGCAGATTTACAGTCAGAAATTGAACAGTTGAAGCAGTCGCTTGAAGCAGAGAAGTCGGCTAGAAGCAAGTGTGAGGCGGAGCTTGCAGCCCTGAGGAATGAGCTGGACGCGTTGAAGAAGGAGTTGGAAAACCTTCGAGACGAGAGCAGAAGGTGCAAAGGGGAATTAGAAGAGCTTAGGAAGAAGATCTCGGAGCTGACCAACGACCTGAACAAGTGCAAAGAAGAGGTCAACGCGTTGAGAGATGCGAACGAAAAATTGAAGACGGAGGTGAACGCGTTGAACAACGAGAAGAACAAACTTCAAGAGGAGCGAGATAGATTGAAGAACGAAGCGGAATCTCTTCGGGCAGAGAACGCAGACCTTCGTCAAGATCGAGACAAGCTAGCGAACGAAACTAACCGGCTGAGGGGCGAGGGTGATGGACAGAAATCGGAGCTCGACAACTTGAAGTCTGACTTATCTGCGCAGAAGGCTGCTGTTCAGAAGCTGCAAGACGATTTGAAGAAATGCCAGGACGAGAACGACAGGTTGAAACAGCAACTGGCTGAGATGGAGAAGTTGAAGGACGAGAAGAATCGATTGGAAAAGGCGTTAGCAGATGCGGAAGCTAAAGCGAAGGCTCTGGAAGACCAGCTGAAAGATCTGCAGGCTGAGAAACAAGAGCTGCTCAATGAGCTGCAACGGCTGCGCGACGAGCTGGGTCGCCTTATGAACGAGATTGACAAACATAAAGCTGCTACTGAAGCAGCCATGAAAGAACTGGCTGCCTTGAAGGAGGAATTAAGCAATTTGAGAGCCACGCTGGATAAAGCCCGCGCCGAAAACGAAGCATTGAACAAAGAGAACGAGAAACTGAAGTCGGACATGGCGAAATTAAATCAGCAGCTGCAGGCATTGCAGGAAGAGAACGCGAAGCTGAAGACGGAGAACGAGCAGTTGGCCTCGGAATTAGCAGCGGCAAAGGAGAAATTAAAAGAAGCCGAAGACATGTTGAACAATCTGCGCATGGAAAATGATGATTTGAAGAAGAGGATCGCTGATCTTGAAAATACTGTGAAGGAGTTGGAGACGCTGAAGAAACAGCTGGAGGATTGCAGGGCCGAGTTGGCGAGGTTGAAGGCGGAATTGGAGAAATTGAAAGAGGAGAATGAGAATTTGAATAACCAGCTAAGGGATGCTCAAAAAGAGTTGAAGGAGGTTAAGGACTTGAAGGATAGGCTGCAGCAGGAACTTGATGCTTTGAAGGCTGATCATGATAAATTGAAGGACGAGTTAGAGAAACTGAAGGAGATTAATGAGAAATTGAAGGACGAGTTGGACGCTTTGAAGGCATTGAATGATAAATTGAAGGATGATTTGAAAAATCAGAAGGCGATCAATGATAAGCTGAAGGAGGAGGCAGAGAAGTTGAAGAAAGAACTTGAAGCGCAGAAGGCTGCCAATGATATGCTGAAGGATGAGGCAGAGAAGATGAAGAAAGAACTTGAAGCGCAGAAGGCTGCCAATGAGAAGTTGAAGGATGATCAGAAGACGTTGTTGGCTGACATTGATAAATTAAAGGATGCGGGTAGAAGTTCTGAGGCAGGGTTAACGAATGAATTGGATAGGTTGAAGAAGGAGAATAATGACTTGAAGTCTGAAATTGAGAGGTTGAAGAAGGAAAATGATGAATTGAAGTCTGAAATTGAGAGGTTGAAGAAGGAATTGAATAATTGCCAGGATTATAATGATAGGCTGCGCAGAGAAATGGAAGGCTGCAAAGACGAACTTAAGAAACTGAAAGCTGCTGCAGATCAAGATACAGTGACTTTAAGAAGCCTAGCAGACGATTGTGGCGATTACATAAGGGCAAATGATTTGTTAAGGGAAAAACTTGACAAGCAAAACGAAGGTGTACAACGTGTCCGTGACTACATAACTTTCCTGGAGGGTAAAGTTGAAAATGAACCCAAAATGGCGGACGAATTGGAGGACGATTCCAGCGTTGACCCAGAAATGAAAAAGGAGATAGAGGAGTTACTGAAAAAGTCGCGGGATTTatctgaaaatatttataacactgaaaaagaaattcagaatatTGCTGCTATTCTGAAACAACTTAAGGATGTTCCAGTTAGCGACAAAGCGTTTGATCCCGATTCGTGGCTTAACTCGTTGAAGTTAACGCAATTGGCAGATCTCCACGACAAGATTTGCCTATTGACATCGGACATGGTGCATCAGGACAAGCCGAGGGCAGTTAGTTGTGTCCCAGGGAGTCCACTAAGTGCggattacaatattttaaatcaGCGGATAGCCGCTCTGCAGAAACAGATAGCGGAGAAGCAAATGGAGACGGGATGGAAGCTAGAAGAGCTGAAGCGAGCTCTTCGGCAGGAGCAGGCTAACCTAATTCGAATCTCCGACGAAATGAATTTAGAGAGAAAACGTAATTTGGCCCTTCAGTTAGGAATGGATGATGCATCTTAA
Protein Sequence
MESMNPPNEPCCCCSYNPFSDNAKESEIYDLSFALRKLAVMKCQMKKWRMERLQFESENRSLKQALQSFGVNVDEILKPDPLLVQSREEIEWLQNANAALEDKVRDLEETLAERDCCDDPCASVHYIREKMRDLRERFALEKKELRDVISNLRLKLAQTEEDVSCPAIYRLRAKLRELMKGGQTADQQVSKVVERSIETLVDLSNSCDDLRLENEQLLAQIAELRRQLAELEKMRPTEPVRVRRMVEPVEEMLRTAETTTVPEYIDVSDLLQKLEDCDHTVSELKRQLDEKDKLIEALKKQLENMVDQQALLDEIGAMKGELSKRDDKMRNLLNEMRQSEIDLLGMSNLRSELDTLKPRLSELEDERDSLIDEVAKMRKLLAERNDQIIDILEHKNKLEQELAEKEAEAQQIIDSLKQEMDDLQARMANLQGELDERDKRIAELEKCCSERDELLEKLQAAQDELASLRDELASAKATIGDLKGEVDALKKDKEKLQRELDEIKEQMNALSDQLAEERAAKEALQKELESARKDLDDLRKENSDLKGQLEQAREENDKLKKENDAFKAELDNMTSELDNLKKANDELQKNLDAAKLENEKLKNDLEESRQEMERLRADGDALKSLDADTKAKMDKLQSQLDDLKGERDRLAGENADMKAKNLELERQLDDATKALEKMRAENADLLEELERLKAELARANSTIAQLKEEMESLKNALDKCVGEMERLTSENEDLKAKNEALKAELDRCKADRDSLQNDLQRTKAELDGVINELNKLKEEHGLLKQNFDRLEGERDKLQEELNNLKNEMGKLREEIDAAKQREAALNSELDRVKKDDDALASELGKLKDENAALQDERDRLKKQLNDANADIEKLKEELDRLKDELARLKDELEKLGKERDQLKQDNEKLNAEVEQLRKERDSLNDELDRVKKEKDALASELGKLKDENSALQGERDRLEKQLDDAKAEIEKLKEELARLKDELEKLGKERDQLKQDNEKLNAEVEQLRKERDSINDELGRVKKEKDALASELGKLKDENSALQGERDRLEKQLDDANAEIEKLKEELARLKDEHEKLGKERDQLKQDNEKLNAEVEQLRKERDSLNDDLKKLRDENSRLQGEVNDLREKLNDERKANEILKKDLMMLDSEVKDLGKALDEARAKNAALAEENQTLQSKLKDLQDELGSLRAECADLKSEIEDLKQQIADLKAKIAKLEEDVEHWKLENCKIKMEVDKLKDDLENALKELNDCKAAKEACQAEVNRLKAEKADLEKKLADLQSEIEQLKQSLEAEKSARSKCEAELAALRNELDALKKELENLRDESRRCKGELEELRKKISELTNDLNKCKEEVNALRDANEKLKTEVNALNNEKNKLQEERDRLKNEAESLRAENADLRQDRDKLANETNRLRGEGDGQKSELDNLKSDLSAQKAAVQKLQDDLKKCQDENDRLKQQLAEMEKLKDEKNRLEKALADAEAKAKALEDQLKDLQAEKQELLNELQRLRDELGRLMNEIDKHKAATEAAMKELAALKEELSNLRATLDKARAENEALNKENEKLKSDMAKLNQQLQALQEENAKLKTENEQLASELAAAKEKLKEAEDMLNNLRMENDDLKKRIADLENTVKELETLKKQLEDCRAELARLKAELEKLKEENENLNNQLRDAQKELKEVKDLKDRLQQELDALKADHDKLKDELEKLKEINEKLKDELDALKALNDKLKDDLKNQKAINDKLKEEAEKLKKELEAQKAANDMLKDEAEKMKKELEAQKAANEKLKDDQKTLLADIDKLKDAGRSSEAGLTNELDRLKKENNDLKSEIERLKKENDELKSEIERLKKELNNCQDYNDRLRREMEGCKDELKKLKAAADQDTVTLRSLADDCGDYIRANDLLREKLDKQNEGVQRVRDYITFLEGKVENEPKMADELEDDSSVDPEMKKEIEELLKKSRDLSENIYNTEKEIQNIAAILKQLKDVPVSDKAFDPDSWLNSLKLTQLADLHDKICLLTSDMVHQDKPRAVSCVPGSPLSADYNILNQRIAALQKQIAEKQMETGWKLEELKRALRQEQANLIRISDEMNLERKRNLALQLGMDDAS

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00860130;
80% Identity
-