Basic Information

Gene Symbol
-
Assembly
GCA_011952205.1
Location
NW:568830-580838[-]

Transcription Factor Domain

TF Family
TF_bZIP
Domain
bZIP domain
PFAM
AnimalTFDB
TF Group
Basic Domians group
Description
bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 42 0.013 6.7 6.1 0.3 30 59 55 84 47 89 0.83
2 42 1.1 5.7e+02 -0.1 1.2 28 63 108 143 106 145 0.86
3 42 6.9 3.4e+03 -2.6 3.9 24 55 132 162 121 170 0.61
4 42 0.0021 1 8.6 5.7 29 63 188 222 186 225 0.76
5 42 0.0049 2.5 7.4 6.5 24 64 243 283 235 284 0.93
6 42 0.044 22 4.4 3.1 22 64 317 359 315 360 0.90
7 42 0.008 4 6.8 5.0 20 62 357 399 354 402 0.65
8 42 0.0091 4.5 6.6 7.2 24 64 396 436 393 444 0.82
9 42 0.074 37 3.7 3.9 21 64 456 499 455 500 0.90
10 42 0.0011 0.54 9.6 4.9 25 64 516 555 505 556 0.83
11 42 0.013 6.5 6.1 3.2 24 61 553 590 550 608 0.62
12 42 0.0091 4.5 6.6 0.2 41 63 605 627 591 633 0.59
13 42 0.0027 1.3 8.3 1.4 28 59 624 655 623 662 0.89
14 42 2.9e-05 0.015 14.6 3.0 25 65 666 706 655 706 0.88
15 42 0.048 24 4.3 3.2 34 63 703 732 700 752 0.51
16 42 0.019 9.4 5.6 7.0 24 60 739 778 736 783 0.81
17 42 0.011 5.4 6.4 0.6 28 63 767 802 765 804 0.87
18 42 0.0085 4.2 6.7 0.1 33 64 800 831 783 832 0.90
19 42 0.003 1.5 8.1 1.7 28 63 823 858 818 860 0.86
20 42 1.1e-05 0.0056 15.9 3.4 32 62 862 892 854 895 0.88
21 42 0.18 90 2.4 1.1 28 60 890 922 889 926 0.84
22 42 0.0006 0.3 10.4 0.5 21 63 914 956 913 957 0.94
23 42 0.0058 2.9 7.2 1.8 30 62 972 1004 962 1011 0.68
24 42 0.0026 1.3 8.4 1.5 28 60 1012 1044 1001 1048 0.73
25 42 0.61 3e+02 0.7 2.2 27 64 1072 1111 1066 1118 0.69
26 42 0.00095 0.47 9.7 1.0 18 62 1121 1165 1117 1168 0.88
27 42 0.23 1.1e+02 2.1 2.7 31 61 1169 1199 1164 1206 0.89
28 42 0.12 58 3.0 0.3 24 63 1214 1253 1211 1254 0.91
29 42 0.0033 1.7 8.0 4.0 26 62 1272 1308 1269 1311 0.90
30 42 0.48 2.4e+02 1.1 7.9 25 59 1313 1347 1302 1353 0.78
31 42 0.031 15 4.9 7.1 26 64 1370 1408 1353 1409 0.78
32 42 0.00031 0.15 11.3 3.5 29 64 1401 1436 1398 1437 0.90
33 42 0.0012 0.6 9.4 2.2 27 62 1413 1448 1409 1451 0.72
34 42 0.0056 2.8 7.3 6.1 28 59 1449 1480 1444 1486 0.86
35 42 0.14 70 2.8 6.0 30 63 1482 1515 1479 1517 0.83
36 42 0.022 11 5.3 1.7 27 57 1521 1551 1518 1554 0.86
37 42 0.00021 0.1 11.9 8.7 20 64 1549 1593 1549 1606 0.95
38 42 0.011 5.8 6.3 1.4 31 64 1588 1621 1585 1622 0.88
39 42 0.0047 2.4 7.5 1.5 31 64 1609 1642 1595 1643 0.83
40 42 0.012 5.9 6.2 3.4 21 54 1641 1674 1640 1675 0.91
41 42 0.00059 0.29 10.4 7.3 27 60 1668 1701 1665 1705 0.91
42 42 6.8 3.4e+03 -2.6 1.1 42 54 1885 1897 1865 1911 0.46

Sequence Information

Coding Sequence
ATGATATCGGATTTGAGGTTGAAGCTGGCGCAGACCGAAGAGGACATCAGCTGCCCCGCGATATATCGTTTAAGGGCAAAACTGCGTGATCTTATGAAAGGTGAAGCGGCTGAACAACAGATTTCCAAAGTTGTGGAGAAATCGATTGAAACCTTGGTGGATCTTTCCAAGAACTGCGATGATCTGCGTTTGGAAAACGAACGTCTTCTTGCACAAGTGACAGATTTGCGCAGTGCATTGGTTGATCtcgaaggaaagaaaacgcCAGAAACGATATTGCAAACGGCCGAAACAACCACAGTGCCGGAGTATATTGACATTTCAGATTTATTGCATAAGCTGAACAACTGCGAGGATATCGTGGCTGATCTAAGAAAGCAGTTAGAAGAAAGGGACGAGCAAATCGATGCACTGAACAAAGAACTGGAATCGATGAGCAGTCAAAAAGGTTTAGAGGAACAGATAGAAGCCATGAAAGAGGAACTTAGGAGGAAAGATGACAAGATCGCAGGACTGCTGAACAACCTAAGACAATCAGAAATAGACTTATTGGGGTTATCTTCTCTGAAATCCGAAGTGGAGAACCTGAAATCAGAGTTACATGATCTCAAATCAGAGAAAACTGAATTACTGAACGAACTAAACAAACTGCGAGAAGCACTGAAGGATAGAGACGATCAGATAATAGATTTACTAGAGCAGAGGAACAACTTGGAGAAGGAGTGCAAGAATAAGACGGCAGAATTACAGTCAAAACTCGATGAGGCAAATGACGAGATCGATGATTTGAAAGCTGAGATAACCAAACTGAAGAATGAGTTGGAAGAGTGCAAGACGCTAAACGCGAAGCTGGAACAGTGCTGTCTGGACAAAAACGCACTTTCGGAAAAGCTACACGGCTTCGAAGAGGACCTTGGCGCCAGGACCGCATTGGAGAAGGAACTGGAGAGAGACCGAGATGAGATTGAATTGTTACAGAGGGAGATTTTCGATCTGAAAGATCAGATCGATGCCGAAAGGAAGGAGAACGACGAGCTTCGCGAAACGTTAGAAGCATCGGTCGGCGAGAGGGAAAAGTTGAGGGCTCGGTTAGAGCAGTTGGAGAGTGAGAACGATGATCTGATGAAAAGGATGAAGGAGCTAGACAATTTGAATTACCAGCTAAGGAACGACTACGATAGCATGAAGCAGGCTTTGGACAATTTGCAAGCAGAGATCAACAAACTGGAGGATGAATTGGCCAAGGCGAAGCAAGAACGCGATGCATTGTTGAACGAGAATAATGGTATCAAAAAGCAGCTGGAACAAGCGATGGCGGAGAACGAGAGTCTGATAGCCAAATTGGACGAAACTGGTAAAGAACTTAATAAACTGAAACTACAGAAGGATGAGCTACAGAAGAGCCTCGATGGGATCAATCTTGAGAACGATTCACTGAAACGAGATATGAAAGCGTTAAGGGATGACCTTGAGGATTCCAGAAGGCAAGCGGAGGAACTAAAAGCCGCTGGTGACGCGTTAAAAGCGACGGATAAGGATAAGGTACTTGAACTTGCAAAGCTGCAAGAACAAGTAGAGAACTGCAAGTTCGAAAAGAATCGCTTAACGAAGGAAAATGATGATTTGAAAtctaaaataatagaattacAAGGAAAGCTGGAGGAGATGGATAAGTTAAAGGGAAGAAATACAGATTTACTGGCTGAAGTAGATCTTCTGAcaaaagaattagaaaaagCGTTGGAGGACATTGATCAATTGAAATCCGAAATAGGTTCCTTGAAGGATGGACTCGATAGTTGTGTGGGCGAGATGCAAAAGCTGAGAATCGAAAATGGCGACCTCAAAAAACAGAACGAAACCTTGAAGTCTGAGATGCAAGCAATTACCGATCACTTAATGAAAGACAATGACGATTTAAAAGCAGAAATCTCAGAATTGGAAGAAAAGTTAAGTGAATTGGATAAAACGAAACTAGAAAATGTTGATTTGCTTGATGAAGTAGATCGTTTGAAACAGGAATTGGCAAAAGCCTGGGAAGAAGTTGATCGATTGAAATCCGAAGTAACATCTCTGAAAAACGCACTCGATAAGTGCGTGGACGAGATGGAAAAGCTGAGAACTGAGAGCGACCAGCTTAAATTGGAGAATCAAGCATTCAAGTCTGATATTCACGGACTTGATGATCGCTTAACGAAAGAAATCGCCAATCTGAAAGCAAAAAACGCGgaattggaagaaaaattagtGGCATTCGATAAATTGAAGTCGGAAAATGAGGATTTACTTGGTGAAGTTGATCGTTTGAGACATGAATTGGAAAAAACCTTAGAGGATATGGATCAATTGAAATCTGAGATAGGTTCTTTGAAAAACGGACTGGATAAATGTGTTGGCGAGATGGATCAGCTGAGAACTGAAAACAGTGGTTTGAAGTCTGAAATTCAGGGAATGAGGGGCGAAGGGGACAGTTTGTCGGCGGAGTTAAATAATCTGAAGAATGAGAATTCTCTTTTGAAAGGCGAAAGAGATCGATTGAGCAAGCAATTGAGCGACTGTAAGAtggaaaacgaaaaattcagAGTGGAGAAGGATCATCTGGAAGCTGAAAATGAGAAGCTGAAAGGAGAGATAAACTCGTGCAAGGAAGAGAATGACAAATTAAAAGACGAACTTGGAAAATCACGGGAACAATTGCAATCATCGAACGACgaattgaataaattaaaggCTAATCTCGACAGAGCTGAGGAGAAAATTCGGTCTCTGGAGCCGCTGATCTCCCGTTTGCATAgtgaaaatgataaattgcGGGACGATTTGACGAGTTTGGAGAACGAGGCCAACGATTTCAAAGCAAAATTGGCTAGAGAAACGGCTGACAATGAAAAGATACAGAACGATCTGAAGATACTGGAGGATCAGGTGCACGATCTAAGTAAGAACCTGGCCAATGCTAGGACAGAAAATGACACTTTGAAACAGGAAAATCAAGGTCTAAAAGCCAAGTTATTGGATATGGATCATGATCTATCGAATTTGAACTCGGAATGTGCGGATCTGAAACGAGAGATTGCTGATCTGAAGAAATTAATCGAtgaattgaaagaaaaaattgctAAACTGGAAGCAAACATAGATCATTGGAAAATGGAGAACTGCAAACTTCAGTTGGACATTGATAAATCGAAAGCTGATCTTGAGAAAGCCTTGAAAGATTTGCTCGAATGCCAGGCTTCGAAGAAAGCACTAGAAGCAGAGATGTACCGTTTCAAGATTGAGAAAGGCGAGCTTGACAAGAAGCTTGTCGATTTAACTTCTCAACTCGAGCAACAGGAAAAAGCATTCGAAGCAGAAAAATCGGCCAGAAATAAGGGTGATTCAGAAATTGCGGCCTTGAAGGAGGAACTGGATGCCTTGAAAAAGGAACTAGGTAAATTGAGAGCTGACAACAACAGATACAGAAACGAAATAGACGACCTAGGAAGACAGCTTGCGGTAACAAAAAATGAACTGGAGAAGTGCAAAGAAGAGGTTTCTATATTAAGAGATGCCAATAACGCACTAAAGTCTCAATTGGATCTCTTGAAAAGTTTAAAGGACGAATATAATAAGTTGATGGCTGATTTAGATTCTCTTAAAGAGGAGAATGTAAACCTTCTGCAAGATAGGAAAAATTTCGAAGATGAGTATACTAGGCTGAAAGGAGAAGGCGATGGACAGAAAGCAGAGATCGATAGATTGAAATCAATCTTGAATGCAGAGGAGGCAGCTGCGGAAAAATTGAGGGCAGATCTGCAAATTTGCCAAACTGAGAACGATAGGTTGCAAAAACAATTAAACGAAGTGAAAAATGAGTTGGATGAACTGACAAAGGGAAACAATCGTATAAAGAACGAGATCGATAAGCTGAAGAAGGCGCTCGCGGACGCGGAAGCAAAGATAAAGTTGCTGGAAAGTGAACTATCCGATTTGCTAGCCGAGAAAAAAGAACTGGTCAACGAACTCTATCGTTTTCGCGAACAGCTAAACAATCGTACAAACGAGCTAGAAGAGCAGATGGCCGCAAAAGATGCGGCCAAGAAGGAATTGGCCGACATGAAGGATGAGCTGACCGCTCTAAAAGCGGCGTTGGATAAGGTTCGCAGCGAAAACGATAAGCTGAGAAACGAGAACGAAAAGCTGAATGTGGAATTAACCAAGTTGAACGGGCAATTAGAAACTCTGAAGGACGATAATACGAAGCTGGGAAACGAAAACGCGAATCTGAAGAACGAAAACGCGAATCTAAAGAATGACAATGCAAAGTTGGCGACGGAGTTAACTGGAACGAAAAACAAATTGGCAGAAGCGGAGAAACAGCTAAACGATctagagaaagaaaacgacgACTTGAATAACAAAATAGCCGATCTCGAGAACACAGTGAACGAGCTCGAGCCTTTGAAGAAACAATTAGAAGATGCTAAAAAAGAACTGGATAGGCTGAGGCCAGAGCTAGATAGATTGAAATCAGAGAATGCAGAACTGcaaaacaatttaaataacgCCATAGAGGAATCGAATAGGTTAAGAAATGATTTGGACAAATTAAAAAGCGATTACGACAAATTGAAGTCTGAATTAGCTGACCTGAAGAAGGAGAGAGATAGTCAGAAAGAACGGAACGCAGAATTGGAGAAAGAATTAGCCAAAATAAAGAAGGAGAATGCGAATCTCAAGGGTGAGTTAGCCGATTGTCAAACAGAGAACGAAAGATTGCGTAATGGATTGACAGATTTGAAGTCGCAAAATGCAAAACTGCAGGACGATTTAAACACGGCGAAGAACgaagtgaataaattaaaagccGATTtggataaattgaaaagcGATTATGGTGAATTGCGGTCGGAATTAGGTAAACTAAGGGATGAGAAAAATAGGCACAGAGAACGCGATACTGCGCTAGCCATGGATCTGGATAAATTGAAGAAAGAGAATGACGAGTTAAAAGATGGAAATGAGAAACTGAAAAGCCAGTTATTCGATTGCCAAGAGGAGAGGGAAAGGCTACGCAAGGAATTGGGAAAgctgaaaagagaaaatgcaaaattgaAAGAAGTGAAAAAGGTGGAGCCCAAGGAAACGGAGATGGACAAAGATATTTTGGATGACTGCGGTGATTTCATAAAGGGGAATGAATTACTGGGGAAGAAATTCAAAAAACAGAATGAAgGTGTGCTGCGCATTCGGAATTATATTAGATATTTGGAAGGTAAAGCTACAGAACCGAAAATGGCGGACAGGCCGGAAGAAGAACCAGAAATTGATCCAGTATTAAAGAAAGACATTGCAGATTTGTTGAAGAAATCTCATGATTTATCTAATGATATCTATCAAACGGAacaggaaataaaaaatcttgGAGCTAAACCTGGCGAGGGCATTTTCGATCCTGACAGCTGGCTGAACTCATTGACATTGACACAGTTAGCTGAGCTTCACGATAAGATTTGTCAGTTGACATTGGACATGGTGCACCAGGACAGTAGAGCGGTGCCGTGCGATGGTTATGAACCAGCGACTTCGTCAAAGACggattataatattttgaacCAGCGTATAGCTGCTTTGCAGAAACAGATAGCGGAAAAGCGAACGGAGGCAGATTGGAAGCTTCAGGAGCTGAGGAGGGCTCTTCAAAACGAGCAGGCTAACTTAATTCGAATCTCCGATCAGATGAACTTTGAGAGAAAACGCAATTTGAACCTTCAGCATAGCATAGATGATTCACCGTAA
Protein Sequence
MISDLRLKLAQTEEDISCPAIYRLRAKLRDLMKGEAAEQQISKVVEKSIETLVDLSKNCDDLRLENERLLAQVTDLRSALVDLEGKKTPETILQTAETTTVPEYIDISDLLHKLNNCEDIVADLRKQLEERDEQIDALNKELESMSSQKGLEEQIEAMKEELRRKDDKIAGLLNNLRQSEIDLLGLSSLKSEVENLKSELHDLKSEKTELLNELNKLREALKDRDDQIIDLLEQRNNLEKECKNKTAELQSKLDEANDEIDDLKAEITKLKNELEECKTLNAKLEQCCLDKNALSEKLHGFEEDLGARTALEKELERDRDEIELLQREIFDLKDQIDAERKENDELRETLEASVGEREKLRARLEQLESENDDLMKRMKELDNLNYQLRNDYDSMKQALDNLQAEINKLEDELAKAKQERDALLNENNGIKKQLEQAMAENESLIAKLDETGKELNKLKLQKDELQKSLDGINLENDSLKRDMKALRDDLEDSRRQAEELKAAGDALKATDKDKVLELAKLQEQVENCKFEKNRLTKENDDLKSKIIELQGKLEEMDKLKGRNTDLLAEVDLLTKELEKALEDIDQLKSEIGSLKDGLDSCVGEMQKLRIENGDLKKQNETLKSEMQAITDHLMKDNDDLKAEISELEEKLSELDKTKLENVDLLDEVDRLKQELAKAWEEVDRLKSEVTSLKNALDKCVDEMEKLRTESDQLKLENQAFKSDIHGLDDRLTKEIANLKAKNAELEEKLVAFDKLKSENEDLLGEVDRLRHELEKTLEDMDQLKSEIGSLKNGLDKCVGEMDQLRTENSGLKSEIQGMRGEGDSLSAELNNLKNENSLLKGERDRLSKQLSDCKMENEKFRVEKDHLEAENEKLKGEINSCKEENDKLKDELGKSREQLQSSNDELNKLKANLDRAEEKIRSLEPLISRLHSENDKLRDDLTSLENEANDFKAKLARETADNEKIQNDLKILEDQVHDLSKNLANARTENDTLKQENQGLKAKLLDMDHDLSNLNSECADLKREIADLKKLIDELKEKIAKLEANIDHWKMENCKLQLDIDKSKADLEKALKDLLECQASKKALEAEMYRFKIEKGELDKKLVDLTSQLEQQEKAFEAEKSARNKGDSEIAALKEELDALKKELGKLRADNNRYRNEIDDLGRQLAVTKNELEKCKEEVSILRDANNALKSQLDLLKSLKDEYNKLMADLDSLKEENVNLLQDRKNFEDEYTRLKGEGDGQKAEIDRLKSILNAEEAAAEKLRADLQICQTENDRLQKQLNEVKNELDELTKGNNRIKNEIDKLKKALADAEAKIKLLESELSDLLAEKKELVNELYRFREQLNNRTNELEEQMAAKDAAKKELADMKDELTALKAALDKVRSENDKLRNENEKLNVELTKLNGQLETLKDDNTKLGNENANLKNENANLKNDNAKLATELTGTKNKLAEAEKQLNDLEKENDDLNNKIADLENTVNELEPLKKQLEDAKKELDRLRPELDRLKSENAELQNNLNNAIEESNRLRNDLDKLKSDYDKLKSELADLKKERDSQKERNAELEKELAKIKKENANLKGELADCQTENERLRNGLTDLKSQNAKLQDDLNTAKNEVNKLKADLDKLKSDYGELRSELGKLRDEKNRHRERDTALAMDLDKLKKENDELKDGNEKLKSQLFDCQEERERLRKELGKLKRENAKLKEVKKVEPKETEMDKDILDDCGDFIKGNELLGKKFKKQNEGVLRIRNYIRYLEGKATEPKMADRPEEEPEIDPVLKKDIADLLKKSHDLSNDIYQTEQEIKNLGAKPGEGIFDPDSWLNSLTLTQLAELHDKICQLTLDMVHQDSRAVPCDGYEPATSSKTDYNILNQRIAALQKQIAEKRTEADWKLQELRRALQNEQANLIRISDQMNFERKRNLNLQHSIDDSP

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00232996;
90% Identity
iTF_00216838;
80% Identity
-