Basic Information

Gene Symbol
-
Assembly
GCA_011952255.1
Location
NW:216851-226801[+]

Transcription Factor Domain

TF Family
TF_bZIP
Domain
bZIP domain
PFAM
AnimalTFDB
TF Group
Basic Domians group
Description
bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 42 0.013 6.7 6.1 0.3 30 59 55 84 47 89 0.83
2 42 3.5 1.7e+03 -1.7 2.6 29 62 127 162 108 170 0.61
3 42 0.0014 0.71 9.2 5.2 29 63 188 222 186 226 0.76
4 42 0.0017 0.87 8.9 5.1 24 64 243 283 240 284 0.92
5 42 0.088 44 3.4 3.9 22 53 314 345 295 356 0.59
6 42 0.0087 4.4 6.6 5.1 20 62 354 396 351 399 0.65
7 42 0.012 6.2 6.2 6.7 24 64 393 433 390 441 0.74
8 42 0.066 33 3.8 3.8 21 64 453 496 452 497 0.90
9 42 0.00098 0.49 9.7 4.7 25 64 513 552 503 553 0.83
10 42 0.012 5.9 6.2 2.4 23 61 549 587 547 591 0.86
11 42 0.0072 3.6 6.9 0.2 41 63 602 624 588 632 0.54
12 42 0.0027 1.4 8.3 1.4 28 59 621 652 620 659 0.90
13 42 2.7e-05 0.013 14.7 3.3 25 65 663 703 652 703 0.78
14 42 0.016 8 5.8 0.1 33 62 699 728 697 730 0.84
15 42 0.0089 4.5 6.6 6.9 24 60 736 775 734 780 0.80
16 42 0.0027 1.4 8.3 3.0 37 60 752 775 750 801 0.55
17 42 0.022 11 5.4 0.0 33 62 797 826 795 827 0.89
18 42 0.0025 1.3 8.4 1.9 28 62 820 854 809 857 0.71
19 42 1e-05 0.0053 16.0 3.3 32 62 859 889 851 892 0.88
20 42 0.21 1.1e+02 2.2 1.0 28 59 887 918 886 920 0.83
21 42 0.00056 0.28 10.5 0.5 21 63 911 953 910 955 0.94
22 42 0.0061 3.1 7.1 1.9 30 62 969 1001 958 1008 0.68
23 42 0.0022 1.1 8.6 1.7 28 60 1009 1041 998 1045 0.66
24 42 0.45 2.3e+02 1.2 1.2 28 63 1070 1107 1064 1115 0.58
25 42 0.00092 0.46 9.8 1.0 18 62 1118 1162 1110 1165 0.89
26 42 0.11 54 3.2 1.6 32 61 1167 1196 1163 1200 0.89
27 42 0.43 2.2e+02 1.2 0.2 24 63 1211 1250 1208 1251 0.91
28 42 0.0074 3.7 6.9 2.8 28 59 1257 1288 1254 1293 0.88
29 42 0.0037 1.9 7.8 4.1 27 62 1270 1305 1267 1308 0.91
30 42 0.17 86 2.5 6.3 25 62 1310 1347 1308 1350 0.82
31 42 0.094 47 3.3 2.0 28 64 1338 1377 1334 1378 0.90
32 42 0.062 31 3.9 7.8 29 64 1370 1405 1351 1406 0.73
33 42 0.0015 0.74 9.1 1.7 30 63 1399 1432 1397 1439 0.89
34 42 0.0027 1.3 8.3 5.1 26 59 1437 1470 1434 1475 0.88
35 42 0.23 1.2e+02 2.1 6.8 30 63 1472 1505 1468 1507 0.83
36 42 0.019 9.7 5.5 1.2 27 56 1511 1540 1508 1543 0.87
37 42 4.7e-05 0.024 13.9 5.8 20 64 1539 1583 1539 1584 0.94
38 42 0.012 5.8 6.3 1.4 31 64 1578 1611 1575 1612 0.88
39 42 0.005 2.5 7.4 1.6 31 64 1599 1632 1585 1633 0.83
40 42 0.0041 2 7.7 2.3 21 55 1631 1665 1630 1668 0.91
41 42 0.00057 0.29 10.5 7.6 24 60 1655 1691 1654 1695 0.91
42 42 5.8 2.9e+03 -2.4 0.6 29 54 1861 1887 1854 1898 0.47

Sequence Information

Coding Sequence
ATGATATCGGATTTGAGGTTGAAGCTGGCGCAGACCGAAGAGGACATCAGCTGCCCTGCGATATATCGTTTAAGGGCAAAACTGCGTGATCTTATGAAAGGTGAAGCGGCTGAACAACAGATTTCCAAAGTTGTGGAGAAATCGATTGAAACCTTGGTGGATCTTTCCAAGAACTGCGATGATCTGCGTTTGGAAAACGAACGTCTTCTTGCACAAGTGACAGATTTGCGCAGTGCATTGGTTGATCtcgaaggaaagaaaacgCCAGAAACGATATTGCAAACGGCCGAAACAACCACAGTGCCGGAGTATATTGACATTTCAGATTTATTGCATAAGCTAAACAACTGCGAGGATATCGTGGTTGATCTAAGAAAGCAGTTAGAAGAAAGGGACGAGCAAATCGATGCACTGAACAAAGAACTGGAATCGATGATCAGCCAAAAAGGTTTAGAGGAACAGATAGAAGCCATGAAGGAGGAACTTAGGAGGAAAGATGACAAGATCGCAGGACTGCTGAACAACCTAAGACAATCAGAAATAGACTTATTGGGGTTATCTTCTCTGAAATCCGAAGTGGAGAACCTGAAATCAGAGTTACATGATCTCAAATCAGAGAAAACTGAATTACTGAACGAACTAAACAAACTGCGAGAAGCACTGAAGGATAGAGACGATCAGATAATAGATTTACTAGAGCAGAGGAACAACTTGGAGAAGGAGTACAAGAATAAGACGGCAGAATTACAGTCAAAACTCGATGAGGCAAATGACGAGATCGATGATTTGAAAGCTGAGATAACCAAACTGAAGAATGAGTTGGAAGAGTGCAAGACGCTAAACGCGAAGCTGGAACAGTGCTGTCTGGATAAAAACGCACTTTCGGAAAAGCTACACGGCTTCGAAGAGGCCAGGACCGCATTGGAGAAGGAACTGGAGAGATACCGAGATGAGATTGAATTGTTACAGAGGGAGATTTTCGATCTGAAAGATCAGATCGATGCCGAAAGGAAGGAGAACGACGAGCTTCGCGAAACGTTAGAAGCATCGGTCGGCGAGAGGGAAAAGTTGAGGGCTCGGTTAGAGCAGTTGGAGAGCGAGAACGATGATCTGATGAAAAGGATGAAGGAGCTAGACAATTTGAATTACCAGCTAAGGAACGACTACGATAGCATGAAGCAGGCTTTGGACAATTTGCAAGCAGAGATCAACAAACTGGAGGATGAATTGGCCAAGGCGAAGCAAGAACGCGATGCATTGTTGAACGAGAATAATGGTATCAAAAAACAGGTGGAACAAGCGATGGCGGAGAACGAGAGTCTGATAGCCAAATTGGACGAAACTGGTAAAGAACTTAATAAACTGAAACTACAGAAGGATGAGCTACAGAAGAGCCTCGATGGGATCAATCTTGAGAACGATTCACTGAAACGAGATATGAAAGCGTTAAGGGATGACCTTGAGGATTCCAGAAGGCAAGCGGAGGAACTAAAAGCCGCTGGTGACGCGTTAAAAGCGACGGATAAGGATAAGGTACTTGAACTTGCAAAGCTGCAAGAACAAGTAGAGAACTGCAAGTTCGAAAAGAATCGCTTAACGAAGGAAAATGATGAtttgaaatctaaaataatagaattacAAGGAAAGCTGGAGGAGATGGATAAGTTAAAGGGAAGAAATACAGATTTACTGGCTGAAGTAGATCTTCTGACaaaagaattagaaaaagCGTTGGAGGACATTGATCAATTGAAATCCGAAATAGGTTCCTTGAAGGATGGACTCGATAGTTGTGTGGGCGAGATGCAAAAGCTGAGAATCGAAAATGGCGACCTCAAAAAACAGAACGAAACCTTGAAGTCTGAGATGCAAGCAATTACCGATCACTTAATGAAAGACAATGACGATTTAAAAGCAGAAATCTCAGAATTGGAAGAAAAGTTAAGTGAATTGGATAAAATGAAACTAGAAAATGTTGATTTGCTTGATGAAGTAGATCGTTTGAAACAGGAATTGGCAAAAGCCTGGGAAGAAGTTGATCGATTGAAATCTGAAGTAACATCTCTGAAAAACGCACTCGATAAGTGCGTGGACGAGATGGAAAAGCTGAGAACTGAGAGCGACCAGCTTAAATTGGAGAATCAAGCATTCAAGTCTGATATTCACGGACTTGATGATCGCTTAATGAAAGAAATCGCCAATCTGAAAGCAAAAAACGCGgaattggaagaaaaattagtGGCATTCGATAAATTGAAGTCGGAAAATGAGGATTTACTTGCTGAAGTTGATCGTTTGAGACATGAATTGGAAAAAGTCCTAGAGGATATGGATCAATTGAAATCTGAGATAGGTTCTTTGAAAAACGGACTGGATAAATGTGTTGGCGAGATGGATCAGCTGAGAACTGAAAACAGCGGTTTGAAGTCTGAAATTCAGGGAATGAGGGGCGAAGGGGACAGTTTGTCGGCGGAGTTAAATAATCTGAAGAATGAGAATTCTCTTTTGAAAGGCGAAAGAGATCGATTGAGCAAGCAATTGAGCGACTGTAAGAtggaaaacgaaaaattcaGAGTGGAGAAGGATCATCTGGAAGCTGAAAATGAGAAGCTGAAAGGAGAGATAAACTCGTGCAAGGAAGAGAATGACAAATTAAAAGACGAACTTGGAAAATCACGGGAACAATTGCAATCATCGAACGACgaattgaataaattaaaggCTAATCTCGACAGAGCTGAGGAGAAAATTCGGTCTCTGGAGCCGCTGATCTCCCGTTTGCATagtgaaaatgataaattgcGGGACGATTTGACGAGTTTGGAGAACGAGGCCAACGATTTCAAAGCAAAATTGGCTAGAGAAACGGCTGACAATGAAAAGATACAGAACGATCTGAAGATACTGGAGGATCAGGTGCACGATCTAAGTAAGAACCTGGCCAATGCTAGGACAGAAAATGACACTTTGAAACAGGAAAATCAAGGTCTAAAAGCCAAGTTATTGGATATGGATCATGATCTATCGAATTTGAAATCGGAATGTGCGGATCTGAAACGAGAGATTGCTGATCTGAAGAAATTAATCGatgaattaaaagaaaaaattgctAAACTGGAAGCAAACATAGATCATTGGAAAATGGAGAACTGCAAACTTCAGTTGGACATTGATAAATCGAAAGCTGATCTTGAGAAAGCCTTGAAAGATTTGCTCGAATGCCAGGCTTCGAAGAAAGCACTAGAAGCAGAGATGTACCGTTTCAAGATTGAGAAAGGCGAGCTTGACAAGAAGCTTGTCGATTTAACTTCTCAACTCGACCAACAGGAAAAAGCATTCGAAGCAGAAAAATCGGCCAGAAATAAGGGTGATTCAGAAATTGCGGCCTTGAAGGAGGAACTGGATGCCTTGAAAAAGGAACTAGGTAAATTGAGAGCTGACAACAACAGATACAGAAACGAAATAGACGACCTAGGAAGACAGCTTGCGGTAACAAAAAATGAACTGGAGAAGTGCAAAGAAGAGGTTTCTATATTAAGAGATGCCAATAACGCACTAAAGTCTCAATTGGATCTCTTGAAAAGTTTAAAGGACGAATATAATACCTTGATGGCTGATTTAGATTCTCTTAAAGAGGAGAACGTAAACCTTCTGCAAGATAGGAAAAATTTCGAAGACGACTATACTAGGCTGAAAGGAGAAGGCGATGGACAGAAAGCAGAGATCGATAGATTGAAATCAATCTTGAATGCAGAGGAGGCAGCTGCGGAAAAATTGAGGGCAGATCTGCAAATTTGCCAAACTGAGAACGATAGGTTGCAAAAACAATTAAACGAAGTGAAAAATGAGTTGGATGAACTGACAAAGGGAAACAATCGTATAAAGAACGAGATCGATAAGCTGAAGAAGGCGCTCGCGGACGCGGAAGCAAAGATAAAGTTGCTGGAAAGTGAACTATCCGATTTGCTAGCCGAGAAAAAAGAACTGGTCAACGAACTCTATCGTTTTCGCGAACAGCTAAACAATCGTACAAACGAGCTAGAAGAGCAGATGGCCGCAAAAGATGCGGCCAAGAAGGAATTGGCCGACATGAAGAATGAGCTGACCGCTCTAAAAGCGGCGTTGGATAAGGTTCGCAGCGAAAACGATAAGCTGAGAAACGAGAACGAAAAGCTGAATGTGGAATTAACCAAGTTGAACGGGCAATTAGAAACTCTGAAGGACGATAATACGAAGCTGGGAAACGAAAACGCGAATCTAAAGAATGACAATGCAAAGTTGGCGACGGAGTTAACTGGAACGAAAAACAAATCGGCAGAAGCGGAGAAACAGCTAAACGATctagagaaagaaaacgacGACTTGAATAACAAAATAGCCGATCTCGAGAACACAGTGAACGAGCTCGAGCCTTTGAAGAAACAATTAGAAGATGCTAAAAAAGAACTGGATAGGCTGAGGCCAGAGCTAGATAGATTGAAATCAGAGAATGCAGAACTGcaaaacaatttaaataacgCCATAGAGGAATCGAATAGGTTAAGAAATGATTTGGACAAATTAAAAAGCGATTACGACAAATTGAAGTCTGAATTAGCTGACCTGAAGAAGGAGAGAGATAGTCAGAAAGAACGGGACGCAGAATTGGAGAAAGAATTAgccaaaataaagaaagagaatGCGAATCTCAAGGGTGAGTTAGCCGATTGTCAAACAGAGAACGAAAGATTGCGTAATGGATTGACAGATTTGAAGTCGCAAAATGCAAAACTGCAGGACGATTTAAACACGGCGAAGAACGaagtgaataaattaaaagccGATTTGGATAAATTGAAAAGCGATTATGGTGAATTGCGGTCGGAATTAGGTAAACTAAGGGATGAGAAAAATAGGCACAGAGAACGCGATACTGCGCTAGCCATGGATCTGGATAAATTGAAGAAAGAGAATGACGAGTTAAAAGATGGAAATGAGAAACTGAAAAGCCAGTTATTCGATTGCCAAGAGGAGAGGGAAAGGCTACGCAAGGAATTGGGAAAGctgaaaagagaaaatgcaaaattgaAAGAAGTGAAAAAGGTGGAGCCCAAGGAAACGGAGATGGACAAAGATATTTTGGATGACTGCGGTGATTTCATAAAGGGGAATGAATTACTGGGGAAGAAATTCAAAAAACAGAATGAAGGTGTGCTGCGCATTCGGAATTATATTAGATATTTGGAAGGTAAAGCTACAGAACCGAAAATGGCGGACAGGCCGGAAGAAGAACCAGAAATTGATCCAGTATTAAAGAAAGACATTGCAGATTTGTTGAAGAAATCTCATGATTTATCTAATGATATCTATCAAACGGAacaggaaataaaaaatcttgGAGCTAAACCTGGCGAGGGCATTTTCGATCCTGACAGCTGGCTGAACTCATTGACATTGACACAGTTAGCTGAGCTTCACGATAAGATTTGTCAGTTGACATTGGACATGGTGCACCAGGACAGTAGAGCGGTGCCGTGCGATGGTTATGAACCAGCGACCCCGTCAAAGGCGGATTATAGTATTTTGAACCAGCGTATAGCTGCTTTGCAGAAACAGATAGCGGAAAAGCGAACGGAGGCAGATTGGAAGCTTCAGGAGCTGAGGAGGGCTCTTCAAAACGAGCACGCTAACTTAATTCGAATCTCCGATCAGATGAACTTTGAGAGAAAACGCAATTTGAACCTTCAGCATAGCATAGATGATTCACCGTAA
Protein Sequence
MISDLRLKLAQTEEDISCPAIYRLRAKLRDLMKGEAAEQQISKVVEKSIETLVDLSKNCDDLRLENERLLAQVTDLRSALVDLEGKKTPETILQTAETTTVPEYIDISDLLHKLNNCEDIVVDLRKQLEERDEQIDALNKELESMISQKGLEEQIEAMKEELRRKDDKIAGLLNNLRQSEIDLLGLSSLKSEVENLKSELHDLKSEKTELLNELNKLREALKDRDDQIIDLLEQRNNLEKEYKNKTAELQSKLDEANDEIDDLKAEITKLKNELEECKTLNAKLEQCCLDKNALSEKLHGFEEARTALEKELERYRDEIELLQREIFDLKDQIDAERKENDELRETLEASVGEREKLRARLEQLESENDDLMKRMKELDNLNYQLRNDYDSMKQALDNLQAEINKLEDELAKAKQERDALLNENNGIKKQVEQAMAENESLIAKLDETGKELNKLKLQKDELQKSLDGINLENDSLKRDMKALRDDLEDSRRQAEELKAAGDALKATDKDKVLELAKLQEQVENCKFEKNRLTKENDDLKSKIIELQGKLEEMDKLKGRNTDLLAEVDLLTKELEKALEDIDQLKSEIGSLKDGLDSCVGEMQKLRIENGDLKKQNETLKSEMQAITDHLMKDNDDLKAEISELEEKLSELDKMKLENVDLLDEVDRLKQELAKAWEEVDRLKSEVTSLKNALDKCVDEMEKLRTESDQLKLENQAFKSDIHGLDDRLMKEIANLKAKNAELEEKLVAFDKLKSENEDLLAEVDRLRHELEKVLEDMDQLKSEIGSLKNGLDKCVGEMDQLRTENSGLKSEIQGMRGEGDSLSAELNNLKNENSLLKGERDRLSKQLSDCKMENEKFRVEKDHLEAENEKLKGEINSCKEENDKLKDELGKSREQLQSSNDELNKLKANLDRAEEKIRSLEPLISRLHSENDKLRDDLTSLENEANDFKAKLARETADNEKIQNDLKILEDQVHDLSKNLANARTENDTLKQENQGLKAKLLDMDHDLSNLKSECADLKREIADLKKLIDELKEKIAKLEANIDHWKMENCKLQLDIDKSKADLEKALKDLLECQASKKALEAEMYRFKIEKGELDKKLVDLTSQLDQQEKAFEAEKSARNKGDSEIAALKEELDALKKELGKLRADNNRYRNEIDDLGRQLAVTKNELEKCKEEVSILRDANNALKSQLDLLKSLKDEYNTLMADLDSLKEENVNLLQDRKNFEDDYTRLKGEGDGQKAEIDRLKSILNAEEAAAEKLRADLQICQTENDRLQKQLNEVKNELDELTKGNNRIKNEIDKLKKALADAEAKIKLLESELSDLLAEKKELVNELYRFREQLNNRTNELEEQMAAKDAAKKELADMKNELTALKAALDKVRSENDKLRNENEKLNVELTKLNGQLETLKDDNTKLGNENANLKNDNAKLATELTGTKNKSAEAEKQLNDLEKENDDLNNKIADLENTVNELEPLKKQLEDAKKELDRLRPELDRLKSENAELQNNLNNAIEESNRLRNDLDKLKSDYDKLKSELADLKKERDSQKERDAELEKELAKIKKENANLKGELADCQTENERLRNGLTDLKSQNAKLQDDLNTAKNEVNKLKADLDKLKSDYGELRSELGKLRDEKNRHRERDTALAMDLDKLKKENDELKDGNEKLKSQLFDCQEERERLRKELGKLKRENAKLKEVKKVEPKETEMDKDILDDCGDFIKGNELLGKKFKKQNEGVLRIRNYIRYLEGKATEPKMADRPEEEPEIDPVLKKDIADLLKKSHDLSNDIYQTEQEIKNLGAKPGEGIFDPDSWLNSLTLTQLAELHDKICQLTLDMVHQDSRAVPCDGYEPATPSKADYSILNQRIAALQKQIAEKRTEADWKLQELRRALQNEHANLIRISDQMNFERKRNLNLQHSIDDSP

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00232996;
90% Identity
iTF_00216838;
80% Identity
-