Basic Information

Gene Symbol
-
Assembly
GCA_037178635.1
Location
CM074115.1:240742-249714[+]

Transcription Factor Domain

TF Family
TF_bZIP
Domain
bZIP domain
PFAM
AnimalTFDB
TF Group
Basic Domians group
Description
bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 46 0.14 1.4e+02 2.9 0.5 32 57 53 78 47 85 0.54
2 46 2.4 2.4e+03 -1.0 2.2 33 56 99 122 97 124 0.73
3 46 5.3 5.2e+03 -2.1 0.1 40 60 151 171 148 174 0.86
4 46 0.15 1.5e+02 2.8 0.1 33 55 214 236 208 242 0.81
5 46 2.1 2e+03 -0.8 3.6 29 63 265 299 262 324 0.77
6 46 0.0029 2.9 8.3 5.1 29 63 344 378 342 380 0.89
7 46 0.0022 2.1 8.7 0.1 27 57 398 428 395 435 0.76
8 46 0.083 81 3.6 5.1 25 64 438 477 436 484 0.78
9 46 0.0062 6.1 7.3 6.6 20 61 489 523 486 534 0.53
10 46 0.043 42 4.6 8.0 24 64 528 568 525 576 0.81
11 46 0.068 66 3.9 3.0 28 59 560 591 557 593 0.91
12 46 0.11 1.1e+02 3.2 5.6 21 64 588 631 587 632 0.86
13 46 0.00086 0.84 10.0 5.2 25 65 648 688 638 688 0.84
14 46 0.015 15 6.0 2.5 29 56 697 724 689 740 0.43
15 46 0.013 13 6.2 1.2 38 63 734 759 711 763 0.65
16 46 0.15 1.5e+02 2.8 5.5 27 58 755 786 734 792 0.62
17 46 1.4e-05 0.014 15.7 4.1 22 65 788 838 787 838 0.75
18 46 0.015 15 6.0 0.5 33 63 834 864 833 866 0.87
19 46 0.11 1e+02 3.3 1.8 41 62 860 881 841 884 0.68
20 46 0.029 29 5.1 8.5 24 58 871 908 861 915 0.77
21 46 0.28 2.8e+02 1.9 0.0 42 63 913 934 907 936 0.83
22 46 0.0089 8.7 6.7 0.1 33 64 932 963 916 964 0.90
23 46 0.003 3 8.2 1.8 28 63 955 990 948 992 0.85
24 46 0.0011 1.1 9.6 5.8 25 63 1001 1039 990 1041 0.68
25 46 0.0022 2.2 8.7 0.4 21 62 1046 1087 1045 1089 0.91
26 46 0.00058 0.57 10.5 2.7 25 63 1106 1144 1102 1146 0.82
27 46 0.0067 6.5 7.1 1.2 31 64 1140 1173 1136 1175 0.77
28 46 0.021 21 5.5 0.7 24 57 1168 1201 1164 1209 0.81
29 46 1.4 1.4e+03 -0.3 3.0 29 63 1206 1242 1200 1250 0.56
30 46 0.00083 0.82 10.0 0.7 18 62 1253 1297 1248 1300 0.90
31 46 0.21 2.1e+02 2.3 1.9 31 60 1301 1330 1295 1335 0.81
32 46 0.36 3.6e+02 1.6 0.4 24 54 1332 1362 1330 1369 0.75
33 46 0.31 3e+02 1.8 0.3 24 63 1346 1385 1343 1387 0.91
34 46 0.022 22 5.5 4.7 26 59 1390 1423 1385 1427 0.90
35 46 0.02 19 5.6 5.9 27 62 1405 1440 1402 1448 0.83
36 46 5.6 5.5e+03 -2.2 7.4 25 59 1445 1479 1443 1485 0.78
37 46 0.73 7.2e+02 0.6 3.1 20 62 1472 1510 1469 1513 0.55
38 46 0.037 37 4.7 7.2 25 64 1501 1540 1488 1541 0.80
39 46 1.3e-05 0.013 15.8 6.3 28 63 1539 1574 1536 1576 0.93
40 46 0.001 0.99 9.8 2.0 28 58 1581 1611 1577 1613 0.88
41 46 0.28 2.7e+02 2.0 8.0 30 63 1614 1647 1611 1648 0.92
42 46 0.027 26 5.2 1.3 28 56 1654 1682 1651 1685 0.86
43 46 1.5e-06 0.0015 18.8 3.3 20 64 1681 1725 1681 1726 0.94
44 46 0.0055 5.4 7.4 1.1 24 62 1741 1772 1737 1775 0.56
45 46 0.0025 2.5 8.5 3.3 21 55 1773 1807 1772 1808 0.93
46 46 0.00063 0.62 10.4 6.8 27 60 1800 1833 1797 1836 0.92

Sequence Information

Coding Sequence
atggAGGTTTGTCGCTGTGGATGCGAGGCATCGTCGCaagaatcgatcgatccacCGCACGAGCCGTGCTGTTGCTGCAGTTACGATCCATTTAGCGACAAAGAAGTAGAAATCTACGATCTCTCGTTTGCCCTGAAGAAGCTCACTGTAATGAAGTGTCAGATGAAAAAGTGGCGAATGGAAAGACTTCAACTCGAGAGCGAAAATAGGTCTCTGAAACAAGCCCTCCAATCATTCGGTGTAAATGCGGATGAGATATTGAAGCCCGATCCGCTGCTAGTGCACTCCCGAGAAGAAATCGAAAGGCTGCAAAATGCAAACGCGGTGCTGCAAGATAAAGCGAGGGATCTTGAGGAAACTCTCGCCGAGCGAGATTGCTGTGACGATCCCGACGCCACGATCCACTTTCTCAGAGagaaaataagatatattAGAGAGGGTTTCGCGCTTGAAAAGAAAGAATTGCGAGACATGATATCGGATTTGAGGTTGAAGCTGGCGCAGACCGAGGAGGACATCAGCTGCCCCGCGATATATCGTTTAAGGGCAAAGCTGCGTGATCTTATGAAAGATGAAGCGGCTGAACAGCAGATTTCCAAAGTTGTGgagaaatcgatcgaaaccTTGGTGGACCTTTCTAAGAGTGATGATCTGCGTTTGGAAAACGAACGTCTTCTTGCTCAAGTGACAGACTTGCGCAGTGCATTGGTTGATATCGAAGGAAAGGAAACGCCAGAAACGATATTGCAAACAGCCGAAACAACCACAGTGCCGGAGTATATTGACATTTCAGATTTATTGGATAAGCTGAACAACTGCGAGGATACCGTAACTGATCTAAGAAAGCAGTTAGAACAAAGAGACGAGCAGATCGATGCACTGAACAAAGAACTGGAGTCGATGATCAGTCAAAAATGTTTAGAAGAACAGATAGAAGTCATGAAAGAGGAACTTAAGAGGAAAGATGACAAGATCGCAGGACTCCTGAATAACCTAAGACAATCAGAAATAGATTTGTTGGGGTTATCTTCTCTGAAATCCGAAGTGGAGAACCTGAAATCAGAGTTACATGATCTCAAATCAGAGAGAACTGAATTAATGAACGAACTAAACAAACTGCGAGAAGCACTGAAGGATAGAGACGATcagataataaatttcgaagaGGACTTTGCAGCCACGAATGCGATAATAGCGAATCTCGAAAGTGAGGTGGACACTTTAAGGCGAGATAAGGAAAATTTGTCGAACGAACTGGGCGAGGCAAGGAAACAGATCGAGTCGTTCATTGGACGATTGGAGGATCAAAGGGCGGCCAGGACCGCATTGGAGAAGGAActggagagaaaagaagatgaGATTGAATTGTTACAGAGGGAGATTTTCGATCTGAAAAATCAGATCGATGccgaaagaaaggagaatcACGAGCTTCGCGAAACGTTAGAAGCATCGGTCGGCGAGAGGGAAAAGTTGAGGGCTCGGTTAGAGCAGCTGGAGAACGAGAACGATGATCTGATGGAAAGGATAAAGGAGCTAGACAATTTGAATAACCAGCTAAGGAACGACTACGATAGCATGAAGCAGGCTTTGGACAATTTGCAAGCAGAGATCAATAAACTGGAGGATGAATTGGCCAAGGCGAAGCAAGAAAGCAATGCGTTGTTGAACGAGAATAACGGTATCAAAAAGCAGCTGGAACAAGCGATGGCGGAGAACGAGAGTTTGAGAGCCAAATTGGACGAAGCTGGTAAAGAACTTAATAAACTGAAACTACAGAAGGACGAGCTACAGGAGAGCCTCGATAGGATCAATCTCGAGAACGATTCACTGAAACGAGATATGAAAGCGTTAAGGGATGACCTTGAGGATTCTAGAAGGCAAGCGGAGGAACTAAAAACCGCTGGTGACGCGTTAAAAGCGACGGATAAGGATAAGGTACTTGAACTTGCAAAGCTGCAAGAACAAGTAGAGAACTGCAAGTTCGAAAAGAATCGTTTAACGAAGGAAAATGATGATTtgaaatctaaaataatagaattacaAGGAAAGTTGGAGGAGTTGGATAAGTTAAAGGGAAGAAATACAGATTTACTGGCTGAAGTAGATCGTTTGagaaaagaattagaaaaagcGTTGGAAGACATTGATCAATTGAAATCCGAAATAGGTTCCTTGAAGGATGGACTCGATAATTGCGTGGGCGAGATGCAAAAGCTGAGAATCGAAAATGGCGACCTCAAAAAACAGAACGAAACCTTGAAGTCTGAGATGCAAGCTATTACCGATCGCTTAATGAAAGACAATGACgatttaaaaggaaaaatctCAGAATTGGAAGAAAAGTTAAGTGAATTGGATAAAATGAAACtagaaaatgttaatttgCTTGATGAAGTAGATCGTTTGAAACAGGAATTGGCAAAAGCCTGGGAAGAAGTTGATCGATTGAAATCTGAAGTAGCATCTCTGAAAAATGCACTCGATAAGTGCGTGGACGAGATGGAAAAGCTGAGAACTGAGAGCGATCAGCTTAAATTGGAGAATCAAGCTTTCAAGTCCGATATTCACGGACTTGATGATCGCTTAACGAAGGAAATCGCCAATCTGAAAGCAAAAAACGCggaattagaagaaaaattagtggcATTCGATAAATTGAAATCGGAAAATGAGGATTTACTTGGTGAAGTCGATCGTTTGAGACGTGAATTGCAAAAAGCCTTAGAAGATATCGATCAATTGAAATCTGAGATAGGTTCTTTGAAAAACGGACTGGATAAATGTGTTGGTGAGATGGATCAGCTGAGAACTGAAAACAGCGGTTTGAAGTCTGAAATTCAGGGAATGAGGGGCGAAGGGGACAGTTTGTCAGCGGAGTTAAATAATCTGAAGAATGAGAATTCTCTTTTGAAAGGCGAAAGAGATCGATTGAGCAAGCAATTGAGCGACTGTAagatggaaaatgaaaaattcagaGTGAAGAAGGCTCACCTGGAAGCTGAAAATGAGAAGCTGGAAGGAGAGATAAATTCGTGCAAGGAAGGGAATGACAAATTAAAAGAGGAACTTGGTAAATTACGGGAACAATTGCAGTTATCGAACGAcgaattgaataaattaaaggCTAATCTCGACAGAGCTGAGGACAAAATTCGGACTCTGGAGCCGCTGATTTCCCGTTTGCATagtgaaaatgataaattgcGGGACGATTTGACGAGTTTGAAGAACGAGGCCAATGATTTCAAAACAAAATTGGCTAGAGAAACGACTGACAATGAAAACATGCAGAACGATCTGAAGATACTGGAGGATCAGGTGCACGATTTAAGTAAGAACCTGGTCAATGTTAGGGCagaaaatgaaactttgaaaCAGGAAAATCAAGGTCTAAGAGCCGAGTTATTAAATATGGATCATAATCTGTCGAATTTGAAAGCGGAATATGCGGATCTGAAACGAGAGATTGCtgatttgaagaaattaatcgatgaattaaaagaaaaaattgctaAACTAGAAGCAGACATAGATCATTGGAAAATGGAGAACTGCAAACTTCAGTTGGACATTGATAAATTGAAAGCTGATCTTGAGAAAGCCTTGAAAGATTTGCTCGAATGTCAGgcTTCGAAGAAAGTACTAGAAGCAGAGATGTACCGTTTCAAGATTGAGAAAGACGAGCTTGACAAGAAGCTTGTCGATTTAACGTCTCAACTCGAGCAACAGGAAAAAGTATTCGAAGCAGAAAAATCGGCCAGAAATAAGGGTGATTCAGAAATCGTGGCCTTGAAGGAGGAACTGGATGCCTTGAAGAAGGAACTAGGAAAATTGAGAGCTGACAACAACAGATACAGAAATGAAATAGACGACCTAGGAAGACAGCTTGCGGTAACAAAAAATGAACTGGAGAAATGCAAAGAAGAGGTTTCTGTGTTAAGAGATGCCAATAACGCGCTAAAGTCTCAATTGGATCCCTTGAAAAGTTTAAAGGATGAATATAATAAGTTGAAGGCTGATTTAGATTTTCTTAAGGAGGAGAACGTAAACCTTCTGCAAGATAGGAAAAATTTCGAAGACGAGTATACTAGGCTGACAGGAGAAGGCGATGGACAGAAAGCAGAGATCGATAGACTGAGATCAAACTTGAATGCAGAGGAGGCAGCTGCGGAAAAATTGAGGGCAGATCTTCAAAATTGCCAAATTGAGAACGATAGACTGCAAAAGCAATTAAACGAAGTGAAAAATGAGTTAGATGAACTAACAAAGGGAAACAATCGTATAAAGAACGAGATCGATGAGCTGAAGAAGGCGCTCGCGGACGCGGAAGCACAGATAAAGTTGCTCAAAAGTGAACTATCCGATTTGTTAgccgagaaaaaagaattgatCAACGAACTCTATCGTTCCCGCGAACAGCTAAACAATCGTACAAACGAGCTAGAAGAGCAGATGGCCGCAAAAGATGCGGCCAAGAAGGAATTGACTGACATGAAGGATGAGCTAACCGCGCTAAAAGCGGCGTTGGATAAGGTTCGTAGCGAAAACGATAAGCTGAGGAACGAGAACGAAAAGCTGAATGTGGAATTAACCAAGTTGAACGGGCAATTAGAAACTTTGGAAAACGATAATGCGAAGCTGGAAAATGAAAACGCGAATCTGAAGAACGAAAATGCGAATCTAGAGAATGAAAATGCGAAATTAGTGGCGGAGTTAACCGGAATTAAAAACAAGTTGGCGGAAGCGGAGAAACAGCTGAACGatctagagaaagaaaacgacgaCTTGAATAACAAAATAGGCGATCTCGAGAACACAGTGAACGAACTCGAACCTTTGAAGAAACAATTAGAAGATGCTAAAAAAGAACTGGATAGGCTGAGGCTAGAGCTAGGTAGATTGAAATTAGAGAATGCAGAACttcaaaacaatttaaataactcCATAGAGGAATTGAATAGGTTAAGAAATGATTTGGACAAATTAAAAAGCGATTACGACAAATTGAAGTCGGAATTAGCTGACCTGAAGGAGGAGAGAGATAGTCAGAAAGAACGGAACGCAGAATTGGAGAAAGAAGTAgccaaaataaagaaagagaatgcGAATCTCAAGGGCGAGTTAGCTGATTGTCAAGCGAAGAACGAAGGATTGCGTAATGGATTGACAGATTTGAAATCGCAAAATGTAATACTGCAGGACGATTTAAACAAGGCAAAgcaagaaacgaataaattaaaagccGATTTGGATAAATTGAAAAGCGATTATGGTGAATTGCGGTCGGAATTAGGTAAACTAAGGGATGAGAAGAATAGGCACAAAGAACGCGATACTGCATTAGCCACGGATCTGgataaattgaagaaagagAATGACGAGTTAAAAGATGGGAATGAGAAACTGAAAAGTCAGTTATTCGATTGCCAAGAGGAGAGGGAAAGGCTACGTAAAGAATTGGGAAAgctgaaaagagaaaatgcaaaattgaAAGAAGGTATGACAATTATCCTTTCAATTTGTTTGGAATCCAAATTAGAAGGGCAAGTGCTTGATCTTTTGTTCAAGTCACTTTAA
Protein Sequence
MEVCRCGCEASSQESIDPPHEPCCCCSYDPFSDKEVEIYDLSFALKKLTVMKCQMKKWRMERLQLESENRSLKQALQSFGVNADEILKPDPLLVHSREEIERLQNANAVLQDKARDLEETLAERDCCDDPDATIHFLREKIRYIREGFALEKKELRDMISDLRLKLAQTEEDISCPAIYRLRAKLRDLMKDEAAEQQISKVVEKSIETLVDLSKSDDLRLENERLLAQVTDLRSALVDIEGKETPETILQTAETTTVPEYIDISDLLDKLNNCEDTVTDLRKQLEQRDEQIDALNKELESMISQKCLEEQIEVMKEELKRKDDKIAGLLNNLRQSEIDLLGLSSLKSEVENLKSELHDLKSERTELMNELNKLREALKDRDDQIINFEEDFAATNAIIANLESEVDTLRRDKENLSNELGEARKQIESFIGRLEDQRAARTALEKELERKEDEIELLQREIFDLKNQIDAERKENHELRETLEASVGEREKLRARLEQLENENDDLMERIKELDNLNNQLRNDYDSMKQALDNLQAEINKLEDELAKAKQESNALLNENNGIKKQLEQAMAENESLRAKLDEAGKELNKLKLQKDELQESLDRINLENDSLKRDMKALRDDLEDSRRQAEELKTAGDALKATDKDKVLELAKLQEQVENCKFEKNRLTKENDDLKSKIIELQGKLEELDKLKGRNTDLLAEVDRLRKELEKALEDIDQLKSEIGSLKDGLDNCVGEMQKLRIENGDLKKQNETLKSEMQAITDRLMKDNDDLKGKISELEEKLSELDKMKLENVNLLDEVDRLKQELAKAWEEVDRLKSEVASLKNALDKCVDEMEKLRTESDQLKLENQAFKSDIHGLDDRLTKEIANLKAKNAELEEKLVAFDKLKSENEDLLGEVDRLRRELQKALEDIDQLKSEIGSLKNGLDKCVGEMDQLRTENSGLKSEIQGMRGEGDSLSAELNNLKNENSLLKGERDRLSKQLSDCKMENEKFRVKKAHLEAENEKLEGEINSCKEGNDKLKEELGKLREQLQLSNDELNKLKANLDRAEDKIRTLEPLISRLHSENDKLRDDLTSLKNEANDFKTKLARETTDNENMQNDLKILEDQVHDLSKNLVNVRAENETLKQENQGLRAELLNMDHNLSNLKAEYADLKREIADLKKLIDELKEKIAKLEADIDHWKMENCKLQLDIDKLKADLEKALKDLLECQASKKVLEAEMYRFKIEKDELDKKLVDLTSQLEQQEKVFEAEKSARNKGDSEIVALKEELDALKKELGKLRADNNRYRNEIDDLGRQLAVTKNELEKCKEEVSVLRDANNALKSQLDPLKSLKDEYNKLKADLDFLKEENVNLLQDRKNFEDEYTRLTGEGDGQKAEIDRLRSNLNAEEAAAEKLRADLQNCQIENDRLQKQLNEVKNELDELTKGNNRIKNEIDELKKALADAEAQIKLLKSELSDLLAEKKELINELYRSREQLNNRTNELEEQMAAKDAAKKELTDMKDELTALKAALDKVRSENDKLRNENEKLNVELTKLNGQLETLENDNAKLENENANLKNENANLENENAKLVAELTGIKNKLAEAEKQLNDLEKENDDLNNKIGDLENTVNELEPLKKQLEDAKKELDRLRLELGRLKLENAELQNNLNNSIEELNRLRNDLDKLKSDYDKLKSELADLKEERDSQKERNAELEKEVAKIKKENANLKGELADCQAKNEGLRNGLTDLKSQNVILQDDLNKAKQETNKLKADLDKLKSDYGELRSELGKLRDEKNRHKERDTALATDLDKLKKENDELKDGNEKLKSQLFDCQEERERLRKELGKLKRENAKLKEGMTIILSICLESKLEGQVLDLLFKSL

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00232996;
90% Identity
-
80% Identity
-