Basic Information

Gene Symbol
-
Assembly
GCA_905332935.1
Location
HG995197.1:115301-120527[+]

Transcription Factor Domain

TF Family
TF_bZIP
Domain
bZIP domain
PFAM
AnimalTFDB
TF Group
Basic Domians group
Description
bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 41 0.0089 3.9 7.8 5.3 29 63 28 62 26 64 0.87
2 41 0.068 29 5.0 8.2 24 64 83 123 70 124 0.93
3 41 0.012 5.2 7.4 0.2 24 56 149 181 147 189 0.84
4 41 0.21 91 3.4 3.9 22 57 203 238 194 245 0.90
5 41 0.0047 2 8.7 4.6 20 64 243 287 238 288 0.93
6 41 0.045 19 5.6 7.9 25 58 290 323 286 330 0.81
7 41 0.15 65 3.9 6.8 21 64 342 385 341 386 0.91
8 41 0.0018 0.77 10.1 5.4 25 65 402 442 392 442 0.84
9 41 0.042 18 5.7 4.6 23 61 438 476 436 494 0.82
10 41 0.12 53 4.2 0.7 31 63 474 513 465 515 0.64
11 41 0.1 44 4.4 2.9 28 59 510 541 502 547 0.88
12 41 0.0026 1.1 9.6 9.6 25 65 552 592 527 592 0.82
13 41 0.02 8.5 6.7 0.4 33 63 588 618 586 620 0.89
14 41 0.18 79 3.6 0.6 42 62 615 635 613 638 0.66
15 41 0.026 11 6.3 7.1 24 58 625 662 622 669 0.81
16 41 0.004 1.7 8.9 1.8 36 58 640 662 639 690 0.69
17 41 0.16 67 3.8 0.0 37 64 690 717 685 718 0.86
18 41 0.0048 2.1 8.7 3.1 28 61 709 742 697 746 0.67
19 41 7.7e-05 0.033 14.4 2.2 32 62 748 778 740 781 0.86
20 41 0.0005 0.22 11.8 4.6 26 62 756 792 750 802 0.73
21 41 0.0028 1.2 9.4 0.2 21 63 800 842 799 844 0.92
22 41 0.018 8 6.8 2.9 26 56 861 891 842 900 0.69
23 41 0.0071 3.1 8.1 2.3 24 64 887 927 886 934 0.76
24 41 0.51 2.2e+02 2.2 6.4 24 64 955 997 952 998 0.85
25 41 0.0016 0.7 10.2 1.2 18 62 1007 1051 1003 1053 0.90
26 41 0.053 23 5.4 1.4 22 64 1039 1081 1038 1089 0.90
27 41 0.3 1.3e+02 2.9 0.8 24 58 1086 1120 1083 1124 0.61
28 41 0.32 1.4e+02 2.9 0.6 24 63 1100 1139 1097 1140 0.91
29 41 0.0043 1.8 8.9 4.9 27 60 1145 1178 1142 1183 0.89
30 41 0.0081 3.5 8.0 4.5 27 62 1159 1194 1157 1202 0.79
31 41 0.22 94 3.4 5.7 25 62 1199 1236 1197 1240 0.86
32 41 0.96 4.2e+02 1.3 2.0 28 63 1227 1265 1224 1267 0.84
33 41 0.21 92 3.4 6.8 30 64 1260 1294 1239 1295 0.72
34 41 0.00021 0.093 13.0 5.0 32 64 1290 1322 1287 1330 0.58
35 41 0.0087 3.8 7.9 6.7 28 61 1335 1368 1330 1373 0.84
36 41 0.88 3.8e+02 1.4 6.1 28 55 1373 1400 1366 1406 0.52
37 41 0.2 85 3.5 0.7 32 56 1412 1436 1404 1438 0.84
38 41 1.3e-05 0.0058 16.9 4.3 20 64 1435 1479 1435 1480 0.95
39 41 0.029 12 6.2 1.0 31 61 1495 1525 1485 1527 0.77
40 41 0.0003 0.13 12.5 6.8 20 55 1526 1561 1523 1564 0.94
41 41 0.031 14 6.1 7.0 29 60 1556 1587 1555 1589 0.93

Sequence Information

Coding Sequence
ATGAGATTCGAAACCCTTTTTCAGATCGCAGGACTCCTGAATAATCTAAGACAATCAGAAATAGACTTGTTGGGGTTATCTTCTCTGAAATCCGAAGTGGAGAACCTGAAATCAGAGTTATATGATCTCAAATCAGAGAGAACTGAATTACTGAACGAACTAAACAAACTGCGAGAAGCACTGAAGGATAGAGACGATCAGATAATAGATTTACTGCAGCAGAAGAACAACTTGGAGAAGGAGCACAAGAATAAGACGGCAGAATTACAGTCAACACTCGATGAGGCAAATGACGAGATCGATAATTTGAAAGCTGAGATAACCAAACTGAAGAATGAGTTGGAAGAGTGTAAGAAGATGAACGCGAAGCTGGAACAGTGCTGTCTGGATAACAACGCACTTACGGAAAAGCTACACGGCTTCGAAGAGGACCTTGCAGCCGCGAAAGCGATAATAGCGAATCTCGAAAGTGAGGTGGACATTTTAAGGCGAGACAAGGAAAATTTGTCGAACGAACTGGACGAGGCAAGGAAACAAGTCGAGATGTTCATTGGACGACTGGAGGATGAAAGGGCGGCCAGGACCGCATTGGAGAAGGAACTGGAGAGAAACCGAGATGAGATTGAATTGTTACAGAGGCAGATTTTCGATCTGAAAGATCAGATCGATGCCGAAAGGAAGGAGAACGACGAGCTTCGCGAAACGTTAGAAGCATCGGCCGGCGAGAGGGAAAAGTTGAAGGCTCGGTTAGGGCAGCTGGAGAACGAGAACGATGATCTGATGGAAAGGATGAAGGAGCTAGACAATTTGAATAACCAACTAAGGAACGACTACGATAGTATGAAGCAGGCTTTGAACAATTTGCAAGCAGAGATCAACAAACTGGAGGATGAATTGGCTAAGGCGAAGCAAGAACGCGATGCGTTGTTGAACGAGAATAACAGTATCAAAAAGCAGCTGGAACAAGAGATGGCGGAGAACGAGACTCTGAGAGCCAAATTGGACGAATCTGGTAAAGAACTTAATAAACTGAAACTACAGAAGGACGAACTACAGAAGAACCTCGATGAGATCAATCTCGAGAACGATTCACTGAAACGAGATATGAAAGCGTTAAGGGATGACCTTGAGGATTCCAGAAGGCAAGCGGAGGAACTAAAAGCCGCTGGTGACGCGTTAAAAGCAACGGATAAGGATAAGGTACTTGAACTTGCAAAGCTGCAAGAACAAGTAGAGAACTGCAAGTTCGAAAAGAATCGTTTAACGAAGGAAAATGATGATTTGAAATCTAAAATAATAGAATTACAAGGAAAGTTGGAGGAGTTGGACAAGTTAAAGGGAAGAAATACAGATTTAATGGCTGAAGTAGATCGTTTGAGAAAAGAATTAGAAAAAGCGTTGGAGAACATTGATCAATTGAAATCGGAAATAGGTTCCTTAAAGGATGGACTCGATAATTGTGTGGGCGAGATGCAAAAGTTGAGAATCGAAAATGGTGACCTCAAAAAGCAGAACGAAACCTTCAAGTCTGAGATGCAAACAATTACCGATCGCTTAATAAAAGACAATGACGATTTAAAAGCAAAAATCTCAGAATTGGAAGAAAAGTTAAGTGAATTGGATAAAATGAAACTAGAAAATGTTGATTTGCTTGATGAAGTAGATCGTTTGAAACAGGAATTGGCAAAAGCTTGGGAAGAAGTTGATCGATTGAAATCCGAAGTAGCGTCTCTGAAAAACGCACTCGATAAGTGCGTGAATGAGATGGAAAAGCTGAGAACTGAAAGCGACCAGCTTAAATTGGAGAATCAAGCTTTCAAGTCTGATATTCATGGACTTGATGATCGCTTAACGAAGGAAATCGCCAATTTGAAAGCAAAAAACGCGGAATTGGAAGAAAAATTAGTGGCATTTGATAAATTGAAGTCGGAAAATGAGGATTTACTTGGTGAAGTCGATCGTTTGAGACGTGAATTGGAAAAAGCCTTAAAAGATATCGATCAATTGAAATCTGAGATAGGTTCTTTGAAAAACGGACTGGATAAATGTGTTGGCGAGATGGATCTGCTGAGAACTGAAAACAGCGGTTTGAAGTCTGAAATTCAGGGAATGAGGGGCGAAGGGGACAGTTTGTCGGCGGAGTTAAATAATCTGAAGAATGAGAATTCTCTTTTGAAAGACGAAAGAGATCGATTGAGCAAGCAATTGAGCGACTGTAAGATGGAAAACGAAAAATTCAGAGTGGAGAAGGCTCACCTGGAAACTGAAAATGAGAAGCTGGAAGGAGAGATAAATTCGTGCAAGGAAGAGAATGACAAATTAAAAGACGAATTTGGAAAATTACGAGAACAATTGCAGTCATCAAATGACGAATTGAATAGATTAAAGGCTAATCTCGACAAAGCTGAGGACAAAATTCGGTCTCTGGAGCCGCTGATCTCCCGTTTGCATAGTGAAAATGATAAATTGCGGGGCGATTTGACGAGTTTGAAGAATGAGGCTAACGATTTCAAAGCAAAATTGGCTAGAGAAACGGCTGACAATGAAAAGATGCAGAACGATCTGAAGATACTGGAGGATCAGGTGCACGATCTAAGTAAGAACCTGGTCAATGCTAGGGCAGAAAATGACACTTTGAAACAGGAAAATCAAGGTCTAAAAGCCAAGTTATTAAATATGGATCATGATCTATCGAATTTGAGAGAGGAATGTGCGGATCTGAAACGTGAGATTGCTGATTTGAAGAAATTAATCGATGAATTAAAAGAAAAAATTGCTAAACTGGAAGCAGACATAGATCATTGGAAAATGGAGAACTGCAAACTTCAGTTGGACATTGATAAATCGAAAGCTGATCTTGAGAAAGCCTTGAAAGATTTGCTCGAATGCCAGGCTTCGAAGAAAGTACTAGAAGCAGAGATGTACCGCCTCAAGATTGAGAAAGGCGAGCTTGACAAGAAGCTTGTCGAATTATCGTCTCAACTCGAGCAACAGGAAAAAGCATTCGAAGCAGAAAAATCGGCTAGAAATAAGGGTGATTCAGAAATCGTGGCCTTAAAGGAGGAACTGGATGCCTTGAAAAAGGAACTAGGAAAATTGAGAGCTGACAACAACAGATACAGAAATGAAATAGACGAACTAGGAAGACAGCTTGCGGTAACAAAAAATGAACTGGAGAAGTGCAAAGAAGAGGTTTCTGTATTAAGAGATGCCAATAACGCGCTAAAGTTTCAATTGGATCCCTTGAAGAGTTTAAAGGACGAATATAATAAGTTGAAAGCTGATTTAGATTCTCTTAAAGAGGAAAACGTAAACCTTCTGCAAGATAGGAAAAATTTCGAAGATGAGTATACTAGGCTGAAAGGAGAAGGCGATGGACAGAAAGCAGAGATCGATAGATTGAGAGCAAACTTGAATACAGAAGAGGCAGCTGCGGAAAAATTGAGGGCAGATCTTCAAAATTGCCAAACTGAGAACGATAGACTGCAAAAGCAATTAAACGAAATGAAAAATGAGTTGGATGAACTAACAAAGGGAAACAATCGTATAAAGAATGAGATCGATAAGCTGAAGAAGACGCTCGCGGACGCGGAAGCAAAGATAAAGTTGCTGGAAAGTGAACTATCCGATTTGTTAGCCGAGAAAAAAGAATTGGTCAGCGAACTCTATCGTTTCCGCGAACAGCTAAACAATCGCACAAACGAGCTAGAAGAGCAGATGGCCGCAAAAGATGCGGCCAAGAAGGAATTGGCCGACATGAAGGATGAGCTGACCGCTCTAAAAGCGGCGTTGGATAAGGGTCGAAGCGAAAACGATAAGCTGAGAAACGAGAACAAAAAGCTGAATGTGGAATTAACCAAGTTGAACGGGCAATTAGAAACTCTGAAAGACGATAATGCGAAGCTGGAAAACGAAAACGCGAACCTGAAGAACGAAAACGCGAATCTAAAGAATGATAATGCGAAATTGGTGGCGGAATTAACTGGAACGAAAAACAAATTGACAGAAGCGGAGAAACAGCTGAACGATCTAGAGAAAGAAAACGACGACTTGAATAACAAAATAGCCGATCTCGAGAACACAGTGAACGAGCTCGAAACATTGAAGAAACAATTAGAAGGTGCTAAAAAAGAACTGGATAGGCTGAGGCCAGAGCTAGATAAATTGAAATCAGAGAATGCAGAACTGCAAAACAATTTAAATAACGCCATAGAGGAATCAAATAGGGTAAGAAATGATTTGGACAAATTAAAAAGCGATTACGACAAATTGAAGTCTGAATTAGCTGACCTGAAGGAGGAGAGAGATAATCAGAAAGAACGGAACGCAGAATTGGAAAAAGAATTAGCCAAAATAAAGAAAGAGAATGCGAGTCTCAAGGGCGAATTAGCCGATTGTCAAGCGAAGAACGAAGGATTACGTAATGGATTGACAGATTTGAAATCGCAAAATACAAAACTGCAGGACGATTTAAACAAGGCGAAGAACGAAGCGAATAAATTAAAAGCCGATTTGGATAAATTGAAAAGCGATTATGGTGAACTGCGGTCGGAATTAGGTAAACTAAGGGATGAGAAGAATAGGCGCAAAGAACGCGATACTGCGTTAGCCACGGATCTGGATAAATTGAAGAAAGAGAATGACGAGTTAAAAGATGAGAATGAGAAACTGAAAAGCCAGTTATTCGATTGCCAAGAGGAGAGGCAAAGGCTACGCAAGGAATTGGGAAAGCTGAAAACAGAAAATGCAAAATTGAAAGAAGGTATGATAATTATCCTTTCAATTTATTTGGAATCCAAATTAGAAGGGCAAGTGCTTAATCTTTTGTTCAAGTCACTTTGCAAATTTAAGCAGAGTGGTTAA
Protein Sequence
MRFETLFQIAGLLNNLRQSEIDLLGLSSLKSEVENLKSELYDLKSERTELLNELNKLREALKDRDDQIIDLLQQKNNLEKEHKNKTAELQSTLDEANDEIDNLKAEITKLKNELEECKKMNAKLEQCCLDNNALTEKLHGFEEDLAAAKAIIANLESEVDILRRDKENLSNELDEARKQVEMFIGRLEDERAARTALEKELERNRDEIELLQRQIFDLKDQIDAERKENDELRETLEASAGEREKLKARLGQLENENDDLMERMKELDNLNNQLRNDYDSMKQALNNLQAEINKLEDELAKAKQERDALLNENNSIKKQLEQEMAENETLRAKLDESGKELNKLKLQKDELQKNLDEINLENDSLKRDMKALRDDLEDSRRQAEELKAAGDALKATDKDKVLELAKLQEQVENCKFEKNRLTKENDDLKSKIIELQGKLEELDKLKGRNTDLMAEVDRLRKELEKALENIDQLKSEIGSLKDGLDNCVGEMQKLRIENGDLKKQNETFKSEMQTITDRLIKDNDDLKAKISELEEKLSELDKMKLENVDLLDEVDRLKQELAKAWEEVDRLKSEVASLKNALDKCVNEMEKLRTESDQLKLENQAFKSDIHGLDDRLTKEIANLKAKNAELEEKLVAFDKLKSENEDLLGEVDRLRRELEKALKDIDQLKSEIGSLKNGLDKCVGEMDLLRTENSGLKSEIQGMRGEGDSLSAELNNLKNENSLLKDERDRLSKQLSDCKMENEKFRVEKAHLETENEKLEGEINSCKEENDKLKDEFGKLREQLQSSNDELNRLKANLDKAEDKIRSLEPLISRLHSENDKLRGDLTSLKNEANDFKAKLARETADNEKMQNDLKILEDQVHDLSKNLVNARAENDTLKQENQGLKAKLLNMDHDLSNLREECADLKREIADLKKLIDELKEKIAKLEADIDHWKMENCKLQLDIDKSKADLEKALKDLLECQASKKVLEAEMYRLKIEKGELDKKLVELSSQLEQQEKAFEAEKSARNKGDSEIVALKEELDALKKELGKLRADNNRYRNEIDELGRQLAVTKNELEKCKEEVSVLRDANNALKFQLDPLKSLKDEYNKLKADLDSLKEENVNLLQDRKNFEDEYTRLKGEGDGQKAEIDRLRANLNTEEAAAEKLRADLQNCQTENDRLQKQLNEMKNELDELTKGNNRIKNEIDKLKKTLADAEAKIKLLESELSDLLAEKKELVSELYRFREQLNNRTNELEEQMAAKDAAKKELADMKDELTALKAALDKGRSENDKLRNENKKLNVELTKLNGQLETLKDDNAKLENENANLKNENANLKNDNAKLVAELTGTKNKLTEAEKQLNDLEKENDDLNNKIADLENTVNELETLKKQLEGAKKELDRLRPELDKLKSENAELQNNLNNAIEESNRVRNDLDKLKSDYDKLKSELADLKEERDNQKERNAELEKELAKIKKENASLKGELADCQAKNEGLRNGLTDLKSQNTKLQDDLNKAKNEANKLKADLDKLKSDYGELRSELGKLRDEKNRRKERDTALATDLDKLKKENDELKDENEKLKSQLFDCQEERQRLRKELGKLKTENAKLKEGMIIILSIYLESKLEGQVLNLLFKSLCKFKQSG

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00232996;
90% Identity
iTF_00218871; iTF_00225657;
80% Identity
-