Basic Information

Gene Symbol
-
Assembly
GCA_951804975.1
Location
OX638119.1:12260663-12271096[+]

Transcription Factor Domain

TF Family
TF_bZIP
Domain
bZIP domain
PFAM
AnimalTFDB
TF Group
Basic Domians group
Description
bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 34 0.11 5.7e+02 3.3 0.7 30 57 54 81 48 88 0.62
2 34 0.2 1e+03 2.5 1.5 33 56 102 125 100 127 0.76
3 34 6.3 3.2e+04 -2.3 0.8 31 59 145 173 140 177 0.61
4 34 0.00012 0.61 12.8 0.8 30 57 215 242 207 244 0.90
5 34 0.079 4e+02 3.8 4.5 25 57 347 379 345 387 0.84
6 34 0.12 6.2e+02 3.2 2.6 25 57 403 435 399 442 0.78
7 34 0.25 1.3e+03 2.2 7.0 26 62 463 499 433 501 0.84
8 34 0.0017 8.8 9.1 6.3 24 64 503 543 501 544 0.93
9 34 0.0017 8.5 9.1 5.6 32 63 549 580 542 582 0.84
10 34 0.028 1.4e+02 5.2 8.7 25 62 605 642 594 645 0.80
11 34 0.31 1.6e+03 1.9 3.3 26 61 655 690 640 694 0.70
12 34 3.5e-05 0.18 14.5 6.9 23 64 708 749 706 750 0.93
13 34 0.0014 6.8 9.4 4.7 32 64 752 784 748 785 0.89
14 34 0.0063 32 7.3 6.8 30 63 792 825 783 834 0.61
15 34 0.045 2.3e+02 4.6 6.3 26 64 830 868 826 869 0.87
16 34 0.00025 1.3 11.8 8.7 21 64 867 910 864 911 0.74
17 34 0.59 3e+03 1.0 2.1 29 56 893 920 891 925 0.71
18 34 0.00092 4.6 10.0 4.0 22 63 910 951 909 959 0.59
19 34 0.0037 19 8.0 5.9 21 61 965 1012 964 1016 0.72
20 34 0.00047 2.4 10.9 0.6 28 62 1014 1048 1011 1050 0.91
21 34 0.58 3e+03 1.0 10.3 30 63 1079 1112 1063 1114 0.67
22 34 0.0038 19 8.0 6.9 26 64 1131 1169 1108 1170 0.90
23 34 0.067 3.4e+02 4.0 0.0 29 64 1162 1197 1159 1198 0.78
24 34 0.016 80 6.0 6.6 24 58 1202 1236 1199 1241 0.66
25 34 0.055 2.8e+02 4.3 3.8 24 63 1216 1255 1214 1271 0.91
26 34 0.00046 2.3 10.9 4.6 24 61 1258 1295 1255 1299 0.91
27 34 0.00052 2.6 10.8 6.1 24 62 1272 1310 1269 1313 0.93
28 34 6.2e-05 0.31 13.7 5.6 26 64 1316 1354 1311 1355 0.93
29 34 0.062 3.1e+02 4.1 10.5 24 65 1370 1411 1353 1411 0.84
30 34 7.2e-05 0.36 13.5 6.5 22 63 1396 1437 1395 1439 0.94
31 34 1 5.2e+03 0.2 8.0 24 64 1471 1511 1455 1512 0.68
32 34 7.3 3.7e+04 -2.5 6.0 33 61 1501 1529 1488 1537 0.55
33 34 2.6 1.3e+04 -1.1 8.0 21 62 1510 1558 1503 1559 0.60
34 34 5.5e-05 0.28 13.9 9.7 24 60 1548 1584 1540 1585 0.93

Sequence Information

Coding Sequence
ATGGAGGGTTGCCGATGCGGATGCGGCGCATCGTCGTCTCCAGATTCGATCAATCCGCCGAACGAGCCATGCTGTTGCTGCAGTTACAATCCCTTCAGTGACAATTCCAAAGAATCAGAGCTCTACGACCTCCCGTTTGCCCTTAGGAAGCTCAGCGTAATGAAGTGTCAGATGAAGAAGTGGCGAATGGAACGACTTCAGCTCGAGAGCGAAAATAGGTCTCTGAAACAAGCCCTCCAGTCATTCGGTGTAAATGCGGATGAGATATTGAAGCCTGATCCTCTGCTAGTGCACTCCCGGGAAGAAATCGAAAGGTTGCAAAATGCAAACGCGGCGCTTGAAGATAAAGTGAGGGATCTAGAGGAAACTCTAGGTGAACGAGATTGCTGCGACGATCCTGGCGTCACGATTCACTTCCTTAGGGAAAAGATGCAACATCTCAGAGAGCGTTTTGCGCTTGAAAAGAAAGAATTACGAGACATGATATCTTATTTGAAGTTAAAGGTGGCACAAACTGAAGAGGACGTTAGCTGCCCCGCGATATATCGTCTAAGGGCGAAGCTGCGTGAACTGATGAAGGGTCGAACAGTAGATCAACAGGTCTCGAAGGTTGTCGAAAGATCGATAGAAACTTTGACGGATCTTTCTAAGGATTGTGATGAACTACGAAAGGAGAATGAACGTTTGTTGGCTGAATTAGCTGAGTTGCGTAGCGCGTTAGGACAAGCACCTCCACAGACGATGTTAAGAGCAGCCGAGACTGTTACCGTTCCGGAGTACATAGATGTCTCAGAGCTattggaaaaattgaataattgcgAGGAAGTTGTGACGGACTTGAGAAAACAGTTGGAGGAGAAGGACAGTCTAATTGAATCGCTGCGTAAGGAGTTGAAAGGGATGGGCGATCAGCAAGATCTATTGGATCAGCTCGAGGCCTTGAAAGCGGAACTCAAGAAGAAAGATGACAAGATGGCAGAATTACTGGACAGCCtaagacaaagtgaaatagacaGTCTAGCACTGAACGACTTGAGAGCCGAACTCGATAATCTGAAGCCCCAACTAAACGATCTGAAGAAAGAGAGGGACGACTTATTGGCGGAGCTTGCTAGACTGCAAGAAGCACTGAAAAATAAAGAGGATCAGATAAACAGCATGCTGGAACAGAAGAATAAAGACGACAACCTGTACAACGAAGCGTTAGCACGTGAAGCAAACTTAAACCAGCAAATCACTGATCTAAACTCTCAAATAGACAATCTGAAGAACGAGCTCGAACAGTGTAAAACACGTAGCGCAGAGCTCGAGAAGTGTTGCTTGGACAAGGATGCACTTGCAGAGAAGCTACTCACTCTGGAGAAAGAACTCGCGTCCGCGAACGAGACAATAGCTGATCTCAAAACGGAGATGAATAGTTTGAACAACGATAAGGAAAAATTGTTAGAAGAGCTTGAGAAAATGAGGGAACAGGTCTGGGCGTTGACCGGTGAGCTGGAAAACGAGAAGCAGGCGAGGAGCGCGTTGCAGAAAGAGCTGGAAAGTAACCGAGATGAAATTGAAAAGTTACGAAAGGAGAATACGGATCTGAAAGATCAGCTAGACGACATGAAGAAGGAAAATGATAAACTTCGCGAAACGGCAGAAGCGGCGAAGAGGCTGgcagaagaaaacgaaaagttgAAAGAGCAGCTCGATCAtttgagaaaagaaaacgacgaTATGACGAAGAGGATGAAAGAGTTGGATGATTTGAATAACGTGCTGAAGTCCGATTACGAGAACATGAAGCAAGCGTTGAATAATCTCGAGGCTGAGATCAGTAGATTGGAAAATCAATTGAACAAGACGCTGGAAGAGCGTGACGCGTTGTTAAACGAGAACACCAACATGAAGAAACAACTTGACCAGGCGTTGGCCGAGAATGAGGTACTGAAAAGTGATATTGGTAAACTGATATCGGAGAAGGACAGCctgcagaagaatctcgacgcgaTCATGCTTGAGAACGAATCGCTGAAGGCAGACGTGAAGGCATTAACAGACAATCTCGAGGAGTCGAAGAGGCTAGCGGAAGAGTTGAAAGCTGCTGGCGATGCGTTAAGAGCGGCGGATGAGGGCAAAAAGTCCGAGGTCGAGAGACTCGAACAGGAGTTGGGTAGTTTGAAGTCCGAAAGGGATCGTTTAACGAACGAGAATGCCGATCTGAAAGCTAAAAACGCCGAATTGGAACGAAAATTAGAGGACGTTGCGAAAGAATTGGAGAAAACGAAAACGGAAAATGCTGATTTACTAGCTGAGGTCGATCGTCTGAGGCGAGAATTGGAGAAAACTGAGAACGAGATCAGCCGATTAGAAGCTGAGCTGGACGCGTTGAAAAAAGCGCTCGATCATTGCGCCGATGAGCTGGAGAAACTGAAAGCTGAAAACAACGAGCTGAAAACGGAGAATCAAGAGATAAAGTCGAAAATGGAAGAGGCTAAGGGGCAAGGTGATAGTTTAGAGGTGGAATTGAACAACTTGAAGAACGAACATTTGGCATTAAAAAATGAGAGAGACCAGTTGAGTAAGCAACTGAGCGATTGCAATGCGGAGAAACAGAAGCTGAAGAGTGAAAACGAGAAGTTGCAAGCAGAAATTAACACATGCAGAGATGAGAACGACAAGCTGAAGGCTGAACTTGAGAACATTCGTGGACAGTTACAATCGTTGAACGACGAATTAAAGAAGTTGAAAGATCAACTCGACGAAGCTGAGAACAAGATTCGTTCCCTGGAGCCTTTGGTCTCACGTTTGCAAAGTGAAAACGATCAAATGCGTAGTGATCTCGCAGCTTTGCAGAACGAGGCGAATGATTTGAAAGCGAAACTGAGCCAAGAATCGGGCGACAATCAAAAGATGAGGAACGACATGATGATGTTAGAGAATCAGGCGAAAGAGTTGATAGAAAAATTAGACAGCGCAAGAGCAGAAAACGAAGCTTTGAAAGCGGAGAATAAAGATCTGAAGGCGAAGTTGTTAGATCTGGATGGAGAATTATCAAGTTTACGGGCTGAGTGTGCGGACATGAAGGCGGAGAACGctgatttgaaaaaattaatcgaCGAACTGAAGGCAAAGATTGCTCAACTGGAGGCAGACGTCGAACATTGGAAATTGGAGAGTTGCAAGCACCAACTGGATATGGATAAGCTGAAAGCTGATCTTGAGAAGGCGCTAAAGGATTTAAGCGCCTGTCAggCCAAGAACAAAGAGCTAGAGGCAGAACTGAGACGTCTGCAGAACGAGAAAGCCGAAGCCGAGAAAAAACTGGCCGCCATAACGTCGCAACTCGAGGAGCAAAAGAAAGCTCTGGAATTAGAAAGATCAGCGAAAGGTAAAGGCGACACGGAGATCTCTAACTTGAAGTCCGAACTAGAAGCATTGAAGAAAGAACTAGAAAAATTAAGAGCTGAGAACAACAAATACAAAGGTGAAATAGACGATCTAGGAAGACAGCTCTCAGCAGTGAAGAACGAATTGAACAGCTGTAGAGACGAGGTTGCCGCATTAAGAGAAGCCAACGATGGGTTAAAGTCTGACTTGAACGCTTTGAAAAGTCTGAAAGACGAACACGAGAAGTTAAAAGCTGAATTGAACGCTTTAAAAGCGGAGAACGCTAACCTCCAACAGGACAAAAAGAAACTGGAGGAAGATTATAGTAAACTGAGAGGCGAGGGTGATGGTCAGAAGGtagaaattgataaattaaagTCAGAGTTAAACGCGGAGAGAGCAGCCGTGGGAAAATTACAGTCAGACCTGCAAAATTGCAAAGCTGAAAATGATAAACTGCAAGCGCAGTTGAACGAGATGCAGAAGGACTTGGACAAACTGAAAAATGAAACTGATCTACTGAAAGGTGAGACTGATCAACTGAAGAAGGACCTTGCAGCTGCTGAGACTAAGGTGAAGTCCCTAGAAAGTGAATTTTCCGCTCTGTTATCCGAGAAGGAGGAGCTGGTTAACGAAGTCTATCGTCTTCGCGAAGAGCTGAACAATCTAAGGAACGAACTGGAGAAACAGAAAACCTTGAGGGACTCGGCCATGAAGGAGTTGGCCGAGCTGAAGGAAGAGCTAGCCGCTCTAAAGGCTACGCTGGATAAAGCTCGGGCTGAAAACGAAGCCCTGTCGAATGAGAACGAGAAGCTGAGAGCGGAGATGGGAATGTTGAATAAGCAGCTGCAGGCACTGAATGACGAAAACGCGAATTTGAAGAACGAGAAAGCGAAATTGGCGACGGAGTTAGCCGAGACGAAGGGCAAACTAACTGACGCCGAGAATCGGCTGAACGATATGAAGAATAAAATTGCCAATCTTGAGAACGCGCTAAAGGAGATTGAAGTATTGAAGAAACAACTAGAAGATGCTAAGAAGGAGTCCGACGGGCTGAGGAAGGAATTAGATAAATTGAATACGCTGAACGCGAAATTGCAGAACGACTTAAACCAGGCGAAGGATGAATCGAACAAATTAAAGGATAGTTTGGACAagctgaaaaataattataacagtTTGCAATCTGAATTAGCTAAAGCAAAGGAGGAGATAGAGAAACAGAAAGATAGTGAAGGTAAAGCAAAGAAGGAGAACGATACTCTGCGGGACGAGAATGCGAAGCTGAAAAGCCAGTTAAACGATTGTCAGGAGGAGAACGAGAGGCTGCGCAGGGAATTGGAAAACTTGAGAAGCGAAAATGCCAAATTGAAAGACGTCCCAGGTCTGATAATTACCACAGTAGCGTTTCGATTGTTTGAACACTCGTTATTCAAATTAGACAGCACAGAACGTTATTGCGCATCTGTCGAAGTCACTTTGCACGTTCAAACGGCACGGCTAATGGAGTCGTCCATTTCATACTTATCTTAA
Protein Sequence
MEGCRCGCGASSSPDSINPPNEPCCCCSYNPFSDNSKESELYDLPFALRKLSVMKCQMKKWRMERLQLESENRSLKQALQSFGVNADEILKPDPLLVHSREEIERLQNANAALEDKVRDLEETLGERDCCDDPGVTIHFLREKMQHLRERFALEKKELRDMISYLKLKVAQTEEDVSCPAIYRLRAKLRELMKGRTVDQQVSKVVERSIETLTDLSKDCDELRKENERLLAELAELRSALGQAPPQTMLRAAETVTVPEYIDVSELLEKLNNCEEVVTDLRKQLEEKDSLIESLRKELKGMGDQQDLLDQLEALKAELKKKDDKMAELLDSLRQSEIDSLALNDLRAELDNLKPQLNDLKKERDDLLAELARLQEALKNKEDQINSMLEQKNKDDNLYNEALAREANLNQQITDLNSQIDNLKNELEQCKTRSAELEKCCLDKDALAEKLLTLEKELASANETIADLKTEMNSLNNDKEKLLEELEKMREQVWALTGELENEKQARSALQKELESNRDEIEKLRKENTDLKDQLDDMKKENDKLRETAEAAKRLAEENEKLKEQLDHLRKENDDMTKRMKELDDLNNVLKSDYENMKQALNNLEAEISRLENQLNKTLEERDALLNENTNMKKQLDQALAENEVLKSDIGKLISEKDSLQKNLDAIMLENESLKADVKALTDNLEESKRLAEELKAAGDALRAADEGKKSEVERLEQELGSLKSERDRLTNENADLKAKNAELERKLEDVAKELEKTKTENADLLAEVDRLRRELEKTENEISRLEAELDALKKALDHCADELEKLKAENNELKTENQEIKSKMEEAKGQGDSLEVELNNLKNEHLALKNERDQLSKQLSDCNAEKQKLKSENEKLQAEINTCRDENDKLKAELENIRGQLQSLNDELKKLKDQLDEAENKIRSLEPLVSRLQSENDQMRSDLAALQNEANDLKAKLSQESGDNQKMRNDMMMLENQAKELIEKLDSARAENEALKAENKDLKAKLLDLDGELSSLRAECADMKAENADLKKLIDELKAKIAQLEADVEHWKLESCKHQLDMDKLKADLEKALKDLSACQAKNKELEAELRRLQNEKAEAEKKLAAITSQLEEQKKALELERSAKGKGDTEISNLKSELEALKKELEKLRAENNKYKGEIDDLGRQLSAVKNELNSCRDEVAALREANDGLKSDLNALKSLKDEHEKLKAELNALKAENANLQQDKKKLEEDYSKLRGEGDGQKVEIDKLKSELNAERAAVGKLQSDLQNCKAENDKLQAQLNEMQKDLDKLKNETDLLKGETDQLKKDLAAAETKVKSLESEFSALLSEKEELVNEVYRLREELNNLRNELEKQKTLRDSAMKELAELKEELAALKATLDKARAENEALSNENEKLRAEMGMLNKQLQALNDENANLKNEKAKLATELAETKGKLTDAENRLNDMKNKIANLENALKEIEVLKKQLEDAKKESDGLRKELDKLNTLNAKLQNDLNQAKDESNKLKDSLDKLKNNYNSLQSELAKAKEEIEKQKDSEGKAKKENDTLRDENAKLKSQLNDCQEENERLRRELENLRSENAKLKDVPGLIITTVAFRLFEHSLFKLDSTERYCASVEVTLHVQTARLMESSISYLS

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-