Basic Information

Gene Symbol
-
Assembly
GCA_963924025.1
Location
OZ001243.1:15715701-15722643[+]

Transcription Factor Domain

TF Family
TF_bZIP
Domain
bZIP domain
PFAM
AnimalTFDB
TF Group
Basic Domians group
Description
bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 42 3.5 3e+03 -1.4 0.2 31 57 62 88 57 95 0.68
2 42 0.37 3.2e+02 1.7 0.1 40 63 109 132 103 134 0.84
3 42 1.3 1.1e+03 -0.0 0.2 36 57 157 178 150 183 0.84
4 42 4.8e-05 0.042 14.1 0.0 30 58 223 251 215 254 0.89
5 42 7.8 6.7e+03 -2.6 0.0 31 59 358 386 354 391 0.52
6 42 0.017 15 5.9 0.3 11 60 411 461 409 465 0.80
7 42 1.8e-05 0.016 15.5 4.4 21 63 474 516 470 518 0.93
8 42 0.007 6 7.2 0.8 29 54 503 528 500 535 0.66
9 42 0.1 86 3.5 2.7 24 63 533 572 531 573 0.92
10 42 0.032 28 5.1 7.2 26 64 591 629 588 630 0.92
11 42 0.0083 7.1 7.0 5.3 26 63 633 670 631 672 0.88
12 42 0.017 15 5.9 3.7 24 64 673 713 670 714 0.91
13 42 0.0017 1.5 9.2 6.7 26 59 724 757 712 770 0.57
14 42 0.0037 3.2 8.1 1.4 29 63 762 796 756 798 0.88
15 42 0.0086 7.4 6.9 1.6 26 63 773 810 769 812 0.85
16 42 0.0029 2.5 8.4 1.4 32 64 821 853 817 854 0.80
17 42 0.037 32 4.9 0.2 24 59 869 904 865 909 0.74
18 42 0.028 24 5.3 0.3 27 59 886 918 875 920 0.66
19 42 0.4 3.4e+02 1.6 0.0 42 64 929 951 927 952 0.88
20 42 0.00017 0.15 12.4 0.3 21 63 957 999 953 1000 0.96
21 42 0.026 22 5.4 9.5 24 65 1016 1057 1008 1057 0.88
22 42 0.0015 1.3 9.4 6.3 30 64 1050 1084 1040 1085 0.88
23 42 4.1e-05 0.036 14.3 2.3 26 60 1067 1101 1064 1106 0.85
24 42 0.13 1.1e+02 3.1 6.3 21 62 1125 1166 1124 1168 0.92
25 42 0.0017 1.4 9.2 5.5 25 56 1178 1216 1170 1225 0.64
26 42 0.92 7.9e+02 0.4 0.2 30 60 1218 1248 1215 1260 0.74
27 42 0.18 1.5e+02 2.7 0.5 28 58 1258 1281 1230 1288 0.49
28 42 0.075 65 3.9 0.3 27 63 1257 1293 1251 1302 0.78
29 42 0.99 8.6e+02 0.3 2.8 28 62 1286 1320 1280 1337 0.61
30 42 0.27 2.3e+02 2.1 3.7 32 57 1339 1364 1315 1370 0.72
31 42 0.00056 0.48 10.7 2.9 25 62 1367 1404 1363 1406 0.88
32 42 0.038 33 4.8 6.1 27 64 1411 1448 1408 1449 0.82
33 42 0.00033 0.29 11.4 0.8 27 63 1446 1482 1442 1484 0.87
34 42 0.0031 2.7 8.3 0.6 27 62 1523 1558 1520 1560 0.90
35 42 0.38 3.3e+02 1.6 0.9 25 62 1563 1600 1561 1602 0.70
36 42 0.00065 0.56 10.5 3.7 21 64 1601 1644 1600 1645 0.91
37 42 0.39 3.4e+02 1.6 2.9 21 63 1629 1671 1628 1680 0.81
38 42 0.56 4.8e+02 1.1 6.6 21 63 1685 1727 1684 1729 0.91
39 42 0.12 1e+02 3.3 2.4 24 52 1751 1779 1748 1788 0.87
40 42 2e-05 0.017 15.3 2.3 27 64 1796 1833 1793 1843 0.92
41 42 2 1.7e+03 -0.7 0.1 28 49 1963 1984 1960 1995 0.78
42 42 4.6 4e+03 -1.8 1.9 44 56 2067 2079 2041 2114 0.57

Sequence Information

Coding Sequence
ATGGCTCGCGATCATTGTCCCGGATCCTGTTGTCTTTGCGGTTGCGATTCTCCGATGACAACGGGAGTTATGGAGCCACCTCACGAGCCATGCTGCTGCTGTAATTACAATCCCTTCAGCGACGATTCAAAGGAATCGGAGATTCACGACTTGTCGTTCGCGTTGAGAAAGCTGACGGTGATGAAGTGCCAGATGAAGAAATGGCGCATGGAACGACTACAATTGGAGAGCGAATGCAGATCTATGAAAGAAACCCTTCAATCATTCGGGGTAAACACGGAAGACGGCCTGAAGACAGACCCTCTGATAGTTCACTATCGGGGCGAGAGCATGCGATTACAAACTGCAAATGATttgctcgacgaaaaaattcaatatctcGAAGAGACACTGGCCGAGCGTGATCTCTATGAGGATCCTTGCGAGGGAGTTCAGATTATGAGGACCAAGATGGGAAAATTGCGGGATCAATTTCTTGCAGAGAAGAAAACgttGAAAAGCACTATAACTCAATTGCGAATAGAATTGGCTGAAGCGGAGGACGATACATCGTGCGCGGCGTTAAATCGCTTGAGAGCGAAGATCCGAGAGTTGACGAAGGGTGGCCAGAAAGCCGATCAGCAAGTGTCTAAGGTCGTAGAGAAATCGATCGAGACGCTCGTCGATTTGTCAAAAAATTGCGACGATCTTCGCGCGGAGAATGAACGTTTACTTACCGAAATCGATAATCTGCGGCGCGCTCTGGATGACATCGGCGTTGATGACGGTATGCTGAAATCATACAAGGAATCCGATCAGAAGTTGAACTATGATGATTTGGTGGCTGATCTGAAGCGACGATTGGCTGAGAGAGATGCCACGATAGAGGATTTGCATAATCAACTAGAGGCCGGAGCTAACGCTGCCGCGAGATTGTCCGAGAGTCTCGACAGAATGACCGTCAGTCACAAGGCCTTGATGGCCGAGGTCACTGACGCTAAGGCTGAATTAATGAAGCGCGACAAGAAGGTCGAGGAGCTTTTGGAGGACGTGGGCAGAGCCTCACTCGACGTATTGGGCATGAATGATTTGAGAAACGAAATCGACACGATGAAACCTGAACTCTACGACCTTCGTATCGAAAGGGATCAGCTTTTGGGGGAATTGCACAAAATTCGGAACGTTGTGTCCGATCGTAACGATCAGATAATCCAAGTTTTAGAGCAGCGAGACAAAACGATCGATGATCACAAAGGAAAAATCGCTGCTATGCAAGCGAGAATTGACGCGTTGCTCGCGCAAGAAGCTATTACGAGTAAAGAgctcgaagaatcgaaaataCAGATTGCACAGCTCGAGGATGAAATTGCTCGGCTAAAAGAGCATATAGACgacctcgaaaaatgttgcgaCGATTCCGCGGTGCTGAACGAAGCGTTGAAGAGACTGAGAGAAGAGAACGATGCGCTCATAGCCGAGAAGGCAAATCTGACTGCGGATAACGAAAAGCTGAAGTACGAAAATGCGAAGCTTTACGCGGAGATTGATAAGCTACGAGAGGAAAACGACAGACTCGATGCAACGGTTGAGAGCTTGAAAAAAGATTTGGTCGATGCGAAAGAGGCAAACGACATGCTGAATAAACAGCTGAGCGGCATGATGGAGgaggatgaaaaattgaagaataatGAACATAATCTGATGGAACGCTTGGGTACTGTCGCTGAGCAATTAGAATTGATGAACGGGGAGAATTCGAAACTAAAATATGACCTGAATGAAGCTAATAGCGAAATCGAGAGATTGAAACATGAATTAGAAAAGTTAACGGAGCAGTTACGCAACttggaagaaaataataagcTTCTTAATGAAGAACTCGGAAAAGTAACTAATGAATTATCGGCAGTCACGATGCAAAATGATGCGCTGAGAGAAGAATTGGAGGCATTGAACAGAGAAAACTATGAGCTTAAAAAGGACGCGGAAAAATTACGGaaagaatttgaaaatatCAGTGAAGATCTTGAGAAATGTCGAAAACAGAATGCAGAATTGGTACAAAAATTGGAACGCTTAACCGCGGACAATAAGTCTTTGATGGACGATCTTGAGAGAACCAAGAACGCAAACGATGCATTGGAGAGCTCATTGATCGAATTACAGAAGGACTTGGACAATCTGAACGCTGAAGTCAATGAACTTCgccaaaaaaattccgaactCGAAGCCGAAGTTGGGGTATTGAAActagaaaataatgaattgaAGCGATTCGTGGGTGAACTTAAAGCGGACTTAGAAAGCGCTCAGGGCAAATTAGCCGAACTTCGGAATGAAAATCGTGCACTGCGGGCAGATGTTGTGAAACTCGAAGAAACTAATGCAACTTTGAAGGGCGATCTCGAACGCGCAAATTTAGCTAACGATCAATTGAGGGAGCAAGTGATTTCTTTGACTGGCGATCTCGACGCGGCAGCGAAAAAGATCGCTGAATTAGATGTCGAAAATAGTGCTATGAGAAACGAAGTCGAAGCGTTGAAAACTGAAAACGAGAAGCTTCGAGCGGATAACACAAAGTTGGGTGCAGATCTCGAAAGCGCCGTTGGAGAATTAGATAAGATTAAAGAAGAGAAtttcgttttgaaaaatgagATGGACAAAATTTTGACCGATAATGCGAGTTTGAAGGACAAAGTTGACAAGCTCATAGAAGAAAACGACAAATTGACAAACGACGGGGATGCGTGGAAAATAGAAAACGAAAAGCTGAAAGCAGCCGGCGATGGGCAGCGGTTGGATGGCATGCGCTTACAGGGCGAAGTCGATGATTTGAGACACGAAAGGGATGGAATGAAATTGGAGCTAGAAAAATCACAATCCGAGCTCGACAAAGCAAAGAACACGATCGATGCTCTCGAATCGCAGCTTGCagattcgaatttcgaaaaatcgaagcttCAAAAAGATATAGACGCCCTTGCTGCCGCTTGCGATGACCTTCGAAGCGAATTGCATTCGGAAAAACTAGCGAAGGAAGAAGCCCAGAAAGAATTGTCAgctttaaaaaatgaattgaaagaATTACTAAAGAAGCTGGACGAATTGAAGCACGAGAACGATGATCTGAACGTCGAGAAAGAGGCTCTCAAGAAACAGCTTATTGACATGAAGAATGAAGTCACGAAGCTTGCCGCGGAGAACATCGACCTCACGAATGAGATAAAACGTTTAAACTCCGAGCTGGAAAAGCTCAAGATGGAGAACGAAGCACTTGTTCGCGAAGTTGGTCAATTGCAGGAAGACGTTGATCATTGGAAAATGGAGAATTGTAAAATGAAGATGGAAATCGACAAGCTTAAAGGCGACGTTGATAGGCTGAAGAAAGATTGTGATatggaacgagagaaaataaaagagctGGAGCATCAGTTGACTCTCTTGGAAGACGCGAAGAATGAATCTTCGCGAGATGTCGAACATCTGGAGAAGGAACTATCGAAACTAAACGAAGATCTCGAGTCCGAGAAAACTGCGAAGGAAAATGCCCTTAGAAAAATCTCCGCTATGGAGGAGGAGCTCGAAGCTTTGAAAGCTGAACTGGCCAAGTGCAGAAACGACAACGAACGGCTCTCGAGGGAAGTCGACGATTTAAAACGACACCTATCGTTTCTGGAGAGCGAGGCGAAGAAAGCGCACGACGAAATATTCGCTCTTCGAGAGCTTAATGATAATTTAAAGTCTGATCTCGATCGTTTGAAAACTGAATCCACCGGCGCCAAAGCAGATTTGGATGCTTTGCATGGCACGGTCGATTCTCTGAATTCGGAGAATGCAAATCTCAAGAACGATCTCGAAAAGTTCAAGTTGTCGAATAGCAGACTAATGGACGAGCTCGAGTCTTGCAAGTCCGATAATGATAAACTAAAAACCGATTTGAACGAAGCGAAGGCTGATGCTGTAAGATTGAAGGACAATCTTGCGCAGTGTCGTTCGGAACTCGAATCTGTAAAATCTGAGCTGAATAAAGCGAATGGCGAgttggaaaaattgaagattgaCAACGACAAGCTAAAACACGAATTAGCAGACATGCGAAAAGCTCTGGATGATGCGGAGGCTAGAGTCGCAGCTTTGCAGTCTCAGTTGGTTGGGTTGCAGCAAGAAAAATCGCGGCTCGTTGAAGACCTCGATAGTTTGAGAGACGAAGCGGATAAGCTTCGGAAGCAACTTTTAGCTGAGAATCTTGCGAAGGAAAATCTTCAAAAGGAAGCTGCTGCATTGAAAACAGAATTGGCAAGTTTGAAATTGGAGCTTGACAAGGCACGGGCTGATAatgaaagtacgaaaaaagaattagagGCATTGAAGAAACACTTTGACGATTTGCAATCAGCGAACGCGGGTTTAAATACCGAGAACGATAAGATGAGGAGAGAGCTTAATTCTTTAAAACCagaaatcgataaattgaGGATTGAACTCGAGAGGTTGAACGGTGAACTATTAGAGAAAGAGGGTGAGATCGAATCGATGAGAGTCGCGAATGAGAAACTCGACAAATCACTCGCCGATACCGAGAATAATGTCAAAGCTCTTGAAAGTCAGCTCGCCGCGTCGAGAGCCGAGAAAGACAAGCTCGCGAAAGAAGTTGCTGACTTGATAGGAGAGATCAGTAAACTTGGAAAGAATTTAGCTAATGAAAAGGCGGCCCGAGAAGCTGTCGCGAAAGAGCTCGAAACGCTCAAGGATACTCTTGCCGGACTGAGAAGCGAGTTGGACAAGTGTAGAGCAGAATCGGAAATTATGAGAAAGGAACTCGATCGTGCTAAGggagaaatatcgaatttgcacgacgaattgaataaattgaaagCGGAAAATAGTCGCCTACAGGACGAACTACAGAAATCCAAAGCGGATGGAAATGAATTGCGTTCTAATTTAGCAAAGTTGGAAGCAGCGAACGGCAAATTGTTGGCTGATCTCGAGATTGCGAATCTGGAAGCcgagagattaaaaaatgacTTGGAAACGACGAGAGTTGACAACGAGCGGTTGAAGAACGAACTTGATAGAGAAAAGGCCGAGGCTAAGAAGCTACAGGATGCGTTCGATGACATAAAAGCACAGAATGAGCAATTGAAGAATCATCTGGAACAATCCCAAGCCAATGAGGAAAGACTTAAGTCTGACCTGGAGAAGGCGAAAGTTGAAATCGACAATTTAACGGGCGATGGTGCAAAGTTAAGAATGGATTTGAACAATGCCAAAATGGAGAATGATAAATTAAAAGAAGATTTGAAACAAGCGACCAACGATAATGAAGCCCTGAGAAGTGATGTCGAGAAACTGCAACTCGGAATCGCAGATAAGCAAAGACAGAGTAAGAACTTCGATGCGGAAGCGAATGCGCTGAGAAACGAGATTGATCGTTTGGAAGCGGAAAATGCAAAGTTAAAGAGCGAAATTGAAGAATGTCACACGGAAAAAGCCAATTTAATGGATAAGAACAACGAGCTTCGAGCTGATATCGATCGGTTGAAAAGGGACTCAAAAGATAATACCGATAccgatgagaaaaagaaactggAGGATGCCTTAAGGTCGGCAAAAGGCCCGAGGTCTAGTATCGACGAAATGCCGCACATTCATCCAGACGAGTGTGGTGACTACGTGAGATCGAACGAGTTGCTGGACAGTAGACTGGCCAAGCAGAACGAAgGTGTGACGCGTGTCCGTGAATACATTGGTCATATCGATGGCAGAGTTCCAACGAAACCTGCGATGACTGGGCACCCTGACGAGGCTCTGGACATCAGCGCTTTCATCTCGCCCATCGTCGTCCAGACGTTGCTAAAGTCTGAGGAACtctcgaataatattttcggcGCTGAATTGGAGATACAGAAGATTGGGGATAAACTGAACGATTTTAGCAAGGAGAACGATAGGTTGAGAAAGTTACTTGAGGATCTCGGCGTCGACGGTCGCGAGATGAAGGACTTTCGTGAGGAAATGGGCGATGCCTTCGATGCGGAAACTTGGCTGAGATCGTTAACGCTTACGCAGTTGGCCGAGCTGCACGATCGAATTTGCTTATTAACCTCGGGAATGGTGAAACAAGACAGCAGCGGCGTGGCTTGTACAACGGGAAAGACCGAACGCTCGAATGCAGCGGCGGAAGAGCAGGATACGTTGACGAAGAATTATGACATTCTTAATAAACGAATTGCCGCGTTACAGAAGCAAATTGCGGACAAACAGGCAGAAGCGACAATGAAGGTGTACGAAATGAGACAATCGCTTCGTCAAGAGCAATTACAGCTCATCAGGATATCGGAGGAGATgaatcgcgagagaaaacgaaatttggCACTTCATCTCACCATCGACGGTTCACCGTGCTGGATCAACAAATGCACTGCCGAATGCAGCGGCGATTTCGATCATTTGACGGACTTCTGA
Protein Sequence
MARDHCPGSCCLCGCDSPMTTGVMEPPHEPCCCCNYNPFSDDSKESEIHDLSFALRKLTVMKCQMKKWRMERLQLESECRSMKETLQSFGVNTEDGLKTDPLIVHYRGESMRLQTANDLLDEKIQYLEETLAERDLYEDPCEGVQIMRTKMGKLRDQFLAEKKTLKSTITQLRIELAEAEDDTSCAALNRLRAKIRELTKGGQKADQQVSKVVEKSIETLVDLSKNCDDLRAENERLLTEIDNLRRALDDIGVDDGMLKSYKESDQKLNYDDLVADLKRRLAERDATIEDLHNQLEAGANAAARLSESLDRMTVSHKALMAEVTDAKAELMKRDKKVEELLEDVGRASLDVLGMNDLRNEIDTMKPELYDLRIERDQLLGELHKIRNVVSDRNDQIIQVLEQRDKTIDDHKGKIAAMQARIDALLAQEAITSKELEESKIQIAQLEDEIARLKEHIDDLEKCCDDSAVLNEALKRLREENDALIAEKANLTADNEKLKYENAKLYAEIDKLREENDRLDATVESLKKDLVDAKEANDMLNKQLSGMMEEDEKLKNNEHNLMERLGTVAEQLELMNGENSKLKYDLNEANSEIERLKHELEKLTEQLRNLEENNKLLNEELGKVTNELSAVTMQNDALREELEALNRENYELKKDAEKLRKEFENISEDLEKCRKQNAELVQKLERLTADNKSLMDDLERTKNANDALESSLIELQKDLDNLNAEVNELRQKNSELEAEVGVLKLENNELKRFVGELKADLESAQGKLAELRNENRALRADVVKLEETNATLKGDLERANLANDQLREQVISLTGDLDAAAKKIAELDVENSAMRNEVEALKTENEKLRADNTKLGADLESAVGELDKIKEENFVLKNEMDKILTDNASLKDKVDKLIEENDKLTNDGDAWKIENEKLKAAGDGQRLDGMRLQGEVDDLRHERDGMKLELEKSQSELDKAKNTIDALESQLADSNFEKSKLQKDIDALAAACDDLRSELHSEKLAKEEAQKELSALKNELKELLKKLDELKHENDDLNVEKEALKKQLIDMKNEVTKLAAENIDLTNEIKRLNSELEKLKMENEALVREVGQLQEDVDHWKMENCKMKMEIDKLKGDVDRLKKDCDMEREKIKELEHQLTLLEDAKNESSRDVEHLEKELSKLNEDLESEKTAKENALRKISAMEEELEALKAELAKCRNDNERLSREVDDLKRHLSFLESEAKKAHDEIFALRELNDNLKSDLDRLKTESTGAKADLDALHGTVDSLNSENANLKNDLEKFKLSNSRLMDELESCKSDNDKLKTDLNEAKADAVRLKDNLAQCRSELESVKSELNKANGELEKLKIDNDKLKHELADMRKALDDAEARVAALQSQLVGLQQEKSRLVEDLDSLRDEADKLRKQLLAENLAKENLQKEAAALKTELASLKLELDKARADNESTKKELEALKKHFDDLQSANAGLNTENDKMRRELNSLKPEIDKLRIELERLNGELLEKEGEIESMRVANEKLDKSLADTENNVKALESQLAASRAEKDKLAKEVADLIGEISKLGKNLANEKAAREAVAKELETLKDTLAGLRSELDKCRAESEIMRKELDRAKGEISNLHDELNKLKAENSRLQDELQKSKADGNELRSNLAKLEAANGKLLADLEIANLEAERLKNDLETTRVDNERLKNELDREKAEAKKLQDAFDDIKAQNEQLKNHLEQSQANEERLKSDLEKAKVEIDNLTGDGAKLRMDLNNAKMENDKLKEDLKQATNDNEALRSDVEKLQLGIADKQRQSKNFDAEANALRNEIDRLEAENAKLKSEIEECHTEKANLMDKNNELRADIDRLKRDSKDNTDTDEKKKLEDALRSAKGPRSSIDEMPHIHPDECGDYVRSNELLDSRLAKQNEGVTRVREYIGHIDGRVPTKPAMTGHPDEALDISAFISPIVVQTLLKSEELSNNIFGAELEIQKIGDKLNDFSKENDRLRKLLEDLGVDGREMKDFREEMGDAFDAETWLRSLTLTQLAELHDRICLLTSGMVKQDSSGVACTTGKTERSNAAAEEQDTLTKNYDILNKRIAALQKQIADKQAEATMKVYEMRQSLRQEQLQLIRISEEMNRERKRNLALHLTIDGSPCWINKCTAECSGDFDHLTDF

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00397813;
90% Identity
-
80% Identity
-