Basic Information

Gene Symbol
-
Assembly
GCA_963583965.1
Location
OY757221.1:7536926-7550891[+]

Transcription Factor Domain

TF Family
TF_bZIP
Domain
bZIP domain
PFAM
AnimalTFDB
TF Group
Basic Domians group
Description
bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 46 0.12 1.3e+02 3.1 0.1 25 50 56 81 52 87 0.61
2 46 0.25 2.7e+02 2.1 0.6 39 62 101 124 97 127 0.65
3 46 0.5 5.4e+02 1.1 0.2 40 56 200 216 183 220 0.87
4 46 4.1 4.4e+03 -1.8 5.6 26 63 267 307 262 309 0.67
5 46 0.027 29 5.2 0.1 27 63 321 357 319 359 0.89
6 46 0.82 8.8e+02 0.5 4.8 30 54 359 383 354 403 0.64
7 46 0.11 1.2e+02 3.2 1.2 26 51 407 436 387 445 0.61
8 46 0.032 34 5.0 0.6 24 57 444 477 442 482 0.86
9 46 0.0048 5.2 7.6 4.4 26 63 481 518 476 520 0.87
10 46 0.013 14 6.2 8.0 25 59 536 570 525 576 0.63
11 46 0.0014 1.5 9.4 1.5 31 61 577 607 569 618 0.53
12 46 0.00072 0.77 10.3 1.3 28 61 588 621 583 632 0.57
13 46 1.1e-06 0.0012 19.3 3.0 23 63 646 686 644 688 0.95
14 46 0.0014 1.5 9.3 6.3 27 63 678 714 676 730 0.86
15 46 0.0052 5.6 7.5 0.2 22 63 736 777 728 779 0.75
16 46 5.4e-05 0.058 13.9 3.9 26 60 782 816 779 819 0.92
17 46 1.1e-05 0.012 16.1 3.0 25 64 816 855 815 856 0.89
18 46 0.0012 1.3 9.5 0.9 26 63 866 903 864 905 0.95
19 46 4.4e-05 0.047 14.1 1.1 25 64 921 960 911 961 0.88
20 46 0.002 2.1 8.8 0.5 28 62 952 986 948 989 0.88
21 46 0.00034 0.37 11.3 5.6 20 60 972 1012 969 1016 0.88
22 46 0.014 15 6.1 10.3 21 63 987 1029 984 1045 0.70
23 46 2e-05 0.022 15.2 1.2 24 64 1046 1086 1044 1087 0.96
24 46 0.0035 3.8 8.0 4.2 29 64 1100 1142 1091 1143 0.65
25 46 0.019 20 5.7 0.8 29 62 1135 1168 1131 1171 0.81
26 46 0.0026 2.8 8.5 3.4 24 63 1151 1190 1144 1191 0.93
27 46 0.00024 0.26 11.8 5.7 25 62 1180 1217 1177 1220 0.90
28 46 0.0037 4 8.0 0.3 24 59 1228 1263 1225 1268 0.87
29 46 0.03 32 5.1 5.9 25 57 1285 1317 1267 1325 0.61
30 46 0.0015 1.6 9.2 1.1 24 64 1312 1352 1309 1353 0.88
31 46 0.009 9.7 6.7 0.9 32 62 1327 1357 1323 1366 0.61
32 46 0.068 74 3.9 2.4 24 64 1340 1380 1339 1388 0.86
33 46 0.032 34 5.0 8.2 27 63 1392 1428 1386 1430 0.87
34 46 0.00014 0.15 12.5 5.8 17 63 1431 1477 1428 1479 0.91
35 46 2.4 2.5e+03 -1.0 0.1 31 60 1480 1509 1474 1514 0.73
36 46 0.00011 0.12 12.9 3.5 24 61 1511 1548 1508 1551 0.91
37 46 0.00038 0.41 11.1 11.5 23 63 1580 1620 1565 1621 0.87
38 46 5.4e-05 0.059 13.8 8.5 25 64 1621 1660 1617 1661 0.92
39 46 0.49 5.3e+02 1.2 0.6 38 63 1669 1694 1659 1696 0.80
40 46 2.4 2.6e+03 -1.0 5.1 34 62 1693 1721 1673 1731 0.70
41 46 0.06 65 4.1 2.7 26 65 1706 1745 1700 1745 0.88
42 46 0.0012 1.3 9.5 4.2 24 63 1739 1778 1738 1780 0.89
43 46 0.011 12 6.5 0.8 28 64 1788 1824 1783 1825 0.87
44 46 0.028 30 5.2 4.5 41 63 1871 1893 1837 1898 0.61
45 46 0.0029 3.1 8.3 1.5 24 60 1882 1918 1880 1922 0.91
46 46 2.1 2.3e+03 -0.8 0.8 29 59 1991 2021 1980 2026 0.62

Sequence Information

Coding Sequence
ATGGAGGTGTGTCGATGCGGATGCGCTTCGTCGACGTCCCTAGAATCGTTCAATCCGCCGAACGAGCCATGCTGTTGCTGCAGTTACAATCCCTTCAGCGACAATTCCAAAGAATCAGAGATCTATGATCTCTCGTTCGCCCTGAGGAAGCTCGGCGTAATGAAGTGTCAGATGAAGAAGTGGCGAATGGAACGACTTCAGCTCGAGAGCGAAAATAGGTCTTTGAAACAAACCCTGCAGTCATTCGGTGTAAATGCGGATGAGATATTGAAGCCTGATCCACTGCTAGTGCACTCCAGAGAAGAAATCGAAAGGCTGCAAAATGCAAACGCGAGTCTCGAAGATAAAGTGAGGGATCTGGAGGAAATTCTTGCTGAACGAGATTGCTGCGACGATCCTGACGCCACGATTCACTTCCTCAGAGAGAAGATGAGACATCTCAGGGAGCGTTTCGCGCTTGAAAAGAAAGACATGCGGGACATAATATCGAATTTGAAGTTGAAGCTGGCGCAAACCGAGGAGGACGTCAGTTGTCCGGCGATATATCGTCTTCGGGCGAAGTTGCGTGACTTGATGAAGGGTCAAACGAACCTTGGCGAAAACGATCATCTCCTTGCCGAGATAGACGAGCTGCGGCGCGGGTTGGGTGGCCGCGAGAAAAGGGAAGTAGTAGAGACGAGGGCAATGGATACGGCCACTGTGCCTGAATACGCTGACGTCTCGGATCTAttgaaaaaattgaacgactgCGAAGACGTTGTGGGCGACCTGGAGAGGAAGTTAGAAGAGAAGGACGACAAGATCGCCGAGCTCCAGAACAGACTGAAGCAGTTCGACCTACTGCAGGCCGAACTCGAAGATCTTAAACGTAAACTGGGAGAACGCGAATCGGAGAATGGCAAGCTCCAGGAAGAATACCACGATAGAACTTCCATCCTGCAGTTGGAACACGACGGCAGAACTTCCATGTTGCAGATACAAATGGACGAATTGTTGGCCGAAATAATGGGTCTGAATCAGACCATCGACACCTTGAACAAAAACATCGACGATTTGAATCAAGAGATCGCCGATCTGAAGTCGCGAGTAGCCGATCTGGAGGCCCAACTGGAGCAGTGTAACAAGCGAAACGCCGAACTGGAAGAGTGTTGCCAGGACAGGGACTCCGTTTTAGAGAGGAACGGGGCCCTGGAGCAAGAGCTCGCGTCAGCAATGGCCAGGATAAAAGACCTCGAGGCACAGGTGGACGCTTTGAGGGCGCAGATCGAGGCAGAGGTGGACGCTTTGAAGGCGCAGATCGAGGCACTGGCCGGGCAACTGGGGAACGAGAAGGCGGCGAGGAGCTCGCTGGAAAAAGAACTGCAGGATAGCCGGGGTGAGATCGAAAGTCTGCGCAAGGAGAATTCGGATCTGAAGGGCCAGATAGACGCAGCGAACGCTGATAACGATGAACTTCGCGACGCGTTAAGGAAATTGTCCGAGGAGGACGAACAGCTGAAGGCGCAGATCGAGCAACTGCAGAACAATATCAAGGACCTGGAGGGCCTGAATAACGCCCTGAGGGGCGACTACGACAACATGAAACAAGCGCTGGATAATTTGGAGGCGGAGATCAACAGACTGGAGGAGGAATTAAGGAGGGCAGAGGATGAACGGCAGGCATTGTTGAATGAGATCGACAGCACTAGAAAGCAGCTGGAGGAGGCGTTGGCAGTGAACGAGACGCTGAAGGCTGAACTGGACGAGGCCGGCGAGCAACTCGACAAACTGAGAGCGGAGAAGGACGAGCTGCAGAAGAGTGTCGACGCGCTCCGGCACGAGAACGATTCGTTCAAACGGGATATGAGGACGTTAAGGGAGGATCTCGATGACGCGCGGAGGCAGGTGGACGAGCTGAAAGCATCTGGCGACGCGCTGAGAGCCGCGGACGACGATAAGAAGTCTGAAGTCGAGAGGCTGACACAGGAGGTGGAGGGTTTGATGTCGGAAAGGGATCGCCTGACGAGCGAGAACGCCGATCTGAAAGCCGAAAACGCGGAACTGGGGCGAAAACTGGAGGATGCCGTGAAGGAGTTGGAACAATTGAGAGCGGAAAATGCTGATTTACGCGCGGAGGTGGATCGTTTGGGGCAGGAATTGGAGAGAGCGAGAGGCGAGCTGGATCAATTGAAATCGGAAATGGCTGCCGTGAAGGACGCGCTCGATAAGTGCGTCGACGAGTTGGAAAAGTTGAAGGGTGAAAACAGTGATCTTAAATCGCTGAACGAGGCCTTGAAGACGGAGATTCAAGGGGTTAGGGGCGAAGGTGATGGTTTGGCGGTCGAGTTGAACTCTCTGAAGGGCGAGAGGGATCAATTGAAGGACCAACTGAGCGAGTGTAATGCGGAGAACGAGAGATTGAAAACTGAGAAGAGCCAGTTGGAGGGCGAAATCGATAGCTTGAAGACGGAGAAGGGTCAATTGGAGGCCGAAGTCGAGAAGCTGAAGGGAGAGAATGCCGGGCTGAAGGATGAGATTGAGAAGCTGCGGGGACAATTGCAATCGTTGAACGATGAAATGAATGCGTTGAAGACGCAGCTTGAGGAGGCCGGGAACAAAATTCAGTCACTGGAGTCATCGCTCTCGAGTCTTCAAGGTGAAAATgagaaatttcgaaatgatTTAAGCTCGCTGCAGGGAGAGATAAACAATTTGAGAGCACAATTGGAGAGGGAAGCCGCTGATAATCTAAAGTTGAGGAACGATCTCAAGGTGCTCGAGGACCAGGTGCAAGATCTGACCAAGCAGGTGGAAAATGCCAGGGCAGAAAACGAGGCCTTGAAGTCGGAGATTCAAGGGGTTAGAGGTGAAGGTGACGGTTTGGCGGCCGAATTGAACGCTCTGAGGGATGAAAAGTCCGCTTTGACGGGCGAGAGGGATCGATTGAAGGAACAACTAAGCGAATGCAACGCGGAAATCGACAGATTGAAAACAGAGAAGGACCAATTGGAAGCCGAAGTCGACAGGCTGAAGGGAGAGTTAAATACCTCCAAAGAAGAAAATGGCAGGCTGAAGGATGAAATGGAAAAACTGCGAGGACAATTGCAGTCATTGAACGATGAAATGAATGCATTGAAGGCGCAGCTTGAGGAGGCCAGAGACAAAATTCAGTCACTGGAGTCAACGGTCTCGAGCCTTCAAGGCGAGAATGAGAAGTTTCGAAATGATTTGAGCTCTCTGCAGGGCGAGGTGCAAAATTTGAGAGCGCAATTGGAGAAGGAAGCTGGCGATAATCTGAAGTTGAGAAACGATCTGAAGCAACTGGAGGACCAAGTGCAAGATTTGGCCAAGCAGTTGCAAAATTCCAGGGCGGAAAACGAGGCACTGAAGTCGGAGATTCAAGGGGTCAGAGGCGAAGGTGACGGTTTGGCGGTCGAATTGAACGCTCTAAGGGACGAAAATTCTGTTCTGAAGGATGAGAGAGATCGATTGAGGAATCAATTTAACGAGTCCAACGCGGAGAACGAGAGActgagagctgggaaggggCAATTGGAGGCTGAAATCGAGAGACTGAAGGGAGAATTAAATACTTGCAAAGAAGAAAATGCCAGGCTGAAAGACGAAATTGAACAACTGCGAGGACAGTTGCAGTCATTGAACGATGAAGTTAATGGATTGAAGACGCAGCTTGAGGAGGCCAGGAACAAAATTCAGACACTCGAGTCAACGGTCTCGAGTCTCCAAGGTGAAAATgagaaatttcgaaatgatttgAGCTCTCTGCAGACCGAGATAAAGAATTCGAGAGCGCAATCGGACAGGGACGCCGATGAAAATCTGAAGCTGAAGAATGATCTGAAGATGCTGGAGGACCAGGTGACGGATTTGATCAAGAAATTGGAAAATGCCAGGGCAGAAAACGACGCCTTGAAGGAGGAAAATCAAAATTTGAAAGTACAGGTGAACGATATGAATCATGAATTAGCCGGTTTAAGAACAGAGTGCGCGGACCTGAAGACAGAGATCGCGGATTTGAAGAAGTCAATCGAAGAATTGAAAGCAAAGATTGCTAAACTGGAGGAAGATTTGGAGTACTGGAAATTGGAGAACTGTAAATTTCAGATGGAGGTGGACAAAGTGAAAGCTGAACTTCAGAAAACTCTGAAGGACTTGAGTGAAGCTCAGTCTCTGAAACAGCAGCTAGAGGCAGAGGTGAATCGTCTCAAGGGCGAGAAGGCCGAGCTTGAGAAAAAACTTGCAGACGCAATGACACAGCTCGAGCAACTGAAGAAAGCACTCGAAGCGGAAAAATCGTCGAAAGATAAGGCCGAGGCACAAGTCGCGGCCCTAAGGTCGGAACTCGATGCATTGAAAGCGGAACTAGAAAAGCTGAGGGCCGAGAACAGCAAGTGCAAAGGTGAGCTGGACGATCTTGCAAGACAGCTGTCAGCCACGAAGAACGAACTTGACGCGTGCAAACAAGAGTTGTTCGCATTGAGGGAGGTAAACAACGCTCTGAAGTCGGATTTCGACGCGTTGAAGGGCATAAAGGACGAGCACAATAAATTAAAGGCGGACGCGGATGCAATGAAAGCGGAGAACGCGAATCTTCAACAAGAGAGACAAAGGCTGCAAGACGAGTACGCCAAGTTGAAAGCGGAGGCTGATGGTCAAAAGCTGGAACTGGACAAGCTGAGATCGGACTTGGCTTCGGAGAGAGCAGCCGCAGAAAAGCTTAGAACAGATCTGCAGAACCGGCAAGCGGAAAACGCCAAGCTGCAGTCAGAACTGGACGAACTGAAAGCTGAATTGGACAGATTGAAAAACGAGAGCGCCAGTTTGAGAAACGAATTGAACGAGCTGAAGAAAGCGCTCGAAGCCAAGATCAAGTCCTTGGAGGATCAACTATCGAACCTTCTGGCCGAGAAGGAGGAGCTGCTGAAGGAACTGGATCGTCTGCGCGACGAGGTGGCCAGTCTGAAAAACGAAGTCGAGAAACAGAAAGCAGCCAAGGAAGCAGCGTTGAGCGAACTGGCGGGTCTGAAGAAGGAGCTGGCCGACCTAGAGAATGTCTTGGCCGGCCTCAGAGCGACGCTGGACAAGGCTAACAGTGAAAATGAAGCGCTGCGCAAGGACAACGAGAAGATGAAAGCAGACCTGGCGAACGCGAGCAACGATCTGGAAGCGcttagaaacgagaactcgcAGTTGGCGGCTGATTTAGCTGGGGCGAAGGAGAAATTAACAGACTTGGAGAAACAGTACGAAGACTTGAAGAAGGAGAACGGCGATCTGAAGTACAAAGCCGCCCGCCTCGATGACACGTTGGCAGAGCTAGAAGCTTTGAAGAAACAATTGGACGGTGCTAATGCGGAGCTGGATAAGCTGCAGTCAGGATTGGATAGATTGAATTCGGAAAATGCAAAACTGCGGGAAGATTTCAATAGGGCGACGGACGAAGCGAATAAGTTGAAGGACAGTTTGGATAAACTGCGGAACGAATACAACGGGCTGCAGGATGATTTGTCTAAGGCGCAGGATGAATTATCTAAGACACAGAATGATTTGTCTAAGCTGCAGGACGAAGCTGCGAAGCAGAAGGCACTTGGGGCATCGCAAGTAGGGGATCTGGATAAATTGCAGAAGGAAAATAACGATCTGCGGGATCAAATCGCGAAGCTGAAGAGTCAGTTGCACGATTGCCGCGAGGAGAGCGACGGGCTGCACAAGGAACTGGGAAATTTGCAGAGGAAGAATGCCAAGCTGATGGAGGCAGCAGAGAGGCCGATGGAGCCAAAGGAGACGGAAATGGACAAAGATATTCTGGATGAATGCGGCGACTTCATTAAGGCGAACGAATTACTGGGGGACAAGTTTCAAAAGCAGAACGaagGCGTTCAACGTGTTCGAGATTATATATCATATTTGGAAGGTAAAGGCGAGGAGCCCAAAATGGCTGATCAATTAGAAGAATCGCATATTGAACCGCAATTGAAGGAAGACATTGCTGACTTGTTGAAGAAATCTCAGGCGTTATCGAAATCTATTTATGAAACGGAGCAGACAATACAAAATCTTACTGATATCCTGAAAAAACCTGCGGAGGCCGTGGCTCCAGGAGCCTTCGATCCCGATTCTTGGCTGAGTGCGTTGACGTTGACGCAGTTAGCGGAGCTTCACGATAAGATTTGCCTGTTGACATCGGACATGGTGCAGCAGGATAGCAAAGCGGTGCCATGCAGGTCAGCCCCCGGACCGGCGAATCCGCAAGCGGATTACAATGTTTTGAACAAGCGGATAGCCGCTTTGCAGAAACAGATAGCAGAGAAGCAAATGGAGGCAGCGTGGAAGATGCAGGAATTGAAGCGGTCTCTTCGAATGGAGCAGGCTAACCTAATCGAAATCTCCGACCATATGAATTTGGAGAGAAAACGAAACTTGGCCCTTCAGTTCACCATTGATGAAGCACCATAG
Protein Sequence
MEVCRCGCASSTSLESFNPPNEPCCCCSYNPFSDNSKESEIYDLSFALRKLGVMKCQMKKWRMERLQLESENRSLKQTLQSFGVNADEILKPDPLLVHSREEIERLQNANASLEDKVRDLEEILAERDCCDDPDATIHFLREKMRHLRERFALEKKDMRDIISNLKLKLAQTEEDVSCPAIYRLRAKLRDLMKGQTNLGENDHLLAEIDELRRGLGGREKREVVETRAMDTATVPEYADVSDLLKKLNDCEDVVGDLERKLEEKDDKIAELQNRLKQFDLLQAELEDLKRKLGERESENGKLQEEYHDRTSILQLEHDGRTSMLQIQMDELLAEIMGLNQTIDTLNKNIDDLNQEIADLKSRVADLEAQLEQCNKRNAELEECCQDRDSVLERNGALEQELASAMARIKDLEAQVDALRAQIEAEVDALKAQIEALAGQLGNEKAARSSLEKELQDSRGEIESLRKENSDLKGQIDAANADNDELRDALRKLSEEDEQLKAQIEQLQNNIKDLEGLNNALRGDYDNMKQALDNLEAEINRLEEELRRAEDERQALLNEIDSTRKQLEEALAVNETLKAELDEAGEQLDKLRAEKDELQKSVDALRHENDSFKRDMRTLREDLDDARRQVDELKASGDALRAADDDKKSEVERLTQEVEGLMSERDRLTSENADLKAENAELGRKLEDAVKELEQLRAENADLRAEVDRLGQELERARGELDQLKSEMAAVKDALDKCVDELEKLKGENSDLKSLNEALKTEIQGVRGEGDGLAVELNSLKGERDQLKDQLSECNAENERLKTEKSQLEGEIDSLKTEKGQLEAEVEKLKGENAGLKDEIEKLRGQLQSLNDEMNALKTQLEEAGNKIQSLESSLSSLQGENEKFRNDLSSLQGEINNLRAQLEREAADNLKLRNDLKVLEDQVQDLTKQVENARAENEALKSEIQGVRGEGDGLAAELNALRDEKSALTGERDRLKEQLSECNAEIDRLKTEKDQLEAEVDRLKGELNTSKEENGRLKDEMEKLRGQLQSLNDEMNALKAQLEEARDKIQSLESTVSSLQGENEKFRNDLSSLQGEVQNLRAQLEKEAGDNLKLRNDLKQLEDQVQDLAKQLQNSRAENEALKSEIQGVRGEGDGLAVELNALRDENSVLKDERDRLRNQFNESNAENERLRAGKGQLEAEIERLKGELNTCKEENARLKDEIEQLRGQLQSLNDEVNGLKTQLEEARNKIQTLESTVSSLQGENEKFRNDLSSLQTEIKNSRAQSDRDADENLKLKNDLKMLEDQVTDLIKKLENARAENDALKEENQNLKVQVNDMNHELAGLRTECADLKTEIADLKKSIEELKAKIAKLEEDLEYWKLENCKFQMEVDKVKAELQKTLKDLSEAQSLKQQLEAEVNRLKGEKAELEKKLADAMTQLEQLKKALEAEKSSKDKAEAQVAALRSELDALKAELEKLRAENSKCKGELDDLARQLSATKNELDACKQELFALREVNNALKSDFDALKGIKDEHNKLKADADAMKAENANLQQERQRLQDEYAKLKAEADGQKLELDKLRSDLASERAAAEKLRTDLQNRQAENAKLQSELDELKAELDRLKNESASLRNELNELKKALEAKIKSLEDQLSNLLAEKEELLKELDRLRDEVASLKNEVEKQKAAKEAALSELAGLKKELADLENVLAGLRATLDKANSENEALRKDNEKMKADLANASNDLEALRNENSQLAADLAGAKEKLTDLEKQYEDLKKENGDLKYKAARLDDTLAELEALKKQLDGANAELDKLQSGLDRLNSENAKLREDFNRATDEANKLKDSLDKLRNEYNGLQDDLSKAQDELSKTQNDLSKLQDEAAKQKALGASQVGDLDKLQKENNDLRDQIAKLKSQLHDCREESDGLHKELGNLQRKNAKLMEAAERPMEPKETEMDKDILDECGDFIKANELLGDKFQKQNEGVQRVRDYISYLEGKGEEPKMADQLEESHIEPQLKEDIADLLKKSQALSKSIYETEQTIQNLTDILKKPAEAVAPGAFDPDSWLSALTLTQLAELHDKICLLTSDMVQQDSKAVPCRSAPGPANPQADYNVLNKRIAALQKQIAEKQMEAAWKMQELKRSLRMEQANLIEISDHMNLERKRNLALQFTIDEAP

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2