Basic Information

Gene Symbol
-
Assembly
GCA_952773225.1
Location
OX731550.1:79030733-79042996[+]

Transcription Factor Domain

TF Family
TF_bZIP
Domain
bZIP domain
PFAM
AnimalTFDB
TF Group
Basic Domians group
Description
bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 42 0.24 3e+02 2.0 0.4 30 55 60 85 54 92 0.60
2 42 0.37 4.6e+02 1.4 1.7 33 56 108 131 106 133 0.79
3 42 0.00013 0.16 12.5 2.5 30 60 222 252 217 255 0.91
4 42 0.43 5.4e+02 1.2 1.0 28 50 282 304 280 326 0.75
5 42 0.0032 4 8.0 1.7 24 57 364 397 362 404 0.71
6 42 0.12 1.5e+02 3.0 0.1 26 62 419 455 416 461 0.77
7 42 0.34 4.2e+02 1.6 0.6 32 62 463 493 459 495 0.90
8 42 0.04 50 4.5 2.3 31 63 504 536 498 537 0.85
9 42 0.016 19 5.8 3.3 24 57 539 572 538 578 0.89
10 42 0.0019 2.4 8.8 7.6 22 64 558 600 557 601 0.95
11 42 0.00062 0.77 10.3 4.7 25 65 575 615 572 615 0.90
12 42 0.036 45 4.7 2.3 24 64 602 642 601 643 0.91
13 42 0.18 2.2e+02 2.4 1.9 26 61 639 674 634 684 0.80
14 42 0.0039 4.8 7.8 0.9 26 62 695 731 690 734 0.87
15 42 0.0016 1.9 9.1 0.7 26 62 751 787 748 790 0.73
16 42 0.0045 5.5 7.6 5.0 25 59 792 826 788 831 0.52
17 42 1.1e-05 0.013 16.0 4.6 21 61 837 877 833 881 0.91
18 42 0.00051 0.63 10.6 7.0 21 64 907 950 906 951 0.94
19 42 0.00049 0.61 10.6 5.6 22 57 922 957 921 961 0.91
20 42 0.0038 4.8 7.8 1.3 22 64 943 985 941 986 0.92
21 42 3.3e-06 0.0041 17.6 3.8 27 60 997 1030 994 1033 0.92
22 42 0.001 1.2 9.7 3.2 29 65 1027 1063 1026 1063 0.93
23 42 0.00012 0.14 12.7 2.3 26 60 1073 1107 1070 1109 0.91
24 42 5e-05 0.062 13.8 6.1 25 64 1128 1167 1117 1168 0.75
25 42 0.17 2.1e+02 2.6 0.8 28 50 1180 1202 1170 1224 0.55
26 42 0.16 2e+02 2.6 3.4 38 63 1239 1264 1222 1272 0.69
27 42 0.033 41 4.8 1.6 34 61 1284 1311 1281 1315 0.75
28 42 0.0007 0.87 10.2 1.1 32 64 1310 1342 1305 1343 0.91
29 42 0.002 2.5 8.7 6.3 21 59 1355 1393 1353 1395 0.82
30 42 0.0055 6.8 7.3 6.4 23 59 1413 1449 1411 1455 0.92
31 42 0.076 94 3.6 1.3 31 63 1442 1474 1439 1476 0.75
32 42 1.8e-05 0.023 15.2 8.1 25 63 1471 1509 1467 1511 0.92
33 42 0.66 8.2e+02 0.6 9.3 21 64 1523 1566 1509 1567 0.64
34 42 0.00057 0.7 10.5 0.7 26 64 1556 1594 1551 1595 0.87
35 42 0.049 61 4.2 2.3 30 58 1595 1623 1592 1633 0.79
36 42 0.052 64 4.2 7.4 21 63 1638 1680 1625 1682 0.73
37 42 0.041 51 4.5 3.3 28 63 1680 1718 1676 1724 0.79
38 42 0.8 1e+03 0.4 8.2 22 64 1733 1775 1725 1776 0.79
39 42 0.0043 5.3 7.6 0.2 28 60 1767 1799 1764 1801 0.90
40 42 0.0032 4 8.0 1.7 21 56 1802 1837 1802 1841 0.92
41 42 0.0044 5.4 7.6 2.0 39 64 1841 1866 1838 1867 0.91
42 42 4.5 5.6e+03 -2.0 1.1 36 51 2075 2090 2055 2097 0.78

Sequence Information

Coding Sequence
ATGGAGACGCGGAAGGCTCTGTCGGTTTGCCGGTGCGGCTGCAACGCCTCGTCGTCCCGCGAATCGATTGATCCACCGAACGAGCCGTGCTGTTGCTGCAGTTACAACCCCTTCAGCGACAATTGCAAAGAGTCGGAGATCTACGACCTCTCGTTTGCCCTGAGGAAGCTGAGCGTAATGAAGTGTCAGATGAAGAAGTGGCGAATGGAGCGGCTGCAGCTCGAGAGCGAAAATAGGTCTCTGAAACAAGCCCTCCAGCCATTCGGCGTAAACGCGGATGAGATATTGAAGCCTGATCCGCTGCTGGTGCACTCGCGGGAAGAAATCGAAAGGCTGCAGAATGCAAACGCGGCCCTCGAAGATAAAGTGGTGGATCTGGAAGAAACCCTCGCCGAACGAGATTGCTGCGACGATCCGTGCTCGTCGGTGCACTGTCTCCGGGAGAAGATGAGATTCTTGCGCGAGGGTTTTGCtctcgaaaagaaagaATTCAGAGACATGATATCGCATCTGAAGTTGAAGATCGTGGAGGTAGAGGAGGACGTCAGCTGTCCCGCGATATACCGCCTGAGGGCGAAGCTCCGCGACCTGATGAAGGGCGGTCAAATGCCGGAACAGCCTGTCTCGAAGGTCGTGGAGAGATCGATAGAGACGCTGGGTGACCTTTCCAAGAGCTGCGACGATTTGCGTGCGGAGAACGATCGTCTCCTGGCCGAGCTGGAGGAGCTGCGTCGCGCGCTGGCGGGCCTTGAAGAGAAGGAGCTGCCCGCTCCCGACATCCAGTATCGAGCGATCCCACAGAAAACGGCCGAAACCACCACGGTACCGGAGTACATAGACGTCTCAGAGCTGACGCAGAAGCTGAAGGACTGCGAGGAGGCCGTGGCCGACTTGAGGAGGCAGCTGGACGAGAAGAATAGTATCATCGATTCGTTGAATAAAGACCTGGAGACTATGGTCAGCCAGAGGGATCTGCTTGACGAGATCGCGGCCATGAAGGCGGAGCTCCAGAGGAGAGACGACAAGATCGGCGATCTGCTGAACCAGTTGAGACAGTCAGAGATAGACCTGCTCGGCCTGGGCAACTTAAAATCTCAGGTCGACGAGCTGCAGCCACATTTACAGGGTCTCGAGGCGGAGAAGGGCCAGCTTCTAGACGAGCTGAATAAATTACGGAACGCGTTGGCTGAGAGGAGCGACCAGATAATCGACATACTGGAGCAGAAGCAAAAGCTCGAGAAGGATTGCGCGGACAAGATGGCGGCCCTGCAGGCCAAGATGGATGAGTGGTTGGCTCGCGAGGCTGCCTTGAACACGGAGATCGACGATCTCAAGGGGCGGGTAGCGGGTTTGCTGGAAGAAATAGAGCGGTGTAAAGCGCGAGACGCGGAGCTGGAGCAGTGCTCCGTGGACAGGGACGCCCTCGCGAAGACGCTGCAGGTGATGGAGGAGGAGCTGGCGGCGTTGCGAGACGACCTCGCGTCCGCGGCAGCCACGACAGCGAGTCACCAGACAGAGCTAGACACATTGAGGGGCGAGAAGGAGGAGTTACTGAAGGAGCTGAACGAGATCAGGGGGCAAGTGGAGGCACTGACCAGCCAGCTGGCAGAGGAGAGGGCGGCCAAGGAAGCTCTGCAAAGGTACCTGGAAAGTGGCCGGGACGAGCTTGAAAAGCTACGGGAGGAAAATTCGAGCCTGAAGGAGCAGCTCGGCGCAGCGACGGAGGACAACGCGAAACTTCAGGCAGCGATGAAGAGCCTGGCTGAGGAGAACGAAAGTTTGAAAGGGGAGCTCGAGAGCCTGAAGACGGAAAACAAGGGGCTGGCGGACAGACTGGAGGAGCTGGAGAATTTGAATAACGTCTTGAGGAACGATTACGAAAACGTGAAAAGGGCGCTGGACAATCTCGAGGCAGAGATCAACAGACTGGAGGGTCTGTTGAGTAAGACGGCCGAGGAACGCGACGCTCTGCTGAATGAAAATGGGGATATTAAAAGACGATTGGAAGAGGCCGTGGCGGAGAACGATGGTCTGAAAGCTGAATTGAACGACATTGGCCAACGACTCGATAAATTTGGATTAGAGAACGAGGAGCTGAAGAAGAACCTCGAGGCGATGAACACTGAGAACGATTCGCTGAAGCAGGATATTGGGGGGTTGAGAAGCGAGCTGGATGGTGCGAGGAAGGAAGCAGAGTCGCTGAAAGCTGCCGGGGACTCGCTGAGGGCGGCGGACGAGGGCGCGAAGGCGGAAGCTGAAAGAGTGGCGGCAGAAATCGGCCACCTAAGGAACGAGAAGGATCGCTTGACGAACGAGAATGCAGATCTGACGGCGAAGAATTTGCAGCTGGGAAGTGAGCTCGATGACGCCAAGCAGAACGTTGAGAAATTCAAGGCCGACAATGCTGCCTTGCAGGAAGAGCTGGATCGTCTGAGGCAGGAATTGGAGAAGTCGAAGAATGCGATGGACCAGTTGAAGTCAGAGCTGGCGACGTCGAAAGAAGCCCTCGACAGGTGCGTGGCCGAGATGGAGAAGCTGAAAAACGAAGCCAACGATCTGAAGTCGCGGAACGAGGCTCTGCAGGCGGAAATCGAAAAGTGCAAGGCGGAAATCGAGAAGTGCAAGGCGGAGGGAGACGCTCTGAGGGGCGATCTAGAGAAAACGAGGTTAGAATTGAGCGATGCGAAAGGGCAGCTGGACAAGGCGACCGACGAGCTGAAGAAACTAAAGGAAGAGAACAGTTCCCTGGACGATAACTTGAAGCGACTGATTGCTGAGAACGACGCGCTGCAATCAGAGGCTGAGGGGCTGATGAAAGAGGCGGAGAAGCTGAAGGAGGAAGCTGCCAAGCTCCAGAAGCAGACGGCGCAAGGGCAGGGCGAAGGCGCCGCGTTGAAGGCTGAAATTGATGGGATTAAGAGCGAAAGAGACGCTCTAGCGGGGGAATTAAGCAGGCTGAAGGACGAGCATGGTGCTATCCTGAGCGAGAGGGATAAGTTGAGCAAACAAGTGGCCGATTGTAATGCGGAGAACGAGAAACTGAGAGCAGAGAGGGGCCAGCTGGAGGCTGAAAATGAGAAACTCAAGGGAGACATGAAGACGTGCGAGGAGGAAAACGGGAGGATGAAAGCTGAGTTGGAGCAGCTGGCGGAGCAAGTGGGTTCGTTGAATGACCAGTTAAATAAACTGAGAGCTCAACTGGCTGACGCGGAGAAGAAAATTCAGTCCCTCGAGCCTCTGGTGACTCGTTTGCAGAGCGAGAATGATAAATTGCGAAACGACGTGCAAGCCTTGCAGGGCGAAATGAACGAGCTGCAGGCAAACATGGGGATAGAAAGGGATGACAGTCAGAAAATGCAGGCCGACATGGTGATGTTGGAGGACAAGGTGCAGGATCTGATCGCGAAACTCGACGAGGCCAGGGCGCAGAATCAGGCTCTGAAACAGGAGAATCAGGACCTTCGTGCAAAGCTGCAGGATGTGGAGCAGGAATTGTCCAGTTTGAAAACGGAATGTGCTGATGCGAAAACGGAGATCGCGAATCTGAAGAAGCTAGTTGCTGACCTGCAAGCGCAAATCGCTAAGCTGCAGGCAGATGTAGAGTATTGGAAAATGGAGAACTGCAAAGACAAGCTGGAAGTGGATAAACTGAAGGCCGATCTGGAGAAAGCGTTGAAGGACTTGAGCGAGTGTCAGACACTAAGAGCAGCACAAGAGGCAGAATTGAATCGTCTCCTGAGCGAAAAGTCCGCGCTCGATAAAAAGCTCGGCGACTTGACGTCGCAAGTCGAGGAGCAGAAGAAGAACCTGGCAGACGGAGAGGCAGCAAAGGGCAAGGGCGACGTAGTGGTTGCAGCCCTTAAAAGTGAGCTCGCAGCATTGAAAAAGGAACTGGAGAACCTGAGAGCGGACAACAAGAGATACAAAAGCCAAGCGGACGATCTGGCGAAGCAGCTCGCGGCGCTGAAGAACGAGCTCGATGCCTGCAAAGAAGAGGTGGCCGCGCTGAGAGACAGCAACAGCGCCTTGAGGGCAGAAGTAAACGGGCTGGGCGTCGAGAAGAACAAGTTACAGGACGAAAGGAACCAACTCAAGTCCCAGGTGGACGCCCTGAAAGCAGAGAACGCGACTCTCGACGAGGATCGGAAGAAGCTGGCAGAGGAGTCCAACAAGCTGAGAGGCGCTGGAGATGGCCAAAGACTAGAAATCGATACCCTCAGGTCGGACCTGGCTGCGCAGAAAGCAGCCACCCAGAAGCTTCAGGCAGACCTGCAGGACTGCAAAACAGAGAACGACAGGTTACAGTTGCAGTTGAACGAAACGAGGAGCGAGTTGGAGAGGCTGGAAGGAGATGCCGATCGGCTGAACGCCGAGGCGGAGCGGCTGAAGGCGGCGCTCGCGGCCGCTGAGGCGCGAGTGAAAGCTCTGGAGGACCAGTTGGCCGAGCTGACCGATGAGAAACAGGAGTTGCTGAACGAGCTCTACCGTTTAAGGGAAGAGGTGGACAATCTGCGGAAGGAAATGGAGAAGGAGAAGCTGGCGAGGGACGCGGCCGCGAAGGAGCTGGACGCTTTGAGGGCGGAGCTGAACGCCCTGAAGGCGGAATTGAGCAAGACTCGCGGGGAAAATGAAAAGTTGAAGAACGAGATGAAGAAGCTGAAGGGGGATCTGGAGAATGTGAACGACCAGCTGGAGACGTTGAGGGGCGAGAACGCTGCATTGAATGGTGAGAACGGAAGGCTGTCGTCCGAGTTGGCTGCGATCAAGGTCAAGCTGACAGAGGCGGAAAATCGAATGGAGAATCTCGAGGGCGAGAGGGACGGCTTGCAGAAGAGGATCGCGGATCTGGAGGAGGCCGCGAAGGAACTGGGAATTCTGAAGAAACAATTAGAGGACGCCACGAACGAGCTGAATAGATTGAGGCCGGAGCTCGAGAAACTGAAGACGGAGCTGGAGACGTTGAAGGGCGAGAATGCTGCATTGAAGGATGAGAATGGGAAGCAGGCGTCCGAGTTGACTGCGATCAGGAGCAAGCTGAACGAGGCAGAAAATCGAATGAAGGACCTCGAGGGCGAGAGGGACGGCTTGCAGAAGAGGGTCGCGGATCTGGAGAAGGTCGCGAAGGAACTGGGAATTTTGAAGGAACAATTAGAGGACGCCACGAACGAGCTGAATAGATTGAGGCCGGAGCTGAAGAAAGCGAAGGTCGAGCTGGAGAAAGCGAAGACGGAGCTGGAGAACTTGAAGACAGAGAACGGGAGGCTGCAGGACGAGCTAAACGAGGCGGAAAATAAATCGAAAGGATTAAGGGATGATTTGGCGAAGCTGAGAGCGGCCAACGATAAGTTGAAGAATGATAACGATAGTTTGCAGTCCGATATCAGTGCGTTGAAGGCGGAGGGGGATAAACAGAAGGATTATGACGCTCAGTTGCAGAACGGCCTGGGCGAGTTGAGGAAGGAGTATGACGAGCTGCAATCCGCGAACGCAGAATTGAAAGACGAGTTGAACCTCTGCTGGGAGGAGAACGAAAAATTAAAAGACGAGTTGAACCACTGCCGGAAGGAGAACGAAAAATTGCGCGGGGTTCTGGACGAGTTGCAGAATGAGCATACCAGGCTGAAGCGGGCTTCGGTGAGGACGGTGGCGCCAAAGGAGACCGAAATGGACAAAGATATTGTGGATGACTGCGGGGATTTCATTAAGGCGAACGAATTGCTCAGGGGAAAATTTGAAAACCAAAATCAAGGTGTACAACGGGTTCGAGATTACATCAGTTTTTTGCAAGGTAAAGTTAAGGAGGAACCCAAAATGGCGGCCAAACCAGAGGAAGGTATGGCGACAGATCCACAAATGAGGCAAGATATAGCAGACCTGTTGGAGAAATCTCATATGCTGTCCGAAAATATTTATCGAACAGAAAAGGAGATACAAAATATCGCTAACTTATTGAAGCAACTGCAGGGGGTTGGGGACCAACCAGTGGCCTTCGATGCTGATTCTTGGCTAAACTCACTAACTCTGACACAGTTAGCGGAGCTTCACGATAAAATTTGCCTGCTGACATCGGGCGTGGTGCAGCAAGATAGCGGCGCCGTGATGTGCGACTCCGTGGCTAACCGGGAGCCAGCGGATCCACTGAATGCAGATTACAATATTTTGAACAAACGGATAGCCGCTTTGCAGAGACAGATAGCGGAGAAGCAAATAGAAGCTGGATTGAAGCTGCAGGAGCTGAAGCGAGCTCTCCGCCAGGAGCAGGCTAACCTGATTCGATTCTCCGAGGAAATGAATTTAGAGAGAAGACGCAATTTAGCCCTTCAGCTCGCCATTGATGATTCAGCCTAA
Protein Sequence
METRKALSVCRCGCNASSSRESIDPPNEPCCCCSYNPFSDNCKESEIYDLSFALRKLSVMKCQMKKWRMERLQLESENRSLKQALQPFGVNADEILKPDPLLVHSREEIERLQNANAALEDKVVDLEETLAERDCCDDPCSSVHCLREKMRFLREGFALEKKEFRDMISHLKLKIVEVEEDVSCPAIYRLRAKLRDLMKGGQMPEQPVSKVVERSIETLGDLSKSCDDLRAENDRLLAELEELRRALAGLEEKELPAPDIQYRAIPQKTAETTTVPEYIDVSELTQKLKDCEEAVADLRRQLDEKNSIIDSLNKDLETMVSQRDLLDEIAAMKAELQRRDDKIGDLLNQLRQSEIDLLGLGNLKSQVDELQPHLQGLEAEKGQLLDELNKLRNALAERSDQIIDILEQKQKLEKDCADKMAALQAKMDEWLAREAALNTEIDDLKGRVAGLLEEIERCKARDAELEQCSVDRDALAKTLQVMEEELAALRDDLASAAATTASHQTELDTLRGEKEELLKELNEIRGQVEALTSQLAEERAAKEALQRYLESGRDELEKLREENSSLKEQLGAATEDNAKLQAAMKSLAEENESLKGELESLKTENKGLADRLEELENLNNVLRNDYENVKRALDNLEAEINRLEGLLSKTAEERDALLNENGDIKRRLEEAVAENDGLKAELNDIGQRLDKFGLENEELKKNLEAMNTENDSLKQDIGGLRSELDGARKEAESLKAAGDSLRAADEGAKAEAERVAAEIGHLRNEKDRLTNENADLTAKNLQLGSELDDAKQNVEKFKADNAALQEELDRLRQELEKSKNAMDQLKSELATSKEALDRCVAEMEKLKNEANDLKSRNEALQAEIEKCKAEIEKCKAEGDALRGDLEKTRLELSDAKGQLDKATDELKKLKEENSSLDDNLKRLIAENDALQSEAEGLMKEAEKLKEEAAKLQKQTAQGQGEGAALKAEIDGIKSERDALAGELSRLKDEHGAILSERDKLSKQVADCNAENEKLRAERGQLEAENEKLKGDMKTCEEENGRMKAELEQLAEQVGSLNDQLNKLRAQLADAEKKIQSLEPLVTRLQSENDKLRNDVQALQGEMNELQANMGIERDDSQKMQADMVMLEDKVQDLIAKLDEARAQNQALKQENQDLRAKLQDVEQELSSLKTECADAKTEIANLKKLVADLQAQIAKLQADVEYWKMENCKDKLEVDKLKADLEKALKDLSECQTLRAAQEAELNRLLSEKSALDKKLGDLTSQVEEQKKNLADGEAAKGKGDVVVAALKSELAALKKELENLRADNKRYKSQADDLAKQLAALKNELDACKEEVAALRDSNSALRAEVNGLGVEKNKLQDERNQLKSQVDALKAENATLDEDRKKLAEESNKLRGAGDGQRLEIDTLRSDLAAQKAATQKLQADLQDCKTENDRLQLQLNETRSELERLEGDADRLNAEAERLKAALAAAEARVKALEDQLAELTDEKQELLNELYRLREEVDNLRKEMEKEKLARDAAAKELDALRAELNALKAELSKTRGENEKLKNEMKKLKGDLENVNDQLETLRGENAALNGENGRLSSELAAIKVKLTEAENRMENLEGERDGLQKRIADLEEAAKELGILKKQLEDATNELNRLRPELEKLKTELETLKGENAALKDENGKQASELTAIRSKLNEAENRMKDLEGERDGLQKRVADLEKVAKELGILKEQLEDATNELNRLRPELKKAKVELEKAKTELENLKTENGRLQDELNEAENKSKGLRDDLAKLRAANDKLKNDNDSLQSDISALKAEGDKQKDYDAQLQNGLGELRKEYDELQSANAELKDELNLCWEENEKLKDELNHCRKENEKLRGVLDELQNEHTRLKRASVRTVAPKETEMDKDIVDDCGDFIKANELLRGKFENQNQGVQRVRDYISFLQGKVKEEPKMAAKPEEGMATDPQMRQDIADLLEKSHMLSENIYRTEKEIQNIANLLKQLQGVGDQPVAFDADSWLNSLTLTQLAELHDKICLLTSGVVQQDSGAVMCDSVANREPADPLNADYNILNKRIAALQRQIAEKQIEAGLKLQELKRALRQEQANLIRFSEEMNLERRRNLALQLAIDDSA

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00085909;
90% Identity
-
80% Identity
-