Basic Information

Gene Symbol
-
Assembly
GCA_963932275.1
Location
OZ010731.1:47054594-47069092[+]

Transcription Factor Domain

TF Family
TF_bZIP
Domain
bZIP domain
PFAM
AnimalTFDB
TF Group
Basic Domians group
Description
bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 43 0.27 4.4e+02 2.0 0.4 30 55 60 85 54 92 0.60
2 43 0.41 6.8e+02 1.4 1.7 33 56 108 131 106 133 0.79
3 43 0.0039 6.4 7.9 1.4 30 60 222 252 217 255 0.91
4 43 0.42 6.9e+02 1.4 0.8 28 50 298 320 295 335 0.73
5 43 2.3 3.8e+03 -1.0 0.3 36 58 344 366 321 370 0.56
6 43 0.0016 2.7 9.1 1.4 24 57 380 413 377 420 0.85
7 43 0.0073 12 7.0 1.7 25 59 448 482 434 488 0.75
8 43 3.8 6.3e+03 -1.7 0.2 33 60 480 507 475 510 0.77
9 43 0.0004 0.66 11.1 3.1 25 63 514 552 512 553 0.91
10 43 0.009 15 6.7 4.8 24 57 555 588 554 596 0.89
11 43 0.00071 1.2 10.3 4.8 27 65 593 631 589 631 0.91
12 43 0.016 26 5.9 2.2 24 64 618 658 617 659 0.93
13 43 0.065 1.1e+02 4.0 5.5 28 63 650 685 645 701 0.73
14 43 0.00082 1.4 10.1 1.1 26 64 711 749 705 750 0.87
15 43 0.00073 1.2 10.2 1.8 28 62 762 796 755 806 0.59
16 43 0.00093 1.5 9.9 6.6 25 64 794 833 792 834 0.92
17 43 0.0036 6 8.0 7.8 27 62 810 845 804 848 0.67
18 43 0.011 18 6.4 4.2 29 64 826 861 825 862 0.91
19 43 5.1e-05 0.084 13.9 2.5 26 63 858 895 853 897 0.89
20 43 4.4e-05 0.072 14.1 7.2 21 64 916 959 915 960 0.93
21 43 0.00093 1.5 9.9 0.7 22 64 959 1001 958 1002 0.93
22 43 1.2e-05 0.02 15.9 4.1 27 60 1013 1046 1008 1050 0.91
23 43 0.0013 2.1 9.4 4.5 27 65 1041 1079 1040 1079 0.92
24 43 3.7e-05 0.061 14.4 1.5 26 61 1089 1124 1086 1127 0.93
25 43 1.3e-05 0.021 15.9 6.2 25 64 1144 1183 1131 1184 0.73
26 43 0.23 3.8e+02 2.2 1.1 25 64 1200 1239 1197 1240 0.85
27 43 0.6 9.9e+02 0.9 4.5 32 64 1247 1281 1239 1288 0.63
28 43 0.006 9.9 7.3 2.7 32 58 1298 1324 1290 1328 0.58
29 43 0.00089 1.5 10.0 1.2 31 64 1325 1358 1318 1359 0.88
30 43 0.0019 3.2 8.9 6.0 21 59 1371 1409 1370 1411 0.85
31 43 0.00052 0.86 10.7 6.9 24 61 1430 1467 1427 1484 0.82
32 43 6.2e-06 0.01 16.9 5.9 25 63 1487 1525 1485 1527 0.93
33 43 0.31 5.1e+02 1.8 4.9 23 64 1527 1568 1525 1569 0.78
34 43 2.7e-05 0.045 14.8 2.0 21 64 1567 1610 1566 1611 0.94
35 43 0.011 19 6.4 1.9 29 64 1610 1645 1607 1649 0.82
36 43 0.64 1.1e+03 0.8 10.2 22 63 1655 1696 1641 1705 0.63
37 43 0.028 47 5.1 2.4 29 63 1704 1738 1699 1740 0.86
38 43 0.81 1.3e+03 0.5 7.7 29 65 1735 1771 1730 1771 0.91
39 43 0.031 51 5.0 6.4 21 64 1755 1798 1754 1799 0.92
40 43 0.0019 3.1 8.9 0.5 28 60 1790 1822 1786 1827 0.90
41 43 0.0003 0.49 11.5 7.8 28 63 1839 1874 1825 1876 0.92
42 43 0.02 33 5.6 2.8 36 63 1868 1895 1867 1897 0.73
43 43 9.4 1.6e+04 -2.9 1.0 36 51 2103 2118 2084 2124 0.69

Sequence Information

Coding Sequence
ATGGAGACGCGGATGGCTCTGTCGCTTTGCCGGTGCGGCTGCAACGCCTCGTCGTCCCGCGAATCGATTGGTCCGCCGAACGAGCCGTGCTGTTGCTGCAGTTACAACCCCTTCAGCGACAATTGCAAAGACTCGGAGATCTACGACCTCCCGTTTGCCCTGAGGAAGCTGAGCGTAATGAAGTGTCAGATGAAGAAGTGGCGAATGGAGCGGCTGCAGCTCGAGAGCGAAAATAGGTCTCTGAAACAAGCCCTCCAGCCATTCGGTGTAAACGCGGATGAGATATTGAAGCCTGATCCGCTGCTGGTGCACTCGCGGGAAGAAATCGAAAGGCTGCAGAATGCAAACGCGGCCCTCGAAGATAAAGTGGTGGATCTAGAAGAAACCCTCGCCGAACGAGATTGCTGCGACGATCCGTGCTCGTCGGTGCACTGTCTCCGGGAGAAGATGAGATTCCTGCGGGAGGGTTTCGCTCTCGAGAAGAAAGAACTCAGGGACATGATATCGTATCTGAAGTTGAGACTCGTGGAGGTAGAGGAGGACGTCAGCTGCCCGGCGATATACCGCCTGAGGGCGAAGCTCCGGGACCTGATGAAGGGCGGTCAAATGCCGGAACAACCTGTCGCGAAGGTCGTGGAGAGATCGATAGAGACGTTGGGGGACCTCTCCAAGAGCTGCGACGATTTGCGCGTGGAGAACGATCGTCTTCTGGCCGAGCTGGAGGTGCTGCGTCGCGCGCTGGCGGGCCTCGAAGAGAAGGAGCTGGCGGGCCCTGAAGAGAAGGAGCTGGCGGGCCTTGAAGAGAAGGAGCTGTCCGCTCCCGATATTCAGTATCGAGCGATGCCGCAGAAAACGGCCGAAACCACCACGGTACCGGAGTACATAGACGTCTCAGAGCTGACGCAGAAGCTGAAGGACTGCGAGGAGGCCGTGGCCGACTTGAGGAGGCAGTTGGACGAGAAGAACAGCATCATCGATTCGTTGAATAAAGACCTGGAGACTATGGTCAGCCAGAAGGATCTGCTTGACGAGATCGCGGCCATGAAAGCGGAGCTGCATAGGAGAGAGGACAAGATCGCCGAGCTGCTGAACCAGTTGAGACAGTCAGAGATAGACCTGCTCGGTCTGGACCACTTGAAATCCCAGATCGACGACCTGCAGCCACAATTACAGGATCTCGAGGGGGAGAAGGGTCAGCTTCTAGACGAGCTGAATAAATTACGGAACGCGTTGGCCGAGAGGAACGACCAGATAATCAACATACTGGAGCAGAAGCAAAAGGTCGAGAAGGACTGcgcggacaagatggcggccctGCAGGCCAAGTTGGACGAGTCGTTGGCTCGCGAGGCTGCCTTGAACACGGAGGTCGACGACCTGAAGTCGCGGGTGGCGGCTTTGCTGGACGAAATAGAGCAGTGTAAAGCGCGAAACGCGGAGCTGGAGGAGTGCTGCGTGGACAGGGACGCCCTCGCGAAGACGCTGCAGGTGATGGAGGAGGAGCTGGCGTCGTTGCGAGGCGACCTCGCGTCCGCGACAGCTAGAATAGCGAATCTCCAAACAGAGCTAGACACGTTGAAGGGCGAGAAAGAGGAGTTGCTGAAGGAGCTGGACGAGATCAGGGGGCAAGTGGAGGCACTGACCAGCCAGCTGGCAGAAGAGAGGGCGGCCAAGGAAGCCCTGCAGAGGGACCTGGAAAGTGGCCGGGACGAGCTTGAACAGCTACGGGCGGAAAATGCGAGCCTGAAGGAGCAGCTCGGCGCAGCGGCGGAGGACAACGCGAAACTTCAGGCAGCGATGAAGAGCCTGGCTGAGGAGAACGAAAGTTTGAAAGGGGAGCTCGAGAGCCTGAAGGCGGAAAACAAGGGGCTGGCGGACAGGCTGGAGGAGCTGGAGAATCTGAATAACGTCTTGAGGAGCGATTACGAAAACGTGAAAAGGGCGCTGGACAATCTCGAGGCAGAGGTCAACAGACTGGAGGATCTGTTGAGCAAGGCGACCGAGGAACGCGACGCTCTGCTGAATGAAAATGGGGATCTTAAGAGACGATTGGAAGAGTCCCTGGCGGAGAACGATGGTCTGAAGGCTGAATTGAACGACATTGGCCAACGACTCGATAAATTGGGATTAGAGAACGAGGGGCTGAAGAAGAACCTCGAGGCGATGAAGACTGAGAACGATTCTCTGAAGCAGGATATTGGGGCGCTGAGAAGCGAGCTGGATGGTGCGAGGAAGGAAGCAGAGGCGCTGAAAGCTGCTGGGGACTCGCTGAGGGCGGCGGACGAGGGGATGAAGGCGGAAGCTGAAAGACTGGCGGCAGAAATCGGCGACCTAAGGAATGAGAAGGATCGTTTGACGAACGAGAATGCGGATCTGATGGCGAAAAATTTGGAGCTGGGACGTGAGCTCGACGAGGCCAAGAAGGGCGTGGAGAAATTGAAGGCCGACAATGCTGCTTTGCAGGAAGAGTTGGATCGTCTGAGGCAGGAATTGGAGAAGTCGAAGAATGAGATGGACCAACTGAAGTCAGAGCTGGCGACGTTGAAAGAAGCCTTGGACAAGTGCGTGGGCGAGATGGAGAAGCTGAAGAACGAAGCCGACCAGCTCAGATCGAAGAACGACGCTCTGCAGGCGGAGATCGAGAAGTGCAAGGCGGAGGCAGACGCTCTGAGGGGCGACCTCGAGAAAACGAGGTTAGAATTGAGCGATGCGAAAGGGCAGCTGGAAAAGGCGAACGACGAGCTGAAGAAGCTAAAAGTAGAGAACAGTTCCCTGGACGCAAACTTGAAGCGACTGATCGCTGAGAACGACGCGCTGCAATCAGAGGCTGAGGCGCTGAGGAAAGAGGCGGAGAAGCTGAAGGAGGAAGCTGCGAAGCTGAAGGAGGAAGCTGCGGAGCTCCAGGCGCAGGCGGCACAAAGGCAGGGCGAAGGGGCCGCGTTGAAGGCTGAAATTGATGGGATTAAGAGCGAAAGAGACGCTCTAGCGGGGGAATTAAGCAGGCTGAAGGACGAGCATGGTGCTTTGCAGAGCGAGAGGGATAAATTGAGCAAACAAGTGGCCGATTGTAATGCGGAGAACGAGAAACTGAGAGCAGAGAAGGGCCAGTTGGAGGTTGAAAATGAGAAACTCAAGGGAGACATGAAGGCGTGCGAGGAAGAAAACGGGAGGTTGAAGGCTGAGTTGGAGCAGCTGGCGGGGCAAGTGGCGTCGTTGAATGACGAGTTGAATAAACTGAGAGCTCAACTGGCTGACGCGGAGAAGAAAATCCAGTCCCTCGAGCCTCTGGTGACTCGTTTGCAGAGCGAGAATGATAAATTGCGAAGCGACGCGCAAGCTTTGCAGGGCGAAATGAACGAGCTGAAGGCAAACATGGGGAAAGAACGGGATGGCAATCAGAAAATGCAGGCGGACATGGTGATGTTGGAGGACAAGGTGCAGGATCTGATCGCGAAACTGGACGAGGCCAGGGCGCAGAACGAGGCTCTGAAACAGGAGAATCAGGAGCTTCGTGCAAAGCTGCAGGATATGGAGCAGGAATTAGCTAGCTTGAGAACGGAATGTGCTGATGCGAACACGGAGATCGCGAATCTGAAGAAGCTAGTAGCTGAGCTGCAAGCACAAATCGCTAACCTCGAGGCAGATGTAGAGTATTGGAAAATGGAGAACTGCAAAGACAAGCTAGAAGTGGATAAACTGAAGGCTGATCTGGAGAAAGCGTTGAAGGACTTGAGCGAGTGTCAGGCACTAAGAGCAGAacaagaggcagaattgaatcGTCTCCGCGGCGAAAAGTCCGCGCTGGATAAGAAGCTCGCCGACTTGACGTCGCAAGTGGAGCAGCAGAAGAAGGACCTGGCAGACGGAAAGGCAGCGAAGGATAAGGGCTCTGCAGAGGTCGAAGCCCTTAAAAGTGAGCTCGCAGCATTGAAGAAGGAACTGGAGAACCTGAGAGCAGATAACAACAAATACAAAACCCAAGCGGACGATCTGGCGAAGCAGCTCGCGGCGCTGAAGAACGAGCTCGATGCCTGCAAAGAAGAGGTGGCCGCGCTGAGAGACAGCAACAGCGCCTTGAGGGCAGAGGTGAACGCGCTGGGCATCGAGAAGAACAAGTTACAGGACGAGAGGAACCAACTCAAGTCCCAGGTGGACGCTCTGAAAGCAGAGAACGCGACTCTCGACGAGGATCGTAAAAAGCTGGCAGAGGAGTCGAACAAGCTGAGAGGCGCTGGGGATGGCCAAAAAGTAGAAATCGATACCCTCCGGTCGGACCTGGCTGCACAGAAAGCAGCTTACCAGAAGCTGCAAACAGACCTGCAGAACTGCAAAACGGAGAACGAGAGGTTGCAATCGCAATTGAACGAACTGAGGAGCGAGTTGGAGAGACTGAAAGGAGAAGCCGATCGGCTGAGCGCCGAGGCGGAGCGGCTGAAGGCGGCGCTCGCAGACGCTGAGGCTCGAGTGAAAGCTCTGGAGGGCCAGTTGGCTGACCTGACGGCTGAGAAACAGGAGTTGCTGAACGAGCTCTATCGTTTAAGGGAAGAGGTGGACAATCTGCGAAAGGAAATGGAGAAGGAGAAGCTGGCGAGGGACGCGGCCGCGAAGGAGCTGGACGCTTTGAGGGCGGAGCTGAACGCCCTGAAGGCGGAATTGGGCAGGACTCGCGGGGAGAACGAGAAGTTGAAGAACGAGATGAAGAAACTGAAGGAGGATCTGGAGAATCTGAACGACCAGCTGGAGACGTTGAGGGGCGAGAACGACGCATTGAAGGGTGAGAATGGGAGGCTGTCGTCCGAGTTGACTGCGATCAAGGGCAAGCTGAAGGAGGCGGAAGATCGAATAAAGGATCTCGAGGGCGAGAGGGACGGCTTGCAGAAGAGGATCGCGGATCTGGAGGAGGCCGCGAAGGAGCTGGGAACGTTGAGGAAGCAATTAGAGGATGCCGAGAACGAGCTGAATAGATTGAGGCCGGAGCTGGAGAAGCTGAAGACGGAGCTGAAGAACGCGAAGACGGAGCTGGAGGCGTTGAAGGGCGAGAACGCTGCATTGAAGGATGAGAATGGGAAGCTGTCGTACGAGTTGACAGCGATCAGGGGCAAGCTGAAGGAGGCGGAAGATCGAACGAAGGATCTCGAGGGCGAGAGGGACGGCTTGAGGAAGAAGATCGCGGATCTGGAGGAGGCCGCGAAGGAACTGGAAATGTTGAGGAAACAATTAGAGGATTGCGAGAACGAGCTGAATAGATTGAGGCCGGAGCTGAAGAAAGCGAAGGCGGACCTGAAGAAAATGAAGACCGAGCTGGAGAAATTGAAGACGGAGAACGGGAGGCTGCAGGACGAGCTAACCGAGGCGGAAAATAAATCGAAGGGATTAAGGGATGACTTGGCGAAGCTGAGGGCGGCCAACGATAAGTTGAAGAATGATAACGCCAGTTTGCAGTCGGATATAGCTGCGTTGAAGGCGGAGGGAGATAAACAGAAGGATCATGACGCTCAGTTGCAGGGCGGCTTGGACGAGCTGAAGAAGGAGTATGACGAGCTGCAAGCCGCGAATGCGCAATTGAAAGACGAGCTGAACCTCTGCCGGGAGGAGAACGAAAAACTGAAAGACGAGTTGAACCACTGCCGGAAGGAGAACGAACGATTGCGCGAGGAGCTGGACGGTTTGAAGGATGAGCATACCAGGCTGAAGCGGGCTTCGGTGAGAACGGTGGCGCCAAAGGAGACGGAAATGGACAAAGATATTGTGGATGACTGCGGGGATTACATTAAGGCGAATGAATTGCTCAAGGGAAAATTTAGAAAGCAAAATCAAGGTGTACAACGGGTTCGAGATTACATCACTTTCTTGGAAGGTAAAGTTAAGGAGGAACCCAAAATGGCGGATAAACTAGAGGAAGATCTGGCGACAGATCCACAAATGAAGCAAGACATAGCAGACCTGTTCGAGAAATCTCATATACTGTCCGAAAATATTTATCGAACAGAGCAGGACATACAAAATATCGCTGACTTATTGAAGCAACCGCAGGAGGGTGAAGTCACGGACCAACCGACGCCAACAACGGCCTTCGATCCTGATTCTTGGCTAAACTCACTAACTCTGACACAGTTGGCGGATCTTCACGATAAAATTTGTCTGCTGACATCAGGCGTGGTGCAGCAAGATACCGCCGCCGTGCTGTGCGACTACGTGGCTGACCGGGACCCGGCGGATCCACTGAATACAGATTACAATATTTTGAACAATCGGATAGCCGCTTTGCAGAGACAGATAGCGGAGAAGCAAATAGAAGCTGGATCGAGGCTGCAGGAGCTGCAGCGAGCTCTCCGCCAGGAGCAGGCTAACCTGATTCGAATCTCCGAGGAAATGAATTTAGAGAGAAGACGCAATTTAGCCCTTCAGCTCACCATGGATGATGCAGCCTAA
Protein Sequence
METRMALSLCRCGCNASSSRESIGPPNEPCCCCSYNPFSDNCKDSEIYDLPFALRKLSVMKCQMKKWRMERLQLESENRSLKQALQPFGVNADEILKPDPLLVHSREEIERLQNANAALEDKVVDLEETLAERDCCDDPCSSVHCLREKMRFLREGFALEKKELRDMISYLKLRLVEVEEDVSCPAIYRLRAKLRDLMKGGQMPEQPVAKVVERSIETLGDLSKSCDDLRVENDRLLAELEVLRRALAGLEEKELAGPEEKELAGLEEKELSAPDIQYRAMPQKTAETTTVPEYIDVSELTQKLKDCEEAVADLRRQLDEKNSIIDSLNKDLETMVSQKDLLDEIAAMKAELHRREDKIAELLNQLRQSEIDLLGLDHLKSQIDDLQPQLQDLEGEKGQLLDELNKLRNALAERNDQIINILEQKQKVEKDCADKMAALQAKLDESLAREAALNTEVDDLKSRVAALLDEIEQCKARNAELEECCVDRDALAKTLQVMEEELASLRGDLASATARIANLQTELDTLKGEKEELLKELDEIRGQVEALTSQLAEERAAKEALQRDLESGRDELEQLRAENASLKEQLGAAAEDNAKLQAAMKSLAEENESLKGELESLKAENKGLADRLEELENLNNVLRSDYENVKRALDNLEAEVNRLEDLLSKATEERDALLNENGDLKRRLEESLAENDGLKAELNDIGQRLDKLGLENEGLKKNLEAMKTENDSLKQDIGALRSELDGARKEAEALKAAGDSLRAADEGMKAEAERLAAEIGDLRNEKDRLTNENADLMAKNLELGRELDEAKKGVEKLKADNAALQEELDRLRQELEKSKNEMDQLKSELATLKEALDKCVGEMEKLKNEADQLRSKNDALQAEIEKCKAEADALRGDLEKTRLELSDAKGQLEKANDELKKLKVENSSLDANLKRLIAENDALQSEAEALRKEAEKLKEEAAKLKEEAAELQAQAAQRQGEGAALKAEIDGIKSERDALAGELSRLKDEHGALQSERDKLSKQVADCNAENEKLRAEKGQLEVENEKLKGDMKACEEENGRLKAELEQLAGQVASLNDELNKLRAQLADAEKKIQSLEPLVTRLQSENDKLRSDAQALQGEMNELKANMGKERDGNQKMQADMVMLEDKVQDLIAKLDEARAQNEALKQENQELRAKLQDMEQELASLRTECADANTEIANLKKLVAELQAQIANLEADVEYWKMENCKDKLEVDKLKADLEKALKDLSECQALRAEQEAELNRLRGEKSALDKKLADLTSQVEQQKKDLADGKAAKDKGSAEVEALKSELAALKKELENLRADNNKYKTQADDLAKQLAALKNELDACKEEVAALRDSNSALRAEVNALGIEKNKLQDERNQLKSQVDALKAENATLDEDRKKLAEESNKLRGAGDGQKVEIDTLRSDLAAQKAAYQKLQTDLQNCKTENERLQSQLNELRSELERLKGEADRLSAEAERLKAALADAEARVKALEGQLADLTAEKQELLNELYRLREEVDNLRKEMEKEKLARDAAAKELDALRAELNALKAELGRTRGENEKLKNEMKKLKEDLENLNDQLETLRGENDALKGENGRLSSELTAIKGKLKEAEDRIKDLEGERDGLQKRIADLEEAAKELGTLRKQLEDAENELNRLRPELEKLKTELKNAKTELEALKGENAALKDENGKLSYELTAIRGKLKEAEDRTKDLEGERDGLRKKIADLEEAAKELEMLRKQLEDCENELNRLRPELKKAKADLKKMKTELEKLKTENGRLQDELTEAENKSKGLRDDLAKLRAANDKLKNDNASLQSDIAALKAEGDKQKDHDAQLQGGLDELKKEYDELQAANAQLKDELNLCREENEKLKDELNHCRKENERLREELDGLKDEHTRLKRASVRTVAPKETEMDKDIVDDCGDYIKANELLKGKFRKQNQGVQRVRDYITFLEGKVKEEPKMADKLEEDLATDPQMKQDIADLFEKSHILSENIYRTEQDIQNIADLLKQPQEGEVTDQPTPTTAFDPDSWLNSLTLTQLADLHDKICLLTSGVVQQDTAAVLCDYVADRDPADPLNTDYNILNNRIAALQRQIAEKQIEAGSRLQELQRALRQEQANLIRISEEMNLERRRNLALQLTMDDAA

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00085909;
90% Identity
-
80% Identity
-