Basic Information

Gene Symbol
-
Assembly
GCA_033822845.1
Location
JAOPTO010001043.1:411607-422814[-]

Transcription Factor Domain

TF Family
TF_bZIP
Domain
bZIP domain
PFAM
AnimalTFDB
TF Group
Basic Domians group
Description
bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 44 0.47 5.6e+02 1.3 0.2 25 48 56 79 53 86 0.62
2 44 0.63 7.5e+02 0.9 1.5 33 55 102 124 99 127 0.61
3 44 0.00014 0.17 12.6 2.9 28 62 214 248 208 250 0.90
4 44 0.27 3.2e+02 2.1 2.6 27 63 268 304 267 306 0.93
5 44 0.017 20 6.0 3.4 24 57 351 384 348 391 0.77
6 44 0.0014 1.6 9.5 5.2 24 64 404 444 403 445 0.96
7 44 0.12 1.5e+02 3.2 6.3 26 64 489 527 472 528 0.85
8 44 0.11 1.3e+02 3.4 1.7 15 62 557 608 537 611 0.72
9 44 0.084 99 3.7 0.8 26 60 607 641 604 663 0.74
10 44 0.45 5.4e+02 1.4 3.8 32 63 665 699 657 700 0.83
11 44 0.0027 3.2 8.5 8.4 24 65 702 743 701 743 0.94
12 44 0.0013 1.6 9.5 8.0 30 64 729 763 728 764 0.94
13 44 0.0011 1.4 9.7 5.4 22 64 770 812 768 813 0.94
14 44 0.0002 0.24 12.1 4.7 23 64 799 840 797 840 0.93
15 44 0.00033 0.39 11.5 13.9 20 64 824 868 821 869 0.95
16 44 0.55 6.5e+02 1.1 8.6 30 64 862 896 861 897 0.86
17 44 0.00076 0.9 10.3 5.7 23 64 904 945 902 946 0.93
18 44 0.0064 7.5 7.3 4.7 24 62 947 985 944 988 0.88
19 44 0.029 34 5.2 6.9 21 65 965 1009 965 1019 0.94
20 44 0.0039 4.6 8.0 9.4 24 63 1031 1070 1023 1071 0.85
21 44 0.0023 2.7 8.8 6.7 28 63 1063 1098 1060 1100 0.87
22 44 0.00019 0.23 12.2 4.6 21 62 1098 1139 1097 1144 0.93
23 44 8.7e-05 0.1 13.3 4.0 28 64 1147 1183 1143 1184 0.91
24 44 4.3e-05 0.051 14.3 8.2 26 63 1166 1203 1164 1205 0.92
25 44 0.0064 7.6 7.3 14.6 24 65 1178 1219 1178 1233 0.95
26 44 0.0022 2.6 8.8 3.9 25 61 1235 1271 1231 1274 0.63
27 44 0.00051 0.6 10.8 6.7 24 64 1290 1330 1281 1331 0.88
28 44 0.00028 0.34 11.7 1.1 24 63 1318 1357 1317 1359 0.90
29 44 0.002 2.3 9.0 2.8 24 57 1353 1386 1346 1394 0.69
30 44 0.058 69 4.3 9.7 27 62 1398 1433 1386 1436 0.68
31 44 0.041 49 4.7 11.2 24 65 1444 1485 1425 1485 0.87
32 44 0.023 27 5.5 2.0 32 61 1487 1516 1483 1537 0.63
33 44 0.15 1.8e+02 2.9 0.5 24 60 1531 1567 1528 1572 0.80
34 44 0.07 83 4.0 10.2 24 57 1594 1627 1570 1634 0.61
35 44 0.00023 0.28 11.9 6.3 25 64 1630 1669 1628 1670 0.94
36 44 0.011 13 6.6 10.6 35 62 1703 1730 1672 1733 0.79
37 44 0.0007 0.83 10.4 6.9 29 64 1725 1760 1724 1761 0.82
38 44 0.02 24 5.7 4.8 29 64 1767 1802 1760 1805 0.79
39 44 0.11 1.3e+02 3.4 6.2 28 63 1797 1832 1794 1834 0.91
40 44 0.033 40 5.0 0.8 32 60 1836 1864 1832 1866 0.89
41 44 0.0023 2.7 8.7 4.8 19 46 1865 1892 1861 1894 0.87
42 44 0.00056 0.66 10.7 7.0 30 63 1890 1923 1889 1925 0.91
43 44 0.71 8.4e+02 0.8 0.5 29 63 1995 2029 1990 2030 0.90
44 44 6.4 7.6e+03 -2.3 1.7 36 51 2114 2129 2093 2137 0.59

Sequence Information

Coding Sequence
ATGGAGGTTTGCCGATGCGGATGCGGCGCATCCTCGTCGACGGAATCGATCAATCCGCCGAACGAGCCGTGCTGTTGCTGCAGTTACAATCCCTTCAGCGACAGTTCCAAAGAATCAGAGATTTACGACCTCTCGTTTGCCCTGAGGAAGCTCGGCGTAATGAAGTGTCAGATGAAGAAGTGGCGAATGGAAAGACTTCAGCTCGAGAGCGAAAATAGGTCTCTGAAACAAGTCCTCCAGTCATTCGGTGTAAACGCGGATGAGATATTGAAGCCCGATCCGCTGCTAGTGCACTCCCGGGAAGAAATCGAAAGGCTGCAAAATGCAAACGCGATGCTTGAAGATAAAGTGAGGGATCTGGAGGAAACTCTCGCGGAACGAGATTGCTGCGACTACCCCAGCGCCACGATCCACTTTCTCAGAGAGAAGATGAGACATCTCAGAGAACGTTTCGCACTTGAAAAGAAAGAATTACGGGATATAATATCAGATTTGAGGCTGAAACTGGTGCAAGCCGAGGAGGACGTCAGCTGCCCTGCGATATTTCGTTTAAGGGCGAAGCTGCGCGAACTGATGAAAGGCGTGCAACCCGCGGAACCACAGGTCACCAAAGTTGTGGAGAAATCGATCGAGACGTTGGTGGAGCTCTCGAGAAGTTGCGACGATTTACGGCTGGAGAACGATCGTCTTAAGGCCGAGGTGGCCGAGCTGCGTGCGGCGTTGGCCAAGCTTCAGGAGCAGGTAGCTCAGTCGGATATGATGAGAACGGTCGAAACAGCAACCGTGCCGGAGTACATTGATGTTTCCGAATTAACGCGAAAACTGAACGAGTGCGAGGACGTCGTGGCAGATCTAAGAAGGCAGCTGCAGGAGAAAGACAGGCTGATCGATGAGCTGAAGACAGAGCTGGAGTCGATGGTCAGCCTGGCCGGTGTGCAGCAACAGATAGAGGCTATGAAGGCGGAACTTAGAAAGAAAGATGATAAGATCACAGGCCTGCTGAATACTCTAAGACAGTCTGAGATAAATATGCTGGGGTTATCTAACCTGAAGTCCGAAATGGAGGATCTGAAGCCGCAATTAGAAGATCTTCAGGTGGAGAAGAACGAGCTGCTGATAGAGCTTGACAGATTGCGGAAGGCGTTGGAGGACAGGAACAACCAGATAATAGACATACTGGAACAGAAGAATAAGTTGGAGCAGGACTGCAAGGAGAAGATAGCAGCGTTACAGGCGAAACTGGACGAAGCGTCGGATCGAGAGGCGAACTTGATGCAGGAGATCGAGAATCTGAAACAGCAGATAGCCGACCTGAAGAACGAGCTGCAAGAGTGTCACAAACGAAACGCAGAGCTCGAAGAGTGTTGCAAGGACAGAGATGCCCTTTCGGAGAAATTGAGCGGGTTAGAGGGTGAACTCGCGGTCGCTAAGGCCCGAATAGCGGACCTCGAGAGGGAAGTGAGCGACAAAGAAGATTTGTTGAAGCAACTGGAAGAGGCGTTAAACCGAGAAGCGGGCTTGAAACAGGAGATCGAGGATCTGAAAGGACAAATAACCGATCTGAAGAACGAGCTGCAAGAGTGTCACAAACGAAGCGCAGAGCTCGAAGAGTGTTGCAAGGACAGGGATGCCCTTTTGGAAAAATTGAGCGGGTTAGAGGGTGAACTCGCGGCCGCTAAGGCCCGAATAGCGGACCTCGAAAGGGAAGTGAGCGACAAAGACGATTTGTTGAAGCAACTGGAAGAGACGTTAATTCGGGAAGCGGGCTTGAAACAGGAGATCGAGGATCTAAAAGGGCAAATATCCGATATGAAAGGGCAAATATCCGATATGAAAGGGCAAATATCCGATCTGTTAGGGCAAATAGCCGATCTGAAGAACGAGCTGGAAGAGTGTAAAAAGCGAAATGCACAGCTCGAAGATTGTTGCCTTGACAAGGATGCTCTTTCGGAGAAGTTACGCGGTTTACAGGATGAACTCGCGGCTGCCAAGGCCAGAATAGCGGATCTCGAGAAGGAGGTGAGCGACAAGGAAGACCTGTTAAAGCAGCTGGAGGATGCAAGGAAACAGATCGAGGCGCTGACCGCACAACTGGAGGACGAAAAGGCTGCTAGAAAAGCGTTAGAGGAGGAGCTGAGGGCTAGTCGAGATGAAAATGAGAAATTGCGAAACGAGAATGCAGATCTGAAGGATCAGGTCAACGCCGCGAAGGCGGAGAACGAGAAGCTTCGCGAGGAACTGAAGAAGATGGCCGAGGAGAATGATAAACTTCGCGAGGAGCTGAAGAAGAAGGACGAGGAGAACGAGAAGCTGAAGGCACTTGTCGAACAGTTGATGGACAAGATGAGTGAGCTTGAGGACTTGAATAAACAGCTGAGGGACGAATACGACAACATGAAGCAAGCGTTAGACAGTTTGGAGGCAGAGGTCAGCAGATTGGCGAACGAATTGAGCAAGATGACCCAAGAACGCGACGCATTGCTGAACGAGAACGACAACATGAAAAAGCAGCTGGAGCAAGCGCAGGCGGAGAACGCTAAACTTCGCGAAGAGTTGAAGAAGATGGCCGAGGAGAATGATAAACTTCGCGAGGAGCTGAAGAAGAAAGACGAGGAGAACGAGAAGTTGAAATCACAGATCGAACAGATGATGGAGAAGATGAAGGAGCTTGAGGACTTGAATAAACAGCTGAGGGATGATTACGATAACATGAAACAAGCGTTAGACAACTTGGAGGCAGAGATCAGCAGACTGGCGAACGAATTGAGCAAGATGACGCAGGAGCGCGACGCGTTGCTGAACGAGAACGACAACATGAAGAAACAGTTGGAGCAAGCGCTGGCTGAGAACCAGGGTCTAAGAGCTCAATTGGACGACGTTAGTAAGCAACTCGACAAATTGAAAGCGGAAAAGGACGAGTTACAGAAGAATCTGGATGCGATCAGGCTCGAGAACGATGCGCTGAAACGAGATATGAAGGCACTAAACGACGAGTTGGAGGAGTCGAGGAAACAGATAGAGCAGCTGAAAGCTGCTGGAGATGCGTTAAGGGCTGCGGACGAGGGCAAAGCTTCCGAGTTACAAAGCTTGCGACAAGCGTTGGAGAACCTACAAGCCGAAAAGGATCGGTTAACGAGCGAAAATGCCGATTTGAAGgctaaaaatttggaattgcAAGAAAGATTAGACAACGTTGCGAAGGAGTTGGAGAAAATGAAGGCGGAGAATGCCGATTTACTGGCCGAGGTGGCTCGTTTGAAAGAGGAATTGGAGAAAGCAAGAAACGAAATCGATCGATTGAAATCCGAGATGGATGCATTGAGGAAGGCGCTCGACAAGTGCACCGAAGAGATGGAAAGATTGAAGATCGAGAACAATGACCTTAAATCGGAGAATCAGGCTCTGAAGACTGAGATGCAGGGAATTAAAGGTGAAGGGGACGGTTTGGCGGTGGAGGTGAGAAATCTAAAGGACGAGAACGCTGCTTTgagaaacgagaaagaaaaattgagcAAACAATTAAGCGATTGCAAGGCGGAGAACGAAAGATTGAAAAGTGAAAAGGATCAGCTGGAGGCTGAGAACAAGAAGCTGCGAGATGATCTAAACTCGTGCAGCAAGGAGAACGCTCAGCTGAAGGACGAACTCGAGAAGTTGCGTGGACAATTGCAAGCGTTGAATGAGGaactaaataaagtgaagGATCAGCTTGCTCAAGCGGAGGAGAAGATTCGGTCTTTAGAGTCGTTGGTTGCTCGTTTGCAAAACGAACTTGCGGCGTTGAAAAAAGAGGCGGACGATCTGCGAGCGAAAGCCGACCGAGAAATGGGCGAAAATTCGAAAATGCGAAACGATTTGAAGATGTTGGAGGATCAGCTGCAAGATCTGAACAACAAATTGGACAACGCTAGAGCGGAGAACGAAGCATTGAAGAAGGAAAATGAAAATCTGAAAGCAAAGATGGCGGACATGAAGGATGAATTGGCGACCTTGAGGGCCGAGTGCGAGTCTCTCAAGTCAGAGATCGCCgatttgaagaaattaatcgAGGACTTGAAAGCACAGATCGCTAGGCTAGAAGCGGACGTGGATCACTGGAAGATGGAGAACTGCAAATTGCAAATGGAAGTAGATAAACTAAAATCTCAACTGGAGAAGGCACTGAAGGATTTGAATGAATGTCAGGCTCTAAAAAAAGCGTTGGAGGAAGATTTGGCCCGACTGCAGAAAGAGAAAGCCGAGCTCGATAAGAAGCTTGCCGATCTAACGGCACAGATTGAGCAACAGAAGAAAGCACTGGAAGCTGAAAAATTGGCCAAAGGCAAGGACGCTGCAGAAATTGAGGCCCTGCGGAATGAGCTCGACGCGTTAAAGAAGGAACTTGAGAAACTGAGAGCGgaaaataacaaatacaagAACGAAATAAACGATCTAGAAAGACAACTTTCGGCTCTGCAAGGAGACCTTGACGCGTGCAGAGAAGAGGTGGCCGCGTTGAGAGACGCTAATATGACGTTGAAGTCCGAATTGGAATcactgaaaaatttgaagaacgaATACGACAAGTTGAAGGGTGAATTAGATGCTCTTAAAGTGGCGTACGAGAATCTGTTGCAAGACAGAAAGAATCTGGAGGACGAGGTCGGTAAATTGAGAGGGGAAGGTGACGGACAGAGAGCAGAGATCGACAGACTGCAGAAAGACTTGGCCGCGGAGAGAGCAGCTGCGAAGAAATTGCAGTCGGACTTGCAAAATTGTCAAGCTGAGAACGACAGAGTGCAAGCTCAACTGAACGGATTGAGAAACGAGTTGGATAAAATGAAGGCTGAGAATGATCGTCTGAAAAATGAGCTCGATGAACTGAAAAAACGGTTATCCGAAGCTGAAGCCAAGGTGAAGACCCTTGAAAAGGATCTTGCGGATTTGCTGGCCGAGAAGCAGGACTTGCTCAGCGAACTCTATCGTCTTCGCGATGAGTTGAATAACCTCAAGGCCGAATTGGAGAAAGTTAGAGCCGCGAAAGACGCGGCACTGAACGAGTTAAATGACCTGAAGAAGGAGTTAGCCGATCTGAGGGCGTTACTGGAGAAGGCTCGCGCTGACAACGAAGCTCTCCAGAGCGAGAACGAGAGGCTGAAAGCGGAGTTAGAGCCGTTGAGGAAACAGCTGAATGCGTTGAAGAACGAGAACGCTAATCTGAAGGACGAGAATGCTAATCTGAAGAAGGAGCTAGAGAACTTGAAGAATGAGATCGCAAAATTGGAAGCGGAATTAGCCGCGTTGAAGGATAAATTGAAGATGGCAGAGAAACGGTTGAACGATATGAAGAAAGAGAACGACGACCTGAGGAAGAAAATCGCCGATCTCGAGGAAACAGCGAAAGAGCTTGATGCTTTGAGGAAAAAGCTAGAAGATGCTGAGAACGAATTGAATAGACTTAGGCCGGAACTGAATAAACTGAAGGCAGAGAACGCCAAGCTTAAAGATGACTTGGACAAGGCCAACAAACAAGTGAAGCAGCTAACGGCTGATttggataaattgaaaaaggatTATAACTCTTTGACGGCTGAATTAGCCAAGGCGAGAGACGAGAGGGATCGTTTGAAGGAGGGTGATGCGGCGTTGGAAGAAGAGTTGGCGAAATTGAAGAAGGAAAACGACAAGCTGAAAACTCAGTTATTCGATTGCCAGGCGCAGAACGAACAGCTGAGGAAGGAATTGGAAGACTTGAAGAACAAGAATGCCGAACTGAAGGCCGCTCTCGCAGCGAAAGAGTCGGTGCCGAAGGAGACGGAAGTGGAGAAAGATTTGCTGGATGAATGCGAGGACTTCGTTAAGGCGAATAAATTATTGGGAGAGAAATTTGAAAAGCATAACGAAGGTGTGCATCGTGTTCGAGATTATATTACGTATTTGGAGGGTAAAGGTGATGAACCAAAAATGGCGGATCAAGTGGACGAGGAGCCAGAAATTGATCCAGAATTAAAGAAAGATATTGCGGATTTGTTGAACAAATCTGAGATGTTATCTAAGAATATTTATGAAGCGGAACAGGAAATACAAAATCTTGCTGACTTACTGCAGAAGGATACCGGAGAAGAACCGATCTTCGATCCTGACTCTTGGCTAAGCTCGTTGACGCTGTCGCAGCTGGCGGATCTTCATGACAAGATTTGCATGCTAACATCGGACATGGTCCAACACGATACCAAGGCTGTGGTGTGCGATCAGGCCCGCGGGGAGGGGAACCCGCTAAGTTCGGACTATGACATTTTGAACAGACGGATAGCCGCTCTGCAGAAGCAGATAGCGGAGAAGCAAATGGAGGCAGGCTGGAAACTGCAAGAACTCAAGCAGGCTCTTCGAGCCGAGCAGGCTAACCTGATTCAACTCTCCGACCAAATGAACTTAGAGAGAAAACGCAATCTGAATATCCAGTACGACATTGACGATTCACTTCAGTAA
Protein Sequence
MEVCRCGCGASSSTESINPPNEPCCCCSYNPFSDSSKESEIYDLSFALRKLGVMKCQMKKWRMERLQLESENRSLKQVLQSFGVNADEILKPDPLLVHSREEIERLQNANAMLEDKVRDLEETLAERDCCDYPSATIHFLREKMRHLRERFALEKKELRDIISDLRLKLVQAEEDVSCPAIFRLRAKLRELMKGVQPAEPQVTKVVEKSIETLVELSRSCDDLRLENDRLKAEVAELRAALAKLQEQVAQSDMMRTVETATVPEYIDVSELTRKLNECEDVVADLRRQLQEKDRLIDELKTELESMVSLAGVQQQIEAMKAELRKKDDKITGLLNTLRQSEINMLGLSNLKSEMEDLKPQLEDLQVEKNELLIELDRLRKALEDRNNQIIDILEQKNKLEQDCKEKIAALQAKLDEASDREANLMQEIENLKQQIADLKNELQECHKRNAELEECCKDRDALSEKLSGLEGELAVAKARIADLEREVSDKEDLLKQLEEALNREAGLKQEIEDLKGQITDLKNELQECHKRSAELEECCKDRDALLEKLSGLEGELAAAKARIADLEREVSDKDDLLKQLEETLIREAGLKQEIEDLKGQISDMKGQISDMKGQISDLLGQIADLKNELEECKKRNAQLEDCCLDKDALSEKLRGLQDELAAAKARIADLEKEVSDKEDLLKQLEDARKQIEALTAQLEDEKAARKALEEELRASRDENEKLRNENADLKDQVNAAKAENEKLREELKKMAEENDKLREELKKKDEENEKLKALVEQLMDKMSELEDLNKQLRDEYDNMKQALDSLEAEVSRLANELSKMTQERDALLNENDNMKKQLEQAQAENAKLREELKKMAEENDKLREELKKKDEENEKLKSQIEQMMEKMKELEDLNKQLRDDYDNMKQALDNLEAEISRLANELSKMTQERDALLNENDNMKKQLEQALAENQGLRAQLDDVSKQLDKLKAEKDELQKNLDAIRLENDALKRDMKALNDELEESRKQIEQLKAAGDALRAADEGKASELQSLRQALENLQAEKDRLTSENADLKAKNLELQERLDNVAKELEKMKAENADLLAEVARLKEELEKARNEIDRLKSEMDALRKALDKCTEEMERLKIENNDLKSENQALKTEMQGIKGEGDGLAVEVRNLKDENAALRNEKEKLSKQLSDCKAENERLKSEKDQLEAENKKLRDDLNSCSKENAQLKDELEKLRGQLQALNEELNKVKDQLAQAEEKIRSLESLVARLQNELAALKKEADDLRAKADREMGENSKMRNDLKMLEDQLQDLNNKLDNARAENEALKKENENLKAKMADMKDELATLRAECESLKSEIADLKKLIEDLKAQIARLEADVDHWKMENCKLQMEVDKLKSQLEKALKDLNECQALKKALEEDLARLQKEKAELDKKLADLTAQIEQQKKALEAEKLAKGKDAAEIEALRNELDALKKELEKLRAENNKYKNEINDLERQLSALQGDLDACREEVAALRDANMTLKSELESLKNLKNEYDKLKGELDALKVAYENLLQDRKNLEDEVGKLRGEGDGQRAEIDRLQKDLAAERAAAKKLQSDLQNCQAENDRVQAQLNGLRNELDKMKAENDRLKNELDELKKRLSEAEAKVKTLEKDLADLLAEKQDLLSELYRLRDELNNLKAELEKVRAAKDAALNELNDLKKELADLRALLEKARADNEALQSENERLKAELEPLRKQLNALKNENANLKDENANLKKELENLKNEIAKLEAELAALKDKLKMAEKRLNDMKKENDDLRKKIADLEETAKELDALRKKLEDAENELNRLRPELNKLKAENAKLKDDLDKANKQVKQLTADLDKLKKDYNSLTAELAKARDERDRLKEGDAALEEELAKLKKENDKLKTQLFDCQAQNEQLRKELEDLKNKNAELKAALAAKESVPKETEVEKDLLDECEDFVKANKLLGEKFEKHNEGVHRVRDYITYLEGKGDEPKMADQVDEEPEIDPELKKDIADLLNKSEMLSKNIYEAEQEIQNLADLLQKDTGEEPIFDPDSWLSSLTLSQLADLHDKICMLTSDMVQHDTKAVVCDQARGEGNPLSSDYDILNRRIAALQKQIAEKQMEAGWKLQELKQALRAEQANLIQLSDQMNLERKRNLNIQYDIDDSLQ

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-