Basic Information

Gene Symbol
hook
Assembly
GCA_963675205.1
Location
OY776080.1:30721866-30742790[+]

Transcription Factor Domain

TF Family
TF_bZIP
Domain
bZIP domain
PFAM
AnimalTFDB
TF Group
Basic Domians group
Description
bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 26 0.0014 3.3 9.6 0.2 22 55 182 215 177 220 0.86
2 26 1.4 3.3e+03 -0.1 1.6 31 60 273 305 261 312 0.62
3 26 2.1 4.9e+03 -0.6 0.5 40 58 326 344 307 351 0.58
4 26 0.00021 0.51 12.2 1.5 33 62 385 414 369 428 0.71
5 26 0.0074 18 7.2 0.0 33 60 438 465 436 469 0.86
6 26 0.0074 18 7.2 0.0 33 60 477 504 475 508 0.86
7 26 0.0074 18 7.2 0.0 33 60 516 543 514 547 0.86
8 26 0.0086 20 7.0 0.1 33 61 555 583 553 586 0.88
9 26 0.0053 13 7.7 0.1 32 61 593 622 592 625 0.89
10 26 0.0056 13 7.6 0.1 32 60 632 660 631 664 0.89
11 26 0.0065 15 7.4 0.1 32 60 671 699 670 703 0.89
12 26 0.0074 18 7.2 0.0 33 60 711 738 709 742 0.86
13 26 0.0074 18 7.2 0.0 33 60 750 777 748 781 0.86
14 26 0.0053 13 7.7 0.1 32 61 788 817 787 820 0.89
15 26 0.0053 13 7.7 0.1 32 61 827 856 826 859 0.89
16 26 0.0053 13 7.7 0.1 32 61 866 895 865 898 0.89
17 26 0.0056 13 7.6 0.1 32 60 905 933 904 937 0.89
18 26 0.0074 18 7.2 0.0 33 60 945 972 943 976 0.86
19 26 0.0074 18 7.2 0.0 33 60 984 1011 982 1015 0.86
20 26 0.0074 18 7.2 0.0 33 60 1023 1050 1021 1054 0.86
21 26 0.0053 13 7.7 0.1 32 61 1061 1090 1060 1093 0.89
22 26 0.0056 13 7.6 0.1 32 60 1100 1128 1099 1132 0.89
23 26 0.0074 18 7.2 0.0 33 60 1140 1167 1138 1171 0.86
24 26 3.3 7.9e+03 -1.3 0.5 30 47 1192 1209 1189 1217 0.74
25 26 1.3 3.2e+03 -0.0 0.9 30 58 1244 1272 1223 1279 0.74
26 26 6.7 1.6e+04 -2.2 0.9 22 47 1267 1292 1266 1310 0.56

Sequence Information

Coding Sequence
ATGGAGGCAAATGGCGCGGTTTTATGTGACAATTTGATAAAGTGGTTGCAGACATTAGATCTTAAGGCAAAACATGCAAATCCATCAGAACTGTCAGACGGTGTGGCAATAGCCGAAGCTCTCACAGAAATAGCGCCAGAATATTTTACACCAACATggaattctaaaataaagacagATGTAGGACATAACTGGCGATTAAAAGTTAGCAACCTGAAGAAAATACTGGAAGGTGTTGTGGATTATCACCAGGACATTCTAAATTTAAGTCTACAAGAGTTTTCAAGGCCAGATGTAGTAAATATAGCCGAAACTGCCGACCCATCTGACTTAGGGAGATTGCTTCAACTTGTTTTGAGTTGTGCAGTCAACTGTGTCAAAAAGGAAGAATATATAACAAGAATAATGGAAATGGAGTTGTCATTCCAGAGGTCTATAATGCAAGCTATACAGgaaCTGCAAACCTTGACGCTGGGCGTGAACCGTGGCAGTGTACACTTGGACACGCCGGCGGCGGAGCAGTCCGATGCGGACATGAGAGAGGCGCTGGCGCAGAGGTGCCATGAACTGGACACACAGGTGAAGATCCTGCAAGAGGAGAAGATGACGTTGCTGGGCGAGGTGGCGCGGCTGTCTGCGCGGGTGGCGGCGGCGCCTGACGCGGCGGAGCTGGACGAGGCGGGCGCGTCGCTGGGCCCGGCGCACGCGGGCACGCTGCGCTACAGCACCATGCGCGCGCAGCTCGACGCGCTCAAGGACGAGCTCGACAAGGTGGAACTGCAGCGGGATGATCAGCACGCGCGCGCCGATGCCGCTGAGCGAGAACTTGCGCTATACAAGCTGAGGAATGAAGAGCTGCagATGGCAGCATCAGAAAACGTGGCGCTAAAAGATGAATTAGACGCACTACGTGAGACGGCGGCAAAGGCGGCTGCGCTGGAGGCCACCGTTGCCTCATACAAGAAGCGTATGGAGGAGCACGTCGACTTGCGACGGCAGGTGAAGCTCCTGGAGCGCGCGAACACGGAGCACGTGCAGCGCGCCATCGAGCACGAGCAGGCCGCCGCCAAGGCCAACGCCGTGCGCGCGCAGCTCGACATATACAAGAAACAGGTAATGGATTTAAACGAGAAGCTAGACGCGGAGATCACGAAGGCGGACAAGTTGGAGATCGAGAACAAGAAGCTCAGCAGCCGCGCAGCATCTTTACAGCGTGAGCGCGACGCTCTGCTACACGAGCGCGACACGCTGCGCGACACCGTCGACGAATTGCGCTGCTCCACCATCACTGCAGGTATGCCATCCAACGCCCGCGCGGTGTCGCTGCAGTGCGAGCGCGACGCGCTGCTGCACGAGCGCGACACGCTGCGCGACACCGTCGACGAATTGCGCTGCTCCACCATCACTGCAGGTATGCCATCCAACGCCCGCGCGGTGTCGCTGCAGTGTGAGCGCGACGCGCTGCTGCACGAGCGCGACACGCTGCGCGACACCGTCGACGAATTGCGCTGCTCCACCATCACTGCAGGTATGCCATCCAACGCCCGCGCGGTGTCGCTGCAGTGTGAGCGCGACGCGCTGCTGCACGAGCGCGACACGCTGCGCGACACCGTCGACGAGCTGCGCTGCTCCACCATCACTGCAGGTATGCCATCCAACGTCCGCGCGGTGTCGCTGCAGCGCGAGCGCGACGCGCTGCTGCACGAGCGCGACACGCTGCGCGACACCGTCGACGAGCTGCGCTGCTCCACCATCGCTGCAGGTATGCCATCCAACGCCCGCGCGGTGTCGCTGCAGCGCGAGCGCGACGCGCTGCTGCACGAGCGCGACACGCTGCGCGACACCGTCGACGAGCTGCGCTGCTCCACCATCGCTGCAGGTATGCCATCCAACGCCCGCGCGGTGTCGCTGCAGCGCGAGCGCGACGCGCTGCTGCACGAGCGCGACACGCTGCGCGACACCGTCGACGAGCTGCGCTGCTCCACCATCACTGCAGGTATGCCATCCAACGCCCGCGCGGTGTCGCTGCAGCGCGAGCGCGACACGCTGCTGCACGAGCGCGACACGCTGCGCGACACCGTCGACGAGCTGCGCTGCTCCACCATCACTGCAGGTATGCCATCCAACGCCCGCGCGGTGTCGCTGCAGTGTGAGCGCGACGCGCTGCTGCACGAGCGCGACACGCTGCGCGACACCGTCGACGAGCTGCGCTGCTCCACCATCACTGCAGGTATGCCATCCAACGCCCGCGCGGTGTCGCTGCAGTGTGAGCGCGACGCGCTGCTGCACGAGCGCGACACGCTGCGCGACACCGTCGACGAGCTGCGCTGCTCCACCATCACTGCAGGTATGCCATCCAACGCCCGCGCGGTGTCGCTGCAGCGCGAGCGCGACGCGCTGCTGCACGAGCGCGACACGCTGCGCGACACCGTCGACGAGCTGCGCTGCTCCACCATCGCTGCAGGTATGCCATCCAACGCCCGCGCGGTGTCGCTGCAGCGCGAGCGCGACGCGCTGCTGCACGAGCGCGACACGCTGCGCGACACCGTCGACGAGCTGCGCTGCTCCACCATCGCTGCAGGTATGCCATCCAACGCCCGCGCGGTGTCGCTGCAGCGCGAGCGCGACGCGCTGCTGCACGAGCGCGACACGCTGCGCGACACCGTCGACGAGCTGCGCTGCTCCACCATCGCTGCAGGTATGCCATCCAACGCCCGCGCGGTGTCGCTGCAGCGCGAGCGCGACGCGCTGCTGCACGAGCGCGACACGCTGCGCGACACCGTCGACGAGCTGCGCTGCTCCACCATCACTGCAGGTATGCCATCCAACGCCCGCGCGGTGTCGCTGCAGTGTGAGCGCGACGCGCTGCTGCACGAGCGCGACACGCTGCGCGACACCGTCGACGAGCTGCGCTGCTCCACCATCACTGCAGGTATGCCATCCAACGCCCGCGCGGTGTCGCTGCAGTGTGAGCGCGACGCGCTGCTGCACGAGCGCGACACGCTGCGCGACACCGTCGACGAGCTGCGCTGCTCCACCATCACTGCAGGTATGCCATCCAACGCCCGCGCGGTGTCGCTGCAGTGTGAGCGCGACGCGCTGCTGCACGAGCGCGACACGCTGCGCGACACCGTCGACGAGCTGCGCTGCTCCACCATCACTGCAGGTATGCCATCCAACGCCCGCGCGGTGTCGCTGCAGCGCGAGCGCGACGCGCTGCTGCACGAGCGCGACACGCTGCGCGACACCGTCGACGAGCTGCGCTGCTCCACCATCGCTGCAGGTATGCCATCCAACGCCCGCGCGGTGTCGCTGCAGCGCGAGCGCGACGCTCTGCTACACGAGCGCGACACGCTGCGCGACACCGTCGACGAATTGCGCTGCTCCACCATCACTGCAGGTATGCCATCCAACGCCCGCGCGGTGTCGCTGCAGTGCGAGCGCGACGCGCTGCTGCACGAGCGCGACACGCTGCGCGACACCGTCGACGAATTGCGCTGCTCCACCATCACTGCAGGCGCGAATGAAGGCAACGTGTCGCGGGAGCTGAGCGAGTTGCCGTCGCCCAACGACATGAAGGAAAAACTCATCCGATTGGAGCACGAGAACAAACTGCTGCGCCAGAACCAGGGGGCGCAGGCCGACCAGGCTTCCGTGCAGGCACTGCTAGAAGACTACGTGACGCGGCTAGAGAAACAGCGCGCTATCAACCGGGAAGCGAATCAGCGCATCATGCAGCTGGAGGCCACGCTCGAAACGCCCAACCCCCGACTCGCGTCGGCCATCGAAGACAGTCAGAGGAAATCTCTGCAAGTGGAAGAGTTGCAAGCGGCGCTGGCGGAGGAGCGGCGCCGCGCCACCAAGCTGCAGGAGGCGCTGGCGGCCAGGGACCACGACCTGCTGGCCACGGAGGACAAGTACAAGAAGTGCCTCGAGAAGGCCAAGGACGTCATCAAGTCGCTCGACCCCAGGGCCTCGGCTCAACAGTCGCTATCAGATATAACCCTGGGCTACTCCCGAGCAGCGACGGCTGGCGCGGGTCCCTCGCGGCCAGACGCCGCACCCGCGCCAAACAACCGACACGAGTGa
Protein Sequence
MEANGAVLCDNLIKWLQTLDLKAKHANPSELSDGVAIAEALTEIAPEYFTPTWNSKIKTDVGHNWRLKVSNLKKILEGVVDYHQDILNLSLQEFSRPDVVNIAETADPSDLGRLLQLVLSCAVNCVKKEEYITRIMEMELSFQRSIMQAIQELQTLTLGVNRGSVHLDTPAAEQSDADMREALAQRCHELDTQVKILQEEKMTLLGEVARLSARVAAAPDAAELDEAGASLGPAHAGTLRYSTMRAQLDALKDELDKVELQRDDQHARADAAERELALYKLRNEELQMAASENVALKDELDALRETAAKAAALEATVASYKKRMEEHVDLRRQVKLLERANTEHVQRAIEHEQAAAKANAVRAQLDIYKKQVMDLNEKLDAEITKADKLEIENKKLSSRAASLQRERDALLHERDTLRDTVDELRCSTITAGMPSNARAVSLQCERDALLHERDTLRDTVDELRCSTITAGMPSNARAVSLQCERDALLHERDTLRDTVDELRCSTITAGMPSNARAVSLQCERDALLHERDTLRDTVDELRCSTITAGMPSNVRAVSLQRERDALLHERDTLRDTVDELRCSTIAAGMPSNARAVSLQRERDALLHERDTLRDTVDELRCSTIAAGMPSNARAVSLQRERDALLHERDTLRDTVDELRCSTITAGMPSNARAVSLQRERDTLLHERDTLRDTVDELRCSTITAGMPSNARAVSLQCERDALLHERDTLRDTVDELRCSTITAGMPSNARAVSLQCERDALLHERDTLRDTVDELRCSTITAGMPSNARAVSLQRERDALLHERDTLRDTVDELRCSTIAAGMPSNARAVSLQRERDALLHERDTLRDTVDELRCSTIAAGMPSNARAVSLQRERDALLHERDTLRDTVDELRCSTIAAGMPSNARAVSLQRERDALLHERDTLRDTVDELRCSTITAGMPSNARAVSLQCERDALLHERDTLRDTVDELRCSTITAGMPSNARAVSLQCERDALLHERDTLRDTVDELRCSTITAGMPSNARAVSLQCERDALLHERDTLRDTVDELRCSTITAGMPSNARAVSLQRERDALLHERDTLRDTVDELRCSTIAAGMPSNARAVSLQRERDALLHERDTLRDTVDELRCSTITAGMPSNARAVSLQCERDALLHERDTLRDTVDELRCSTITAGANEGNVSRELSELPSPNDMKEKLIRLEHENKLLRQNQGAQADQASVQALLEDYVTRLEKQRAINREANQRIMQLEATLETPNPRLASAIEDSQRKSLQVEELQAALAEERRRATKLQEALAARDHDLLATEDKYKKCLEKAKDVIKSLDPRASAQQSLSDITLGYSRAATAGAGPSRPDAAPAPNNRHE

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-