Basic Information

Gene Symbol
-
Assembly
GCA_963989295.1
Location
OZ022401.1:10287750-10293857[+]

Transcription Factor Domain

TF Family
TF_bZIP
Domain
bZIP domain
PFAM
AnimalTFDB
TF Group
Basic Domians group
Description
bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 21 0.44 4.9e+02 1.5 0.2 37 56 292 311 289 316 0.74
2 21 0.00092 1 10.1 1.8 28 61 600 633 594 634 0.86
3 21 0.00081 0.9 10.3 3.5 25 61 646 682 643 683 0.89
4 21 0.00015 0.17 12.7 4.7 25 62 695 732 694 737 0.88
5 21 0.00071 0.79 10.5 6.7 25 62 751 788 750 790 0.91
6 21 0.00055 0.61 10.9 6.6 25 64 807 846 806 847 0.90
7 21 0.0024 2.7 8.8 5.6 25 61 828 864 827 865 0.85
8 21 0.00036 0.4 11.4 5.2 26 62 878 914 876 920 0.77
9 21 0.00016 0.18 12.6 5.6 26 63 934 971 932 976 0.80
10 21 0.02 22 5.9 1.8 32 59 989 1016 984 1019 0.63
11 21 0.026 29 5.5 2.3 32 60 1031 1059 1025 1061 0.64
12 21 0.028 32 5.4 1.4 32 59 1073 1100 1067 1103 0.64
13 21 0.002 2.2 9.0 3.2 25 60 1115 1150 1113 1153 0.88
14 21 0.0058 6.4 7.6 5.3 25 57 1213 1245 1211 1246 0.65
15 21 0.00028 0.31 11.8 3.9 25 58 1234 1267 1229 1272 0.80
16 21 0.23 2.6e+02 2.4 0.2 32 57 1297 1322 1287 1324 0.76
17 21 0.013 14 6.5 4.3 31 60 1342 1371 1336 1374 0.55
18 21 8.3 9.2e+03 -2.5 3.6 26 57 1386 1417 1384 1442 0.71
19 21 0.021 23 5.8 1.6 32 63 1455 1486 1450 1496 0.81
20 21 0.094 1e+02 3.7 4.1 25 51 1518 1544 1510 1562 0.49
21 21 0.021 23 5.8 0.3 32 61 1581 1610 1578 1616 0.87

Sequence Information

Coding Sequence
atggCTCCCGCAGCGGTGGCAATATTCGTAATCTCCTTATCCGTGGGAGCGTTTTCAGCTCCAAATCGGTGTGCCAGTTGCCAAACATGGGAATCGCATGGCGGATATCAAGCGGAAAGTGGGTCAGGCGGATGGATAAATCAAGACAATTTATCCCAGAGATCATCGAACTTGGAAGATTTAACACAACAAGCAGAGACCGAGCTGCACAGATCTCCCGGTCAATTAGCTTTTGATAACACGAGATCTGGAAACTGGACAGATGTAAACCATTACAGAACTGCCGACGGCCATGGAAGGGTTTACGAAGAGCAAGGTCAACGCGTAGATGGGCCAACGCGGATAAGATTCTACAGAAAGAATTTCACTTCGAGTTACAACAACGGAAACTCGGGTGCTTTGAGAGAAACTGATTTAGGAGGATTCAACTCTTTGGGTAGAGAAGATGCAGGCCAGTTAATAAGCCACGGAACTGCTGATCATTCACTGAATTCAGCTTATGATCAATCCGTCACACGAGGAAATTCTTATACCTCGCAAGGCTCCCAATATTCCAGGGACAGAGCTAACGGTCGCATCGATGGATATTATCAAAATCGCGACAATAGCGGTCGGTACAGAGAAGATAGTTCGGCATTAAATGGCCAAGCGACGCAGCAAGGATCAACCTCTTGGGATCAAGGAAGAGGAGGAAATTGGACGACGGCTAATACTTATAGAACCGACGGAGAAAATGGACAAGTTTACGAAGAACGAGGGCAAGTTGTGACAGGGCCAAGGCAAGTTCGTTTTTACAAAAAGAATTATACTTCAAGTTACACCTCAGGTGGAGCTATTCCTACTTTTGATTTGGGGGTAGACGGAAGGAGGAGCTTTGAAAGCGAGGTACAGCAGCTGCAAAGAGAGCTTGAAAGTGCGACAAGGGACAGTCATCGAATTAATCAAGATTCTATAAATGGTGGTTACACTCAAGAGACTTCAGGACATTATAGACAGCCTGGTTCGACTGTGGGTCAAACAACTTATAGATATGTAGTGAGGCCCGATAACTACAGACCGTTAGATCAGACTACTTCAGGGATAAATTCTCACGGAACGTACCAGCATACAACTaactttgaaaatcaacaTGTGTCTCAAACTGGCGGTAGTTCTGTTGACCAGACAGAACAAGATACCAGAAGAAATATCAATTCGGGATATTCAACAAGCCATTTTGGCAATAGTGACGGATACGGTACTCAGAGACCACAAAGAATATACACAATTGTGGAACAGTCTGGTAGAACAACTCTGCCTGTATACTCTGATTCACAAACTCGGCAATCTTATTATGATCGTCTTTCGCAGTCACAGCAGGCAGAATACAGACGTATTTATGGATCCGAAGGCTTGAGTCAGAGAGAAGTATCTATCGGTAATGTCAACAACCAACAACAATCACATTACACAGGTTCTCATGGTACCGCGAGACCATCAGGCCAAATGCAACATTACGAACAGTTTCaaacttcttcttcctcctcggTACGTCACCCTGATATTGATGTAAGAACTGCTCAATCTGGTATGGATCAGGAAATCCAGAGACTACATAATGCACATGCTAACTTCGGTCAAGCCAGCAGTAATGGTTATGGTAGCAGCACTTATCGGGTTCAGAATGGACAACTTATGACACAGGGACTTGACTTGGGACAAGTCGCAGATACCGTAGATTGTGCAGATGGTACAAGTGGACATTCCACATTCGAACAAACCCAGTATCACAGAGTGTACAGAGGAGTTCACCAGCCTGAAGATCTTACTCAACAGACTGAGGATCTTACTCAGCAAACTGAGGATCTTACACAGCAAACAGAAGATCTAACACAACAAACACAGCATTTTGGGCAGCAAAGTCCTTGGAGAACTGGTAAATTGGAGGCTGAAAGCCAACGGGTCCATGATCTCACTCAACAGACTGAGGATCTTACTCAACAAACTGAGGATCTCACTCAGCAAACAGAAGATCTAACGCAACAGACTCAACATTTTGGACAGCAAACTCCTTGGAGAACTGGTAAATTAGAGGTTGGAACCCAGCGTGTCCATCATCTCACTCAACAGACTGAGGATCTCACTCAACAAACTGAGGACCTCACTCAACAGACTCAGGATCTTACTCAGCAAACAGAAGATCTAACGCAACAGACTCAACATTTTGGACAGCAAACTCCTTGGAGAACTGGTAAATTAGAGGTTGGAACCCAGCGTGTCCATCATCTCACTCAACAGACTGAGGATCTCACTCAACAAACTGAGGACCTCACTCAACAGACTCAGGATCTTACTCAGCAAACAGAAGATCTAACGCAACAGACTCAACATTTTGGACTGCAAGCTCCTTGGAGACCTGGTAAATTAGAGGTTGAAAGCCAGCGGGTTCATCATCTCACTCAACAGACTGAGGATCTCACCCAACAAACCGAAGACCTCACTCAACAGACTGAGGATCTTACTCAGCAAACAGAAGATCTAACGCAACAGACTCAGGATCTTACTCAGCAAACAGAAGATCTAACGCAACAGACTCAACATTTTGGACAGCAAGCTCCTTGGAGACCTGGGAAATTGGAAGTTGAAAGCCAGCGGGTGCATGATCTCACTCAACAGACTGAGGATCTTACTCAGCAAACAGAAGATCTAACGCAACAGACTCAGGATCTTACTCAGCAAACAGAAGATCTAACGCAACAGACTCAACATTTTGGACAGCAAGCTCCTTGGAGACCTGGGAAATTGGAAGTTGAAAGCCAGCGGGTCCATGATCTCACTCAACAGACTGAGGATCTTACTCAGCAAACAGAAGATCTAACGCAACAGACTCAGGATCTTACTCAGCAAACAGAAGATCTAAGGCAACAGACTCAACATTTTGGACAGCAAGCTCCTTGGAGACCTGGGAAATTGGAAGTTGAAAGCCAGCGGGTCCATGATCTCACTCAACAGACTGAGGATCTTACTCAGCAAACAGAAGATCTAACGCAACAGACTCAGCATTTTGGACAGCAATCTCCTTGGAGATCTGGTAAATTGGAGACTGAAAGCCAACGGGTCCATGATCTCACTCAACAGACTGAGGATCTTACTCAGCAAACAGAAGATCTAACGCAACAGACTCAGCATTTTGGACAGCAATCTCCTTGGAGATCTGGTAAATTGGAGACTGAAAGCCAACGGGTCCATGATCTCACTCAACAGACTGAGGATCTTACTCAGCAAACAGAAGATCTAACGCAACATACTCAGCATTTTGGACAGCAATCTCCTTGGAGATTTGGTAAATTGGAGACTGAAAGCCAACGGGTCCATGATCTCACTCAACAGACTGAGGATCTTACGCAACAAACCGAGGATCTTACTCAGCAGACTCAGCATTTTACTCAACAAATTGAGGGTCTTGCACAGCGAGATCAAGATTTTGGACAGCAATCTTCTTGGAGACCTGGTAAGTTAGAAGCTCAAACTCAGCGGGTAGAAGACCTGACGTATCAAGTTGGTCAAACTGATGATTACTACTTGCAAAAACGAGATGGAGATCGTGATTTTGGACAAGAATCTTCTCGGGGACCGAGTAGATTGGAAGTTGGAAATCAGCAGGTGGAGGATCTCACTCAACAAACGGAGGATCTCACTCAACAAACGGAGGATCTCACTCAACGGACAGAGGATCTCACTCAACAAACAGAGGATCTCACTCAGCAGACAGAAGATCTTACTCAACAAACACAACATCTAACACAGCATGTTGGTCAAACTGATGATTATTACTTGCAAAAACCAGTTGGAGATCATGACTTTGGTCAAGAATCTTACAGAAAGCCAGGTAAATTGGAAGTTGGAAGTCAACAAATAGCAGATCTTACTCAACAAACGGAAGATCTCACTCAACAGACGGAAGATATAACTCAACATGTTGGTCAAGAATCTTCCTGGAAACCTGGCAAATTGGAAGTTGGAGGTCAACAGGTAGAAGATCGTACTCAACAAACGGAAGACCTTACTCAGCAAATAGAAGATCTGACTCAACAATCTGAGGATCTGACCCAACAAACAGAACACCTAGGCCAGCAATCTTCGTGGCAACCAGGACAACTAGAATTCGGAAGCCAGCGAGTCGAAGATCTGACTCAACAAACAGAAAATCCTGGCCAGCAAACGGAAGATCTTGGCCAGCAGACACAGGATCTTGGGCAGCTAACAGAAGATCATGGCCAACAGACACAGGATCTTGGGCAGCTAACAGAAGATCTTGGCCAACAGACAGAGGATCTAAGCCAGCAATCTTCGTGGCAACCAGGACAACTAGAAATCGCAAGTCAGCGAGTCGAAGATCTGACTCAGCAAACAGAAAATCTTGGCCAGCGAACAGAAGATCTGGGCCAGCAAACACAGGATCTTGGGCAGCTAACAGAAGATCTTGGCCAACAGATAGAGGATCTAGGCCAGCAATCTTCGTGGCAACCAGGACAACTAGAATTCGGAAGCCAGCGAGTTGAAGATCTGACTCAACAAACAGAAAATCTTGGCCAGCAAACAGAAGATCTGGGCCAGCAGACACAGGATCTTGGGCAGCTAACAGAAGATCTTGGCCAACAGACACAGGATCTTGGGCAGCTAACAGAAGATCTTGGCCAACAGACAGAGGATCTAGGCCAGCAATCTTCGTGGCAACCAGGACAACTAGAATTCGGAAGCCAGCGAGTCGAAGATTTGACTCAACAAACAGAAAATCTTGGCCAGCAAACAGAAGATCTTGGCCAGCAGACACAGGATCTGGGGCAGCTAACAGAAGACTTTGGCCAACAAACAGGAGGTCTCCCTCAACAAACCGAAGACATTTGGCAACTAAGTGGTAAAATTGACCATTATATAGGACAAACAACAGGAAACTCAGCATTTGGACAAGAGTCTTCTTGGAAGTCCCCCGAGCAGGGAATAATAAGTCAACGAGCTGAGAACCTCGATCAAGGAAACCAATTTGGCACACACCAAACACCTATCTATCCTGGACAAGTTTCGACAATAGCACCAAAGCCCGTCCCGAGGCCAAAACGTCCAAGACCTGCAAATTTCTATCCCACTCAACAAATTAATGTGGAACTCGAAGAATCAACTACCGACAGTCATTCGGCGATAATAATTGATCAGCAAGATCATAAAAAATGGGAAGTAACTGGCAATACAGATCCAAGTACAACAATTCCCCACACTCCACACAGAGGTGATCAAGGTATCGACCCGAATTCTGACGAACCAGAACAAACAGATGTTCAGATTGAAACGATAACGCACAAAGTACTTGAACAACAGAGCGGATATGTGCCAGTGCATCCAAGTTCATTTAATCATGAAACTACAACTGAGGAACAAGTCAGAAAAGTTCAACCCACTAAAACTAAGACAGGACGCCGAACTTATCAAGGTGGACAACGCTTGCCCAAAGTTAACGATTGGTATCCTACTCGTGACGTAGTCTACAGTCAACAACCAACCACCAGATTAGTTGATGGCCGTCCTTCTGGCCTAGGAAGTCCAGTCAACTCAAACCTGGAGCAATCGACAAGTAGCGGACAATTTACAGAAGACATTCAACAACATTGGACTAACACTGAAGAGACCCAACAGGTTGAATCCAGTCAGACGTTCCAAAGAATTCAACACGTTGGCTCTGGTATAGAAAACAGACAACAGAGCGGTGTTCAACCGAACGAAGaagttcgaattcaacctCGAATCTTGGAGGCATGTGGAGCAAACGGACCATACGATATAAATAATTACGATCTACTCCCGACTCCGAAACCATTTCCTGATACAGCTCTAACACCGCCTGATGGCCGCGATGCTTGGGATATTCGCGTGGTCGATCGTAGAGTTACGACCACGACAGAGGCCTCTGTTCCTACATCAACGATACCGATAACAACAACGACAacaacgacaacaacaacaacgccAATTCCTCCTCCGACTCCAGCTCCTTCATTTTGGCGTAGAATTGGAAATTCTGTCACTAATACTTATGACAAGGCTAAAGAAAAAGCCAAGGAATTATTTGGCTAG
Protein Sequence
MAPAAVAIFVISLSVGAFSAPNRCASCQTWESHGGYQAESGSGGWINQDNLSQRSSNLEDLTQQAETELHRSPGQLAFDNTRSGNWTDVNHYRTADGHGRVYEEQGQRVDGPTRIRFYRKNFTSSYNNGNSGALRETDLGGFNSLGREDAGQLISHGTADHSLNSAYDQSVTRGNSYTSQGSQYSRDRANGRIDGYYQNRDNSGRYREDSSALNGQATQQGSTSWDQGRGGNWTTANTYRTDGENGQVYEERGQVVTGPRQVRFYKKNYTSSYTSGGAIPTFDLGVDGRRSFESEVQQLQRELESATRDSHRINQDSINGGYTQETSGHYRQPGSTVGQTTYRYVVRPDNYRPLDQTTSGINSHGTYQHTTNFENQHVSQTGGSSVDQTEQDTRRNINSGYSTSHFGNSDGYGTQRPQRIYTIVEQSGRTTLPVYSDSQTRQSYYDRLSQSQQAEYRRIYGSEGLSQREVSIGNVNNQQQSHYTGSHGTARPSGQMQHYEQFQTSSSSSVRHPDIDVRTAQSGMDQEIQRLHNAHANFGQASSNGYGSSTYRVQNGQLMTQGLDLGQVADTVDCADGTSGHSTFEQTQYHRVYRGVHQPEDLTQQTEDLTQQTEDLTQQTEDLTQQTQHFGQQSPWRTGKLEAESQRVHDLTQQTEDLTQQTEDLTQQTEDLTQQTQHFGQQTPWRTGKLEVGTQRVHHLTQQTEDLTQQTEDLTQQTQDLTQQTEDLTQQTQHFGQQTPWRTGKLEVGTQRVHHLTQQTEDLTQQTEDLTQQTQDLTQQTEDLTQQTQHFGLQAPWRPGKLEVESQRVHHLTQQTEDLTQQTEDLTQQTEDLTQQTEDLTQQTQDLTQQTEDLTQQTQHFGQQAPWRPGKLEVESQRVHDLTQQTEDLTQQTEDLTQQTQDLTQQTEDLTQQTQHFGQQAPWRPGKLEVESQRVHDLTQQTEDLTQQTEDLTQQTQDLTQQTEDLRQQTQHFGQQAPWRPGKLEVESQRVHDLTQQTEDLTQQTEDLTQQTQHFGQQSPWRSGKLETESQRVHDLTQQTEDLTQQTEDLTQQTQHFGQQSPWRSGKLETESQRVHDLTQQTEDLTQQTEDLTQHTQHFGQQSPWRFGKLETESQRVHDLTQQTEDLTQQTEDLTQQTQHFTQQIEGLAQRDQDFGQQSSWRPGKLEAQTQRVEDLTYQVGQTDDYYLQKRDGDRDFGQESSRGPSRLEVGNQQVEDLTQQTEDLTQQTEDLTQRTEDLTQQTEDLTQQTEDLTQQTQHLTQHVGQTDDYYLQKPVGDHDFGQESYRKPGKLEVGSQQIADLTQQTEDLTQQTEDITQHVGQESSWKPGKLEVGGQQVEDRTQQTEDLTQQIEDLTQQSEDLTQQTEHLGQQSSWQPGQLEFGSQRVEDLTQQTENPGQQTEDLGQQTQDLGQLTEDHGQQTQDLGQLTEDLGQQTEDLSQQSSWQPGQLEIASQRVEDLTQQTENLGQRTEDLGQQTQDLGQLTEDLGQQIEDLGQQSSWQPGQLEFGSQRVEDLTQQTENLGQQTEDLGQQTQDLGQLTEDLGQQTQDLGQLTEDLGQQTEDLGQQSSWQPGQLEFGSQRVEDLTQQTENLGQQTEDLGQQTQDLGQLTEDFGQQTGGLPQQTEDIWQLSGKIDHYIGQTTGNSAFGQESSWKSPEQGIISQRAENLDQGNQFGTHQTPIYPGQVSTIAPKPVPRPKRPRPANFYPTQQINVELEESTTDSHSAIIIDQQDHKKWEVTGNTDPSTTIPHTPHRGDQGIDPNSDEPEQTDVQIETITHKVLEQQSGYVPVHPSSFNHETTTEEQVRKVQPTKTKTGRRTYQGGQRLPKVNDWYPTRDVVYSQQPTTRLVDGRPSGLGSPVNSNLEQSTSSGQFTEDIQQHWTNTEETQQVESSQTFQRIQHVGSGIENRQQSGVQPNEEVRIQPRILEACGANGPYDINNYDLLPTPKPFPDTALTPPDGRDAWDIRVVDRRVTTTTEASVPTSTIPITTTTTTTTTTTPIPPPTPAPSFWRRIGNSVTNTYDKAKEKAKELFG

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-