Basic Information

Gene Symbol
-
Assembly
GCA_030762935.1
Location
CM060880.1:18209397-18220187[-]

Transcription Factor Domain

TF Family
TF_bZIP
Domain
bZIP domain
PFAM
AnimalTFDB
TF Group
Basic Domians group
Description
bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 42 0.021 1e+02 5.7 0.1 28 62 70 104 67 107 0.78
2 42 0.0034 16 8.3 0.2 25 60 88 123 79 128 0.71
3 42 0.0078 37 7.1 0.4 26 60 124 158 115 163 0.62
4 42 0.007 33 7.3 0.4 25 61 144 180 136 184 0.78
5 42 0.018 84 6.0 0.2 30 61 170 201 163 212 0.54
6 42 0.013 61 6.4 0.3 26 51 201 226 185 240 0.48
7 42 0.0056 27 7.6 0.6 24 58 234 268 221 275 0.60
8 42 0.016 74 6.1 0.7 25 61 256 292 250 303 0.74
9 42 0.02 95 5.8 0.2 25 61 340 376 337 380 0.82
10 42 0.032 1.5e+02 5.2 0.1 27 62 379 414 375 417 0.84
11 42 0.0087 42 6.9 0.2 26 59 399 432 389 438 0.56
12 42 0.0059 28 7.5 0.2 25 62 447 484 445 487 0.88
13 42 0.13 6.1e+02 3.2 0.1 24 61 488 525 487 529 0.84
14 42 0.017 80 6.0 0.6 24 61 509 546 506 550 0.70
15 42 0.0048 23 7.8 1.6 25 51 538 564 521 578 0.49
16 42 0.012 56 6.5 0.2 31 62 551 582 545 592 0.54
17 42 0.0086 41 7.0 0.1 25 61 594 630 586 634 0.66
18 42 0.0039 19 8.1 0.2 27 59 631 663 622 676 0.64
19 42 0.033 1.6e+02 5.1 0.1 26 60 672 706 663 711 0.61
20 42 0.033 1.6e+02 5.1 0.1 26 59 700 733 691 739 0.60
21 42 0.044 2.1e+02 4.7 0.1 26 58 714 746 705 753 0.53
22 42 0.0034 16 8.3 0.2 24 62 754 792 750 795 0.86
23 42 0.0025 12 8.7 0.3 25 61 769 805 761 809 0.82
24 42 0.042 2e+02 4.8 0.1 32 59 804 831 790 844 0.51
25 42 0.026 1.3e+02 5.4 0.3 32 62 825 855 816 871 0.56
26 42 0.038 1.8e+02 4.9 0.1 30 61 875 906 868 924 0.61
27 42 0.055 2.7e+02 4.4 0.5 27 59 921 953 903 959 0.57
28 42 0.0098 47 6.8 0.2 26 61 969 1004 966 1008 0.75
29 42 0.0086 41 7.0 0.2 25 60 1017 1052 1008 1057 0.65
30 42 0.0045 21 7.9 0.1 28 60 1055 1087 1050 1099 0.67
31 42 0.028 1.4e+02 5.3 0.2 30 61 1099 1130 1092 1141 0.52
32 42 0.048 2.3e+02 4.6 0.1 26 58 1137 1169 1129 1176 0.55
33 42 0.0039 18 8.1 0.4 24 62 1177 1215 1176 1221 0.82
34 42 0.0028 13 8.5 0.2 25 61 1192 1228 1190 1232 0.89
35 42 0.066 3.1e+02 4.1 0.1 25 58 1220 1253 1213 1258 0.59
36 42 0.027 1.3e+02 5.4 0.4 31 62 1247 1278 1235 1294 0.58
37 42 0.051 2.4e+02 4.5 0.1 26 61 1294 1329 1290 1333 0.68
38 42 0.02 95 5.8 0.1 26 60 1322 1356 1314 1361 0.60
39 42 0.019 91 5.9 0.0 25 52 1356 1383 1354 1393 0.84
40 42 0.0028 13 8.5 0.4 25 64 1411 1450 1399 1451 0.89
41 42 0.028 1.4e+02 5.3 0.2 28 63 1449 1484 1445 1492 0.71
42 42 0.022 1.1e+02 5.6 0.0 19 62 1548 1592 1547 1602 0.71

Sequence Information

Coding Sequence
ATGTTCTTAACTCTATTTAGATACTACTCCTCTCTCCGATCAGCACGCACAAAATTTACCTATTGGATAGCTGATGATGACTATTATATCTTGCTCTTCTTCCTCTCACAGACTCTGTATGATGGGGCTCATGAAAACATTCATCTACTGTATGTTGCAGCTAATGATGCTACCATTGAGTGCATAATCCATGAATATGAAGAGACCGCCGacttgacagcacagaccgccgacctgagtgcacagaccgccgacctgacagcacagaccgccgacctgacagcacagaccgcagacctgagtgcacagaccgccgacctgacaACAGAGACCGCCGACCTaacagcacagaccgccgacctgacaACACAGACCGCAGacctgagtgcacagaccgccgacctgagtgcacagaccgcccacctgacagcacagaccgccgaactgagtgcacagaccgccgacctgagtgcacagaccgcagacctgagtgcacagaccgccgacctgacagcacagaccgcagacctgagtgcacagaccgcagacctgagtgcacagaccgccgacctgacagcacagaccgcAGACCTGAGcgcacagaccgccgacctgacagcacagaccgccgacctgagtgcacagaccgccgacctgagtgcacagaccgccgacctgacagcacagaccgccgacctgaaaccacagaccgccgacctgagtgcacagaccgccgacctgacaACAGAGACCGCCGACCTAACAGCACAGACCGCCGAActgagtgcacagaccgccgacctgagtgcacagaccgccgacctgagtgcacagaccgcagacctgagtgcacagaccgcagacctgagtgcacagaccgccgacctgacagcacagaccgcagacctgagtgcacagaccgcagacctgagtgcacagaccgccgacctgacagcacagaccgcAGACCTGAGcgcacagaccgccgacctgacagcacagaccgccgacctgagtgcacagaccgccgaactgagtgcacagaccgccgacctgagtgcacagaccgccgacctgagtgcacagaccgccgacctgagtgcacagaccgccgacctgacagcacagaccgcagacctgaacctgacagcacagaccgccgacctgagtgcacagaccactgacctgacagcacagaccgccgacctgagtgcacagaccgccgacctgacagcacagaccgcagacctgacagcacagaccgccgacctgacagcacagaccgccgacctgacagcacagaccgccgacctgagTCCACAGACCGCCGACATTACAGCACAGACCGCAGAACTGACAGCACAGACCGCAGacctgagtgcacagaccgccgacctaTCAGCACAGACCGCAgacctgacagcacagaccgccgacctgagtgcacagaccgcAGACCTGAGAGCACAGACCGCAGacctgagtgcacagaccgccgacatgacagcacagaccgccgacctgaaaccacagaccgccgacctgacagcacagaccgccgacctgaaaccacagaccgccgacctgacagcacagaccgcAGAACTGACAGCACAGACCGCAGAGCTGAATGCACAGACCGCAGacctgagtgcacagaccgccgacctgacagcacagaccgccgacctgacagcacagaccgccgacctgacagcacagaccgccgacctgagtgcacagaccgccgacctgatAGCACAGACCTccgacctgacagcacagacctccgacctgagtgcacagaccgccgacctgacagcacagaccgccgacctgagtgcacagactgccgacctgacagcacagaccgccgacctgagtgcacagaccgccgacctgacagctcagaccgccgacctgagtgcacagaccgccgacctgacagcagagaccgccgacctgacagcacagaccgccgacctgacaGCACACACCGCCGacctgagtgcacagaccgccgacctgacagcacagaccgccgacctgagtgcacagaccgccgacctgagtgcacagaccgcagacctgagtgcacagaccgccgacctaTCAGCACAGACCGCAGacctgagtgcacagaccgcagacctgagtgcacagaccgccgacctgacagcacagaccgccgaccttAAAccacagaccgccgacctgagtacacagaccgccgacctgacagcacagaccgccgacctgaaaccacagaccgccgacctgagtgcacagaccgccgacctgacagcacagaccgcagaactgacagcacagaccgcagacctgagtgcacagaccgccgacctgacagcacagaccgccgacctgacagcacagaccgACGACCTGAtagcacagaccgccgacctgagtgcacagaccgccgacctgacagcacagaccgccgacctgagtgcacagaccgccgaccttACAGCACAGACCCCCTACctgactgcacagaccgccgacctgagtgcacagaccgccgaactgagtgcacagaccgcagacctgacagcacagaccgcAGACCTCATAGacctgagtgcacagaccgccgacctgacagcacagaccgcagacctgagtgcacagaccgcAGACCTGACAGCACATACCGCAGacctgagtgcacagaccgccgacctgacagcacagaccgcagacctgagtgcacagaccgccgacatgacagcacagaccgccgacctgaaaccacagaccgccgacctgacagcacagaccgccgacctgaaaccacagaccgccgacctgacagcacagaccgcAGAACTGAATGCACAGACCGCAGacctgagtgcacagaccgcagacctgagtgcacagaccgccgacctgacagcacagaccgccgacctgacagcacagaccgccgacctgacagcacagaccgccgacctgacagcacagaccgccgacctgagtgcacagaccgccgacctgatAGCACAGACCTccgacctgacagcacagacctccgacctgagtgcacagaccgccgacctgacagcacagaccgccgacctgagtgcacagactgccgacctgacagcacagaccgccgacctgagtgcacagaccgccgacctgacagctcagaccgccgacctgagtgcacagaccgccgacctgacagcagagaccgccgacctgacagcacagaccgccgacctgacaGCACACACCGCCGacctgagtgcacagaccgccgacctgacagcacagaccgccgacctgagtgcacagaccgccgacctgagtgcacagaccgcagacctgagtgcacagaccgccgacctaTCAGCACAGACCGCAGacctgagtgcacagaccgcagacctgagtgcacagaccgccgacctgacagcacagaccgccgaccttAAAccacagaccgccgacctgagtacacagaccgccgacctgacagcacagaccgccgacctgaaaccacagaccgccgacctgagtgcacagaccgccgacctgacagcacagaccgcagaactgacagcacagaccgcagacctgagtgcacagaccgccgacctgacagcacagaccgccgacctgacagcacagaccgACGACCTGAtagcacagaccgccgacctgagtgcacagaccgccgacctgacagcacagaccgccgacctgagtgcacagaccgccgaccttACAGCACAGACCCCCTACctgactgcacagaccgccgacctgagtgcacagaccgccgaactgagtgcacagaccgcagacctgacagcacagaccgcAGACCTCATAGacctgagtgcacagaccgccgacctgacagcacagaccgcagacctgagtgcacagaccgcAGACCTGACAGCACATACCGCAGacctgagtgcacagaccgccgacctgacagcacagaccgcagacctgagtgcacagaccgccgacctgacagcacagaccgccgacctgagcgcacagaccgccgacctgacaGCTCAGACCGCCGACTTGAGTACACAGGCCgccgacttgactgcacagaaCACCGAACTGACTGCAATGATCGCCGACTTGAGTGCAATGACCGCCGACCAGACAGCACGGACCGCCGACTTGAGTACACGGACCGCAgacctgacagcacagaccgccgacctgacacagaccgccgacttgagtaaacagaccgccgacctgactgcacagaccgccgatCTGAGTGCACAGACCGCAGACCTGCGTGCACAGACCGCCGCCCTGAGTGCACAGACCAccgacctgacagcacagaccgctgacttgagtgcacagaccaccgacctgagtgcacagaccgccgacctgaaagcacagaccgccgacctgacaGCTCAGACCTCCGacctgaATCGCAGACCTGACAGCACAGACGGCCGACCTGACAGCATAGACCGCCGATTTGACTTCACAGACCAccgacctgacagcacagaccgccgacctgacagcacagaccgccgacctgagtgcacagaccaccgaccagacagcacagaccgccgacaTGACTGCAAAGACCTCCGACCTGattgcacagaccgccgacctgacaGCTCAGACCGCAGACTGACAGCACGGACCGCCGACTTAAGTGCACAGACGACCGacttgagtgcacagaccgccgacttgattGAACAGACCGCAGACTTGGGTGCACAGACTGCCGacttgagtgcacagaccgcagacctgagtgcacagaccgcAGAATTGACTGCATAG
Protein Sequence
MFLTLFRYYSSLRSARTKFTYWIADDDYYILLFFLSQTLYDGAHENIHLLYVAANDATIECIIHEYEETADLTAQTADLSAQTADLTAQTADLTAQTADLSAQTADLTTETADLTAQTADLTTQTADLSAQTADLSAQTAHLTAQTAELSAQTADLSAQTADLSAQTADLTAQTADLSAQTADLSAQTADLTAQTADLSAQTADLTAQTADLSAQTADLSAQTADLTAQTADLKPQTADLSAQTADLTTETADLTAQTAELSAQTADLSAQTADLSAQTADLSAQTADLSAQTADLTAQTADLSAQTADLSAQTADLTAQTADLSAQTADLTAQTADLSAQTAELSAQTADLSAQTADLSAQTADLSAQTADLTAQTADLNLTAQTADLSAQTTDLTAQTADLSAQTADLTAQTADLTAQTADLTAQTADLTAQTADLSPQTADITAQTAELTAQTADLSAQTADLSAQTADLTAQTADLSAQTADLRAQTADLSAQTADMTAQTADLKPQTADLTAQTADLKPQTADLTAQTAELTAQTAELNAQTADLSAQTADLTAQTADLTAQTADLTAQTADLSAQTADLIAQTSDLTAQTSDLSAQTADLTAQTADLSAQTADLTAQTADLSAQTADLTAQTADLSAQTADLTAETADLTAQTADLTAHTADLSAQTADLTAQTADLSAQTADLSAQTADLSAQTADLSAQTADLSAQTADLSAQTADLTAQTADLKPQTADLSTQTADLTAQTADLKPQTADLSAQTADLTAQTAELTAQTADLSAQTADLTAQTADLTAQTDDLIAQTADLSAQTADLTAQTADLSAQTADLTAQTPYLTAQTADLSAQTAELSAQTADLTAQTADLIDLSAQTADLTAQTADLSAQTADLTAHTADLSAQTADLTAQTADLSAQTADMTAQTADLKPQTADLTAQTADLKPQTADLTAQTAELNAQTADLSAQTADLSAQTADLTAQTADLTAQTADLTAQTADLTAQTADLSAQTADLIAQTSDLTAQTSDLSAQTADLTAQTADLSAQTADLTAQTADLSAQTADLTAQTADLSAQTADLTAETADLTAQTADLTAHTADLSAQTADLTAQTADLSAQTADLSAQTADLSAQTADLSAQTADLSAQTADLSAQTADLTAQTADLKPQTADLSTQTADLTAQTADLKPQTADLSAQTADLTAQTAELTAQTADLSAQTADLTAQTADLTAQTDDLIAQTADLSAQTADLTAQTADLSAQTADLTAQTPYLTAQTADLSAQTAELSAQTADLTAQTADLIDLSAQTADLTAQTADLSAQTADLTAHTADLSAQTADLTAQTADLSAQTADLTAQTADLSAQTADLTAQTADLSTQAADLTAQNTELTAMIADLSAMTADQTARTADLSTRTADLTAQTADLTQTADLSKQTADLTAQTADLSAQTADLRAQTAALSAQTTDLTAQTADLSAQTTDLSAQTADLKAQTADLTAQTSDLNRRPDSTDGRPDSIDRRFDFTDHRPDSTDRRPDSTDRRPECTDHRPDSTDRRHDCKDLRPDCTDRRPDSSDRRLTARTADLSAQTTDLSAQTADLIEQTADLGAQTADLSAQTADLSAQTAELTA

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-