Basic Information

Gene Symbol
-
Assembly
GCA_030762935.1
Location
CM060885.1:26324496-26329602[+]

Transcription Factor Domain

TF Family
TF_bZIP
Domain
bZIP domain
PFAM
AnimalTFDB
TF Group
Basic Domians group
Description
bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 32 0.017 83 6.0 0.0 30 64 10 44 3 52 0.64
2 32 2.9 1.4e+04 -1.1 0.0 30 64 52 86 46 87 0.73
3 32 0.79 3.8e+03 0.7 0.0 30 61 66 97 60 101 0.82
4 32 0.00045 2.1 11.1 0.5 25 63 96 134 94 136 0.90
5 32 0.87 4.1e+03 0.6 0.1 24 53 158 187 151 198 0.61
6 32 0.039 1.9e+02 4.8 0.1 27 62 213 248 209 251 0.79
7 32 0.63 3e+03 1.0 0.1 30 64 251 285 244 286 0.85
8 32 4.1 1.9e+04 -1.6 0.0 33 59 303 329 295 333 0.63
9 32 0.026 1.3e+02 5.4 0.0 22 56 338 372 337 381 0.85
10 32 1.5 7e+03 -0.2 0.0 25 50 432 457 427 463 0.71
11 32 0.039 1.9e+02 4.9 0.1 19 61 522 565 514 569 0.61
12 32 0.12 5.6e+02 3.3 0.1 26 63 565 602 563 608 0.79
13 32 0.026 1.2e+02 5.4 0.0 28 63 648 683 637 691 0.56
14 32 0.59 2.8e+03 1.1 0.2 26 59 688 725 678 731 0.58
15 32 0.19 8.9e+02 2.7 0.2 27 60 749 782 738 787 0.82
16 32 0.26 1.2e+03 2.2 0.2 27 61 763 797 759 800 0.82
17 32 0.17 7.9e+02 2.8 0.1 30 61 811 842 805 846 0.69
18 32 0.038 1.8e+02 4.9 0.1 24 64 826 866 822 867 0.91
19 32 0.2 9.5e+02 2.6 0.0 27 59 871 903 868 909 0.85
20 32 0.52 2.5e+03 1.3 0.0 27 59 913 945 909 951 0.84
21 32 0.41 1.9e+03 1.6 0.0 26 59 933 966 930 972 0.86
22 32 0.42 2e+03 1.6 0.0 26 59 954 987 950 992 0.86
23 32 0.51 2.4e+03 1.3 0.0 27 59 997 1029 992 1035 0.83
24 32 0.4 1.9e+03 1.6 0.0 26 59 1017 1050 1013 1056 0.86
25 32 0.4 1.9e+03 1.6 0.0 26 59 1038 1071 1034 1077 0.86
26 32 0.39 1.9e+03 1.6 0.0 26 59 1059 1092 1054 1098 0.86
27 32 0.52 2.5e+03 1.3 0.0 27 59 1102 1134 1098 1140 0.84
28 32 0.86 4.1e+03 0.6 0.0 26 58 1122 1154 1119 1158 0.83
29 32 1.6 7.8e+03 -0.3 0.1 19 52 1176 1210 1174 1235 0.82
30 32 1.9 9.1e+03 -0.6 0.1 20 61 1247 1289 1245 1293 0.74
31 32 0.079 3.8e+02 3.9 0.3 26 57 1289 1320 1271 1325 0.62
32 32 0.067 3.2e+02 4.1 0.1 30 58 1362 1390 1343 1404 0.58

Sequence Information

Coding Sequence
atgactgcacagaccgccACCGCCTACTTGAGTACACAGACTGCCGACTTGACGGCACAGACACCCGACTTGAgtacacagaccgccgacttgaggACACAGACAGCCGACTTGACAGCACAGATCaccgacttgactgcacagaccaccgacttgaTTGCACAGACCACCGACCTGATAGCACAGCCCaccgacttgactgcacagaccaCCGACCTGACAGCAGAGACTGCCAAATTCAGTGCACAGATCAccgacctgacagcacagaccgccgacatgactgcacagaccgccgaccttacagcacagaccgccgacttgacagcacagaccgccgaATTAAAAGCAGAGGCCACCGACTTGACTGCTCAGTCCGCCGAATTGAGTGCACAGATCACCTACTTGACAACACAGACCACCGACTTGAAAGCACAGACCACCGTATTAAGTGCACAGACCACCGAAATGAAGGCACAGACCACCGACCTGATAGCACAGACCACCGACATGACTGCACAAACCGCCGACCTGACAGCAAAGACCGCCGAATTGACAGCACAATCCGCCGACGCCACCGACCTGAAAGCACAGCGCACCGTCCTGATATCACATACCACCGACTTGGCTGCACAGACCAccgacctgacagcacagaccgccgaATTGATGGCACAGACCACCGACCTGACACCACAGGCCACCCACTTGACTGCAGAGACCACCGACTTGAGAGCACAGACCACCGATTTCAgtgcacagaccaccgacttgaCTGCTCAGACCACCGACTTGAGAGCACGGACCACCGACCTGAGTCCACAGACCACTTACTTGACAGCACAGAACACCGACTTGAGTACACAGACCACCAACTTGAGTACACAGACTGCCGACTTTAcagcacagaccaccgacttCACAGCACAGATCGCCGACTTGACAGCACAGACAGCCGACTTGAATCGCCGACTTGAGTACACATACCGTCGACTTGACATCACAGACAGCCGACTTAGAGCACAGACCGCCCACTTGACAACAAAGACATTCGACTTGACAGCACAGAACGCCGACTTGACAGCACAGACAGCCGACTTGAGTACACAGACCACATGCAtgacagcacagaccgccgacttgacagCGCATACCGACGACTTGAGTACACAGACCGATGACTTGAgtacacagaccgccgactttAGTATAGAGACTGCCGACTTGAGTACACAGACCGACGAATTGAGTACACAGACAACCAACTTTAgtacacagaccgccgacttgacagCACAGACAGCCAACATGAgtacacagaccgccgacttgagtTCACAGACCGACGACTTGAGTACACAGAACGCCTACTTAGTACACACCCATCGACATGAAtacacagaccgccgacttgagtacacagaccgccgacttgacaTCACAGACAGCCGAATTGACAGCACAGACAGCCGACTTGACAGCACAGAACGCCGACTTGAGGACACAGACAGCCAACTTAATTACACAGAATGCCGACTTGAgtacacagaccgccgacttgacagcacagaccgccgacttgagtACACAACCGCCTTCTTGAGTACACAGACAGCCGACTTGAATACAGAGACCGCCGACTTGACAGCACAGACAGCCGACTTGAATACACAGATTGCTGACTTGACAGCACAGACAGCGGACTTGAGTACACAGACAGCCGACTTGACAGCACAGACAGCCGATTTGAgtacacagaccgccgacttgagtacacagaccgccgacttgagtACACAGACCACCGACTTGAACCGCCCACTTAAGTACACAGACCTTCGACTTGAATACGCAGACAACCAAAGTGACAGCAAGGACCACCGATATGACTGCACAGACCGCCCTcctgacagcacagaccgccgacgTACGAGCACATACCACAACCTCActgcacagaccaccgacttgactgcacagaccgccgacttgagtgCACAGCCCACCGACCTGACAGCACAGATTACCGATTTGActgcacagaccaccgacttgaCGGCACAGACCACAGACATGACTGCACAGACCACTGACTTAAcagcacagaccaccgacttgaaccgccgacttgaggCACACACCAccgacctgacagcacagaccaCAGACTTGAATGCACAGACCACTgacctgacagcacagaccaccgacccaacagcacagaccaccgacttgaGAGCACTGACCGCCGACTTGACAGCACAGACCACCGATTTAAgtgcacagaccaccgacttgagaacacagaccaccgacttgactgcacagactGCCAACTTGAGTGCACGGACCGCCGAATTGAGTGCACAGACCATCGACTTGAGTGCACAGACCACCAACCCGATAGAACAGACCACCGACACCACCGACCTGACAGAAAAGACCGCCGACTTGACAGCAGAGACCACCGACTTGAGAGCACAGACCACCGATTTGAGTGCACAGACCATCGACTTGACTGCTCAGACCACCGACTTGAGAGCAAGGACCACCGACCTGAGTGCACAGACCACCTACTTGACAGCACAGAACACCGACTTGAGTACACAGACCACCAAATTGAGTACACAGACAGCCGACTTTACAGCACAGACCGCCGATTTGACAGCACAGATCGCCGACTTGACAGCTCAGACAGCCGACTTGAGTACACATACTCCCGACTTGAgtacacagaccgccgacttgacagCTCAGACAGCCGACTTGAGTACACATACTCCCGACTTGAGTACACAGATCGCCGACTTGACAGCTCAGACAGCCGACTTGAGTACACATACTCCCGACTTGAGTACACAGATCGCCGACTTGACAGCTCAGACAGCCGACTTGAGTACACATACTCCCGACTTGAGTACACAGATCGCCGACTTGACAGCTCAGACAGCCGACTTGAGTACACATACTCCCGACTTGAgtacacagaccgccgacttgacagCTCAGACAGCCGACTTGAGTACACATACTCCCGACTTGAGTACACAGATCGCCGACTTGACAGCTCAGACAGCCGACTTGAGTACACATACGCCCGACTTGAGTACACAGATCGCCGACTTGACAGCTCAGACAGCCGACTTGAGTACACATACTCCCGACTTGAGTACACAGATCGCCGACTTGACAGCTCAGACAGCCGACTTGAGTACACATACTCCCGACTTGAGTACACAGATCGCCGACTTGACAGCTCAGACAGCCGACTTGAGTACACATACTCCCGACTTGAgtacacagaccgccgacttgacagCTCAGACAGCCGACTTGAGTACACATACTCCCGACTTGAGTACACAGATCGCCGACTTGACAGCTCAGACAGCCGACTTGAGTACACATACTCCCGACTTGAGTACACAGATCGCCGACTTGACAGCTCAGACAGCCATCTTGAGTACACATACTCCCGACTTGAGTACACAGACCGCCTACTTAGTACACACGCATCGACATGAATACACTGACCGCCGACTTGAgtacacagaccgccgacttgacaTCACAGTCAGCCGACTTGACAGCACAGACAGCCGACTTGACAGAACAGAACGCCGACTTGACAGCATAGACAGCCGACTTGACAGCACAGAACGCCGACTTGACAACACAAACAGCCGACTTGAGTACACAGTCCACATACTTGatagcacagaccgccgacttgacagCACATACCGACGACTTGAgtacacagaccgccgacttgacagCACAGTGAGCCGAATTGAAAACACAGAGAGCCGACTTGAGTACACAGACGCCGACTTGAGTACACAGACCGACGAATTGAGTACACAGACAGCCAACTTGAGTACAAAGACCGCCGACTTGACAGCACAGACAGCCAACTTGAgtacacagaccgccgacttgagttcgcagaccgccgacttgactaCACAGACTCCCGACTTTAcacagaccgccgacttgagtacacagaccgccgacttgagtACACAGACAGCCGACATGAATACAGAGACCGCCGACTTGACAGCACAGACAGTCGACTTGAGTACACAGATTGCTGacttgacagcacagaccgcGGACTTGAGTACACAAACAACCGACTTGACAGCACAGACTGCCGATTTGAgtacacagaccgccgacttgagtACACAGACCATCGACTTGAACCGCCCACTTAAGTACACAGACCTCCGACTTGAGTACGTAGACCACCGACGTGACAGCACAGACCACCGACATGACTGCACAGACCGCCCacctgacagcacagaccgccgacgTATGAGCACAGACCACAACCTCAATGCACATACCACCGACTTGACTTGCACAGATTaccgacttgactgcacagaccaccgacttgaGAGCACAGACCACAGACATGACTGCACAGACCGCCAAATTGAGGCACACACCAccgacctgacagcacagacTTGA
Protein Sequence
MTAQTATAYLSTQTADLTAQTPDLSTQTADLRTQTADLTAQITDLTAQTTDLIAQTTDLIAQPTDLTAQTTDLTAETAKFSAQITDLTAQTADMTAQTADLTAQTADLTAQTAELKAEATDLTAQSAELSAQITYLTTQTTDLKAQTTVLSAQTTEMKAQTTDLIAQTTDMTAQTADLTAKTAELTAQSADATDLKAQRTVLISHTTDLAAQTTDLTAQTAELMAQTTDLTPQATHLTAETTDLRAQTTDFSAQTTDLTAQTTDLRARTTDLSPQTTYLTAQNTDLSTQTTNLSTQTADFTAQTTDFTAQIADLTAQTADLNRRLEYTYRRLDITDSRLRAQTAHLTTKTFDLTAQNADLTAQTADLSTQTTCMTAQTADLTAHTDDLSTQTDDLSTQTADFSIETADLSTQTDELSTQTTNFSTQTADLTAQTANMSTQTADLSSQTDDLSTQNAYLVHTHRHEYTDRRLEYTDRRLDITDSRIDSTDSRLDSTERRLEDTDSQLNYTECRLEYTDRRLDSTDRRLEYTTAFLSTQTADLNTETADLTAQTADLNTQIADLTAQTADLSTQTADLTAQTADLSTQTADLSTQTADLSTQTTDLNRPLKYTDLRLEYADNQSDSKDHRYDCTDRPPDSTDRRRTSTYHNLTAQTTDLTAQTADLSAQPTDLTAQITDLTAQTTDLTAQTTDMTAQTTDLTAQTTDLNRRLEAHTTDLTAQTTDLNAQTTDLTAQTTDPTAQTTDLRALTADLTAQTTDLSAQTTDLRTQTTDLTAQTANLSARTAELSAQTIDLSAQTTNPIEQTTDTTDLTEKTADLTAETTDLRAQTTDLSAQTIDLTAQTTDLRARTTDLSAQTTYLTAQNTDLSTQTTKLSTQTADFTAQTADLTAQIADLTAQTADLSTHTPDLSTQTADLTAQTADLSTHTPDLSTQIADLTAQTADLSTHTPDLSTQIADLTAQTADLSTHTPDLSTQIADLTAQTADLSTHTPDLSTQTADLTAQTADLSTHTPDLSTQIADLTAQTADLSTHTPDLSTQIADLTAQTADLSTHTPDLSTQIADLTAQTADLSTHTPDLSTQIADLTAQTADLSTHTPDLSTQTADLTAQTADLSTHTPDLSTQIADLTAQTADLSTHTPDLSTQIADLTAQTAILSTHTPDLSTQTAYLVHTHRHEYTDRRLEYTDRRLDITVSRLDSTDSRLDRTERRLDSIDSRLDSTERRLDNTNSRLEYTVHILDSTDRRLDSTYRRLEYTDRRLDSTVSRIENTESRLEYTDADLSTQTDELSTQTANLSTKTADLTAQTANLSTQTADLSSQTADLTTQTPDFTQTADLSTQTADLSTQTADMNTETADLTAQTVDLSTQIADLTAQTADLSTQTTDLTAQTADLSTQTADLSTQTIDLNRPLKYTDLRLEYVDHRRDSTDHRHDCTDRPPDSTDRRRMSTDHNLNAHTTDLTCTDYRLDCTDHRLESTDHRHDCTDRQIEAHTTDLTAQT

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-