Basic Information

Gene Symbol
-
Assembly
GCA_001263575.1
Location
NW:19941-25523[-]

Transcription Factor Domain

TF Family
TF_bZIP
Domain
bZIP domain
PFAM
AnimalTFDB
TF Group
Basic Domians group
Description
bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 28 1.9 8.3e+02 -0.6 0.9 46 64 66 84 52 85 0.58
2 28 0.00024 0.11 11.9 1.6 35 64 99 128 98 129 0.89
3 28 0.00069 0.31 10.4 1.3 28 59 216 247 209 252 0.67
4 28 1.6 7e+02 -0.4 0.2 41 62 277 298 268 300 0.69
5 28 6 2.7e+03 -2.2 0.0 25 47 303 325 301 330 0.82
6 28 0.77 3.4e+02 0.6 1.4 29 62 373 406 371 409 0.79
7 28 0.00026 0.12 11.7 5.0 28 63 442 477 434 479 0.55
8 28 0.078 35 3.8 1.0 25 45 477 497 474 504 0.85
9 28 2.9e-05 0.013 14.8 6.4 25 63 505 543 501 544 0.94
10 28 0.0093 4.2 6.7 1.5 32 64 554 586 546 587 0.86
11 28 0.77 3.5e+02 0.6 0.1 47 63 583 599 581 612 0.67
12 28 1.4e-07 6.2e-05 22.2 3.1 23 65 615 657 613 657 0.94
13 28 7.7e-06 0.0035 16.6 3.6 21 62 648 689 648 692 0.91
14 28 0.0039 1.7 8.0 5.9 22 65 670 713 670 720 0.71
15 28 0.035 16 4.9 7.8 24 64 693 733 684 734 0.82
16 28 0.0074 3.3 7.1 2.4 30 63 720 753 716 755 0.90
17 28 0.0005 0.23 10.8 2.5 26 61 779 814 756 815 0.73
18 28 1.1e-05 0.0049 16.1 5.4 28 64 809 845 807 846 0.94
19 28 0.021 9.4 5.6 3.9 32 61 848 877 844 880 0.81
20 28 4.2e-07 0.00019 20.7 2.4 25 63 876 914 874 916 0.93
21 28 0.089 40 3.6 6.1 27 62 934 969 916 970 0.83
22 28 0.0041 1.8 7.9 7.8 22 63 957 998 942 1000 0.74
23 28 0.0025 1.1 8.6 5.7 26 62 989 1025 984 1034 0.60
24 28 7.2e-06 0.0032 16.7 4.9 24 65 1029 1070 1026 1070 0.94
25 28 0.00092 0.41 10.0 4.1 28 63 1068 1103 1064 1105 0.85
26 28 0.00094 0.42 9.9 3.3 23 57 1077 1111 1074 1118 0.66
27 28 8.9 4e+03 -2.8 0.0 39 59 1188 1208 1184 1211 0.79
28 28 0.054 24 4.3 1.3 34 57 1268 1291 1243 1295 0.77

Sequence Information

Coding Sequence
ATGATGGCGGGTCGCACATGTCAATGCGGATGCACTGACCCACCGAAAATGACGGGAGGTGATCCTCCGAACGAAGGATCCTGCGGGTGTAGCTACAATCCGCTGGGAGAGGGCGGGAGGGATGCGGAAATAACGGACCTATCTTACGCCCTGCGGAAACTGACCTCGATGAAATGCCAGATGAAGAAATGGAGGATGGAACGTCTGCAGCTTGAGAGTGAGGCGAGGGCTTTGAAGCAGGTGCTGCAGGCCCACGGTCTTAACGACGACATCGTGAGGCCTGATCCACTGCTTGCTCATCTTCGAGAGGAGAATGCCAGGCtggaaaacgaaaacgaagaaCTTCAGGACAAGGTTAAAGGACTCGAGGACACCATAACCGAGTACGAGTATGTCGAGTCACCGTGCGAACTGGTCAGCAAACTTCGCGAAAAGATGAGGAACATGAAGGAGGCTCATGCTGGTGAAAAACGAAGATTGAGAGAGTTAATTTCCGGGCTGAAGATCCGGCTCCAGGAGGCGGAGGCCGAGTCATCTTGTGCAGCATTGAATCGTCTGCGAGCAAAGCTTCGCGAGCTAACTGAAGGCGGACAGGAGGCAGACCAACGGGTTTCGAAAGTGGTTCAGCGTTCGATAGAAACGTTGGTCGAGCTGACGGATAACGTTGATGACCTAAAGGCGGAGATCGAGAGACTTCGTGCGGAGATCAAGCGTCTAAAGGACCTGCTGGACGCCTGTGAAGAGCGGCGAAGAACGGCGACTGATGTCGCTGTCGAAACAACCCTCCCGGAGGTGAAACCACCAGAAAAACCGCTGGTGGAAATGGACGTTTCAGATCTACTCAACAGGATCAAGGAGCTCGAAGCACTCATAGCTCAGCTGAGGAAGCAGCTCGTGGACAAGGATGCCGCCATCAATGACCTCCAAAACAAACTGTTCAACGTCACATCGGACAATAAGAGACTCAGCACTGACCTAGATCAAATGATGGTCAGCTACAGAGCCGTCATGGACGAGGTAAAAGCTATGAAGGATGAGCTCAAAAAGAGGGACGTGAAGGTTTCGGATCTCTTGCGTGAGCTCCAAGCGTCGGCGATCGATATGCTGGGATTGAACAGGCTGCAGAGTGAGATTGAATCGGTCAAGCCACAATTGTACAACCTTGAACTGGAGAGAGAACAGCTGTTGTCAGAGCTCGGTAAGGTTCGGGGAGTAGTTTCGGAGAGGAACGATCAGATAATTAAGATCCTGGAGGAAAGAGACAAGCATGCTCGAGCTCTGGGCAAAGTCGCGAGCACGATACAGGAAACGGCCGAACGGGAGGAGGCGCTGAAACGCGAGATTGATCGGCTGAAGGATCGAATAGCTGAGCTTGAGAAGGAGATAGCTGAGCTTAAGAAAAAGGTGGCTGAGCTCGCGGcggagaacgaaaaaattcctggacttgagaaaaaaattaaggagcTCGAAGACGAGCTAGCGAAGCTCAGAGGCGATCTTGCCGCTGCTAACACGAAGATGAACGACCTTGAGAAAGAAATAGCCGATCTAAAGGCAGAAAAAGATGCATTAGCAAGAGAGCTAGCAAAGGCAAAGGAGCAGGTGGAGAAGCTGAAAGAGGAGCTTGCTGCTGAGAGATCTGCAAAAGAAGCGGCCATGAAAGAACTCGAGGTCTGTAGAGCTGAGAACGAGAAACTGAGAGGAGACAACGAGCGAATGAGCAACGAACTCAACGCGGCAAAGGGCGAAATTGAAAGACTGAAAAATGAGCTTGACAAGGTTAAGGGCGAGTTGGATAAATCGAGAGCCGAAAACAGTGAGCTCAAGGACCTGCTCGCTGCAGCTAAGGCGGAAATCGACAAGCTCAGAAGCGAGGTCGAGGGGTGCAAAGCTGAGAATGCCAAGCTTAAAGGTGAGATTGTGCGATTAAATGAGGAAGTACAAAAGTTGAAGGCAGAGAATAGCGAGCtcaagaaagagagagacacgcTGCAGGCTGAAGTGGGAAAGCTGAAAGAAAAGATCGACGGAATGCAAGCTGAAATTGATAAGCTGAAGAACGATCTGGCCGCATCTAAAAGCGAGATGGAAAAGCTCAAGAATGATTTGGACGCTTTGAAATCGGAGAACGAGAAGCTCAAGAACAGTTTACGCGAAGCCGAGGCGAAGATAAAGGCATTGGAAGCGGAGAACTCAGATCTTGCTAATAAATTAGCCGATCTGAAGATCAAGAtagaaaatcttgaaaaacagCTTGCAGATGAAAAAGCCGCGAAAGAAGCGGCGCTAAAGGAATTGGCAGCGTTAAAGTCGGACCTCAAAGCGTTACTCGGAGAGATGGACAAGCTCAAAGCCGAGCGGGACAAACTGAAAGGGGAAGTGGATGACCTGACGAAGCGGATGGCAGACTTGACCAATGAGCTAAATCAGCTGAAATCAAAGTGTGCTGCCCTCGCGGCAGAGAACGAGAAGTTGAAAGCAGAAGTCAACGGTCTTAAAACAGAGAATGAGAGGCTGAAGAACGACCTGGAGAAGGTCAAGGCTGACCTTGAGGCAGCGAAATCAGAGAACGCGAAGTTAAAAGCGGAAAATGAGAAGCTGAAGAAAGATTTGATTGATGCTGAGGCAAAGGTCAAGGCGCTCGAGGATAAGGTCAAGGCATGTGAGGACGAAAAGGCGAAGCTTCGTCAGGAGATCGAGGGGCTCAAAAGTCAGATTGACAAACTCAACAGTGAGCTCGCAGCAGAGAAAGCGGCGAAAGAGGCGGCTTTGAAAGAACTGGCAGCGACCAAGGCCGAGCTAGCTGCGCTCAGAACAGAGCTGGACAAAGTGAGAGCCGAGAACGCGAGACTGAATGGTGAGCTCGAAAAGTTGAAGTCGGAGAATGAGAAGATGAAGGGGGAGCTTGACCGACTCAAAGCGGAAAACGCGAAGCTACAAGGTGACCTGGACGCCCTAAGGGCGGAGAATTCGAAGCTCAAAGGAGATCTGGATAAATTGAATTCCGAACTGAGCGCGTTGCGAGCTGAGAACGATAAGTTGAAGgccgaaaattcgaaattgaagGATGATTTGGCGGCCGCAAAAGAGGAAGCTGCGCGTCTCAAGAGTGATctggaaaaactaaaatccGAGAATGACGCCCTGAGAGCTGAGAACGACAAGGTGAAGGGAGAACTTGAGGGGCTCAAAGCAGAGCTTAACAAACTACGCGGGGATTTAGACGCCATGAAGGACGAGAATGCGAGGCTCAGGTCTGAGGTTGACAAACTAAAAAGCGATAATGAGAATCTAAAGAACGAGCTTGCGAAGGCCAACGCCGAGTTGGAAAATTCTAAGAAAGCAGTTGACAAACCAAAAGCTGCTGTGGCTGCCACTACTGGTCCTCTTCCTGAGAAAGTTCGTCGATCTATTCCAATGGAAGTCCCACCCACTGCTGCCAAGCCTGCAGTGAAGAAGGAACCGCCCCGAATGCCCTCAAAAACTCCAAAAGTTGAACGGCGAGCTTCGGTTGCAAAGAGAGACCAAGGATCCCAAGGCGAGGGCTGCGGTGATTACGTAAGCGCGAATGAACAGCTCAGGAAGAACATAAACAATCAAGACAGAGCTGTTCAGCGGATACGAAACTTCGTGAAGTACGTGCtgggagagagagaatcaCCTCCGGAGATGGCTGACTCACAAACTCACCGTATGTCATCAGTGATGAGGAACAAATTTGCGGAAGACATAATGGAGTTGCTTAAGGAGTCGCAGTATCTCTCAGAGAGTATATTCAACGCTGAGGCGGACGTTCAGCGGCTTCTGAAGATCCTCGAAGAGCtagaaaagttgagaaatgAGAACGCTGCGCTCAGAGACAAACTCGAAGTGACGGAGGAGCCCGTAGGCTTCGGGGACGCTTTCGACGCCGAATCGTGGCTGAAGACGCTGACGTTGACCGAGCTCGCCGAGCTCCACGACAGGATCTGCCTCGTGACGTCGAGCATGGTTAAGCAGGACATAAACCCGGAGGACTACGTTGACGACTCTACTTCACCCGACGGAGTCTGCAAGCCCTGTTCAGGTCTGCAGGACGCGGACGATGAGATGATCGGCGGGGACTACGAAGCTCTGAATAAGAGGATAGCAGCGCTTCAAATGCAGATTAACGAGAAGCAGAACGAGGCAGCAGCCAAGGTTCAAGAGATGCGGAAGGCCATGTGGCGAGAGCAGGACCGGCTGATACGCCTATCGGAGGAGATGAATGCCCAGAAAAGGAACAACCTGacgatgaagatgaagatcGGTGGTGAGCTCGATCCTTTCGGGGTTCCCGAAGGAGTCGCCGTCCTTTGCGGTGGGAGTCGGAGCTCTGGGGAGCTTGACGATTATCCGGGGACGAAATGGAACCCGCGATTCTCTGCCATCGGAGAGAAAGATGACGGAAAGTGGAAATCTGGCTGCGTTAGGTCGAAGAGAAAAAGCTCTCTCCAAGCTCCGGCACCTTGCGCTCTTCCTGTTAAACACGCAGACGTCCCGTGCTGTCAGAAACTCTGCTGTCCGTCTACCCTAAATCCGAAAACTCTTTTCCCGAAGAAGAGGAAGCAGGATGATGACGATAATATGCAGGAGGTCGCGGTTCTTTGTGGGGAAAATCGACGGGCCACGAATACTTATgatagataa
Protein Sequence
MMAGRTCQCGCTDPPKMTGGDPPNEGSCGCSYNPLGEGGRDAEITDLSYALRKLTSMKCQMKKWRMERLQLESEARALKQVLQAHGLNDDIVRPDPLLAHLREENARLENENEELQDKVKGLEDTITEYEYVESPCELVSKLREKMRNMKEAHAGEKRRLRELISGLKIRLQEAEAESSCAALNRLRAKLRELTEGGQEADQRVSKVVQRSIETLVELTDNVDDLKAEIERLRAEIKRLKDLLDACEERRRTATDVAVETTLPEVKPPEKPLVEMDVSDLLNRIKELEALIAQLRKQLVDKDAAINDLQNKLFNVTSDNKRLSTDLDQMMVSYRAVMDEVKAMKDELKKRDVKVSDLLRELQASAIDMLGLNRLQSEIESVKPQLYNLELEREQLLSELGKVRGVVSERNDQIIKILEERDKHARALGKVASTIQETAEREEALKREIDRLKDRIAELEKEIAELKKKVAELAAENEKIPGLEKKIKELEDELAKLRGDLAAANTKMNDLEKEIADLKAEKDALARELAKAKEQVEKLKEELAAERSAKEAAMKELEVCRAENEKLRGDNERMSNELNAAKGEIERLKNELDKVKGELDKSRAENSELKDLLAAAKAEIDKLRSEVEGCKAENAKLKGEIVRLNEEVQKLKAENSELKKERDTLQAEVGKLKEKIDGMQAEIDKLKNDLAASKSEMEKLKNDLDALKSENEKLKNSLREAEAKIKALEAENSDLANKLADLKIKIENLEKQLADEKAAKEAALKELAALKSDLKALLGEMDKLKAERDKLKGEVDDLTKRMADLTNELNQLKSKCAALAAENEKLKAEVNGLKTENERLKNDLEKVKADLEAAKSENAKLKAENEKLKKDLIDAEAKVKALEDKVKACEDEKAKLRQEIEGLKSQIDKLNSELAAEKAAKEAALKELAATKAELAALRTELDKVRAENARLNGELEKLKSENEKMKGELDRLKAENAKLQGDLDALRAENSKLKGDLDKLNSELSALRAENDKLKAENSKLKDDLAAAKEEAARLKSDLEKLKSENDALRAENDKVKGELEGLKAELNKLRGDLDAMKDENARLRSEVDKLKSDNENLKNELAKANAELENSKKAVDKPKAAVAATTGPLPEKVRRSIPMEVPPTAAKPAVKKEPPRMPSKTPKVERRASVAKRDQGSQGEGCGDYVSANEQLRKNINNQDRAVQRIRNFVKYVLGERESPPEMADSQTHRMSSVMRNKFAEDIMELLKESQYLSESIFNAEADVQRLLKILEELEKLRNENAALRDKLEVTEEPVGFGDAFDAESWLKTLTLTELAELHDRICLVTSSMVKQDINPEDYVDDSTSPDGVCKPCSGLQDADDEMIGGDYEALNKRIAALQMQINEKQNEAAAKVQEMRKAMWREQDRLIRLSEEMNAQKRNNLTMKMKIGGELDPFGVPEGVAVLCGGSRSSGELDDYPGTKWNPRFSAIGEKDDGKWKSGCVRSKRKSSLQAPAPCALPVKHADVPCCQKLCCPSTLNPKTLFPKKRKQDDDDNMQEVAVLCGENRRATNTYDR

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_01048633;
90% Identity
iTF_01048633;
80% Identity
-