Basic Information

Gene Symbol
-
Assembly
GCA_021155775.2
Location
CM037746.1:2947536-2953097[-]

Transcription Factor Domain

TF Family
TF_bZIP
Domain
bZIP domain
PFAM
AnimalTFDB
TF Group
Basic Domians group
Description
bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 25 2.1 2.6e+03 -0.6 0.9 46 64 66 84 52 85 0.58
2 25 0.00027 0.34 11.8 1.6 35 64 99 128 98 129 0.89
3 25 0.00079 0.98 10.3 1.3 28 59 216 247 209 252 0.67
4 25 1.8 2.2e+03 -0.4 0.2 41 62 277 298 268 300 0.69
5 25 0.87 1.1e+03 0.6 1.4 29 62 373 406 371 409 0.79
6 25 0.00023 0.28 12.1 4.2 25 60 439 474 437 479 0.84
7 25 0.058 73 4.4 0.3 25 45 477 497 474 504 0.86
8 25 4.7e-05 0.059 14.3 7.3 26 63 506 543 501 544 0.94
9 25 0.023 29 5.7 2.6 32 64 554 586 546 608 0.89
10 25 1.7e-07 0.00021 22.1 3.2 23 65 615 657 613 657 0.94
11 25 2e-05 0.025 15.4 3.4 21 62 648 689 648 692 0.84
12 25 0.01 13 6.8 4.3 25 65 673 713 672 720 0.49
13 25 0.038 47 5.0 7.7 24 64 693 733 684 734 0.82
14 25 0.0091 11 6.9 2.4 30 63 720 753 718 755 0.89
15 25 0.00024 0.31 12.0 3.1 26 64 779 817 759 818 0.86
16 25 3e-05 0.037 14.9 6.9 28 64 809 845 805 852 0.94
17 25 0.022 28 5.7 4.6 32 64 848 880 844 881 0.83
18 25 0.0016 2 9.3 1.4 32 62 876 906 874 908 0.49
19 25 9.8e-07 0.0012 19.7 3.1 25 63 897 935 895 937 0.93
20 25 0.77 9.6e+02 0.8 2.5 25 65 939 979 935 983 0.72
21 25 0.051 64 4.5 8.3 28 58 970 1000 952 1013 0.60
22 25 0.0017 2.2 9.2 5.5 24 62 1008 1046 1005 1053 0.72
23 25 9.1e-06 0.011 16.5 5.0 24 65 1050 1091 1047 1091 0.94
24 25 0.0013 1.6 9.7 3.6 23 57 1098 1132 1095 1139 0.66
25 25 0.099 1.2e+02 3.6 1.3 35 56 1290 1311 1261 1315 0.73

Sequence Information

Coding Sequence
ATGATGGCGGGTCGCACATGTCAATGCGGATGCACTGACCCACCGAAAATGACGGGAGGTGATCCTCCGAACGAAGGATCCTGCGGGTGCAGCTACAATCCGCTGGGAGAGGGTGGAAGGGATGCGGAAATAACAGACCTATCTTACGCCCTGCGGAAACTGACCTCGATGAAATGCCAGATGAAGAAATGGAGGATGGAACGTCTGCAGCTTGAGAGTGAGGCGAGGGCTTTGAAGCAGGTGCTGCAGGCCCACGGTCTTAACGACGACATCGTGAGGCCCGATCCACTGCTTGCTCATCTTCGAGAGGAGAATGCCAGGCTGGAAAACGAAAACGAAGAACTTCAGGACAAGGTTAAAGGACTCGAGGACACCATAACCGAGTACGAGTATGTCGAGTCACCGTGCGAACTGGTCAGCAAACTTCGCGAAAAGATGAGGAACATGAAGGAGGCTCATGCTGGTGAAAAACGAAGATTGAGAGAGTTAATTTCCGGGCTGAAGATCCGGCTCCAGGAGGCGGAGGCCGAGTCATCTTGTGCAGCATTGAATCGTCTGCGAGCAAAGCTTCGCGAGCTAACTGAAGGCGGACAGGAGGCAGACCAACGGGTTTCGAAAGTGGTTCAGCGTTCGATAGAAACGTTGGTCGAGCTGACGGATAACGTTGATGACCTAAAGGCGGAGATCGAGAGACTTCGTGCGGAGATCAAGCGTCTAAAGGACCTGCTGGACGCCTGTGAAGAGCGGCGAAGAACGGCGACTGATGTCGCTGTCGAAACAACCCTCCCGGAGGTGAAACCACCAGAAAAACCGCTGGTGGAAATGGACGTTTCAGATCTACTCAACAGGATCAAGGAGCTCGAAGCACTCATAGCTCAGCTGAGGAAGCAGCTCGTGGACAAGGATGCCGTCATCAATGACCTCCAAAATAAACTGTTCAACGTCACATCGGACAATAAGAGACTCAGCACTGACCTAGATCAAATGATGGTCAGCTACAGAGCCGTCATGGACGAGGTAAAAGCTATGAAGGATGAGCTCAAAAAGAGGGACGTGAAGGTTTCGGATCTCTTGCGTGAGCTCCAAGCGTCGGCGATCGATATGCTGGGATTGAACAGGCTGCAGAGTGAGATTGAATCGGTCAAGCCACAATTGTACAACCTTGAACTGGAGAGAGAACAGCTGTTGTCAGAGCTCGGTAAGGTTCGGGGAGTAGTTTCGGAGAGGAACGATCAGATAATTAAGATCCTGGAGGAAAGAGACAAGCATGCTCGAGCTCTGGGCAAGGTCGCGAGCACGATACAGGAAACGGCCGAACGGGAGGAGGCGCTGAAACGCGAGATTGATCGGCTGAAGGATCAAATAGCTGAGCTTGAGAAGGAGATAGCTGAGCTTAAGAAAAAGGTGGCTGAGCTCGCGGCGGGGAACGAAAAAATTCCTggacttgagaaaaaaattaaggagcTCGAAGACGAGCTAGCGAAGCTCAGAGGCGATCTTGCCGCTGCTGACACGAAGATGAACGACCTTGAGAAAGAAATAGCCGATCTGAAGGCAGAAAAAGATGAATTAGCAAGAGAGCTAGCAAAGGCAAAGGAGCAGGTGGAGAAGCTGAAAGAGGAGCTTGCTGCTGAGAGATCTGCAAAAGAAGCGGCCATGAAAGAACTCGAGGTCTGTAGAGCTGAGAACGAGAAACTGAGAGGAGACAACGAGCGAATGAGCAACGAACTCAACGCGGCAAAGGGAGAAATTGAAAGACTGAAAAATGAGCTTGACAAGGTTAATGGCGAGTTGGATAAATCGAGAGCCGAAAACAGTGAGCTCAAGGACCTGCTCGCTGCAGCTAAGGCGGAAATCGACAAGCTCAGAAGCGAGGTCGAGGGGTGCAAGGCTGAGAATGCCAAGCTTAAAGGTGAGATTGTGCGATTAAATGAGGAAGTACAAAAGTTGAAGGCAGAGAATAGCGAGCtcaagaaagagagagacacgCTGCAGGCTGAAGTGGGAAAGCTGAAAGAAAAGATCGACGGAATGCAAGGTGAAATTGATAAGCTGAAGAACGATCTGGCCGCATCTAAAAGCGAGATGGAAAAGCTCAAGAATGATTTGGACGCTTTGAAATCGGAGAACGAGAAGCTCAAGAACAGTTTACGCGAAGCCGAGGCGAAGATAAAGGCATTGGAAGCGGAGAACTCAGATCTTGCTAATAAATTAGCCGATCTGAAGATCAAGAtagaaaatcttgaaaaacagCTTGCAGATGAAAAAGCCGCGAAAGAAGCGGCGCTAAAGGAATTGGCAGCGTTAAAGTCGGACCTCAAAGCGTTACTCGGAGAGATGGACAAGCTCAAAGCCGAGCGGGACAAACTGAAAGGGGAAGTGGATGACCTGACGAAGCGGATGGCAGACTTGACCAATGAGCTAAATCAGCTGAAATCAAAGTGTGCTGCCCTCGCGGCAGAGAACGAGAAGTTGAAAGCAGAAGTCAACGGTCTTAAAACAGAGAATGAGAGGCTGAAGAACGACCTGGAGAAGGTCAAGGCTGACCTTGAGGCAGCGAAATCAGAGAACGCGAAGTTAAAAGCGGAAAATGAGAAGCTGAAGAAAGATTTAATTGATGCTGAGGCAAAGGTCAAGGCGCTCGAGGATAAGGTCAAGGCGCTCGAAGACAAGGTGAAGACGCTCGAAGACAAGGTGAAAACACTTGAAGACAAGGTCAAGGCATGTGAGGACGAAAAGGCGAAGCTTCGTCAGGAGATCGAGGGGCTCAAAAGTCAGATTGACAAACTCAACAGTGAGCTCGCAGCAGAGAAAGCGGCGAAAGAGGCGGCTTTGAAAGAACTGGCAGCGACCAAGGCCGAGCTAGCTGCGCTCAGAACAGAGCTGGACAAAGTGAGAGCCGAGTACGCGAGACTGAATGGTGAGCTCGAAAAGTTGAAGTCGGAGAATGAGAAGATGAAGGGGGAGCTTGACCGACTCAAAGCGGAAAACGCGAAGCTACAAGGCGACCTAGACGCCCTAAAGGCGGAGAATTCGAAGCTCAAAGGAGATCTGGATAAATTGAATTCCGAACTGAGCGCGTTGCGAGCTGAGAACGATAAGTTGAAGgccgaaaattcgaaattaaaGGATGATTTGGCGGCCGCAAAAGAGGAAGCTGCGCGTCTCAAGAGTGATCTGGAAAAACTAAAATCCGAGAATGACGCCCTGAGAGCTGAGAATGACAAGGTGAAGGGAGAACTTGAGGGGCTCAAAGCAGAGCTTAACAAACTACGCGGGGATTTAGACGCCATGAAGGACGAGAATGCGAGGCTCAGGTCTGAGGTTGACAAACTAAAAAGCGATAATGAGAATCTAAAGAACGAGCTTGCGAAGGCCAACGCCGAATTGGAAAATTCTAAGAAAGCAGTTGACAAACCAAAAGCTGCTGTGGCTGCCACTACTGGTCCTCTTCCTGAGAAAGTTCGTCGATCTATTCCAATGGAAGTCCCACCCACTGCTGCTAAGCCTGCAGTGAAGAAGGAACCGCCCCAAATGCCCTCAAAAACTCCAAAAGTTGCACGGCGAGCTTCGGTTGCAAAGAGAGACCAAGGATCCCAAGGCGAGGGCTGCGGTGATTACGTAAGCGCGAATGAACAGCTCAGGAAGAACATAAACAATCAAGACAGAGCTGTTCAGCGGATACGAAACTTCGTGAAGTACGTGCTGGGAGAGAGAGAATCACCTCCGGAGATGGCTGACTCACAAACTCACCGTATGTCATCAGTGATGAGGAACAAATTTGCGGAAGACATAATGGAGTTGCTTAAGGAGTCGCAGTATCTCTCAGAGAGTATATTCAACGCTGAGGCGGACGTTCAGCGGCTTCTGAAGATCCTCGAAGAGCTAGAAAAGTTGAGAAATGAGAACGCTGCGCTCAGAGACAAACTCGAAGTAGCGGAGGAGCCCGTAGGCTTCGGGGACGCTTTCGATGCCGAATCGTGGCTGAAGACGCTGACGTTGACCGAGCTCGCCGAGCTCCACGACAGGATCTGCCTCGTGACGTCGAGCATGGTTAAGCAGGACATAAACCCGGAGGACTACGTTGACGACTCTACTTCACCCGACGGAGTCTGCAGGCCCTGTTCAGGTCTGCAGGACGCGGACGATGAGATGATCGGCGGGGACTACGAAGCTCTGAATAAGAGGATAGCAGCGCTTCAAATGCAGATTAACGAGAAGCAGAACGAGGCAGCAGCCAAGGTTCAAGAGATGCGGAAGGCCATGTGGCGAGAGCAGGACCGGCTGATACGCCTATCGGAGGAGATGAATGCCCAGAAAAGGAACAACCTGACGATGAAGATGAAGATCGGTGGTGAGCTCGATCCTTTCGGGGTTCCCGAAGGAGTCGCCGTCCTTTGCGGTGGGAGTCGGAGCTCTGGGGAGCGTGACGATTATCCGGGGACGAAACGGAACCCGCGATTCTCTGCCATCGGAGAGAAAGATGACGGAAAGTGGAAATCTGGCTGCGTTAGGTCGAAGAGAAAAAGCTCTCTCCAAGCTCCGGCACCTTGCGCTCTTCCTGTTAAACACGCAGACGTCCCGTGCTGTCAGAAACTCTGCTGTCCGTCTACCCTAAATCCGAAAACTCTTTTCCCGAAGAAGAGGAAGCAGGATGATGACGATAATATGCAGGAGGTCGCGGTTCTTTGTGGGGAAAATCGACGGGCCACGAATACTTAG
Protein Sequence
MMAGRTCQCGCTDPPKMTGGDPPNEGSCGCSYNPLGEGGRDAEITDLSYALRKLTSMKCQMKKWRMERLQLESEARALKQVLQAHGLNDDIVRPDPLLAHLREENARLENENEELQDKVKGLEDTITEYEYVESPCELVSKLREKMRNMKEAHAGEKRRLRELISGLKIRLQEAEAESSCAALNRLRAKLRELTEGGQEADQRVSKVVQRSIETLVELTDNVDDLKAEIERLRAEIKRLKDLLDACEERRRTATDVAVETTLPEVKPPEKPLVEMDVSDLLNRIKELEALIAQLRKQLVDKDAVINDLQNKLFNVTSDNKRLSTDLDQMMVSYRAVMDEVKAMKDELKKRDVKVSDLLRELQASAIDMLGLNRLQSEIESVKPQLYNLELEREQLLSELGKVRGVVSERNDQIIKILEERDKHARALGKVASTIQETAEREEALKREIDRLKDQIAELEKEIAELKKKVAELAAGNEKIPGLEKKIKELEDELAKLRGDLAAADTKMNDLEKEIADLKAEKDELARELAKAKEQVEKLKEELAAERSAKEAAMKELEVCRAENEKLRGDNERMSNELNAAKGEIERLKNELDKVNGELDKSRAENSELKDLLAAAKAEIDKLRSEVEGCKAENAKLKGEIVRLNEEVQKLKAENSELKKERDTLQAEVGKLKEKIDGMQGEIDKLKNDLAASKSEMEKLKNDLDALKSENEKLKNSLREAEAKIKALEAENSDLANKLADLKIKIENLEKQLADEKAAKEAALKELAALKSDLKALLGEMDKLKAERDKLKGEVDDLTKRMADLTNELNQLKSKCAALAAENEKLKAEVNGLKTENERLKNDLEKVKADLEAAKSENAKLKAENEKLKKDLIDAEAKVKALEDKVKALEDKVKTLEDKVKTLEDKVKACEDEKAKLRQEIEGLKSQIDKLNSELAAEKAAKEAALKELAATKAELAALRTELDKVRAEYARLNGELEKLKSENEKMKGELDRLKAENAKLQGDLDALKAENSKLKGDLDKLNSELSALRAENDKLKAENSKLKDDLAAAKEEAARLKSDLEKLKSENDALRAENDKVKGELEGLKAELNKLRGDLDAMKDENARLRSEVDKLKSDNENLKNELAKANAELENSKKAVDKPKAAVAATTGPLPEKVRRSIPMEVPPTAAKPAVKKEPPQMPSKTPKVARRASVAKRDQGSQGEGCGDYVSANEQLRKNINNQDRAVQRIRNFVKYVLGERESPPEMADSQTHRMSSVMRNKFAEDIMELLKESQYLSESIFNAEADVQRLLKILEELEKLRNENAALRDKLEVAEEPVGFGDAFDAESWLKTLTLTELAELHDRICLVTSSMVKQDINPEDYVDDSTSPDGVCRPCSGLQDADDEMIGGDYEALNKRIAALQMQINEKQNEAAAKVQEMRKAMWREQDRLIRLSEEMNAQKRNNLTMKMKIGGELDPFGVPEGVAVLCGGSRSSGERDDYPGTKRNPRFSAIGEKDDGKWKSGCVRSKRKSSLQAPAPCALPVKHADVPCCQKLCCPSTLNPKTLFPKKRKQDDDDNMQEVAVLCGENRRATNT

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_01047992;
90% Identity
iTF_01047992;
80% Identity
-