Basic Information

Gene Symbol
-
Assembly
GCA_029784135.1
Location
CM056649.1:244103872-244184070[-]

Transcription Factor Domain

TF Family
TF_bZIP
Domain
bZIP domain
PFAM
AnimalTFDB
TF Group
Basic Domians group
Description
bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 20 0.0093 22 6.8 1.5 31 56 100 125 95 133 0.81
2 20 0.0093 22 6.8 1.5 31 56 225 250 220 258 0.81
3 20 0.0093 22 6.8 1.5 31 56 350 375 345 383 0.81
4 20 0.0093 22 6.8 1.5 31 56 475 500 470 508 0.81
5 20 0.0093 22 6.8 1.5 31 56 600 625 595 633 0.81
6 20 0.0093 22 6.8 1.5 31 56 725 750 720 758 0.81
7 20 0.0093 22 6.8 1.5 31 56 850 875 845 883 0.81
8 20 0.0093 22 6.8 1.5 31 56 975 1000 970 1008 0.81
9 20 0.0093 22 6.8 1.5 31 56 1100 1125 1095 1133 0.81
10 20 0.0093 22 6.8 1.5 31 56 1225 1250 1220 1258 0.81
11 20 0.0093 22 6.8 1.5 31 56 1350 1375 1345 1383 0.81
12 20 0.0093 22 6.8 1.5 31 56 1475 1500 1470 1508 0.81
13 20 0.0093 22 6.8 1.5 31 56 1600 1625 1595 1633 0.81
14 20 0.014 31 6.3 1.3 31 55 1725 1749 1720 1754 0.84
15 20 0.0093 22 6.8 1.5 31 56 1810 1835 1805 1843 0.81
16 20 0.0093 22 6.8 1.5 31 56 1935 1960 1930 1968 0.81
17 20 0.0093 22 6.8 1.5 31 56 2060 2085 2055 2093 0.81
18 20 0.0093 22 6.8 1.5 31 56 2185 2210 2180 2218 0.81
19 20 0.0093 22 6.8 1.5 31 56 2310 2335 2305 2343 0.81
20 20 0.03 69 5.2 2.6 31 55 2435 2459 2430 2461 0.84

Sequence Information

Coding Sequence
ATGCAGCGAGCGGCCTTCACATCGAAGATTCCCACCGGGTCGTTGGTTCCAAAACGGTCTTGGGCCAACCGAGAACCCGGAGTTCATGGGGAATCAAGTGGAAAAGGATCTTTAATCAAGCCAATTTCAAAGCCAAACCAAATACAGCGGAAGCAGCTAATGATTGAAGATGAAGTTCCTCCGCCTCTAGTCCCCAGTGAACCGATTCTCGTTGAAGATGACGAGGATTTGTTCGACTGGGAGCCATTGGAGGACTGGACGGAGATGGAGGCTGCGGTGTTCCAGAGGATGAAGGCTGAGAGAGACCTTCGGAAGGAGCAAGCAGAGAATATTAGACTGCGGCAGGAAAACCAACGACTGAAAGCAACAAAAGGGATGCAGCGAGCGGCCTTCACATCGAAGATTCCCACCGGGTCGTTGGTTCCAAAACGCTCTTGGGCCAACCGAGAACCCGGAGTTCATGGGGAATCAAGTGGAAAAGGATCTTTAATCAAGCCAATTTCAAAGCCAAACCAAATACAGCGGAAGCAGCCAATGATTGAAGATGAAGTTCCTCCGCCTCTAGTCCCCAGTGAACCGATTCTCGTTGATGATGACGAGGATTTGTTCGACTGGGAGCCATTGGAGGACTGGACGGAGATGGAGATTACGGTGTTCCAGAGGATGAAGGCTGAGAGAGACCTTCGGAAGGAGCAAGCAGAGAATATTAGACTGCGGCAGGAAAACCAACGACTGAAAGCAACAAAAGGGATGCAGCGAGCGGCCTTCACATCGAAGATTCCCACCGGGTCGTTGGTTCCAAAACGGTCTTGGGCCAAGCGAGAACCCGGAGTTCATGGGGAATCAAGTGGAAAAGGATCTTTAATCAAGccAATTTCAAAGCCAAACCAAATACAGCGGAAGCAGCCAATGATTGAAGATGAAGTTCCTCCGCCTCTAGTCCCCAGTGAACCGATTCTCGTTGAAGATGACGAGGATTTGTTCGACTGGGAGCCATTGGAGGACTGGACGGAGATGGAGGCTGCGGTGTTCCAGAGGATGAAGGCTGAGAGAGACCTTCGGAAGGAGCAAGCAGAGAATATTAGACTGCGGCAGGAAAACCAACGACTGAAAGCAACAAAAGGGATGCAGCGAGCGGCCTTCACATCGAAGATTCCCACCGGGTCGTTGGTTCCAAAACGGTCTTGGGCCAACCGAGAACCCGGAGTTCATGGGGAATCAAGTGGAAAAGGATCTTTAATCAAGccAATTTCAAAGCCAAACCAAATACAGCGGAAGCAGCCAATGATTGAAGATGAAGTTCCTCCGCCTCTAGTCCCCAGTGAACCGATTCTCGTTGAAGATGACGAGGATTTGTTCGACTGGGAGCCATTGGAGGACTGGACGGAGATGGAGGCTGCGGTGTTCCAGAGGATGAAGGCTGAGAGAGACCTTCGGAAGGAGCAAGCAGAGAATATTAGACTGCGGCAGGAAAACCAACGACTGAAAGCAACAAAAGGGATGCAGCGAGCGGCCTTCACATCGAAGATTCCCACCGGGTCGTTGGTTCCAAAACGGTCTTGGGCCAAGCGAGAACCCGGAGTTCATGGGGAATCAAGTGGAAAAGGATCTTTAATCAAGccAATTTCAAAGCCAAACCAAATACAGCGGAAGCAGCCAATGATTGAAGATGAAGTTCCTCCGCCTCTAGTCCCCAGTGAACCGATTCTCGTTGAAGATGACGAGGATTTGTTCGACTGGGAGCCATTGGAGGACTGGACGGAGATGGAGGCTGCGGTGTTCCAGAGGATGAAGGCTGAGAGAGACCTTCGGAAGGAGCAAGCAGAGAATATTAGACTGCGGCAGGAAAACCAACGACTGAAAGCAACAAAAGGGATGCAGCGAGCGGCCTTCACATCGAAGATTCCCACCGGGTCGTTGGTTCCAAAACGGTCTTGGGCCAACCGAGAACCCGGAGTTCATGGGGAATCAAGTGGAAAAGGATCTTTAATCAAGccAATTTCAAAGCCAAACCAAATACAGCGGAAGCAGCCAATGATTGAAGATGAAGTTCCTCCGCCTCTAGTCCCCAGTGAACCGATTCTCGTTGAAGATGACGAGGATTTGTTCGACTGGGAGCCATTGGAGGACTGGACGGAGATGGAGGCTGCGGTGTTCCAGAGGATGAAGGCTGAGAGAGACCTTCGGAAGGAGCAAGCAGAGAATATTAGACTGCGGCAGGAAAACCAACGACTGAAAGCAACAAAAGGGATGCAGCGAGCGGCCTTCACATCGAAGATTCCCACCGGGTCGTTGGTTCCAAAACGGTCTTGGGCCAACCGAGAACCCGGAGTTCATGGGGAATCAAGTGGAAAAGGATCTTTAATCAAGccAATTTCAAAGCCAAACCAAATACAGCGGAAGCAGCCAATGATTGAAGATGAAGTTCCTCCGCCTCTAGTCCCCAGTGAACCGATTCTCGTTGAAGATGACGAGGATTTGTTCGACTGGGAGCCATTGGAGGACTGGACGGAGATGGAGGCTGCGGTGTTCCAGAGGATGAAGGCTGAGAGAGACCTTCGGAAGGAGCAAGCAGAGAATATTAGACTGCGGCAGGAAAACCAACGACTGAAAGCAACAAAAGGGATGCAGCGAGCGGCCTTCACATCGAAGATTCCCACCGGGTCGTTGGTTTCAAAACGCTCTTGGGCCAACCGAGAACCCGGAGTTCATGGGGAATCAAGTGGAAAAGGATCTTTAATCAAGCCAATTTCAAAGCCAAACCAAATACAGCGGAAGCAGCCAATGATTGAAGATGAAGTTCCTCCGCCTCTAGTCCCCAGTGAACCGATTCTCGTTGATGATGACGAGGATTTGTTCGACTGGGAGCCATTGGAGGACTGGACGGAGATGGAGATTACGGTGTTCCAGAGGATGAAGGCTGAGAGAGACCTTCGGAAGGAGCAAGCAGAGAATATTAGACTGCGGCAGGAAAACCAACGACTGAAAGCAACAAAAGGGATGCAGCGAGCGGCCTTCACATCGAAGATTCCCACCGGGTCGTTGGTTCCAAAACGGTCTTGGGCCAAGCGAGAACCCGGAGTTCATGGGGAATCAAGTGGAAAAGGATCTTTAATCAAGccAATTTCAAAGCCAAACCAAATACAGCGGAAGCAGCCAATGATTGAAGATGAAGTTCCTCCGCCTCTAGTCCCCAGTGAACCGATTCTCGTTGAAGATGACGAGGATTTGTTCGACTGGGAGCCATTGGAGGACTGGACGGAGATGGAGGCTGCGGTGTTCCAGAGGATGAAGGCTGAGAGAGACCTTCGGAAGGAGCAAGCAGAGAATATTAGACTGCGGCAGGAAAACCAACGACTGAAAGCAACAAAAGGGATGCAGCGAGCGGCCTTCACATCGAAGATTCCCACCGGGTCGTTGGTTCCAAAACGGTCTTGGGCCAACCGAGAACCCGGAGTTCATGGGGAATCAAGTGGAAAAGGATCTTTAATCAAGccAATTTCAAAGCCAAACCAAATACAGCGGAAGCAGCCAATGATTGAAGATGAAGTTCCTCCGCCTCTAGTCCCCAGTGAACCGATTCTCGTTGAAGATGACGAGGATTTGTTCGACTGGGAGCCATTGGAGGACTGGACGGAGATGGAGGCTGCGGTGTTCCAGAGGATGAAGGCTGAGAGAGACCTTCGGAAGGAGCAAGCAGAGAATATTAGACTGCGGCAGGAAAACCAACGACTGAAAGCAACAAAAGGGATGCAGCGAGCGGCCTTCACATCGAAGATTCCCACCGGGTCGTTGGTTCCAAAACGCTCTTGGGCCAACCGAGAACCCGGAGTTCATGGGGAATCAAGTGGAAAAGGATCTTTAATCAAGCCAATTTCAAAGCCAAACCAAATACAGCGGAAGCAGCCAATGATTGAAGATGAAGTTCCTCCGCCTCTAGTCCCCAGTGAACCGATTCTCGTTGATGATGACGAGGATTTGTTCGACTGGGAGCCATTGGAGGACTGGACGGAGATGGAGATTACGGTGTTCCAGAGGATGAAGGCTGAGAGAGACCTTCGGAAGGAGCAAGCAGAGAATATTAGACTGCGGCAGGAAAACCAACGACTGAAAGCAACAAAAGGGATGCAGCGAGCGGCCTTCACATCGAAGATTCCCACCGGGTCGTTGGTTCCAAAACGGTCTTGGGCCAACCGAGAACCCGGAGTTCATGGGGAATCAAGTGGAAAAGGATCTTTAATCAAGccAATTTCAAAGCCAAACCAAATACAGCGGAAGCAGCCAATGATTGAAGATGAAGTTCCTCCGCCTCTAGTCCCCAGTGAACCGATTCTCGTTGAAGATGACGAGGATTTGTTCGACTGGGAGCCATTGGAGGACTGGACGGAGATGGAGGCTGCGGTGTTCCAGAGGATGAAGGCTGAGAGAGACCTTCGGAAGGAGCAAGCAGAGAATATTAGACTGCGGCAGGAAAACCAACGACTGAAAGCAACAAAAGGGATGCAGCGAGCGGCCTTCACATCGAAGATTCCCACCGGGTCGTTGGTTCCAAAACGGTCTTGGGCCAACCGAGAACCCGGAGTTCATGGGGAATCAAGTGGAAAAGGATCTTTAATCAAGCCAATTTCAAAGCCAAACCAAATACAGCGGAAGCAGCCAATGATTGAAGATGAAGTTCCTCCGCCTCTAGTCCCCAGTGAACCGATTCTCGTTGATGATGACGAGGATTTGTTCGACTGGGAGCCATTGGAGGACTGGACGGAGATGGAGATTACGGTGTTCCAGAGGATGAAGGCTGAGAGAGACCTTCGGAAGGAGCAAGCAGAGAATATTAGACTGCGGCAGGAAAACCAACGACTGAAAGCAACAAAAGGGATGCAGCGAGCGGCCTTCACATCGAAGATTCCCACCGGGTCGTTGGTTCCAAAACGCTCTTGGGCCAACCGAGAACCCGGAGTTCATGGGGAATCAAGTGGAAAAGGATCTTTAATCAAGCCAATTTCAAAGCCAAACCAAATACAGCGGAAGCAGCCAATGATTGAAGATGAAGTTCCTCCGCCTCTAGTCCCCAGTGAACCGATTCTCGTTGATGATGACGAGGATTTGTTCGACTGGGAGCCATTGGAGGACTGGACGGAGATGGAGATTACGGTGTTCCAGAGGATGAAGGCTGAGAGAGACCTTCGGAAGGAGCAAGCAGAGAATATTAGACTGCGGCAGGAAAACCAACGACTGAAAGCAACAAAAGGGccAATTTCAAAGCCAAACCAAATACAGCGGAAGCAGCCAATGATTGAAGATGAAGTTCCTCCGCCTCTAGTCCCCAGTGAACCGATTCTCGTTGAAGATGACGAGGATTTGTTCGACTGGGAGCCATTGGAGGACTGGACGGAGATGGAGATTACGGTGTTCCAGAGGATGAAGGCTGAGAGAGACCTTCGGAAGGAGCAAGCAGAGAATATTAGACTGCGGCAGGAAAACCAACGACTGAAAGCAACAAAAGGGATGCAGCGAGCGGCCTTCACATCGAAGATTCCCACCGGGTCGTTGGTTCCAAAACGCTCTTGGGCCAACCGAGAACCCGGAGTTCATGGGGAATCAAGTGGAAAAGGATCTTTAATCAAGccAATTTCAAAGCCAAACCAAATACAGCGGAAGCAGCCAATGATTGAAGATGAAGTTCCTCCGCCTCTAGTCCCCAGTGAACCGATTCTCGTTGAAGATGACGAGGATTTGTTCGACTGGGAGCCATTGGAGGACTGGACGGAGATGGAGATTACGGTGTTCCAGAGGATGAAGGCTGAGAGAGACCTTCGGAAGGAGCAAGCAGAGAATATTAGACTGCGGCAGGAAAACCAACGACTGAAAGCAACAAAAGGGATGCAGCGAGCGGCCTTCACATCGAAGATTCCCACCGGGTCGTTGGTTCCAAAACGCTCTTGGGCCAACCGAGAACCCGGAGTTCATGGGGAATCAAGTGGAAAAGGATCTTTAATCAAGCCAATTTCAAAGCCAAACCAAATACAGCGGAAGCAGCCAATGATTGAAGATGAAGTTCCTCCGCCTCTAGTCCCCAGTGAACCGATTCTCGTTGATGATGACGAGGATTTGTTCGACTGGGAGCCATTGGAGGACTGGACGGAGATGGAGATTACGGTGTTCCAGAGGATGAAGGCTGAGAGAGACCTTCGGAAGGAGCAAGCAGAGAATATTAGACTGCGGCAGGAAAACCAACGACTGAAAGCAACAAAAGGGATGCAGCGAGCGGCCTTCACATCGAAGATTCCCACCGGGTCGTTGGTTCCAAAACGCTCTTGGGCCAACCGAGAACCCGGAGTTCATGGGGAATCAAGTGGAAAAGGATCTTTAATCAAGCCAATTTCAAAGCCAAACCAAATACAGCGGAAGCAGCCAATGATTGAAGATGAAGTTCCTCCGCCTCTAGTCCCCAGTGAACCGATTCTCGTTGATGATGACGAGGATTTGTTCGACTGGGAGCCATTGGAGGACTGGACGGAGATGGAGATTACGGTGTTCCAGAGGATGAAGGCTGAGAGAGACCTTCGGAAGGAGCAAGCAGAGAATATTAGACTGCGGCAGGAAAACCAACGACTGAAAGCAACAAAAGGGATGCAGCGAGCGGCCTTCACATCGAAGATTCCCACCGGGTCGTTGGTTCCAAAACGCTCTTGGGCCAACCGAGAACCCGGAGTTCATGGGGAATCAAGTGGAAAAGGATCTTTAATCAAGccAATTTCAAAGCCAAACCAAATACAGCGGAAGCAGCCAATGATTGAAGATGAAGTTCCTCCGCCTCTAGTCCCCAGTGAACCGATTCTCGTTGAAGATGACGAGGATTTGTTCGACTGGGAGCCATTGGAGGACTGGACGGAGATGGAGGCTGCGGTGTTCCAGAGGATGAAGGCTGAGAGAGACCTTCGGAAGGAGCAAGCAGAGAATATTAGACTGCGGCAGGAAAACCAACGACTGAAAGCAACAAAAGGGATGCAGCGAGCGGCCTTCACATCGAAGATTCCCACCGGGTCGTTGGTTCCAAAACGCTCTTGGGCCAACCGAGAACCCGGAGTTCATGGGGAATCAAGTGGAAAAGGATCTTTAATCAAGCCAATTTCAAAGCCAAACCAAATACAGCGGAAGCAGCCAATGATTGAAGATGAAGTTCCTCCGCCTCTAGTCCCCAGTGAACCGATTCTCGTTGATGATGACGAGGATTTGTTCGACTGGGAGCCATTGGAGGACTGGACGGAGATGGAGATTACGGTGTTCCAGAGGATGAAGGCTGAGAGAGACCTTCGGAAGGAGCAAGCAGAGAATATTAGACTGCGGCAGGAAAACCAACGACTGAAAGCAACAAAAGGGGCGAGGTAG
Protein Sequence
MQRAAFTSKIPTGSLVPKRSWANREPGVHGESSGKGSLIKPISKPNQIQRKQLMIEDEVPPPLVPSEPILVEDDEDLFDWEPLEDWTEMEAAVFQRMKAERDLRKEQAENIRLRQENQRLKATKGMQRAAFTSKIPTGSLVPKRSWANREPGVHGESSGKGSLIKPISKPNQIQRKQPMIEDEVPPPLVPSEPILVDDDEDLFDWEPLEDWTEMEITVFQRMKAERDLRKEQAENIRLRQENQRLKATKGMQRAAFTSKIPTGSLVPKRSWAKREPGVHGESSGKGSLIKPISKPNQIQRKQPMIEDEVPPPLVPSEPILVEDDEDLFDWEPLEDWTEMEAAVFQRMKAERDLRKEQAENIRLRQENQRLKATKGMQRAAFTSKIPTGSLVPKRSWANREPGVHGESSGKGSLIKPISKPNQIQRKQPMIEDEVPPPLVPSEPILVEDDEDLFDWEPLEDWTEMEAAVFQRMKAERDLRKEQAENIRLRQENQRLKATKGMQRAAFTSKIPTGSLVPKRSWAKREPGVHGESSGKGSLIKPISKPNQIQRKQPMIEDEVPPPLVPSEPILVEDDEDLFDWEPLEDWTEMEAAVFQRMKAERDLRKEQAENIRLRQENQRLKATKGMQRAAFTSKIPTGSLVPKRSWANREPGVHGESSGKGSLIKPISKPNQIQRKQPMIEDEVPPPLVPSEPILVEDDEDLFDWEPLEDWTEMEAAVFQRMKAERDLRKEQAENIRLRQENQRLKATKGMQRAAFTSKIPTGSLVPKRSWANREPGVHGESSGKGSLIKPISKPNQIQRKQPMIEDEVPPPLVPSEPILVEDDEDLFDWEPLEDWTEMEAAVFQRMKAERDLRKEQAENIRLRQENQRLKATKGMQRAAFTSKIPTGSLVSKRSWANREPGVHGESSGKGSLIKPISKPNQIQRKQPMIEDEVPPPLVPSEPILVDDDEDLFDWEPLEDWTEMEITVFQRMKAERDLRKEQAENIRLRQENQRLKATKGMQRAAFTSKIPTGSLVPKRSWAKREPGVHGESSGKGSLIKPISKPNQIQRKQPMIEDEVPPPLVPSEPILVEDDEDLFDWEPLEDWTEMEAAVFQRMKAERDLRKEQAENIRLRQENQRLKATKGMQRAAFTSKIPTGSLVPKRSWANREPGVHGESSGKGSLIKPISKPNQIQRKQPMIEDEVPPPLVPSEPILVEDDEDLFDWEPLEDWTEMEAAVFQRMKAERDLRKEQAENIRLRQENQRLKATKGMQRAAFTSKIPTGSLVPKRSWANREPGVHGESSGKGSLIKPISKPNQIQRKQPMIEDEVPPPLVPSEPILVDDDEDLFDWEPLEDWTEMEITVFQRMKAERDLRKEQAENIRLRQENQRLKATKGMQRAAFTSKIPTGSLVPKRSWANREPGVHGESSGKGSLIKPISKPNQIQRKQPMIEDEVPPPLVPSEPILVEDDEDLFDWEPLEDWTEMEAAVFQRMKAERDLRKEQAENIRLRQENQRLKATKGMQRAAFTSKIPTGSLVPKRSWANREPGVHGESSGKGSLIKPISKPNQIQRKQPMIEDEVPPPLVPSEPILVDDDEDLFDWEPLEDWTEMEITVFQRMKAERDLRKEQAENIRLRQENQRLKATKGMQRAAFTSKIPTGSLVPKRSWANREPGVHGESSGKGSLIKPISKPNQIQRKQPMIEDEVPPPLVPSEPILVDDDEDLFDWEPLEDWTEMEITVFQRMKAERDLRKEQAENIRLRQENQRLKATKGPISKPNQIQRKQPMIEDEVPPPLVPSEPILVEDDEDLFDWEPLEDWTEMEITVFQRMKAERDLRKEQAENIRLRQENQRLKATKGMQRAAFTSKIPTGSLVPKRSWANREPGVHGESSGKGSLIKPISKPNQIQRKQPMIEDEVPPPLVPSEPILVEDDEDLFDWEPLEDWTEMEITVFQRMKAERDLRKEQAENIRLRQENQRLKATKGMQRAAFTSKIPTGSLVPKRSWANREPGVHGESSGKGSLIKPISKPNQIQRKQPMIEDEVPPPLVPSEPILVDDDEDLFDWEPLEDWTEMEITVFQRMKAERDLRKEQAENIRLRQENQRLKATKGMQRAAFTSKIPTGSLVPKRSWANREPGVHGESSGKGSLIKPISKPNQIQRKQPMIEDEVPPPLVPSEPILVDDDEDLFDWEPLEDWTEMEITVFQRMKAERDLRKEQAENIRLRQENQRLKATKGMQRAAFTSKIPTGSLVPKRSWANREPGVHGESSGKGSLIKPISKPNQIQRKQPMIEDEVPPPLVPSEPILVEDDEDLFDWEPLEDWTEMEAAVFQRMKAERDLRKEQAENIRLRQENQRLKATKGMQRAAFTSKIPTGSLVPKRSWANREPGVHGESSGKGSLIKPISKPNQIQRKQPMIEDEVPPPLVPSEPILVDDDEDLFDWEPLEDWTEMEITVFQRMKAERDLRKEQAENIRLRQENQRLKATKGAR

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-