Basic Information

Gene Symbol
zfh2
Assembly
GCA_963966105.1
Location
OZ014535.1:11681911-11720452[-]

Transcription Factor Domain

TF Family
zf-C2H2
Domain
zf-C2H2 domain
PFAM
PF00096
TF Group
Zinc-Coordinating Group
Description
The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 17 0.56 1.9e+02 4.8 0.6 2 23 123 145 122 145 0.94
2 17 0.00022 0.075 15.5 1.4 2 23 715 737 714 737 0.96
3 17 5.4e-05 0.019 17.4 0.7 1 23 769 793 769 793 0.92
4 17 0.017 5.9 9.5 0.4 1 22 835 856 835 859 0.91
5 17 2.1 7.3e+02 2.9 1.9 1 23 1161 1185 1161 1185 0.91
6 17 3.1 1.1e+03 2.4 1.0 1 23 1538 1561 1538 1561 0.92
7 17 0.065 23 7.7 0.2 2 23 1679 1701 1678 1701 0.95
8 17 0.039 13 8.4 1.3 1 21 1791 1811 1791 1815 0.92
9 17 0.0026 0.89 12.1 2.3 1 23 1861 1883 1861 1883 0.96
10 17 0.00066 0.23 14.0 1.5 2 23 1890 1912 1889 1912 0.94
11 17 0.0029 1 11.9 3.1 2 23 2090 2113 2089 2113 0.90
12 17 0.034 12 8.6 0.6 1 22 2134 2155 2134 2158 0.89
13 17 0.0084 2.9 10.5 0.1 1 23 2242 2266 2242 2266 0.93
14 17 0.41 1.4e+02 5.2 2.9 1 19 2362 2380 2362 2385 0.92
15 17 0.0002 0.069 15.6 1.8 1 23 3083 3105 3083 3105 0.97
16 17 0.00049 0.17 14.4 1.4 1 23 3222 3244 3222 3244 0.98
17 17 0.011 3.7 10.1 0.6 2 23 3779 3801 3778 3801 0.96

Sequence Information

Coding Sequence
ATGCCACCACCTTTAATAACACAATCGGAGTCGACACATAGTGAGCCAACACCAAGAAAGCGCCGACGCAAGCGTGATGATCCACAATCGTGTTTCACCAATTCGGAGgaaTACGAATCGGATGACTGCTCCCCAATGTCCTGTTCTGATGTTGAAAGTTTTCAAGGCAAGATTGTATATAATCCAGATGGCAGTGCGTATATAATCGATTCGGAAAATGAATCACTTTCGAATATATCGGAAAATTGTATGAGCGTAGGGGCAACTACAACGAATAACCCAAAAATCCATTCGTTTCGTGTGGTTACCGCTCGCGATGCCAGTGTCAATATTTCCGaaccaaataaaatacaaaagccAATATTGATGTGTTTCATTTGTAAATTGAGCTTTGGTAACACGAAATCGTTTAGCTTGCATGCAAACAGTGAACATACCCTTAATCTTcaagaatcggaaaaattactACTGAATCGAGAATATTCGAGTGCCATTATACAGCGGAATGTTGATGAGAAGCCACAGATATCATTTTTGGAACCATTGGATATACAAAAGCAACATCAATTGATGAAACAATCATCGCTGGCGCAACAGTCACAatcacagtcacagtcacattcacagcaacagcaacagcaacaacaacagcaacagcaacagcttaTTGGATCAACATTATCTTTGGTGAATAGTAGCAATAATAATAGCTGTAGTACACCATCATCGATACCATCGGCGACGTCGACATCATCATTGTCTTTTTCATCATCATCGCCATCGTCAGCGTCGTCTATAACACCGGTGTCAACGGCGGCGGCAGTAGCTGCGGCTGCAAATGCCGCTTTGGTGGCTGCAATAGCAGCTAGttgcagcaacagcaacaatagtATAAATTCACCAGCGTCAATATCAATGAATACATCACATTTGGAAAGTGATCTTATAATGGCGAGCATTGGTCCAACTGGCACAAGCTGCAGTGGTAATAGCAATCCTGGTTATGGCAACACAACTAACATCAATAGTAATAATCCAAATAGCAGCAGTAGTATTTGTAGTCAAATGAATCACCAATTAAGATCAGATAATTTTGATAATTTGAACACATTAGACTTAAGCGCAGCAACTGCGGCGGCACAAGCCGCCACCGTATCTGTTGTAGCTGCAACATCAATTGATGGTCACCGTACACCACCACCATTGTCACCAGCATCGACGACCTCGTCTTCACCATCATCAACATCCGCGTCTTCTTCAACCTCAGCGTCGCTATCATTTGTCCAACAACCACCGGCGACAACAATCATCACATCAGGGCTACAAGTAGTCACAACAACAAGTTCAGCTGCCTCTAAGCATTCGTCACATTCCTTAGCAATTTCCGATACAATTCCAATAACATCGTCAACGAGCTCACCAACAACTAAATCAACTAGcaatagtagtagtagtattagTAATATTAATAACACCAGTAGCAATAAGAACAGTTCATCTGCATTATCACCGCCGACACCAACAACAGTAGCGGATTTCCTACAGCAGCAATTTCAACAAATGCAAAATCAAATCCGAATAACATCACCAGCCTCTGCAGCGGCTGTCGTCAGCACTAGTGGTAGCAATACAGATGGCAATGCTATGTCTTTAGTGACATCGAATACAACATCATTGAATTCGTTAACAGCATCATTAGCAGCGGCTGCTGCAGCTGTTGGTAGTGGTACACCTGGTGATTTATCTGGTAATAGTAGCAGTGTGAAATTAATTAATGATTTCCTGCAACATCAattgcaacagcaacagcaacagcaacagcaacagcatgcCGCATTTGCTACATGTCCCGAACATCCAGATATTAAGGGTATCGATTGTAAAACTTGCGAAATGATCGAGATCAATATGAAATCGCCAATGACACCGACGCGATCACCAAACAGCATCAATCTATTCCCATCAAATTCGACAATGTCACCTACTGCAGCTGCTGCGCCTAGCTTTACAATAGGTGCCTGTCCAGAGCATATTAATGGTCGACCATTGGGTGTGGATTGTTCTAGATGCGAAATGATATTGAACTCAGCACGGCTTAATAGTGGCGTACAGATGTCTACACGAAATTCATGCAAGACACTGAAATGTCCTCAATGTAATTGGCACTACAAGTATCaggaaacattagaaattcataTGAGGGAAAAGCATCCGGACGGAGAGAGTGCATGTGGCTATTGTCTGGCGGGTCAGCAACACCCTCGATTGGCACGAGGTGAGTCCTACACCTGCGGCTACAAGCCATACCGATGTGAAATTTGTAATTATTCAACAACTACAAAGGGTAATTTATCGATTCACATGCAAAGCGACAAACATTTAAATAATATGCAAGAGTTAAATAGCTCGCAAAATATGGCGAATACAGCGGCAGAGATTCGTGAATCGCCTAAAATTATAATGCCAAATATGCAGCAGCAGGCTTCTAAGCCAAAGCCTAGTTTTCGCTGTGACGTGTGTTCCTATGAGACAAGTGTGGCGCGTAATCTTCGCATTCATATGACTAGTGAAAAGCATACTCATAATATGGCTGTACTACAAAATAATATCAAGCACATACAAGCGTTTAGTTTTCTGCAATCACAAAATCTTGGTCAGCTAAGCGCGGCTCAAAGTGCAGCGGTAGCCGCTTCAAACTTGCCCAATATGCCgaacttgcaaaatttcctaccCGAAGCTGCACTAGCTGATATTGCCTATAATCAGGCCTTGATGATTCAGCTATTACATCAGAATTCTGCTGCGGGAGCATTAAGTGCGGCGGCAGCAGCAGCGGCCGCTGCTAATCCTCTAACTCTTGCCCCACCACAACAGTCACCAGTTGGAAGTACTGGAGCAGTATCACAGTCGCAAAATGCCTCAACGTCACAACAGTCGTCATCACAGCAGTCGTATCAATTGTCACAGCCATCAAAACTCAACCATCCATCACAATCGCCACATATTTTAACCACATCGGCTGCAACAGTATCAACACtgtcatcatcgtcaacaacattATCgtcacagcagcaacaacaacagcaacaacaacaacaacaacagcaacaacaacagcaacaacaacagcaatctGCTGTTGCGGCAGCTCTTCTAGCTGaagcggcagcagcagcagcagcagcagcggcagCTGCAGCAGCAACCACCGATACAACAACTTGTCTgcagcaacatcagcaacaacaacaacaacaacagcaacaacaacaacaacaacaacagcagcaacaacaacaagactCCTCGCTCGATCCACCAATCGATCCTGATCCAAAACCAACCACAGCCTTCAGCTGTCTCATATGCGCCAACTACAATACCAACAGCATTGATGAACTAAACAATCACTTAATGATCGATCGATCGCGCAACACCAATAACAACTGCAGCGACATCATGATGATCATCAACAACAATTACATTTGTCGTCTTTGCAATtacaaaacaaatttgaaggCCAATTTTCAATTGCATAGCAAAACGGACAAGCATCTTCAAAAGCTCAACTATATTAATCACATTCGTGAGGGTGGCGtcaaaaatgaatacaaattaaaatacaaCCAAACAAATACAGTTCAGTTAAGATGCAATTGTTGTGACTTTTATACAAATTCCATACAGAAGCTAAATCTCCATACGCAACATATGCGTCATGACACCATGAAGATGATCTTCAATCATTTACTTTATTTAGTGAATAGTTTTAATGCGTCCTTAGGCAGTAGTAGCAGTGATTCCTTAACCAGTGAAAACAATGAATTTCAATTGATGGACAAAAATAAGGTGCTCATGTGTCAATTGTGTAACTTTAGTGCGATGAATATACTGCAAATGGTGCAGCACGTGAAAAGTTTACGTCACATACAGGTGGAGCAGTTTATATGTTTACAACGTCGTAGTGAAAATCTCGAATCTTTGGGCTTGGACGATGTCTTCAAAATTTCCGATAATAATGGCGCTATCGATTTTAATTATACCGTTCGCATGATAGATCCGAATTTGCCTTCTTCGGCACCTATTCGCCCGGAGAAATTCGCTGTTCTGACTGCTGTTCCGACTGCTGTTCGGACTGCTGTTTCGACTGCTGTTCCGACTGCTGTTCCCACTGCTGTTCCTACTGCTGTTCGGATTGCTGTTCCCACTGCAGTTCCGACTGCTGTTCGGATTGCTGTTCCGACTGCTGTTCCGACTGCTGTTCCCACTGCTGTTTGGCGTTCGGTGGTTGATCCATGTTTCGTCAACAGTGCAAAAACTCATCCCAAATTGATACATGAAATTTTGACTTTCGGAGATTGTATTAAATCGGGACGTTCGAGTCCTGAACAATCCTTAGAAACCCATCAATCGGTTTTAAAAACAGATAATGATAAAAACGTGCAAATGACACCGAAAATTTGTTCCACTATGGCTGGTGGAAATCGCGATAGTTACATAGATTCAACGGAACAACATAAGAATCTAAGTGGATTAAGCGCCACCAGCTCTACATGTTCCAATAATAAAGAATTAGCTGATCTCGATATTTCAACATTGCCATCTTTGATATACAAATGCAACAATTGTGATTATTTTGCTCAAATAAAACCTGAAATGGAACATCACATTTCCAGTATGCATCCAAATGTATCAGAACAGGATTATCTAACAATACCGACAAATCCTGCTGCTTTGCATGCATTTCATGCAGCTGTTGCTGCAGCGGCTGTAGCAGCTAATGCTGCAGCATCACAATCTCGAAGTAAATCATCTTCACCCGTAATGGAAGGTAATCGTCAACATCATGATAAAGGTTTGTCGTCTTCGGTGTGCGGTAGTGATTTATTAGCTGAGGTTAAAATCGAACGCATGGATACGACAGATGATGCTCAATCAAATGATGAGTGTGAACAATTTGATGATACAATTGAATTATCGTCGAGCTTAAGAGGCAcaacaaatttaacaaattctCCAGCAGTCAATAATGTTATGTGTCCACTTTGTCAGGATACGTTTAATGAGAAAAAATCTCTTGAAATGCATCTTATGAGTGTGCATAGCGTTAATAGCGATGGGCTAACAAGACTACTTCAACTCGTTGACAATAGTCAATGGTTAAATACCAGTCGTCGaagtagtacaagtactaccccAGAGCCTCGGAATTCTAGTACACCTCATTCCGATGTTGGACAATTGACaccacagcaacaacaacagcaacaacagcaacaacaacagcagcagcagcagcaccaacaacaacatcagcaacaacaatcGAATTCCCTTTCTGGAGTTCATCAGCCAGTAGCCAGCGAAGAATACACATGTTTGCAATGCGGCCAAGGATTTAAGTTGcagcaacattttttaaagcatGCCAACGATGCTCAACACTATCAAATGGTAAATGAACAATATCAATGTTTGGCTAAACATTGTCAACAGCTATTTAGTAGCTTATTACAAATGTTTGGCCATTACAAAGACAGCCACATGAATATTGTAATCTCTGAGCGTCACGTTTACAAATATCGTTGCAAACAATGTTCACTGGCTTTTAAGACTCAGGAGAAGCTAAACACTCATTCTTTATATCATACGATGCGTGATGCTACCAAGTGCATGATTTGCAATCGCAATTTTCGCAGTACACAATCATTGCAAAAACACATGGAACAGGCGCATAATCAGTTACAAACAACCGGAAGTCCTATTCATTCTCCCAATAGTAGTGGAGAATTGACAACAAATACTCTTGGATCTGCATCTGGGCAAGCAGCTGTTAGTAGAGGAGGAGACGAAGAAACAGGGGCAAATATCACAGCCACAACCGCAAGTATTTCTGCCGCAGCAGCAGCCGTTACTACAGGTACGATCAATCAATTCACTCCTGACTTAACTCCTCAAGATAACGataacgatgatgatgatgatgatgatattggTGATAATCAGACTAaagcaattttcaaaaacaatgtcTCTAATAACTCGATAAATGAATCAGTACTTTCAACACCACCCCCACCACccccaccaccaccaccgccgccaccaccaccaccaccaccgccgcTACTACAACCACCATCACCGGCCATGCTTGATATTATATGCAATGACGCTGTTACCAGCAGTACCAATGTCCAAATGGATGATATTCTTAACTCACAACAAATATCCGAAGAACGTTATAACGACATTGAACGTAAGCTCAAATGTCATAAATGTAAAGTTGCTTACACAAACCAGAGCTACTTAGTCAAGCACTATAAATCAAATCAACATCGTCGCAATGAAAAACTGAGTATTTATCCTTTGGAAAAGTATCTCGATCCAAATCGACCATTCAAGTGTGAAGTGTGCCGCGAAAGTTTTACCCAGAAGAATATTCTTTTGGTGCATTACAACAGTGTGTCTCACTTACATaaagcaaaaaagcaaaaatgcgAAAATACAGCAGCAGTTGCAACGAGCATGCCATCCCCATTGATAATGTCTGATTTGGTTGAaggtaactttgacagcagcAGGGCGCCGATAATGGAATCACCTGGTGGGAGTGGTAGCACTTGTTGTGGTGGTGCTGGGGGTGTTGCATCCATCAATAAATCATACTATTCAAAGCGAAAGACAACTCTCGAATCCGATTATGAGAGCTCGAAAAAACGCTTCAAATGTGACATATGCAAAGTGGCTTATGCTCAAGGCAGCACTCTGGATATTCACATGAGAAGTGTTCTACATCAAACACGTGCATGTCGCTTACAAGAGCAACAATGTCAATCAAAATCAATGACGCCACCTTCATCAGAAAGTCCGACATCGACGTCCATACAAATGTTAATTCCAACGGTgactacaacaccaacaccaacaccaacacaaacgcCGACGCCAACGTTAACGCCAACGTCAACACCAACGCTTAATGATCAAATGTACAAATCTCTTCTAGAAACGTATGGTTTTGATATTGTCAAGCAATTTAATGAGATCAATAAATTATGTCCCGTTATTGAAGGTGGAAACTATTACTGTCGTTACTGTAGTAAAGTGTTCTCATCGATTTTTGTCCTAAAGACACACTGTGAAGAAGTGCATAATGAGAAAATTCCATTAGAGTTATTAGAGAAATTTGCTGAAAAATTGAAGAATTTTTATCTCGATCATCAACCgccatcaccatcatcgtcatcaaaTACAAATAATCCGTATATATCTTTAAATGAATTCGAAATTAATTTAGGTGCAACAACATCAGCTGCTGACAGTGAAAGCAATTCTCCATCTCCAGTGCCAGATTCTGTTGTTGGGGGTAGTGAAACTTGTTCATTACTGCCCTCATCATCACAGCCTGTTGTATCTACCAGTCCAAATCCAGTTTCAGCTGTGGCAAGTGTGctattaaaacaacaacaacaacaacaacaacagcaacaacaacaacagcagcaacaacagcagcaacaacaacaacagcagcatcaacaaTCCCTTACCCCAGATCTAGTGCAAAAACTAAACTTAGATCCAACTATGCTCGCTCAAAAGATaatggaacaaaattttgctagTTTTCCACCAAACTTCCCAGGTCTGcctcaaaatcttcaaaatttaCAGAGCTTACAGAGTCtgcaaaatttacaaaatatgcAACAAAATCTACCGAATATGAGCAATCTCCCAATGAATACATTGGACATGCTGAATTTGATGCAGTTCCATCACCTTATGTCacttaattttatgaatttagcTCCCCCTTTAATATTTGGGACATCAGGTACTGGCCCAAATCAATCAACTGCATCTACATCATCTGCTGAGCTACCACCGACGCCATCTACACAATTaatacaacaacaaacagcAGCAGCTGCTTCACAGCAGAATACCAGTAATCAAAAACGTGCCCGCACTCGTATAACGGATGATCAACTGAAAATTCTACGTGCACATTTTGACATCAATAACTCGCCCAGTGAAGAAAGCATAATGGAAATGTCGAAGAAAGCTAATCTCCCAATGAAGgtcgTCAAACATTGGTTCCGCAACACGCTGTTCAAGGAAAGACAAAGGAACAAAGATTCTCCTTACAACTTTAATAATCCACCATCTACAACGTTAAATCTTGAAGAATACGAGCGTACTGGACAAGCTAAGGTGACCCCTTTATCAGAAAGCGGAGGAGGAACTAGTCTAAGCAGTTTTCATCttcatcaacaacagcagcaacagcaacagcaacagcagcaacagcataGAGAACATCAGCAGCgtgaacaacaacagcaacaacaacaaattcaaCAGCAAATTCAATCGCAACATctacaacagcagcaacaacgcTCACCTTCTTCCCAATCGAATGAGTTGAATTTTCCGCAACTGAGCTTCCACCAGCAACCCCAACAACAACATGATCTCTCCcgccaacaacagcaacaacaacagcaacaacaacaacaacaacaacaacaacagcatcatcaGCAACATCAGGACAATCGTCCTCTGTCGCATCCATCGAGTGCGGCCAGCGATCGTGGTGATATTCATATTAAATCAGAACCAACTGATGACATTGGTAGTTCTGATTGCGATCAACAAATGATGTTGTCTAAAGATCAAGACACTGAACAATCAATGATGCAATCACATCATCAACAATCAATGTTTTACAATAATTTCGAAACGAAATCCGAAAGTGGAAGCTCTGAGATACTCTCACGTCCCCAAACTCCAAATAGTACATCAACACCATACAGCAGTAATATTTCTGACATACTAGGTCAACAAATTGACAATCTTCCCATCAATAGTATGACCAACATAAGCAATCTTAATAATATGGGTCcaccgaaaaaatttcaaatgaacaaGATGTTTGAGAAAAGTAGCAATTTCGAGACAAATTCCAATTCATCGAACAGCTCAACATCGAGTGGAAAACGAGCAAACCGTACACGCTTTACTGACTATCAAATCAAAGTGTTGCAGGAGTTCTTCGAAAACAACTCCTATCCCAAAGACAGTGACTTAGAGTACTTAAGCAAATTGTTGCTGCTGTCACCAAGAGTAATTGTCGTTTGGTTTCAGAATGCCCGTCAAAAACAACGTAAAATATATGAGAATCAGCCAAACAATACCTTTTACGAATCTGAAGAGAAAAAGCAAAACATCAATTATGCCTGCAAAAAATGTAACTTAGTGTTTCAGCGTTACTATGAGCTGATTCGCCATCAGAAAAATCACTGTTTTAAAGAGGAAAATAATAAGAAATCAGCTAAAGCACAGATTGCTGCAGCTCAAATAGCTCAAAGTCTTAGTAGTGAAGATTCCAATTCTAGTATCGATATTAACAGTGCTAATTTGTTATCTTCGAATTTGGCTGGTCAACAAGCAGCAGCTGCTGCTGCAGCAGTAGCTGTTGCAGCAGCTGTAGGTGGTACATCAGTACCATCTATACCACCAGTAATTCCTGGCCTATCAACAAGTCCAGGAATGAGCCTGCTTACTTCACCACAACATAtatttaaacaacaacaaacagcgTCATCAGTAGCTGGTGCTCATGTGGACAGCGCTTCGCCATTGCAAAAATTTGAATGTGACAAATGCCAACTGACTTTTACGCGTTATGAATTATTTAAGGAACATCAGCTCATACATCTTATGAATCCGAGTTTATTTATGAATCAGAACTACAATGAGGGTTCACCATTTGGaatattacaaaatttacaAGCCAATAATCAGCAACAGGACGCTTCCATGGATTTGAGCCGACAGAAAAAGCGCAAATATTCTGATACGCAAAATTCACCCGAAGACTTGCAACAACAAAATGAATATGAAGCATTtaataagaaatttaaaaacgatcaatatgattttttgtatcAGTATTTTATGCAGAATGAAACAAATGCAGATCTTAAGAAACAAtttcaacaacagcaacaacaacagcccgATTTGGATTTAGAATATTTGGCAAATTTTTATCAGCAAAATGAGCTGAAGAAGCGCAGtaattatgattttttatttcaatattacCTACGAAATGAGTCAAAGCAATCGAATAGTGCTAGCAGTCTTTTGCTGTTAAACGATGACGCTAATAAACCAAATATGGACTTTCTCCTACAGTATTATCAACTCAGTGAATCGAAGAAGTTTTTTCAGTTAGATGCCTCGCCCCAACGAATACATGATTTTCCACCATTGCTAAATTTAGCCagcgcagcagcagcagcagtcaTAAACAATGGTGTTACAGTAAAACCgccacaacagcaacagcaacagcaacagcaacagcaacaacaacagcatcaccagcaacagcaacagcaacagcagcaacaacagcaacagcaacagcaacaagtTTCGTTAACACCACCAGCGGAAACTACAATGAGTACTATAACCAACGCCATTGCAACAACGCCCAGCAGTTCACCAGTGTtgcagaacaacaacaacaattgtgGTGGTATTGGCAACAAAGATaccaacagcaacatcaacaacaacaacatcagcgccAATCAGCCACAATCGATTTTGCCGGCATTAACAGCAGAAGCCAAACTTTTGTCAGCAGCAGTGGCCGCAGCTGGTGGTGGGAGCGGGGTACAAACACTAACGCTGCCGATAGTGAATGCAGCAACTTCTGGTAGTACTAGTTTTGTAGCGAATTCCATGAGCAATAATCGACATTGCTCCATCCCTTCACAgcatcatcaacaacaacaacaacaacaacaacaacaacaacaacaagaaaattcttctctcATCTCAAATCGTTTGGATGCGTCGCCAatcgttgctgctgctgctgcatcTGATGCTGCCGTTCAATCCCTCAAACTATCCATCAATAGTTTAACAACCAATGTTACGTCGAAAAACTCTCCGCTCAATAATAACAACATAGACATTGCTATCGACtctggaatagcacatcattcacaAGAATCATCCGGTATAGTTTCCACCACTGTTCATCAGCTAGAGGAAACTACAATTACAACAaccgaaaaacaaaacagcaaaAGACTTCGCACCACTATTCTACCCGagcagttgaattttttatacgAATGCTATCAGAATGAGTCGAATCCAAGTCGTAAAATGCTCGAGGAGATCTCAAAGAAAGTCAACCTTAAGAAACGTGTGGTCCAGGTCTGGTATCAGAATTCACGTGCCCGTGAACGTAAAGGTCAGTTCCGGCAGAATATTCAAATAATCAATAAAAAGTGTCCGCATTGTGCTgccatatttaaaattaaatcagCTTTAGAATGGCATCTACAGTCCAAACATGGCGATAAGCAAGCCATCAATGTTGATCAAATACCCGATTTGAAATTTGCCGATGGTCTATTGAATTTGTCGAGTGCTGCTACTCCATTCGAAACGAAAAATGATGAAATCGAACAACAGCcgaaacaacagcaaaaacaacatgAACAAAATTCACCGacaacaacaccaccaccaacagcTATTGTGTCTTGTGGTAGGCCTATTTCATCATCTATGCCAGCATCATCTGCAAGCTCATCGGTATCAGCTGCTGCATCAATTTTAGCTCTAGTTAAAGGCAGAGGATCACCTCTTGATTTAAGTAAGGCACAACCAACATCACTTGTTCATCAAACTAACAGCAACAAATATGAACAAAGTGAAAGTGAAATTAGTTTTTCCGATTCAAATAATGATCATGACGagtcaaatgatttttttacaaCCTCGTCATCATTAAATAACGGTAGTCTAACAAATAGACCCACAAATACGAATGGTAACAATCAGCGTACATTTGATGCAAATTTATTAGAAACAACTAACAATGGTGTTGTTGTCGGTTGTGGTTTCAATGAATATAATATCACAACCAATGGCAATGTATCTAGTGATTATTTTGGCAACGATCGTGACAATACTAATTCGCCGTTTAGTCAAACTTCAAGCAATCACAGCGTACAACAAAAGAAACGCTTTCGCACACAAATGAGCAATTTACAAGTGCGCATATTAAAGACTTTATTTCGAGACGTTAAAACACCATCAATGACAGATTGTTCTAATATTGGCCGTGAGATCGGACTACAAAAGCGTGTCATTCAGGTGTGGTTTCAAAATGCCCGAGCCAAAGAGAAGAAGTTCCGCAATCAACGTTTTCTTCACGACGAGAATACCTTCGAAAATGATACATCGACAAAATCCAATGTTGACTCAACAACAAGCACAATAACCGCAGAGCTACGTGATTGCAGTATTTGTCATTTACAGAGTGTTAACATTCAGGAACATGCATTTTCGGCCCAACATATAGCACAGGTGCGCATGCTTCTCGAATCGAATAGCAGTAAAAATGATGACCACCAACATGTTGATAATAGTACGGAACACGAATTTAATGGCATCTACTCACAATTGTGCGccttacaacaaaataaaaccaacaacaCTGACATAATCGAACAGCGATCGAATACCAGCATTCATGATGAAGATGATACAGATGTTGATAATCATCATTTTTCCAATATTACTGATATTGATGGCAATATTGAACAGAATGACGATGAAGATGATAATACAgctaataaacaaaatgaagatGTTGCTTTAAATGAAGCCAATAAAGCTGCCTTAGCATtaagaaatttcaataaattacaacaacattttgcagcagcaacagcattagcaaaacaacaacaccatcaacaacaacatcagcaacaacatcaaaccCAATGTGACAAACTTACATCAGTATCACCACCAACAATGGAGAATAATTTAATGCTAAAAATTAACACGGACAATCCTTTAACATCTAATCATTCGGAAATGTTGCAACAGCTCTTCAACTATAGTCAAATGAGTGGTGAGTTAATTTTATATTAG
Protein Sequence
MPPPLITQSESTHSEPTPRKRRRKRDDPQSCFTNSEEYESDDCSPMSCSDVESFQGKIVYNPDGSAYIIDSENESLSNISENCMSVGATTTNNPKIHSFRVVTARDASVNISEPNKIQKPILMCFICKLSFGNTKSFSLHANSEHTLNLQESEKLLLNREYSSAIIQRNVDEKPQISFLEPLDIQKQHQLMKQSSLAQQSQSQSQSHSQQQQQQQQQQQQQLIGSTLSLVNSSNNNSCSTPSSIPSATSTSSLSFSSSSPSSASSITPVSTAAAVAAAANAALVAAIAASCSNSNNSINSPASISMNTSHLESDLIMASIGPTGTSCSGNSNPGYGNTTNINSNNPNSSSSICSQMNHQLRSDNFDNLNTLDLSAATAAAQAATVSVVAATSIDGHRTPPPLSPASTTSSSPSSTSASSSTSASLSFVQQPPATTIITSGLQVVTTTSSAASKHSSHSLAISDTIPITSSTSSPTTKSTSNSSSSISNINNTSSNKNSSSALSPPTPTTVADFLQQQFQQMQNQIRITSPASAAAVVSTSGSNTDGNAMSLVTSNTTSLNSLTASLAAAAAAVGSGTPGDLSGNSSSVKLINDFLQHQLQQQQQQQQQQHAAFATCPEHPDIKGIDCKTCEMIEINMKSPMTPTRSPNSINLFPSNSTMSPTAAAAPSFTIGACPEHINGRPLGVDCSRCEMILNSARLNSGVQMSTRNSCKTLKCPQCNWHYKYQETLEIHMREKHPDGESACGYCLAGQQHPRLARGESYTCGYKPYRCEICNYSTTTKGNLSIHMQSDKHLNNMQELNSSQNMANTAAEIRESPKIIMPNMQQQASKPKPSFRCDVCSYETSVARNLRIHMTSEKHTHNMAVLQNNIKHIQAFSFLQSQNLGQLSAAQSAAVAASNLPNMPNLQNFLPEAALADIAYNQALMIQLLHQNSAAGALSAAAAAAAAANPLTLAPPQQSPVGSTGAVSQSQNASTSQQSSSQQSYQLSQPSKLNHPSQSPHILTTSAATVSTLSSSSTTLSSQQQQQQQQQQQQQQQQQQQQQQSAVAAALLAEAAAAAAAAAAAAAATTDTTTCLQQHQQQQQQQQQQQQQQQQQQQQQDSSLDPPIDPDPKPTTAFSCLICANYNTNSIDELNNHLMIDRSRNTNNNCSDIMMIINNNYICRLCNYKTNLKANFQLHSKTDKHLQKLNYINHIREGGVKNEYKLKYNQTNTVQLRCNCCDFYTNSIQKLNLHTQHMRHDTMKMIFNHLLYLVNSFNASLGSSSSDSLTSENNEFQLMDKNKVLMCQLCNFSAMNILQMVQHVKSLRHIQVEQFICLQRRSENLESLGLDDVFKISDNNGAIDFNYTVRMIDPNLPSSAPIRPEKFAVLTAVPTAVRTAVSTAVPTAVPTAVPTAVRIAVPTAVPTAVRIAVPTAVPTAVPTAVWRSVVDPCFVNSAKTHPKLIHEILTFGDCIKSGRSSPEQSLETHQSVLKTDNDKNVQMTPKICSTMAGGNRDSYIDSTEQHKNLSGLSATSSTCSNNKELADLDISTLPSLIYKCNNCDYFAQIKPEMEHHISSMHPNVSEQDYLTIPTNPAALHAFHAAVAAAAVAANAAASQSRSKSSSPVMEGNRQHHDKGLSSSVCGSDLLAEVKIERMDTTDDAQSNDECEQFDDTIELSSSLRGTTNLTNSPAVNNVMCPLCQDTFNEKKSLEMHLMSVHSVNSDGLTRLLQLVDNSQWLNTSRRSSTSTTPEPRNSSTPHSDVGQLTPQQQQQQQQQQQQQQQQHQQQHQQQQSNSLSGVHQPVASEEYTCLQCGQGFKLQQHFLKHANDAQHYQMVNEQYQCLAKHCQQLFSSLLQMFGHYKDSHMNIVISERHVYKYRCKQCSLAFKTQEKLNTHSLYHTMRDATKCMICNRNFRSTQSLQKHMEQAHNQLQTTGSPIHSPNSSGELTTNTLGSASGQAAVSRGGDEETGANITATTASISAAAAAVTTGTINQFTPDLTPQDNDNDDDDDDDIGDNQTKAIFKNNVSNNSINESVLSTPPPPPPPPPPPPPPPPPPPLLQPPSPAMLDIICNDAVTSSTNVQMDDILNSQQISEERYNDIERKLKCHKCKVAYTNQSYLVKHYKSNQHRRNEKLSIYPLEKYLDPNRPFKCEVCRESFTQKNILLVHYNSVSHLHKAKKQKCENTAAVATSMPSPLIMSDLVEGNFDSSRAPIMESPGGSGSTCCGGAGGVASINKSYYSKRKTTLESDYESSKKRFKCDICKVAYAQGSTLDIHMRSVLHQTRACRLQEQQCQSKSMTPPSSESPTSTSIQMLIPTVTTTPTPTPTQTPTPTLTPTSTPTLNDQMYKSLLETYGFDIVKQFNEINKLCPVIEGGNYYCRYCSKVFSSIFVLKTHCEEVHNEKIPLELLEKFAEKLKNFYLDHQPPSPSSSSNTNNPYISLNEFEINLGATTSAADSESNSPSPVPDSVVGGSETCSLLPSSSQPVVSTSPNPVSAVASVLLKQQQQQQQQQQQQQQQQQQQQQQQQHQQSLTPDLVQKLNLDPTMLAQKIMEQNFASFPPNFPGLPQNLQNLQSLQSLQNLQNMQQNLPNMSNLPMNTLDMLNLMQFHHLMSLNFMNLAPPLIFGTSGTGPNQSTASTSSAELPPTPSTQLIQQQTAAAASQQNTSNQKRARTRITDDQLKILRAHFDINNSPSEESIMEMSKKANLPMKVVKHWFRNTLFKERQRNKDSPYNFNNPPSTTLNLEEYERTGQAKVTPLSESGGGTSLSSFHLHQQQQQQQQQQQQQHREHQQREQQQQQQQIQQQIQSQHLQQQQQRSPSSQSNELNFPQLSFHQQPQQQHDLSRQQQQQQQQQQQQQQQQQHHQQHQDNRPLSHPSSAASDRGDIHIKSEPTDDIGSSDCDQQMMLSKDQDTEQSMMQSHHQQSMFYNNFETKSESGSSEILSRPQTPNSTSTPYSSNISDILGQQIDNLPINSMTNISNLNNMGPPKKFQMNKMFEKSSNFETNSNSSNSSTSSGKRANRTRFTDYQIKVLQEFFENNSYPKDSDLEYLSKLLLLSPRVIVVWFQNARQKQRKIYENQPNNTFYESEEKKQNINYACKKCNLVFQRYYELIRHQKNHCFKEENNKKSAKAQIAAAQIAQSLSSEDSNSSIDINSANLLSSNLAGQQAAAAAAAVAVAAAVGGTSVPSIPPVIPGLSTSPGMSLLTSPQHIFKQQQTASSVAGAHVDSASPLQKFECDKCQLTFTRYELFKEHQLIHLMNPSLFMNQNYNEGSPFGILQNLQANNQQQDASMDLSRQKKRKYSDTQNSPEDLQQQNEYEAFNKKFKNDQYDFLYQYFMQNETNADLKKQFQQQQQQQPDLDLEYLANFYQQNELKKRSNYDFLFQYYLRNESKQSNSASSLLLLNDDANKPNMDFLLQYYQLSESKKFFQLDASPQRIHDFPPLLNLASAAAAAVINNGVTVKPPQQQQQQQQQQQQQQHHQQQQQQQQQQQQQQQQQVSLTPPAETTMSTITNAIATTPSSSPVLQNNNNNCGGIGNKDTNSNINNNNISANQPQSILPALTAEAKLLSAAVAAAGGGSGVQTLTLPIVNAATSGSTSFVANSMSNNRHCSIPSQHHQQQQQQQQQQQQQENSSLISNRLDASPIVAAAAASDAAVQSLKLSINSLTTNVTSKNSPLNNNNIDIAIDSGIAHHSQESSGIVSTTVHQLEETTITTTEKQNSKRLRTTILPEQLNFLYECYQNESNPSRKMLEEISKKVNLKKRVVQVWYQNSRARERKGQFRQNIQIINKKCPHCAAIFKIKSALEWHLQSKHGDKQAINVDQIPDLKFADGLLNLSSAATPFETKNDEIEQQPKQQQKQHEQNSPTTTPPPTAIVSCGRPISSSMPASSASSSVSAAASILALVKGRGSPLDLSKAQPTSLVHQTNSNKYEQSESEISFSDSNNDHDESNDFFTTSSSLNNGSLTNRPTNTNGNNQRTFDANLLETTNNGVVVGCGFNEYNITTNGNVSSDYFGNDRDNTNSPFSQTSSNHSVQQKKRFRTQMSNLQVRILKTLFRDVKTPSMTDCSNIGREIGLQKRVIQVWFQNARAKEKKFRNQRFLHDENTFENDTSTKSNVDSTTSTITAELRDCSICHLQSVNIQEHAFSAQHIAQVRMLLESNSSKNDDHQHVDNSTEHEFNGIYSQLCALQQNKTNNTDIIEQRSNTSIHDEDDTDVDNHHFSNITDIDGNIEQNDDEDDNTANKQNEDVALNEANKAALALRNFNKLQQHFAAATALAKQQHHQQQHQQQHQTQCDKLTSVSPPTMENNLMLKINTDNPLTSNHSEMLQQLFNYSQMSGELILY

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2