Basic Information

Gene Symbol
zfh2
Assembly
GCA_949320155.1
Location
OX439486.1:1363848-1407504[+]

Transcription Factor Domain

TF Family
zf-C2H2
Domain
zf-C2H2 domain
PFAM
PF00096
TF Group
Zinc-Coordinating Group
Description
The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 17 0.35 92 5.9 0.7 2 23 123 145 122 145 0.95
2 17 0.0003 0.077 15.6 1.4 2 23 693 715 692 715 0.96
3 17 7.3e-05 0.019 17.5 0.7 1 23 747 771 747 771 0.92
4 17 0.023 6 9.6 0.4 1 22 813 834 813 837 0.91
5 17 2.9 7.5e+02 3.0 1.9 1 23 1112 1136 1112 1136 0.91
6 17 0.051 13 8.5 0.9 1 23 1393 1416 1393 1416 0.93
7 17 0.17 43 6.9 0.1 2 23 1541 1563 1540 1563 0.93
8 17 0.044 11 8.7 0.2 1 21 1640 1660 1640 1664 0.90
9 17 0.0035 0.91 12.2 2.3 1 23 1710 1732 1710 1732 0.96
10 17 0.00089 0.23 14.0 1.5 2 23 1739 1761 1738 1761 0.94
11 17 0.0054 1.4 11.6 3.7 2 23 1906 1929 1905 1929 0.90
12 17 0.11 29 7.4 0.6 1 21 1950 1970 1950 1974 0.85
13 17 0.011 2.9 10.6 0.1 1 23 2056 2080 2056 2080 0.93
14 17 1.2 3.2e+02 4.2 4.1 1 19 2152 2170 2152 2175 0.92
15 17 0.00027 0.07 15.7 1.8 1 23 2853 2875 2853 2875 0.97
16 17 0.0019 0.48 13.0 0.8 1 23 2993 3015 2993 3015 0.98
17 17 0.015 3.8 10.2 0.6 2 23 3521 3543 3520 3543 0.96

Sequence Information

Coding Sequence
atgccacCACCCTCAACACAATCGGAATCCCAACATAATGAACCAACACCTAGAAAACGTCGACGTAAACGTGATGATCCCCAATCGTGTTTCACCAATTCGGAGGAATACGAATCGGATGACTGCTCCCCAATGTCCTGTTCTGATGTTGAAAGTTTTCAAGGCAAGATCGTTTATAATCCAGATGGCAGTGCTTATATAATCGATTCTGAAAATGAATCACTTTCGAATATACCGGAGAATTGCATGAGTGCTgggacaacaacaacaacaaacaacccAAAAATTCACTCATTTCGTGTGGTAACGGCTCGTGATGCTTGTGTTAATATTTCCGAGctaaataaaatccaaaaaccCATATTAATGTGTTTCATTTGTAAGTTGAGTTTTGGTAATACGAAATCGTTTAGCTTGCATGCAAACACTGAACACACATTAAATCTCCAAGAATCGGAAAAGCTATTATTGAATCGAGAATATTCAAGTGCCATTATACAGCGTAATGTTGATGAAAAACCGCAGATATCGTTTTTGGAACCGTTGGATATACAGAAGCAGAACCAATTGAATAAATTAGCATCACcacaaacacaacaacaacatcaacaacaacacccACAACAACAGCAACTACTTATAAGTTCAACATCATCATTATCGGTGAACAgtaacagcaacagcaacaataacaataacgtAAGTTGTAGTAAATTACCAACGACACTATCACCATCATCGTCGTCGGCAATAGGATCGGTGTCTTCGGTAgtggcagcagcagcagcagccaaTGCGGCGTTGGTGGCTGCAATAGCAGCAAGTTGCAGTAACACCAGTTTGAATACGTCAACATCATTGTCGATGAACGCACCCCATATTGATAGTGATTTAATTATGGCGAGTCTTGGCGGTCCTAGTGGTGGTGGTGGTAATAGCAATCCTGGCTATGGTAACACAACTAGCACTATTAGCAACAATCCAAGTAGTAGCAGCAGCAGCAATTGTAGTCAAATGATTCATCAGCAGCAACAAAGATCGGATAATTTTGATAATCTGAATACTTTAGATTTAAGTGCAGTGACAGCTGCAGCACAAGCGGCTGCTTCAACATCAATAGTCAATAGCCGGCACACACCACCGCCGTCATCACCTACATCGACTACCTCGTCGTCCCCATcatcttcttcatcttcatcaGCTTCAACCTTGTCATTATCTGCCCAGCAACCGTCAACAACAATCAGTGCATCCGGACTACCAGCACCCACAAGCACCATTGCCACTGAACCTAAATCCTCGCATTCCCTAGCGATGGACACTATCTCATCAACATTTGCAACGAGCTCCCCAGCAACAAAACTATCTAGCAGTAATAGTAGTAGTAGTGTAAGCAATACCTTAAGCAATAAGAACAATTCGTCAGCAGTGTTACCTCCGACACCTACAACGGTGGCGGATTTCCTTCATCAGCAATTCCAGCATATGCAGAATCAAATCCGAATAACATCGCCCGCCTCTGCGATAGTGGTAAGTGCTAGCGGCAGCAATACAGAAATGAATGCTCTGCCATCTTCGGTGACAACGAGTGCACCATCGCTAAGTTCGTTAACAGCCTCTTTGGCAGCAGCGGCAGCAGCTGTTGGTAGTGGTTCTGTAACTGATCTTAGCAGCAATAGCAGTGGTGTAAAGCTGATCAATGATTTTCTTCAGCATCAGTtccagcagcagcaacagcaacaacaccaacaacagcaGCCACATTCATCATTTTCAACGTGTCCCGAACATCCCGACGTAAAAGGTATCGATTGCAAAACATGCGAAATGATCGAAATCAATATTAAATCACCAATGACGCCAACACGATCGCCAAATAGTATTAATCTGTTTCCATCGAATCTGACTTTGTCACCGACGGCAGCTGCAGCGCCTAGTTTTACAATTGGTGCATGTCCGGAGCATATAAATGGCAGGCCGCTGGGTGTTGACTGTTCGAGATGCGAAATGATATTGAATTCAGCTCGGCTTAATAGCGGCGTACAGATGTCCACACGTAACTCATGCAAAACTCTGAAATGTCCCCAATGTAATTGGCACTACAAGTATCAGGAAACATTAGAAATTCATATGAGGGAGAAGCATCCGGATGGGGAGAGTGCATGTGGCTATTGTCTGGCCGgtcAACAACACCCGCGATTGGCACGTGGAGAATCATACACCTGCGGCTATAAACCGTACCGCTGTGAGATCTGCAActattcaacaacaacaaaaggcAATCTATCGATTCATATGCAAAGCgataaacatttaaataataTGCAAGAGTTGAATAGCTCGCAGAACATGGCAAATACCCCGGCGGAAATTCGTGAATCGCCAAAAATAATAATGCCAAATATGCAGCAGCAGGCCTCGAAGCCGAAGCCGAGCTTTCGCTGTGACGTGTGTTCCTATGAAACAAGTGTGGCGCGAAATTTGCGTATACATATGACAAGTGAGAAGCACACCCATAATATGGCAGTGCTGCAGAATAACATCAAACACATTCAAGCATTTAGTTTCCTGCAATCACAAAATATCGGTCAACTGAGTGCAGCACAGAGTGCGGCGGTTGCTGCATCAAACTTGCCTAATATTCCGAATTTGCAAAATTTTCTACCTGAAGCGGCTCTAGCTGACATCGCATACAATCAAGCTTTGATGATTCAGTTGCTGCATCAGAATTCAGCAGCGGGGGCATTGAGTGCAGCGGCAGCAGCTGCAGCGGCTGCAAGTCCACTAACCCTGCCCCCACCTCAGCAGTCTCCTGCTGGATCTGGGGGAACAGTTTCGCAGCCACAAACGACTCCAACGTCACAACAACCACCGCAACAATCCTTTCAGTCGTTGCAGACATCGAAGTTAAACCATCAGTCTTCACCGCAGCAAAGTATGCCAACAGTTGCAACAACAGGAGCTCCCTCActgtcatcatcatcaacaacagCATCGTCACATCCATCCTCGTCGCAACAGCATTCTGCAGTGGCGGCAGCAGCTCTACTAGCTGAAGCAGCCGCTGCAGCTGCAGCAGCAGCCACCTCTGATCCAGCATCAATATGCCTtcagcagcaacagcaacaacaacagcaacaaattCAACAACAAGACTCATCCCTCGATCCACCAATCGATCCCGATCCAAAACCCACAACAGCCTTCAGCTGCCTCATATGCGCCAACTACAACACCAATAGCATCGACGAACTCAACAATCACCTAATGATCGACCGATCACGCAATACCAACAACAATTGCAGCGATATCATGATGATTATCAACAACAATTACATCTGTCGACTTTGCAACTACAAGACTAACCTCAAAGCCAACTTTCAATTGCATAGCAAAACGGACAAACATCTGCAGAAGCTCAACTACATCAATCACATCCGCGAAGGTGGTGTCAAAAACGAGTACAAACTCAAGTACAACCAAACCAATACCGTTCAACTAAAATGCAATTGTTGCGACTTTTATACGAACTCTATACAAAAACTGAATATGCATACTCAACATATGCGACACGATACCATGAAGATGATCTTCAATCATCTCTTGtatttagtgaatagttttaaTGTTTCTATGGGAGGTGATTCGATGGCCGTTGTAAGTGAAAACAGCGAATATCAACTTATGAGCAAAAACAAAGTTCTAATGTGTAAACTGTGTAACTATAGTGCAGTGAATATTTTACAAATTGTGCAACATGTCAAAAGTTTGCGTCATATACAAGTGGAACAGTTTATATGCTTGCAGCGGCGCAGTGAAAATCTCGAATCGCTTGGACTGGATGATGTCTTTAAAATTGCCGATAATACTGATTGTATCAAATCAGAACGTTCAAGTCCTGTACCGTCATTAGAATCTCACCAATTGTTGGTTAAAGAAGAAAACGAAAAGAGTTCCCCAATggcaacgacaacaacaactacaacaactacAGCACTAGCAGTAACACCACCAGCTACAACCACAACAGCGAAGAGCAGTCATGATATTTACATTGATTCTTCTTCCGATCACCAACAGAATGTCTCTTCATCTTCTGCTATTGCCGTCAGCAATATCGCCAACTGCAACAAAGAGCTTGCTGATGTCGATCTCTCTTCATTGCCATCAATCATATATAAGTGCAACAATTGTGATTATTTTGCTCAAAACAAGTCCGAAATGCAAAATCACATTGTCAATACTCATCAGAATGTTTCGGAAGAAGACTTTTTAACAATTCCTACAAATCCAGCGGCTTTGCATGCCTTTCATGCAGCTGTAGCAGCTGCAGCTGTTGCTGTAGCCGAAAATGCGGCGGCATCACAATCTAGAAGTAAATCATCATCACCAGTGCATATGCACATGCAGGATGATCggcaacagcaacaacatcatGATCAAGGATTTGCCACATCTGTGCGTGGGAATGAGTTGATGCCTCAAGTCAAAACCGAGCAGATGGACATCATGGATGATGCTCAGTCTAATGATGAATGTGAACAATTGGAAGATCCAATTGAACTACTGAGCTCGAGGAACTCTGCCAATGTGACAAGCTCACCGGCTGTGAAGAGTGTAATGTGTCCACTGTGTCAAGATACTTTTAGCGAAAAGAAAGCACTTGAGATGCATCTTATGGGAGTGCATAGTGTAAACAGTGATGGCTTGGCAAGACTCCTCCAGCTAGTCGATAGCAGTCATTGGTTGAATAGCAGCAGGCGAAGCAGTACAAGCACAACGCCAGAGCCCAGAAATTCGAGCACACCACTTTCAGAAGCCGGAGGGCAGTTAGCATTTCAACAaccacaacagcaacaacaaaagcaacaacagcCTGGGTCTTCTCCTGTTAACCAACATGTATTCAGTGACGAATACGCATGCGCACAGTGCGGGATTACATTCAAGCTGCAGCAACATTTGCTAATGCACGCCAACGATGCGCAGCATTATCAAATGGTCAACGAGCAATATCAGTGCTTGGCAAAACATTGCCAACAACTGTTTGGCAACTTGCTGCAAATGTTGACCCACTACAAAGATAGTCACATGAATATAGTGATTTCTGAGCGTCATGTTTACAAATATCGCTGCAAACAATGTTCTTTAGCATTCAAGACCCAGGAAAAACTCAATACTCATTCCTTGTATCACACAATGCGTGATGCTACCAAGTGCATGATATGCAACCGGAACTTTCGGAGTACACAATCCTTGCAGAAACATATGGAACAGGCACACAGTCAATTGCAACCAACCAGCCCAATTCCATCACCAAGTAGTAGTGGGGAAATCGCAAATGCCCCTGGATCTCTGTCCGTAACCCCTTCATCCAGGGCAGAAGATGAAGAAGTTACTGCCAACGCTGCCGTTACCACATCCATGACTACAGGAGCAACAACAGGTACGAATAATAAATTCAGCAACAATCAATACAATGTTGATGGTGTTGGTGATGGTCAGGACATTGTTTCtctaaaaagtagcatttatgATTCGATATCAGTATCAGTATCAGCAcctccaccaccaccaccaccaccactacCACCGCCACTGCCACCACCACCCACCGTTCTTGATATCTTGCCCAGTGACAAAAGTCAACTTGAGGACTACCTCAACTCCCAACAAATGTCTGAAGTATATTACACAGATGTCGAACGCAAACTCAAATGCCACAAATGTAAAGTTGCATACACAAATCAAAGCTATCTTTCGAAGCACTACAAGTCCAATCAGCATCGTCGCAATGAAAAGTTGAGTATTTACCCGCTGGAAAAGTTCCTCGATCCAAATCGTCCTTTTAAGTGTGAAGTTTGTCGCGAGAGTTTCACCCAAAAGAATATTCTTTTGGTGCACTTTAATAGCGTGTCGCATTTGCACAAGGCCAAGAAGCAACAAGCTGGAGGTAGCGCATCGCCAGCAAAGTCTATTCCGATGTTCTCTGACGCTATAGAAGGTCTTAGTGAAAGCAAAGGACAAACTGTGGAAATAGCTGGTGGCAGTGGCAGTGGTAGCAATGCTTGTGGTGGCGATGGAGGTATCACAGCTGTAAATAAGTCGCCATTTTCTAAGCGAAAAGTCAGCCTTGAATCTGACTATGAAAGTCCGAAGAAACGTTTCAAATGCGAAATCTGCAAAGTGGCATATGCGCAGGGCAGTACACTCGATATTCACATGAGAAGTGTCTTGCACCAGACACGCTCCTGCCGCTTGCAGGAGCAACAGGCTCAGCCATTAAAACCAATGACACCGCCTTCGTCATCGGAGAGCCCAACATCTACGACGGCAACTACCCCAACCTTAAACGATCATATGTACAAGTCCCTGCTAGAAACGTTCGGCTTTGATATTGTTAAGCAATTCAATGAGATTAACAAACTTTGTACAGTTTCAAACTCTAGAAATTATTATTGCCGTCACTGCAACAAAGAGTTCTCCTCTATTTTTGTGCTGAAAACACATTGTGAGGAGATACACAGTGAGCAGATCCCTCTTGAATTATTAGAAAAATTCGCTGAAAAGTTTAAGCATATTTATCTCGATCAAGAACCTACATCACCAACATCGTTGTTAACAATAACAAATTCAGACCAAAATCCAAaccaatgtttttcttttaatgaatCCGAAAATAATTTAGGTGCAACAACATCAACGGCATGCAGTGATAGCAATTCTCCATCACCAGTGGGAGGCGAGCCAGCAGCCGGATCAAATGAGACTTCTCTTATAGTTCCTTCTACATCAACAATTGCTGCTCCCACCAGCCCCAATACATCAGCCGTTACGGCTGTAGCTGCTGCATTGCTCAAACAGCAACAGCACCAGCAGCAACAATCTCTTACACCAGATCTTGTGCAAAAGCTCAACCTAGATCCAACAATGTTGGCTCAAAAGATAATGGAACAGAACTTTTCCAACTTTCCACCGAACTTCCCAGGACTGCCACAGAATTTACAAAGTTTACAAAGCCTTCAAAGCCTACAAAATCTGCAAAACATGCAACAAAATTTACCAAACATGGGTAACATGCCCATGAATACTCTGGATATGCTTAACCTAATGCAGTTTCATCATTTAATGTCTTTGAATTTTATGAACTTGGCACCGCCCCTGATATTTGGGGCTGGAGGAACGGGCCCAAGCCCTTCGGTAACAGGCACTACATCGGCAACACCATCTGATCTACCACCAACAGCTGCCACCCAAATTATTCAGCAGCAAACGTCATCTTCATCACAGAAGGGGTTTACATTCTCTTCACAGAATACCAGCAATCAAAAAAGAGCACGAACGCGCATTACTGATGATCAGCTGAAGATCCTTCGAGCACATTTTGATATCAATAATTCTCCCAGTGAAGAGAGCATCatggaaatgtcaaaaaaagcCAATCTCCCAATGAAGGTCGTGAAACATTGGTTCCGCAACACCCTGTTCAAGGAGAGGCAACGTAACAAAGATTCACCTTACAATTTCAACAATCCCCCGTCGACAACACTGAATCTTGAGGAGTACGAACGAACTGGCCAGACAAAGGTGACTCCTTTATCGGAAAGCGGAGGTGGCAGCATCAGTGGCTTTCACCtccagcaacagcaacaacaaagaGAACAACAGCAACGTGAACAACAGCAGCGTGAACACCAACAGCAACAAAtccagcagcagcaacaacttCAACAGCAAATTCAATCACAACATCtccagcagcaacaacaacagcgcCCGCCATCATCACAATCTAGTGATCTGAATTTTCCCCAACTGAGCttccaccaacaacaacacgaTCTCTCCCGCCAACAATTGCACCAGCAACAGGACAATCGTCCATTGTCCCATCCGTCAAGCGTTACCAGCGATCGTGGCGATATCCATATTAAGCCTGAACCGGCCGATGACATTGGTAGTTCCGACTGTGATCAACAAATGGCTATGAGCAAAGATCACGATAACGAACAATCAATAATGCAATCACACCATCAACAGTCAATGTTCTACAACAACTTCGAGACTAAATCCGAAAGTGGAAGCTCTGAGATCCTATCACGTCCCCAAACCCCAAATAGTACATCTACACCGTACAGCAGTAATATATCAGATATCCTGGGTCAGCAAATGGACAGTTTGCCGCTAAACAACATGGCCAATATAAGTAACTTGAACAATATGGGACcgccaaaaaaatttcaaatgaaCAAGATGTTCGAGAAGAGTGGGAACTTTGAAACCAATTCCAATTCGTCTAATAGTTCGACGTCGAGTGGAAAGCGGGCCAATCGTACTCGTTTTACGGATTACCAAATAAAAGTGTTGCAGGagtttttcgaaaataattcttATCCCAAAGACAGTGACTTGGAATACTTGAGCAAGTTGTTGCTGTTGTCGCCGAGAGTAATTGTCGTTTGGTTTCAGAACGCACGTCAAAAACAGCGAAAAATCTATGAGAATCAACCGAATAATACGTTCTACGAGTCCGAGGAAAAGAAACCCAACATCAACTACGCTTGCAAGAAATGTAACCTGGTGTTCCAGCGTTACTATGAACTTATCCGACATCAAAAGAATCATTGCTTCAAGGAGGAAAATAACAAGAAATCTGCGAAAGCACAAATAGCTGCTGCTCAAATTGCTCAAAGCCTCAGCAGTGAAGATTCGAATTCTAGCATCGATATCAACAGTACCAATATGTTGTCATCAAATTTGGTTGGCCCGCAAGCTGCTGCAGCGGCTGCCGCAGTTGCtgtagcagcagcagcagttgGTGGCAATGTTGGTACGGCAATTCCACCGGTGATGCCAGGCCTGGCCTCCAGTCCAGGCATTAACTTGCTTGCATCACCCCAGCACATTTTCAAGCAACAACAGGCAGTGACTTCTGTCGGAGGAAGTCATGCGGATAGCACTTCACcgcttcaaaaatttgaatgtgACAAATGTCAACTGGTCTTCAATCGCTATGAACTCTACAAAGAACATCAGCTTATCCATCTCATGAACCCCAATTTGTTTATGAACCAAAACTACAATGAATCTTCACCCTTTGGAATCCTACAAAATCTGCAGGGTAACCACAATAGTCAACAAGACACATCAATTGACTTGAGTCGACAGAAAAAACGCAAGTATTCCGATACGCAAAACTCACCCGATGAACTGCACCATCAGCAAACTGACTACGAAGCTttcaataagaaattcaaaaacgaTCAATATGAATTTCTGtatcaatattttttacaaaacgaCTCCAATTCTGATTTGAAGAAACAAtttcagcagcagcaacaacagccCGAAATGGATCTAGATTATCTGGCTAATTTTTATCAgcaaaatgaatacaaaaagctCAGCAATTACGATTTTTTATTGCAATATTACATGCGAAACGAGTCAAAACAGCCTAGTAACGCGTCCAACCTCATGTTGCTGAACGATGATGCTAATAAACCAAATATGGACTTTCTCCTTCAGTATTATCAACTCAGTGAATCGAAGAAGTTTTTTCAGTTAGAAGCCTCGCCCCAACGAATACATGATTTCCCACCGTTGCTGAATCTGACCAGCAacgcagcagcagcagccgttGCAGCAATAAACAATGGTGTATCAGCCCAGCAACATCAAGAGCAGCAACAAGTTTCAGTAATATCACCTACAGAGACACCAATGAATACCACAACCAATACCAATGTAGCGATATCATCGCCAAACAGCTCACCAGTACGGCAACAGAGCAACAACAGTGGTTGTGCTATTGGCAACAAAGATACgaccagcaacaacaaaaattgtcaaacacATCCAACAGTATTGACAGAAAATGTCAAACTTCTGTCAGCGACGGTGTCGGCATTTGGTGGTGGCAGCGGGAACGGGAGCGGGGTACCAACAGCGACATTACCAATAACAACACCGAATACAACAAATACTGGTAGTACGTCTTTCAGTACGAATTCGATAAACAACAACCGCCATTGTCCCATACTTTCACCATTGCAAACGAAACAGCAAGAAAACTCATCATTGATATCGAATCGTTTGGATGCAGctcctgctgctgctgctgccgcCACTGCATCCGAGTTCGCTGTGCAATCAATAAAACTTTCTGTTAACAACATAAGCACAAATGCGGTGTCAAAAACTTCGCCCATTGCAAATAGCATCGACATGATGGATGCCGAGCTAGGAATTACTCATCACCACCAACAGCACGACTCATCAGCAACATCCGTTTCAAGTGCTAGCCACCAACTTGAAGAAACTGTCACAACAACTGAGAAGCAAAATAGCAAAAGGCTTCGTACTACTATATTGCCAGAGCAGCTGAATTTTTTATATGAATGCTACCAAAATGAGTCAAATCCAAGTCGCAAAATGCTCGAAGAAATTTCAAAGAAGGTCAATCTCAAGAAACGTGTAGTTCAGGTCTGGTATCAAAACTCAAGAGCTCGCGAACGCAAAGGACAATTTCGTCAGAATATCcagataataaataaaaaatgtccgCATTGTGCGgccatatttaaaataaagtctgCTCTGGAGTGGCATCTGCAATCCAAGCATGGAGATAAGCAAGCTATAAATGTTGATCAAATTCCTGACTTGAAATTCTCCGATGGTTTACTAAATTTTTCAAGTTCGACACCGTACGGTATAAAAGTCGATGAGCAACACAAGGAACAGCAGAAGCTAAAAGAACAACCAAgttcaccaacaacaacaccaacaactaCAACACCACCTCCAGCAACATTGTCCCCCGTCAAATCAGCATTATGTGTCGCATCTGTTGTCACTACaaatgctgctgctgctgcttcttcGCCAGCAGTTTCGTCCGCAGTATCCTCTGTAGCCTCTTCAGCATCGATCCTTACCCTTGCCAAAGGTGGAGGAACTACTCCACTTGATCTTAGCAAGACACCTACGCCACTAGTCAATAACTTTAGCAAGTATGAACAGAGTGAAAGCGATATAAGCTTCTCTGATTCCAACAATGACCATGACGAgtctaatgatttttttaccCCTTCGTCGTCATTGAATAATGGAAATCTTGGAAATCTAGTGCCGAATAATAGGGCCAACAACTGCATAAACAACCGACGTCAGTATGATACTAATTTGATAGACGGAAATAATGTAGCAGGAGGATGTAGCTACAACGATAATAACACCATGAGCAACATAAGCGATTATCTAGGCAACGAACGTGAAAATACTAATTCACCAGTTAGCCAAACTTCAAGCAACAATAGTGCTCAACAGAAAAAGCGTTTTCGGACACAAATGAGTAACTTGCAGGTGCGCATACTTAAAACATTGTTCCATGACGTCAAGACGCCTTCCATGACGGATTGTTCAAATGTTGGACGAGAAATTGGACTGGGGAAACGTGTTATTCAGGTTTGGTTCCAAAATGCCAgagccaaagaaaaaaaatctcgcaaCCAACGATATATACACGATGAAAATACTTTTGAAAATGACAAACCAAATGTGGATTCAACTACATCAAATAACATACTCGAGATACGTGAATGCAACATTTGTCAATTGCCAAGCGTAAACATTCAAGAGCATGCTTTCTCAGCTCAACATATAGCACAGGTACGAGTGCTGCTCGAAGCAAACAGTAGCAATAAAAGTGATGACAACCAGCAATATGTTGAAAATAGTATGGAACATGAATTCAATGGTATTTATTCGAAGCTGTATACACAGCAGCAGCATCATAATAACCATAGTCATcatcataaaaatgaaaatgctgaTTCGGGACGAGCCGATACTAGTCACCATGATAAAGACGACAGTGATGTTGATTATCATTGTCCAAATATTATGACTAATGTTGATTGTGTCAACAACGACAAAAACTACGAaaacgacgatgatgatgacgatgaacatgatgatgatgatcatGATCACGATGCCGATGATCACGATGACGATGCCAAAACGAAACTTGGCATACAAAACGAGGATATGGCCTTGAGTGAGGCTAATAAGGCAGCATTGGCATTGAGAAATTTCAACAAGTTGCAGCAACATTTCGCTGCAGCTACTGCTTTTGCTAAGCAGCAACACCAATATCCCCACCAGCAACagctacaacaacaacagcagtgTGATACAATATCTTCAGCTTCACCACCAACAATGGAAAATAATTTGATGTTGAAAATTAACACGGATAATCCGTTGGCAACAAATCATTCAGAAATGTTGCAACAACTGTTTAGCTACAGTCAGATGAGTGGTGAGTCTTATATGTAG
Protein Sequence
MPPPSTQSESQHNEPTPRKRRRKRDDPQSCFTNSEEYESDDCSPMSCSDVESFQGKIVYNPDGSAYIIDSENESLSNIPENCMSAGTTTTTNNPKIHSFRVVTARDACVNISELNKIQKPILMCFICKLSFGNTKSFSLHANTEHTLNLQESEKLLLNREYSSAIIQRNVDEKPQISFLEPLDIQKQNQLNKLASPQTQQQHQQQHPQQQQLLISSTSSLSVNSNSNSNNNNNVSCSKLPTTLSPSSSSAIGSVSSVVAAAAAANAALVAAIAASCSNTSLNTSTSLSMNAPHIDSDLIMASLGGPSGGGGNSNPGYGNTTSTISNNPSSSSSSNCSQMIHQQQQRSDNFDNLNTLDLSAVTAAAQAAASTSIVNSRHTPPPSSPTSTTSSSPSSSSSSSASTLSLSAQQPSTTISASGLPAPTSTIATEPKSSHSLAMDTISSTFATSSPATKLSSSNSSSSVSNTLSNKNNSSAVLPPTPTTVADFLHQQFQHMQNQIRITSPASAIVVSASGSNTEMNALPSSVTTSAPSLSSLTASLAAAAAAVGSGSVTDLSSNSSGVKLINDFLQHQFQQQQQQQHQQQQPHSSFSTCPEHPDVKGIDCKTCEMIEINIKSPMTPTRSPNSINLFPSNLTLSPTAAAAPSFTIGACPEHINGRPLGVDCSRCEMILNSARLNSGVQMSTRNSCKTLKCPQCNWHYKYQETLEIHMREKHPDGESACGYCLAGQQHPRLARGESYTCGYKPYRCEICNYSTTTKGNLSIHMQSDKHLNNMQELNSSQNMANTPAEIRESPKIIMPNMQQQASKPKPSFRCDVCSYETSVARNLRIHMTSEKHTHNMAVLQNNIKHIQAFSFLQSQNIGQLSAAQSAAVAASNLPNIPNLQNFLPEAALADIAYNQALMIQLLHQNSAAGALSAAAAAAAAASPLTLPPPQQSPAGSGGTVSQPQTTPTSQQPPQQSFQSLQTSKLNHQSSPQQSMPTVATTGAPSLSSSSTTASSHPSSSQQHSAVAAAALLAEAAAAAAAAATSDPASICLQQQQQQQQQQIQQQDSSLDPPIDPDPKPTTAFSCLICANYNTNSIDELNNHLMIDRSRNTNNNCSDIMMIINNNYICRLCNYKTNLKANFQLHSKTDKHLQKLNYINHIREGGVKNEYKLKYNQTNTVQLKCNCCDFYTNSIQKLNMHTQHMRHDTMKMIFNHLLYLVNSFNVSMGGDSMAVVSENSEYQLMSKNKVLMCKLCNYSAVNILQIVQHVKSLRHIQVEQFICLQRRSENLESLGLDDVFKIADNTDCIKSERSSPVPSLESHQLLVKEENEKSSPMATTTTTTTTTALAVTPPATTTTAKSSHDIYIDSSSDHQQNVSSSSAIAVSNIANCNKELADVDLSSLPSIIYKCNNCDYFAQNKSEMQNHIVNTHQNVSEEDFLTIPTNPAALHAFHAAVAAAAVAVAENAAASQSRSKSSSPVHMHMQDDRQQQQHHDQGFATSVRGNELMPQVKTEQMDIMDDAQSNDECEQLEDPIELLSSRNSANVTSSPAVKSVMCPLCQDTFSEKKALEMHLMGVHSVNSDGLARLLQLVDSSHWLNSSRRSSTSTTPEPRNSSTPLSEAGGQLAFQQPQQQQQKQQQPGSSPVNQHVFSDEYACAQCGITFKLQQHLLMHANDAQHYQMVNEQYQCLAKHCQQLFGNLLQMLTHYKDSHMNIVISERHVYKYRCKQCSLAFKTQEKLNTHSLYHTMRDATKCMICNRNFRSTQSLQKHMEQAHSQLQPTSPIPSPSSSGEIANAPGSLSVTPSSRAEDEEVTANAAVTTSMTTGATTGTNNKFSNNQYNVDGVGDGQDIVSLKSSIYDSISVSVSAPPPPPPPPLPPPLPPPPTVLDILPSDKSQLEDYLNSQQMSEVYYTDVERKLKCHKCKVAYTNQSYLSKHYKSNQHRRNEKLSIYPLEKFLDPNRPFKCEVCRESFTQKNILLVHFNSVSHLHKAKKQQAGGSASPAKSIPMFSDAIEGLSESKGQTVEIAGGSGSGSNACGGDGGITAVNKSPFSKRKVSLESDYESPKKRFKCEICKVAYAQGSTLDIHMRSVLHQTRSCRLQEQQAQPLKPMTPPSSSESPTSTTATTPTLNDHMYKSLLETFGFDIVKQFNEINKLCTVSNSRNYYCRHCNKEFSSIFVLKTHCEEIHSEQIPLELLEKFAEKFKHIYLDQEPTSPTSLLTITNSDQNPNQCFSFNESENNLGATTSTACSDSNSPSPVGGEPAAGSNETSLIVPSTSTIAAPTSPNTSAVTAVAAALLKQQQHQQQQSLTPDLVQKLNLDPTMLAQKIMEQNFSNFPPNFPGLPQNLQSLQSLQSLQNLQNMQQNLPNMGNMPMNTLDMLNLMQFHHLMSLNFMNLAPPLIFGAGGTGPSPSVTGTTSATPSDLPPTAATQIIQQQTSSSSQKGFTFSSQNTSNQKRARTRITDDQLKILRAHFDINNSPSEESIMEMSKKANLPMKVVKHWFRNTLFKERQRNKDSPYNFNNPPSTTLNLEEYERTGQTKVTPLSESGGGSISGFHLQQQQQQREQQQREQQQREHQQQQIQQQQQLQQQIQSQHLQQQQQQRPPSSQSSDLNFPQLSFHQQQHDLSRQQLHQQQDNRPLSHPSSVTSDRGDIHIKPEPADDIGSSDCDQQMAMSKDHDNEQSIMQSHHQQSMFYNNFETKSESGSSEILSRPQTPNSTSTPYSSNISDILGQQMDSLPLNNMANISNLNNMGPPKKFQMNKMFEKSGNFETNSNSSNSSTSSGKRANRTRFTDYQIKVLQEFFENNSYPKDSDLEYLSKLLLLSPRVIVVWFQNARQKQRKIYENQPNNTFYESEEKKPNINYACKKCNLVFQRYYELIRHQKNHCFKEENNKKSAKAQIAAAQIAQSLSSEDSNSSIDINSTNMLSSNLVGPQAAAAAAAVAVAAAAVGGNVGTAIPPVMPGLASSPGINLLASPQHIFKQQQAVTSVGGSHADSTSPLQKFECDKCQLVFNRYELYKEHQLIHLMNPNLFMNQNYNESSPFGILQNLQGNHNSQQDTSIDLSRQKKRKYSDTQNSPDELHHQQTDYEAFNKKFKNDQYEFLYQYFLQNDSNSDLKKQFQQQQQQPEMDLDYLANFYQQNEYKKLSNYDFLLQYYMRNESKQPSNASNLMLLNDDANKPNMDFLLQYYQLSESKKFFQLEASPQRIHDFPPLLNLTSNAAAAAVAAINNGVSAQQHQEQQQVSVISPTETPMNTTTNTNVAISSPNSSPVRQQSNNSGCAIGNKDTTSNNKNCQTHPTVLTENVKLLSATVSAFGGGSGNGSGVPTATLPITTPNTTNTGSTSFSTNSINNNRHCPILSPLQTKQQENSSLISNRLDAAPAAAAAATASEFAVQSIKLSVNNISTNAVSKTSPIANSIDMMDAELGITHHHQQHDSSATSVSSASHQLEETVTTTEKQNSKRLRTTILPEQLNFLYECYQNESNPSRKMLEEISKKVNLKKRVVQVWYQNSRARERKGQFRQNIQIINKKCPHCAAIFKIKSALEWHLQSKHGDKQAINVDQIPDLKFSDGLLNFSSSTPYGIKVDEQHKEQQKLKEQPSSPTTTPTTTTPPPATLSPVKSALCVASVVTTNAAAAASSPAVSSAVSSVASSASILTLAKGGGTTPLDLSKTPTPLVNNFSKYEQSESDISFSDSNNDHDESNDFFTPSSSLNNGNLGNLVPNNRANNCINNRRQYDTNLIDGNNVAGGCSYNDNNTMSNISDYLGNERENTNSPVSQTSSNNSAQQKKRFRTQMSNLQVRILKTLFHDVKTPSMTDCSNVGREIGLGKRVIQVWFQNARAKEKKSRNQRYIHDENTFENDKPNVDSTTSNNILEIRECNICQLPSVNIQEHAFSAQHIAQVRVLLEANSSNKSDDNQQYVENSMEHEFNGIYSKLYTQQQHHNNHSHHHKNENADSGRADTSHHDKDDSDVDYHCPNIMTNVDCVNNDKNYENDDDDDDEHDDDDHDHDADDHDDDAKTKLGIQNEDMALSEANKAALALRNFNKLQQHFAAATAFAKQQHQYPHQQQLQQQQQCDTISSASPPTMENNLMLKINTDNPLATNHSEMLQQLFSYSQMSGESYM

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2