Basic Information

Gene Symbol
zfh2
Assembly
GCA_001188975.4
Location
LGAM02021656.1:515373-604126[-]

Transcription Factor Domain

TF Family
zf-C2H2
Domain
zf-C2H2 domain
PFAM
PF00096
TF Group
Zinc-Coordinating Group
Description
The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 13 0.082 7.1 7.5 0.4 2 23 243 265 242 265 0.95
2 13 0.00021 0.018 15.6 1.4 2 23 872 894 871 894 0.96
3 13 5.3e-05 0.0046 17.5 0.7 1 23 926 950 926 950 0.92
4 13 0.0093 0.8 10.5 0.4 1 22 1049 1070 1049 1073 0.91
5 13 1.2 1e+02 3.9 1.9 1 23 1321 1345 1321 1345 0.91
6 13 6.2 5.4e+02 1.6 0.7 2 22 1485 1505 1484 1508 0.88
7 13 2.2 1.9e+02 3.0 0.8 1 23 1645 1668 1645 1668 0.91
8 13 0.00054 0.046 14.4 0.8 2 23 1821 1843 1820 1843 0.95
9 13 0.25 21 6.0 2.7 2 22 1941 1961 1940 1964 0.88
10 13 0.0011 0.095 13.4 3.9 1 23 2016 2038 2016 2038 0.98
11 13 0.0097 0.84 10.4 3.3 2 23 2045 2067 2044 2068 0.93
12 13 0.0002 0.017 15.7 2.6 1 23 2772 2794 2772 2794 0.97
13 13 0.014 1.2 10.0 3.8 1 23 2918 2940 2918 2940 0.97

Sequence Information

Coding Sequence
atgttaacagACAGTGAAAATAGTCAGTTCTCGTATAATTATTCATCGGAAAACAGTGTTAATAATGTCTTCGAAAAAAGGGACGGTACGGGAACCAGTTATCGTGGAAGATGTTTCGCCAGTAATGAACGAAAACGAGGAATATCCTCTACCTGCGCAGgatattttaaaagatttagACCAGACGCAGATGATAGCGAATACACAAGCAGTGTAGACAGCGATTATGAAGGAAAAGCAGCAAAAAAGGAAAAGGAAGCAGGATTCCAACGTTCTGAATTTCATGCCAATGAATTAACAGCAAAAACCGTACAAAAAACGAGTGAATCTAAGCAATTACTGACGTCAAATGTGGTTGGGGAAACAAGTGCTACAACTCAAAACATAACATCAACAACTGTAGCAagtgcgacaacaacaacattgagtACAACATCGACTACGTTAAAAGAAGCACCAGGTGGTCAAGATGGACAAGGACCCGATGCCTTAAAATGTAGCGGAAGCCCACGTATACATTCATTTCGAATTGTGAGTGCTCAGGATGCTACAACCGCCATCGTTGCTAACACTACCACCATCGACAAAAGCAGTGTGGAGAAAACGGAGCTCAGAAAATCGTATATCGAACAGCAACAGCGTCGCCGATATAGCACCTTCGACGATAACGATAGCGATAGTAGCAACGACCACAAAAGATCTTCCAAAATACAAAAGCCGATATTGATGTGCTTTATATGCAAACTGTCATTTGGGAACGCAAAATCTTTTAGCTTACACGCTAACACAGAACATCAATTGACACTACAAACAAAAGAACAGCATTTATTAAACTGCGAATACTCGAGCGTAATTATACAGCCGCAAAATATGGATGAACGGCCGCAAATATCATTCCTTGAGCCAATTGATGTGCACAATGCAATCCAAAATGATAAATTAAATTCTGAGACTGATATTCAATTTAATGAGGGTACATCGCAAGTATTTTCTTGTAGACTGGATAGTGCGGTTGATACTTCCATCTCTGAAAGTTCGAGTACAGGATGTGTAAGAGCTGAAAAATCGGACAAACCAGCGCCAATATCGTCAAACTCTAACTCATTATCGCCATCTTCATCTTCAGCTTCCCCCGTGTCATCAGCAGTAGTACCAATAGCACATACCGTGTTGTCACCATCACAGATAACATCAACTACGTTGTCCAGCGCAATCGAATGTATTAACGATAGCCATCTATTTCAAACAACCTCATTTTCAAAATGTATGCTCCAGGGGCAGCGAAATGAGGTGAGCGATACTGCAGCAACAACTGCAGTAACAACACTGTTATCAAGCCCTAAATTTGAAACCAGtgaacaacagcagcagcaaaatatAAAGGCAAATGTAGCCATAGAATCGAACGACAGTACTGAGTCCAAAATGCAAAGTGCATCGATATCACGATCACTGATTCCCTTAACAGAGAACATGTTAGCACTAACCCCACCCTTATTACCAAAAAAAGaatcagaaaatattattaaacttgACGAACCATATAGTATGGACAATAGAACTGCATCTGGCGGCGTTTTATGCTCAAGTGCTGATGTTCAACTGAAGCCAAGATTGATAGCGTCTCCGATCCCTACAAATAAAAACACCGCTATGACCTGTGATATGGGCAGTTATGACGAACGCGGTTTATACTCAGATTACATCCGTAGCCCGCCTTTGACAACATTTAGGAAAACTCATGATAAAATGATAGAAATGTCCTCATTAACAGTTAATGCTCCAACTACTCATGGTGACTTTGTACATTTTGAAGCATCTTCAGGAACGGTATCAGAGGAAGGTAATATGTTGAATCCAACAGATACACTGGCAGAAACAGTTACATTTTTgaagcagcagcagcgacaAATTGCAACAATGACCAGTACCTCCTATGCCACCCCTCAACCTGCATTAATGTCCATTGTACCCACGCACACTCAACTCAGCTGCTTACATGCCTCTTTGGCAGCTTTATCTGATGACAGAAATATCAACTGCAGTACTACAGATGCACAAAAAACGAACGCTAAATTGTTCACCGACTTTTTGCAACAGCATcttaatttacaacaacaaaaaacgtatTCCGATGTAAGTGGCAATTGTTCTGAACATGCCGACTATAAGGATAGCGACTGTAAGAATTGCGAAATTCAACAATTAAAGTCGTCTCCGTATCACAGTTCAATACACCAGCTTACCAACAACACGTCGCAATGCTCGCCAAATCGCAATAATGGCAGTTGTACCAGCAGCGCTATTATGAAATCTCCTGCACACATGGCCACATCTCCTACAGCTGTCAATGTCTCAACTGCGACAGTAGCGGCGACAGCGACAACCCAGCAACAAAATGCAGCTGCTGCTGTAGCTGTGgctgctgcagcagcagcagctgcggCAGCTTCAGTGGCTAACAATACCTCAAGTTTTACTATCGGCGCATGTTCGGACCATATCAACGGACGGTCTTTAGGCGTAGAATGCGCTCGGTGCGAAATGATTTTAAATTCAACGCGTCTTAACACAGGCGTTCAAATGTCAACACGTAATTCTTGTAAAACATTGAAATGTCCTCAATGTAATTGGCACTACAAATATCAGGAGACACTCGAGATTCATATGCGTGAAAAACACCCGGATGGCGAGAGTGCTTGTGGCTACTGTTTGTCCGGTCAACAGCATCCCCGTCTAGCACGCGGCGAGTCATATTCTTGTGGATACAAACCATATCGCTGTGAAATTTGTAACTACTCGACAACGACAAAAGGCAATCTTTCCATTCATATGCAGTCCGATAAGCATTTGAATAATATGCAGGAGTTGAATAGTTCACAAAGTATGTTAGCAGCCGCCGTCGTTGCTGCTAATAGTGGAAAGGCGGATGTTGCATCCAAAATGTTGCTATCCAACAGTGCTGCAGCTGCtgctcaacaacaacagcaagcactACCGCAGTCTCAATCCCAACAACCACACTCTACGCTTGTCGCCAGTTCGTCAATGTGTAGTACCTCCGCTTCTGTGAGTGCCAGTGGTCTCGCCAGCGTCGATGGCATTGGAACTAATTGCAACATGCTCAATAAAAATAAGCCGTCCTTTCGTTGTGACATTTGCAGTTATGAAACATCTGTAGCACGAAATCTTCGTATACATATGACCAGCGAAAAGCATACGCATAATATGGCTGCACTGCAAAATAATATCAAGCACATACAAGCTTACAGCTTTTTACAGCAGCATTCGCAAGTTGTAGCCgcacatcagcaacaacaattagcaGCAGCAACACAGGTATCACAATCACAATTACCAGCACTGGCCAACAGTTTTTTGCCAGAAATCGCTTTAGCTGATTTAGCTTATAATCAAGCGATAATGATCCAGTTGCTGCAGCACAACTCTGTtagtcaacaacagcaacaacatgaaTCTGTTGCCAAAATGCAGGTAAAAACTTCACCATGTTCCTCACCGCGTTCCATGCAAATAGAACAGCAATCAACAGCtcaccagcagcaacaacagcaactccAACAATTACGTCAACATTTACAGCAGTCTTATTCACCGAACAATAGTTCATCAATGTTATTACTTTCGGCTGATCCGTTCGGCACCAGCGATGTAAATTTGCCTTCCCACGTTAGTGGTAGCTGCAGTAACGACGAAACGTTGGAGCCACCCATACATGCCGATCCATGTCCAACAAACCTGTATAGTTGCTTAGTATGTGAGGTGTTCGGTACAAATAGTTTAGACGAATTAAACCAACATTTGCTGATTGATCGTTCACGTTGCTGCAACAAGCAGCAAGCTACACCAACAAATGCGGTCGGCAACAGTTGCACCGAGAATGCTAATGTCACTAATACAGTTAATAGCAATGATATAATGGTAATTTTAAACAACAATTACGTTTGCCGATTGTGCAACTATAAGACTAATCTAAAGGCAAACTTTCAGCTGCACAGCAAAACGGACAAACATCTGCAAAAATTGAACTATATAAATCATATACGTGAAGGCGGTACACAAAACGAATACAAGCTAAAACATTTACATCTAGCTACTAACACAGTACAACTTAAATGCAACTGTTGTGATTTCTACACAAATTCCATCCAAAAGCTGTCATTGCACACGCAGCATATGCGTCATGATACGATGCGTATGATTTTCCAACATATACTGCATATCATGGAACAGCAAAATTTTGAGCGAAACAAAACCAAAGAGATGGCACCACACTCAAAAACTCATTCACCTAAAGATGCAACTGAAATTGTTCCAAGTATTCGTGAGTCGCACAGTGTTGTGCCTGTTCATGAAAATGATGGGAGTGACACCAACCACAGCAGCAGCATTGCAGTGGAAGACACGCCGCAGTCGATGCAAAAGTCTTTAACATGTCAATTGTGTGATTTTAGCACGTTTACGTTGCTTAATATGATCCAGCACGTGAAAAGTATGCGTCATATGCAAATAGAACAGTTTGTAAATCTACAACGACGCAGCGAGCAACTTGATCCTCCTAGTCTGGATGACATTTTCAAGATGGTAGAACGACCTACACTACAGTCTTCGGCACTAACAGCTGCTGCAAGACAAGAAGaAAATAACAGAATGGAAAATGTTACATCTGTTTCGCGATTCGGCAATTTTCCAATGCCACTGGCACGTTTATCAAACGATGATTTCTCTAATAGAGATCACAACTCTTTTTCACCGTCATCTAACTCCTCAACTGCTTCAGGAAACACTAGTGTGCGCAGCAAAAGCTATCAgagtgttaatatttttaactttcaaaCAGCAAGTGATGACAAAAATGCTTCACCATCAGCACTTCCGGGAGGTAGTCCCGCCGTCATGCCCTCAGTTGTATTTAAATGCAACAattgtgatttttttgcacattcCAAAGCTGAAATGGAACTACACCTGAGTGCAGTACATCCTCAAACTGAGCCAGACTACATTAGCATCCCAACGAACTCTGCTGCCATACAAGCATTTCAAGCAGCTGTAGCGGCAGCGACAGCAGCAGCTTCTGCAGTGACTGCCACCCGTGCTCCTAGTAACAAATCTAATACAGAAAATGATGATTTCCCTACTATAATTAAAAGAGAGCGCCTCTGTACGACAGAAGATGAAGAAACCGTAAAGTCCGCAGATGTGCAGAATACAATGTCGCTTCAGGATGTGTCGACAATTACACCTTGGTTCACAAAGACAacgaatacaataaataattacgaAACTTCAGCAGCTATGGTTCAACCTAAATCAATTCATAATGTATTACATGAGTTGGAAAAGACCGAGGAACAGAAAGCACAGCAAATTGAAGATACAAGTGGGGATGCATTGATTGAAAGTACCAACAAATATTCCGGAGAAGCAGTCAATGTTCAGTGTCCCCTCTGTACAGAAACATTTGATAGCAAACATACATTGGAAACGCATTTGATGAATATACATAGTGTGAATCACGATGGTTTATCACGTCTCCTACAATTAGTAGACACTAGCGCTTGGGATTTGACTGGTAAAACCGCAACAACGCCTACTATAGCTAAAGACTGTAAGGACTCAAATAGCGCCTCCgaaacaacaactaaacaaaCTATCACCGGTACTATTAATATTAAACCATCGACAGAATTAGAGTTATCGCTAATCGAGGTTTCAAATGACATACCTTTAAATCTCAATGCTTCTCATACCTCGACTTACATGCATTCCATAGAAtcgttaaataataataaattgtcaTGTGAACACTGTGGTTCAAAATTCAAGCATGAATTACAATTATTGCAACATGCGCAAAAAATGCAACATTTCATAATCTTGCCAAATGGTGGTCACCGTTGTTTGGCTGCTAGTCATCCCAGCCGACCGTGCCATTCGACGTTTCCGACTCAAGCTTCAATGGTCATCCATTATAAAAATACTCATATTAGCTTAATCATATCCGAGCGTCATGTGTATAAATACCGATGTAAACATTGTTCACTTGcatttaaaactcaagaaaaacTATCTACACATTTGCTGTATCATACGATGCGGGAAGCCACCAAATGCACGCTCTGCCAACGAAACTTTCGTACGACTCAAGCGTTACAAAAGCACATCGATCAAACACATCATAGTAGCGGCGATCAACATGTCGTCAGTAACAGTCCACCGACAGTATTAAGCTATAGCGGCCGTGGATCGCCAGCCTTACAGATGAGTCACGTGCAAAACAAATCCCAATTAAATGACAATGGCAATGCAGCTAAAAACCAAGACATCAatgTCCGTGCATCGCCCCCTACGACACCGAAAACAAATGAATCTTCAAAGGTATCGTCTTCCCCATTACCAGCTAACATATTGCAAgaccaacagcaacaagaacatGCTGCTGCTTACACCGCTGCGTTATTAAATCAGTCAACAcctctacagcaacaacaacatatcagCGGCGAAGAACTAACAGAGAGCGGCTGTCATCTTATGCAGCATACACAAATGAAACCACGTAGTCCCCTCTTAACACAAAAATATCTTCAACAACAGAATCTTCAAAACCTTCAACAATTGCCACAATTGACGGCAGCAGCAGCTACATCCGGCTTTCAATTGAATCCTGTCGAGATATTTAATCTTATGCAGTTTCATCACATAATGTCAATGAACTTTATGAATCTTGCACCGCCGCTTATATTTGGGGTCGGCACTAATTCTGGCTCTAGTGACAATGTTGATTTGTTAACACCCACTCACATACCTAAAGTTACTTACAATCCCAGTGCCAATACACTTCTATGTGGGAACGATATGTCGGTACCAGGCACACCAGTGGCGCCGCGAGCGGACCTAATAGGTGCATTACAGCAACAGCCTCAGTCCCAATCACAACAATCTGCACCAACTAGTACCGTTCAGATGGTAAATAATCAGAAACGTGCAAGGACGCGTATCACTGACGACCAGCTTAAAATTCTGCGAGCTCATTTTGATATTAATAATTCGCCAAGTGAAGAAAGTATAATGGAAATGTCACAAAAAGCGAATTTGCCAATGAAGGTCGTCAAGCATTGGTTTCGGAATACGCTGTTTAAAGAACGTCAAAGAAACAAAGATTCCCCTTACAACTTTAATAATCCACCATCTACAACACTAAATTTAGAAGAATACGAACGCACTGGACAAGCAAAAGTTACGCCGCTGAACGAGGATTTACCTACGGCATCGaaTAACTctatgcagcaacaacaaaagcaaaatattcatGAATCGAATcccaaaaataattcaaaagaaaaaccaACCACTATGACAGACATAAGTGGCGATTCTTCGTTATTAATTGACATCAAAGCTGAACCCCGAGATGATAATATTGAGGCATACATTACAACTCCAGGGAGTTCACAACGTCAAAATGAACAGCAGCAACCTCAAGAGCAAAGGCAACGAGAACTGCATAAGAACGACGAAAAAGATTTATTGAGCACGTCATCGGCATTGCTGCTGCATAAACGACAACAGCATTTATCCGCATTGTCAGCGTCtgaacatcaacaacaacttcTTGAAATACCGGGAAACACAGTAGTGGGAACGGCTCCTGTTACGGCGTTGCAGCAGcatcatcaacagcaacaacatttgcatggtcaatcaaatcaaaatattcACCAGCAACATCCACAGCATATCAACCTATACAGTTATGAAACCAAATCAGAAAGCGGCAGTTCGGACATCCTTTCCAGACCTCAGTCACCAAACAACAGTTCTACAGTTGTACCGACACACTATGCCAGCATTAATGAGTTAATAAATCAACAATTGGATAATTTGCCGCTCGGCCATAATATCAGCAATATAAATGTTGGCAACATGCACGGTAACAACATGGGCCCgccgaaaaattttcaaaccagcAAATCTTTCGACAAAAATTCACCGACTTCTCAGTTTGACACCAATTCGAATTCATCGAATGCATCGTCCACATCGAGCGGAAAGCGCGCAAATCGCACACGCTTTACGGACTACCAAATTAAGGTACTGCAGGAATTCTTTGAGAATAACTCCTATCCAAAGGATAGCGATTTGGAGTATTTAAGCAAGCTATTATTGCTTTCTCCACGAGTTATTGTTGTCTGGTTTCAGaatgcTCGCCAAAAACAGCGTAAGATTTATGAAAATCAACCAAATAATTCACTTTACGAAACTGAAGAGAAAAAGCATAATATTAACTACGCCTGCAAAAAGTGTAATATGACATTTCAGCGCTATTACGAGTTGATACGCCATCAAAAAAATCATTGctttaaagaagaaaataacaaaaagtcgGCCAAGGCACAAATAGCCGCAGCACAGATCGCACAAAATCTCAGCTCAGAAGACTCCAATTCTTCGATGGACATTAATAATAGCAGTGCGTATCAATTACAGCATCAACATATTTCTAATGCTGCAGCAACAGCGGTGTTGTCAAGTACCGCCAACGTTTCCAGCACATCACCTAGCAGCACAGCTCCTGGCGTTACATCGCCACAGCATTTATATGGTAAATCGTCAATGTCAATGACTGATTTTAGTCCATCCACTACGCCCACTCCACCACAGACACAGCGTGAGCGCAGCGATAGCAGTGAATTGCTGCCGCAAGGTCCAGTGCACAAGTCAAAATACGAATGTGACAAATGCAAATTACAATTCAGTTACTACGAACATTTCCGTGAGCATCAGCTACTACACCTAATGAATCCCAGCTTATTTACCAcacaaattacaaacataccAGAGGCTTACGGTTCATTTGGCAGCATATTGCAAAGCTTACAGCAAGTTGCAGCTGCCAGTGCCACGCATCAGCAGCACCACCAATTCCTAGAACAGCAAGACCAACCACCCGCAAAAAAACGTAAGTGCTCTGAAACATCTTCAATAGCAGATGATGTGTCATCAATTTTTGGCACCGGTGATGGTGAAATCAGCAACCCTGTCTCATTTTCGTTATCAAACAGcaaaaaatacgaatttttatatCAGTATTTTATGCAAAACGAGAGCAACAACGAATTGAAACAACAATTTCaagcacaacaaaaaaagtCACATGAACCAGAAATCGAAATggaatatttaacaaatttttaccaCCAAAGTGAGTTACGAAAGCGTAGTAATTATGATTTCTTGTatcaatattatcaaaaaaatgaGCACACACAGCAAATGAGTGCTTTGCCGTCCCAAGCGGGCGTTTTCGGTAGTGAAAACAAACCAAATATAGACGTTCTACTCCAGTACTATCAACTGAATGAATCAAAAAGGTTTTTTCAGTTAAATGCCTCGAACCAAGAACTAAATGACCTTACGGCAGCGTCGCCAACATCCACATCACAATACCAACATGTAAATAATCCGATACCAATATCGAATAGTGATCATACAACCACACCTAGAGTTGATCGGTATGATATAACCGATATGAGTTTATTGAAAAACTCTCTGGATGTCCCTAatagtagtattaattttacAGATTGCGATAATGACAACAACGACCACGAAATAAGTAGTAATGGTTGTGGCATGGATGTTGACGACGCTGAAACTTGTACTGACACTGATGATCATATTAATGACAATAATCAAAAATCGAATGAAATCTACGGTCACAATAGCCGAATCAACATAGTGAAGAGTAGTAACGATAATGATAATAAGGATAATGAACAAGCAGCAAATAAATTTGCGAAAATGGATCCAATTAATGAACTCCCTACTAGTCTGCAATCTCCATTTTCTTATAGCAATGAGCAACATAAGCTCAAGGACTTAAGTAAATCTCTTGCCAGATgcggaaaaaatcaaaaacataatTTGACAAAGCGTAAAGCATCAAATGTGTCATCAAAATCAAAtacgacaacaacaattaataatGAGTATATTGATCATTTAAACGATTTTCTTCATGTCAATCAAAAATACGAATCCAAAGAGAAAAGTAAACAGCAGCAGCATTATAACGATTTACTACAAGCACAAAATGAAACCAACGGTCTTGGGAATAATGGATCAGCCCGACAAGCAATCGATAAGAGACAATTAGCAGAGACTACATCATCATCACCGAATTCcactacaaatttaaataacaaaccTACCGTTGGGTCttctaccacaacaacaaacacagtcACTGGTATTtctgaaaaacaacaaagcaaacgATTACGCACTACAATACTGCCGgagcaattaaattttctatatgaATGCTATCAAAATGAATCGAATCCAAGTAGAAAAATGCTTGAGGAGATCGCAAAAAAAGTTAATCTTAAAAAACGGGTTGTACAAGTTTGGTTCCAAAACTCACGTGCCAAAGATAAAAAGTCACGCAATCAACGCTTTACTGCAATATCCGATGACAGTAACTACGAGGATGCTTCGCAAAATAATCGAGACTACGGActctttcaaaaaaatacattgAAGTCCACTACAATCCACGGCAACAAGGGCAACATGAACTTTTTACCTGATGCTAATATAAGTAATAACACCAACACTGGAATGGAATTAGGCGGCTGTAATTTATGCCAACTGTCTCAAGTAAATGTTCAGCAACATGCCTTATCTATTGAGCACATATGTAAAGTAAAAAAGCTGTTGGAGCAAACCGCAGTTGATGTCAAACTTAATATCGATTCGAAAGTTTTTTCAACTGCAGTGACTGTGGAAAATGAAGaattcacaaataaacaaagtgCAACTGGCCAGCCAAATACTAACGCCGATTTAAAAGAAGGTAAATATGTCACGAGTGACGAAGCTGCTCTTGCCCAAGATAGTGATTTCAAGACGTATGTGAATACTGATATTACCGGCAACGATAGACTTGTCACTTTTACGAGAATATATAAGGAGCtgaaattacaaaatacaaTGAAACGAGCCATCGATTATACATCCAATAACGAAAACAGTGAATTAATGGAATGGTATGAATGTGgcaatattaataatgaaagcAATGGAGCAAAAACAAACGACATAGGTGACAATCGATGTAAAGGGCGAAAACCACAGGTTGCTAAACAGacttcaaaagaaaataatgaaaaacaagaaaggGATAAAGTTATTAATAGGTCATACAGTGCTGAGAATGTTGAGAACACTGATATTGGAAGCGTCAACAATATTGTCAATCACAAATTAATGGAAactcaatataatattaaatatagtgAACGCAGCAGCAAAAGTGCcatttttgtttcaaatgcACCGCTGGCAACAAGTATTACTGATGATGAAGATAATTATATAGACGAATACAAGGATAGTAGTGATGGGAATACGATAGATCTAAATGATAATAATGAGACCATACAAAACCCATCAACCAGAAATGAAACAATAACAGCAAGCAGTAACAATAGCAGCACATCTTGTACATTAACAACCAGTACAACGCATACGCAATTAACTAATACGCATGCTCAAGATATTattcaacaattatttaattgtaatcaAATAACTGTTTCGAGCGGCAAATAA
Protein Sequence
MLTDSENSQFSYNYSSENSVNNVFEKRDGTGTSYRGRCFASNERKRGISSTCAGYFKRFRPDADDSEYTSSVDSDYEGKAAKKEKEAGFQRSEFHANELTAKTVQKTSESKQLLTSNVVGETSATTQNITSTTVASATTTTLSTTSTTLKEAPGGQDGQGPDALKCSGSPRIHSFRIVSAQDATTAIVANTTTIDKSSVEKTELRKSYIEQQQRRRYSTFDDNDSDSSNDHKRSSKIQKPILMCFICKLSFGNAKSFSLHANTEHQLTLQTKEQHLLNCEYSSVIIQPQNMDERPQISFLEPIDVHNAIQNDKLNSETDIQFNEGTSQVFSCRLDSAVDTSISESSSTGCVRAEKSDKPAPISSNSNSLSPSSSSASPVSSAVVPIAHTVLSPSQITSTTLSSAIECINDSHLFQTTSFSKCMLQGQRNEVSDTAATTAVTTLLSSPKFETSEQQQQQNIKANVAIESNDSTESKMQSASISRSLIPLTENMLALTPPLLPKKESENIIKLDEPYSMDNRTASGGVLCSSADVQLKPRLIASPIPTNKNTAMTCDMGSYDERGLYSDYIRSPPLTTFRKTHDKMIEMSSLTVNAPTTHGDFVHFEASSGTVSEEGNMLNPTDTLAETVTFLKQQQRQIATMTSTSYATPQPALMSIVPTHTQLSCLHASLAALSDDRNINCSTTDAQKTNAKLFTDFLQQHLNLQQQKTYSDVSGNCSEHADYKDSDCKNCEIQQLKSSPYHSSIHQLTNNTSQCSPNRNNGSCTSSAIMKSPAHMATSPTAVNVSTATVAATATTQQQNAAAAVAVAAAAAAAAAASVANNTSSFTIGACSDHINGRSLGVECARCEMILNSTRLNTGVQMSTRNSCKTLKCPQCNWHYKYQETLEIHMREKHPDGESACGYCLSGQQHPRLARGESYSCGYKPYRCEICNYSTTTKGNLSIHMQSDKHLNNMQELNSSQSMLAAAVVAANSGKADVASKMLLSNSAAAAAQQQQQALPQSQSQQPHSTLVASSSMCSTSASVSASGLASVDGIGTNCNMLNKNKPSFRCDICSYETSVARNLRIHMTSEKHTHNMAALQNNIKHIQAYSFLQQHSQVVAAHQQQQLAAATQVSQSQLPALANSFLPEIALADLAYNQAIMIQLLQHNSVSQQQQQHESVAKMQVKTSPCSSPRSMQIEQQSTAHQQQQQQLQQLRQHLQQSYSPNNSSSMLLLSADPFGTSDVNLPSHVSGSCSNDETLEPPIHADPCPTNLYSCLVCEVFGTNSLDELNQHLLIDRSRCCNKQQATPTNAVGNSCTENANVTNTVNSNDIMVILNNNYVCRLCNYKTNLKANFQLHSKTDKHLQKLNYINHIREGGTQNEYKLKHLHLATNTVQLKCNCCDFYTNSIQKLSLHTQHMRHDTMRMIFQHILHIMEQQNFERNKTKEMAPHSKTHSPKDATEIVPSIRESHSVVPVHENDGSDTNHSSSIAVEDTPQSMQKSLTCQLCDFSTFTLLNMIQHVKSMRHMQIEQFVNLQRRSEQLDPPSLDDIFKMVERPTLQSSALTAAARQEENNRMENVTSVSRFGNFPMPLARLSNDDFSNRDHNSFSPSSNSSTASGNTSVRSKSYQSVNIFNFQTASDDKNASPSALPGGSPAVMPSVVFKCNNCDFFAHSKAEMELHLSAVHPQTEPDYISIPTNSAAIQAFQAAVAAATAAASAVTATRAPSNKSNTENDDFPTIIKRERLCTTEDEETVKSADVQNTMSLQDVSTITPWFTKTTNTINNYETSAAMVQPKSIHNVLHELEKTEEQKAQQIEDTSGDALIESTNKYSGEAVNVQCPLCTETFDSKHTLETHLMNIHSVNHDGLSRLLQLVDTSAWDLTGKTATTPTIAKDCKDSNSASETTTKQTITGTINIKPSTELELSLIEVSNDIPLNLNASHTSTYMHSIESLNNNKLSCEHCGSKFKHELQLLQHAQKMQHFIILPNGGHRCLAASHPSRPCHSTFPTQASMVIHYKNTHISLIISERHVYKYRCKHCSLAFKTQEKLSTHLLYHTMREATKCTLCQRNFRTTQALQKHIDQTHHSSGDQHVVSNSPPTVLSYSGRGSPALQMSHVQNKSQLNDNGNAAKNQDINVRASPPTTPKTNESSKVSSSPLPANILQDQQQQEHAAAYTAALLNQSTPLQQQQHISGEELTESGCHLMQHTQMKPRSPLLTQKYLQQQNLQNLQQLPQLTAAAATSGFQLNPVEIFNLMQFHHIMSMNFMNLAPPLIFGVGTNSGSSDNVDLLTPTHIPKVTYNPSANTLLCGNDMSVPGTPVAPRADLIGALQQQPQSQSQQSAPTSTVQMVNNQKRARTRITDDQLKILRAHFDINNSPSEESIMEMSQKANLPMKVVKHWFRNTLFKERQRNKDSPYNFNNPPSTTLNLEEYERTGQAKVTPLNEDLPTASNNSMQQQQKQNIHESNPKNNSKEKPTTMTDISGDSSLLIDIKAEPRDDNIEAYITTPGSSQRQNEQQQPQEQRQRELHKNDEKDLLSTSSALLLHKRQQHLSALSASEHQQQLLEIPGNTVVGTAPVTALQQHHQQQQHLHGQSNQNIHQQHPQHINLYSYETKSESGSSDILSRPQSPNNSSTVVPTHYASINELINQQLDNLPLGHNISNINVGNMHGNNMGPPKNFQTSKSFDKNSPTSQFDTNSNSSNASSTSSGKRANRTRFTDYQIKVLQEFFENNSYPKDSDLEYLSKLLLLSPRVIVVWFQNARQKQRKIYENQPNNSLYETEEKKHNINYACKKCNMTFQRYYELIRHQKNHCFKEENNKKSAKAQIAAAQIAQNLSSEDSNSSMDINNSSAYQLQHQHISNAAATAVLSSTANVSSTSPSSTAPGVTSPQHLYGKSSMSMTDFSPSTTPTPPQTQRERSDSSELLPQGPVHKSKYECDKCKLQFSYYEHFREHQLLHLMNPSLFTTQITNIPEAYGSFGSILQSLQQVAAASATHQQHHQFLEQQDQPPAKKRKCSETSSIADDVSSIFGTGDGEISNPVSFSLSNSKKYEFLYQYFMQNESNNELKQQFQAQQKKSHEPEIEMEYLTNFYHQSELRKRSNYDFLYQYYQKNEHTQQMSALPSQAGVFGSENKPNIDVLLQYYQLNESKRFFQLNASNQELNDLTAASPTSTSQYQHVNNPIPISNSDHTTTPRVDRYDITDMSLLKNSLDVPNSSINFTDCDNDNNDHEISSNGCGMDVDDAETCTDTDDHINDNNQKSNEIYGHNSRINIVKSSNDNDNKDNEQAANKFAKMDPINELPTSLQSPFSYSNEQHKLKDLSKSLARCGKNQKHNLTKRKASNVSSKSNTTTTINNEYIDHLNDFLHVNQKYESKEKSKQQQHYNDLLQAQNETNGLGNNGSARQAIDKRQLAETTSSSPNSTTNLNNKPTVGSSTTTTNTVTGISEKQQSKRLRTTILPEQLNFLYECYQNESNPSRKMLEEIAKKVNLKKRVVQVWFQNSRAKDKKSRNQRFTAISDDSNYEDASQNNRDYGLFQKNTLKSTTIHGNKGNMNFLPDANISNNTNTGMELGGCNLCQLSQVNVQQHALSIEHICKVKKLLEQTAVDVKLNIDSKVFSTAVTVENEEFTNKQSATGQPNTNADLKEGKYVTSDEAALAQDSDFKTYVNTDITGNDRLVTFTRIYKELKLQNTMKRAIDYTSNNENSELMEWYECGNINNESNGAKTNDIGDNRCKGRKPQVAKQTSKENNEKQERDKVINRSYSAENVENTDIGSVNNIVNHKLMETQYNIKYSERSSKSAIFVSNAPLATSITDDEDNYIDEYKDSSDGNTIDLNDNNETIQNPSTRNETITASSNNSSTSCTLTTSTTHTQLTNTHAQDIIQQLFNCNQITVSSGK

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2