Mqua068899.1
Basic Information
- Insect
- Melangyna quadrimaculata
- Gene Symbol
- zfh2
- Assembly
- GCA_949320155.1
- Location
- OX439486.1:1363848-1407504[+]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 17 0.35 92 5.9 0.7 2 23 123 145 122 145 0.95 2 17 0.0003 0.077 15.6 1.4 2 23 693 715 692 715 0.96 3 17 7.3e-05 0.019 17.5 0.7 1 23 747 771 747 771 0.92 4 17 0.023 6 9.6 0.4 1 22 813 834 813 837 0.91 5 17 2.9 7.5e+02 3.0 1.9 1 23 1112 1136 1112 1136 0.91 6 17 0.051 13 8.5 0.9 1 23 1393 1416 1393 1416 0.93 7 17 0.17 43 6.9 0.1 2 23 1541 1563 1540 1563 0.93 8 17 0.044 11 8.7 0.2 1 21 1640 1660 1640 1664 0.90 9 17 0.0035 0.91 12.2 2.3 1 23 1710 1732 1710 1732 0.96 10 17 0.00089 0.23 14.0 1.5 2 23 1739 1761 1738 1761 0.94 11 17 0.0054 1.4 11.6 3.7 2 23 1906 1929 1905 1929 0.90 12 17 0.11 29 7.4 0.6 1 21 1950 1970 1950 1974 0.85 13 17 0.011 2.9 10.6 0.1 1 23 2056 2080 2056 2080 0.93 14 17 1.2 3.2e+02 4.2 4.1 1 19 2152 2170 2152 2175 0.92 15 17 0.00027 0.07 15.7 1.8 1 23 2853 2875 2853 2875 0.97 16 17 0.0019 0.48 13.0 0.8 1 23 2993 3015 2993 3015 0.98 17 17 0.015 3.8 10.2 0.6 2 23 3521 3543 3520 3543 0.96
Sequence Information
- Coding Sequence
- atgccacCACCCTCAACACAATCGGAATCCCAACATAATGAACCAACACCTAGAAAACGTCGACGTAAACGTGATGATCCCCAATCGTGTTTCACCAATTCGGAGGAATACGAATCGGATGACTGCTCCCCAATGTCCTGTTCTGATGTTGAAAGTTTTCAAGGCAAGATCGTTTATAATCCAGATGGCAGTGCTTATATAATCGATTCTGAAAATGAATCACTTTCGAATATACCGGAGAATTGCATGAGTGCTgggacaacaacaacaacaaacaacccAAAAATTCACTCATTTCGTGTGGTAACGGCTCGTGATGCTTGTGTTAATATTTCCGAGctaaataaaatccaaaaaccCATATTAATGTGTTTCATTTGTAAGTTGAGTTTTGGTAATACGAAATCGTTTAGCTTGCATGCAAACACTGAACACACATTAAATCTCCAAGAATCGGAAAAGCTATTATTGAATCGAGAATATTCAAGTGCCATTATACAGCGTAATGTTGATGAAAAACCGCAGATATCGTTTTTGGAACCGTTGGATATACAGAAGCAGAACCAATTGAATAAATTAGCATCACcacaaacacaacaacaacatcaacaacaacacccACAACAACAGCAACTACTTATAAGTTCAACATCATCATTATCGGTGAACAgtaacagcaacagcaacaataacaataacgtAAGTTGTAGTAAATTACCAACGACACTATCACCATCATCGTCGTCGGCAATAGGATCGGTGTCTTCGGTAgtggcagcagcagcagcagccaaTGCGGCGTTGGTGGCTGCAATAGCAGCAAGTTGCAGTAACACCAGTTTGAATACGTCAACATCATTGTCGATGAACGCACCCCATATTGATAGTGATTTAATTATGGCGAGTCTTGGCGGTCCTAGTGGTGGTGGTGGTAATAGCAATCCTGGCTATGGTAACACAACTAGCACTATTAGCAACAATCCAAGTAGTAGCAGCAGCAGCAATTGTAGTCAAATGATTCATCAGCAGCAACAAAGATCGGATAATTTTGATAATCTGAATACTTTAGATTTAAGTGCAGTGACAGCTGCAGCACAAGCGGCTGCTTCAACATCAATAGTCAATAGCCGGCACACACCACCGCCGTCATCACCTACATCGACTACCTCGTCGTCCCCATcatcttcttcatcttcatcaGCTTCAACCTTGTCATTATCTGCCCAGCAACCGTCAACAACAATCAGTGCATCCGGACTACCAGCACCCACAAGCACCATTGCCACTGAACCTAAATCCTCGCATTCCCTAGCGATGGACACTATCTCATCAACATTTGCAACGAGCTCCCCAGCAACAAAACTATCTAGCAGTAATAGTAGTAGTAGTGTAAGCAATACCTTAAGCAATAAGAACAATTCGTCAGCAGTGTTACCTCCGACACCTACAACGGTGGCGGATTTCCTTCATCAGCAATTCCAGCATATGCAGAATCAAATCCGAATAACATCGCCCGCCTCTGCGATAGTGGTAAGTGCTAGCGGCAGCAATACAGAAATGAATGCTCTGCCATCTTCGGTGACAACGAGTGCACCATCGCTAAGTTCGTTAACAGCCTCTTTGGCAGCAGCGGCAGCAGCTGTTGGTAGTGGTTCTGTAACTGATCTTAGCAGCAATAGCAGTGGTGTAAAGCTGATCAATGATTTTCTTCAGCATCAGTtccagcagcagcaacagcaacaacaccaacaacagcaGCCACATTCATCATTTTCAACGTGTCCCGAACATCCCGACGTAAAAGGTATCGATTGCAAAACATGCGAAATGATCGAAATCAATATTAAATCACCAATGACGCCAACACGATCGCCAAATAGTATTAATCTGTTTCCATCGAATCTGACTTTGTCACCGACGGCAGCTGCAGCGCCTAGTTTTACAATTGGTGCATGTCCGGAGCATATAAATGGCAGGCCGCTGGGTGTTGACTGTTCGAGATGCGAAATGATATTGAATTCAGCTCGGCTTAATAGCGGCGTACAGATGTCCACACGTAACTCATGCAAAACTCTGAAATGTCCCCAATGTAATTGGCACTACAAGTATCAGGAAACATTAGAAATTCATATGAGGGAGAAGCATCCGGATGGGGAGAGTGCATGTGGCTATTGTCTGGCCGgtcAACAACACCCGCGATTGGCACGTGGAGAATCATACACCTGCGGCTATAAACCGTACCGCTGTGAGATCTGCAActattcaacaacaacaaaaggcAATCTATCGATTCATATGCAAAGCgataaacatttaaataataTGCAAGAGTTGAATAGCTCGCAGAACATGGCAAATACCCCGGCGGAAATTCGTGAATCGCCAAAAATAATAATGCCAAATATGCAGCAGCAGGCCTCGAAGCCGAAGCCGAGCTTTCGCTGTGACGTGTGTTCCTATGAAACAAGTGTGGCGCGAAATTTGCGTATACATATGACAAGTGAGAAGCACACCCATAATATGGCAGTGCTGCAGAATAACATCAAACACATTCAAGCATTTAGTTTCCTGCAATCACAAAATATCGGTCAACTGAGTGCAGCACAGAGTGCGGCGGTTGCTGCATCAAACTTGCCTAATATTCCGAATTTGCAAAATTTTCTACCTGAAGCGGCTCTAGCTGACATCGCATACAATCAAGCTTTGATGATTCAGTTGCTGCATCAGAATTCAGCAGCGGGGGCATTGAGTGCAGCGGCAGCAGCTGCAGCGGCTGCAAGTCCACTAACCCTGCCCCCACCTCAGCAGTCTCCTGCTGGATCTGGGGGAACAGTTTCGCAGCCACAAACGACTCCAACGTCACAACAACCACCGCAACAATCCTTTCAGTCGTTGCAGACATCGAAGTTAAACCATCAGTCTTCACCGCAGCAAAGTATGCCAACAGTTGCAACAACAGGAGCTCCCTCActgtcatcatcatcaacaacagCATCGTCACATCCATCCTCGTCGCAACAGCATTCTGCAGTGGCGGCAGCAGCTCTACTAGCTGAAGCAGCCGCTGCAGCTGCAGCAGCAGCCACCTCTGATCCAGCATCAATATGCCTtcagcagcaacagcaacaacaacagcaacaaattCAACAACAAGACTCATCCCTCGATCCACCAATCGATCCCGATCCAAAACCCACAACAGCCTTCAGCTGCCTCATATGCGCCAACTACAACACCAATAGCATCGACGAACTCAACAATCACCTAATGATCGACCGATCACGCAATACCAACAACAATTGCAGCGATATCATGATGATTATCAACAACAATTACATCTGTCGACTTTGCAACTACAAGACTAACCTCAAAGCCAACTTTCAATTGCATAGCAAAACGGACAAACATCTGCAGAAGCTCAACTACATCAATCACATCCGCGAAGGTGGTGTCAAAAACGAGTACAAACTCAAGTACAACCAAACCAATACCGTTCAACTAAAATGCAATTGTTGCGACTTTTATACGAACTCTATACAAAAACTGAATATGCATACTCAACATATGCGACACGATACCATGAAGATGATCTTCAATCATCTCTTGtatttagtgaatagttttaaTGTTTCTATGGGAGGTGATTCGATGGCCGTTGTAAGTGAAAACAGCGAATATCAACTTATGAGCAAAAACAAAGTTCTAATGTGTAAACTGTGTAACTATAGTGCAGTGAATATTTTACAAATTGTGCAACATGTCAAAAGTTTGCGTCATATACAAGTGGAACAGTTTATATGCTTGCAGCGGCGCAGTGAAAATCTCGAATCGCTTGGACTGGATGATGTCTTTAAAATTGCCGATAATACTGATTGTATCAAATCAGAACGTTCAAGTCCTGTACCGTCATTAGAATCTCACCAATTGTTGGTTAAAGAAGAAAACGAAAAGAGTTCCCCAATggcaacgacaacaacaactacaacaactacAGCACTAGCAGTAACACCACCAGCTACAACCACAACAGCGAAGAGCAGTCATGATATTTACATTGATTCTTCTTCCGATCACCAACAGAATGTCTCTTCATCTTCTGCTATTGCCGTCAGCAATATCGCCAACTGCAACAAAGAGCTTGCTGATGTCGATCTCTCTTCATTGCCATCAATCATATATAAGTGCAACAATTGTGATTATTTTGCTCAAAACAAGTCCGAAATGCAAAATCACATTGTCAATACTCATCAGAATGTTTCGGAAGAAGACTTTTTAACAATTCCTACAAATCCAGCGGCTTTGCATGCCTTTCATGCAGCTGTAGCAGCTGCAGCTGTTGCTGTAGCCGAAAATGCGGCGGCATCACAATCTAGAAGTAAATCATCATCACCAGTGCATATGCACATGCAGGATGATCggcaacagcaacaacatcatGATCAAGGATTTGCCACATCTGTGCGTGGGAATGAGTTGATGCCTCAAGTCAAAACCGAGCAGATGGACATCATGGATGATGCTCAGTCTAATGATGAATGTGAACAATTGGAAGATCCAATTGAACTACTGAGCTCGAGGAACTCTGCCAATGTGACAAGCTCACCGGCTGTGAAGAGTGTAATGTGTCCACTGTGTCAAGATACTTTTAGCGAAAAGAAAGCACTTGAGATGCATCTTATGGGAGTGCATAGTGTAAACAGTGATGGCTTGGCAAGACTCCTCCAGCTAGTCGATAGCAGTCATTGGTTGAATAGCAGCAGGCGAAGCAGTACAAGCACAACGCCAGAGCCCAGAAATTCGAGCACACCACTTTCAGAAGCCGGAGGGCAGTTAGCATTTCAACAaccacaacagcaacaacaaaagcaacaacagcCTGGGTCTTCTCCTGTTAACCAACATGTATTCAGTGACGAATACGCATGCGCACAGTGCGGGATTACATTCAAGCTGCAGCAACATTTGCTAATGCACGCCAACGATGCGCAGCATTATCAAATGGTCAACGAGCAATATCAGTGCTTGGCAAAACATTGCCAACAACTGTTTGGCAACTTGCTGCAAATGTTGACCCACTACAAAGATAGTCACATGAATATAGTGATTTCTGAGCGTCATGTTTACAAATATCGCTGCAAACAATGTTCTTTAGCATTCAAGACCCAGGAAAAACTCAATACTCATTCCTTGTATCACACAATGCGTGATGCTACCAAGTGCATGATATGCAACCGGAACTTTCGGAGTACACAATCCTTGCAGAAACATATGGAACAGGCACACAGTCAATTGCAACCAACCAGCCCAATTCCATCACCAAGTAGTAGTGGGGAAATCGCAAATGCCCCTGGATCTCTGTCCGTAACCCCTTCATCCAGGGCAGAAGATGAAGAAGTTACTGCCAACGCTGCCGTTACCACATCCATGACTACAGGAGCAACAACAGGTACGAATAATAAATTCAGCAACAATCAATACAATGTTGATGGTGTTGGTGATGGTCAGGACATTGTTTCtctaaaaagtagcatttatgATTCGATATCAGTATCAGTATCAGCAcctccaccaccaccaccaccaccactacCACCGCCACTGCCACCACCACCCACCGTTCTTGATATCTTGCCCAGTGACAAAAGTCAACTTGAGGACTACCTCAACTCCCAACAAATGTCTGAAGTATATTACACAGATGTCGAACGCAAACTCAAATGCCACAAATGTAAAGTTGCATACACAAATCAAAGCTATCTTTCGAAGCACTACAAGTCCAATCAGCATCGTCGCAATGAAAAGTTGAGTATTTACCCGCTGGAAAAGTTCCTCGATCCAAATCGTCCTTTTAAGTGTGAAGTTTGTCGCGAGAGTTTCACCCAAAAGAATATTCTTTTGGTGCACTTTAATAGCGTGTCGCATTTGCACAAGGCCAAGAAGCAACAAGCTGGAGGTAGCGCATCGCCAGCAAAGTCTATTCCGATGTTCTCTGACGCTATAGAAGGTCTTAGTGAAAGCAAAGGACAAACTGTGGAAATAGCTGGTGGCAGTGGCAGTGGTAGCAATGCTTGTGGTGGCGATGGAGGTATCACAGCTGTAAATAAGTCGCCATTTTCTAAGCGAAAAGTCAGCCTTGAATCTGACTATGAAAGTCCGAAGAAACGTTTCAAATGCGAAATCTGCAAAGTGGCATATGCGCAGGGCAGTACACTCGATATTCACATGAGAAGTGTCTTGCACCAGACACGCTCCTGCCGCTTGCAGGAGCAACAGGCTCAGCCATTAAAACCAATGACACCGCCTTCGTCATCGGAGAGCCCAACATCTACGACGGCAACTACCCCAACCTTAAACGATCATATGTACAAGTCCCTGCTAGAAACGTTCGGCTTTGATATTGTTAAGCAATTCAATGAGATTAACAAACTTTGTACAGTTTCAAACTCTAGAAATTATTATTGCCGTCACTGCAACAAAGAGTTCTCCTCTATTTTTGTGCTGAAAACACATTGTGAGGAGATACACAGTGAGCAGATCCCTCTTGAATTATTAGAAAAATTCGCTGAAAAGTTTAAGCATATTTATCTCGATCAAGAACCTACATCACCAACATCGTTGTTAACAATAACAAATTCAGACCAAAATCCAAaccaatgtttttcttttaatgaatCCGAAAATAATTTAGGTGCAACAACATCAACGGCATGCAGTGATAGCAATTCTCCATCACCAGTGGGAGGCGAGCCAGCAGCCGGATCAAATGAGACTTCTCTTATAGTTCCTTCTACATCAACAATTGCTGCTCCCACCAGCCCCAATACATCAGCCGTTACGGCTGTAGCTGCTGCATTGCTCAAACAGCAACAGCACCAGCAGCAACAATCTCTTACACCAGATCTTGTGCAAAAGCTCAACCTAGATCCAACAATGTTGGCTCAAAAGATAATGGAACAGAACTTTTCCAACTTTCCACCGAACTTCCCAGGACTGCCACAGAATTTACAAAGTTTACAAAGCCTTCAAAGCCTACAAAATCTGCAAAACATGCAACAAAATTTACCAAACATGGGTAACATGCCCATGAATACTCTGGATATGCTTAACCTAATGCAGTTTCATCATTTAATGTCTTTGAATTTTATGAACTTGGCACCGCCCCTGATATTTGGGGCTGGAGGAACGGGCCCAAGCCCTTCGGTAACAGGCACTACATCGGCAACACCATCTGATCTACCACCAACAGCTGCCACCCAAATTATTCAGCAGCAAACGTCATCTTCATCACAGAAGGGGTTTACATTCTCTTCACAGAATACCAGCAATCAAAAAAGAGCACGAACGCGCATTACTGATGATCAGCTGAAGATCCTTCGAGCACATTTTGATATCAATAATTCTCCCAGTGAAGAGAGCATCatggaaatgtcaaaaaaagcCAATCTCCCAATGAAGGTCGTGAAACATTGGTTCCGCAACACCCTGTTCAAGGAGAGGCAACGTAACAAAGATTCACCTTACAATTTCAACAATCCCCCGTCGACAACACTGAATCTTGAGGAGTACGAACGAACTGGCCAGACAAAGGTGACTCCTTTATCGGAAAGCGGAGGTGGCAGCATCAGTGGCTTTCACCtccagcaacagcaacaacaaagaGAACAACAGCAACGTGAACAACAGCAGCGTGAACACCAACAGCAACAAAtccagcagcagcaacaacttCAACAGCAAATTCAATCACAACATCtccagcagcaacaacaacagcgcCCGCCATCATCACAATCTAGTGATCTGAATTTTCCCCAACTGAGCttccaccaacaacaacacgaTCTCTCCCGCCAACAATTGCACCAGCAACAGGACAATCGTCCATTGTCCCATCCGTCAAGCGTTACCAGCGATCGTGGCGATATCCATATTAAGCCTGAACCGGCCGATGACATTGGTAGTTCCGACTGTGATCAACAAATGGCTATGAGCAAAGATCACGATAACGAACAATCAATAATGCAATCACACCATCAACAGTCAATGTTCTACAACAACTTCGAGACTAAATCCGAAAGTGGAAGCTCTGAGATCCTATCACGTCCCCAAACCCCAAATAGTACATCTACACCGTACAGCAGTAATATATCAGATATCCTGGGTCAGCAAATGGACAGTTTGCCGCTAAACAACATGGCCAATATAAGTAACTTGAACAATATGGGACcgccaaaaaaatttcaaatgaaCAAGATGTTCGAGAAGAGTGGGAACTTTGAAACCAATTCCAATTCGTCTAATAGTTCGACGTCGAGTGGAAAGCGGGCCAATCGTACTCGTTTTACGGATTACCAAATAAAAGTGTTGCAGGagtttttcgaaaataattcttATCCCAAAGACAGTGACTTGGAATACTTGAGCAAGTTGTTGCTGTTGTCGCCGAGAGTAATTGTCGTTTGGTTTCAGAACGCACGTCAAAAACAGCGAAAAATCTATGAGAATCAACCGAATAATACGTTCTACGAGTCCGAGGAAAAGAAACCCAACATCAACTACGCTTGCAAGAAATGTAACCTGGTGTTCCAGCGTTACTATGAACTTATCCGACATCAAAAGAATCATTGCTTCAAGGAGGAAAATAACAAGAAATCTGCGAAAGCACAAATAGCTGCTGCTCAAATTGCTCAAAGCCTCAGCAGTGAAGATTCGAATTCTAGCATCGATATCAACAGTACCAATATGTTGTCATCAAATTTGGTTGGCCCGCAAGCTGCTGCAGCGGCTGCCGCAGTTGCtgtagcagcagcagcagttgGTGGCAATGTTGGTACGGCAATTCCACCGGTGATGCCAGGCCTGGCCTCCAGTCCAGGCATTAACTTGCTTGCATCACCCCAGCACATTTTCAAGCAACAACAGGCAGTGACTTCTGTCGGAGGAAGTCATGCGGATAGCACTTCACcgcttcaaaaatttgaatgtgACAAATGTCAACTGGTCTTCAATCGCTATGAACTCTACAAAGAACATCAGCTTATCCATCTCATGAACCCCAATTTGTTTATGAACCAAAACTACAATGAATCTTCACCCTTTGGAATCCTACAAAATCTGCAGGGTAACCACAATAGTCAACAAGACACATCAATTGACTTGAGTCGACAGAAAAAACGCAAGTATTCCGATACGCAAAACTCACCCGATGAACTGCACCATCAGCAAACTGACTACGAAGCTttcaataagaaattcaaaaacgaTCAATATGAATTTCTGtatcaatattttttacaaaacgaCTCCAATTCTGATTTGAAGAAACAAtttcagcagcagcaacaacagccCGAAATGGATCTAGATTATCTGGCTAATTTTTATCAgcaaaatgaatacaaaaagctCAGCAATTACGATTTTTTATTGCAATATTACATGCGAAACGAGTCAAAACAGCCTAGTAACGCGTCCAACCTCATGTTGCTGAACGATGATGCTAATAAACCAAATATGGACTTTCTCCTTCAGTATTATCAACTCAGTGAATCGAAGAAGTTTTTTCAGTTAGAAGCCTCGCCCCAACGAATACATGATTTCCCACCGTTGCTGAATCTGACCAGCAacgcagcagcagcagccgttGCAGCAATAAACAATGGTGTATCAGCCCAGCAACATCAAGAGCAGCAACAAGTTTCAGTAATATCACCTACAGAGACACCAATGAATACCACAACCAATACCAATGTAGCGATATCATCGCCAAACAGCTCACCAGTACGGCAACAGAGCAACAACAGTGGTTGTGCTATTGGCAACAAAGATACgaccagcaacaacaaaaattgtcaaacacATCCAACAGTATTGACAGAAAATGTCAAACTTCTGTCAGCGACGGTGTCGGCATTTGGTGGTGGCAGCGGGAACGGGAGCGGGGTACCAACAGCGACATTACCAATAACAACACCGAATACAACAAATACTGGTAGTACGTCTTTCAGTACGAATTCGATAAACAACAACCGCCATTGTCCCATACTTTCACCATTGCAAACGAAACAGCAAGAAAACTCATCATTGATATCGAATCGTTTGGATGCAGctcctgctgctgctgctgccgcCACTGCATCCGAGTTCGCTGTGCAATCAATAAAACTTTCTGTTAACAACATAAGCACAAATGCGGTGTCAAAAACTTCGCCCATTGCAAATAGCATCGACATGATGGATGCCGAGCTAGGAATTACTCATCACCACCAACAGCACGACTCATCAGCAACATCCGTTTCAAGTGCTAGCCACCAACTTGAAGAAACTGTCACAACAACTGAGAAGCAAAATAGCAAAAGGCTTCGTACTACTATATTGCCAGAGCAGCTGAATTTTTTATATGAATGCTACCAAAATGAGTCAAATCCAAGTCGCAAAATGCTCGAAGAAATTTCAAAGAAGGTCAATCTCAAGAAACGTGTAGTTCAGGTCTGGTATCAAAACTCAAGAGCTCGCGAACGCAAAGGACAATTTCGTCAGAATATCcagataataaataaaaaatgtccgCATTGTGCGgccatatttaaaataaagtctgCTCTGGAGTGGCATCTGCAATCCAAGCATGGAGATAAGCAAGCTATAAATGTTGATCAAATTCCTGACTTGAAATTCTCCGATGGTTTACTAAATTTTTCAAGTTCGACACCGTACGGTATAAAAGTCGATGAGCAACACAAGGAACAGCAGAAGCTAAAAGAACAACCAAgttcaccaacaacaacaccaacaactaCAACACCACCTCCAGCAACATTGTCCCCCGTCAAATCAGCATTATGTGTCGCATCTGTTGTCACTACaaatgctgctgctgctgcttcttcGCCAGCAGTTTCGTCCGCAGTATCCTCTGTAGCCTCTTCAGCATCGATCCTTACCCTTGCCAAAGGTGGAGGAACTACTCCACTTGATCTTAGCAAGACACCTACGCCACTAGTCAATAACTTTAGCAAGTATGAACAGAGTGAAAGCGATATAAGCTTCTCTGATTCCAACAATGACCATGACGAgtctaatgatttttttaccCCTTCGTCGTCATTGAATAATGGAAATCTTGGAAATCTAGTGCCGAATAATAGGGCCAACAACTGCATAAACAACCGACGTCAGTATGATACTAATTTGATAGACGGAAATAATGTAGCAGGAGGATGTAGCTACAACGATAATAACACCATGAGCAACATAAGCGATTATCTAGGCAACGAACGTGAAAATACTAATTCACCAGTTAGCCAAACTTCAAGCAACAATAGTGCTCAACAGAAAAAGCGTTTTCGGACACAAATGAGTAACTTGCAGGTGCGCATACTTAAAACATTGTTCCATGACGTCAAGACGCCTTCCATGACGGATTGTTCAAATGTTGGACGAGAAATTGGACTGGGGAAACGTGTTATTCAGGTTTGGTTCCAAAATGCCAgagccaaagaaaaaaaatctcgcaaCCAACGATATATACACGATGAAAATACTTTTGAAAATGACAAACCAAATGTGGATTCAACTACATCAAATAACATACTCGAGATACGTGAATGCAACATTTGTCAATTGCCAAGCGTAAACATTCAAGAGCATGCTTTCTCAGCTCAACATATAGCACAGGTACGAGTGCTGCTCGAAGCAAACAGTAGCAATAAAAGTGATGACAACCAGCAATATGTTGAAAATAGTATGGAACATGAATTCAATGGTATTTATTCGAAGCTGTATACACAGCAGCAGCATCATAATAACCATAGTCATcatcataaaaatgaaaatgctgaTTCGGGACGAGCCGATACTAGTCACCATGATAAAGACGACAGTGATGTTGATTATCATTGTCCAAATATTATGACTAATGTTGATTGTGTCAACAACGACAAAAACTACGAaaacgacgatgatgatgacgatgaacatgatgatgatgatcatGATCACGATGCCGATGATCACGATGACGATGCCAAAACGAAACTTGGCATACAAAACGAGGATATGGCCTTGAGTGAGGCTAATAAGGCAGCATTGGCATTGAGAAATTTCAACAAGTTGCAGCAACATTTCGCTGCAGCTACTGCTTTTGCTAAGCAGCAACACCAATATCCCCACCAGCAACagctacaacaacaacagcagtgTGATACAATATCTTCAGCTTCACCACCAACAATGGAAAATAATTTGATGTTGAAAATTAACACGGATAATCCGTTGGCAACAAATCATTCAGAAATGTTGCAACAACTGTTTAGCTACAGTCAGATGAGTGGTGAGTCTTATATGTAG
- Protein Sequence
- MPPPSTQSESQHNEPTPRKRRRKRDDPQSCFTNSEEYESDDCSPMSCSDVESFQGKIVYNPDGSAYIIDSENESLSNIPENCMSAGTTTTTNNPKIHSFRVVTARDACVNISELNKIQKPILMCFICKLSFGNTKSFSLHANTEHTLNLQESEKLLLNREYSSAIIQRNVDEKPQISFLEPLDIQKQNQLNKLASPQTQQQHQQQHPQQQQLLISSTSSLSVNSNSNSNNNNNVSCSKLPTTLSPSSSSAIGSVSSVVAAAAAANAALVAAIAASCSNTSLNTSTSLSMNAPHIDSDLIMASLGGPSGGGGNSNPGYGNTTSTISNNPSSSSSSNCSQMIHQQQQRSDNFDNLNTLDLSAVTAAAQAAASTSIVNSRHTPPPSSPTSTTSSSPSSSSSSSASTLSLSAQQPSTTISASGLPAPTSTIATEPKSSHSLAMDTISSTFATSSPATKLSSSNSSSSVSNTLSNKNNSSAVLPPTPTTVADFLHQQFQHMQNQIRITSPASAIVVSASGSNTEMNALPSSVTTSAPSLSSLTASLAAAAAAVGSGSVTDLSSNSSGVKLINDFLQHQFQQQQQQQHQQQQPHSSFSTCPEHPDVKGIDCKTCEMIEINIKSPMTPTRSPNSINLFPSNLTLSPTAAAAPSFTIGACPEHINGRPLGVDCSRCEMILNSARLNSGVQMSTRNSCKTLKCPQCNWHYKYQETLEIHMREKHPDGESACGYCLAGQQHPRLARGESYTCGYKPYRCEICNYSTTTKGNLSIHMQSDKHLNNMQELNSSQNMANTPAEIRESPKIIMPNMQQQASKPKPSFRCDVCSYETSVARNLRIHMTSEKHTHNMAVLQNNIKHIQAFSFLQSQNIGQLSAAQSAAVAASNLPNIPNLQNFLPEAALADIAYNQALMIQLLHQNSAAGALSAAAAAAAAASPLTLPPPQQSPAGSGGTVSQPQTTPTSQQPPQQSFQSLQTSKLNHQSSPQQSMPTVATTGAPSLSSSSTTASSHPSSSQQHSAVAAAALLAEAAAAAAAAATSDPASICLQQQQQQQQQQIQQQDSSLDPPIDPDPKPTTAFSCLICANYNTNSIDELNNHLMIDRSRNTNNNCSDIMMIINNNYICRLCNYKTNLKANFQLHSKTDKHLQKLNYINHIREGGVKNEYKLKYNQTNTVQLKCNCCDFYTNSIQKLNMHTQHMRHDTMKMIFNHLLYLVNSFNVSMGGDSMAVVSENSEYQLMSKNKVLMCKLCNYSAVNILQIVQHVKSLRHIQVEQFICLQRRSENLESLGLDDVFKIADNTDCIKSERSSPVPSLESHQLLVKEENEKSSPMATTTTTTTTTALAVTPPATTTTAKSSHDIYIDSSSDHQQNVSSSSAIAVSNIANCNKELADVDLSSLPSIIYKCNNCDYFAQNKSEMQNHIVNTHQNVSEEDFLTIPTNPAALHAFHAAVAAAAVAVAENAAASQSRSKSSSPVHMHMQDDRQQQQHHDQGFATSVRGNELMPQVKTEQMDIMDDAQSNDECEQLEDPIELLSSRNSANVTSSPAVKSVMCPLCQDTFSEKKALEMHLMGVHSVNSDGLARLLQLVDSSHWLNSSRRSSTSTTPEPRNSSTPLSEAGGQLAFQQPQQQQQKQQQPGSSPVNQHVFSDEYACAQCGITFKLQQHLLMHANDAQHYQMVNEQYQCLAKHCQQLFGNLLQMLTHYKDSHMNIVISERHVYKYRCKQCSLAFKTQEKLNTHSLYHTMRDATKCMICNRNFRSTQSLQKHMEQAHSQLQPTSPIPSPSSSGEIANAPGSLSVTPSSRAEDEEVTANAAVTTSMTTGATTGTNNKFSNNQYNVDGVGDGQDIVSLKSSIYDSISVSVSAPPPPPPPPLPPPLPPPPTVLDILPSDKSQLEDYLNSQQMSEVYYTDVERKLKCHKCKVAYTNQSYLSKHYKSNQHRRNEKLSIYPLEKFLDPNRPFKCEVCRESFTQKNILLVHFNSVSHLHKAKKQQAGGSASPAKSIPMFSDAIEGLSESKGQTVEIAGGSGSGSNACGGDGGITAVNKSPFSKRKVSLESDYESPKKRFKCEICKVAYAQGSTLDIHMRSVLHQTRSCRLQEQQAQPLKPMTPPSSSESPTSTTATTPTLNDHMYKSLLETFGFDIVKQFNEINKLCTVSNSRNYYCRHCNKEFSSIFVLKTHCEEIHSEQIPLELLEKFAEKFKHIYLDQEPTSPTSLLTITNSDQNPNQCFSFNESENNLGATTSTACSDSNSPSPVGGEPAAGSNETSLIVPSTSTIAAPTSPNTSAVTAVAAALLKQQQHQQQQSLTPDLVQKLNLDPTMLAQKIMEQNFSNFPPNFPGLPQNLQSLQSLQSLQNLQNMQQNLPNMGNMPMNTLDMLNLMQFHHLMSLNFMNLAPPLIFGAGGTGPSPSVTGTTSATPSDLPPTAATQIIQQQTSSSSQKGFTFSSQNTSNQKRARTRITDDQLKILRAHFDINNSPSEESIMEMSKKANLPMKVVKHWFRNTLFKERQRNKDSPYNFNNPPSTTLNLEEYERTGQTKVTPLSESGGGSISGFHLQQQQQQREQQQREQQQREHQQQQIQQQQQLQQQIQSQHLQQQQQQRPPSSQSSDLNFPQLSFHQQQHDLSRQQLHQQQDNRPLSHPSSVTSDRGDIHIKPEPADDIGSSDCDQQMAMSKDHDNEQSIMQSHHQQSMFYNNFETKSESGSSEILSRPQTPNSTSTPYSSNISDILGQQMDSLPLNNMANISNLNNMGPPKKFQMNKMFEKSGNFETNSNSSNSSTSSGKRANRTRFTDYQIKVLQEFFENNSYPKDSDLEYLSKLLLLSPRVIVVWFQNARQKQRKIYENQPNNTFYESEEKKPNINYACKKCNLVFQRYYELIRHQKNHCFKEENNKKSAKAQIAAAQIAQSLSSEDSNSSIDINSTNMLSSNLVGPQAAAAAAAVAVAAAAVGGNVGTAIPPVMPGLASSPGINLLASPQHIFKQQQAVTSVGGSHADSTSPLQKFECDKCQLVFNRYELYKEHQLIHLMNPNLFMNQNYNESSPFGILQNLQGNHNSQQDTSIDLSRQKKRKYSDTQNSPDELHHQQTDYEAFNKKFKNDQYEFLYQYFLQNDSNSDLKKQFQQQQQQPEMDLDYLANFYQQNEYKKLSNYDFLLQYYMRNESKQPSNASNLMLLNDDANKPNMDFLLQYYQLSESKKFFQLEASPQRIHDFPPLLNLTSNAAAAAVAAINNGVSAQQHQEQQQVSVISPTETPMNTTTNTNVAISSPNSSPVRQQSNNSGCAIGNKDTTSNNKNCQTHPTVLTENVKLLSATVSAFGGGSGNGSGVPTATLPITTPNTTNTGSTSFSTNSINNNRHCPILSPLQTKQQENSSLISNRLDAAPAAAAAATASEFAVQSIKLSVNNISTNAVSKTSPIANSIDMMDAELGITHHHQQHDSSATSVSSASHQLEETVTTTEKQNSKRLRTTILPEQLNFLYECYQNESNPSRKMLEEISKKVNLKKRVVQVWYQNSRARERKGQFRQNIQIINKKCPHCAAIFKIKSALEWHLQSKHGDKQAINVDQIPDLKFSDGLLNFSSSTPYGIKVDEQHKEQQKLKEQPSSPTTTPTTTTPPPATLSPVKSALCVASVVTTNAAAAASSPAVSSAVSSVASSASILTLAKGGGTTPLDLSKTPTPLVNNFSKYEQSESDISFSDSNNDHDESNDFFTPSSSLNNGNLGNLVPNNRANNCINNRRQYDTNLIDGNNVAGGCSYNDNNTMSNISDYLGNERENTNSPVSQTSSNNSAQQKKRFRTQMSNLQVRILKTLFHDVKTPSMTDCSNVGREIGLGKRVIQVWFQNARAKEKKSRNQRYIHDENTFENDKPNVDSTTSNNILEIRECNICQLPSVNIQEHAFSAQHIAQVRVLLEANSSNKSDDNQQYVENSMEHEFNGIYSKLYTQQQHHNNHSHHHKNENADSGRADTSHHDKDDSDVDYHCPNIMTNVDCVNNDKNYENDDDDDDEHDDDDHDHDADDHDDDAKTKLGIQNEDMALSEANKAALALRNFNKLQQHFAAATAFAKQQHQYPHQQQLQQQQQCDTISSASPPTMENNLMLKINTDNPLATNHSEMLQQLFSYSQMSGESYM
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00974464; iTF_00663877; iTF_00664363; iTF_00335368; iTF_00334789; iTF_00426954; iTF_00427563; iTF_00665459; iTF_00666004; iTF_00694966; iTF_00694414; iTF_00426171; iTF_00426598;
- 90% Identity
- iTF_00974464;
- 80% Identity
- iTF_00974464;