Mpol036819.1
Basic Information
- Insect
- Microchrysa polita
- Gene Symbol
- zfh2
- Assembly
- GCA_949715475.1
- Location
- OX454422.1:8737431-8785633[+]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 18 0.97 98 4.9 0.6 2 23 120 142 119 142 0.94 2 18 0.00038 0.038 15.6 1.4 2 23 613 635 612 635 0.96 3 18 9.4e-05 0.0095 17.5 0.7 1 23 667 691 667 691 0.92 4 18 0.03 3 9.6 0.4 1 22 738 759 738 762 0.91 5 18 3.7 3.7e+02 3.0 1.9 1 23 982 1006 982 1006 0.91 6 18 1.1 1.1e+02 4.7 1.3 1 23 1276 1299 1276 1299 0.94 7 18 2.7 2.7e+02 3.5 0.0 3 23 1388 1409 1387 1409 0.94 8 18 2.7 2.7e+02 3.5 0.1 2 21 1481 1500 1480 1504 0.90 9 18 0.72 73 5.3 0.3 1 23 1512 1537 1512 1537 0.94 10 18 0.0032 0.33 12.6 2.4 1 23 1550 1572 1550 1572 0.97 11 18 0.00085 0.086 14.5 1.6 2 23 1579 1601 1578 1601 0.94 12 18 0.69 70 5.3 9.3 1 23 1700 1724 1700 1724 0.91 13 18 0.12 12 7.8 0.4 1 22 1745 1766 1745 1769 0.89 14 18 0.015 1.5 10.6 0.1 1 23 1845 1869 1845 1869 0.93 15 18 2.9 3e+02 3.3 5.0 1 19 2005 2023 2005 2028 0.91 16 18 0.00022 0.022 16.3 2.8 1 23 2693 2715 2693 2715 0.98 17 18 0.0015 0.15 13.7 1.4 1 23 2841 2863 2841 2863 0.98 18 18 0.00034 0.034 15.7 1.0 2 23 3371 3393 3370 3393 0.97
Sequence Information
- Coding Sequence
- ATGCCGCCTCCTGATTCACCACATGCCGATTTGGCACCGAGAAAAAGACGTCGTAAACGTGACGATCCGCAAAGTTATTTTACAAATTCAGAGGAGTACGAAGAATCGGAAGAAGGCTCGCCCGTGTCGTGTTCGGATGTGGAAAGTTTTCAGGGTAAAATCGTGTATAATCCGGACGGTAGTGCGTATATAATTGATTCGGAGAATGAATCGTTGTCGAATCTATCGGAAAGTGGTTTAACAGCTGGTCTGCCGACAACAAATAATCCAAAAATTCATTCATTTCGTGTGGTCACGGCACGTGATGCAACTATTAATAAAATCTGCGaaccaaataaaattcaaaagccCATTTTAATGTGTTTCATCTGCAAATTGAGCTTTGGCAATACAAAATCATTTAGTTTGCATGCCAACAGCGAACATACACTGGATTTGCATGAATCGGAAAAGTTACTATTGAATCGTGAATATTCGAGTGCCATTTTACAACGTAACGTTGACGAGAAGCCACAAATATCATTTTTGGAGCCTCTCGACGTGCATAAGCAAATGtttcagcaacaacaacaacaacaaataatgTCGTCGTCGTCACAATCGTCGCAAGGACATCCAGCGGTGACGCAACATTATGCAAATTCCGTCGTATCGTCGTTGATTGATGAGTGTACAACAGCGGCAGCATCAGCGACCGCCGCGGCTGCTGCATCATCGTCATCCAGCTGTAAAACAACAACCGTCACATCCAGTAATAATCCTAGTTCGACAACAATATCGGCTGCCGCCAATAGTCAAATGTTGGATACAACAACCGCAACAATAGCATCATCCTTACTATCATCAACAACAACACCAGCATCATCCTCATCATCCGCGTCATCATCGACGTCATCATCAACGTCATCACCAACGTCATCGGCGGTGTCGcccatacaacaacaacaacaagccaCACTTACGGCCGCCGCCTCCGCCGTGACTTCACCAACAATTTCATCACCCTCGACATCATcaacaataataaataaattaaatagtaGTGATATAACGACTAGTAGTACGTCTATTCTAAATACAAAGACTAATGCCGATGGTTGTAGTAGTACGTCAAATATTGCCGGACAAAATTTGAGCGATTACGTTCTtcatcaacaacaacaaaagacACCGCTCTCGTTGTCACCAACAACAACACCAGCAATCGGATTGGGTACGAGTCAAATatcgccatcatcatcatcatcattgtcaGCTCCATCGACAGCAACACAACATCATACACCAACGACAACAACCACAACACAACATGCCGATATCAGTGGCACAAATCCGGGTGATTTATGTACCagtaatagtaataataatagtaatagtAGCAATACGTTAATGTTAAATGAGttcctacaacaacaacaacaaatacttcaacaacaacaacaccacCAACAATCGTCTACGTCCGCCGCGTCCGTCGCGACCACGCCCACATTGCTAGCATGTCCCGAACATCAAGGCTACAATAAAATGTCGGCGGGTCTGGTGGATTGTAAATCATGCGAGCTGATCAATATCAATAATATGACAACAAAAGTTTCACCAATGACACCAACAAAATCACCAAACAGTATTAATCTTGCCACGCCGCCATCATCGACATCGCTAACATCCATTTCACCAACAACACCGGCGCCTAGCTTTACAATTGGTGCATGTCCGGATCATATTAACGGACGACCAATTGGTGTTGATTGTGCACGATGCGAAATGATCCTAAGCTCGGCACGACTGAATAGTGGCTCACAAATTTCCACCCGAAATTCGTGCAAGACATTGAAATGTCCGCAATGCAATTGGCACTATAAATATCAGGAGACGCTGGAAATACATATGCGCGAAAAACATCCGGACGGTGAGAGTGCATGCGGCTATTGTTTGGCAGGTCAACAACATCCGCGTCTGGCACGTGGCGAATCGTACACGTGCGGCTACAAGCCGTATCGCTGTGAAATTTGCAACTACTCCACAACAACGAAAGGCAATCTCTCCATACACATGCAAAGTGATAAACATCTCAACAATATGCAGGAGCTGAACAACTCACAAAACCTTTCAGCCGCGGCCGCGGTGAGTGGCCAAGATCTACGCGATAGTAGTGCAAAAATCGTTATGCCAAATATGACACAGCCATCGTCCAAGCCGAAGCCCAGTTTTCGGTGTGACGTGTGCTCGTACGAGACAAGTGTTGCGCGCAATTTGCGCATACACATGACCAGTGAAAAGCACACTCACAACATGGCCGTGCTGCAGAACAACATGAAACACTTTCAGGCGCTCAGCTTTTTGCAAAGTCAAAATTTGGGTGCTGCGGCGAATTTAACGCAATTACCCAGCATGCAACAGAACATGCCCAATTTGCAGAATTTCTTACCGGAAGCCGCTCTCGCCGATCTGGCCTACAATCAGGCACTGATGATACAATTGTTGCATCAAAATTCCGCCGCCGGAGCATTGGGTGCAGCCGCAGCGGCCGCTGCAGCAGCTGCACAAATGCCACAACCAGCGCAGCAATCCGGTCAACAGTCGACCGCGAATCAGCCGCAATTGAACGTGCCGTCATCGACGACGGTTAGTGATCGATCTGGAAGTGCGGGATCTGCACCATCCGCTACGTCAGCTTCAAACGCTGCCGCCGTAGCAGCCGCTGCACTTCTTGGCAGTTCGCTCGAACCCGATCAAGGATTGAACGTCGAAACGCTGGAGCCGCCAATTGAACCCGATCAACGTCCAACAACCGTTTTCAGTTGCCTCATTTGCTCCGCCTACAACACGAACAGCATCGACGAACTGAACAATCATCTGATGATTGATCGTTCGCGTACGTCCAACAATTCGCCCAACAGTTGCAACACCAGCGACATTATGCTGATCATCAACAACAATTACATCTGCCGGCTGTGCAACTACAAGACCAATTTGAAAGCCAACTTTCAACTGCACAGCAAAACGGACAAACACATCCAGAAATTGAACTACATCaatcatatcaaagagggcgGTCCGAAAAATGAGTACAAACTAAAGTACAACAACACGAATACCGTTCAACTGAAATGCAACTGTTGCGATTTCTATACGAATTCAATACAAAAGTTGAATTTGCATACGCAACATATGCGTCACGATACgatgaaaatgattttcaatCATTTGATTTATATGGAACAGTTGCATGAGTTGACGCAACAGCATCATCGCGGTGCCACGTCGACGCCAAACAATTTAGTTGACGGTCAACAGCAACGATTGAATTCAACTTTGGTCGATCTAGCGACTAGTGCGTGCGGTGCAGGTGGTGGTGATAGTGTCAGTGAATCGAATCAGCATCAATGCACGATTGTGTGTGAATTGTGTAATTTTAAGGCGCCAAGTGTTTTGCAAATGATACAGCATGTGAAAAGTTTACGACATGTACAAATTGAACAGTTCATATGTTTGCAGCGACGCAGCGAGAATTTGGAGCCGTTGGAGTTGAGTGATGTGTACAAGATTGTTGATGGTGGTCTCATTAAATCGGAAAAATCATCGCCAATCAATTCACCGATAGATCCACTAATGGGCGTCCGTACAACCGGTCCCCACACTCCACTGAATACGCCGCGTGACACAATCACGGCACGCTCCTTGGCCGGTCTGCTTCAGTCACCGTCGGCCTCAAGTCTTCCATCGGTctcatcgtcatcgtcatcatcgATCACGACAACACCCACGACGACGGCTACACTGGCTGCCAGTGGTGGCCCGCCATCTCTAACTACGACAATTTTCAAATGCCAGACATGCGACTATTTTGCCGAAACCAAACACGAAATGGAACTGCATTTGGACAACATTCATCCGGAACGTCCGGACACGGACTATATTGCAATTCCGACCCATCCCGAGGCACTGCAAGCATTCCAAGCGGCTATGGCAGCTGCGGCCGCTGCAGCGGCAGCTTCCAGTCGCATTACGCCGCCCAATAATTTGATTGGAGATATCAAGACGGAACGTATGGATGCCGATGCGGATTCGGTGACGGATGAAAGTCCGGCCGATGGCGATCCGAATGATATGGATGCGGACGCATCGGAAACGACCGTGATGCAACCACGAATTCTGTGCCCGTTGTGTCAGGATGGCTTTGGCGATCGCAAGTCGTTGGAGACGCACGTGATGGGTGTGCACAGTGTGAATAGTGACGGACTGACACGCTTGTTGCAATTGGTCGATACTAGTCATTGGTTAAATAGTACAAGCAGCGCAACAGCGACGAAAAAGGACAGCAGCGGGACCAGCGAAGATAGTGCCCTCGATCGTCCCGCCAATCATGCAGCCAACAATTTCTATATGCAACAATTGCGCACGGGTAGCAGTGCTGCCGGCTACATTGACGAGCTTGAGTGCGCCCAGTGCGGGGTCGCAGTGAAGACGCACCAGGAATTGCTTGCACACGCCCAGGACACACAACACTACCCGACCATCAACGATCAGTATCAGTGCTTGCTGAAGCAATGCCAGCAGAATTTCGTGAATGTTGCCAACATGATTGCCCACTACAAGGACTCGCATATGAATATTGTGATATCGGAGCGTCATGTGTACAAGTACCGGTGCAAGCAATGTTCGTTGGCGTTCAAGACGCAAGAAAAGCTCAACACGCACTCGATGTATCATATGATGCGCGACGCGACCAAATGCATGATCTGCAATCGTAGCTTCCGTAGCACGCAATCGCTGCAGAAGCACATCGAGCAGTCCCACAGTCAGCAACAGTCCGGCGGCGGCAGTCCATCTCCTCCGCCAGCCACCGATGCTCTGGACCAGGCCCGATCCAACGAAGATTTCCTTACATCGCCCACGCCCAGTCTGAAGCAGGACGATGACATGAACATGACCCAACCGCAGGATCTTATCATGCGTGACGATCACATGCTGCATCATGGCTCGTTCATGGACGGCAGCAGTAGTACGTCCACCACGCCCACGACTGAAATGGATGAATGTCTGAACACGGCCCAAATGGCTGAGGACGGCTACAATGACCCGGGCCGTAAGCACAAGTGCCACAAGTGCAAGATGGCCTTCACACACCAACACTTCCTCGCGCTACACTACAAGTCCAATCAGCATCGACGCAACGAAAAGATGAACAATTACCCGATGGAGAAATATCTCGATCCGAATCGTCCCTTCAAGTGTGAAATTTGCTTGGAGAGTTTCACGCAAAAGAACATCCTGCTGGTGCACTACAACAGCGTATCGCATTTGCACAAGCTCAAGAAGCAGAGCGAGAACAACAACACGCCCTCGTCGTCGCCGAACAATAATCCGTCGCCGATGGATTGTGATCGTAAGAGCGTCGATTCGGATCGACGCAGCGTCGAGCACGATCGCAAGTCGGTTGATCTCGAGCTGGAGATGCTGGGCGGTGGCGATCAGCAGAAACgcaaactgagtcctgaaaatgaCTATGACAGTCCAAAGAAGCGCTTCAAGTGTGATATATGCAAAGTGGCATATGCACAGGGCAGCACTCTTGACATACATATGCGAAGCGTGTTGCATCAGACACGCGCGTGTCGTCTGCAAgaacagcagcaacaacaacagcagcaacagcaacagTTGCAAAATGCCATGCAATTTAATCCGGCTCTTACGTCGAGCCTGGCGCGCATCGTCGAACAACAACAGCTGCAGCAAAGCATCAGCCCGACGCCGTcgaacttgagcggcgcgggcGGTAGTCTGGCTGACCAAGACTCAAacatgccgaaattcaataaccAGATCTATAAAACTCTGCTGGAAAATTTCGGCTTTGATATCGTCAAACAATTCAACGAACTGAAACATGCCGGACCGGGTGGCACCGGGCCGATAAGTACACTGGGTCCGGTCAGCACTCAACTGGCGGCCGATACACGCGCCGATCTGACACGCGCCACCAATGAACTGTCCGACGAAAAGTACTTTTGTCGTCACTGCAAAAAGACCTTCTCGTCAATTTTCGTGCTAAAGTCCCACTGCGAGGAGATTCATAATGACAAAATTCCGCTCGACGTGCTCGAAAAATTTGCCGAGAAATTCAAGAATTACTATCTGGAAACGGGCGAAGTTGACAATGAGATTTTGGACTTTTCGGCGAAAAAGCTTTCGTCGGAATCGTCCGGCTCGCCGGTGTCCTCTGTAAACGGAGCCACGGCCGGTGGTGGTGTCAAGGAGCCAAAATCTTCAACGACACCACTGTTGGCAGCCGGTCTTTCGCCGCCATTGACCGCCACTTCGCCGGCACTCGTGAAGCAACAtccgcaacaacaacaaccgccACCACTACCACCACCGATTGCATCGTCCCCGTCGccgcaacaacaacaagcacaCTCAACATCGCAGTCTGCCGCCGCGGCTGCTGCCGAACTGGCCAATCAGGCCAGTAAGTTCAACATCGATCCCGCCGTACTTGCGCAAAAGATTATGGAACAGAATTTGGCCAATGTTCCGCCAAATTTCGCTCAATTGCCGCAAAACTTGCAAAGTTTGCAAAACTTGCAGAGCTTGCAGAATTTGCAAAATTTGCAGAATATGCCGAATTTGCCGCTCAACACACTCGATATGATCAGTTTGATGCAGTTTCATCATTTGATGtcgttgaattttatgaatttggcaCCGCCATTGATTTTCGGAGCGACCGGAGCGGCGGCTGCGGCCGCAGCAATGGCTGGCGGTAGTGTAGCGGGTCCAGTGACGCCAGAGTTGCCGGTTCCAGGGACGCCACAACAGCAAGTGCAATTGTTGCAACAGCAGGCTGCCGCGGCAGCGGCAGTGCAGCAGCAAGCGACGAACAACCAAAAGCGTGCCCGTACCCGCATCACAGACGAACAATTGAAAATTCTTCGCGCCCATTTCGATATCAACAACTCTCCCAGCGAAGAGAGCATCATGGAAATGTCCAAGAAAGCCAATTTGCCCATGAAGGTGGTCAAGCACTGGTTTCGTAATACACTTTTCAAGGAGCGTCAACGCAACAAAGACTCACCCTACAATTTCAACAATCCACCCTCGACCACTCTCAACCTCGAGGAGTACGAACGTACTGGTCAGGCAAAAGTGACGCCACTGGTCGAACAGGATCGCGATCACCAAATGCGCGAACTTCAATCCGAACAGCGACCACATTCGCAGCCGTCCAGCACGGCCAGTGAGCGTGGTGGCATTGGTGAGCGTGGGGGCGGCAACAGTGGCAGCATCATCGGTGAGATTCAAATCAAAACCGAACCAACCGATGATTTAGGATCGACCGATTCCGATCAACTGcaccatcatcatcagcatCAGATGCATCAACAGTCACATCATTTGTTCAAGCACGACCAACAGTCGCACGACTCGGATTTGAACACGGGAGCTGGTGGGGGTGGCTCGGGCTCGTCCGGAGGTGGCGGCCCGGTCAATTCGTATCACCACAGTCAACAGCAACAGCTCTTCTGCAATAATTTCGAAACCAAGTCGGAGTCGGGCAGTTCGGAAATCTTATCGCGACCGCAAACTCCGAATAGCACATCATTTCCCGGCCTAGGCGATATGGGCAATCCACAAATGGATGCTATGACGCTCAGTGCACTGGGTGCAGCCGGTGCTGGCACCGGTTCCGGCTCGGCGAATAGTATGGCCGGTGTTGTCGGCAACTTGAACAATATGGGACCGCCAAAAAAGTTTCAAATGAACAAAATGTTTGACAAGAACAACTTTGAGACAAACTCGAATTCGTCGAGTGGCTCGTCGTCGAGCGGTAAACGTGCGAATCGTACCCGCTTCACCGACTATCAGATCAAAGTGTTgcaggagtttttcgaaaacaaTTCGTATCCGAAGGACAGCGATCTGGAGTACTTGAGCAAATTGCTGTTGCTCTCGCCCCGAGTCATTGTCGTTTGGTTTCAGAATGCTCGCCAGAAACAACGGAAAATCTACGAAAATCAGCCAAACAACTCGTTCTACGAGTCGGAAGAGAAGAAACCAAACATCAACTACACCTGCAAGAAATGCAATCTCGTGTTCCAGCGCTACTACGAGCTGATACGACATCAGAAGAATCACTGCTTCAAGGAGGAGAACAACAAAAAGTCGGCCAAAGCTCAAATAGCCGCTGCACAAATTGCTCAATCGCTGAGCAGCGAAGATTCCAATTCCAGTATTGATATAAACAGTGCCAATCTGCTGACCACAAATCTGTCGTCAGCTCAGCAAGCAGCGGCTGCCGCTGCGGCCGCCGCGGCCGCTCTAGGCGGTCAAGTGTCCAGTCTCGCGATGCCTCAATCGCCATCCAGTTCGGCGGCCAGTCCCGGACTAGGTATCCTCTCGTCGTCGCACAATCTTTTCGGCGGAAAACATGCGGCCGCAACGACGGCCGGTCAAGCGACCAGCGCCGGCGGCGGCGGTGCATCTGTCAAGGACAACAGCAACAGTCAAAAGTTTGAATGCGACAAATGTCAATTGACGTTTTCGCGCTTCGAGCTCTTCAAGGAGCATCAACTCATCCACATCATGAATCCAAATTTATTTCTGAATCACAGCTATGCTGAAAACACGCCATTTGGCATACTGCAAAATATGCAGAACAGTTTGATGAATGCCGGCGGCGTTATGTCGGCCGCCGATCAGCAAGATGCCAGTATGGATTTGTCGCAATCCGCCTCCTCCACGACAACGACCAACAAGAAGAAGCGCAAATATTCGGAGACTCACTCACAGTCCGGCGACGGTGATCTGCAATCAATGATGGCCGGCGCCGGCGGTCTTCAGTCCGCTGCGGAGTACGAAATGGCGAAAAAATTCCGTTCGGATCAGTATGATTTTTTATATCAATATTTTCTGCAAAATGAGCCCAACGACGAGCTGAAAAAGCAATTTCAGCAGCAGCAACATCAGCAAAACAtcgacattgaatatctggcgCATTTCTACCAAATGAACGAGCTGAAGAAGAAGGGTAACTATGACTTCTTCTATCAGTACTATCTGCAAAACGAGGGCAAACAACAGCAGCCCGACAGCCCCGGCTCAATGCTGGACGTGAAGCCACAAATGGAGTTTCTTCTGCAATACTATCAACTCAACGAATGCAAGAAGTTTTTTCAGTTAGGTGCCTCGCCCCAACGAATACATGATTTTCCATTGCTCAATCTGACAGCGGCAGCAGCGGTGTCAGGATCGGATACAACAGCACAATGCTCAACGACATCATCCACAACGAACAACGTTCACGATCTCGTTTCAACGACATCGACGAATCATATTCGTGGCAACGGAATAAATCACGTGACGAAGGATATCGGCTCGACAACAACTGCATGCGTTACATCGCCCACAACAGCGGCATTAGCAGTAGCATCGGTCGAGACTGCTTCAACAACACCAACATCCACTTTcaaccacaacaacaacaataccaACAGCAACAGCTGCACTATCAATTCGCCACAACATCAGGCAGCACAAGCTGCAGCCGCCGTTGTTGCAGCAACGATACTAGCAGCTGGTGGGAGCGGGGTAGCCGGTCAACAACAAAAAGATCAACAACAACAGCTTCAGCAACAACTACAACAGTCACACAATTCAATCCTACCAGCCACaacaccaacaacaacaacagccacAACCATTGGTGGTAGGCTTAGTCCACAATCGAGTAAATGTCTTCCAATCGTTTCATCGTTGGCATCGAGCCTGCAAAAAGCTGCCAACAACAATCAATCGTCGGCCCAATCGTCGCCGAGCTCATCGGTCAATTTGGCGGACTACGGAAACGCCGCAGGGATCGGGGAACACATCACGACAATGGAGAAGCAGACAAATAAGCGTTTGCGAACCACAATCTTGCCGGAACAGCTCAACTTTTTGTACGAATGCTATCAGAATGAGTCGAATCCTAGTCGAAAAATGCTTGAGGAGATCTCGAAGAAGGTCAATCTGAAGAAACGTGTTGTACAGGTTTGGTATCAAAACTCTCGGGCACGCGAACGTAAAGGACAGTTTCGCCAAAACTTGCAAATTATCAACAAAAAGTGTCCACATTGCGGGGCCATATTCAAAATAAAATCCGCTCTCGAGTCGCATTTACAAACAAAACATCCGGAAAAGCAACCGATCAATATCGATCAAATACCGGACGCTACACCAACGGTCACAACAACGCACGCGCAGCCGCAACAGCAACCATCGCTGGAGAGTAGTGGCTCATCGCCATCGCCATTTGCTTCGTTGCAAGCCAATCATCATCATGATTTGCTCAATAATTTGCATCGAACCGCCACTTCGGCGAATAGTTTCACCAACGCGATGACGAACAGTGCCGCGACGCATCGCTCCGAAATGAGTTTCACGACGAATAACGATGATAATGTTACGTCAACGGTGCTTGAGACGATGATCACCAACACCAATGCTGGTCGACAGCTGGGCACGCCCCTCGATCTGAGCAAGACTACGCCAAAATATGAGCCCAGCGAGAGTGATGTCAGCTTTTCGGATTCCAACAACGATCACGATGAGGCCgatgatttttttattacaacCACAACTACACCAACACCGGTCAACAACAATCATTCAATTCTCAACAATAATCACGTGAAAATTCAGGATCTTGTTAATTATCGTCACAATATGAATTCGACACCACCTCCTCCGTCATCTGCGGTGACCACAACGACGACCACTCGATCTTCACCAATGATTGGTGAAACTTGTGGTGGCACGAACAACAACGTGTACTACGACGTTGATAGTCGCGCGTCGCATACGAATGTTGGTGAAAATGTTGGTGAAACATTGTTTGGTGTTGGCTCGGGTGTGGGCGTCGATCTTATGGATCATGCACCCTCGAGTCCGGCCAGTAGCACACAAAGTCTGCAGAAAAAACGATTTCGCACACAAATGAGCAATATTCAGGTGCGCATATTGCGTGCACTCTTTTCGGACGTGAAAACACCAACGATGACCGATTGTGAGAATATTGGACGTGAAATCGGTTTGCCGAAACGTGTCGTACAGGTCTGGTTCCAAAATGCCCGTGCCAAAGAGAAGAAGAGTCGCAACAATCGCAACTATTCCGACGATTCTTACGAAATGATCAATGGTGGATCGGATCTGACGACCGGCAGCGATTTGACCAATAGTAACGCTGCAACCATGTCGGACGATTGTCGCGTGTGTAACGTGACAAAAGTGAATATGCAAGAGCACGTCTTCTCCGCCTCGCACATAGCCCAGGTACGCCTCGCGTTGGAGAATGGTGAAAATCTTTTCGCCACGAACGGCAACAATGCCGGCGCCGGCGGTTGTTATAGCGGTAGCGAATCAAATGATCAACACGATGATCAGAATAATTCAGCACGACATCATCTCAAGTCACCAAATCACCAATCCCACCAGTCGACTGGCCATCAACGCCACTCGTCCTCGTCATCGTCCGATGCCAGTGGCACCGGCGCCGGTACAGAACCGCATGACGCAACCAGTGTGCAAATTTACAATCAACTTCTTATGCAAAATCCAATGTTTGCCGCTGCTGCGGCAGCATCGTCGTCATCAAATCAAAACGGATCGACGTCTTCGGATCCGATTGCCAAGAAAATGTTGTTTGCCAACGGTCTTGAGTCGTCGGACAATAGCATGGGTGGCGGCGAGATGGACGGTGGAGAATTGAGTAGTGCAGCAAATGCAGCCATGACTGCCGCCGCGGCCATGATGAATTTCAGTCAAATTCAACAACATCTGGCAAGTGTGCAGCAGCAGAACCAGCAAACATCATCGGACAAATTGTCGGCCGATTTACCAAACGGTGTCACGGATGGTGGCGATTCAAATGGAACTATACAAGCGGCAGCCGCTGCAGCGGCCGCTGCACATCGTCTTCTTATGGGCAACAATCTACTGTTGCAAATCAATGCCGATCAACGATCGTTGACGCAAGCCGCCAATCCGTCGGAGAGTCTGTTGCAGCAAATCTACAATTACAATCAGATGAGCGGTGAGTGTTGA
- Protein Sequence
- MPPPDSPHADLAPRKRRRKRDDPQSYFTNSEEYEESEEGSPVSCSDVESFQGKIVYNPDGSAYIIDSENESLSNLSESGLTAGLPTTNNPKIHSFRVVTARDATINKICEPNKIQKPILMCFICKLSFGNTKSFSLHANSEHTLDLHESEKLLLNREYSSAILQRNVDEKPQISFLEPLDVHKQMFQQQQQQQIMSSSSQSSQGHPAVTQHYANSVVSSLIDECTTAAASATAAAAASSSSSCKTTTVTSSNNPSSTTISAAANSQMLDTTTATIASSLLSSTTTPASSSSSASSSTSSSTSSPTSSAVSPIQQQQQATLTAAASAVTSPTISSPSTSSTIINKLNSSDITTSSTSILNTKTNADGCSSTSNIAGQNLSDYVLHQQQQKTPLSLSPTTTPAIGLGTSQISPSSSSSLSAPSTATQHHTPTTTTTTQHADISGTNPGDLCTSNSNNNSNSSNTLMLNEFLQQQQQILQQQQHHQQSSTSAASVATTPTLLACPEHQGYNKMSAGLVDCKSCELININNMTTKVSPMTPTKSPNSINLATPPSSTSLTSISPTTPAPSFTIGACPDHINGRPIGVDCARCEMILSSARLNSGSQISTRNSCKTLKCPQCNWHYKYQETLEIHMREKHPDGESACGYCLAGQQHPRLARGESYTCGYKPYRCEICNYSTTTKGNLSIHMQSDKHLNNMQELNNSQNLSAAAAVSGQDLRDSSAKIVMPNMTQPSSKPKPSFRCDVCSYETSVARNLRIHMTSEKHTHNMAVLQNNMKHFQALSFLQSQNLGAAANLTQLPSMQQNMPNLQNFLPEAALADLAYNQALMIQLLHQNSAAGALGAAAAAAAAAAQMPQPAQQSGQQSTANQPQLNVPSSTTVSDRSGSAGSAPSATSASNAAAVAAAALLGSSLEPDQGLNVETLEPPIEPDQRPTTVFSCLICSAYNTNSIDELNNHLMIDRSRTSNNSPNSCNTSDIMLIINNNYICRLCNYKTNLKANFQLHSKTDKHIQKLNYINHIKEGGPKNEYKLKYNNTNTVQLKCNCCDFYTNSIQKLNLHTQHMRHDTMKMIFNHLIYMEQLHELTQQHHRGATSTPNNLVDGQQQRLNSTLVDLATSACGAGGGDSVSESNQHQCTIVCELCNFKAPSVLQMIQHVKSLRHVQIEQFICLQRRSENLEPLELSDVYKIVDGGLIKSEKSSPINSPIDPLMGVRTTGPHTPLNTPRDTITARSLAGLLQSPSASSLPSVSSSSSSSITTTPTTTATLAASGGPPSLTTTIFKCQTCDYFAETKHEMELHLDNIHPERPDTDYIAIPTHPEALQAFQAAMAAAAAAAAASSRITPPNNLIGDIKTERMDADADSVTDESPADGDPNDMDADASETTVMQPRILCPLCQDGFGDRKSLETHVMGVHSVNSDGLTRLLQLVDTSHWLNSTSSATATKKDSSGTSEDSALDRPANHAANNFYMQQLRTGSSAAGYIDELECAQCGVAVKTHQELLAHAQDTQHYPTINDQYQCLLKQCQQNFVNVANMIAHYKDSHMNIVISERHVYKYRCKQCSLAFKTQEKLNTHSMYHMMRDATKCMICNRSFRSTQSLQKHIEQSHSQQQSGGGSPSPPPATDALDQARSNEDFLTSPTPSLKQDDDMNMTQPQDLIMRDDHMLHHGSFMDGSSSTSTTPTTEMDECLNTAQMAEDGYNDPGRKHKCHKCKMAFTHQHFLALHYKSNQHRRNEKMNNYPMEKYLDPNRPFKCEICLESFTQKNILLVHYNSVSHLHKLKKQSENNNTPSSSPNNNPSPMDCDRKSVDSDRRSVEHDRKSVDLELEMLGGGDQQKRKLSPENDYDSPKKRFKCDICKVAYAQGSTLDIHMRSVLHQTRACRLQEQQQQQQQQQQQLQNAMQFNPALTSSLARIVEQQQLQQSISPTPSNLSGAGGSLADQDSNMPKFNNQIYKTLLENFGFDIVKQFNELKHAGPGGTGPISTLGPVSTQLAADTRADLTRATNELSDEKYFCRHCKKTFSSIFVLKSHCEEIHNDKIPLDVLEKFAEKFKNYYLETGEVDNEILDFSAKKLSSESSGSPVSSVNGATAGGGVKEPKSSTTPLLAAGLSPPLTATSPALVKQHPQQQQPPPLPPPIASSPSPQQQQAHSTSQSAAAAAAELANQASKFNIDPAVLAQKIMEQNLANVPPNFAQLPQNLQSLQNLQSLQNLQNLQNMPNLPLNTLDMISLMQFHHLMSLNFMNLAPPLIFGATGAAAAAAAMAGGSVAGPVTPELPVPGTPQQQVQLLQQQAAAAAAVQQQATNNQKRARTRITDEQLKILRAHFDINNSPSEESIMEMSKKANLPMKVVKHWFRNTLFKERQRNKDSPYNFNNPPSTTLNLEEYERTGQAKVTPLVEQDRDHQMRELQSEQRPHSQPSSTASERGGIGERGGGNSGSIIGEIQIKTEPTDDLGSTDSDQLHHHHQHQMHQQSHHLFKHDQQSHDSDLNTGAGGGGSGSSGGGGPVNSYHHSQQQQLFCNNFETKSESGSSEILSRPQTPNSTSFPGLGDMGNPQMDAMTLSALGAAGAGTGSGSANSMAGVVGNLNNMGPPKKFQMNKMFDKNNFETNSNSSSGSSSSGKRANRTRFTDYQIKVLQEFFENNSYPKDSDLEYLSKLLLLSPRVIVVWFQNARQKQRKIYENQPNNSFYESEEKKPNINYTCKKCNLVFQRYYELIRHQKNHCFKEENNKKSAKAQIAAAQIAQSLSSEDSNSSIDINSANLLTTNLSSAQQAAAAAAAAAAALGGQVSSLAMPQSPSSSAASPGLGILSSSHNLFGGKHAAATTAGQATSAGGGGASVKDNSNSQKFECDKCQLTFSRFELFKEHQLIHIMNPNLFLNHSYAENTPFGILQNMQNSLMNAGGVMSAADQQDASMDLSQSASSTTTTNKKKRKYSETHSQSGDGDLQSMMAGAGGLQSAAEYEMAKKFRSDQYDFLYQYFLQNEPNDELKKQFQQQQHQQNIDIEYLAHFYQMNELKKKGNYDFFYQYYLQNEGKQQQPDSPGSMLDVKPQMEFLLQYYQLNECKKFFQLGASPQRIHDFPLLNLTAAAAVSGSDTTAQCSTTSSTTNNVHDLVSTTSTNHIRGNGINHVTKDIGSTTTACVTSPTTAALAVASVETASTTPTSTFNHNNNNTNSNSCTINSPQHQAAQAAAAVVAATILAAGGSGVAGQQQKDQQQQLQQQLQQSHNSILPATTPTTTTATTIGGRLSPQSSKCLPIVSSLASSLQKAANNNQSSAQSSPSSSVNLADYGNAAGIGEHITTMEKQTNKRLRTTILPEQLNFLYECYQNESNPSRKMLEEISKKVNLKKRVVQVWYQNSRARERKGQFRQNLQIINKKCPHCGAIFKIKSALESHLQTKHPEKQPINIDQIPDATPTVTTTHAQPQQQPSLESSGSSPSPFASLQANHHHDLLNNLHRTATSANSFTNAMTNSAATHRSEMSFTTNNDDNVTSTVLETMITNTNAGRQLGTPLDLSKTTPKYEPSESDVSFSDSNNDHDEADDFFITTTTTPTPVNNNHSILNNNHVKIQDLVNYRHNMNSTPPPPSSAVTTTTTTRSSPMIGETCGGTNNNVYYDVDSRASHTNVGENVGETLFGVGSGVGVDLMDHAPSSPASSTQSLQKKRFRTQMSNIQVRILRALFSDVKTPTMTDCENIGREIGLPKRVVQVWFQNARAKEKKSRNNRNYSDDSYEMINGGSDLTTGSDLTNSNAATMSDDCRVCNVTKVNMQEHVFSASHIAQVRLALENGENLFATNGNNAGAGGCYSGSESNDQHDDQNNSARHHLKSPNHQSHQSTGHQRHSSSSSSDASGTGAGTEPHDATSVQIYNQLLMQNPMFAAAAAASSSSNQNGSTSSDPIAKKMLFANGLESSDNSMGGGEMDGGELSSAANAAMTAAAAMMNFSQIQQHLASVQQQNQQTSSDKLSADLPNGVTDGGDSNGTIQAAAAAAAAAHRLLMGNNLLLQINADQRSLTQAANPSESLLQQIYNYNQMSGEC
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00999398;
- 90% Identity
- iTF_00999398;
- 80% Identity
- iTF_00999398;