Aros005748.1
Basic Information
- Insect
- Athalia rosae
- Gene Symbol
- Zfhx4
- Assembly
- GCA_000344095.2
- Location
- NW:276599-448892[-]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 21 0.055 3.6 7.5 0.2 2 23 323 345 322 345 0.95 2 21 0.00023 0.015 15.0 1.8 2 23 575 597 574 597 0.95 3 21 5.6e-05 0.0037 16.9 0.7 1 23 629 653 629 653 0.92 4 21 0.0083 0.54 10.1 0.4 1 22 709 730 709 733 0.91 5 21 7.9 5.2e+02 0.7 0.8 2 20 961 979 960 984 0.88 6 21 4 2.6e+02 1.6 1.5 1 23 1008 1032 1008 1032 0.91 7 21 0.1 6.7 6.7 0.6 3 23 1126 1147 1124 1147 0.95 8 21 0.31 20 5.2 0.1 3 23 1169 1190 1168 1190 0.95 9 21 0.002 0.13 12.0 2.6 2 23 1254 1276 1254 1276 0.96 10 21 0.0041 0.27 11.1 1.3 1 23 1293 1315 1293 1315 0.97 11 21 0.00039 0.026 14.3 0.1 2 23 1322 1344 1321 1344 0.92 12 21 0.0023 0.15 11.8 3.3 1 23 1433 1457 1433 1457 0.91 13 21 0.0065 0.43 10.4 0.6 1 22 1477 1498 1477 1501 0.89 14 21 0.00088 0.058 13.2 0.2 1 23 1542 1566 1542 1566 0.93 15 21 0.14 9 6.3 0.0 2 21 1623 1642 1622 1643 0.94 16 21 0.2 13 5.8 2.0 2 19 1799 1816 1798 1821 0.91 17 21 0.00011 0.0072 16.0 2.1 1 23 2434 2456 2434 2456 0.99 18 21 0.0045 0.29 10.9 0.5 1 23 2571 2593 2571 2593 0.98 19 21 0.013 0.83 9.5 0.3 2 23 2736 2758 2735 2758 0.95 20 21 0.35 23 5.0 0.2 2 21 3034 3053 3033 3057 0.89 21 21 0.0037 0.24 11.2 0.1 2 22 3243 3263 3242 3263 0.94
Sequence Information
- Coding Sequence
- ATGCCCTCCAGTGAGCCCCATCCTCCCCAGTACCACCTTCAACACCACAACATTCCTCCCTCCCAGCTTCAGATTCAAACTCAAGCCCAACAGCAGAGCGTGCATACGTCGCCGGCAGGAGTTATCGgacagcatcagcatcagctaTATCAACAGCTGGTAGCAGCCCAGCACCAGCATCTAGGTTTAACCACCGGTGGTGCGGcgagtaattttcaaaattcctcGTTCACGTCACAGAAGCAGGAGATGAGCCCCGAGGAGGAGGGGGGCCGTGGGGGTGGGTCCCCTCCGGCAGCAGGGGCCGCTCTCCATCAGCCACATCACCCCCGGACAGCGAGTCCGCCCTCGGGAACCGAACCCTGTACCCGGGACGCGATACCCACACCGACACCGATAGTCGTCGATACCACCGCCACGAACGCGATGACGACTACCGCTGTCGCCGTTGGTCAACAGCAAcaggaacaacaacaacaacaacagcagcagcaacagcagcaacagcaacaaggGTCGAGTCCGAGTCCCAGTCCTACGGGAGGCGACGTTGAAAAGTTCGACGGTAAGATCGTCTACAACCCCGACGGTTCCGCCTACATAATCGAGGGTGAAAGTGAGCTGAGCGAAGACGATTCGCACTCGCTACCCGACGGTTGTATAGTAGACGGCAGGGGGGTGTCGGTACCGCATTCCTTGGTGTTCCCCCAAATAGCGAGCGCGTATTATGTTTCGAGACTTTACGCGCATCAGGCTTaccagcaacaacaacaacaccaacagcagcaacagacACTTCAACAAAGGTCGACGTCCCAACAACAGTTGCAACAAAATCCGGACCTCCCCGTCATGCACAGCTACAGGGTGATTAGTTATAGAAGTGCGGAAGGCGGAAAACAACCCCCGCCTCCCGTAACGACGCCTCCATCTCCCGCGGCATCCGTCCCCGTTAAACCGATACTCATGTGTTTCATCTGTAAACTCAGTTTCGGATACGCGAAAAGTTTCGTCGCTCATGCACAGACCGAGCATCAGCTCACCTTAATGGAAGACGAAAGGCAGGTATTATCTCATTCGACGGCCTCGGCGATAATACAGGAGGTGGGACGGGGTAAGCAAGCCCTCGTCAGTTTTTTGGAACCGGTGACGAGCTCGACGTGCTCCTCGCAGTCCTCGCCCAGTCAGTCGCACTCCCAACAGCAACAGAGGAGCGATTCCCTCGGCGAACACGACGCACCCACGACAACGAGCACCCCGGCTTCCACACCGGGAGTACCCAGCAGTCCCCAACAGTCCCAACAACAGCAAAGGCCATCGCCTAGCACCCCGACGACGCCCACATCCCACACGAGCCACTCTCTCATGTACAATCATCAGCAAACGCAGCAACAACCATCCCAGACACAGTCCCAACACGGGCAGTGGACCGGTGCCCAAGTTACCGCTGCCGCTTGGGCGAAAACCGAGCCCGCTACCGCCCTGCATTACACGTCGCCACCGCCTAGTACGAAAGGCTCGCCCTCGTCGTACGCCAGTTTGACCCAACAACCTCCGAATTTTCTTACGGGTACGACGATCGGCGTGTGTCCCGAGCACATGCAGGGGCGGCCGAGCGGCGTCGAATGTCCGAAGTGCGAGTTGATTTTAGCGAGCAGTCGCCTGGCCGGTCCCGGTGGTCCCCTCGCTGGAATTCACAGTCGTAATTCCTGCAAAACACTGAAATGTCCGAAATGCAATTGGCATTACAAGTATCAAGAGACGTTGGAAATTCATATGAAGGAAAAACACCCGGAGAGCGAGACGTCCTGCATTTATTGCATAGCCGGACAACCACATCCGAGGTTAGCGAGGGGTGAAACTTATACCTGTGGCTATAAACCTTACAGGTGTGAAGTGTGCAACTATTCAACGACGACCAAAGGTAACCTCAGCATTCACATGCAGAGCGACAAACACCTCAACAATATGCAGGAACTCCAGCAGGGTGGCGGCGGCACGAGTACGAGTAACCCCTCGTCGTCGCAGGACGCTCCCATGCCGACGAGGAGTCCGCACCATCAACAGAACCACAGTCCGCATTTAGCCAGCCAAACGAGTCAGCAGGGTAAACCGAAACCTACGTTTCGCTGTGACGTTTGCAATTACGAGACGAACGTCGCTAGAAATTTACGGATACACATGACTAGCGAAAAACACACGCACAATATGCTGGTTCTTCAGCAGAACGTTAAGCACATGCAGACTCTGTCGGCCCTCCAATCCCATCATCAACAACAAGCTCAACAACATCATcaccaacaacagcaacaacaacagcatcAGCAACAATTGGAACAACTTTTGCATCTCGGTGGTCTCGACAAACCGCAACATGCCGAAGCGGCTTTGGCCGACATGGCCTATAATCAGGCGTTGCTCATCCAAATGATGACCGGCGGTCAACTTCCGCCTCAACTTCCCCCCGAATTGATGGGCGGCATGACGAGTATGGGGGCGATGGGAAACCTCGCCGGCGACGTCGGTCTATCACCCGAGAGTATGGAGCCTCCTCCGGAGCCAGCGGATCCCGATCCATCGCATTTGTACCATTGCTGCGTGTGCAACAGTTTCGCGACAGATTCCCTCGAAGCGCTCGGTCACCACTTGGCGGCGGACAGGACGCGAACAAGAGAAGGTGAAATCCTGGCTCTTGTCGCCGGACATTTCGTCTGCAAATTATGTTCCTACAAAACAAATCTAAaagcaaattttcaattgcatTGTAAAACGGACAAACACCTTCAGAGACTTCAACACGTTAATCACGTTAAAGAAGGTGGTCCgagaaacgaatggaaattgaaatacttAGCGTCTCCTACCGGTGCCGCACAGCTAAGATGTCACGCTTGTGATTATTACACAAATAGCGCGCATAAATTAGCCCTTCACGCCGCTTCGCCGAGACACGAAGCCGCCTCACTTCTTTTAAGACACTTGCTCGAAGCCAGCGCCAACACTCCATCACCGGGTAAATTGTACCACTGTGCCCTTTGCGGGTTTAGCGCTAGGCATCGACTACCGTTGCTCCAACACGTCAGGTCGCTAAGGCATCTGCAAATGGAACAATTGCACCAGTTGCACAGAAGAAGCGGTGTACAAGGTACCGAAGCACCTCACACGGACATCGGCGATGTCTTCCAAGTCGTTCCCGACCCAGACGTACCGCCTGCTCAACAACCAAGCCCCACCGCACCCAATACTCCCAACGCCGCAAACGCTACTAACGAGCGGCGAGAGGAGGGCAGCGATTGCGGCAGCGAAGTAAAGCAAGAACCGGACGTCGACCAAGACGGTGATCAGGAGGTGGATAACGACCCGGAAGACATTTGCTGCCCGTACTGCACCTATCAGCCGACATCGAGGGATGAACTACGTCAGCATTTGCAAGTTGCTCACGTTCAGGACGTCGAGGAGAAGCCGGCTGACCTTGTGAAAGAGGAATCCACGCCCGATTTGCTCTGCCCACTTTGTCAGGACGGTTTTAAAGAACGTACCGCGTTAGAGAGGCACGTGATGCAGATACATTCGGTTAACGCTGACGGACTGCAAAGACTTTTATTACTCGTTGACCAGAGCCACTGGCTAAATAATAACCCGAGAAACACCTCGACGCCTGCGGTTACGCCGACGtccccgacgacgacgagacaACAGCATCAGCAAGACGAAGAGGTCAACGAACGTGGTGGTAACGACGAAGTTGAGGAAATTGCCAGGTGTAATGTCTGCGGGAGAATTTGCCGCTCCCTCGAGGAACTTCAGCAACACCACCGTGAAGCTCACCCGGCCACTACGCCGACCTTGGCGGTGAGCGAGAAACACGTTTACAAGTACAGGTGTGGACAGTGTAGTCTCGCGTTtaaaacgttggaaaaactTCAGCAACATTCGCAGTATCACGCGATAAGGGACGCTACGAAGTGCGCCCTTTGCGGGAGATCCTTCCGATCGGTGCAAGCTCTTCAACGACACCTCGAAGCGGCACATTCCGATCTCCACGAGGACGAACTGGCGCAGTATAAACAAAGTCTTCTTCACGCACACCCGCTTCTTCAAGCACTTACCGAAGAAGCATTGAAAAGACAAGCCATAATGGGAGAACAGACTGGCGACGACGAATGTACGAGAGGAGATGAGGAAGAAAGCGACGCGAGCGATTCGTCGCCGCTTCACAAGGAACAGAGACTCCTGGAGGATTATCTGAACAGTCAACCGGTTGCCGAGGACTCTTACCACGACCctggaagaaaattcaagTGCCATCGGTGCAAGGTCGCCTTTACGAGACAGAGCTATCTTACCGGGCACAACAAGACTCTCCTTCACAGGAAAGGGGAAAAGATGTCCTATCCTATGGAAAAGTATCTGGACCCGAACAGACCGTACAAGTGTGACGTTTGCAAGGAGAGTTTCACGcagaagaatattttattgGTTCATTACAACAGCGTGAGTCACCTTCACAAGCTGAAAAGAGCGATGCAGGAACAGGGTAACAACAATACCCTGATTTCTGTTGTTCCCCCCGCGAGTCCCACGGAGTCTCCCGACTCCCAACAAGACCAGGATAAAAAGCCGTACAAATGTAACATCTGCAAGGTAGCTTACTCCCAAGGAAGCACCCTCGATATCCACATGCGAAGCGTTCTTCATCAGACGCGAGCGAGTAAACTTCCCGACCTCGCCGCGAGCGGTCAGCTTGACCTCGGACGACCTCTGATCGAACAACCATCTCCTACGAGTCCGAATAGTCCGCCGTGTAATACCGGAACTACAACGATAAACTCGAACAATAATAGCAGCAATAATGGGAACATGTTATCGTGTCCGAGATGTAGCGCGTTGTTTGTCAGTCAGGAACAACTGGCGACGCATCAGCAGCTCTATTGTATATTCAGTAATCCGTTGGCGCTGTTCCAGCAACTTGCGGCTTCGCAGCAGATCGTTTCCTCAGCTCCTTCGAAATCACCACCACCCATGACCACGACCCCAGGTCCCCAGTTGCACgttcaacaacaacaacagcaacaacaacagcaacaacaccaacaacaacaaacacaGCCACAAATCGCTCAAGACATTCTGTCACAACCGAGACATAAGACTTCACAAATGTACAAACACCTTCTCGAGAGCTTTGGATTTGACCTCGTTATGCAATTCAATGAGAATCATCAGAGGCGGCaaaggaaggaggaagaagCCGCGGCCGCGCTTCAAGCGCAACAGGAGCAACAGAAGCAAGAGCAACAGAAACAAGCCTTGGCTGCACAGGCGGCtcaagaaagagaagaagaaggtgatGAACCTGGAGATGAGGAACCTATCCCCGAGCTTACCCGCAGCACCTGCCAACACTGCAACAAAGAATTCAGCAGCGTTTGGGTGCTGAAGGCACACTGCGAGGAAGTCCACAGGGATCTCGTGCCCCGTGAATTCCTTGAAAAATACGCGCAGCAATTCAAGTGCGAATACGAGAAGAAATCGGTCGTTGTCACTGCTGCGACTTCCTCGTCGACTACCACTGCACCGAGAAGTTCAACGCCGGCGTCAAGCCAGCCTCAAGATCTTAGCCCTGACAAAGATCATCGAgtggagaaggaaaaggacgATATTCCTGAGATCAAGGAGCCCACGAGCAGGACTCCCGAAGCCACCTCAACTACACCCACTACTACTCCGGCCCTGAGCAATACCCCGGTGTCCAGCACCGATTCCACTACTCCAACAGCTCCTTCTAACAACCAACACCAAGtccagcaacaacaacaacaacagcaacaacagcagcaacaccaacagcagcagcaacaacaacaacaacaacaacaacaacaacaacagcaacagcagcaacatgCACAGATGACACTCGCTCAACAAATGTCCGAGATGCAAGCGGCATTGAATGCCATGGCGGCTTCTCAACTTCAGCAACAACTTCAACAGTATCCTGGGCTAATGATGGGAATGATGGGACTTCCATTAGGACTCAATGTTCCCGCTCTCGCTGCCATGAACCTCCAACCTCCTTTGGTTCCTATCATGTTGCCTCCACCACCTTACGAAGGAGCGTCCAGCGGCTATCCTCAGATAAATTCACAGGCCGACATCCTTGCGAAACAACACCTGGCcttgcaacaacaacaagcCGCTGCGGCAAACGCTGCAGCATCCCAGAAGCGGGCGCGCACCAGGATAACGGACGAACAGTTGAAGATTTTACGTGCTCATTTTGACATCAATAACTCGCCGGGCGAGGAACAAATTCTCGACATGGCCGCGCAGAGTGGTCTACCTCCAAAAGTGATAAAACACTGGTTTAGAAATACATTGTTCAAAGAACGACAGCGTAACAAGGACAGTCCGTACAACTTCAACAATCCTCCGAGCACCACGTTGAATCTCGAAGAGTACGAGAAGACCGGAGAGGCGAAAGTTACTCCTCTGAACTCAAGTGTTTCGGGTAGTAGTTCCGCGGACGACAAGAGTCCGAACAAACAAGCGTCGCCTCCGCCACCAGTTCCATCTACGGTAGTCACGACATCCCAGTTACCAGAAGTGAAACAGGAAATTCCTGAGCCCCCGCATTGTCAACAACAACCGCAAAATCTTCCGCAACAACTCATGCAGCATCAAGACGAACCGCAACATCATTCCCCGGGGAGTTCCGGCGGTCAACACTCCAGACCTCATTCGCCCGCGCTCAGTATGAGTTCCGTATTCTCGGGACTTCATCACGACCTTTCCGCCCTCACACCTACCACCGCGACGAATATCCCAACCACCCCTATGTTACCGCCAAAATTAGCGCCGCAAAATTTTGCCAGTCCTACTCCGGGTGCCGGTGGCGTAGTACCCGCGGCGATGGCAGCTATGGCTTTGACGCCGCAGAGGTCGTTGAGTCCGGGTAGAGGACCCGCGGATTATTCCTTCGGTGGTAACAGTAACGGAAGTAATTCATCGGGCGGTAGTTCCGGTAAAAGAGCGAACCGGACGAGGTTCACCGATTACCAGATAAAAGTGCTGCAGGAATTTTTTGAGAACAACGCCTATCCGAAAGACGACGATCTTGAATACCTCAGCAAGCTACTCAGCCTCAGTCCGAGGGTGATAGTTGTTTGGTTTCAAAATGCCAGACAAAAGGCACGAAAGGTTTACGAAAATCAACCGGCCGCTGAACCCGTCACTCCGGGCGGAAGGGACGGAGACGACGGCGCCGGACGATTTCAAAGGACTCCGGGTTTAAATTATCAGTGTAACAAGTGTTTGCTAGTTTTTCAACGATACTACGAGTTGATACGACATCAGAAGACTCACTGTTTCAAGGAGGAGGATGCCAAGAGAAGCGCCCAGGCGCAAGCTGCAGCCGCGCAAGTCGCTGCGGTTTTAAGCTCCGAGGACAGCAACAGCAGttcaacgacaacgacgaatACCGTTCCCACGAATCCGCCGAACGCTCCCGCCCTGGCCGAACAACTCCAACAACCTCTGAACGCGGTAATAACACCCCCGCTTCATCAGGTCCAACAAATCGCCGCTCAACTCCAACAACAAGTTCAACATCAACCGGTTCAACAAAATCAACAGGCTCAACAACaaccgcaacaacaacagcaaacGCAAGTACAAGGTGCCCAAAGTGAACCGAAGGAGGGTAGTTTTCAATGCGACAAGTGTAATCTAATGTTCGGGAGGTTCGAATTGTGGCGTGAACATCAGTTGGTTCACATAATGAACCCGTCGTTGTTCCCCCCCGCTTATCCGCCAGATTCACCATTCGGGATACTGCAACAACAGGCTCTCAATGCTACCAGTGGTGTTACCAGTGAAACTCCCCATCCGCTTATTGCTATGATGCAAGACCGGAAGCGGAAGTTTGAAGATTTCGAAGAAGGTACCGGTAGCGATGGCCGTGCGACATCCGAGCACTCCGAACAACCGAAAGACAAGCGGCTGAGAACGACGATTCTCCCGGAGCAGCTGGATTACCTCTATCAAAAGTATCAAGTGGAATCGAACCCGTCgagaaaaatgttggaaacCATTGCTCGCGAAGTCGGACTAAAGAAACGCGTTGTTCAAGTTTGGTTCCAGAACACTCGTGCCCGCGAGAGGAAGGGTCAGTTCAGAGCGCACAGTCAGGTAATAAACAAACGTTGTCCTTTTTGTCCGGCGTTGTTCAAAGTGAAATCCGCACTCGAATCTCACCTGAGTAGTAAACACGCCGATCAAGTTGCTCGGGGCGAAGTGAACATCGACAACATTCCCGACGAAGAACTATCGATGGAATCAGCCCCCTCGAATCCCAGTACGCCGAACATGATGCCACCGCTGTTCCCACCGTTCAACACCGAAATGGAAGCTTCccttaaaaaatattacgaggaGTCCATGAAACGGTATATCAGCGAACTCCAAGCCCATACCAGTAACGGAAAACAGGAGACTACGAGCACCCCGCAAACAACAAGCACCGGAGAGTCCCCTTTGGATCTCAGCAAACCCGTAGACCTCAGCAGACCGGTGAAATTGAGTCTCGGCGGTCTCGGCGGACTCCTCGATGAACACCACGGTGGTCATTTGAGAAGCGGAAGCGATTGCGGACCACTCACCGATTTGTCGGAGCGGAGCATATGCGACGAGGACAGTATGAGCGAGACAACAGAGTTCCTAGACGACGAAAGTGGTCCGGCAAGTCCGGCCTCGAGTACTCAGAGTTCGAGACACGGTGCCGCCGGAGGCGGAAGTCAGATCGGAAGTACCAACCTCGGGGGTCAGAGCGGGGGTGGTGGAGGTAGCGGAGGTAAACGTTACCGAACGCAGATGTCCGCTACCCAGGTGAAGGTGATGAAGTCTCTGTTCTCCGACTACAAAACACCGACGATGGCCGAGTGCGAGATGCTCGGGCGTGAGATCGGACTCCCAAAACGCGTAGTCCAGGTATGGTTCCAGAACGCCCGAGCTAAGGAAAAGAAGGCCCGTTTAGCGGCGGGTTTGCCAGCCGAAGGATCGGCGGTTCCGCCTCATCGCGGGCCAACGGGACCCGACGAATGTCGCCTCTGCGGGGTCAGGTACTCCCCGAAATCCCCACTTCAGGAACACGTATTTTCCAGACGGCACATCGAGTCGGTTAGATTGGCGGTAGAAGACGGTACTTTGGTTCCTCCTACTCCCGGTGCTCCCATATTGCCCGGTAGTATCGCTGCGGCGACGTTAGTCGCCAATGCGAATTCCGTCACTCCCGTCGgacaccaacaacaacaacagcagcaacaacaacaacagcagcagcagcagcagcaacaacaacagtcCGACGAGAACATGATGTACGGTTCTCTCTTCCTTCATCCGACGGCAATGTTTCCACCTCAACAACAGCAACACCCCAGCGGGAATGTTCAGGCAACTCCGGCTACCACAACCGCCGTAGCGACTTCGACCGTGACAAATCCAGGAAATTCAGGAATTTCACCATTGATGTCTCTGCAAGTCGAAGGTGGTACAAATGTCCAAGTACCACGTGTCCTAATGCAGGCATTTCTTCAGCAAGATCCAAACCATCCGGGCCTCGAAACGGTCCGTCTTCCGTTACCTTCGGGGTGTGGAGAATTGAATTCCCCAAAACCGGAAACGCCTCCACACTGCCGCGAAGTAGAAACAGAGGTATGCCTCGTTTGTCGTCGTTGCGGCCGAGCGTATCCGCAAGAGTCATCGTTATTGTCCCATCAAAGATCGTGTTATCTGGGGAACCAGCAGCGTCGGGGTGCCTTGCGTCTGGTCGAGTCCCGTTGGTCCTGCACGTTGTGCGTAGGAGATGCGAAAACCGTTTTTACGTTGCCGGAAATACGTCGTCACGTCGAAACTCCTCAGCATCGCTCGAACCTGGAGTCGCTGAACCAGCGTACGAATTTCCAATCGGGATCGGTTCGGGTTGCCGCATCCGCTTCGGAAAGAGCGTCGGGGTCGGGCtgcgaagatgaagaagaggaggaagacgaagaggaggaagaggagcgAGGAGAGGATGAGGCTGCGAATCCGTTGACCAACGAAATGGAGGACGTCGTGAACCAAATAACACTTTTGGCGGCTAGAGCAGCGGCCGAAAGCACGACGACCGTTACGACGACTACCAATTCAACTACCGCGATTATGACCTCCGCGGGGTCAAATTTCAACCTTCAACAATCAACCGATCGAAGCGCGGCTGCTACCGCCGGAGGACAAGATAACAATAACGCTCCTGATACTAAAAGGCAAAAACTAATACAAGAAATTCCCAAAGCTCTCGCCGGAGCGCGATAA
- Protein Sequence
- MPSSEPHPPQYHLQHHNIPPSQLQIQTQAQQQSVHTSPAGVIGQHQHQLYQQLVAAQHQHLGLTTGGAASNFQNSSFTSQKQEMSPEEEGGRGGGSPPAAGAALHQPHHPRTASPPSGTEPCTRDAIPTPTPIVVDTTATNAMTTTAVAVGQQQQEQQQQQQQQQQQQQQQGSSPSPSPTGGDVEKFDGKIVYNPDGSAYIIEGESELSEDDSHSLPDGCIVDGRGVSVPHSLVFPQIASAYYVSRLYAHQAYQQQQQHQQQQQTLQQRSTSQQQLQQNPDLPVMHSYRVISYRSAEGGKQPPPPVTTPPSPAASVPVKPILMCFICKLSFGYAKSFVAHAQTEHQLTLMEDERQVLSHSTASAIIQEVGRGKQALVSFLEPVTSSTCSSQSSPSQSHSQQQQRSDSLGEHDAPTTTSTPASTPGVPSSPQQSQQQQRPSPSTPTTPTSHTSHSLMYNHQQTQQQPSQTQSQHGQWTGAQVTAAAWAKTEPATALHYTSPPPSTKGSPSSYASLTQQPPNFLTGTTIGVCPEHMQGRPSGVECPKCELILASSRLAGPGGPLAGIHSRNSCKTLKCPKCNWHYKYQETLEIHMKEKHPESETSCIYCIAGQPHPRLARGETYTCGYKPYRCEVCNYSTTTKGNLSIHMQSDKHLNNMQELQQGGGGTSTSNPSSSQDAPMPTRSPHHQQNHSPHLASQTSQQGKPKPTFRCDVCNYETNVARNLRIHMTSEKHTHNMLVLQQNVKHMQTLSALQSHHQQQAQQHHHQQQQQQQHQQQLEQLLHLGGLDKPQHAEAALADMAYNQALLIQMMTGGQLPPQLPPELMGGMTSMGAMGNLAGDVGLSPESMEPPPEPADPDPSHLYHCCVCNSFATDSLEALGHHLAADRTRTREGEILALVAGHFVCKLCSYKTNLKANFQLHCKTDKHLQRLQHVNHVKEGGPRNEWKLKYLASPTGAAQLRCHACDYYTNSAHKLALHAASPRHEAASLLLRHLLEASANTPSPGKLYHCALCGFSARHRLPLLQHVRSLRHLQMEQLHQLHRRSGVQGTEAPHTDIGDVFQVVPDPDVPPAQQPSPTAPNTPNAANATNERREEGSDCGSEVKQEPDVDQDGDQEVDNDPEDICCPYCTYQPTSRDELRQHLQVAHVQDVEEKPADLVKEESTPDLLCPLCQDGFKERTALERHVMQIHSVNADGLQRLLLLVDQSHWLNNNPRNTSTPAVTPTSPTTTRQQHQQDEEVNERGGNDEVEEIARCNVCGRICRSLEELQQHHREAHPATTPTLAVSEKHVYKYRCGQCSLAFKTLEKLQQHSQYHAIRDATKCALCGRSFRSVQALQRHLEAAHSDLHEDELAQYKQSLLHAHPLLQALTEEALKRQAIMGEQTGDDECTRGDEEESDASDSSPLHKEQRLLEDYLNSQPVAEDSYHDPGRKFKCHRCKVAFTRQSYLTGHNKTLLHRKGEKMSYPMEKYLDPNRPYKCDVCKESFTQKNILLVHYNSVSHLHKLKRAMQEQGNNNTLISVVPPASPTESPDSQQDQDKKPYKCNICKVAYSQGSTLDIHMRSVLHQTRASKLPDLAASGQLDLGRPLIEQPSPTSPNSPPCNTGTTTINSNNNSSNNGNMLSCPRCSALFVSQEQLATHQQLYCIFSNPLALFQQLAASQQIVSSAPSKSPPPMTTTPGPQLHVQQQQQQQQQQQHQQQQTQPQIAQDILSQPRHKTSQMYKHLLESFGFDLVMQFNENHQRRQRKEEEAAAALQAQQEQQKQEQQKQALAAQAAQEREEEGDEPGDEEPIPELTRSTCQHCNKEFSSVWVLKAHCEEVHRDLVPREFLEKYAQQFKCEYEKKSVVVTAATSSSTTTAPRSSTPASSQPQDLSPDKDHRVEKEKDDIPEIKEPTSRTPEATSTTPTTTPALSNTPVSSTDSTTPTAPSNNQHQVQQQQQQQQQQQQHQQQQQQQQQQQQQQQQQQQQHAQMTLAQQMSEMQAALNAMAASQLQQQLQQYPGLMMGMMGLPLGLNVPALAAMNLQPPLVPIMLPPPPYEGASSGYPQINSQADILAKQHLALQQQQAAAANAAASQKRARTRITDEQLKILRAHFDINNSPGEEQILDMAAQSGLPPKVIKHWFRNTLFKERQRNKDSPYNFNNPPSTTLNLEEYEKTGEAKVTPLNSSVSGSSSADDKSPNKQASPPPPVPSTVVTTSQLPEVKQEIPEPPHCQQQPQNLPQQLMQHQDEPQHHSPGSSGGQHSRPHSPALSMSSVFSGLHHDLSALTPTTATNIPTTPMLPPKLAPQNFASPTPGAGGVVPAAMAAMALTPQRSLSPGRGPADYSFGGNSNGSNSSGGSSGKRANRTRFTDYQIKVLQEFFENNAYPKDDDLEYLSKLLSLSPRVIVVWFQNARQKARKVYENQPAAEPVTPGGRDGDDGAGRFQRTPGLNYQCNKCLLVFQRYYELIRHQKTHCFKEEDAKRSAQAQAAAAQVAAVLSSEDSNSSSTTTTNTVPTNPPNAPALAEQLQQPLNAVITPPLHQVQQIAAQLQQQVQHQPVQQNQQAQQQPQQQQQTQVQGAQSEPKEGSFQCDKCNLMFGRFELWREHQLVHIMNPSLFPPAYPPDSPFGILQQQALNATSGVTSETPHPLIAMMQDRKRKFEDFEEGTGSDGRATSEHSEQPKDKRLRTTILPEQLDYLYQKYQVESNPSRKMLETIAREVGLKKRVVQVWFQNTRARERKGQFRAHSQVINKRCPFCPALFKVKSALESHLSSKHADQVARGEVNIDNIPDEELSMESAPSNPSTPNMMPPLFPPFNTEMEASLKKYYEESMKRYISELQAHTSNGKQETTSTPQTTSTGESPLDLSKPVDLSRPVKLSLGGLGGLLDEHHGGHLRSGSDCGPLTDLSERSICDEDSMSETTEFLDDESGPASPASSTQSSRHGAAGGGSQIGSTNLGGQSGGGGGSGGKRYRTQMSATQVKVMKSLFSDYKTPTMAECEMLGREIGLPKRVVQVWFQNARAKEKKARLAAGLPAEGSAVPPHRGPTGPDECRLCGVRYSPKSPLQEHVFSRRHIESVRLAVEDGTLVPPTPGAPILPGSIAAATLVANANSVTPVGHQQQQQQQQQQQQQQQQQQQQSDENMMYGSLFLHPTAMFPPQQQQHPSGNVQATPATTTAVATSTVTNPGNSGISPLMSLQVEGGTNVQVPRVLMQAFLQQDPNHPGLETVRLPLPSGCGELNSPKPETPPHCREVETEVCLVCRRCGRAYPQESSLLSHQRSCYLGNQQRRGALRLVESRWSCTLCVGDAKTVFTLPEIRRHVETPQHRSNLESLNQRTNFQSGSVRVAASASERASGSGCEDEEEEEDEEEEEERGEDEAANPLTNEMEDVVNQITLLAARAAAESTTTVTTTTNSTTAIMTSAGSNFNLQQSTDRSAAATAGGQDNNNAPDTKRQKLIQEIPKALAGAR
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_01498017;
- 90% Identity
- iTF_00173988;
- 80% Identity
- iTF_00175582;