Csor026022.2
Basic Information
- Insect
- Cheilosia soror
- Gene Symbol
- zfh2
- Assembly
- GCA_948107745.1
- Location
- OX443565.1:10237131-10260671[+]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 17 0.58 68 4.9 0.6 2 23 78 100 77 100 0.94 2 17 0.00023 0.026 15.6 1.4 2 23 644 666 643 666 0.96 3 17 5.6e-05 0.0066 17.5 0.7 1 23 698 722 698 722 0.92 4 17 0.018 2.1 9.6 0.4 1 22 764 785 764 788 0.91 5 17 2.2 2.6e+02 3.1 1.9 1 23 1050 1074 1050 1074 0.91 6 17 7.4 8.7e+02 1.4 3.2 1 23 1320 1343 1320 1343 0.93 7 17 0.079 9.2 7.6 0.1 2 23 1462 1484 1461 1484 0.93 8 17 0.017 2 9.7 0.1 1 21 1560 1580 1560 1584 0.90 9 17 0.0027 0.32 12.2 2.3 1 23 1630 1652 1630 1652 0.96 10 17 0.00068 0.08 14.1 1.5 2 23 1659 1681 1658 1681 0.94 11 17 0.0097 1.1 10.5 4.8 2 23 1841 1864 1840 1864 0.90 12 17 0.035 4.1 8.7 0.6 1 22 1885 1906 1885 1909 0.89 13 17 0.0087 1 10.6 0.1 1 23 1993 2017 1993 2017 0.93 14 17 0.4 47 5.4 2.9 1 19 2084 2102 2084 2107 0.92 15 17 0.00021 0.024 15.7 1.8 1 23 2774 2796 2774 2796 0.97 16 17 0.00071 0.083 14.1 1.3 1 23 2913 2935 2913 2935 0.98 17 17 0.011 1.3 10.3 0.6 2 23 3418 3440 3417 3440 0.96
Sequence Information
- Coding Sequence
- ATGTCCTGTTCTGATGTTGAAAGTTTTCAAGGCAAGATTGTTTATAATCCAGATGGCAGTGCTTTTATAATCGATTCTGAAAATGAATCACTTTCGAATATATCGGATAATTGTATGAACGTTGGGGCAACCGCAACGAATAACCCAAAAATCCACTCATTCCGTGTGGTTACCGCTCGCGATGCCAGTGTTAATATTTCCGAgccaaataaaattcaaaagccCATATTAATGTGTTTCATCTGTAAACTGAGCTTTGGAAATACGAAATCGTTTAGCTTGCACGCAAACAGTGAGCACACCCTTAATCTCCAAGAATCCGAAAAGTTACTATTGAATCGAGAGTATTCGAGTGCCATTATACAGCGGAATGTTGATGAGAAGCCGCAGATATCGTTTTTGGAACCGTTGGATATACAAAAGCAACATCAATTTATGAAACAATCATCCCAAACGccacaacatcaacaacaactgCTTATTGGATCCACATTGTCGTCGGCGGCGAGCAACAATAACTTTAGCAGCAATAACTGTAgcaacaataatattaatattaattgtAGTACATCCACAGCGATACCACCGTCGAcgtcgtcctcgtcgtcgtcgtcgtcagcaTCATCGGTGTCATCGGCGGCAGTAGCAGCAGCTGCGGCGGCTAATGCGGCGTTGGTGGCTGCAATAGCAGCCAGTTGTAGCAACACCAACAGCGTGAATACATCAGCATCCATATCGATGAACACGCCGCACTTGGATAGTGATCTTATAATGGCGAACCTCGGCGCTGCCGGCGGTGGCGGTAATAACAATCCTAGTTATGGCAACACGACGGGCACCATTAGTAACAATCCTAGTAGTAGCAGTAGTAATTGTAGTCAAATTAATCATCAACAAAGATCAGAGAACTTTGATAATCTGAGCACTTTAGATTTAAGTGCGGCGACAGTGGCAGCGCAGGCGGCAGCCGCGGCCGTAGCAGCTGCAATAACAGTTGATGGCCGGCACACACCGCCACCGTCGTCGCCTACATCGACCACCTCGTCCTCACCTTCATCATCTTCTTCGTCTTCGTCAATATCAACATCGTCGTCATCTGCCCAGCAACAAGCCACAACAATTATCGCTTCAGGAGTACCAGTAGCCACAGCACCCGGTGGGCCAAAACAGAGCTCGCATTCCTCAGCAGCTATTGACGCTATTCCAGCAACATCGTCAATAAGTTCCACAGCAACAAAATCAACTAGTAGTAATAGTAGTAGTaatattaacaataccagtAGCAATAAGATCAGTGCATCTGCAGTGTCGCCCCCAACACCCACAACAGTGGCGGATTTCCTGCAGCAACAGTTCCAACAAATGCAGAATCAAATTCGAATAACATCCCCCACATCCGCATCGGCAGCCGTGGTGAGTACCAGTGGAAGCAATATGGAAAGCAATATATTGTCCTCGGTGACCTCAAGTGCGCCAACGCTAAGTTCATTAACCGCATCATTGGCAGCGGCGGCAGCAGCTGTTGGTAGCGGCACTCCGGGTGATTTAACTAGCAATAGTAATAGTGTGAAGTTGATCAATGATTTCCTGCAACATCAATTGCAACATCAGCAACATCCTTCGTATGCAACATGTCCCGAGCATCCAGACGTGAAGGGCATAGATTGCAAGACATGCGAAATGATTGAGATCAACATGAAATCGCCGATAACACCGACCCGTTCGCCAAATAGTATTAATCTGTTCCCATCGAGTTCGACAATGTCCCCGACAGCAGCTGCAGCTCCCAGCTTTACAATCGGCGCCTGTCCGGAGCATATCAACGGGCGGCCCTTAGGCGTGGATTGTTCTAGATGTGAAATGATATTGAATTCCGCCCGGCTCAACAGTGGCGTACAGATGTCAACACGTAACTCGTGCAAAACTCTCAAGTGTCCCCAATGTAATTGGCACTACAAGTATCAGGAAACATTAGAAATTCATATGAGGGAGAAGCATCCAGATGGGGAGAGTGCATGTGGCTATTGTCTGGCCGGTCAACAACACCCGCGATTGGCACGCGGCGAGTCCTACACCTGCGGCTACAAGCCGTATCGCTGTGAGATTTGCAACTACTCGACAACTACTAAGGGAAACCTATCGATTCACATGCAAAGTGACAAGCATTTGAATAATATGCAGGAGTTGAACAGTTCGCAGAGTATGGCGAATACGGCAGCCGAAATTCGTGAGTCGCCGAAAATCATCATGCCAAATATGCAGCAGCAGGCTTCGAAGCCGAAGCCGAGCTTTCGCTGTGACGTGTGTTCCTATGAGACAAGTGTGGCGCGTAATCTCCGTATCCACATGACCAGCGAGAAGCACACCCACAACATGGCTGTCCTGCAGAACAATATTAAACACATTCAAGCGTTCAGTTTCCTGCAATCACAGAATCTCGGCCAACTGAGTGCGGCTCAGAGCGCTGCCGTCGCAGCTTCTAACTTGCCCAATATGCCGAACTTGCAAAATTTTCTACCAGAGGCAGCTCTGGCTGATATTGCTTACAATCAGGCCCTAATGATTCAGCTATTACATCAGAACTCTGCTGCGGGAGCTTTAAGCGCGGCGGCCGCAGCCGCTGCCGCAGCCAATCCTTTAACCTTGGCCCCACCACAACAGTCTCCGGCTGGAAGTGGAGTAACAACAACGTTGCCGCAGAGTGCATCAACATCACAGCAGCCATCCCAACAACCCTACCAAACGTCACAGCCATCGAAACTAAATCATCCACCAACGCCACAAGTTTTGCCAACAACAGGAGCGACTCCAGTATCAGTGTCGTCAACAACGTTATCGGTTCATCAATCCTCGCCACAGCAGCAATCTGCAGTAGCGGCGGCGGCTCTTTTAGCCGAGGCGGCCGCTGCTGGAGCGGCTACCACCGATTCAGCAACATGtatccaacaacaacaacaacaacaacaggacTCATCTCTCGATCCGCCAATTGATCCCGATCCAAAGCCAACAACTGCTTTTAGCTGCCTCATTTGCGCCAACTACAATACGAACAGCATCGATGAACTGAACAATCATCTTATGATTGATCGATCGcgcaacaccaacaacaattGCAGCGACATCATGATGATCATCAACAATAATTATATATGTCGCCTGTGCAACTACAAGACGAATCTGAAGGCTAACTTCCAATTGCATAGCAAAACGGACAAGCATTTGCAAAAGCTCAACTACATCAATCACATCCGCGAGGGTGGTGTTAAGAACGAGTACAAACTGAAGTACAACCAAACGAATACGGTACAGCTGAAGTGCAACTGCTGCGATTTTTACACAAACTCCATACAGAAGCTAAATCTTCATACTCAGCACATGCGTCACGATACCATGAAGATGATATTCAATCATCTCTTGTACTTGGTGAATAGTTTTAACGCGTCTATGGGCAATAATAGTGATGTCGGTGTCGGTAGTGAAATCAGTGAATTTCAACTAATGAGCAAGAACAAAGCGCTCATGTGTCAATTGTGTAATTTCAGTGCGGTGAACATTCTGCAAATGGTGCAGCACGTAAAGAGTTTGCGTCACATTCAAGTGGAACAGTTCATATGTCTGCAGCGAAGGAGTGAAAACCTCGAATCGTTAGGCCTGGATGATGTCTTTAAAATCGCTGATAACACTGATTGCGTTAAATCGGAACGATCAAGTCCCGAACAATGCTTAGAATCCCACCAATTATCGCTTAAACAGGACATTGAGAAAAACGCTACAACGACACCGAGGAGTTGTTCTGCTATGGGCGATGGACGTCACGATAGTTACGCAGACTCCTCGCAATACCAACAGAACCTGGGCATATCGTCAGTACCAAGCGCAACTGGAAGCAGTAGTTCCAGTTCCAATAACAAGGAACTTTCTGACGTTGACCTCTCTACGTTGCCatcaataatatataaatgCAACAATTGTGATTATTTCGCTCAAATCAAACACGAAATGGAACATCATATTTCGAACATGCATTCAAATGTATCTGAGCATGATTACCTTACGATTCCGACAAATCCGGCTGCTTTGCATGCTTTTCATGCAGCTGTCGCAGCGGCAGCCGTAGCAGCAAATGCTGCAGCCTCACAATCTCGAAGTAAATCATCTTCGCCAGTGCAAACGCAGAGTAAACGACAGCATCATGAAGAACAGGCATTCGCAACATCTGTATGCGGGAATAGTATGTTGCCTGAAGTAAAAACTGAACGAATGGATGCCATGGACGATCCTCAGTCAAATGATGAATGCGATCAGTTTGAAGATCCAACTGAGTCGTCGAACTCTAGAAGCGCGATTAACGTCGCGGGTTCTACGGCACTCAACTCGGTTATGTGTCCTCTTTGCCAGGATACATTCAGTGAGAAGAAGTCTCTTGAAATGCATCTCATGGGTGTGCATAGTGTGAACAGTGATGGATTGGCAAGACTGCTGCAACTGGTGGACAATACTCAATGGTTAAATAGTAGCCGGCGCAGTAGTACTAGTACTACTCCAGAGCCTCGCAGTTCAAGCACACCTCATTCAGATGTTGGAGCATTAACgttgcaacagcaacaacaaatgCATCAACAACAATCCTCTTCATCTGCTTCATCTGCTCATCAAAATGTTCCTAGCGAAGAATACGCCTGCCCTCAGTGTGGTGCTGCATTCAAGTTGCAGCAGCATTTGTTGATGCATGCTAATGACGCACAACATTATCAATTAGTAAACGAACAATATCAATGTTTGGCAAAGCATTGTCAGCAGCTCTTTGGCAATTTGCTGCAAATGTTAACGCACTATAAAGATAGCCACATGAATATAGTGATTTCTGAGCGTCATGTGTACAAGTATAGATGCAAGCAATGTTCGCTGGCCTTTAAAACTCAAGAAAAACTAAACACTCATTCCTTATATCATACAATGCGTGATGCTACCAAGTGTATGATCTGCAATCGCAATTTTCGCAGCACTCAATCATTACAGAAACACATGGAACAGGCGCACAATCAATTGCAAGCAACTGGAAGTCCCATCCCATCTCCTAATAGTAGTGTAGAACTGACAAATACTCCTGCATCCGTAACGCCTATTACCAGTCGGGGAGATGAAGATACGGCTGTTAACACCACAGCCATaacggcagcagcagcagttaCAGCAGCGACTACAGGTACGACAAATCAATTCAGCAAAACTTACGTTAagaattatgatgatgatgatgatgatgatgttggtaATGGTCAGTTCGTTGTTTCGATACAAGAAATCTCTAATAACTCCACATTAATAAAAGCACCTCCACCACCACAACCGCCagcaccgccaccaccaccaccaccaccaccaccaccaccactgccGCCACAAACCGCGATCGAAATTTTAAGCAGCGATAGCAGCAACAAGATCCCATTAGATGACTATCTGAATTCCCAACAAATGTCGGAAGAAAATTACACCGATGTGGAACGTAAGCTTAAGTGTCACAAGTGCAAAGTTGCCTACACAAATCAAACCCATCTAGCGAAACATTACAAATCGAATCAACATCGTCGCAATGAAAAGTTGAGTATATATCCATTGGAAAAGTATCTCGATCCCAACCGTCCTTTCAAGTGCGAAGTGTGCCGCGAAAGTTTCACCCAAAAAAACATACTTTTGGTGCACTACAACAGTGTATCGCACTTACATAAGGCAAAAAAGCAACAAAGCGGTGGTATGGCATTACAAGTAACGCCGATCACATCCCCCGTACTTCTAACTGATTTTGTGGATAGTACTGTCGACAGTAGGGGACAAGTAGTGGAACCAGTTGTTGGTGGTAGTGGTAGCACTTGCTGTGGTGGTGCTGGGTGTGTAGCGGCAATCAGCAAATCGCTGAATTCCAAACGAAAAATATCTCTCGAATCCGACTATGAGAGTCCTAAGAAGCGGTTCAAATGTGACATCTGCAAAGTGGCATATGCCCAGGGAAGTACACTTGATATTCACATGAGAAGTGTATTGCACCAAACACGCGCTTGCCGTTTTCAGGAACAGCAGAAACCGATGACGCCGTCCTCATCATCGAACAGTCCAACATCAACAATGACAACGCCGACATTCAATGACCAAGTATATAAATCCCTTCTGGAAACGTACGGCTTTGACATTGTAAAGCAATTCAATGAGATAAATAAGATTTGCCCAGCTACTGACTCTGGCAATTATTATTGCCGTTACTGCAGTAAGGTATTCTCGTCAGTGTTTGTATTAAAGACACACTGCGAGGAAATACACAATGAGAAGATTCCCCTAGAGTTATTAGAAAAATTtgcagaaaaatttaaaatatattatcTCGACCAACaaccaccatcaccatcatcgcTGTTATCGTTAAGCATTCCAAATCAGAATGAATCTActatttctttaaataaatctGAAACTAATTTAGGTGCAACAACATCCACCGCATGCAGCGAAAGTAGCTCTCCAGCGCCAATTGCAGACTTGGGAGCAGGAAGAAGCGAAACCTCCCCCATATTGCCTTCAACTTCGCAGCCAGTAGCGGCGACCAGCCCCAACCCCGTCACGGTATCAGCCGTGGCATCTGCACTACTAAAACAGCAACAGCAAACCCTAACTCCAGATATTGTGCAAAAAATCAGTCTAGACCCTACCATGTTGGCCCAAAAAATTATggagcaaaattttcaaaattttccaccaaATTTTACAGGTCTGCCACAGAATCTCCAAAATTTACAGAGCTTGCAAAGCCtacaaaatttgcaaaacatGCAACAAAATCTACCGAATATGAGTAACCTACCAATGAATACACTAGACATGCTAAACATAATGCAATTCCATCACTTGATGTCGCTGAATTTTATGAATTTGGCGCCACCTTTGATTTTCGGGGGAGCTGGAACCGGCCAAAGTCCATCGGTTTCAGGAACTACCCCATCAACAGTCACTGAACTACCGCCAACACCGGCCACGCAATTAATACAGCAGCAACAGACTGCAGCCGCTTCGCAGAATACCAGCAATCAAAAGCGAGCTCGCACTCGTATCACAGATGATCAGCTGAAAATCCTCCGTGCCCATTTCGATATAAACAATTCGCCCAGTGAAGAGAGCATCATGGAAATGTCCAAGAAGGCCAATCTCCCAATGAAGGTCGTCAAACATTGGTTCCGCAACACGCTGTTTAAGGAAAGGCAACGCAACAAAGACTCGCCTTATAACTTCAACAACCCACCATCAACAACGCTGAATCTAGAAGAATATGAACGTACTGGACAGACGAAGGTGACCCCTTTATCGGAAAGCGGAGGTGGTGCCATTAGCGGCTTCCATCTCcttcaacaacagcagcaacagcagcagcaacaacgagaacatcaacagcagcagcagcagcagcagcaacaacaacttcAGCAGCAACAACTGCAGCAGCAGATTCAGTCGCAACATCTTCAACAGCAACGCCCGCCTTCTTCTCAATCCAGTGATTTGAATTTTCCCCAGCTGAGCTTccaccagcaacaacaacagctcgATCTCTCCCGCCAACAACAGcatcaacaacagcagcaggacAATCGTCCTTTGTCTCACCCATCAAGCGTTACCAGCGATCGTGGAGACATCCATATTAAATCAGAACCAACTGATGATATTGGCAGTTCCGATTGTGATCAGCAGATGATGCTGAACAAAGATCACGATAGCGAACAATCATTGATGCAATCACACCATCAGCAATCAATGTTCTACAACAACTTCGAGACGAAATCCGAAAGTGGAAGCTCTGAGATTCTATCACGCCCCCAAACCCCGAACAGCACGTCGACACCATACAGCAGTAACATTTCAGACATCCTAGGTCAGCAAATGGACAATCTACCGCTGAATAATATGTCAAACATCAGCAATCTGAACAATATGGGTCCTCCAAAAAAGTTTCAAATGAACAAGATGTTTGAAAAGAATAGTAACTTTGAGACGAACTCTAATTCATCGAATAGCTCCACATCGAGTGGCAAACGTGCCAATCGCACCCGTTTCACCGACTACCAAATCAAAGTGTTGCAAGAGTTTTTCGAAAACAACTCTTATCCTAAAGACAGCGACTTGGAGTATTTAAGTaaattgctgctgctgtcacCAAGAGTTATTGTCGTTTGGTTTCAGAACGCACGTCAGAAACAGCGCAAGATCTACGAGAATCAACCGAACAATACCTTCTATGAGTCCGAAGAGAAAAAGCAAAACATCAACTACGCCTGCAAGAAGTGTAACCTAGTTTTCCAGCGCTACTATGAACTCATTCGCCACCAAAAGAATCATTGCTTCAAGGAGGAGAACAATAAGAAATCAGCAAAAGCACAAATAGCTGCTGCTCAGATTGCGCAAAGTCTTAGCAGTGAGGACTCCAACTCAAGCATCGATATCAACAATGCTAATTTGATGTCTTCGAACATAGTTGGACAACAGGCAGCTGCAGCGGCCGCTGCAGTAGCTGTGGCGGCAGCAGTTGGTGGGGCTGCTGTTCCACCTGTGCCACCAGCGATTCCTGGCCTCTCCACCAGTCCAGGAATGAACTTGCTTTCATCGCCACAACATATTTTCAAGCAACAGCAATCAGCGACTTTAGCAACTGGAAGTCACATTGATAGCACCTCGCCGCTACAGAAATTCGAATGCGACAAATGCCAGTTAACATTTACCCGTTATGAGCTGTACAAGGAACATCAACTAATACATCTTATGAATCCAAATCTTTTTATGAATCAGAATTACAGTGAATCGTCGCCCTTCGGAATCCTGCAAAATCTGCAGGGAAATCACAGCCAACAGCAAGACACTTCGATGGATTTAAGCAGGCAAAAGAAACGTAAATATTCTGATACACAAATCTCACCCGATGATCTACAGCAACAACAAGACTATGAATCTTTcaataagaaattcaaaaatGATCAGTACGACTTCCTCTACCAGTATTTTATGCAAAATGAGACCAACGCCGATCTCAAGAAGCAGATTACACAGCAACAACAACCAGAAGTCGATCTGGAATACTTGGCCAACTTTTACCAGCAAAATGAACTGAAAAAGCGCAGCAATTacgattttttatttcaatactaCCTGCGAAATGAATCAAAACAACCGAGTAGTGCGGCCAGCCTCATGATGCTGAACGATGATGCTAATAAACCAAATATGGAATTTCTCCTACAATATTATCAACTCAGTGAATCGAAAAAGTTTTTTCAGTTAGATGCCTCGCCCCAACGAATACATGATTTTCCACCATTGCTGAATATGGGCAGCGCAGCAGCAGCGTCCGTAAACAATGGTGTAGGaatacagcagcagcagcaagttTCGGCAACATCCCCAACAGAAACTATAATGAGTTCCACTACCAAAGTCACTGCAGCAACGCCCAGCACTTCACCAGTGTTGCTGACCAGCAGCCATGGTGGTGGTATTGGGAGCAAAGATACCAGCAGCAACATCAGCGGCAACAACCAGATTTCGATGCCTTTGTCAGCAACAGAAGCCAAACTGCTGTCAGGGGCAACGGCAGCTGGTGGTGGGAGCGGGGTACAAACAGTAACGTTACCAACTACGACAGTGAACACAATAATTTCTGGTAATAGTTCTTTTATAGCCAATTCCATGAGCATTAATCGCCATTGCCCGATGCCTCCGCATCATCCGAAACAACAAGAAAATTCTTCCCTTATCTCGAATCGTTTGGATCTATCACCTATCGCTGCTGCTGCCAGCGCATCTGATGTTGCTGTTCAATCACTGAAACTTTCCATTAAcaatttaacaacaaattctGCGTCAAAAAGTTCTCCactcaacaataacaacaacaatactGAAATGGATATGGATTCTGGTATCGCTCAACATCAAGAGTCCGTCAATAGTGTTGGCCATCAACTCGAAGAGACCATAACAACCACTGAGAAGCAGAACAACAAGAGGTTGCGCACGACCATATTGCCTGAGCAGTTAAATTTTTTATACGAATGCTATCAGAATGAGTCGAACCCAAGTCGCAAAATGCTCGAGGAGATTTCGAAAAAGGTCAACCTTAAGAAACGTGTTGTACAGGTCTGGTATCAAAACTCACGCGCGCGTGAACGCAAAGGTCAGTTCCGTCAGAAtattcaaataataaataaaaaatgtccgcACTGTGCCGCCATATTTAAAATCAAGTCAGCTCTCGAGTGGCATCTGCAATCCAAACATGGTGATAAGCAAGCCATTAACGTTGATCAAATTCCAGACTTGAAGTTTCCTGACGGCCTGTTGAATCTGTCGGGTGCTACACCTTCAGCGAAtttcgaaatgaaaaatgacGAAAGTGACCACATGCAAAAACAAGAGAAAAAAGAACAAATTCTATCCACAGCAACACCACCCGCAGTGTCTGTCTCATCAGCTGTTGCGCCATCTCTGGGTTCATCTACGGTATCGCCTACCGTAACTGCTGCATCAATCCTTGCTCTAGTTAAAGGGGAAGGAACTCCACTTGATCTTAGCAAGGCGCCTCCATCAACAAATCACCACAACAGCAAGTACGAAcagagtgaaagtgaaataagCTTCTCTGACTCCAATAATGATCACGATGAGTCCAATGATTTTTTTAGTCCCTCATCGTCATTAAATAATGGTAGCGTAGTTGTGAACCGACTTGCAAATGGAAGTAATCAACGTCAGTATGACTCAAGTGCACTCGAAAATGGTGTCAGTTGTATCTTTAATGAGTACAACACCACAACCAATGGTAATAATTCAAATGATTACATTGCAAATGAGCGTGACAACACCAACTCACCGGCTAGTCAGACTTCAAGTAACCACAGCGCACAACAAAAGAAACGCTTTCGCACGCAGATGAGCAATATGCAGGTGCGCATACTAAAGACGCTGTTCTTTGACGTTAAAACGCCATCGATGACTGACTGCGCAAATATTGGCCGGGAGATAGGATTGCAAAAACGTGTCATTCAGGTTTGGTTCCAGAACGCCAGAGCCAAGGAGAAGAAATCCCGTAATCAACGATTTTTACACGACGAAAACACATTCGAGAACGACACATCAACAAAGTCAAATATTGACTCAACAACAACCGCCATTCCAGAAATACGCGATTGCAATATTTGCCATTTGCAAAGTGTCAACATTCAGGAGCATGCTTTCTCATCCCAACATATCGCACAGGTGCGAATTTTACTCGAATCAAGTAGCAGTAAAAATGATGACCAGCAACATGCTGACAACAGCATGGAGCACGAATTTACTAGCATATATTCACAGCTGTATAATCAACAGCATAAGAACAACCACCCGGATGTAGAGCGTTCGGATACAAATCACCATGATGAAGACGACGCTGATATTGATGATCATAGTTTCAATATTACCGAAGTTGATGGCAACATGGATCAAAATGAAGATGACGGCGATAGCACAACTCAGAAACAAAACGAAGATCTGGCTTTAAATGAGGCCAATAAAGCTGCTTTAGCATTGAGAAACTTTAACAAGTTGCAGCAACATTTCGCTGCAGCTACTGCATTAGctaaacagcagcagcagcaacaacaacaacatcagcaaCAACAGCACCCGCAATGTGATGCATTATCATTGGCTTCGCCACCaacaatggaaaacaatttattgCTGAAAATAAACACAGATAATCCGTTAACATCAAATCATTCTGAAATGTTGCAACAGCTCTTTAACTACAGTCAAATGAGTGTTATCGGCGGAAAGTGA
- Protein Sequence
- MSCSDVESFQGKIVYNPDGSAFIIDSENESLSNISDNCMNVGATATNNPKIHSFRVVTARDASVNISEPNKIQKPILMCFICKLSFGNTKSFSLHANSEHTLNLQESEKLLLNREYSSAIIQRNVDEKPQISFLEPLDIQKQHQFMKQSSQTPQHQQQLLIGSTLSSAASNNNFSSNNCSNNNININCSTSTAIPPSTSSSSSSSSASSVSSAAVAAAAAANAALVAAIAASCSNTNSVNTSASISMNTPHLDSDLIMANLGAAGGGGNNNPSYGNTTGTISNNPSSSSSNCSQINHQQRSENFDNLSTLDLSAATVAAQAAAAAVAAAITVDGRHTPPPSSPTSTTSSSPSSSSSSSSISTSSSSAQQQATTIIASGVPVATAPGGPKQSSHSSAAIDAIPATSSISSTATKSTSSNSSSNINNTSSNKISASAVSPPTPTTVADFLQQQFQQMQNQIRITSPTSASAAVVSTSGSNMESNILSSVTSSAPTLSSLTASLAAAAAAVGSGTPGDLTSNSNSVKLINDFLQHQLQHQQHPSYATCPEHPDVKGIDCKTCEMIEINMKSPITPTRSPNSINLFPSSSTMSPTAAAAPSFTIGACPEHINGRPLGVDCSRCEMILNSARLNSGVQMSTRNSCKTLKCPQCNWHYKYQETLEIHMREKHPDGESACGYCLAGQQHPRLARGESYTCGYKPYRCEICNYSTTTKGNLSIHMQSDKHLNNMQELNSSQSMANTAAEIRESPKIIMPNMQQQASKPKPSFRCDVCSYETSVARNLRIHMTSEKHTHNMAVLQNNIKHIQAFSFLQSQNLGQLSAAQSAAVAASNLPNMPNLQNFLPEAALADIAYNQALMIQLLHQNSAAGALSAAAAAAAAANPLTLAPPQQSPAGSGVTTTLPQSASTSQQPSQQPYQTSQPSKLNHPPTPQVLPTTGATPVSVSSTTLSVHQSSPQQQSAVAAAALLAEAAAAGAATTDSATCIQQQQQQQQDSSLDPPIDPDPKPTTAFSCLICANYNTNSIDELNNHLMIDRSRNTNNNCSDIMMIINNNYICRLCNYKTNLKANFQLHSKTDKHLQKLNYINHIREGGVKNEYKLKYNQTNTVQLKCNCCDFYTNSIQKLNLHTQHMRHDTMKMIFNHLLYLVNSFNASMGNNSDVGVGSEISEFQLMSKNKALMCQLCNFSAVNILQMVQHVKSLRHIQVEQFICLQRRSENLESLGLDDVFKIADNTDCVKSERSSPEQCLESHQLSLKQDIEKNATTTPRSCSAMGDGRHDSYADSSQYQQNLGISSVPSATGSSSSSSNNKELSDVDLSTLPSIIYKCNNCDYFAQIKHEMEHHISNMHSNVSEHDYLTIPTNPAALHAFHAAVAAAAVAANAAASQSRSKSSSPVQTQSKRQHHEEQAFATSVCGNSMLPEVKTERMDAMDDPQSNDECDQFEDPTESSNSRSAINVAGSTALNSVMCPLCQDTFSEKKSLEMHLMGVHSVNSDGLARLLQLVDNTQWLNSSRRSSTSTTPEPRSSSTPHSDVGALTLQQQQQMHQQQSSSSASSAHQNVPSEEYACPQCGAAFKLQQHLLMHANDAQHYQLVNEQYQCLAKHCQQLFGNLLQMLTHYKDSHMNIVISERHVYKYRCKQCSLAFKTQEKLNTHSLYHTMRDATKCMICNRNFRSTQSLQKHMEQAHNQLQATGSPIPSPNSSVELTNTPASVTPITSRGDEDTAVNTTAITAAAAVTAATTGTTNQFSKTYVKNYDDDDDDDVGNGQFVVSIQEISNNSTLIKAPPPPQPPAPPPPPPPPPPPPLPPQTAIEILSSDSSNKIPLDDYLNSQQMSEENYTDVERKLKCHKCKVAYTNQTHLAKHYKSNQHRRNEKLSIYPLEKYLDPNRPFKCEVCRESFTQKNILLVHYNSVSHLHKAKKQQSGGMALQVTPITSPVLLTDFVDSTVDSRGQVVEPVVGGSGSTCCGGAGCVAAISKSLNSKRKISLESDYESPKKRFKCDICKVAYAQGSTLDIHMRSVLHQTRACRFQEQQKPMTPSSSSNSPTSTMTTPTFNDQVYKSLLETYGFDIVKQFNEINKICPATDSGNYYCRYCSKVFSSVFVLKTHCEEIHNEKIPLELLEKFAEKFKIYYLDQQPPSPSSLLSLSIPNQNESTISLNKSETNLGATTSTACSESSSPAPIADLGAGRSETSPILPSTSQPVAATSPNPVTVSAVASALLKQQQQTLTPDIVQKISLDPTMLAQKIMEQNFQNFPPNFTGLPQNLQNLQSLQSLQNLQNMQQNLPNMSNLPMNTLDMLNIMQFHHLMSLNFMNLAPPLIFGGAGTGQSPSVSGTTPSTVTELPPTPATQLIQQQQTAAASQNTSNQKRARTRITDDQLKILRAHFDINNSPSEESIMEMSKKANLPMKVVKHWFRNTLFKERQRNKDSPYNFNNPPSTTLNLEEYERTGQTKVTPLSESGGGAISGFHLLQQQQQQQQQQREHQQQQQQQQQQQLQQQQLQQQIQSQHLQQQRPPSSQSSDLNFPQLSFHQQQQQLDLSRQQQHQQQQQDNRPLSHPSSVTSDRGDIHIKSEPTDDIGSSDCDQQMMLNKDHDSEQSLMQSHHQQSMFYNNFETKSESGSSEILSRPQTPNSTSTPYSSNISDILGQQMDNLPLNNMSNISNLNNMGPPKKFQMNKMFEKNSNFETNSNSSNSSTSSGKRANRTRFTDYQIKVLQEFFENNSYPKDSDLEYLSKLLLLSPRVIVVWFQNARQKQRKIYENQPNNTFYESEEKKQNINYACKKCNLVFQRYYELIRHQKNHCFKEENNKKSAKAQIAAAQIAQSLSSEDSNSSIDINNANLMSSNIVGQQAAAAAAAVAVAAAVGGAAVPPVPPAIPGLSTSPGMNLLSSPQHIFKQQQSATLATGSHIDSTSPLQKFECDKCQLTFTRYELYKEHQLIHLMNPNLFMNQNYSESSPFGILQNLQGNHSQQQDTSMDLSRQKKRKYSDTQISPDDLQQQQDYESFNKKFKNDQYDFLYQYFMQNETNADLKKQITQQQQPEVDLEYLANFYQQNELKKRSNYDFLFQYYLRNESKQPSSAASLMMLNDDANKPNMEFLLQYYQLSESKKFFQLDASPQRIHDFPPLLNMGSAAAASVNNGVGIQQQQQVSATSPTETIMSSTTKVTAATPSTSPVLLTSSHGGGIGSKDTSSNISGNNQISMPLSATEAKLLSGATAAGGGSGVQTVTLPTTTVNTIISGNSSFIANSMSINRHCPMPPHHPKQQENSSLISNRLDLSPIAAAASASDVAVQSLKLSINNLTTNSASKSSPLNNNNNNTEMDMDSGIAQHQESVNSVGHQLEETITTTEKQNNKRLRTTILPEQLNFLYECYQNESNPSRKMLEEISKKVNLKKRVVQVWYQNSRARERKGQFRQNIQIINKKCPHCAAIFKIKSALEWHLQSKHGDKQAINVDQIPDLKFPDGLLNLSGATPSANFEMKNDESDHMQKQEKKEQILSTATPPAVSVSSAVAPSLGSSTVSPTVTAASILALVKGEGTPLDLSKAPPSTNHHNSKYEQSESEISFSDSNNDHDESNDFFSPSSSLNNGSVVVNRLANGSNQRQYDSSALENGVSCIFNEYNTTTNGNNSNDYIANERDNTNSPASQTSSNHSAQQKKRFRTQMSNMQVRILKTLFFDVKTPSMTDCANIGREIGLQKRVIQVWFQNARAKEKKSRNQRFLHDENTFENDTSTKSNIDSTTTAIPEIRDCNICHLQSVNIQEHAFSSQHIAQVRILLESSSSKNDDQQHADNSMEHEFTSIYSQLYNQQHKNNHPDVERSDTNHHDEDDADIDDHSFNITEVDGNMDQNEDDGDSTTQKQNEDLALNEANKAALALRNFNKLQQHFAAATALAKQQQQQQQQHQQQQHPQCDALSLASPPTMENNLLLKINTDNPLTSNHSEMLQQLFNYSQMSVIGGK
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00313269;
- 90% Identity
- iTF_00315474;
- 80% Identity
- iTF_00313269;