Bole019743.1
Basic Information
- Insect
- Bactrocera oleae
- Gene Symbol
- zfh2
- Assembly
- GCA_001188975.4
- Location
- LGAM02021656.1:515373-604126[-]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 13 0.082 7.1 7.5 0.4 2 23 243 265 242 265 0.95 2 13 0.00021 0.018 15.6 1.4 2 23 872 894 871 894 0.96 3 13 5.3e-05 0.0046 17.5 0.7 1 23 926 950 926 950 0.92 4 13 0.0093 0.8 10.5 0.4 1 22 1049 1070 1049 1073 0.91 5 13 1.2 1e+02 3.9 1.9 1 23 1321 1345 1321 1345 0.91 6 13 6.2 5.4e+02 1.6 0.7 2 22 1485 1505 1484 1508 0.88 7 13 2.2 1.9e+02 3.0 0.8 1 23 1645 1668 1645 1668 0.91 8 13 0.00054 0.046 14.4 0.8 2 23 1821 1843 1820 1843 0.95 9 13 0.25 21 6.0 2.7 2 22 1941 1961 1940 1964 0.88 10 13 0.0011 0.095 13.4 3.9 1 23 2016 2038 2016 2038 0.98 11 13 0.0097 0.84 10.4 3.3 2 23 2045 2067 2044 2068 0.93 12 13 0.0002 0.017 15.7 2.6 1 23 2772 2794 2772 2794 0.97 13 13 0.014 1.2 10.0 3.8 1 23 2918 2940 2918 2940 0.97
Sequence Information
- Coding Sequence
- atgttaacagACAGTGAAAATAGTCAGTTCTCGTATAATTATTCATCGGAAAACAGTGTTAATAATGTCTTCGAAAAAAGGGACGGTACGGGAACCAGTTATCGTGGAAGATGTTTCGCCAGTAATGAACGAAAACGAGGAATATCCTCTACCTGCGCAGgatattttaaaagatttagACCAGACGCAGATGATAGCGAATACACAAGCAGTGTAGACAGCGATTATGAAGGAAAAGCAGCAAAAAAGGAAAAGGAAGCAGGATTCCAACGTTCTGAATTTCATGCCAATGAATTAACAGCAAAAACCGTACAAAAAACGAGTGAATCTAAGCAATTACTGACGTCAAATGTGGTTGGGGAAACAAGTGCTACAACTCAAAACATAACATCAACAACTGTAGCAagtgcgacaacaacaacattgagtACAACATCGACTACGTTAAAAGAAGCACCAGGTGGTCAAGATGGACAAGGACCCGATGCCTTAAAATGTAGCGGAAGCCCACGTATACATTCATTTCGAATTGTGAGTGCTCAGGATGCTACAACCGCCATCGTTGCTAACACTACCACCATCGACAAAAGCAGTGTGGAGAAAACGGAGCTCAGAAAATCGTATATCGAACAGCAACAGCGTCGCCGATATAGCACCTTCGACGATAACGATAGCGATAGTAGCAACGACCACAAAAGATCTTCCAAAATACAAAAGCCGATATTGATGTGCTTTATATGCAAACTGTCATTTGGGAACGCAAAATCTTTTAGCTTACACGCTAACACAGAACATCAATTGACACTACAAACAAAAGAACAGCATTTATTAAACTGCGAATACTCGAGCGTAATTATACAGCCGCAAAATATGGATGAACGGCCGCAAATATCATTCCTTGAGCCAATTGATGTGCACAATGCAATCCAAAATGATAAATTAAATTCTGAGACTGATATTCAATTTAATGAGGGTACATCGCAAGTATTTTCTTGTAGACTGGATAGTGCGGTTGATACTTCCATCTCTGAAAGTTCGAGTACAGGATGTGTAAGAGCTGAAAAATCGGACAAACCAGCGCCAATATCGTCAAACTCTAACTCATTATCGCCATCTTCATCTTCAGCTTCCCCCGTGTCATCAGCAGTAGTACCAATAGCACATACCGTGTTGTCACCATCACAGATAACATCAACTACGTTGTCCAGCGCAATCGAATGTATTAACGATAGCCATCTATTTCAAACAACCTCATTTTCAAAATGTATGCTCCAGGGGCAGCGAAATGAGGTGAGCGATACTGCAGCAACAACTGCAGTAACAACACTGTTATCAAGCCCTAAATTTGAAACCAGtgaacaacagcagcagcaaaatatAAAGGCAAATGTAGCCATAGAATCGAACGACAGTACTGAGTCCAAAATGCAAAGTGCATCGATATCACGATCACTGATTCCCTTAACAGAGAACATGTTAGCACTAACCCCACCCTTATTACCAAAAAAAGaatcagaaaatattattaaacttgACGAACCATATAGTATGGACAATAGAACTGCATCTGGCGGCGTTTTATGCTCAAGTGCTGATGTTCAACTGAAGCCAAGATTGATAGCGTCTCCGATCCCTACAAATAAAAACACCGCTATGACCTGTGATATGGGCAGTTATGACGAACGCGGTTTATACTCAGATTACATCCGTAGCCCGCCTTTGACAACATTTAGGAAAACTCATGATAAAATGATAGAAATGTCCTCATTAACAGTTAATGCTCCAACTACTCATGGTGACTTTGTACATTTTGAAGCATCTTCAGGAACGGTATCAGAGGAAGGTAATATGTTGAATCCAACAGATACACTGGCAGAAACAGTTACATTTTTgaagcagcagcagcgacaAATTGCAACAATGACCAGTACCTCCTATGCCACCCCTCAACCTGCATTAATGTCCATTGTACCCACGCACACTCAACTCAGCTGCTTACATGCCTCTTTGGCAGCTTTATCTGATGACAGAAATATCAACTGCAGTACTACAGATGCACAAAAAACGAACGCTAAATTGTTCACCGACTTTTTGCAACAGCATcttaatttacaacaacaaaaaacgtatTCCGATGTAAGTGGCAATTGTTCTGAACATGCCGACTATAAGGATAGCGACTGTAAGAATTGCGAAATTCAACAATTAAAGTCGTCTCCGTATCACAGTTCAATACACCAGCTTACCAACAACACGTCGCAATGCTCGCCAAATCGCAATAATGGCAGTTGTACCAGCAGCGCTATTATGAAATCTCCTGCACACATGGCCACATCTCCTACAGCTGTCAATGTCTCAACTGCGACAGTAGCGGCGACAGCGACAACCCAGCAACAAAATGCAGCTGCTGCTGTAGCTGTGgctgctgcagcagcagcagctgcggCAGCTTCAGTGGCTAACAATACCTCAAGTTTTACTATCGGCGCATGTTCGGACCATATCAACGGACGGTCTTTAGGCGTAGAATGCGCTCGGTGCGAAATGATTTTAAATTCAACGCGTCTTAACACAGGCGTTCAAATGTCAACACGTAATTCTTGTAAAACATTGAAATGTCCTCAATGTAATTGGCACTACAAATATCAGGAGACACTCGAGATTCATATGCGTGAAAAACACCCGGATGGCGAGAGTGCTTGTGGCTACTGTTTGTCCGGTCAACAGCATCCCCGTCTAGCACGCGGCGAGTCATATTCTTGTGGATACAAACCATATCGCTGTGAAATTTGTAACTACTCGACAACGACAAAAGGCAATCTTTCCATTCATATGCAGTCCGATAAGCATTTGAATAATATGCAGGAGTTGAATAGTTCACAAAGTATGTTAGCAGCCGCCGTCGTTGCTGCTAATAGTGGAAAGGCGGATGTTGCATCCAAAATGTTGCTATCCAACAGTGCTGCAGCTGCtgctcaacaacaacagcaagcactACCGCAGTCTCAATCCCAACAACCACACTCTACGCTTGTCGCCAGTTCGTCAATGTGTAGTACCTCCGCTTCTGTGAGTGCCAGTGGTCTCGCCAGCGTCGATGGCATTGGAACTAATTGCAACATGCTCAATAAAAATAAGCCGTCCTTTCGTTGTGACATTTGCAGTTATGAAACATCTGTAGCACGAAATCTTCGTATACATATGACCAGCGAAAAGCATACGCATAATATGGCTGCACTGCAAAATAATATCAAGCACATACAAGCTTACAGCTTTTTACAGCAGCATTCGCAAGTTGTAGCCgcacatcagcaacaacaattagcaGCAGCAACACAGGTATCACAATCACAATTACCAGCACTGGCCAACAGTTTTTTGCCAGAAATCGCTTTAGCTGATTTAGCTTATAATCAAGCGATAATGATCCAGTTGCTGCAGCACAACTCTGTtagtcaacaacagcaacaacatgaaTCTGTTGCCAAAATGCAGGTAAAAACTTCACCATGTTCCTCACCGCGTTCCATGCAAATAGAACAGCAATCAACAGCtcaccagcagcaacaacagcaactccAACAATTACGTCAACATTTACAGCAGTCTTATTCACCGAACAATAGTTCATCAATGTTATTACTTTCGGCTGATCCGTTCGGCACCAGCGATGTAAATTTGCCTTCCCACGTTAGTGGTAGCTGCAGTAACGACGAAACGTTGGAGCCACCCATACATGCCGATCCATGTCCAACAAACCTGTATAGTTGCTTAGTATGTGAGGTGTTCGGTACAAATAGTTTAGACGAATTAAACCAACATTTGCTGATTGATCGTTCACGTTGCTGCAACAAGCAGCAAGCTACACCAACAAATGCGGTCGGCAACAGTTGCACCGAGAATGCTAATGTCACTAATACAGTTAATAGCAATGATATAATGGTAATTTTAAACAACAATTACGTTTGCCGATTGTGCAACTATAAGACTAATCTAAAGGCAAACTTTCAGCTGCACAGCAAAACGGACAAACATCTGCAAAAATTGAACTATATAAATCATATACGTGAAGGCGGTACACAAAACGAATACAAGCTAAAACATTTACATCTAGCTACTAACACAGTACAACTTAAATGCAACTGTTGTGATTTCTACACAAATTCCATCCAAAAGCTGTCATTGCACACGCAGCATATGCGTCATGATACGATGCGTATGATTTTCCAACATATACTGCATATCATGGAACAGCAAAATTTTGAGCGAAACAAAACCAAAGAGATGGCACCACACTCAAAAACTCATTCACCTAAAGATGCAACTGAAATTGTTCCAAGTATTCGTGAGTCGCACAGTGTTGTGCCTGTTCATGAAAATGATGGGAGTGACACCAACCACAGCAGCAGCATTGCAGTGGAAGACACGCCGCAGTCGATGCAAAAGTCTTTAACATGTCAATTGTGTGATTTTAGCACGTTTACGTTGCTTAATATGATCCAGCACGTGAAAAGTATGCGTCATATGCAAATAGAACAGTTTGTAAATCTACAACGACGCAGCGAGCAACTTGATCCTCCTAGTCTGGATGACATTTTCAAGATGGTAGAACGACCTACACTACAGTCTTCGGCACTAACAGCTGCTGCAAGACAAGAAGaAAATAACAGAATGGAAAATGTTACATCTGTTTCGCGATTCGGCAATTTTCCAATGCCACTGGCACGTTTATCAAACGATGATTTCTCTAATAGAGATCACAACTCTTTTTCACCGTCATCTAACTCCTCAACTGCTTCAGGAAACACTAGTGTGCGCAGCAAAAGCTATCAgagtgttaatatttttaactttcaaaCAGCAAGTGATGACAAAAATGCTTCACCATCAGCACTTCCGGGAGGTAGTCCCGCCGTCATGCCCTCAGTTGTATTTAAATGCAACAattgtgatttttttgcacattcCAAAGCTGAAATGGAACTACACCTGAGTGCAGTACATCCTCAAACTGAGCCAGACTACATTAGCATCCCAACGAACTCTGCTGCCATACAAGCATTTCAAGCAGCTGTAGCGGCAGCGACAGCAGCAGCTTCTGCAGTGACTGCCACCCGTGCTCCTAGTAACAAATCTAATACAGAAAATGATGATTTCCCTACTATAATTAAAAGAGAGCGCCTCTGTACGACAGAAGATGAAGAAACCGTAAAGTCCGCAGATGTGCAGAATACAATGTCGCTTCAGGATGTGTCGACAATTACACCTTGGTTCACAAAGACAacgaatacaataaataattacgaAACTTCAGCAGCTATGGTTCAACCTAAATCAATTCATAATGTATTACATGAGTTGGAAAAGACCGAGGAACAGAAAGCACAGCAAATTGAAGATACAAGTGGGGATGCATTGATTGAAAGTACCAACAAATATTCCGGAGAAGCAGTCAATGTTCAGTGTCCCCTCTGTACAGAAACATTTGATAGCAAACATACATTGGAAACGCATTTGATGAATATACATAGTGTGAATCACGATGGTTTATCACGTCTCCTACAATTAGTAGACACTAGCGCTTGGGATTTGACTGGTAAAACCGCAACAACGCCTACTATAGCTAAAGACTGTAAGGACTCAAATAGCGCCTCCgaaacaacaactaaacaaaCTATCACCGGTACTATTAATATTAAACCATCGACAGAATTAGAGTTATCGCTAATCGAGGTTTCAAATGACATACCTTTAAATCTCAATGCTTCTCATACCTCGACTTACATGCATTCCATAGAAtcgttaaataataataaattgtcaTGTGAACACTGTGGTTCAAAATTCAAGCATGAATTACAATTATTGCAACATGCGCAAAAAATGCAACATTTCATAATCTTGCCAAATGGTGGTCACCGTTGTTTGGCTGCTAGTCATCCCAGCCGACCGTGCCATTCGACGTTTCCGACTCAAGCTTCAATGGTCATCCATTATAAAAATACTCATATTAGCTTAATCATATCCGAGCGTCATGTGTATAAATACCGATGTAAACATTGTTCACTTGcatttaaaactcaagaaaaacTATCTACACATTTGCTGTATCATACGATGCGGGAAGCCACCAAATGCACGCTCTGCCAACGAAACTTTCGTACGACTCAAGCGTTACAAAAGCACATCGATCAAACACATCATAGTAGCGGCGATCAACATGTCGTCAGTAACAGTCCACCGACAGTATTAAGCTATAGCGGCCGTGGATCGCCAGCCTTACAGATGAGTCACGTGCAAAACAAATCCCAATTAAATGACAATGGCAATGCAGCTAAAAACCAAGACATCAatgTCCGTGCATCGCCCCCTACGACACCGAAAACAAATGAATCTTCAAAGGTATCGTCTTCCCCATTACCAGCTAACATATTGCAAgaccaacagcaacaagaacatGCTGCTGCTTACACCGCTGCGTTATTAAATCAGTCAACAcctctacagcaacaacaacatatcagCGGCGAAGAACTAACAGAGAGCGGCTGTCATCTTATGCAGCATACACAAATGAAACCACGTAGTCCCCTCTTAACACAAAAATATCTTCAACAACAGAATCTTCAAAACCTTCAACAATTGCCACAATTGACGGCAGCAGCAGCTACATCCGGCTTTCAATTGAATCCTGTCGAGATATTTAATCTTATGCAGTTTCATCACATAATGTCAATGAACTTTATGAATCTTGCACCGCCGCTTATATTTGGGGTCGGCACTAATTCTGGCTCTAGTGACAATGTTGATTTGTTAACACCCACTCACATACCTAAAGTTACTTACAATCCCAGTGCCAATACACTTCTATGTGGGAACGATATGTCGGTACCAGGCACACCAGTGGCGCCGCGAGCGGACCTAATAGGTGCATTACAGCAACAGCCTCAGTCCCAATCACAACAATCTGCACCAACTAGTACCGTTCAGATGGTAAATAATCAGAAACGTGCAAGGACGCGTATCACTGACGACCAGCTTAAAATTCTGCGAGCTCATTTTGATATTAATAATTCGCCAAGTGAAGAAAGTATAATGGAAATGTCACAAAAAGCGAATTTGCCAATGAAGGTCGTCAAGCATTGGTTTCGGAATACGCTGTTTAAAGAACGTCAAAGAAACAAAGATTCCCCTTACAACTTTAATAATCCACCATCTACAACACTAAATTTAGAAGAATACGAACGCACTGGACAAGCAAAAGTTACGCCGCTGAACGAGGATTTACCTACGGCATCGaaTAACTctatgcagcaacaacaaaagcaaaatattcatGAATCGAATcccaaaaataattcaaaagaaaaaccaACCACTATGACAGACATAAGTGGCGATTCTTCGTTATTAATTGACATCAAAGCTGAACCCCGAGATGATAATATTGAGGCATACATTACAACTCCAGGGAGTTCACAACGTCAAAATGAACAGCAGCAACCTCAAGAGCAAAGGCAACGAGAACTGCATAAGAACGACGAAAAAGATTTATTGAGCACGTCATCGGCATTGCTGCTGCATAAACGACAACAGCATTTATCCGCATTGTCAGCGTCtgaacatcaacaacaacttcTTGAAATACCGGGAAACACAGTAGTGGGAACGGCTCCTGTTACGGCGTTGCAGCAGcatcatcaacagcaacaacatttgcatggtcaatcaaatcaaaatattcACCAGCAACATCCACAGCATATCAACCTATACAGTTATGAAACCAAATCAGAAAGCGGCAGTTCGGACATCCTTTCCAGACCTCAGTCACCAAACAACAGTTCTACAGTTGTACCGACACACTATGCCAGCATTAATGAGTTAATAAATCAACAATTGGATAATTTGCCGCTCGGCCATAATATCAGCAATATAAATGTTGGCAACATGCACGGTAACAACATGGGCCCgccgaaaaattttcaaaccagcAAATCTTTCGACAAAAATTCACCGACTTCTCAGTTTGACACCAATTCGAATTCATCGAATGCATCGTCCACATCGAGCGGAAAGCGCGCAAATCGCACACGCTTTACGGACTACCAAATTAAGGTACTGCAGGAATTCTTTGAGAATAACTCCTATCCAAAGGATAGCGATTTGGAGTATTTAAGCAAGCTATTATTGCTTTCTCCACGAGTTATTGTTGTCTGGTTTCAGaatgcTCGCCAAAAACAGCGTAAGATTTATGAAAATCAACCAAATAATTCACTTTACGAAACTGAAGAGAAAAAGCATAATATTAACTACGCCTGCAAAAAGTGTAATATGACATTTCAGCGCTATTACGAGTTGATACGCCATCAAAAAAATCATTGctttaaagaagaaaataacaaaaagtcgGCCAAGGCACAAATAGCCGCAGCACAGATCGCACAAAATCTCAGCTCAGAAGACTCCAATTCTTCGATGGACATTAATAATAGCAGTGCGTATCAATTACAGCATCAACATATTTCTAATGCTGCAGCAACAGCGGTGTTGTCAAGTACCGCCAACGTTTCCAGCACATCACCTAGCAGCACAGCTCCTGGCGTTACATCGCCACAGCATTTATATGGTAAATCGTCAATGTCAATGACTGATTTTAGTCCATCCACTACGCCCACTCCACCACAGACACAGCGTGAGCGCAGCGATAGCAGTGAATTGCTGCCGCAAGGTCCAGTGCACAAGTCAAAATACGAATGTGACAAATGCAAATTACAATTCAGTTACTACGAACATTTCCGTGAGCATCAGCTACTACACCTAATGAATCCCAGCTTATTTACCAcacaaattacaaacataccAGAGGCTTACGGTTCATTTGGCAGCATATTGCAAAGCTTACAGCAAGTTGCAGCTGCCAGTGCCACGCATCAGCAGCACCACCAATTCCTAGAACAGCAAGACCAACCACCCGCAAAAAAACGTAAGTGCTCTGAAACATCTTCAATAGCAGATGATGTGTCATCAATTTTTGGCACCGGTGATGGTGAAATCAGCAACCCTGTCTCATTTTCGTTATCAAACAGcaaaaaatacgaatttttatatCAGTATTTTATGCAAAACGAGAGCAACAACGAATTGAAACAACAATTTCaagcacaacaaaaaaagtCACATGAACCAGAAATCGAAATggaatatttaacaaatttttaccaCCAAAGTGAGTTACGAAAGCGTAGTAATTATGATTTCTTGTatcaatattatcaaaaaaatgaGCACACACAGCAAATGAGTGCTTTGCCGTCCCAAGCGGGCGTTTTCGGTAGTGAAAACAAACCAAATATAGACGTTCTACTCCAGTACTATCAACTGAATGAATCAAAAAGGTTTTTTCAGTTAAATGCCTCGAACCAAGAACTAAATGACCTTACGGCAGCGTCGCCAACATCCACATCACAATACCAACATGTAAATAATCCGATACCAATATCGAATAGTGATCATACAACCACACCTAGAGTTGATCGGTATGATATAACCGATATGAGTTTATTGAAAAACTCTCTGGATGTCCCTAatagtagtattaattttacAGATTGCGATAATGACAACAACGACCACGAAATAAGTAGTAATGGTTGTGGCATGGATGTTGACGACGCTGAAACTTGTACTGACACTGATGATCATATTAATGACAATAATCAAAAATCGAATGAAATCTACGGTCACAATAGCCGAATCAACATAGTGAAGAGTAGTAACGATAATGATAATAAGGATAATGAACAAGCAGCAAATAAATTTGCGAAAATGGATCCAATTAATGAACTCCCTACTAGTCTGCAATCTCCATTTTCTTATAGCAATGAGCAACATAAGCTCAAGGACTTAAGTAAATCTCTTGCCAGATgcggaaaaaatcaaaaacataatTTGACAAAGCGTAAAGCATCAAATGTGTCATCAAAATCAAAtacgacaacaacaattaataatGAGTATATTGATCATTTAAACGATTTTCTTCATGTCAATCAAAAATACGAATCCAAAGAGAAAAGTAAACAGCAGCAGCATTATAACGATTTACTACAAGCACAAAATGAAACCAACGGTCTTGGGAATAATGGATCAGCCCGACAAGCAATCGATAAGAGACAATTAGCAGAGACTACATCATCATCACCGAATTCcactacaaatttaaataacaaaccTACCGTTGGGTCttctaccacaacaacaaacacagtcACTGGTATTtctgaaaaacaacaaagcaaacgATTACGCACTACAATACTGCCGgagcaattaaattttctatatgaATGCTATCAAAATGAATCGAATCCAAGTAGAAAAATGCTTGAGGAGATCGCAAAAAAAGTTAATCTTAAAAAACGGGTTGTACAAGTTTGGTTCCAAAACTCACGTGCCAAAGATAAAAAGTCACGCAATCAACGCTTTACTGCAATATCCGATGACAGTAACTACGAGGATGCTTCGCAAAATAATCGAGACTACGGActctttcaaaaaaatacattgAAGTCCACTACAATCCACGGCAACAAGGGCAACATGAACTTTTTACCTGATGCTAATATAAGTAATAACACCAACACTGGAATGGAATTAGGCGGCTGTAATTTATGCCAACTGTCTCAAGTAAATGTTCAGCAACATGCCTTATCTATTGAGCACATATGTAAAGTAAAAAAGCTGTTGGAGCAAACCGCAGTTGATGTCAAACTTAATATCGATTCGAAAGTTTTTTCAACTGCAGTGACTGTGGAAAATGAAGaattcacaaataaacaaagtgCAACTGGCCAGCCAAATACTAACGCCGATTTAAAAGAAGGTAAATATGTCACGAGTGACGAAGCTGCTCTTGCCCAAGATAGTGATTTCAAGACGTATGTGAATACTGATATTACCGGCAACGATAGACTTGTCACTTTTACGAGAATATATAAGGAGCtgaaattacaaaatacaaTGAAACGAGCCATCGATTATACATCCAATAACGAAAACAGTGAATTAATGGAATGGTATGAATGTGgcaatattaataatgaaagcAATGGAGCAAAAACAAACGACATAGGTGACAATCGATGTAAAGGGCGAAAACCACAGGTTGCTAAACAGacttcaaaagaaaataatgaaaaacaagaaaggGATAAAGTTATTAATAGGTCATACAGTGCTGAGAATGTTGAGAACACTGATATTGGAAGCGTCAACAATATTGTCAATCACAAATTAATGGAAactcaatataatattaaatatagtgAACGCAGCAGCAAAAGTGCcatttttgtttcaaatgcACCGCTGGCAACAAGTATTACTGATGATGAAGATAATTATATAGACGAATACAAGGATAGTAGTGATGGGAATACGATAGATCTAAATGATAATAATGAGACCATACAAAACCCATCAACCAGAAATGAAACAATAACAGCAAGCAGTAACAATAGCAGCACATCTTGTACATTAACAACCAGTACAACGCATACGCAATTAACTAATACGCATGCTCAAGATATTattcaacaattatttaattgtaatcaAATAACTGTTTCGAGCGGCAAATAA
- Protein Sequence
- MLTDSENSQFSYNYSSENSVNNVFEKRDGTGTSYRGRCFASNERKRGISSTCAGYFKRFRPDADDSEYTSSVDSDYEGKAAKKEKEAGFQRSEFHANELTAKTVQKTSESKQLLTSNVVGETSATTQNITSTTVASATTTTLSTTSTTLKEAPGGQDGQGPDALKCSGSPRIHSFRIVSAQDATTAIVANTTTIDKSSVEKTELRKSYIEQQQRRRYSTFDDNDSDSSNDHKRSSKIQKPILMCFICKLSFGNAKSFSLHANTEHQLTLQTKEQHLLNCEYSSVIIQPQNMDERPQISFLEPIDVHNAIQNDKLNSETDIQFNEGTSQVFSCRLDSAVDTSISESSSTGCVRAEKSDKPAPISSNSNSLSPSSSSASPVSSAVVPIAHTVLSPSQITSTTLSSAIECINDSHLFQTTSFSKCMLQGQRNEVSDTAATTAVTTLLSSPKFETSEQQQQQNIKANVAIESNDSTESKMQSASISRSLIPLTENMLALTPPLLPKKESENIIKLDEPYSMDNRTASGGVLCSSADVQLKPRLIASPIPTNKNTAMTCDMGSYDERGLYSDYIRSPPLTTFRKTHDKMIEMSSLTVNAPTTHGDFVHFEASSGTVSEEGNMLNPTDTLAETVTFLKQQQRQIATMTSTSYATPQPALMSIVPTHTQLSCLHASLAALSDDRNINCSTTDAQKTNAKLFTDFLQQHLNLQQQKTYSDVSGNCSEHADYKDSDCKNCEIQQLKSSPYHSSIHQLTNNTSQCSPNRNNGSCTSSAIMKSPAHMATSPTAVNVSTATVAATATTQQQNAAAAVAVAAAAAAAAAASVANNTSSFTIGACSDHINGRSLGVECARCEMILNSTRLNTGVQMSTRNSCKTLKCPQCNWHYKYQETLEIHMREKHPDGESACGYCLSGQQHPRLARGESYSCGYKPYRCEICNYSTTTKGNLSIHMQSDKHLNNMQELNSSQSMLAAAVVAANSGKADVASKMLLSNSAAAAAQQQQQALPQSQSQQPHSTLVASSSMCSTSASVSASGLASVDGIGTNCNMLNKNKPSFRCDICSYETSVARNLRIHMTSEKHTHNMAALQNNIKHIQAYSFLQQHSQVVAAHQQQQLAAATQVSQSQLPALANSFLPEIALADLAYNQAIMIQLLQHNSVSQQQQQHESVAKMQVKTSPCSSPRSMQIEQQSTAHQQQQQQLQQLRQHLQQSYSPNNSSSMLLLSADPFGTSDVNLPSHVSGSCSNDETLEPPIHADPCPTNLYSCLVCEVFGTNSLDELNQHLLIDRSRCCNKQQATPTNAVGNSCTENANVTNTVNSNDIMVILNNNYVCRLCNYKTNLKANFQLHSKTDKHLQKLNYINHIREGGTQNEYKLKHLHLATNTVQLKCNCCDFYTNSIQKLSLHTQHMRHDTMRMIFQHILHIMEQQNFERNKTKEMAPHSKTHSPKDATEIVPSIRESHSVVPVHENDGSDTNHSSSIAVEDTPQSMQKSLTCQLCDFSTFTLLNMIQHVKSMRHMQIEQFVNLQRRSEQLDPPSLDDIFKMVERPTLQSSALTAAARQEENNRMENVTSVSRFGNFPMPLARLSNDDFSNRDHNSFSPSSNSSTASGNTSVRSKSYQSVNIFNFQTASDDKNASPSALPGGSPAVMPSVVFKCNNCDFFAHSKAEMELHLSAVHPQTEPDYISIPTNSAAIQAFQAAVAAATAAASAVTATRAPSNKSNTENDDFPTIIKRERLCTTEDEETVKSADVQNTMSLQDVSTITPWFTKTTNTINNYETSAAMVQPKSIHNVLHELEKTEEQKAQQIEDTSGDALIESTNKYSGEAVNVQCPLCTETFDSKHTLETHLMNIHSVNHDGLSRLLQLVDTSAWDLTGKTATTPTIAKDCKDSNSASETTTKQTITGTINIKPSTELELSLIEVSNDIPLNLNASHTSTYMHSIESLNNNKLSCEHCGSKFKHELQLLQHAQKMQHFIILPNGGHRCLAASHPSRPCHSTFPTQASMVIHYKNTHISLIISERHVYKYRCKHCSLAFKTQEKLSTHLLYHTMREATKCTLCQRNFRTTQALQKHIDQTHHSSGDQHVVSNSPPTVLSYSGRGSPALQMSHVQNKSQLNDNGNAAKNQDINVRASPPTTPKTNESSKVSSSPLPANILQDQQQQEHAAAYTAALLNQSTPLQQQQHISGEELTESGCHLMQHTQMKPRSPLLTQKYLQQQNLQNLQQLPQLTAAAATSGFQLNPVEIFNLMQFHHIMSMNFMNLAPPLIFGVGTNSGSSDNVDLLTPTHIPKVTYNPSANTLLCGNDMSVPGTPVAPRADLIGALQQQPQSQSQQSAPTSTVQMVNNQKRARTRITDDQLKILRAHFDINNSPSEESIMEMSQKANLPMKVVKHWFRNTLFKERQRNKDSPYNFNNPPSTTLNLEEYERTGQAKVTPLNEDLPTASNNSMQQQQKQNIHESNPKNNSKEKPTTMTDISGDSSLLIDIKAEPRDDNIEAYITTPGSSQRQNEQQQPQEQRQRELHKNDEKDLLSTSSALLLHKRQQHLSALSASEHQQQLLEIPGNTVVGTAPVTALQQHHQQQQHLHGQSNQNIHQQHPQHINLYSYETKSESGSSDILSRPQSPNNSSTVVPTHYASINELINQQLDNLPLGHNISNINVGNMHGNNMGPPKNFQTSKSFDKNSPTSQFDTNSNSSNASSTSSGKRANRTRFTDYQIKVLQEFFENNSYPKDSDLEYLSKLLLLSPRVIVVWFQNARQKQRKIYENQPNNSLYETEEKKHNINYACKKCNMTFQRYYELIRHQKNHCFKEENNKKSAKAQIAAAQIAQNLSSEDSNSSMDINNSSAYQLQHQHISNAAATAVLSSTANVSSTSPSSTAPGVTSPQHLYGKSSMSMTDFSPSTTPTPPQTQRERSDSSELLPQGPVHKSKYECDKCKLQFSYYEHFREHQLLHLMNPSLFTTQITNIPEAYGSFGSILQSLQQVAAASATHQQHHQFLEQQDQPPAKKRKCSETSSIADDVSSIFGTGDGEISNPVSFSLSNSKKYEFLYQYFMQNESNNELKQQFQAQQKKSHEPEIEMEYLTNFYHQSELRKRSNYDFLYQYYQKNEHTQQMSALPSQAGVFGSENKPNIDVLLQYYQLNESKRFFQLNASNQELNDLTAASPTSTSQYQHVNNPIPISNSDHTTTPRVDRYDITDMSLLKNSLDVPNSSINFTDCDNDNNDHEISSNGCGMDVDDAETCTDTDDHINDNNQKSNEIYGHNSRINIVKSSNDNDNKDNEQAANKFAKMDPINELPTSLQSPFSYSNEQHKLKDLSKSLARCGKNQKHNLTKRKASNVSSKSNTTTTINNEYIDHLNDFLHVNQKYESKEKSKQQQHYNDLLQAQNETNGLGNNGSARQAIDKRQLAETTSSSPNSTTNLNNKPTVGSSTTTTNTVTGISEKQQSKRLRTTILPEQLNFLYECYQNESNPSRKMLEEIAKKVNLKKRVVQVWFQNSRAKDKKSRNQRFTAISDDSNYEDASQNNRDYGLFQKNTLKSTTIHGNKGNMNFLPDANISNNTNTGMELGGCNLCQLSQVNVQQHALSIEHICKVKKLLEQTAVDVKLNIDSKVFSTAVTVENEEFTNKQSATGQPNTNADLKEGKYVTSDEAALAQDSDFKTYVNTDITGNDRLVTFTRIYKELKLQNTMKRAIDYTSNNENSELMEWYECGNINNESNGAKTNDIGDNRCKGRKPQVAKQTSKENNEKQERDKVINRSYSAENVENTDIGSVNNIVNHKLMETQYNIKYSERSSKSAIFVSNAPLATSITDDEDNYIDEYKDSSDGNTIDLNDNNETIQNPSTRNETITASSNNSSTSCTLTTSTTHTQLTNTHAQDIIQQLFNCNQITVSSGK
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00193133; iTF_00192637; iTF_00192638;
- 90% Identity
- iTF_00193133; iTF_00192637; iTF_00192638;
- 80% Identity
- iTF_00192637;