Plon041521.1
Basic Information
- Insect
- Pachydiplax longipennis
- Gene Symbol
- ZFHX3
- Assembly
- GCA_036926295.1
- Location
- JAVFKF010000009.1:71111049-71129398[+]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 19 1.6 6.5e+02 3.3 0.2 2 22 442 462 441 464 0.87 2 19 0.00042 0.17 14.6 1.8 2 23 924 946 923 946 0.95 3 19 0.0001 0.042 16.5 0.7 1 23 978 1002 978 1002 0.92 4 19 0.015 6.2 9.7 0.4 1 22 1076 1097 1076 1100 0.91 5 19 0.55 2.2e+02 4.8 0.1 2 23 1622 1644 1621 1644 0.95 6 19 0.059 24 7.9 0.2 1 23 1685 1708 1685 1708 0.94 7 19 0.25 1e+02 5.9 0.3 2 21 1803 1822 1802 1826 0.88 8 19 0.29 1.2e+02 5.7 0.4 3 23 1842 1865 1840 1865 0.88 9 19 0.011 4.2 10.2 1.6 1 23 1903 1925 1903 1925 0.97 10 19 0.014 5.7 9.8 0.5 2 23 1932 1954 1931 1954 0.94 11 19 0.05 20 8.1 3.2 1 23 2077 2101 2077 2101 0.91 12 19 0.017 6.8 9.6 0.7 1 22 2121 2142 2121 2145 0.89 13 19 0.0037 1.5 11.6 0.2 1 23 2204 2228 2204 2228 0.93 14 19 0.28 1.1e+02 5.7 0.2 1 21 2447 2467 2447 2468 0.92 15 19 0.78 3.1e+02 4.3 1.4 2 19 2678 2695 2677 2700 0.91 16 19 0.00026 0.11 15.3 2.5 1 23 3566 3588 3566 3588 0.99 17 19 0.0018 0.72 12.6 0.3 1 23 3697 3719 3697 3719 0.98 18 19 0.028 11 8.9 0.2 2 23 3906 3928 3905 3928 0.95 19 19 4.9 2e+03 1.8 3.0 3 22 4319 4338 4318 4341 0.88
Sequence Information
- Coding Sequence
- ATGCCAACACCGACCACAGGCGAAACGAGCCAACCGTCTCCGCCCCTTCCAACAGCCAAGCTAGACGAGGGGTGTGGAGTGAGGGAAGGCGAAGAGGCCGTCACAGACGAGAGAAACTGCCTCTCATCCCTTCAGCGGGGTCGGAGGTCATCCGGGGGCAGGGAAGAAGGAGGAATCCGCAGGGGCCAGGAGGAAGGCGATGAGGAGGAcgaagaagaggaggaggaggaaagtGTATTAGCGGAAGAATCAACGGAAGGAATGAACAACGGTCCTCTGCAAGGTCTTCAGAACCTTTCCGCGTCGTCCCTCGCGGCGGCGGCATTGGCGGTCAATCGGACGGCCAACAGCCTCGTCGGAGGCACCAGCGCGGTGGTGACTTGCGGCAGCGGACTATCCCTCCTGCAAGGCGGCTTGGATAGGGTGAAGAGAATAAGATTGGAAGGCGAGGAAGAAGAAGGCTTACCCATGAGCCCGGAAGAAGACGCAGAGGCCGCGGTGGCCATGGTACGTGGGGAGGCATCCCCAAAGTCCGCCTTGTCCTTGAGAGGAATCTCTCAGATCAGGTGCAAAGAAGAGGTGTTAGAGGACGAAGACACGGAAATGAGGGAGGAGGACGAGGCTCGATCCCTGAACTCCCAGGCCAACGAGATGGAGGAGGAGATGGACATGGAGGATGGCAGGAGTAAAAGGGCGGCGGGATCGTGTGCGGGAGGCGACGAGGAGTCGTCGACGAGCGACGTCGAGAGGTTTGACGGCAGAATAGTTTACAACCCCGACGGTTCGGCTTACATCATCGAGGAGAGCGAGTTGAGCGATGAAGAGAGCGGTGGTGCGAGCAGCGTGGTGCTGCCGCGAATTCTGGGAGATGGTTGCATCGTAGACGGTAGGGGCGTCTCCCTCTCTCAGTTCCAGGTGTTTCCGCAGATCGCCAACGCTTTCTACGTGTCGAGGAGCAGCGTGGCGTTGTACAGCGCGCTCTACGGTTCCGCCGCCGGTGGGGCCGCCGTCCTCCAAGGGGAGAAGAAAATAGTTCCTGAAGTACCAATAATGCATAGTTACAGGGTTTACACCGTTCGTGATAATAAAACTGGTGATAGCGGGAAGGACGGGAGCGGTAGCTCGGCAGAGGAATCCCGCCTTGAAGAGGATGCAAGGCCAAGTGACCTCTCCAGTAAGGGTGCGAGGGCGAAACGGAAGGCGGGGCCCGGAAGGGGCCTGGAAGCCGAGTCAGCCCCAGAGGATGACGAAAGAGAAGGAGGAGATGGAAAGGAGGGCAAGGGTCCGATGGACTGTGCCTCCGTACCCGTCAAGCCCATCCTCATGTGCTTCATCTGCAAGCTGTCCTTTGGTTACGCCAAATCATTCGTTTCCCACGCGACCGCGGACCACGGCGTTTCGCTTCTCGCAGATGAGAGGGCGCTACTGGGTCACCGAAATGCCTCCGCCATCATCCAGTGTGTTGGGAGGGAGAAGGAGCCCTTAGTGTCATTCCTAGAGCCCGTATCTCCCTTCCCTGAAGTGGTAGGAATCCCTCCCCCGCTTCTCCCCGCCCTCATGGGGCAGCAGAGGCAGGAAGGCTCTCCGGCCTCCGCCGCCGCCGCCATGGCGACTATGGCGACGATGGCGATGGCCGCGGCGGCGGCGGCTTACACCAACGCGGCCGCGGGGGCGTCGACCACTCCAGTCTCGGCCCATGGCACTCCAAAGCCCACGCCGTCGCCCGTGGGACGCAACTCCACGTCGCCCTGTTCGGCTGCCAGCCAGCAAGTTCCTAAAGACCACCAAAATTCTGGTACCAACCGGCAGCACGGTGAGGGTCAGACAGGAGGCGATGATACGGCGGAAACGGGTTTGCCGAGGCCCTCGCCACCTTCTTCGTGTTCTTCGGCGGCGTCTGCATCTTCGGCGTCCTCGTCATCGTCGTTGGCATGCTCTCCTCCGATGCAGCACCCGCGGCTTCAGGCGAGCCTGGAAGGAGGCGCGGAAGATGGGCAGGAAGGAGGCAATGGAGCGGCGAGTGAATCTGCGGAGAAGGAGCACGCGGAAGCAATGAGAGTTGGAGAATCCTTCGGAGATGAGGCGGTGGGAGTGGTGCAGGGTCGGCCTTTTGGCCGTTGCCCATCGCGGCAGGACGAGATGGCCTCGCTGAGGTCCACGAATCCTCAACCGCATCAAGTTCCTGAGACTTCTGTCTCCCTACCTCAAAACGGtagtaatagtagtaataatataaatgtaaatgttaataacCCTGGTTTGAACCCTGGAAGCATTGTGAACAGTAACAACGGCGGCAGCAATGGAAGCCGCGGAATGGACTTGACAAGGAAGAGACCTAATAGTGTTTCACCTGTATCGACGGCCAACATGGTGGGAGGGGGCTCCCCTCCTACGATGAGTCCTGTGGCCCTCATGGGATCACATCATCACCTACACCACTCCATGAACGGACCCGGAGGCGGAAATTTCGTGTCACCTCAGCCACCCCTTCCGCCCCCTCCACCACCGCTAGGACCTCCTCCACCGCCTCCTAGTTTCCTCACGGGGACCACCATCGGTGTTTGTCCTGAGCACATAACCGGAAGACCAAGCGCGGTAGATTGTGCCAAGTGCGAGATGATTCTCCAGACTGCAAGGCTCAGTGCTGGGGGACCCGGAGGCCCCGGAGCTGGACTCCTGGGTCCAGGCGCCGGGGGACTTGGTGGTGCCGGCGGGCCTGGAGGAATATTCTCCGGAATGCACTCAAGAAATTCGTGTAAAACCTTGAAATGTCCAAAATGCAACTGGCATTACAAATATCAAGAGACCCTGGAGATACATATGAAGGAGAAACATCCCGAGAGCGAGACTTCCTGCGTTTATTGCGTTGCCGGTCAACCACACCCACGATTAGCACGCGGAGAGACGTACACCTGTGGATACAAGCCGTACAGGTGCGAAGTTTGTAATTATTCCACGACCACCAAAGGAAACCTCAGCATACACATGCAGTCAGACAAGCATCTAAATAATATGCAAGAGCTTCAACAAAACGGAGGTGTGGTTTCCAGTCAGGGCGGAGGTGGAGGTGCAGGAGGCGTCTCCGACGTTCCCACATCCTCCGTATCAGTAGCGACCTCCTCTCCGTCCCATATGGCGAAAGGAGGCGCGGGGGCTTCGGCGCTTAGCCCAAGTCTTTCGTCCACGCAAGCCCAACAACAGCAAGCGAAGCCTAAACCGACCTTCCGATGCGATGTCTGCAATTACGAGACCAACGTCGCGAGAAATCTCAGGATACACATGACTAGTGAGAAGCACACGCACAACATGATGGTGCTCCAGCAAAATATGAAGCACATGCAGCAACTGAGTGTGCTTCAACAGCAGCAACAGCAGATGGGTGGAGCCGGCGGCGGAGCGGGAGGATTCGATCCAGCTACGGCAGCAGCAGCGGCCGCGGCACTCCTCCACTTTCATCCGGGATTGACTCTTCCGGGAGAGAAGCCCCCGCCGCACACGGAAGCCGCCCTCGCGGACATGGCTTACAATCAGGCGCTACTCATTCAGATGATGACGGGAGGTCATATGTCTGCGCACGGCCCGCCTCCGCAGTCTCTCCCTCCGCCCCCTCACCCGGCCGATTCGAGACATCAACATCACGGACATCCCGGACACCACCACGGATATCCATCGCCCCATCACCATGCGCACCATCACGCCGCGGCGGCAGCAGCCGCACAGCAGCATCACATACATCCACACTTCGCTGCTGCAGCCGCTGCCGCAGCCGCCGCGGCCGCCGCCGCCGATTTGACCGGAGGAGGAGACGCCGGACTCAACCCTGAGACGATGGAACCTCCCCCGGAGCCTCCGGAGCAGAATCCCGCCATGCTATTCACCTGCTGCGTCTGCAACGCATTCGCGACGGACTCTTTAGAGGCTCTCTCATGTCATTTGGCGACGGACAGGACGAAACTAAGAGAGCAGGAAGTTCTTATGTTGGTGGCCGGCAGCTATGTCTGCAAGTTGTGCGCTTACCGAACGAATCTCAAGGCGAACTTCCAGCTTCACTGCAAGACGGACAAACATCTGCAAAGGCTTCAGCACGTTAACCACGTGAAAGAAGGAGGCCCTCGGAACGAATGGAAGCTGAAATACCACCTCAGCAGCGTCACCACAAGCAATCCAGTTCAGGTGCGCTGCAACGCATGTGACTATTACACAAACAGTGCCCACAAGCTGCAATTGCATTGCGCGGGTGCGAGACACGAGGCTAGTTGCGCCCTCTTCAGGCATCTTTTATCTTGTGAAGAAGCCCTGGCTGATAATCCATCGTCTCCCACCGCATTGGTGCCCGTTTCCGGAGGGTCGCCGAACACTCAGCTGCAACAGAATAATAAGATATATCATTGCGCGCTATGCAACTTTTCTGCCAGAGGATCACGTCTTCAGTTGCTGCAGCACGTCCGTACGTTAAAGCACATGCAAATGGAGCAATTACATCAGCTTCAGCGACGTAGCGAAGGCAGAGAGCTACAGGTTGACATTTCGGAGGTGTTCCAAGTTGTTGCTGAGCCTCAGGAAGGAGAGAACGATAAGAAAGAGGGACAAGTCCTCACGCAGCAGCAAAAAGAAGAGGCAGCTAAGGAATTGTTGAGTCAACAGCAACAGCAGGACAAGCAGAGTATGCTCAAGTATGCTCTTGAACAGCAGGGACTTACGGAGGAGTGTTCTACGACTACTGCTCCGAGTGAAACCCCACAGACTCCGACTGGCCAACAGCAGCAAAAGAATGCCGGTAATGTCACTCCAGTCACTACCTCGTCGACTTCACCAGCATCTTCATCCTCCGCTTCGGCAGCTGCTTCTCCTCCCATACAAGTATGTCCTTATTGTAACTATAACAGCACATCTGAAATGCGTATACAAGCCCACATTATCACACAACATTCACAACATATTACTACCTCCCAGCAGCAAGGCAGCCAAACGCAGCAGGCGGTGCAACAGCAGCAGAGCCAGGATAACTCTGGACAGCAGCAGCAAGGCCAGCAACAACAGCAGCCCGAATTCCTCTGTCCATTGTGCCAGGATggattcaaagaaaaatccTTGCTGGAACAACACGTCATGAAAATTCATAGTGTGAACGCTGAAGGCTTGCAGAGACTTTTACTGCTTGTGGATCAATCTCACTGGCTAAATGCTGTCTCACGTCCGCAGGGTCAGCAAAAAGGCTCGACGACGAATAGCAGCCCTCAGTCAAGCAATGAGCAGGAAAACGCTAACAAGAGTATTTCCTCCGATGGTTCAAAAGAGCCAAGCAGCGAGGGAGAAGTATCTTGCAAAGAAGAACCGCTGATGGTATTGTCGCCACCTTCAGATACTCAGAATCAGTCCATGGAAGAGGGCGAGGAAACACTTCGTTGCCAGCCTTGCAATAGGGCATTCCGTAACATCGATGACTTTATGAATCATCAGTTGGAAACTGGCCATATGGACTCTGCTAAGGCACCAGGGTTGGGAAGCGGAGGATACCTGTGCTGGAAGAAAGGATGTAATCAATATTTCCCAAATGCAATGAGTCTTCAGACTCATTTCAGAGAGATTCATGGACGGCTTACGAATGCTCAACAAACGACACCAACTCCCACTCAGCATACTATTGGAACGAGCACATCGAATACGCCTCCAGCGGCAGTATCTGAGAAGCATGTTTATAAGTATCGCTGCAGTCAGTGCTCACTAGCATTTAAAACATTAGAAAAGCTCCAACTTCACTCTCAGTACCACATAATCCGTGATGCAACGAAATGCGCTCTTTGTGGACGCAGCTTCAGATCCATCTTAGCCCTCCACAAGCATGTAGAGAGTGCTCATACTGACTTACCTGAAGAAGAATTGGCTCAGTATAAACAGAGCTTGCTTTCAAATCCATTACTTTTGGCTGGACTGGGCGGTGGTCCCGTACCAGGATTGGGAATGTTTTTGAGCGGAATGAAGTCAGAAACCGCATCAGCTCCACCCATGGAAGTTGATGAGGAAACAAGTGCTGCTACTGCCGCAGCGGATGAGGAAATGATGATGGTGATGGAAGCTGAAAGGAACAAAGAGGATGGAGGAATCGGTTCTGGTGGCGGAGAAGAGAACAGTGACGAGTCAGGAGGAAAAGAACAGCAATTATTAGAGGACTACTTAAATTCACAGGCCGTGGCCGAAGATGGATATAATGACCCAAACAGGAAATACAAATGTCATCGGTGCAAGGTGGCTTACACGAAACAGTCTTACCTGACAGGACACAACAAGACTTTATTACACCGTAAGggagaaaaattatcatatccAATGGAGAAGTATTTGGACCCCAATAGACCATTCAAGTGTGATGTATGCAAGGAATCATTCACTCAGAAGAATATTCTATTGGTACATTATAACTCCGTTAGTCATTTGCATAAGCTTAAAAGAGCTATGCAAGAACAGCAACAGCCAAACAATAATGGAGTTATGAATGTATCTTCACCTACCACTCCAACCAATCAGAAGCTTCTGAATGTTAATGCATCCACTCCAACTGCCATGACCACTTCCTCATGCACAACCGAGGAAGATGACAAGAAGCCATACAAATGTAACATATGCAGGGTAGCCTATAGTCAGGGATCCACCCTCGACATACACATGAGATCAGTTCTGCATCAAACAAGGGCTTCAAAATTACAGGACCTTGCCCTATCGGGACAAATAGATTTAAGTAGACCTCTCATAGAGCAGCCCATGCAGAATGCATCTCAGGCAGCCGAGAAGGCATCAAATCTTCTGCACGACATTCTGGGTACAGCTCAAGCAGCGGTTGCTGCATCAAACAACGCGACTGGTATGTCGACTCCTAAGAGCATCCAAGGACAGCATATGGGTCAGGCCATGCATGGGCAACAGCAGTTTCATTCACCTCACCCTCAGCATTTTGGACAGCAGCTTCATTCCATGCATCAAGGTTCGATGCTGCAAACGCAGCAATCACCAAGCTCGCTATCCTCTGCGTCTTCCACCGCGTCCTCATCCACCGCTCCTTCTCCTACCCAAGTACCAAGTGGTAGTGCAGCTCAATTGGCATCTTCAGTTCCTTCATCGCAGGCAACTACAGGTCCATCCTCTCTATCGAATGCCTCAGTCTCCGCCAATTCTCCTTCTACAGTTCCTAATACCATCTCAGCTGGCACAGGCGGAGGAGCTGGCAGTGAGACTACTCAACAGAGTGGAAATGGAAGCGGTGGATCATCCAATGCCCAAACATCTGTGATATTGAACTCAGCCAATTCCCAAGGGCAACAGCAGCAGGCCGGAGGGATGCACGCTTGCCCACGTTGCAATGCGCTCTTTTCGAGTCAAGAACAGATGATTCAGCACACGCAGCTTTACTGCATGTTTACTCAACAGCCTGCTTTAATACAATCCAATTCTGGAAATACCCCTACGACTGCCTCTACAGGAAATCAGACCGTAGGATTTCATCATCATGGTCATTTTCATCATGGccaccaccaccaccatcaTCATGTTAACAATCAACAGCAACAAGTGGTTTTTGGGGTTCACCATAAGGTTCCCTCTCCATCTCCTTCACCATCGATGTCTGTTGGCACTAATACTGTCGGTGTGTCTACTTCATCTGGAAGTGGAACACCATCAACGCCAACAGGAACTAGTCTTCACCATCATCATCACCACCATCAAATGTTGGATGATGCTTTCGCACGTTTCTCCATGCCAACAAGAAAGTCATCTCAGATGTATAAGCATCTGTTGGAGAGTTATGGATTTGAACTTGTCATGCAGTTTAATGAGAATCATCACCAGAGAAGACAAAAGAGGGAGGAAACGGCGGACGTATCTTCCAGCACATCAAATAAAGTGAACCAACAAATCACTTCCACCACCGATGGTGGTGATGTTCCATCTTCTGGAACAGTGGAGACACAGACTGATGATGGCAAATTGGATGACGAGACGAAGAGCGACTTATTACCGGAAGTGTCAAAATCTGTTTGTGGTCATTGCAAGAAGGAGTTCTCCAGCGTTTGGGTGCTAAAAGCACACTGCGAGGAAGTCCATCGCGACTTAGTCCCATTGGATTTCCTTGAAGAATACGCCATGAGATATAAGAACGAATACGAGCGTAAGACAGCAGCTGTGATTGCAGCTGAGCTGAAGCAACAACAGCAGCATCAACAGAAACAACAGCAGATAGCCCAACAGTATCAGCAGCAGCAACTCCAATTGCAGCAACAACAGCAATTGCATCAGCAACATAACCTTCATATGCAATTCAACCAACATCATCAGACTCAGCAACAACAGGGAATGGTTGTGCAACAGGGATCAAATAACCAATCCACTAAGATAGAGGCGCCAATGGCCTCGCCGGACAAAGATGTTCAACCTGTCACTACAACAACTCAGGGCACTATAACTGGAGATGAAGAGCAGGATGATAAGATCGCCATGATGATGATGTTGATGAAGGGGGAGAAAGCGTCTTCAAATACGGTCAGTACATCAACTTCTACGAACCCGGTAAATCCTGGGGCTAGGATGTCATCAACTCCGAATCCTCTGAGAAGAGCCGACAGTACTTCACCCGATGTGGTTCCCATTCCTCCTTCAACTCCTACCACTCCCACCTCATCATCGACTCCGGCACCTGCGGGAGAAACGATCTCCTCAACCACCAATAATCAAGGAGGTAATAATGCCTCTCCTACTACGCCCAATCCCACGTCTACGACTCCAAGACCAGTGTCTAATCCTAGAGATGACCAGCAGTCGCACCATCTTCAACAGCAAGGACATGGTAATACCGGAACCCAAGAAGGATCCACTCAGGGGTCAAACAGTGGCCCTCAGAATGCAGCAAACGCGGCTGCGGCAGCATCTGCGATGCAACTTTCCTTGGCCGCACAGATGAACGAAATGCAGGCTGCAATCAATTTAATGGCCATGCAACAATTTCACAACCCAGCGACTGCCATGTCGATGATGCAAATGGCTGCTATGGGTCTCTCTCCCCTTGGTTTGATGAACCTACAGCCTCCTCTAGTGCCGATGATGATGGCACCACCTCCACCACCTCCTCCGGCAAATCAAAATCAACCTCCAGCGACTCACTCTCCTTCGCCGAACGATCAACTTGCGGCGGCCGCTGCAGCAGCGAATTCCTTGTTTTCTCCCGCTGGATTGCTCGTTGCCAAGCAGCAAGCTCTTTTGCAGCAACAACAGCATATCCCTCATGTACCTCCTCATCATCACATCTCCACAGTTACTGCGCAGATGCaacagcaacaacaacaacagcaGCAGAAGAGAGCGCGCACTCGCATCACAGACGACCAGCTGAAGATTCTTCGCGCTCATTTTGACATCAACAACTCTCCATCGGAGGAACAGATAGCCGAGATGGCATCGAGATCTGGTTTACCACCAAAAGTCATCAAACATTGGTTCAGAAACACCCTATTCAAGGAGAGACAGAGAAATAAGGATTCTCCGTATAATTTCAATAATCCTCCTTCCACGACTCTCAACCTGGAGGAATATGAAAAGACAGGAGAGGCGAAAGTTTCTCCATTGAATCCCGAAGAGCAGAAGGAATTGTTCAACGCCGTTGCTCAGGCGACTCAAGCCACTTCATCAAACAAAAGGAAGCAAGCTCAACCCACTCAGACTCCTCAGCCACCTCCCACGCCTCCTCAAGTGGAGAAGTCCCCTGCTGAGTCGTCGATGATAGGCTATCGGCAAAGTGAGGAGCAATTTGGAGAAGATTCTGGTAATCGACAAGTGAAGATGGAGGTGGATGAGGACGTTAAGCCAATTATAAATCTCGCGGACGAGAACACTCGCGAGGAACCGGATTCTCCAGCTCCGAATAATGCGCCGACATCCGTCGGAACGCAGATTGGCATGCCCGCCTCCACTACACTAAACATGCAAACATCCAGAAATAACTCACCCGGTAGCATAACTCTCACTTCGATCATCGCGTCTCAGTTGGGACCACAAATGGAAGGAGGGCCTGTGGCTCCGGTCACATCGTCATCAACTTCACCACATTCTGCTTCTGGAGGTGGCGCAGCGTCCGCCAACATCCTCGCACCACCCAAGGCCGCTCCATGCGCCACCATCTTCTCCTCTCCTGTTCAGAATCAGATTCCTCAGAGTCCTACTTCCATGATGGGAATGTCTCTTACCAGCACACCCAATACGGCCATGTCAGGAGGATCTTTGATGGGAATGTCCACTCCAGTATCCACCATGAGTCCGGGTAGGTCACCATCTTCCTCGATGGATTGTATGACTTCAGGCATGGGAAGTGGAGGTAGCGGAAGTTCCTCTGGAAAGAGAGCGAACAGAACAAGGTTTACGGATTACCAAATTAAAGTACTTCAGgaattttttgaaaacaatgCATATCCAAAGGATGATGACTTGGAATATTTATCCAAGCTCCTTAGCCTGAGCCCGAGAGTGATTGTCGTTTGGTTCCAAAATGCGAGACAGAAGGCCAGGAAAGTTTATGAGAACCAACCGCCAGTGGAAGCGACCGCTGGAGTTGTGGCAGGAGGCGCCATCGGTGCCGGTCTGCCCGCTCTCGCTAACATGACACCTGGAGAAGAAGGCGCAGGAAGATTCCAGCGAACCCCTGGACTTAACTACCAATGCAAGAAATGCTTACTAGTCTTTCAGAGATATTACGAATTAATTCGCCATCAGAAGACGCATTGTTTCAAGGAGGAGGACGCAAAGAGGTCTGCGCAAGCGCAGGCCGCCGCAGCACAAATTGCGGCTGTTTTGAGTTCCGAAGATTCTAACTCAAGTTCGGTGGCGGAATCACGGGGGGTCGCCGATCTCTCATCGGTGTCCAACACCAATGCccctcctccaccaccaccaccGCCTCCTCCACCGCCACCACCTCCTCCGCCTCCATCCTTCGCATTGGCCCCCTCTCCGCCTGCCGTCTCATCGGACGGCCGCAAGAGCGCTCAGCCCGCGGAGGCCTCACCGCCTCCCTCGCAATCCCATCAAGAGAGTTCGGAAGACAGCTTCCAGTGTGACAAATGCAACCTAGTCTTCCCGCGCTTCGAACTCTGGCGGGAACACCAATTAGTCCACCTGATGAATCCCAACCTATTCCCATCTTACCCTCCAGACTCTCCCTTCGGAATCCTCCAGCAGCATGCTCAGATGCAGCAACATCAACAGCAGCAGCATCAACAGCAGCAACAACAGCAGCAGCAACAGCAACAGCCCCAACAACAGCAACAGCATAATCCGCAACATATGGGAGGAATCTCCGGTGCGGCCGTCGCCGCCGCAGCGGTTGCTGCTGCCGCTGCTGCTGGCTTAACTTATGATGCCGGAATGGGAACTCTGTCGCCTCCGATTGGTCAGGTGAAGAGGAAGTTTGATGAAATGGAGGAGGAGATGAGAACAGGAAGCGAAGAAAATGCGGCACCCAAAGACAAGAGGCTGAGGACAACCATTTTGCCGGAGCAGCTGGACTACTTGTATCAGAAATATCagGTGGAGAGCAATCCAAGCCGTAAGATGCTGGAGTCAATCGCGCGCGAAGTTGGACTGAAGAAGCGAGTCGTGCAGGTGTGGTTCCAAAATACGCGGGCGAGGGAGCGTAAAGGACAGTTTAGAGCTCATGCACAGGTGATCAACAAGCGTTGCCCATTCTGTCCTGCACTCTTCAAAGTGAAATCCGCCCTCGAATCGCATTTGGCGTCGAAACACGCTGATCAAGTGGCCGCAAGAGGAGGGGACGTCAACATCGACGCACTCCCGGACGAGGAACTCAGCGTCGATTCCTCAGCGTCCTTCTCCACCGCAGGAGATGGCAACACCACCGGAGGAACTCCCGGAAGCACGACTCCTGGCGGCAAAGGCATTTTATCCGGTGCATTCCCCGGCGCCACATCCATGATACCACCCAATATGATCTCGCCGCTCTTCTCGCCTTTCGCCGCCGCGGCTGCAGCGGCCGCAGCCTCACAGGACGCGGAGAACTCTCTTAAGAAGTATTATGAAGAATCGATGAAGAGGTACATCAGCGAGCTGCAGTCCCACGCTGCCCAGAACGGCGTCACCGCCGGAGGAGCTGCCGGCGGAGGACAGAACAACCCCGCAGGGAACGCCAGCAACGTCGGAGGGAGCGGAGGAGGATCTTCCGGAGGAGCGGCACAAGCGACCAAAGAATCATCCGCTAGTGTAGCCACGGATTTAAGCATGAAGATCAAGCAGGAGATGGGGCAGTCGCCTCCCGAATCGGCCGGCGATGCCCCCCTTGATTTAAGCAAGCCGGTCGATCTATCTCGTCCGGTTCGAATCAGCACGGAAAGCGGGGCCCCTTTGTCTTGCGACCAGGGCCCCTTGACGGACGTTAGCGAGAGAAGTGGAAGTGTGGGCCTGGCGTCTTTTGCAGGCGCCATGGGCATCGGAATCGAAAGAGCCGTGGGCGTTTTGGGTTTGGGCCTCGGCGCTGAAGACGCCAGGTCGGACAGTTTGTCGGAAACGTGTACTGACAACATGGACGGAGATGAAAGCAACCCCACGTCTCCCGCATCCAGCACCCAGTCGCAGAGACACCAAGTGACTCCGGTGGGAACCCCTGGTGCGGCGGGCACTCCTACTGGAGGGAGCGGGACGGGGTCTGGAAAGAGATTCCGCACCCAGATGTCTAGTTTACAGGTCAAGGTGATGAAGTCCCTGTTCGCGGACTACAAGACTCCAACGATGGCCGAGTGCGAGATGCTTGGAAGGGAGATTGGACTGCCCAAGAGAGTGGTCCAAGTTTGGTTCCAAAACGCCCGTGCCAAGGAGAAGAAGAGCAAGCTGGCCATGCAGAAAGCGGCCGGATTCACCGGGAACACCGTCGAGGCGGAGACCCAGAGACCACCCGAAGACTGCCGCCTCTGCAACTTCAAGTACTCTCACAAGTACTCGGTTCAGGACCACGTATTCACCAGAAGGCATATCGACGCCGTAAAAGCGGCAATCGAGGGCGGCACCGGAGGCGGAGGACCCATCGGGGGCGTCGGCGGAGGAGGAACCGTCCCCGGAGGGATGAGGCAGGAGACGCAGGACTCTGCCCCCTCGACTCCTTTCACAGTGCCACCAACGCCTCCACTTGGGGGAGGCAACAACAACCCCCCTAACGAACCATCTACGACACCACCATCGGGCCAGCAACAAGGGCAACAGGGACAACAGCAGCAGCACACTTCTCAGCAGCAACAACTAGCCCAACTTCAGATGCTGCAGATGGCCGCTGCTGCGGCGTCCTTCCCCAACTCCCTCTCCGCCGCTGCCATGGCGATGGCCGCCAAGGATGCTGGGAAGTCTCCGAATGCTGCGGCGGCCGCGGCTGCTGCACTTCTCGGCGGCGGCAACGGAGGCGGCGGCGGAGGACAAGGTGGAGGCCAAGGAGGGATGCCCGGGGCGGAAGATATGGCGCTATTCCATCAACTATACGGACTAGGATTGGCCGGATTCCCTCAAGGAAACCTGTTCCTACACCCGGCCATGTTCTCAGCTGCCAGTGAGTGCTCCTTTCCATTTTAG
- Protein Sequence
- MPTPTTGETSQPSPPLPTAKLDEGCGVREGEEAVTDERNCLSSLQRGRRSSGGREEGGIRRGQEEGDEEDEEEEEEESVLAEESTEGMNNGPLQGLQNLSASSLAAAALAVNRTANSLVGGTSAVVTCGSGLSLLQGGLDRVKRIRLEGEEEEGLPMSPEEDAEAAVAMVRGEASPKSALSLRGISQIRCKEEVLEDEDTEMREEDEARSLNSQANEMEEEMDMEDGRSKRAAGSCAGGDEESSTSDVERFDGRIVYNPDGSAYIIEESELSDEESGGASSVVLPRILGDGCIVDGRGVSLSQFQVFPQIANAFYVSRSSVALYSALYGSAAGGAAVLQGEKKIVPEVPIMHSYRVYTVRDNKTGDSGKDGSGSSAEESRLEEDARPSDLSSKGARAKRKAGPGRGLEAESAPEDDEREGGDGKEGKGPMDCASVPVKPILMCFICKLSFGYAKSFVSHATADHGVSLLADERALLGHRNASAIIQCVGREKEPLVSFLEPVSPFPEVVGIPPPLLPALMGQQRQEGSPASAAAAMATMATMAMAAAAAAYTNAAAGASTTPVSAHGTPKPTPSPVGRNSTSPCSAASQQVPKDHQNSGTNRQHGEGQTGGDDTAETGLPRPSPPSSCSSAASASSASSSSSLACSPPMQHPRLQASLEGGAEDGQEGGNGAASESAEKEHAEAMRVGESFGDEAVGVVQGRPFGRCPSRQDEMASLRSTNPQPHQVPETSVSLPQNGSNSSNNINVNVNNPGLNPGSIVNSNNGGSNGSRGMDLTRKRPNSVSPVSTANMVGGGSPPTMSPVALMGSHHHLHHSMNGPGGGNFVSPQPPLPPPPPPLGPPPPPPSFLTGTTIGVCPEHITGRPSAVDCAKCEMILQTARLSAGGPGGPGAGLLGPGAGGLGGAGGPGGIFSGMHSRNSCKTLKCPKCNWHYKYQETLEIHMKEKHPESETSCVYCVAGQPHPRLARGETYTCGYKPYRCEVCNYSTTTKGNLSIHMQSDKHLNNMQELQQNGGVVSSQGGGGGAGGVSDVPTSSVSVATSSPSHMAKGGAGASALSPSLSSTQAQQQQAKPKPTFRCDVCNYETNVARNLRIHMTSEKHTHNMMVLQQNMKHMQQLSVLQQQQQQMGGAGGGAGGFDPATAAAAAAALLHFHPGLTLPGEKPPPHTEAALADMAYNQALLIQMMTGGHMSAHGPPPQSLPPPPHPADSRHQHHGHPGHHHGYPSPHHHAHHHAAAAAAAQQHHIHPHFAAAAAAAAAAAAAADLTGGGDAGLNPETMEPPPEPPEQNPAMLFTCCVCNAFATDSLEALSCHLATDRTKLREQEVLMLVAGSYVCKLCAYRTNLKANFQLHCKTDKHLQRLQHVNHVKEGGPRNEWKLKYHLSSVTTSNPVQVRCNACDYYTNSAHKLQLHCAGARHEASCALFRHLLSCEEALADNPSSPTALVPVSGGSPNTQLQQNNKIYHCALCNFSARGSRLQLLQHVRTLKHMQMEQLHQLQRRSEGRELQVDISEVFQVVAEPQEGENDKKEGQVLTQQQKEEAAKELLSQQQQQDKQSMLKYALEQQGLTEECSTTTAPSETPQTPTGQQQQKNAGNVTPVTTSSTSPASSSSASAAASPPIQVCPYCNYNSTSEMRIQAHIITQHSQHITTSQQQGSQTQQAVQQQQSQDNSGQQQQGQQQQQPEFLCPLCQDGFKEKSLLEQHVMKIHSVNAEGLQRLLLLVDQSHWLNAVSRPQGQQKGSTTNSSPQSSNEQENANKSISSDGSKEPSSEGEVSCKEEPLMVLSPPSDTQNQSMEEGEETLRCQPCNRAFRNIDDFMNHQLETGHMDSAKAPGLGSGGYLCWKKGCNQYFPNAMSLQTHFREIHGRLTNAQQTTPTPTQHTIGTSTSNTPPAAVSEKHVYKYRCSQCSLAFKTLEKLQLHSQYHIIRDATKCALCGRSFRSILALHKHVESAHTDLPEEELAQYKQSLLSNPLLLAGLGGGPVPGLGMFLSGMKSETASAPPMEVDEETSAATAAADEEMMMVMEAERNKEDGGIGSGGGEENSDESGGKEQQLLEDYLNSQAVAEDGYNDPNRKYKCHRCKVAYTKQSYLTGHNKTLLHRKGEKLSYPMEKYLDPNRPFKCDVCKESFTQKNILLVHYNSVSHLHKLKRAMQEQQQPNNNGVMNVSSPTTPTNQKLLNVNASTPTAMTTSSCTTEEDDKKPYKCNICRVAYSQGSTLDIHMRSVLHQTRASKLQDLALSGQIDLSRPLIEQPMQNASQAAEKASNLLHDILGTAQAAVAASNNATGMSTPKSIQGQHMGQAMHGQQQFHSPHPQHFGQQLHSMHQGSMLQTQQSPSSLSSASSTASSSTAPSPTQVPSGSAAQLASSVPSSQATTGPSSLSNASVSANSPSTVPNTISAGTGGGAGSETTQQSGNGSGGSSNAQTSVILNSANSQGQQQQAGGMHACPRCNALFSSQEQMIQHTQLYCMFTQQPALIQSNSGNTPTTASTGNQTVGFHHHGHFHHGHHHHHHHVNNQQQQVVFGVHHKVPSPSPSPSMSVGTNTVGVSTSSGSGTPSTPTGTSLHHHHHHHQMLDDAFARFSMPTRKSSQMYKHLLESYGFELVMQFNENHHQRRQKREETADVSSSTSNKVNQQITSTTDGGDVPSSGTVETQTDDGKLDDETKSDLLPEVSKSVCGHCKKEFSSVWVLKAHCEEVHRDLVPLDFLEEYAMRYKNEYERKTAAVIAAELKQQQQHQQKQQQIAQQYQQQQLQLQQQQQLHQQHNLHMQFNQHHQTQQQQGMVVQQGSNNQSTKIEAPMASPDKDVQPVTTTTQGTITGDEEQDDKIAMMMMLMKGEKASSNTVSTSTSTNPVNPGARMSSTPNPLRRADSTSPDVVPIPPSTPTTPTSSSTPAPAGETISSTTNNQGGNNASPTTPNPTSTTPRPVSNPRDDQQSHHLQQQGHGNTGTQEGSTQGSNSGPQNAANAAAAASAMQLSLAAQMNEMQAAINLMAMQQFHNPATAMSMMQMAAMGLSPLGLMNLQPPLVPMMMAPPPPPPPANQNQPPATHSPSPNDQLAAAAAAANSLFSPAGLLVAKQQALLQQQQHIPHVPPHHHISTVTAQMQQQQQQQQQKRARTRITDDQLKILRAHFDINNSPSEEQIAEMASRSGLPPKVIKHWFRNTLFKERQRNKDSPYNFNNPPSTTLNLEEYEKTGEAKVSPLNPEEQKELFNAVAQATQATSSNKRKQAQPTQTPQPPPTPPQVEKSPAESSMIGYRQSEEQFGEDSGNRQVKMEVDEDVKPIINLADENTREEPDSPAPNNAPTSVGTQIGMPASTTLNMQTSRNNSPGSITLTSIIASQLGPQMEGGPVAPVTSSSTSPHSASGGGAASANILAPPKAAPCATIFSSPVQNQIPQSPTSMMGMSLTSTPNTAMSGGSLMGMSTPVSTMSPGRSPSSSMDCMTSGMGSGGSGSSSGKRANRTRFTDYQIKVLQEFFENNAYPKDDDLEYLSKLLSLSPRVIVVWFQNARQKARKVYENQPPVEATAGVVAGGAIGAGLPALANMTPGEEGAGRFQRTPGLNYQCKKCLLVFQRYYELIRHQKTHCFKEEDAKRSAQAQAAAAQIAAVLSSEDSNSSSVAESRGVADLSSVSNTNAPPPPPPPPPPPPPPPPPPSFALAPSPPAVSSDGRKSAQPAEASPPPSQSHQESSEDSFQCDKCNLVFPRFELWREHQLVHLMNPNLFPSYPPDSPFGILQQHAQMQQHQQQQHQQQQQQQQQQQQPQQQQQHNPQHMGGISGAAVAAAAVAAAAAAGLTYDAGMGTLSPPIGQVKRKFDEMEEEMRTGSEENAAPKDKRLRTTILPEQLDYLYQKYQVESNPSRKMLESIAREVGLKKRVVQVWFQNTRARERKGQFRAHAQVINKRCPFCPALFKVKSALESHLASKHADQVAARGGDVNIDALPDEELSVDSSASFSTAGDGNTTGGTPGSTTPGGKGILSGAFPGATSMIPPNMISPLFSPFAAAAAAAAASQDAENSLKKYYEESMKRYISELQSHAAQNGVTAGGAAGGGQNNPAGNASNVGGSGGGSSGGAAQATKESSASVATDLSMKIKQEMGQSPPESAGDAPLDLSKPVDLSRPVRISTESGAPLSCDQGPLTDVSERSGSVGLASFAGAMGIGIERAVGVLGLGLGAEDARSDSLSETCTDNMDGDESNPTSPASSTQSQRHQVTPVGTPGAAGTPTGGSGTGSGKRFRTQMSSLQVKVMKSLFADYKTPTMAECEMLGREIGLPKRVVQVWFQNARAKEKKSKLAMQKAAGFTGNTVEAETQRPPEDCRLCNFKYSHKYSVQDHVFTRRHIDAVKAAIEGGTGGGGPIGGVGGGGTVPGGMRQETQDSAPSTPFTVPPTPPLGGGNNNPPNEPSTTPPSGQQQGQQGQQQQHTSQQQQLAQLQMLQMAAAAASFPNSLSAAAMAMAAKDAGKSPNAAAAAAAALLGGGNGGGGGGQGGGQGGMPGAEDMALFHQLYGLGLAGFPQGNLFLHPAMFSAASECSFPF
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_01131385; iTF_01385177; iTF_01384469;
- 90% Identity
- iTF_01131385;
- 80% Identity
- iTF_01131385;