Curb032119.2
Basic Information
- Insect
- Cheilosia urbana
- Gene Symbol
- zfh2
- Assembly
- GCA_946477595.1
- Location
- CAMLCJ010000374.1:180808-205010[-]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 17 0.61 1e+02 4.9 0.6 2 23 78 100 77 100 0.94 2 17 0.00024 0.04 15.6 1.4 2 23 642 664 641 664 0.96 3 17 5.8e-05 0.0098 17.5 0.7 1 23 696 720 696 720 0.92 4 17 0.019 3.1 9.7 0.4 1 22 762 783 762 786 0.91 5 17 2.3 3.9e+02 3.1 1.9 1 23 1040 1064 1040 1064 0.91 6 17 4.4 7.5e+02 2.2 1.8 1 23 1307 1330 1307 1330 0.92 7 17 0.082 14 7.6 0.1 2 23 1449 1471 1448 1471 0.93 8 17 0.087 15 7.6 0.6 1 20 1549 1568 1549 1573 0.91 9 17 0.0028 0.47 12.2 2.3 1 23 1619 1641 1619 1641 0.96 10 17 0.00071 0.12 14.1 1.5 2 23 1648 1670 1647 1670 0.94 11 17 0.035 5.9 8.8 1.5 2 23 1814 1837 1813 1837 0.90 12 17 0.037 6.1 8.7 0.6 1 22 1858 1879 1858 1882 0.89 13 17 0.0091 1.5 10.6 0.1 1 23 1966 1990 1966 1990 0.93 14 17 0.42 70 5.4 2.9 1 19 2057 2075 2057 2080 0.92 15 17 0.00022 0.036 15.7 1.8 1 23 2740 2762 2740 2762 0.97 16 17 0.00053 0.089 14.5 1.4 1 23 2877 2899 2877 2899 0.98 17 17 0.012 2 10.3 0.6 2 23 3381 3403 3380 3403 0.96
Sequence Information
- Coding Sequence
- ATGTCCTGTTCTGATGTTGAAAGTTTCCAAGGCAAGATTGTTTATAATCCAGATGGCAGTGCTTTTATAATCGATTCTGAAAATGAATCACTTTCGAATATATCGGATAACTGTATGAGCGTTGGGGCAACTACAACGAATAACCCAAAAATCCACTCATTTCGTGTGGTTACCGCTCACGATGCCAGTGTTAATATTTCCGAGCCAAGTAAAATTCAAAAGCCAATATTAATGTGTTTCATCTGTAAACTGAGCTTTGGAAATACGAAATCGTTTAGCTTGCATGCAAACAGCGAGCACACCCTTAATCTTCAAGAATCAGAAAAGTTACTATTGAATCGCGAGTATTCAAGTGCCATTATACAGAGGAATGTTGATGAGAAGCCGCAGATATCGTTTCTGGAACCGTTGGATATTCAAAAGCAAGATCAATTTATGAAACAATCGTCACAAACgccacaacatcaacaacaactgcTTATTGGATCTACATCTTCATCGGGGCCGAATAACAATAACTTAAGCAACAATAACTGTAGCaacaataatattaatattaattgtAACACATCTACAGCGATACCACCGCCGACGTCATTGTCCCCATCATCATCGGTGTCTTCGGCTGCAGTAGCAGCAGCTGCCGCGGCTAATGCGGCGTTTGTGGCTGCAATAGCAGCTAGTTGTAGCAACACCAACAGTGTGAATACATCAGCATCTATATCGATGAAGAAGTCGCATTTGGAAAGTGATATTATAATGGCGAGCATCGGTGCTGCTGCCGGTGGAGGCAATAACAATCCTAGCTATGGTAACACGACGGGCACCATTAGTAGCAATCCTAGTAGTAGCAGTAATAATTGTAGTCAAATTAATCATCAACAAAGATCAGAGAACTTTGATAATCTGAGCACTTTAGATCTAAGTGCGGCGACAGTAGCAGCGCAGGCGGCAGCCGCGGCCGTAGCAGCTGCAATAACAGTTGATGGCCGGCACTCACCGCCTCCGTCGTCGCCTACATCTACCACTTCGTCCTCACCTTTATCATCGTCTTCGTCTTCGTCAACATCAACATCGTCGTCATCTGCGCAGCAACAAGCCACAACAATTATTGTATCAGGAATACCATTGGCTACAGCACCAATGCCCGGCGGGTCAACGCAAAGCTCGCATTCCTCAATAGCTATTGATGCTATTCCAGCAACATCCTCAATAAGTTCCACAGCAACAAAATCAACTAGTAGTAATAGCAGTAGTAATTTTATTAATGCCAGTAGTAATAAGATCAGCTCATCAGCAGTGTCGTCTCCGACACCCACAACCATGGCGGATTTTCTGCAGCAACAATTCCAACAAATGCAGAATCAAATACGGATAACATCCCCCACTTCTGCATCCTCAGTGGTTGTGAGTAACAGTGGGAGCAATATGGAAGCAAATATCTTGTCCTCGGTAACCTCAAGTGGGCCAACGCTAAGTTCATTAACCGCATCCTTGGCAGCGGCGGCAGCAGCCGTTGGTAGCGGCACTCCGGGTGATTTAACTAGCAATAGTAATAGTGTAAAATTGATCAATGATTTCCTGCAACATCAACTGCAACATCAGCAGCATCCTTCGTACGCAACATGTCCCGAGCATCCAGACGTGAAGGGCATTGATTGCAAGACTTGTGAAATGATTGAGATTAACATGAAATCACCGATAACACCGACCCGTTCGCCAAATAGTATTAATTTGTTCCCATCGAATTCGACAATGTCGCCAACTGCAGCTGCAGCTCCAAGCTTTACAATCGGCGCCTGCCCAGAGCACATCAACGGGCGCCCATTAGGTGTGGATTGTTCTAGATGTGAAATGATATTGAATTCAGCCCGGCTAAACAGTGGCGTACAGATGTCAACACGTAACTCGTGCAAAACTCTGAAATGTCCCCAATGTAATTGGCACTACAAATATCAGGAAACATTAGAAATTCATATGAGGGAGAAGCATCCAGATGGGGAGAGTGCATGTGGCTATTGTCTGGCCGGTCAACAACACCCGCGATTAGCACGCGGCGAGTCCTACACCTGCGGCTACAAGCCGTATCGCTGTGAGATTTGTAACTACTCGACAACTACAAAAGGAAACCTATCGATTCACATGCAAAGTGACAAGCATTTGAATAACATGCAGGAGTTGAACAGTTCGCAGAGTATGGCTAACACTGCAGCCGAAATTCGTGAATCACCGAAAATCATTATGCCAAATATGCAACAGCAGGCTTCCAAACCGAAGCCGAGCTTTCGCTGTGACGTGTGTTCCTATGAGACGAGTGTAGCGCGTAATCTTCGTATCCACATGACCAGCGAGAAGCACACCCACAATATGGCTGTCTTGCAAAACAACATCAAACACATTCAAGCTTTCAGTTTCCTGCAATCCCAGAACCTCGGCCAACTGAGTGCAGCACAGACCGCTGCCGTTGCAGCCTCCAATTTGCCCAATTTGCCCAATTTGCCTAATTTGCAAAATTTTTTACCGGAGGCAGCTCTGGCTGATATTGCTTACAATCAGGCACTGATGATTCAGCTATTACACCAGAATTCTGCTGCTGGAGCTTTGAGCGCggcggctgctgctgctgcctcAGCAAATCCCTTAACCTTAGTCCCGCCGCAACAGTCTCCAGCTGGAAGTGGAGTAACAACAACGTTACCCCCAAATGCATCAACATCGCACCAGCCATCGCTACAACCCTATCAAACCTCACAGCCATCGAAAGTTTTGCCAACAACAGGAGCTACTCCAGTATCAGTACCGTCAACAACGTTATCGGTTCATCAATCCTCGCCACAGCAGCAGTCTGCAGTTGCTACGGCAGCTCTTCTAGCCGAGGCACCCGCGGCAGGAGTAGCTAACACCGATTCACCAACAtgtttacaacaacaacaacaggacTCATCGCTCGATCCGCCCATTGAGCCTGATCCGAAGCCAACAACAGCATTTAGCTGCCTCATATGCGCCAACTACAACACCAACAGCATCGATGAACTGAACAATCACCTTATGATTGATCGATCgcgcaacaccaacaacaactgCAGTGACATCATGATGATAATCAACAATAATTATATATGTCGCCTGTGCAACTACAAGACGAATCTGAAGGCTAACTTCCAATTACATAGCAAAACGGACAAGCATCTGCAAAAGCTTAACTACATAAATCACATCCGTGAGGGTGGTGTTAAGAATGAGTACAAACTGAAGTACAATCAATCGAATACGATTCAGTTGAAGTGCAACTGCTGCGACTTTTACACGAATTCCATACAGAAGCTAAATCTCCATACACAACATATGCGTCATGATACCATGAAGATGATCTTCAACCATCTCCTGTACTTGGTGAATAGTTTTAACGCGTCTATGGGCAATAATAGTGACGTTGGTGCCGTTAGTGAAAACAGTGAATTTCAGCTAGTTAACAAGAACAAATCGCTCATGTGTCAACTGTGTAATTTCAGTGCGGTTAACATAATGCAAATGGTGCAGCACGTAAAAAGTTTGCGTCACATACAAGTAGAACAATTCATATGCCTGCAGCGTAGAAGTGAGAATCTCGAATCGTTAGGTTTGGATGATGTCTTTAAAATCGCTGATAATACTGATTGCGTTAAATCGGAACGTTCAAGTCCGGAACAATGCTTAGAATCCCACCAATTGGTGCTGAAACAAGACATTGATAAAATTGTAACGGTACCAAAGAGTTGTTCTGCTATGAGCGATGGACGTCACGATAGTTACACAGACACCACTCAACACCATCAGAACGTGGCTTTATCGTCAGTACAAATAACAAGTGGTAGCAGTAGTACTAATAATAAGGAACATTCTGACGTTGATCTCTCTACATTGCCATCAATAATATACAAATGCAACAATTGTGACTATTTCGCTCAAATCAAACGTGAAATGGAACATCACATTTCGAACATGCATCCAAATGTATCTGAACATGACTACCTTACGATACCGACTAATCCGGCTGCTTTGCATGCTTTTCATGCAGCTGTTGCAGCAGCAGCTGTAGCAGCAAATGCTGCAGCCTCACAATCTCGGAGTAAATCATCTTCGCCAGTGCAAATGCAAAGTAAACGGCAGCACAATGAACAACAGGGATTCGGAACATCTGCATCCGGAAGTAATATGTTGTCTGAAGTAAAAACTGAACGAATGGATGCCATAGAAGATCCTCAGTCGAATGATGAGTGTGATCCGTTTGAAGATCCAACTGAGTCGTCGCACTCGAGAAACGCGGTCAATATACCTCATCCTTCGGCAATTAACTCTGTTATGTGTCCTCTTTGCCAGGATACATTTAGTGAGAAGAAGTCTCTTGAAATGCATCTTATGGGCGTGCATAGTGTGAATAGTGATGGTTTGGCGAGACTACTGCAACTAGTGGACAATACTCAGTGGCTAAATAGCAGCCGACGTAGTAGTACTAGCACTACTCCAGAACCTCGCAGTTCAAGCACACCTCATTCAGAAGTTGGAAATTTATcttcgcagcagcaacaacaccaacaacaacagcaacaacaatcaTGCTCATCTGTTTCATCTGCTCACATAAATGTTCCTAGCGAAGAATACGCCTGCTCGCAGTGTGGTGCTGCATTCAAGTTGCAGCAGCATTTGTTGATGCACGCTCATGACGCACAACACTATCAAATGTCAAACGAGCAATATCAATGTTTGGCAAAGCATTGTCAACAGCTCTTTGGCAGTTTACTGCAAATGTTGACGCACTACAAAGACAGCCACATGAATATAGTGATTTCTGAACGTCATGTGTACAAGTACAGATGCAAGCAATGTTCGCTGGCATTTAAAACGCAAGAAAAACTAAACACTCATTCCTTATATCATACGATGCGTGATGCTACGAAGTGTATGATTTGCAACCGGAATTTTCGCAGCACTCAATCCTTACAGAAACACATGGAACAAGCGCACAATCAATTGCAAGCATGTGGGAGTCCCATCCCATCTCCTAATAGCAGTGGAGAACTGACACAAACTCCTGCATCTGTAACGCCTACTATCAGTCGGGGAGAGGAAGATACTGCTGTAAACACCATGGCAATTGTAGCAGCAGTTACAACAGCGACTACAGGTACGACAAATCAATTCAGCAAAACTTACGTTAACAattacgatgatgatgatgttggtgATGGTCAGTTCGTTGTTTCGATTAAAGATATATCTAATAACTCAACATTAACAACAGCACccctaccaccaccaccacctccaccaccaccaccacgagCTGTAATGGAAATTTTAAGCAGCGAGAGCACCAAACGAATCCCATTTGATGACTACCTGAATTCCCAACAAATATCAGAAGAAAATTACAGCGATGTGGAACGTAAGCTTAAGTGCCACAAGTGCAAAGTTGCCTACACAAATGAAATCTATCTAGCGAAGCACTATAAATCGATTCAACATCGTCGCAATGAAAAGTTGAGTATATACCCAATGGAAAAGTATCTCGATCCCAATCGTCCTTTCAAGTGCGAAGTGTGCCGCGAAAGTTTCACCCAAAAGAACATACTTTTGGTACACTACAACAGTGTATCGCACCTACATAAGGCGAAAAAGCAGCAGAGCGGTGGTATGGTATCACAAGTAATGCCGATCGCATCCCTCGTGCTCCCAACTGATTTTGTGGATGGTACTGTCGACGGTAGGGGACAAGTAATGGAACCAATTGTTGGTGGTAGTGGTAGCACTTGCTGTGGTGGTGCTGGGTGTGTAGCGACAATCAACAAATCGCTGAGCTCGAAACGaaaaatatctctcgaatctgaCTATGAGAGTCCTAAGAAGCGATTTAAATGTGACATCTGCAAAGTGGCCTATGCCCAGGGAAGTACACTTGATATTCATATGAGAAGTGTCTTGCACCAAACACGCGCTTGCCGTTTTCAGGAACAGCGGAAACCGATAACACCGCCCTTATCATCGGAGAGCCCAACATCAACAATGTCAACGCCAACATTGAATGACCAAATATATAAGTCCCTTCTGGAAACGTATGGTTTTGATATTGTAAAGCAATTCAATGAGATTAACAGGCTGTGTCCAGCTACTGACTCTGGCAATTATTACTGTCGTTACTGCAGTAAAGTATTCTCATCAGTGTTTGTGTTGAAGACACACTGCGAGGAAATACACGATGAGAAGATTCCCCTAGAGTTATTAGAAAAATtcgcagaaaaattaaaaatcttttatCTCGACCAACAACCTACATCACCATCATCGTTACTGTTTAATATTGCAAATCAGAATGAATCTactatttctttaaataaatctgAAACTAATTTAGGTGCAACAACATCCACCGCATGCAGCGAAAGTAGTTCTCCAGCGCCAACAACAGACTTGGGAGCAGTTGGATGCGAAACCTCCCCCATATTGCCTTCAACCTCGCAACCAGCAGCGGCAACCAGCCCCAACCCCTTAACAGTGTCAGCCATGGCGTCGGCACTATtaaaacagcaacaacaaggCCTAACTCCAGATATTGTGCAAAAAATCAATCTAGACCCGACTATGTTGGCCCAAAAAATTATggaacaaaattttcaaaactttCCACCGAATTTCACAGGTCTGCCGCACAATATTCAAAATTTACAGAACTTGCAAAGCCTGCAAAACTTACAGAACGTGCAACAAAATCTACCGAGTATGAGTAATTTACCAATGAATACACTAGATATGTTAAATATTATGCAGTTTCATCACTTAATGTCgctaaattttatgaatttggcCCCGCCTTTGATTTTTGGGGCAGCAGGAGCCGGTCAAAGTGCTTCGGTTTCAGGAAATGCACCATCAACAGCCACTGAATTACCGACAACACCGGCCACACAGTTGATACAGCAACAGACGGCAGCGGCTTCACAGAATACCAGCAATCAAAAACGAGCTCGCACTCGTATCACTGATGATCAGCTGAAAATCCTTCGTGCCCATTTCGATATAAATAATTCACCAAGTGAAGAGAGCATCATGGAAATGTCCAAGAAGGCTAATCTCCCAATGAAGGTTGTCAAACATTGGTTCCGCAACACGCTGTTCAAGGAAAGGCAACGCAACAAAGACTCGCCATATAACTTTAACAACCCACCATCAACAACGCTGAATCTAGAAGAATATGAACGTACTGGACAAACCAAGGTTACGCCTTTAGCGGAAAGCGGAGGTGGCGGCATTAGCGGCTTCCATATCcttcaacaacagcaacagcagcagcaacaacaacgcgaacatcaacagcagcaacaacaacttcAGCAGCAACAGCTGCAACAGCAGATTCAAGCGCAACATCTGCAACAGCAACGCCCCCCATCATCCCAATCCAGTGATTTGAATTTTCCCCAGCTGAGCTTccaccagcaacaacaacagctcgATCTCTCTcgtcaacagcagcaacaacaacagcaacaggacAATCGTCCTTCGTCTCACCCATCAAGCGTTACCAGCGATCGTGGAGACATCCATATTAAATCAGAACCAACTGATGATATTGGCAGTTCCGATTGTGACCAGCAAATGATGCTGAACAAAGATCACGAAAGCGAACAATCATTAATGCAATCACACCATCAGCAATCAATGTTCTACAACAACTTCGAGACGAAGTCCGAAAGTGGAAGCTCTGAGATTCTATCACGCCCCCAAACCCCGAATAGTACGTCGACACCATACAGCAGTAACATTTCAGACATCCTTGGTCAGCAAATTGACAATCTTCCGCTGAATAATATGGCCAATATTAGCAATCTGAACAACATGGGTCCTCCAAAAAAGTTTCAAATGAGCAAAATGTTTGAGAAGAATAGCAATTTTGACACCAATTCAAACTCATCGAATAGCTCCACATCAAGCGGCAAACGAGCCAATCGCACTCGCTTCACCGACTATCAAATCAAAGTGTTGCAGGAGTTTTTCGAAAATAACTCTTATCCTAAAGACAGCGACTTGGAGTATTTAAGCAAATTGCTGCTGCTGTCACCACGAGTTATTGTCGTTTGGTTTCAGAACGCACGTCAGAAACAACGTAAGATCTATGAAAATCAACCGAACAATACCTTTTACGAGTCCGAAGAGAAGAAGCAAAACATCAACTACGCCTGCAAGAAGTGTAACCTAGTTTTTCAGCGGTACTATGAACTCATTCGTCACCAAAAAAATCACTGTTTCAAGGAGGAGAACAATAAAAAATCGGCAAAAGCACAAATAGCTGCCGCGCAGATTGCGCAAAGTCTTAGCAGTGAGGACTCCAACTCAAGCATAGATGTCAACAATGCAAACTTGATGTCTTCGAATATAGTTGGACAACAGACAGCAGCAGCTGCAGTGGCTGCGGTGTCAGCAGTTGGTGGGGCTGCTGTTCCAGCAATTCCCGCAGCGATCCCATGTCATCCTACAAGTCCAGGAATGACTTTGCTTTCATCGCCACAGCATAATTTTAAGCAACAGCAATCAGCGACTGCTGCAACTGGAGGTCACATTGATAGCACTTCGCCGCTACAGAAATTTGAATGCGACAAATGCCAGCTAACCTTTACCCGTTATGAACTTTTCAAGGAACATCAACTAATACACCTTATGAATCCAAATCTCTTTATAAATCAGAGTTACAGCGATTCGTCACCTTTCGGAATTTTGCAAAATCTGCAGGGAAATCACAACAGCCTTCAGCAAGATACTTCAATGGATTTGAGCCGGCAAAAGAAACGTAAATATTCCGATACGCAAATCTCACCCGATGATCAACAGCAACAAAATGACTATGAAGCGTTCAATAAGAAATTCAAAAATGATCAGTACGACTTTCTGTACCAGTATTTCATGCAAAATGAGACCAGTGCCGATCTCAAGAAGCAATTTACGCAGCAACAGCAACCAGAACTGGATCTGGAATACTTATCCAACTTTTACCAACAAAATGAACTAAAAAAGCGCAGCAATTACGATTTTTTATTTCAGTACTATCTGCGAAATGAATCAAAACAACCGAGTAGTGCTGCCAGCCTCATGATGCTGAACAACGATGCTAATAAACCAAATATGGAATTTCTCCTACAATATTATCAACTCAGTGAATCGAAAAAGTTTTTTCAGTTAGATGCCTCGCCCCAACGAATACATGATTTTCCACCGTTGCTGAATCTGAGCAGCGCAGCAGCAGCGTCCGTAAACAATGGTGTTGGaatacaacagcaacagcaacaagttTCGGCAACATCCCCAACAGAAACTGTAAAAAGTTCCACTACCAACGCTACAGCAGCAACGCGAACCACTTCACCAGTGTTCCTGAGCAGCAGCAGTGGTGATGGTATTGTGAACAAAGATACCAGCAGCAACAGCGGCAGCAACCAGATCCCGATATCTTTGTTAGCAACTGAAGTCAAACTGCTGTCAGCGACAGCGGCAGCTGGTGATGGGAGCGGGGTACAAACAGTAACATTACCAACTACAATTGTGAACGCAATAACTTCTGGTAGTACTTTCATTGCCAATTCCATAAGTAATAATCTTCATTGCTCGATGCCTCCGCATCATCCAAAACAACAAGAAAATTCTTCCCTTATCTCGAATCGTTTGGATATTTCACCTATCGCTGCTGCTGCCACCGCTTCTGAAGCTGCTGTACAATCACTGAAACTTTCTATTAACAATTTAACAACAAATTCTGCGTCAAAAAGTTCTCCTcttaacaataacaacaatacTGAAATAGATATGGATTCTGGAATCACCCAGCATCAAGAATCCGTCACAAGTGTTGGCCATCAAATCGAAGAGACCATAACAACGACTGAGAAGCAGAACAACAAGAGATTACGCACGACCATATTGCCAGAGCAGTTAAATTTTTTATACGAATGCTACCAGAATGAGTCGAACCCAAGTCGCAAAATGCTCGAGGAGATTTCCAAGAAGGTCAACCTTAAGAAACGTGTTGTACAGGTCTGGTATCAGAACTCACGAGCACGTGAACGCAAAGGTCAGTTTCGCCAGaatattcaaataataaataaaaaatgtccgcACTGTGCCGCCATATTTAAAATTAAGTCAGCTCTCGAGTGGCATCTGCAGTCCAAGCATGGTGATAAGCAAGCCATCAATGTTGATCAAATTCCTGACTTAAAGTTCTCTGAGAGTTTATTAAACCTGTCGGGagctgcagcttctgcaaactTCGGAATGAAAAATGACGAAGGTGACCATACACAAAAGCAAgagaaaaaggaacaaattcaaCTGACAACAGTACAACCCGCCGTTTCTGTCACATCAGCTGCTACACCATCTTTGGGTTCATCTTCGGTATCGCCTACTGCCGCATGTATTCTTGATCTAGTTAAAGGGGAAGGAACTCCACTTGATCTTAGCAAGGCGCCTCCATTAACAAATCACAATAGCAAGTATGAACAGAGTGAAAGCGAAATAAGCTTCTCTGACTCGAATAATGATCATGATGAGTCCAatgatttttttaatacttCATCGTCAATAAACAATGGTAGGCTAGTCTTGAACCGACTTGCCAATGAAGGTAGTCAACGTCAATACGACTCAAGTGCATTAGACAATGGTGTCAGTTGTGTCTTTAATGAATACAACACCACAACTAATGGTAATAATTCAAATGATTACCTAGGAAATGAGCGTGACAACACCAACTCACCGGTCAGTCAGACTTCAAGTAACCACAGCACGCAACAAAAGAAACGCTTTCGCACACAGATGAGCAACATGCAGGTACGCATACTAAAGACACTATTCTTTGACGTTAAAACACCTTCGATGACTGACTGCTCAAATATTGGCCGGGAAATTGGATTGCAAAAACGTGTCATTCAGGTTTGGTTCCAGAACGCCAGAGCCAAGGAGAAGAAATCCCGTAATCAACGATTTCAAAATGATGAAAATACATTCGAGAACGACACATACCAAAAGTCAAACATTGATTCAACAACAACCGTCATTCCAGAAATACGCGATTGCAATATTTGCCATTTGCAAAGTGTCAACATTCAGGAGCATGCTTTCTCATCCCAACATATCGCGCAGGTGCGAATGTTGCTCGAATCTAATAGAAGTAAAAATGATGATCAGCAACATGCTGACAACAGCATGGAGCACGAATTTAGTAGTATATATTCTCAGCTGTATAATCAACAGCATAAGAACAACCAGTTGGATGTAGAGCGTTCCGACACAAATCACAACGATGAAAATGACGCTGAAATGGATAATCATAGTTTCAATATTACCGATATTGATGGCAACATGGAACAAATTGAAGACGACGACGATAACACAACTCAAAAGCAAAACGAAGATTTGGCTTTAAATGAAGCTAAAAAAGCTGCTTTAGCATTGAGAAACTTCAACAAGTTGCAGCAACATTTCGCTGCAGCTACTGCCTTAGccaagcagcagcaacaacaaaagcaCCCCCAATGTGATGCACTATCATCGGCTTCGCCACCAacaatggaaaacaatttattgCTGAAAATTAACACAGATAACCAGTTAACATCAAATCATTCTGAAATGTTGCAACAGCTCTTTAACTATAGTCAAATGagtgttatcggCGGAAAGTGA
- Protein Sequence
- MSCSDVESFQGKIVYNPDGSAFIIDSENESLSNISDNCMSVGATTTNNPKIHSFRVVTAHDASVNISEPSKIQKPILMCFICKLSFGNTKSFSLHANSEHTLNLQESEKLLLNREYSSAIIQRNVDEKPQISFLEPLDIQKQDQFMKQSSQTPQHQQQLLIGSTSSSGPNNNNLSNNNCSNNNININCNTSTAIPPPTSLSPSSSVSSAAVAAAAAANAAFVAAIAASCSNTNSVNTSASISMKKSHLESDIIMASIGAAAGGGNNNPSYGNTTGTISSNPSSSSNNCSQINHQQRSENFDNLSTLDLSAATVAAQAAAAAVAAAITVDGRHSPPPSSPTSTTSSSPLSSSSSSSTSTSSSSAQQQATTIIVSGIPLATAPMPGGSTQSSHSSIAIDAIPATSSISSTATKSTSSNSSSNFINASSNKISSSAVSSPTPTTMADFLQQQFQQMQNQIRITSPTSASSVVVSNSGSNMEANILSSVTSSGPTLSSLTASLAAAAAAVGSGTPGDLTSNSNSVKLINDFLQHQLQHQQHPSYATCPEHPDVKGIDCKTCEMIEINMKSPITPTRSPNSINLFPSNSTMSPTAAAAPSFTIGACPEHINGRPLGVDCSRCEMILNSARLNSGVQMSTRNSCKTLKCPQCNWHYKYQETLEIHMREKHPDGESACGYCLAGQQHPRLARGESYTCGYKPYRCEICNYSTTTKGNLSIHMQSDKHLNNMQELNSSQSMANTAAEIRESPKIIMPNMQQQASKPKPSFRCDVCSYETSVARNLRIHMTSEKHTHNMAVLQNNIKHIQAFSFLQSQNLGQLSAAQTAAVAASNLPNLPNLPNLQNFLPEAALADIAYNQALMIQLLHQNSAAGALSAAAAAAASANPLTLVPPQQSPAGSGVTTTLPPNASTSHQPSLQPYQTSQPSKVLPTTGATPVSVPSTTLSVHQSSPQQQSAVATAALLAEAPAAGVANTDSPTCLQQQQQDSSLDPPIEPDPKPTTAFSCLICANYNTNSIDELNNHLMIDRSRNTNNNCSDIMMIINNNYICRLCNYKTNLKANFQLHSKTDKHLQKLNYINHIREGGVKNEYKLKYNQSNTIQLKCNCCDFYTNSIQKLNLHTQHMRHDTMKMIFNHLLYLVNSFNASMGNNSDVGAVSENSEFQLVNKNKSLMCQLCNFSAVNIMQMVQHVKSLRHIQVEQFICLQRRSENLESLGLDDVFKIADNTDCVKSERSSPEQCLESHQLVLKQDIDKIVTVPKSCSAMSDGRHDSYTDTTQHHQNVALSSVQITSGSSSTNNKEHSDVDLSTLPSIIYKCNNCDYFAQIKREMEHHISNMHPNVSEHDYLTIPTNPAALHAFHAAVAAAAVAANAAASQSRSKSSSPVQMQSKRQHNEQQGFGTSASGSNMLSEVKTERMDAIEDPQSNDECDPFEDPTESSHSRNAVNIPHPSAINSVMCPLCQDTFSEKKSLEMHLMGVHSVNSDGLARLLQLVDNTQWLNSSRRSSTSTTPEPRSSSTPHSEVGNLSSQQQQHQQQQQQQSCSSVSSAHINVPSEEYACSQCGAAFKLQQHLLMHAHDAQHYQMSNEQYQCLAKHCQQLFGSLLQMLTHYKDSHMNIVISERHVYKYRCKQCSLAFKTQEKLNTHSLYHTMRDATKCMICNRNFRSTQSLQKHMEQAHNQLQACGSPIPSPNSSGELTQTPASVTPTISRGEEDTAVNTMAIVAAVTTATTGTTNQFSKTYVNNYDDDDVGDGQFVVSIKDISNNSTLTTAPLPPPPPPPPPPRAVMEILSSESTKRIPFDDYLNSQQISEENYSDVERKLKCHKCKVAYTNEIYLAKHYKSIQHRRNEKLSIYPMEKYLDPNRPFKCEVCRESFTQKNILLVHYNSVSHLHKAKKQQSGGMVSQVMPIASLVLPTDFVDGTVDGRGQVMEPIVGGSGSTCCGGAGCVATINKSLSSKRKISLESDYESPKKRFKCDICKVAYAQGSTLDIHMRSVLHQTRACRFQEQRKPITPPLSSESPTSTMSTPTLNDQIYKSLLETYGFDIVKQFNEINRLCPATDSGNYYCRYCSKVFSSVFVLKTHCEEIHDEKIPLELLEKFAEKLKIFYLDQQPTSPSSLLFNIANQNESTISLNKSETNLGATTSTACSESSSPAPTTDLGAVGCETSPILPSTSQPAAATSPNPLTVSAMASALLKQQQQGLTPDIVQKINLDPTMLAQKIMEQNFQNFPPNFTGLPHNIQNLQNLQSLQNLQNVQQNLPSMSNLPMNTLDMLNIMQFHHLMSLNFMNLAPPLIFGAAGAGQSASVSGNAPSTATELPTTPATQLIQQQTAAASQNTSNQKRARTRITDDQLKILRAHFDINNSPSEESIMEMSKKANLPMKVVKHWFRNTLFKERQRNKDSPYNFNNPPSTTLNLEEYERTGQTKVTPLAESGGGGISGFHILQQQQQQQQQQREHQQQQQQLQQQQLQQQIQAQHLQQQRPPSSQSSDLNFPQLSFHQQQQQLDLSRQQQQQQQQQDNRPSSHPSSVTSDRGDIHIKSEPTDDIGSSDCDQQMMLNKDHESEQSLMQSHHQQSMFYNNFETKSESGSSEILSRPQTPNSTSTPYSSNISDILGQQIDNLPLNNMANISNLNNMGPPKKFQMSKMFEKNSNFDTNSNSSNSSTSSGKRANRTRFTDYQIKVLQEFFENNSYPKDSDLEYLSKLLLLSPRVIVVWFQNARQKQRKIYENQPNNTFYESEEKKQNINYACKKCNLVFQRYYELIRHQKNHCFKEENNKKSAKAQIAAAQIAQSLSSEDSNSSIDVNNANLMSSNIVGQQTAAAAVAAVSAVGGAAVPAIPAAIPCHPTSPGMTLLSSPQHNFKQQQSATAATGGHIDSTSPLQKFECDKCQLTFTRYELFKEHQLIHLMNPNLFINQSYSDSSPFGILQNLQGNHNSLQQDTSMDLSRQKKRKYSDTQISPDDQQQQNDYEAFNKKFKNDQYDFLYQYFMQNETSADLKKQFTQQQQPELDLEYLSNFYQQNELKKRSNYDFLFQYYLRNESKQPSSAASLMMLNNDANKPNMEFLLQYYQLSESKKFFQLDASPQRIHDFPPLLNLSSAAAASVNNGVGIQQQQQQVSATSPTETVKSSTTNATAATRTTSPVFLSSSSGDGIVNKDTSSNSGSNQIPISLLATEVKLLSATAAAGDGSGVQTVTLPTTIVNAITSGSTFIANSISNNLHCSMPPHHPKQQENSSLISNRLDISPIAAAATASEAAVQSLKLSINNLTTNSASKSSPLNNNNNTEIDMDSGITQHQESVTSVGHQIEETITTTEKQNNKRLRTTILPEQLNFLYECYQNESNPSRKMLEEISKKVNLKKRVVQVWYQNSRARERKGQFRQNIQIINKKCPHCAAIFKIKSALEWHLQSKHGDKQAINVDQIPDLKFSESLLNLSGAAASANFGMKNDEGDHTQKQEKKEQIQLTTVQPAVSVTSAATPSLGSSSVSPTAACILDLVKGEGTPLDLSKAPPLTNHNSKYEQSESEISFSDSNNDHDESNDFFNTSSSINNGRLVLNRLANEGSQRQYDSSALDNGVSCVFNEYNTTTNGNNSNDYLGNERDNTNSPVSQTSSNHSTQQKKRFRTQMSNMQVRILKTLFFDVKTPSMTDCSNIGREIGLQKRVIQVWFQNARAKEKKSRNQRFQNDENTFENDTYQKSNIDSTTTVIPEIRDCNICHLQSVNIQEHAFSSQHIAQVRMLLESNRSKNDDQQHADNSMEHEFSSIYSQLYNQQHKNNQLDVERSDTNHNDENDAEMDNHSFNITDIDGNMEQIEDDDDNTTQKQNEDLALNEAKKAALALRNFNKLQQHFAAATALAKQQQQQKHPQCDALSSASPPTMENNLLLKINTDNQLTSNHSEMLQQLFNYSQMSVIGGK
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00313269;
- 90% Identity
- iTF_00315474;
- 80% Identity
- iTF_00314088;