Enis022962.1
Basic Information
- Insect
- Epinotia nisella
- Gene Symbol
- -
- Assembly
- GCA_932294385.1
- Location
- CAKOAM010000173.1:47876-74046[+]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 21 5.3 3.6e+02 2.4 0.3 2 23 6 27 5 27 0.96 2 21 1.4 95 4.2 0.0 2 23 923 945 922 945 0.91 3 21 0.00011 0.0076 17.1 2.0 2 23 3596 3617 3595 3617 0.97 4 21 0.047 3.2 8.9 3.8 1 23 3623 3645 3623 3645 0.99 5 21 0.036 2.4 9.2 0.6 1 19 3651 3669 3651 3670 0.97 6 21 0.036 2.4 9.2 0.6 1 19 3692 3710 3692 3711 0.97 7 21 0.036 2.4 9.2 0.6 1 19 3733 3751 3733 3752 0.97 8 21 0.036 2.4 9.2 0.6 1 19 3774 3792 3774 3793 0.97 9 21 0.036 2.4 9.2 0.6 1 19 3815 3833 3815 3834 0.97 10 21 0.036 2.4 9.2 0.6 1 19 3856 3874 3856 3875 0.97 11 21 0.036 2.4 9.2 0.6 1 19 3897 3915 3897 3916 0.97 12 21 0.036 2.4 9.2 0.6 1 19 3938 3956 3938 3957 0.97 13 21 0.036 2.4 9.2 0.6 1 19 3979 3997 3979 3998 0.97 14 21 0.0081 0.55 11.3 3.7 1 23 4020 4042 4020 4043 0.96 15 21 1.2 80 4.5 0.1 6 23 4057 4074 4056 4074 0.98 16 21 0.08 5.5 8.1 0.8 2 23 4080 4102 4079 4102 0.93 17 21 0.0009 0.062 14.3 0.5 1 20 4166 4185 4166 4188 0.92 18 21 0.0094 0.64 11.1 0.4 1 23 4196 4218 4196 4218 0.98 19 21 0.019 1.3 10.1 7.2 1 23 4224 4246 4224 4246 0.98 20 21 2.4e-05 0.0017 19.2 1.4 1 23 4252 4276 4252 4276 0.98 21 21 0.38 26 6.0 2.1 1 23 4282 4304 4282 4305 0.95
Sequence Information
- Coding Sequence
- ATGGAGCCGCTCGTGCAGTGCCCCGTGTGCACGCTCTTCCTCCACAATGGCATGTCCTTGGAATCCCACCTCGACACCCACCCCAAAGACCAGGTCATCAAAGCGCTGTGCACCCTATCTGCAAACAATGTTGGCTTTGGAAGCCGGACTTCCACACCCCTGCACTCAGAGCGCTCCTACAGGAGCCGGTCAAGGACTCCAGCGGAGGATAGCCCCCGGTGGAGCGGGCGAAGCATTGACAATGAGAAATACTGGAGAAGAACCCCCAGCAGGTGCTCCAAGACTCCTGTGTCCAGTATCACTAGGAATGCAACACCTTCGGATGTGCGAATGGGTGACATCAACTTTGAGAATAATTCTAATAATGCTACATCTAGCCCCTACCCAGGTATGTCATTTATGAAGTCCGACAAGTCCAGTCAGCCCTTCATTCAGAATGTGCCCATTCCAGAGTTTGATCCGCAGTATGTGTACTATCCGGACCAGCAGGAGGACAAGGAGATCAAGTATTCACGCAGCGCTGAGTACAGTGCCATAGACAACACTAACAATGTGTTCACTTACAACATGCCATCAGCCCCGGCGGTCAAGCTGCAAAGTGCCCTTGTACCCACCGTCTCAGGGTCCCGCAAACACACTGGTCTTGTAAAAATACTTCCCAAACCCAATAATATACTTGTAAAGGCTAGTGTGGGAGGAGTGCAATATATTACTCCTGGAGTGAAGCCCATGCACCTGATGGTACCCACAGCACCCACATTTGTACAGAAAAATGTTCAAAACAACATGATTGTGTCTGGCAGCATTCCCACCAGCCAGATCATGGAGCCCAAGTCTGTAGCGTCACCACTAGCCAGCAGCAGTAGTCAATTCAGCCAGTTATCAACTGGCACATTTACCCCAGGGACCACAGTGGTTACACAAAACTCTCAGATTATTTACAGGGAAATGGTGCACAATATTGATGGGAAACCTTTTCTCACTAGCATGCCCGCTGTGCTCAGTGGACACGACAATCTGACGAATGTAGCCACCACTAGCTCCATGTACCAGAATGTCATGGTGGTGGACCAGTTTGGCAACACATCTTGTATGTACACCACTCCTCAGATCTTGGCTAAGCCATGTAGCACGCCAGTATTTAGTGAAGTGGCTGTAAACAACCTGGAGAAAACTCTACCCAATCCTAGTAATGTAGGTAATGAGTCAAATAAAACATTGATTATTGAAGTGGGCCCTATAGTGGGTGCACCAGTGGAAGTCAGTACCTCCACCAGCAACTCACAAGCGTCTCAGACAGTTTCCGAGGTTAAAACCACTCCAGGTCCCGAGAAACAGGTAGAGAGGAATGATACACCCGGTTCTACAAGAGGATTAAAAATTCTTAGCAATATTAAGGTGGAAGTTCCAGTCCAGCATCACAAAAACTTGCTCAACACTGTGGTAGACCTCACCGGGCCGCAGCCTGAGTATTCAGAGAGGGCCGTCTCGCCAGAGAGAATCCTACCCGACTTCGAAGACAACAACCATAGCCTGTCAGGAGACTGCATGCCCTCTGCCACTAGTAGTGATAGTATAGTTTCAAATACGTTCTCTGTGATAAAAAACGTCGGGAATCCCCCCTCGTACAAAGACTCAGTTTCCAAGTCTTGCGTTGTAGATAATAAAATTGAAAACGAATTTTCCGATAGTTGTCCCATGCCCGACCTCATTTGCAACGAGAAACCCTCAATATCGCCGTGCAGCGAGCTCTCGGAGAGCGGGGAGAACTCCACCGACAGAATTTCCATCTCGCCTAAGCCCGAGGCCGCGCTGGCGCCCGGGGAGTCACCCGCCGAAACAAAACCACCACTGAAGCCTTTCAGGAGCGCACCACTAAGACTGAACAATATCTTTGTCAAGAAGCATAAAAAGATTTTAGAGATTAAGAATTCCAAAAGCTTTTGCGTGGCACCCTGCAGCAGTAAAGACCTGCAAGGGCCCTCGTCGCCGCGGCCGCCCGAGAACTTCACCATGAACAAGATTCACAGCATCGACGTCGAGGAGAGGACGGAGACGCTGCAGACGATCACGGTGGAGGTGTGCAACGACAACGAGTTCGACGCGGACACCGAGGAGCAGTCCATGGACCTGGAGCCCGTGGCGCACTCCAGCATCCGGGACTCCAGCCAGCACTCCGTGGTGCACGTCAAGGAGGAGATCAACTCCAGCAACGAGGGCGGCTTCAGCAACGAGCGCGCGCTCGCGCCGCTCGAGGCCATGCGCCCCATCAACGTCATCGCGTACGGCGCCATGGCCGACTTCGACGACGAGTCCAACCACAAGGAGCTGCTCGAGCTCGAGGCGGCCTCCAAGAACAAGCAGTTCGTGAGCATGATGAACGACAACTACTTCGGGGACAACATCTACGCGGACTACTTCACGCCGGAGCGCGAGCCCTTCGAGCGGGACCCCAAGGACGGCTACCTGTGGGGCGAGGGCCCGCGGGACGGCGAGTTCGTGCTGCCCACCTTCGTGCACGAGAGCTACAAGGTGGCGGACAGCGGCGCGCCCGACTTCGTGGAGCCCCGCGACAAGGACGAGCGCCGCGACCGCGCCGCGCCCCTGCCCGCCGACATGGACTCGCCGTGGACCGGGATGTACCGCGAGGTGGCGGCGGGGGGCTACGAGCTGCTGGCGCGCGAGAGCTGGCCCTCGGACGCCGAGGACGAGCCCGCGGAGCCGCCCGACTGTCTACAAGAGGCGAGTCGCGCGCCGCGGACTGTCACGTGCGGCGCGTGCCGCGCGGTGTTCGGGTCCCGCGCCGAGCTGCGCGCGCACCGCGCGCTCGTGCACGCGCCCGCCGGCAGCTCGCGCGCCGCCGCCCGCCGCGGACCTGCCGCCGCCATCAAGCAGGAGGACAAGCCCGAGCCTGCGCAGGGAGGTACGTGTCACTTGTTCCCGCGCCGAGCTGCGCGCGCACCGCGCGCTCGTGCACGCGCCCGCCGGCAGCTCGCGCGCCGCCGCCCGCCGCGGACCTGCCGCCGCCATCAAGCAGGAGGACAAGCCCGAGCCTGCGCAGGGAGGTACGTGTCACTTGTTCCCGCGCCGAGCTGCGCGCGCACCGCGCGCTCGTGCACGCGCCCGCCGGCAGCTCGCGCGCCGCCGCCCGCCGCGGACCTGCCGCCGCCATCAAGCAGGAGGACAAGCCCGAGCCTGCGCAGGGAGGTACGTGTCACTTGTTCCCGCGCCGAGCTGCGCGCGCACCGCGCGCTCGTGCACGCGCCCGCCGGCAGCTCGCGCGCCGCCGCCCGCCGCGGACCTGCCGCCGCCATCAAGCAGGAGGACAAGCCCGAGCCTGCGCAGGGAGGTACGTGTCACTTGTTCCCGCGCCGAGCTGCGCGCGCACCGCGCGCTCGTGCACGCGCCCGCCGGCAGCTCGCGCGCCGCCGCCCGCCGCGGACCTGCCGCCGCCATCAAGCAGGAGGACAAGCCCGAGCCTGCGCAGGGAGGTACGTGTCACTTGTTCCCGCGCCGAGCTGCGCGCGCACCGCGCGCTCGTGCACGCGCCCGCCGGCAGCTCGCGCGCCGCCGCCCGCCGCGGACCTGCCGCCGCCATCAAGCAGGAGGACAAGCCCGAGCCTGCGCAGGGAGGTACGTGTCACTTGTTCCCGCGCCGAGCTGCGCGCGCACCGCGCGCTCGTGCACGCGCCCGCCGGCAGCTCGCGCGCCGCCGCCCGCCGCGGACCTGCCGCCGCCATCAAGCAGGAGGACAAGCCCGAGCCTGCGCAGGGAGGTACGTGTCACTTGTTCCCGCGCCGAGCTGCGCGCGCACCGCGCGCTCGTGCACGCGCCCGCCGGCAGCTCGCGCGCCGCCGCCCGCCGCGGACCTGCCGCCGCCATCAAGCAGGAGGACAAGCCCGAGCCTGCGCAGGGAGGTACGTGTCACTTGTTCCCGCGCCGAGCTGCGCGCGCACCGCGCGCTCGTGCACGCGCCCGCCGGCAGCTCGCGCGCCGCCGCCCGCCGCGGACCTGCCGCCGCCATCAAGCAGGAGGACAAGCCCGAGCCTGCGCAGGGAGGTACGTGTCACTTGTTCCCGCGCCGAGCTGCGCGCGCACCGCGCGCTCGTGCACGCGCCCGCCGGCAGCTCGCGCGCCGCCGCCCGCCGCGGACCTGCCGCCGCCATCAAGCAGGAGGACAAGCCCGAGCCTGCGCAGGGAGGTACGTGTCACTTGTTCCCGCGCCGAGCTGCGCGCGCACCGCGCGCTCGTGCACGCGCCCGCCGGCAGCTCGCGCGCCGCCGCCCGCCGCGGACCTGCCGCCGCCATCAAGCAGGAGGACAAGCCCGAGCCTGCGCAGGGAGGTACGTGTCACTTGTTCCCGCGCCGAGCTGCGCGCGCACCGCGCGCTCGTGCACGCGCCCGCCGGCAGCTCGCGCGCCGCCGCCCGCCGCGGACCTGCCGCCGCCATCAAGCAGGAGGACAAGCCCGAGCCTGCGCAGGGAGGTACGTGTCACTTGTTCCCGCGCCGAGCTGCGCGCGCACCGCGCGCTCGTGCACGCGCCCGCCGGCAGCTCGCGCGCCGCCGCCCGCCGCGGACCTGCCGCCGCCATCAAGCAGGAGGACAAGCCCGAGCCTGCGCAGGGAGGTACGTGTCACTTGTTCCCGCGCCGAGCTGCGCGCGCACCGCGCGCTCGTGCACGCGCCCGCCGGCAGCTCGCGCGCCGCCGCCCGCCGCGGACCTGCCGCCGCCATCAAGCAGGAGGACAAGCCCGAGCCTGCGCAGGGAGGTACGTGTCACTTGTTCCCGCGCCGAGCTGCGCGCGCACCGCGCGCTCGTGCACGCGCCCGCCGGCAGCTCGCGCGCCGCCGCCCGCCGCGGACCTGCCGCCGCCATCAAGCAGGAGGACAAGCCCGAGCCTGCGCAGGGAGGTACGTGTCACTTGTTCCCGCGCCGAGCTGCGCGCGCACCGCGCGCTCGTGCACGCGCCCGCCGGCAGCTCGCGCGCCGCCGCCCGCCGCGGACCTGCCGCCGCCATCAAGCAGGAGGACAAGCCCGAGCCTGCGCAGGGAGGTACGTGTCACTTGTTCCCGCGCCGAGCTGCGCGCGCACCGCGCGCTCGTGCACGCGCCCGCCGGCAGCTCGCGCGCCGCCGCCCGCCGCGGACCTGCCGCCGCCATCAAGCAGGAGGACAAGCCCGAGCCTGCGCAGGGAGGTACGTGTCACTTGTTCCCGCGCCGAGCTGCGCGCGCACCGCGCGCTCGTGCACGCGCCCGCCGGCAGCTCGCGCGCCGCCGCCCGCCGCGGACCTGCCGCCGCCATCAAGCAGGAGGACAAGCCCGAGCCTGCGCAGGGAGGTACGTGTCACTTGTTCCCGCGCCGAGCTGCGCGCGCACCGCGCGCTCGTGCACGCGCCCGCCGGCAGCTCGCGCGCCGCCGCCCGCCGCGGACCTGCCGCCGCCATCAAGCAGGAGGACAAGCCCGAGCCTGCGCAGGGAGGTACGTGTCACTTGTTCCCGCGCCGAGCTGCGCGCGCACCGCGCGCTCGTGCACGCGCCCGCCGGCAGCTCGCGCGCCGCCGCCCGCCGCGGACCTGCCGCCGCCATCAAGCAGGAGGACAAGCCCGAGCCTGCGCAGGGAGGTACGTGTCACTTGTTCCCGCGCCGAGCTGCGCGCGCACCGCGCGCTCGTGCACGCGCCCGCCGGCAGCTCGCGCGCCGCCGCCCGCCGCGGACCTGCCGCCGCCATCAAGCAGGAGGACAAGCCCGAGCCTGCGCAGGGAGGTACGTGTCACTTGTTCCCGCGCCGAGCTGCGCGCGCACCGCGCGCTCGTGCACGCGCCCGCCGGCAGCTCGCGCGCCGCCGCCCGCCGCGGACCTGCCGCCGCCATCAAGCAGGAGGACAAGCCCGAGCCTGCGCAGGGAGGTACGTGTCACTTGTTCCCGCGCCGAGCTGCGCGCGCACCGCGCGCTCGTGCACGCGCCCGCCGGCAGCTCGCGCGCCGCCGCCCGCCGCGGACCTGCCGCCGCCATCAAGCAGGAGGACAAGCCCGAGCCTGCGCAGGGAGGTACGTGTCACTTGTTCCCGCGCCGAGCTGCGCGCGCACCGCGCGCTCGTGCACGCGCCCGCCGGCAGCTCGCGCGCCGCCGCCCGCCGCGGACCTGCCGCCGCCATCAAGCAGGAGGACAAGCCCGAGCCTGCGCAGGGAGGTACGTGTCACTTGTTCCCGCGCCGAGCTGCGCGCGCACCGCGCGCTCGTGCACGCGCCCGCCGGCAGCTCGCGCGCCGCCGCCCGCCGCGGACCTGCCGCCGCCATCAAGCAGGAGGACAAGCCCGAGCCTGCGCAGGGAGGTACGTGTCACTTGTTCCCGCGCCGAGCTGCGCGCGCACCGCGCGCTCGTGCACGCGCCCGCCGGCAGCTCGCGCGCCGCCGCCCGCCGCGGACCTGCCGCCGCCATCAAGCAGGAGGACAAGCCCGAGCCTGCGCAGGGAGGTACGTGTCACTTGTTCCCGCGCCGAGCTGCGCGCGCACCGCGCGCTCGTGCACGCGCCCGCCGGCAGCTCGCGCGCCGCCGCCCGCCGCGGACCTGCCGCCGCCATCAAGCAGGAGGACAAGCCCGAGCCTGCGCAGGGAGGTACGTGTCACTTGTTCCCGCGCCGAGCTGCGCGCGCACCGCGCGCTCGTGCACGCGCCCGCCGGCAGCTCGCGCGCCGCCGCCCGCCGCGGACCTGCCGCCGCCATCAAGCAGGAGGACAAGCCCGAGCCTGCGCAGGGAGGTACGTGTCACTTGTTCCCGCGCCGAGCTGCGCGCGCACCGCGCGCTCGTGCACGCGCCCGCCGGCAGCTCGCGCGCCGCCGCCCGCCGCGGACCTGCCGCCGCCATCAAGCAGGAGGACAAGCCCGAGCCTGCGCAGGGAGGTACGTGTCACTTGTTCCCGCGCCGAGCTGCGCGCGCACCGCGCGCTCGTGCACGCGCCCGCCGGCAGCTCGCGCGCCGCCGCCCGCCGCGGACCTGCCGCCGCCATCAAGCAGGAGGACAAGCCCGAGCCTGCGCAGGGAGGTACGTGTCACTTGTTCCCGCGCCGAGCTGCGCGCGCACCGCGCGCTCGTGCACGCGCCCGCCGGCAGCTCGCGCGCCGCCGCCCGCCGCGGACCTGCCGCCGCCATCAAGCAGGAGGACAAGCCCGAGCCTGCGCAGGGAGGTACGTGTCACTTGTTCCCGCGCCGAGCTGCGCGCGCACCGCGCGCTCGTGCACGCGCCCGCCGGCAGCTCGCGCGCCGCCGCCCGCCGCGGACCTGCCGCCGCCATCAAGCAGGAGGACAAGCCCGAGCCTGCGCAGGGAGGTACGTGTCACTTGTTCCCGCGCCGAGCTGCGCGCGCACCGCGCGCTCGTGCACGCGCCCGCCGGCAGCTCGCGCGCCGCCGCCCGCCGCGGACCTGCCGCCGCCATCAAGCAGGAGGACAAGCCCGAGCCTGCGCAGGGAGGTACGTGTCACTTGTTCCCGCGCCGAGCTGCGCGCGCACCGCGCGCTCGTGCACGCGCCCGCCGGCAGCTCGCGCGCCGCCGCCCGCCGCGGACCTGCCGCCGCCATCAAGCAGGAGGACAAGCCCGAGCCTGCGCAGGGAGGTACGTGTCACTTGTTCCCGCGCCGAGCTGCGCGCGCACCGCGCGCTCGTGCACGCGCCCGCCGGCAGCTCGCGCGCCGCCGCCCGCCGCGGACCTGCCGCCGCCATCAAGCAGGAGGACAAGCCCGAGCCTGCGCAGGGAGGTACGTGTCACTTGTTCCCGCGCCGAGCTGCGCGCGCACCGCGCGCTCGTGCACGCGCCCGCCGGCAGCTCGCGCGCCGCCGCCCGCCGCGGACCTGCCGCCGCCATCAAGCAGGAGGACAAGCCCGAGCCTGCGCAGGGAGGTACGTGTCACTTGTTCCCGCGCCGAGCTGCGCGCGCACCGCGCGCTCGTGCACGCGCCCGCCGGCAGCTCGCGCGCCGCCGCCCGCCGCGGACCTGCCGCCGCCATCAAGCAGGAGGACAAGCCCGAGCCTGCGCAGGGAGGTACGTGTCACTTGTTCCCGCGCCGAGCTGCGCGCGCACCGCGCGCTCGTGCACGCGCCCGCCGGCAGCTCGCGCGCCGCCGCCCGCCGCGGACCTGCCGCCGCCATCAAGCAGGAGGACAAGCCCGAGCCTGCGCAGGGAGGTACGTGTCACTTGTTCCCGCGCCGAGCTGCGCGCGCACCGCGCGCTCGTGCACGCGCCCGCCGGCAGCTCGCGCGCCGCCGCCCGCCGCGGACCTGCCGCCGCCATCAAGCAGGAGGACAAGCCCGAGCCTGCGCAGGGAGGTACGTGTCACTTGTTCCCGCGCCGAGCTGCGCGCGCACCGCGCGCTCGTGCACGCGCCCGCCGGCAGCTCGCGCGCCGCCGCCCGCCGCGGACCTGCCGCCGCCATCAAGCAGGAGGACAAGCCCGAGCCTGCGCAGGGAGGTACGTGTCACTTGTTCCCGCGCCGAGCTGCGCGCGCACCGCGCGCTCGTGCACGCGCCCGCCGGCAGCTCGCGCGCCGCCGCCCGCCGCGGACCTGCCGCCGCCATCAAGCAGGAGGACAAGCCCGAGCCTGCGCAGGGAGGTACGTGTCACTTGTTCCCGCGCCGAGCTGCGCGCGCACCGCGCGCTCGTGCACGCGCCCGCCGGCAGCTCGCGCGCCGCCGCCCGCCGCGGACCTGCCGCCGCCATCAAGCAGGAGGACAAGCCCGAGCCTGCGCAGGGAGGTACGTGTCACTTGTTCCCGCGCCGAGCTGCGCGCGCACCGCGCGCTCGTGCACGCGCCCGCCGGCAGCTCGCGCGCCGCCGCCCGCCGCGGACCTGCCGCCGCCATCAAGCAGGAGGACAAGCCCGAGCCTGCGCAGGGAGGTACGTGTCACTTGTTCCCGCGCCGAGCTGCGCGCGCACCGCGCGCTCGTGCACGCGCCCGCCGGCAGCTCGCGCGCCGCCGCCCGCCGCGGACCTGCCGCCGCCATCAAGCAGGAGGACAAGCCCGAGCCTGCGCAGGGAGGTACGTGTCACTTGTTCCCGCGCCGAGCTGCGCGCGCACCGCGCGCTCGTGCACGCGCCCGCCGGCAGCTCGCGCGCCGCCGCCCGCCGCGGACCTGCCGCCGCCATCAAGCAGGAGGACAAGCCCGAGCCTGCGCAGGGAGGTACGTGTCACTTGTTCCCGCGCCGAGCTGCGCGCGCACCGCGCGCTCGTGCACGCGCCCGCCGGCAGCTCGCGCGCCGCCGCCCGCCGCGGACCTGCCGCCGCCATCAAGCAGGAGGACAAGCCCGAGCCTGCGCAGGGAGGTACGTGTCACTTGTTCCCGCGCCGAGCTGCGCGCGCACCGCGCGCTCGTGCACGCGCCCGCCGGCAGCTCGCGCGCCGCCGCCCGCCGCGGACCTGCCGCCGCCATCAAGCAGGAGGACAAGCCCGAGCCTGCGCAGGGAGGTACGTGTCACTTGTTCCCGCGCCGAGCTGCGCGCGCACCGCGCGCTCGTGCACGCGCCCGCCGGCAGCTCGCGCGCCGCCGCCCGCCGCGGACCTGCCGCCGCCATCAAGCAGGAGGACAAGCCCGAGCCTGCGCAGGGAGGTACGTGTCACTTGTTCCCGCGCCGAGCTGCGCGCGCACCGCGCGCTCGTGCACGCGCCCGCCGGCAGCTCGCGCGCCGCCGCCCGCCGCGGACCTGCCGCCGCCATCAAGCAGGAGGACAAGCCCGAGCCTGCGCAGGGAGGTACGTGTCACTTGTTCCCGCGCCGAGCTGCGCGCGCACCGCGCGCTCGTGCACGCGCCCGCCGGCAGCTCGCGCGCCGCCGCCCGCCGCGGACCTGCCGCCGCCATCAAGCAGGAGGACAAGCCCGAGCCTGCGCAGGGAGGTACGTGTCACTTGTTCCCGCGCCGAGCTGCGCGCGCACCGCGCGCTCGTGCACGCGCCCGCCGGCAGCTCGCGCGCCGCCGCCCGCCGCGGACCTGCCGCCGCCATCAAGCAGGAGGACAAGCCCGAGCCTGCGCAGGGAGGTACGTGTCACTTGTTCCCGCGCCGAGCTGCGCGCGCACCGCGCGCTCGTGCACGCGCCCGCCGGCAGCTCGCGCGCCGCCGCCCGCCGCGGACCTGCCGCCGCCATCAAGCAGGAGGACAAGCCCGAGCCTGCGCAGGGAGGTACGTGTCACTTGTTCCCGCGCCGAGCTGCGCGCGCACCGCGCGCTCGTGCACGCGCCCGCCGGCAGCTCGCGCGCCGCCGCCCGCCGCGGACCTGCCGCCGCCATCAAGCAGGAGGACAAGCCCGAGCCTGCGCAGGGAGGTACGTGTCACTTGTTCCCGCGCCGAGCTGCGCGCGCACCGCGCGCTCGTGCACGCGCCCGCCGGCAGCTCGCGCGCCGCCGCCCGCCGCGGACCTGCCGCCGCCATCAAGCAGGAGGACAAGCCCGAGCCTGCGCAGGGAGGTACGTGTCACTTGTTCCCGCGCCGAGCTGCGCGCGCACCGCGCGCTCGTGCACGCGCCCGCCGGCAGCTCGCGCGCCGCCGCCCGCCGCGGACCTGCCGCCGCCATCAAGCAGGAGGACAAGCCCGAGCCTGCGCAGGGAGAATCGGACGCGCTACTATTGCCGGACTCGAAGGAGGCGCTGGCGTCCAGCATCCTGTACCCCGGGCTCGAGACCAAGCCCGTCATCGTGAAGCAGGAGCCGCGCCGGCGCCGCCGCGACCACACCTGCCCCACCTGCAAGGAGGACCAGGGCTCGGAGGCGGCCTTCACGGCGCACCTGAAGCTGCACCCGCTGGAGTGCCTCACGTGCGGCAAGTGCTTCTTCCGGCGCGCCAACCTCGCGCTGCACCTCAAGACGCACCAGGGCATCAAGAACTTCAAgtgcgaggtgtgcgagaagcggttcctgacgcggcagaagctgatggagcaccacaacacgcacacgggccgcgcgccctacaagtgcaccgtctgcgacgacaccttccgcagatactccaacatggtgcagcacaGCCCCCTGACCCTGACGCGGCAGAAGCTGATGGAGCACCACAAcacgcacacgggccgcgcgccctacaagtgcaccgtctgcgacgacaccttccgcagatactccaacatggtgcagcacaGCCCCCTGACCCTGACGCGGCAGAAGCTGATGGAGCACCACAAcacgcacacgggccgcgcgccctacaagtgcaccgtctgcgacgacaccttccgcagatactccaacatggtgcagcacaGCCCCCTGACCCTGACGCGGCAGAAGCTGATGGAGCACCACAAcacgcacacgggccgcgcgccctacaagtgcaccgtctgcgacgacaccttccgcagatactccaacatggtgcagcacaGCCCCCTGACCCTGACGCGGCAGAAGCTGATGGAGCACCACAAcacgcacacgggccgcgcgccctacaagtgcaccgtctgcgacgacaccttccgcagatactccaacatggtgcagcacaGCCCCCTGACCCTGACGCGGCAGAAGCTGATGGAGCACCACAAcacgcacacgggccgcgcgccctacaagtgcaccgtctgcgacgacaccttccgcagatactccaacatggtgcagcacaGCCCCCTGACCCTGACGCGGCAGAAGCTGATGGAGCACCACAAcacgcacacgggccgcgcgccctacaagtgcaccgtctgcgacgacaccttccgcagatactccaacatggtgcagcacaGCCCCCTGACCCTGACGCGGCAGAAGCTGATGGAGCACCACAAcacgcacacgggccgcgcgccctacaagtgcaccgtctgcgacgacaccttccgcagatactccaacatggtgcagcacaGCCCCCTGACCCTGACGCGGCAGAAGCTGATGGAGCACCACAAcacgcacacgggccgcgcgccctacaagtgcaccgtctgcgacgacaccttccgcagatactccaacatggtgcagcacaGCCCCCTGACCCTGACGCGGCAGAAGCTGATGGAGCACCACAAcacgcacacgggccgcgcgccctacaagtgcaccgtctgcgacgacaccttccgcagatactccaacatggtgcagcacaGGGACCGGCACCACCTGCACAAGAAGCGCGTGGTGCGCGACTTCGTGTGCGCGTGCGGCGCGGTGTTCCACTCGCGCGCCAAGCTGCGCTGGCACGCCGAGACGCACGCGGCGCGCCCGCACGCGTGCCTGGCGTGCGGCGACAAGTTCGTGCACGCCGCCAGCCTCACGCGCCACGTGCGGCGCGCGCACGACCCCGCCTACACCGACACGCGGCGGCCGCGCCAGCACAACGTGCCCTGCCCCGTCTGCAAGCAGGTATGTCCCGCGTCCGCGCGTGTGCGTGCGTGTGCGTGTGCGCGACCTACAGCCCGCTTGTTGCAGGTGTACCTGCGCGCcaacctgcgcgcgcacctgctgacgcacagcggcaagcggcccttcgtgtgcgtggtgtgcagcaaggccttcaccaccaagtggaacctcaagctgcaccgctggacgcacgcctcgcgctccgccaagccctacaagtgcgcgctctgcaagggcgccttcatccgccactcggagtacgtggcgcacatgaacgcgcacaaggccgtgcgcccctacacctgcaactactgcggctgccagttcatccgcaagtacaactgccagcgccacgtgcgcgagcacgagaccgccaagaagtacgtgtgcaaggtggccgagtgcggcaagtccttccaccgctcctactacctgtccgagcacatgaaggtgcacagcggcgcgcggcccttcgcgtgcggcgtctgcggcaagctgtccggcaacaagtccaaccacaacaagcacgtgcgcatccaccacgcgcgcgagcccgtggccagcgaggcctag
- Protein Sequence
- MEPLVQCPVCTLFLHNGMSLESHLDTHPKDQVIKALCTLSANNVGFGSRTSTPLHSERSYRSRSRTPAEDSPRWSGRSIDNEKYWRRTPSRCSKTPVSSITRNATPSDVRMGDINFENNSNNATSSPYPGMSFMKSDKSSQPFIQNVPIPEFDPQYVYYPDQQEDKEIKYSRSAEYSAIDNTNNVFTYNMPSAPAVKLQSALVPTVSGSRKHTGLVKILPKPNNILVKASVGGVQYITPGVKPMHLMVPTAPTFVQKNVQNNMIVSGSIPTSQIMEPKSVASPLASSSSQFSQLSTGTFTPGTTVVTQNSQIIYREMVHNIDGKPFLTSMPAVLSGHDNLTNVATTSSMYQNVMVVDQFGNTSCMYTTPQILAKPCSTPVFSEVAVNNLEKTLPNPSNVGNESNKTLIIEVGPIVGAPVEVSTSTSNSQASQTVSEVKTTPGPEKQVERNDTPGSTRGLKILSNIKVEVPVQHHKNLLNTVVDLTGPQPEYSERAVSPERILPDFEDNNHSLSGDCMPSATSSDSIVSNTFSVIKNVGNPPSYKDSVSKSCVVDNKIENEFSDSCPMPDLICNEKPSISPCSELSESGENSTDRISISPKPEAALAPGESPAETKPPLKPFRSAPLRLNNIFVKKHKKILEIKNSKSFCVAPCSSKDLQGPSSPRPPENFTMNKIHSIDVEERTETLQTITVEVCNDNEFDADTEEQSMDLEPVAHSSIRDSSQHSVVHVKEEINSSNEGGFSNERALAPLEAMRPINVIAYGAMADFDDESNHKELLELEAASKNKQFVSMMNDNYFGDNIYADYFTPEREPFERDPKDGYLWGEGPRDGEFVLPTFVHESYKVADSGAPDFVEPRDKDERRDRAAPLPADMDSPWTGMYREVAAGGYELLARESWPSDAEDEPAEPPDCLQEASRAPRTVTCGACRAVFGSRAELRAHRALVHAPAGSSRAAARRGPAAAIKQEDKPEPAQGGTCHLFPRRAARAPRARARARRQLARRRPPRTCRRHQAGGQARACAGRYVSLVPAPSCARTARSCTRPPAARAPPPAADLPPPSSRRTSPSLRREVRVTCSRAELRAHRALVHAPAGSSRAAARRGPAAAIKQEDKPEPAQGGTCHLFPRRAARAPRARARARRQLARRRPPRTCRRHQAGGQARACAGRYVSLVPAPSCARTARSCTRPPAARAPPPAADLPPPSSRRTSPSLRREVRVTCSRAELRAHRALVHAPAGSSRAAARRGPAAAIKQEDKPEPAQGGTCHLFPRRAARAPRARARARRQLARRRPPRTCRRHQAGGQARACAGRYVSLVPAPSCARTARSCTRPPAARAPPPAADLPPPSSRRTSPSLRREVRVTCSRAELRAHRALVHAPAGSSRAAARRGPAAAIKQEDKPEPAQGGTCHLFPRRAARAPRARARARRQLARRRPPRTCRRHQAGGQARACAGRYVSLVPAPSCARTARSCTRPPAARAPPPAADLPPPSSRRTSPSLRREVRVTCSRAELRAHRALVHAPAGSSRAAARRGPAAAIKQEDKPEPAQGGTCHLFPRRAARAPRARARARRQLARRRPPRTCRRHQAGGQARACAGRYVSLVPAPSCARTARSCTRPPAARAPPPAADLPPPSSRRTSPSLRREVRVTCSRAELRAHRALVHAPAGSSRAAARRGPAAAIKQEDKPEPAQGGTCHLFPRRAARAPRARARARRQLARRRPPRTCRRHQAGGQARACAGRYVSLVPAPSCARTARSCTRPPAARAPPPAADLPPPSSRRTSPSLRREVRVTCSRAELRAHRALVHAPAGSSRAAARRGPAAAIKQEDKPEPAQGGTCHLFPRRAARAPRARARARRQLARRRPPRTCRRHQAGGQARACAGRYVSLVPAPSCARTARSCTRPPAARAPPPAADLPPPSSRRTSPSLRREVRVTCSRAELRAHRALVHAPAGSSRAAARRGPAAAIKQEDKPEPAQGGTCHLFPRRAARAPRARARARRQLARRRPPRTCRRHQAGGQARACAGRYVSLVPAPSCARTARSCTRPPAARAPPPAADLPPPSSRRTSPSLRREVRVTCSRAELRAHRALVHAPAGSSRAAARRGPAAAIKQEDKPEPAQGGTCHLFPRRAARAPRARARARRQLARRRPPRTCRRHQAGGQARACAGRYVSLVPAPSCARTARSCTRPPAARAPPPAADLPPPSSRRTSPSLRREVRVTCSRAELRAHRALVHAPAGSSRAAARRGPAAAIKQEDKPEPAQGGTCHLFPRRAARAPRARARARRQLARRRPPRTCRRHQAGGQARACAGRYVSLVPAPSCARTARSCTRPPAARAPPPAADLPPPSSRRTSPSLRREVRVTCSRAELRAHRALVHAPAGSSRAAARRGPAAAIKQEDKPEPAQGGTCHLFPRRAARAPRARARARRQLARRRPPRTCRRHQAGGQARACAGRYVSLVPAPSCARTARSCTRPPAARAPPPAADLPPPSSRRTSPSLRREVRVTCSRAELRAHRALVHAPAGSSRAAARRGPAAAIKQEDKPEPAQGGTCHLFPRRAARAPRARARARRQLARRRPPRTCRRHQAGGQARACAGRYVSLVPAPSCARTARSCTRPPAARAPPPAADLPPPSSRRTSPSLRREVRVTCSRAELRAHRALVHAPAGSSRAAARRGPAAAIKQEDKPEPAQGGTCHLFPRRAARAPRARARARRQLARRRPPRTCRRHQAGGQARACAGRYVSLVPAPSCARTARSCTRPPAARAPPPAADLPPPSSRRTSPSLRREVRVTCSRAELRAHRALVHAPAGSSRAAARRGPAAAIKQEDKPEPAQGGTCHLFPRRAARAPRARARARRQLARRRPPRTCRRHQAGGQARACAGRYVSLVPAPSCARTARSCTRPPAARAPPPAADLPPPSSRRTSPSLRREVRVTCSRAELRAHRALVHAPAGSSRAAARRGPAAAIKQEDKPEPAQGGTCHLFPRRAARAPRARARARRQLARRRPPRTCRRHQAGGQARACAGRYVSLVPAPSCARTARSCTRPPAARAPPPAADLPPPSSRRTSPSLRREVRVTCSRAELRAHRALVHAPAGSSRAAARRGPAAAIKQEDKPEPAQGGTCHLFPRRAARAPRARARARRQLARRRPPRTCRRHQAGGQARACAGRYVSLVPAPSCARTARSCTRPPAARAPPPAADLPPPSSRRTSPSLRREVRVTCSRAELRAHRALVHAPAGSSRAAARRGPAAAIKQEDKPEPAQGGTCHLFPRRAARAPRARARARRQLARRRPPRTCRRHQAGGQARACAGRYVSLVPAPSCARTARSCTRPPAARAPPPAADLPPPSSRRTSPSLRREVRVTCSRAELRAHRALVHAPAGSSRAAARRGPAAAIKQEDKPEPAQGGTCHLFPRRAARAPRARARARRQLARRRPPRTCRRHQAGGQARACAGRYVSLVPAPSCARTARSCTRPPAARAPPPAADLPPPSSRRTSPSLRREVRVTCSRAELRAHRALVHAPAGSSRAAARRGPAAAIKQEDKPEPAQGESDALLLPDSKEALASSILYPGLETKPVIVKQEPRRRRRDHTCPTCKEDQGSEAAFTAHLKLHPLECLTCGKCFFRRANLALHLKTHQGIKNFKCEVCEKRFLTRQKLMEHHNTHTGRAPYKCTVCDDTFRRYSNMVQHSPLTLTRQKLMEHHNTHTGRAPYKCTVCDDTFRRYSNMVQHSPLTLTRQKLMEHHNTHTGRAPYKCTVCDDTFRRYSNMVQHSPLTLTRQKLMEHHNTHTGRAPYKCTVCDDTFRRYSNMVQHSPLTLTRQKLMEHHNTHTGRAPYKCTVCDDTFRRYSNMVQHSPLTLTRQKLMEHHNTHTGRAPYKCTVCDDTFRRYSNMVQHSPLTLTRQKLMEHHNTHTGRAPYKCTVCDDTFRRYSNMVQHSPLTLTRQKLMEHHNTHTGRAPYKCTVCDDTFRRYSNMVQHSPLTLTRQKLMEHHNTHTGRAPYKCTVCDDTFRRYSNMVQHSPLTLTRQKLMEHHNTHTGRAPYKCTVCDDTFRRYSNMVQHRDRHHLHKKRVVRDFVCACGAVFHSRAKLRWHAETHAARPHACLACGDKFVHAASLTRHVRRAHDPAYTDTRRPRQHNVPCPVCKQVCPASARVRACACARPTARLLQVYLRANLRAHLLTHSGKRPFVCVVCSKAFTTKWNLKLHRWTHASRSAKPYKCALCKGAFIRHSEYVAHMNAHKAVRPYTCNYCGCQFIRKYNCQRHVREHETAKKYVCKVAECGKSFHRSYYLSEHMKVHSGARPFACGVCGKLSGNKSNHNKHVRIHHAREPVASEA
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -