Llun000519.1
Basic Information
- Insect
- Lordithon lunulatus
- Gene Symbol
- -
- Assembly
- GCA_963942505.1
- Location
- OZ012666.1:1086126-1121615[-]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 35 1.1 1.8e+02 4.3 0.1 1 23 708 731 708 731 0.92 2 35 0.51 80 5.4 3.6 1 23 778 801 778 801 0.92 3 35 0.41 66 5.7 2.0 2 22 806 826 805 828 0.84 4 35 0.0014 0.22 13.5 0.7 1 23 1281 1304 1281 1304 0.95 5 35 7.4e-05 0.012 17.5 3.3 1 23 1345 1367 1345 1367 0.98 6 35 0.0053 0.83 11.6 3.8 3 23 1371 1391 1369 1391 0.97 7 35 1.2 2e+02 4.2 1.2 2 19 1397 1414 1396 1417 0.89 8 35 4.2 6.6e+02 2.5 1.3 2 23 1803 1825 1802 1825 0.94 9 35 0.0056 0.9 11.5 1.3 1 23 1870 1893 1870 1893 0.95 10 35 6.8e-05 0.011 17.6 1.3 1 23 1930 1952 1930 1952 0.97 11 35 0.0033 0.53 12.3 4.3 2 23 1955 1976 1954 1976 0.96 12 35 0.57 91 5.2 0.5 2 20 1982 2000 1981 2002 0.88 13 35 0.9 1.4e+02 4.6 0.9 2 23 2011 2032 2010 2032 0.92 14 35 0.95 1.5e+02 4.5 3.0 2 20 2041 2059 2040 2062 0.90 15 35 0.067 11 8.2 0.1 3 23 2072 2092 2071 2092 0.97 16 35 0.061 9.7 8.3 0.9 1 21 2095 2115 2095 2116 0.94 17 35 0.34 54 5.9 1.9 1 23 2893 2916 2893 2916 0.95 18 35 2.6 4.1e+02 3.2 1.6 3 19 2929 2945 2927 2947 0.96 19 35 2.1 3.4e+02 3.4 3.1 1 23 2962 2985 2962 2985 0.92 20 35 0.091 14 7.8 1.2 2 21 2990 3009 2989 3010 0.92 21 35 8 1.3e+03 1.6 0.1 1 12 3011 3022 3011 3032 0.84 22 35 0.018 2.9 10.0 0.4 2 23 3041 3062 3040 3062 0.96 23 35 0.00012 0.019 16.8 0.3 2 23 3068 3089 3067 3089 0.97 24 35 4.1e-05 0.0065 18.3 0.3 1 23 3095 3117 3095 3117 0.99 25 35 0.0012 0.2 13.6 0.5 1 20 3123 3142 3123 3142 0.96 26 35 0.021 3.3 9.8 0.2 1 20 3542 3561 3542 3562 0.95 27 35 0.14 22 7.2 0.2 1 23 3571 3593 3571 3593 0.97 28 35 3.5e-05 0.0055 18.5 1.5 1 23 3599 3622 3599 3622 0.96 29 35 0.00038 0.06 15.2 0.8 2 23 3873 3895 3872 3895 0.93 30 35 0.028 4.4 9.4 1.7 1 23 3901 3924 3901 3924 0.96 31 35 5.8e-06 0.00093 20.9 1.5 1 23 3932 3954 3932 3954 0.98 32 35 0.00074 0.12 14.3 3.1 2 23 3960 3981 3960 3981 0.97 33 35 2e-07 3.2e-05 25.6 0.6 1 23 3988 4010 3988 4010 0.98 34 35 9.4e-06 0.0015 20.3 3.1 1 23 4016 4038 4016 4038 0.98 35 35 0.00077 0.12 14.3 0.1 1 23 4044 4067 4044 4067 0.97
Sequence Information
- Coding Sequence
- atggaTAGAAACGCGGTCAACAAGCGGTGCCGCGTTTGCCTGCAAATGGGCATCGGTACTGTGTCCCTGTACGACGCGATTAAACTGCAAGACGGAGATATGTTGTTGTCAGAGGTGTTGGTCAAGTGCTGTGCAGTAGAGATTGGCAAAGATGATAAGAAACCACAGTCTTTATGTACAAAGTGTGCACAGCAAGCCGAGCAGTGCTACCAATTTAAAACGAGCTGTGAGAAGGCAGATGGTATACTGCAGTCTGAACTGGAGAATACTATTGACCTGGACCTCGATATGGATGTGGAAGTGAAAGCTGAACCTGATGAGCCATCCCCTGGTGCATTAGAGAATGAGcctaaaaaattcaaagatatCGCAGACGTGGCCGTGATGCAAATAAAGAAAGACTATGATTACGATTACGACCGCGACGCTGATAACATAATGCCATCATCGCCCCAATCACCACCGTCTAGTCAGAATGCAATTTACGACGACGATACACATACGGGGCCCGAAGAGAAGGCAGCCGCATGTAACCTTGAAATAGCGAAGAATCTCCAGAAATATGACGATACACATGCGGGGTCCGAAAAGAAGGCAGCCGCACACAATCTTGAAACAGCGAAGAAACGCCAGAAATTCGAAGATACGCTTACGAATCCCGAAAGGCAAGTAGCCGCACGCAACGTTGAAATAGCGAAGAAACTTCATAAACACGAAGATACACTTACGAATCCCAAAAAGCAAGCAACCGTACGCAACCTTGAATTAACGAAAAAACTTAATAAACACGAAGATACACTTACGAATCCCGAAAAGCAAGCAGCCTCACGCAACCTTGAAATAGCGAAGAAACTTTATAAACACGAAGATACACTTACGAATCCCGAAAAGCAAGCAGCCGCACGCAACCTTGAAATCGCGAAGAAACGtcagatatacgaagatatacttACGAATCCAGAAAAACAAGCAGTCTCACGCAACGTTGAAATAGCGAAGAATCTTCATAAACACGTAGATACACTTACGAATCCCAAAAAGCAAGCAGCCGTACGCAACCTTGAATTAACGAAGAAACTTAATAAACACGAAGATACACTTACGAATCCCGAAAAGCAAGCAGCCACACGCAACCTTGAAATCGCGAAGAAACGTCAGTTATACGAAGATACACTTACGAATCCAGAAAAACAAGCAGTCTCACGCAACGTTGAAATAGCGAAGAAACTTCATAAACACGAAGATACACTTACGAATCCCAAAAGGCAAGCAGCCGTACGCAACCTTGAATTAACGAAGAAACTTCATAAACACGAAGATACACTTACGTCTCCCAAAAAGCAAGCAGCCGTACGCAACCTTGAATTAACGAAGAAACTTAATAAACACGAAGATACACTTACGAATCCCGAAAAGCAAGCAGCCTCACGCAACCTTGAAATCGCGAAGAAACGTCAGTTATACGAAGATACACTTACGAATCCAGAAAAACAAGCAGTCTCACGCAACGTTGAAATAGCGAAGAAACTTCATAAACACGAAGATACACTTACGAATCCCAAAAAGCAAGCAACCGTACGCAACCTTGAATTAACGAAGAAACTTCATAAACACGAAGATACACTTACGAATCCCGAAAAGCAAGCAGCCACACGCAACCTTCAAACAGCGAAGAAACTCAAGAAATACGACGATACAAATACGGGGCCCGAAAAGAAGGCAGCCGCACACAACCTTGAAACAGGGAAGAATTTCCAGAAATTCGAAGATACACTTACGAATCCCGAAAAGCAGGCAGCCGCACGCAAACTTGAATTAGTAAAGAAACTTAATAAACACgaagaaacatatataaatccCGAAAAGCATGTAGCCGTACGCAACCTTGAAATAGCGAAGGAACTCCAGAAATACGAAGCCCTAATTAGAGCTAAAGGCATAAGGGATGCGTTAAGCTATAGCACCAATAAAAACGATAACGTTTCTAATAGTCGTGTAGTGAAATGTTTCGGGAATAAATCTTTTAAGTATATGTGCCTACAGTGCGGATTTGAGGATGCTTCGAAACAAAAAACTGTAATCCATTCGCAAGCCCATGTTAAAACTGGTTGTGCAGAAAGTTACATTTGCGAGTATTGCAGTGCAGTTTACGATAAGGGCGCGAGTTTAGTCAAACACAAATCCGAAGAGCACTACCATAACCAAACTATAACTGTAGCAGTAAAACGGTGTAGTGATTGTGGTATAATTAACACAAAATCACACAATAATGACTTGAATGATAAATGCAGCGAGagagtttttattgtacgtttGAAACGGATAACTTTTTTCAGTTGTAAATTGTGCGACGTAATGTGTAGGCGTATAGAGGATTTGCATAGGCATAATCAGGAATATCATCCCAATGCTTGTCAGTGTAGCATGTGTGATAAGGTTTTCACCTGTGCGTCCGATCGTAATGCGCATTGGATTGATGTACACAAGAGGGGTATTGTTAAACCGCCAATCCATCTTACTCCTCCACATGATACTCTGGATGAGTGTATAAATATCGAAGCTGTTGACTCAGTAGAGACGATGAAAACAGATACGAATATTGATTTGGTTGGCATGAATGATGGCTATGAAGAAATAATACAATCCACTATTGAATTTATTGATATGAAAACAGAGATTAAAGTTGAAGAGATAATTCCTGTATCGCATAATAATGATTTATCTTGCATTGATTTCGATATGAATCTTGATAAGGATCATACAATAAAAGCTTCTTCCAATAAACGCAAGTTAACACATGTAATTACTAAGATTGAAGATCCTCTAGCGGATGACAATAGTAATATACAAAGAGAGTCATCCGGTTTTCATTTAATGGATTTGGGGTACAAAACAAGTGTTGATTCTATCACTGAAATCAGTGATacgaaaattgatttagaagaaacgaaaatttatttagaagaaacgaaaattaatttagaagaaacgaaaattgatttagaaGAAACGAAAATTGATTTACAAGAAACgaaaattaataacattgaTAATTCTTTTAAACTAATTGACTCTAATACAGTACAGTTAATTAATACCAAGAAGATATTAAAacgtaatataattaataatgatgTATGCACTTCAGTGAGTGATCTTGAATGCATTGAGCAAGTATTCGATTCGGTTATAACATCTACTAATACATTGCAATCGTCTACAGACGTTACTGATATGGTTGAATGTTCTTTATTAGATGATGCAATGTTGTCCAGTTCAGATTTATTGAAATCTTATGAAGATACAGGTACTGTAGTTCATGGTGATGTAATTTTGGATAATCCAGGCGAGCTTGAGTGgttgaatacaaataaaaaagtacGTCTTGATTCTGAATCAGTGGATGGTAGTAGAAATGAGATACGTCAAAACAGGAGTTACGGTCCGACGatgataaaacaaacaaaaaatactagGAAAACCAAAACAGGACATACACCAACAGGAATGTGTATCGCTAACCAAACAAGTCTATACATGTGCAATATATGTGGTTACGAGGACCcatcaaaattgattatattaaatCACTACAATGCGCATGTAGGAAAACAAGATACGCTTGCACGTACACGATACCGATGTAAGCACTGTGCCTTGTACCACAGCACCGGGGAAGCTCTGGTGGAACATAAAAAGGAAAAGCACTTTGCATCCACCGCCAAATCAGTCCTAATATGCTTTTGCAAACTCTGCTCTATTGTGTACACGAAAAAAGATCACACCCTCCACTCGAGCCACTCCGACACCACTATGATCCGCGCAACAAGAATCTTCATATACTCCTGTCAGCTGTGTGGAGAAACATATAAAAGCCAAGACGAATTTTTAACCCACAAACAACTCGTACATCGCGCCATATGCCTCTGCTCTAATAACCCGGCAATCCATAATCAGCAATCACAACAACTCAAGCGAATTAGAAAACAGCCCTGCAAACCGACAGTAAAGGATTCAGATACAATTGAATTCTTTCAATGTAGCAAATGCCCAAAATCCTTTATGACCCGAAAACACTTAACTCTGCACGTAAATGTGCATGAGATCATATGCTCGGTTTGCGACTGCAAATTTAGAAAAGACAAGAATTTCAAGGAACATATGAAAACCCACTATCAGGCTGTGAAGAAATGCACCGGTTGTGACTTGACGTTTGATACCGCAAACGAAATGGACAGCCATTCATGTAAAAAAGTGTCGTATTTGTGCGAACCGTGCGGATTCCGAACAACGAATAAATTCTTTTACGAAAATCACTTGCCAATCCATGTTACGCCTCCACATGATACTCCAGTTGAGTGTACAAATATCGAAGCTTTTAACCCAGGAGATTCGATGAAAATCAAAAACTCTGATACGATTATTGATTTGGATGAGGTGGATAATGGTTATGAAGAAACAATACCAACATCTATTGAATTCATCGATATAAAATCTGGAATTAAAGTTGAAGATATAAATATACCTGAAGAAACTGTATCGCATAATACTGATTTATCTCCTATAGATTTCGATATGAATCTTAATAAGGATCATACAATACAAGCTTGTTCCAATAAACGCAAGTTAAAAGATGTAATTACTAAGATTGAAGATCCTTTAGCGGATGACAATAGCGACATACAACGACAGTCAGATTTTCATTTAATGGATATGGAGTATACAAAAAGTGTTGATTCTATCACTGAAGTCATTGATACGAAAATTGATGTAGAAGAAACTGATATTAAGAACATTAAACTAATTAACTCGAATACATTACAGTTAATTAATACCAAGGAGTTGttaaaaagtaataatgatGTATGTACTTCTGTGAGTGATCTTGACTACACTGAGCAACCATTTCATTCGGATATAACATCTACTAATACATTGAAATCGTCTACAGAAGTTATAGAAGTTGAAGATTCTGATTTGACTGAATATTCTTTATCCGATGATGCAATGTTGTGGGGTTCTGATTTATTGGAAGATGCAGACATTACAGATACTGTAGTGCATGGTGGTGTAATCCCGGATGACCTTGAATGGttgaatagaaataaaaaggCACGTATGGATTCTGAAGGAATGGAGGGTAGCAGTAATGAGGATATACGTCAAAGCAGGAGCTATGGGCCGACGatgataaaacaaacaaaaaaacgcaagaaaaataaaacattaaatataccTACAGGAGATATATCTACAGAACATATATCAACAGGACTTTGCATAGGTAATCTAACATGTCTATACATGTGCAACATATGTGGTTTCGAAAAACCctcaaaatttattatattaaatcacTACAATTCGCACATAGAAAAGCAAACTATGCATACACGTACTATCCTCCGATGTAAACACTGTTTATCAAACTACAGCAGCGGGGATGCTCTGGTGAAACATAAAAAGGAAGAGCACTTTGCATCCACCACCAAATCAGTCCTAATATGCTATTGCAAACTCTGCGCTGTCCTGTACACGAAAAAAGATCACCCGCTCCACTCGGACCACTCCGAGACCACCATCATCCGCGCAACAAGAATTTCCATATACTCCTGTCGACTGTGCGGAGAAACATGTAAAAGCCAAGACGAATTCGTAGCCCATACACAACTCATGCACAACGACACATGCCTCTGTCCGGCCTTATATGATAAACAATCAGCACAACTCATGGAGAAAAAACTACCCTGCAAACCGGTGGCAACCGATTCAGACTCAATTGAATTCTTTCAATGCTCCGAATGCCCGAAATCGTTTATGTCGCGCAAGTATTTATCGCAGCACATGAGCGCTCACGAATGCACGTGTTCCGTTTGCGACCGCAAGTTTCGGACGGATTTCATGCTCAAAGAGCACATGAAGAACCACTATCAGACCGTGATGAAATGCACCGGTTGTGACTTGACGTTCGATACCGCAGAGGAAATATACTCGCATTCGTGCACAAAAGTGCCCAACAAACCGGCGTATTTGTGCGAACCGTGCGGCTTCAAAACAACgaacaaatatttttacacCAATCACTTGCTGGTGCACAGGCCCAAAGTGTTGCACAAATTGTGTTGCTCGATTTGTTTTGTTAAGTTCAAATCGCAACGGGAATTGAATTTGCACAAGCCGAAACACATACTGCGTGTTACGGATTGTAATGATTGTAAGATATGCGGGATAAGCGTTAATGACGATGAAGATATGCGAACACATATGTTCTCGCATGGCTGTTTTAAATGCGCTAAGTGTGATGTTGTATATCCGGATGAAACTAGTCTTAGCAGGCACAAATGTAGATTACCAGCTCTTGCTGCAGTACCCGAGGCGAATTGGGCTCAAAAAGACACATTAGTTGATCCAATAAACACACCAGTACCTGACCTACTATATTATGGTACACcagataataatttcatttcaaaaattgatataGAGGAAACTCAAATTATAAAGGAGGAGCCTATTAAAATCATTGAGAACGTAAAAAAGGATGTAAGAAAACCGAAACCACAGTTTAGTTACATTAACGAAACTAAAACTGTTCTAAAAGAGTCTCTTAAAATCTTCCAGAAAgctaatattaaaacatatgttGCAACTAAGCTCAAACCAAAACCAATTTCGACATTTGATATGAATGAAACTCGAAGTAAAATGGATATGTCTGCTAAAATTAGTAGGGAAACAACCCAAGTTCCAGTTAAATTAAAAGGAAAAATAATTTCGATAATTGATATTGATCAAATTAGAAATGCAGTGGATGCGTCTGCTAAAATTAGTCATGAAGAAGCCTACAGTCCAGCTAAAATAACACCACAATTAATGACAAAAATTGATATCGAAAAAACTCGAAATTCAGTGGATGTGTCTGCTAAAATTAGTCAGGAAGAAACCTACGTTCCAGCTAAAATAACACCACAATTAATGTCAAAAATTGATATTGATAAAATTCGAAATGCAGTGGATGTGTCTACTAAAATTAGTGAAGAAGAAACCTACGGTTCAGCTAAAATAAAACCACAATTAATGTCAAAAACTGATATTGATAAAATTCGAAATACAGTGGATATGTCTGCTAAAATTAGTCAGGAAGAAACCTACGTTCCAGCTAAAATAAGACCACAATTAATGTCAAAAATTGATATCGATACAACTCGAAATGCAGTGGATACGTCTACTAAAATTAGCCATGAAGAAACCTACGGTCCAGCTAAAATAACACCACAATTAATGTCAACAATTGATATTGATAAAACTCGAAGTATAATGGATGAGTCTACTAAAATTAGTCGGGGAGCAACTTACGGTCCAGTTAAAAAGAAagcaaaattaatttctaaacgTAATATTGATGAAAGTCGAATTATAATGGAACAATCGACTAATAACACCTCTGTTTTAATTCAGCCAAAACCAAAAATATCGTGTTTTCGGTTTGAGAATGAAATGAGAGAGCATGTACACGAAGGCGAACAGAATTATATTACTCAAAATATAAGTAAAGAAGACACATTGGTGGCTATAAATACCTTGCAAAATATAAACATGATAGGTTATAATTACAATACATGTGATTTAATTTCGGATATTGGAGTTGATGATGGTCCCATTGGGCAGAATGTGTTTATCACTAAACCTAAAATGGATACAATTATGAATATACAGTGTGATGACGAAATACGTGAGAGCTCTAGTTACAGTACATATGTACCAGTAAAAATGAATTCAAATGCAAATCCAAATATAGgtcttcaaaaaacatataaaaaggtTACTCCTAAAAATGTTAATCTTAAAAAGGTTAACTATAAAAAAGTTATCCCAACAAAGGTTATACATGAAAAGGCTACTGTTAAAAAGGTTACACATGAGAAGGCTATTTCTGAAAATGTTAGTGATGGTAAGGTTATTCATGATAATGTTACTCATGATAATGTTACTCATGATAATGTTACTCATGAAAAAGGTACTCCTCAAAAGGTCAAACATGAAAAGGCTATTCCTGAGAAGATTCCTTATGTAAACGTTACTCATGAAAATGTTAACGATGATAATGTTATACCTGAAAAGGTTACTGCTGAAAAACATGAGAAGCCTATTCCTGGAAAGGTTACTGATGACCAGGTGTCTTATGAAAATGTTACTCATGGAAAAATTACTTCTATAAAGGTAACTCATAAAAATGTTAGCCATGAAAATGATACACATGAAACGGTTACTCATGAAAAGGTTACGCATAAAAAGGCTAGTACTGAAAAGGTTACACATGAAAAGGCTAATCTTAAAAAGTTTACTGATGAAAAGGTTCCTTATGAAAACATTACACGTCGAAGAAGTACTAGATCCATAACAAATGTTACCCGTAAAAATATTACGCATGAAACTGTTATTCGTGAAACTGTTACTCATGAAAAGACCACACCTGAAAAGGTTACTAACAAAAAGGTTCCCCGTAAAAAGGTtactaaaaaaaagaaattatccCAGATCCGTCATACTGGACAAACATGTCAGGCTAAAAGATCTGTAGATTTCATGTGCATTAAGTGTGGATTTGAAAGTCCGTTAAAAATTACAATGGCACAACATTACATCACGCATTCACAAATAAATAACTCCGAAGCGTACGAATGCCAACACTGCAACCAGCAATATTCAGAAATGGAAGAACTCCTGGACCACAAAACGAAGCAACACTACCCGAAATCCAACCGAAGTATAGCAATATACTGGTGTAAAACATGTAACATAGTATGCCGCTACAAAGAAGACTTCAAAGAACATTCAACGCACCCAGAAGCAGAAACAATATTCGCCTTAAGAATCACCACCTACCGCTGCCCACTGTGCCCAGCAATTTGCGAAACCTTCCACGATTTCACCACACATAAATTAGATAAACACCCCGACACGTGTAACTGCTCTGATTGTTCGGAAGTCTTTAAAACCGAACGGAATACTAAGATTCATCTTAAAGGGTTTGTGTGTGATATATGCGGCTATACAGGCTCCGATCTACGTAAAAAGGTGCATATGCTGCAACACAACAAGCCGAGCCGTGGCAGCTGGCAATGCTCGATATGCCCCAAAATATTACAGACGAAAATGATGCTGAGAAAGCACGAGCTCAGGCACCAGCAGTGTTTTGATGTGTGTAAACTGTGTGGAGCTACATTTACTAGCTACTATGGCTTATCGAATCATATGCAACGGCATGCTGATAAGACGGAGTATAAATGTGATATTTGTGGAAGTGTTTTGAAAACTCGGGAGTCTTTGAGGAACCATATGAACATTCATACTGGTGCGAGGAGTTACGAGTGCCAGACGTGTGGGAAGTCGTTTAATGATggatataattataaagtacACTTGCCAGCTCTTGATGCACACCCCATGGCATATGGGGCCCATGCAAACAGATTACTTGATGCAATAGATAGACCAGTACCGGAACTGGTACACTTTAGTACAACTGATGAAAATGGTATTGATGAAACTTGTATTATAATGAAGAAGCCTGTTACAACTAGTcagaataaaagaaaaaatagcaAGCCCTTTCAAATTAACTCTAAATCAAAGTTAATTGAGAACAGTGATATTGACGAAACTCGAAATCTAATGGTCGAGTCTTATAAAATCAGTCGTAAACCAAGTGTTACAGACAACGAGACGACAGCAAACACTGGTAGCTATAGTTGGTGTCATTCTGATATTGATATAAAACTTGAAATTGAGGATACACAAATTATAACAGAAGATGACCATATGAACACCAATTACGattacaaagaaattaataGCCCAAGTACATCACTAGTGGTGATTAATGAAAGCCAGGACGCAAAGTATCAACTGACTAACGAAGATAATAACATTGAAATCGCAGGCTCCAGCACAACAATTAAAGAAATGAAAGGAATCCTCGAGACCAGATTATTTCTGGGTTCCGATCAAATGTCATCGGATGACGATTTTATGCGAGTCCAGCTTGACTGTTGTGGGATTGTACTGCATTCGCTGCAGGGTGCAAAACACACGCGTACACGAGTTGATTCGTTACACGCGTTGATTTACATAACGACCCATTTGATGACACGAGTTGATTTACAGCACGAgCCATCCAGCGATGAGATCGTAAAAGCAACGAGGGATGAGCATAGAAATAAATCCCCAACAGAGATGCAgagaaatgagaaatattttgGTTTCAGTTCAGATGATACGGATAGTGATGACGATATTCCATTAAAAAATTGTACGTTACAACATTCTGTCTCAAATGAAACTGATATGAATGATATAACAAAGAAAGCCACTACGGATACTGAAGAGCGTATATTGGATCAACACAAACCAATCCAGGATGTACCAGTAAACTCTGACAAACATATTGCAGCACAAACAGATGATGAAATCAATATTGATACAGATATAAAGCTTGATATCGAGGATTATGTTATAACAAAACAAGAACGTGATAATGCAACAAATAATTCCATTTGTAGTGAAGAGCATCCAAACTTATCAGAGTTAGGCAAACATCTGCTGGAAATACACGGCAATACGCACCAGTGCGACCAATGCCCTCTTGCATTTGAAAACGCTACGAGTCTTGAACAGCACACCACTTGTGTGCACAGCCAAATGGCGCTTTATATATGCGAGTTTTGTGGATATAAAACTAATGAATTAAACAGTTTAAACTTACATTTGGATATACATTCGGATGTACGAGCATATAAGTGCCCGGAATGTTCTAAACAGTTTAAATCGCGAAATTCATTAAGCCGACACCGTGAAAGTGCGCACACGAGGCCAGCTGTTGACGGTACACATTTTGCTGCGATGGAAATGTATCAGCCACAAATGGATAACctaaatgtttatatagaaaAGTCTGATGATAAAGACAGTGATGTCGGTTTTCCACTGACATATCTGAAACAAGAATTGTTTGTGGAAAACGTACctttaattgatttaaatgCAATTAAATATGAAGAGACAACAAGTGATTACAGCGACCCTGTTATAACACAACAAGAAGGATTACCAAATATTCCAACCGAATTGGCTAATTGTACAACAAGCAAAGGAATCAAATCTGAACATGTTGGGGAGCTTCAAACAAATAATGAGACATTAATCGAGTCCCAACCTCTAAATTTAAAAGGTTATGGTGTAAAGTTCCGCAAAATCTTTTGTAACCAGTGCGATTTTGTATGCAGAACTAAACTACAAATCATGGAACATTGTAAAATAcattatgaattaaaaaatgaagAAACAGATGAATATGATAGCCCGGATGATGATATATACCCGCCTAAAATCTCAAAAAACACTGCAATGATATACCGCTGTAGAATCTGTAATCTATTTTGGAAGAACCATCACATTTATGTAGAACATAAGTTAATTCATAACGacaaaaaatgtatggaaatcatacattttaaaatgaaaacagCATACTCCTGCGAGATTTGTAATGAAGAACACGACAGTATAACCGATTTGGATAAACACAAGCTACAGAATCACTCAAACAATACCCTACAATGCAAACAATGCCCGCGTACATTCACAAACAACATAGCCTTAGAAAAACATACAAGTCAATCCCATACTAACTCAATCAATTACATATGTGAGTTATGTGCGTACTCTTGTAAAAGCAGCTCCCAGCTATACATCCATAAAAATAGCATCCACAAAGATCCTACAATAACGCCATTCAAATGCCCCAAATGCCCTAAAGCGTTTAAATCCAGGAGGGCATTAGTTAACCACATGCATGTCCATAAACAGTACAACGGCGTATGTGAGTTGTGTCAAGCCGTCTTTACCACACGGTATGGTTTGCACCATCACATGCGAAGACATGAGATGCATGAAAAAAAATTTGTGTGCACCATCTGCCCTAAAGCTTTCACTAATAAGGACTCGCTTAAACGGCATGTTAATACACATACTGGTGAGAAGACGTTTAAATGCGAGACATGCGGGAAATGCTTTAATAGGGGGAGTAATTTGCGGACACATTTGTCGGTGCATACGGGTGTTAAGGGGTATGTTTGTGAGGTTTGTGGCAGTGCGTTTCGGGTGGCTGTTAGTTTGAGAACGCACATGGCTAAATATCATTTGAAGGAAAAGTGA
- Protein Sequence
- MDRNAVNKRCRVCLQMGIGTVSLYDAIKLQDGDMLLSEVLVKCCAVEIGKDDKKPQSLCTKCAQQAEQCYQFKTSCEKADGILQSELENTIDLDLDMDVEVKAEPDEPSPGALENEPKKFKDIADVAVMQIKKDYDYDYDRDADNIMPSSPQSPPSSQNAIYDDDTHTGPEEKAAACNLEIAKNLQKYDDTHAGSEKKAAAHNLETAKKRQKFEDTLTNPERQVAARNVEIAKKLHKHEDTLTNPKKQATVRNLELTKKLNKHEDTLTNPEKQAASRNLEIAKKLYKHEDTLTNPEKQAAARNLEIAKKRQIYEDILTNPEKQAVSRNVEIAKNLHKHVDTLTNPKKQAAVRNLELTKKLNKHEDTLTNPEKQAATRNLEIAKKRQLYEDTLTNPEKQAVSRNVEIAKKLHKHEDTLTNPKRQAAVRNLELTKKLHKHEDTLTSPKKQAAVRNLELTKKLNKHEDTLTNPEKQAASRNLEIAKKRQLYEDTLTNPEKQAVSRNVEIAKKLHKHEDTLTNPKKQATVRNLELTKKLHKHEDTLTNPEKQAATRNLQTAKKLKKYDDTNTGPEKKAAAHNLETGKNFQKFEDTLTNPEKQAAARKLELVKKLNKHEETYINPEKHVAVRNLEIAKELQKYEALIRAKGIRDALSYSTNKNDNVSNSRVVKCFGNKSFKYMCLQCGFEDASKQKTVIHSQAHVKTGCAESYICEYCSAVYDKGASLVKHKSEEHYHNQTITVAVKRCSDCGIINTKSHNNDLNDKCSERVFIVRLKRITFFSCKLCDVMCRRIEDLHRHNQEYHPNACQCSMCDKVFTCASDRNAHWIDVHKRGIVKPPIHLTPPHDTLDECINIEAVDSVETMKTDTNIDLVGMNDGYEEIIQSTIEFIDMKTEIKVEEIIPVSHNNDLSCIDFDMNLDKDHTIKASSNKRKLTHVITKIEDPLADDNSNIQRESSGFHLMDLGYKTSVDSITEISDTKIDLEETKIYLEETKINLEETKIDLEETKIDLQETKINNIDNSFKLIDSNTVQLINTKKILKRNIINNDVCTSVSDLECIEQVFDSVITSTNTLQSSTDVTDMVECSLLDDAMLSSSDLLKSYEDTGTVVHGDVILDNPGELEWLNTNKKVRLDSESVDGSRNEIRQNRSYGPTMIKQTKNTRKTKTGHTPTGMCIANQTSLYMCNICGYEDPSKLIILNHYNAHVGKQDTLARTRYRCKHCALYHSTGEALVEHKKEKHFASTAKSVLICFCKLCSIVYTKKDHTLHSSHSDTTMIRATRIFIYSCQLCGETYKSQDEFLTHKQLVHRAICLCSNNPAIHNQQSQQLKRIRKQPCKPTVKDSDTIEFFQCSKCPKSFMTRKHLTLHVNVHEIICSVCDCKFRKDKNFKEHMKTHYQAVKKCTGCDLTFDTANEMDSHSCKKVSYLCEPCGFRTTNKFFYENHLPIHVTPPHDTPVECTNIEAFNPGDSMKIKNSDTIIDLDEVDNGYEETIPTSIEFIDIKSGIKVEDINIPEETVSHNTDLSPIDFDMNLNKDHTIQACSNKRKLKDVITKIEDPLADDNSDIQRQSDFHLMDMEYTKSVDSITEVIDTKIDVEETDIKNIKLINSNTLQLINTKELLKSNNDVCTSVSDLDYTEQPFHSDITSTNTLKSSTEVIEVEDSDLTEYSLSDDAMLWGSDLLEDADITDTVVHGGVIPDDLEWLNRNKKARMDSEGMEGSSNEDIRQSRSYGPTMIKQTKKRKKNKTLNIPTGDISTEHISTGLCIGNLTCLYMCNICGFEKPSKFIILNHYNSHIEKQTMHTRTILRCKHCLSNYSSGDALVKHKKEEHFASTTKSVLICYCKLCAVLYTKKDHPLHSDHSETTIIRATRISIYSCRLCGETCKSQDEFVAHTQLMHNDTCLCPALYDKQSAQLMEKKLPCKPVATDSDSIEFFQCSECPKSFMSRKYLSQHMSAHECTCSVCDRKFRTDFMLKEHMKNHYQTVMKCTGCDLTFDTAEEIYSHSCTKVPNKPAYLCEPCGFKTTNKYFYTNHLLVHRPKVLHKLCCSICFVKFKSQRELNLHKPKHILRVTDCNDCKICGISVNDDEDMRTHMFSHGCFKCAKCDVVYPDETSLSRHKCRLPALAAVPEANWAQKDTLVDPINTPVPDLLYYGTPDNNFISKIDIEETQIIKEEPIKIIENVKKDVRKPKPQFSYINETKTVLKESLKIFQKANIKTYVATKLKPKPISTFDMNETRSKMDMSAKISRETTQVPVKLKGKIISIIDIDQIRNAVDASAKISHEEAYSPAKITPQLMTKIDIEKTRNSVDVSAKISQEETYVPAKITPQLMSKIDIDKIRNAVDVSTKISEEETYGSAKIKPQLMSKTDIDKIRNTVDMSAKISQEETYVPAKIRPQLMSKIDIDTTRNAVDTSTKISHEETYGPAKITPQLMSTIDIDKTRSIMDESTKISRGATYGPVKKKAKLISKRNIDESRIIMEQSTNNTSVLIQPKPKISCFRFENEMREHVHEGEQNYITQNISKEDTLVAINTLQNINMIGYNYNTCDLISDIGVDDGPIGQNVFITKPKMDTIMNIQCDDEIRESSSYSTYVPVKMNSNANPNIGLQKTYKKVTPKNVNLKKVNYKKVIPTKVIHEKATVKKVTHEKAISENVSDGKVIHDNVTHDNVTHDNVTHEKGTPQKVKHEKAIPEKIPYVNVTHENVNDDNVIPEKVTAEKHEKPIPGKVTDDQVSYENVTHGKITSIKVTHKNVSHENDTHETVTHEKVTHKKASTEKVTHEKANLKKFTDEKVPYENITRRRSTRSITNVTRKNITHETVIRETVTHEKTTPEKVTNKKVPRKKVTKKKKLSQIRHTGQTCQAKRSVDFMCIKCGFESPLKITMAQHYITHSQINNSEAYECQHCNQQYSEMEELLDHKTKQHYPKSNRSIAIYWCKTCNIVCRYKEDFKEHSTHPEAETIFALRITTYRCPLCPAICETFHDFTTHKLDKHPDTCNCSDCSEVFKTERNTKIHLKGFVCDICGYTGSDLRKKVHMLQHNKPSRGSWQCSICPKILQTKMMLRKHELRHQQCFDVCKLCGATFTSYYGLSNHMQRHADKTEYKCDICGSVLKTRESLRNHMNIHTGARSYECQTCGKSFNDGYNYKVHLPALDAHPMAYGAHANRLLDAIDRPVPELVHFSTTDENGIDETCIIMKKPVTTSQNKRKNSKPFQINSKSKLIENSDIDETRNLMVESYKISRKPSVTDNETTANTGSYSWCHSDIDIKLEIEDTQIITEDDHMNTNYDYKEINSPSTSLVVINESQDAKYQLTNEDNNIEIAGSSTTIKEMKGILETRLFLGSDQMSSDDDFMRVQLDCCGIVLHSLQGAKHTRTRVDSLHALIYITTHLMTRVDLQHEPSSDEIVKATRDEHRNKSPTEMQRNEKYFGFSSDDTDSDDDIPLKNCTLQHSVSNETDMNDITKKATTDTEERILDQHKPIQDVPVNSDKHIAAQTDDEINIDTDIKLDIEDYVITKQERDNATNNSICSEEHPNLSELGKHLLEIHGNTHQCDQCPLAFENATSLEQHTTCVHSQMALYICEFCGYKTNELNSLNLHLDIHSDVRAYKCPECSKQFKSRNSLSRHRESAHTRPAVDGTHFAAMEMYQPQMDNLNVYIEKSDDKDSDVGFPLTYLKQELFVENVPLIDLNAIKYEETTSDYSDPVITQQEGLPNIPTELANCTTSKGIKSEHVGELQTNNETLIESQPLNLKGYGVKFRKIFCNQCDFVCRTKLQIMEHCKIHYELKNEETDEYDSPDDDIYPPKISKNTAMIYRCRICNLFWKNHHIYVEHKLIHNDKKCMEIIHFKMKTAYSCEICNEEHDSITDLDKHKLQNHSNNTLQCKQCPRTFTNNIALEKHTSQSHTNSINYICELCAYSCKSSSQLYIHKNSIHKDPTITPFKCPKCPKAFKSRRALVNHMHVHKQYNGVCELCQAVFTTRYGLHHHMRRHEMHEKKFVCTICPKAFTNKDSLKRHVNTHTGEKTFKCETCGKCFNRGSNLRTHLSVHTGVKGYVCEVCGSAFRVAVSLRTHMAKYHLKEK
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -