Lyun000448.1
Basic Information
- Insect
- Lamprigera yunnana
- Gene Symbol
- -
- Assembly
- GCA_013368075.1
- Location
- JABVZV010000047.1:1839495-1843076[-]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 38 0.0057 0.08 13.5 6.1 2 23 10 31 9 31 0.96 2 38 0.0017 0.024 15.2 0.9 2 23 37 58 36 58 0.97 3 38 0.0037 0.052 14.1 1.1 2 23 64 85 63 85 0.97 4 38 0.0017 0.024 15.2 0.9 2 23 91 112 90 112 0.97 5 38 0.0017 0.024 15.2 0.9 2 23 118 139 117 139 0.97 6 38 0.0037 0.052 14.1 1.1 2 23 145 166 144 166 0.97 7 38 0.0057 0.08 13.5 6.1 2 23 172 193 171 193 0.96 8 38 0.0017 0.024 15.2 0.9 2 23 199 220 198 220 0.97 9 38 0.0057 0.08 13.5 6.1 2 23 226 247 225 247 0.96 10 38 0.0017 0.024 15.2 0.9 2 23 253 274 252 274 0.97 11 38 0.0037 0.052 14.1 1.1 2 23 280 301 279 301 0.97 12 38 0.0057 0.08 13.5 6.1 2 23 307 328 306 328 0.96 13 38 0.0037 0.052 14.1 1.1 2 23 334 355 333 355 0.97 14 38 0.0017 0.024 15.2 0.9 2 23 361 382 360 382 0.97 15 38 0.0057 0.08 13.5 6.1 2 23 388 409 387 409 0.96 16 38 1 15 6.4 0.7 2 23 415 436 414 436 0.96 17 38 0.004 0.056 14.0 1.0 2 23 442 463 441 463 0.97 18 38 0.0037 0.052 14.1 1.1 2 23 469 490 468 490 0.97 19 38 0.0057 0.08 13.5 6.1 2 23 496 517 495 517 0.96 20 38 0.0017 0.024 15.2 0.9 2 23 523 544 522 544 0.97 21 38 0.002 0.029 14.9 6.1 2 23 550 571 549 571 0.97 22 38 0.0024 0.034 14.7 6.0 2 23 577 598 576 598 0.96 23 38 0.0063 0.089 13.4 0.8 2 23 604 625 603 625 0.97 24 38 0.023 0.33 11.6 3.7 2 23 631 652 630 652 0.97 25 38 0.12 1.7 9.4 2.6 2 23 658 679 657 679 0.96 26 38 0.0037 0.053 14.1 0.7 2 23 685 706 684 706 0.96 27 38 0.0092 0.13 12.9 3.4 2 23 712 733 711 733 0.97 28 38 0.00081 0.011 16.2 0.8 2 23 739 760 738 760 0.96 29 38 0.12 1.7 9.4 2.6 2 23 766 787 765 787 0.96 30 38 0.021 0.29 11.8 0.6 2 23 793 814 792 814 0.96 31 38 0.0025 0.036 14.6 1.8 2 23 820 841 819 841 0.96 32 38 0.0021 0.029 14.9 0.4 2 23 847 868 846 868 0.96 33 38 0.0015 0.021 15.4 2.6 2 23 874 895 873 895 0.97 34 38 0.00026 0.0037 17.8 3.0 2 23 901 922 900 922 0.98 35 38 0.0037 0.053 14.1 3.9 2 23 928 949 927 949 0.96 36 38 0.0039 0.055 14.1 0.9 2 23 955 976 954 976 0.97 37 38 0.00035 0.0049 17.4 3.5 2 23 982 1003 981 1003 0.97 38 38 0.074 1 10.0 3.8 1 23 1008 1030 1008 1030 0.97
Sequence Information
- Coding Sequence
- atgagaattcatacaggtgatttATTGaactgtaaggaatgtgattataaaactcaTAGGAAACATAATCTTAACACGCATATGAAAAtccatacaggtaatgaattgaagtgtaaagaatgtggctACAAAACTCCttggaaatatttattaaaaaaacatatgagaattcatacagttgataaattgaagtgtaaagaatgtggctACAAAACTCCttggaaatatttattaaaaaaacatatgagacttcatacaggtaatgaattgaagtgtaaagaatgtggctACAAAACTCCttggaaatatttattaaaaaaacatatgagaattcatacaggtaatgaattgaagtgtaaagaatgtggctACAAAACTCCttggaaatatttattaaaaaaacatatgagaattcatacagttgataaattgaagtgtaaagaatgtggctACAAAACTCCttggaaatatttattaaaaaaacatatgaGACTTCATACAGGTGATTTATTGaactgtaaggaatgtgattataaaactcaTAGGAAACATAATCTTAACACGCATATGAAAAtccatacaggtaatgaattgaagtgtaaagaatgtggctACAAAACTCCttggaaatatttattaaaaaaacatatgagaattcatacaggtgatttATTGaactgtaaggaatgtgattataaaactcaTAGGAAACATAATCTTAACACGCATATGAAAAtccatacaggtaatgaattgaagtgtaaagaatgtggctACAAAACTCCttggaaatatttattaaaaaaacatatgagaattcatacaggtaatgaattgaagtgtaaagaatgtggctACAAAACTCCttggaaatatttattaaaaaaacatatgaGACTTCATACAGGTGATTTATTGaactgtaaggaatgtgattataaaactcaTAGGAAACATAATCTTAACACGCATATGAAAAtccatacaggtaatgaattgaagtgtaaagaatgtggctACAAAACTCCttggaaatatttattaaaaaaacatatgagacttcatacaggtaatgaattgaagtgtaaagaatgtggctACAAAACTCCttggaaatatttattaaaaaaacatatgagaattcatacaggtgatttATTGaactgtaaggaatgtgattataaaactcaTAGGAAACATAATCTTAACACGCATATGAAAAtccatacaggtaatgaattgaagtgtgaagaatgtgattttaaaactccATGGAAACATGTATTAAAAGAACATATAAGAATTCATACAGTTGAtaaattgaagtgtaaagaatgtggctACAAAACTCCttggaaatatttattaaaaaaacatatgaaaattcatacaggtaacgaattgaagtgtaaagaatgtggctACAAAACTCCttggaaatatttattaaaaaaacatatgaGACTTCATACAGGTGATTTATTGaactgtaaggaatgtgattataaaactcaTAGGAAACATAATCTTAACACGCATATGAAAAtccatacaggtaatgaattgaagtgtaaagaatgtggctACAAAACTCCttggaaatatttattaaaaaaacatatgagaattcatacaggtaatgaattgaagtgtgaagaatgtgactacaaaacttgTAGGAAACATTTGTTAAAACAACATATGAGACTTCATACAGGTGATTTATTGaactgtaaggaatgtgattataaaactcaTAGGAAACATAATCTTAACAcgcatatgagaattcatacaggtaatgaattgaagtgtaaagaatgtggctACAAAACTCCttggaaatatttattaaaaaaacatctgAGAATTCACACAGGTAATgaattgaagtgtaaagaatgtgactacaaaactccttgtaaatatttattaaaaaaacatatgagaattcatacaggtaatgaattgaagtgtaaagaatgtggctACAAAActccttgtaaatatttattaaaaaaacatatattaCTTCATACAGGTGATTTATTGaactgtaaggaatgtgactataaaaccgCCAGAGAACAAGTTCTTAAaagacatatgaaaattcacacaggtaatgaattgaagtgtaaagaatgtggctACAAAActccttgtaaatatttattaaaaaaacatatgagaattcatacaggtgatttATTGaactgtaaggaatgtgattataaaactgtacgagTACAAGATCTTAAaagacatatgaaaattcacacaggtaatgaattgaagtgtaaagaatgtggctACAAAActccttgtaaatatttattaaaaaaacatatattaCTTCATACAGGTGAGTTATTGAattgtaaggaatgtgattataaaaccgcGCGAGGACAAGTTCTTAAaagacatatgaaaattcacacactTAATGAGTtgaagtgtaatgaatgtgactacaaaactcttcggaaacaattattaaaacgaCATATGAGAGTTCATACAGTTGATGTATTGaactgtaaggaatgtgattataaaactgtacgagTACAAGATCTTAAGAGACATATggaaattcacacaggtgataaattgcagtgtaaagaatgtgactacaaaactgcTAAGAAACATCTCTTAaaagaacatatgaaaattcacacaagTGATAAAatgaagtgtaaagaatgtgactacaaaactactaggaaatttaaattaaaagaacatatgagaattcatacaggtgatgaattgaactgtaaggaatgtgactataaaactgtaaggaaatgtgattttaacagacatatgaaaattcacacaggtgttCAActgaagtgtaaagaatgtgactacaaaactgcTAGAAAACAATACCTTGTCGTACATAcaaaaattcacacaggtgctAAATtgaagtgtgaagaatgtgactacaaaactactaagaaacatctattaaaacaacatatgagaattcatacaggtgatgaattcatgtgtaagaaatgtgactacaaaactcttAGCAAACATAATCTTAACATACatctaaaaattcatacaggtgataaattgTAG
- Protein Sequence
- MRIHTGDLLNCKECDYKTHRKHNLNTHMKIHTGNELKCKECGYKTPWKYLLKKHMRIHTVDKLKCKECGYKTPWKYLLKKHMRLHTGNELKCKECGYKTPWKYLLKKHMRIHTGNELKCKECGYKTPWKYLLKKHMRIHTVDKLKCKECGYKTPWKYLLKKHMRLHTGDLLNCKECDYKTHRKHNLNTHMKIHTGNELKCKECGYKTPWKYLLKKHMRIHTGDLLNCKECDYKTHRKHNLNTHMKIHTGNELKCKECGYKTPWKYLLKKHMRIHTGNELKCKECGYKTPWKYLLKKHMRLHTGDLLNCKECDYKTHRKHNLNTHMKIHTGNELKCKECGYKTPWKYLLKKHMRLHTGNELKCKECGYKTPWKYLLKKHMRIHTGDLLNCKECDYKTHRKHNLNTHMKIHTGNELKCEECDFKTPWKHVLKEHIRIHTVDKLKCKECGYKTPWKYLLKKHMKIHTGNELKCKECGYKTPWKYLLKKHMRLHTGDLLNCKECDYKTHRKHNLNTHMKIHTGNELKCKECGYKTPWKYLLKKHMRIHTGNELKCEECDYKTCRKHLLKQHMRLHTGDLLNCKECDYKTHRKHNLNTHMRIHTGNELKCKECGYKTPWKYLLKKHLRIHTGNELKCKECDYKTPCKYLLKKHMRIHTGNELKCKECGYKTPCKYLLKKHILLHTGDLLNCKECDYKTAREQVLKRHMKIHTGNELKCKECGYKTPCKYLLKKHMRIHTGDLLNCKECDYKTVRVQDLKRHMKIHTGNELKCKECGYKTPCKYLLKKHILLHTGELLNCKECDYKTARGQVLKRHMKIHTLNELKCNECDYKTLRKQLLKRHMRVHTVDVLNCKECDYKTVRVQDLKRHMEIHTGDKLQCKECDYKTAKKHLLKEHMKIHTSDKMKCKECDYKTTRKFKLKEHMRIHTGDELNCKECDYKTVRKCDFNRHMKIHTGVQLKCKECDYKTARKQYLVVHTKIHTGAKLKCEECDYKTTKKHLLKQHMRIHTGDEFMCKKCDYKTLSKHNLNIHLKIHTGDKL
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -