Lyun000125.1
Basic Information
- Insect
- Lamprigera yunnana
- Gene Symbol
- -
- Assembly
- GCA_013368075.1
- Location
- JABVZV010000001.1:5127185-5142415[+]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 47 0.23 3.2 8.5 1.3 1 23 98 120 98 120 0.98 2 47 0.011 0.16 12.6 1.1 1 23 152 174 152 174 0.97 3 47 0.17 2.4 8.9 0.9 1 23 206 228 206 228 0.97 4 47 0.016 0.22 12.1 1.2 1 23 233 255 233 255 0.98 5 47 0.00077 0.011 16.3 3.6 1 23 260 282 260 282 0.99 6 47 0.59 8.4 7.2 0.3 1 23 287 309 287 309 0.96 7 47 0.0099 0.14 12.8 0.9 1 23 314 336 314 336 0.99 8 47 0.59 8.4 7.2 0.3 1 23 349 371 349 371 0.96 9 47 0.0099 0.14 12.8 0.9 1 23 376 398 376 398 0.99 10 47 0.0045 0.064 13.9 2.4 1 23 403 425 403 425 0.99 11 47 2.8e-05 0.00039 20.8 1.0 1 23 430 452 430 452 0.99 12 47 0.019 0.27 11.9 0.5 1 23 457 479 457 479 0.98 13 47 0.15 2.1 9.1 2.8 1 23 484 506 484 506 0.98 14 47 0.15 2.1 9.1 1.7 1 23 511 533 511 533 0.97 15 47 0.0021 0.03 14.9 1.2 1 23 538 560 538 560 0.98 16 47 0.1 1.4 9.6 2.5 1 23 565 587 565 587 0.98 17 47 1.3 18 6.2 0.7 1 23 592 614 592 614 0.97 18 47 0.052 0.74 10.5 2.1 1 23 620 642 620 642 0.98 19 47 0.0037 0.052 14.1 1.6 1 23 647 669 647 669 0.98 20 47 0.019 0.27 11.9 0.5 1 23 674 696 674 696 0.98 21 47 3.4 48 4.8 0.4 1 23 717 738 717 738 0.96 22 47 0.045 0.64 10.7 1.7 1 23 743 765 743 765 0.98 23 47 3 43 5.0 0.3 1 23 770 792 770 792 0.96 24 47 0.26 3.7 8.3 2.0 1 23 797 819 797 819 0.97 25 47 0.49 6.9 7.5 0.4 1 23 829 851 829 851 0.96 26 47 0.1 1.4 9.6 2.5 1 23 856 878 856 878 0.98 27 47 0.00066 0.0093 16.5 1.1 1 23 883 905 883 905 0.99 28 47 0.00066 0.0093 16.5 1.1 1 23 910 932 910 932 0.99 29 47 3.4 48 4.8 0.4 1 23 937 958 937 958 0.96 30 47 0.052 0.74 10.5 2.1 1 23 964 986 964 986 0.98 31 47 0.73 10 6.9 0.6 1 21 991 1011 991 1012 0.93 32 47 0.019 0.27 11.9 0.5 1 23 1018 1040 1018 1040 0.98 33 47 0.73 10 6.9 3.5 1 23 1061 1083 1061 1083 0.98 34 47 2.2 32 5.4 2.1 1 23 1201 1223 1201 1223 0.97 35 47 0.011 0.15 12.7 0.2 1 23 1255 1277 1255 1277 0.97 36 47 0.00013 0.0019 18.7 4.1 1 23 1309 1331 1309 1331 0.99 37 47 5 71 4.3 3.2 1 23 1336 1358 1336 1358 0.97 38 47 0.00066 0.0094 16.5 2.1 1 23 1363 1385 1363 1385 0.99 39 47 0.0002 0.0028 18.1 3.2 1 23 1390 1412 1390 1412 0.99 40 47 0.025 0.35 11.5 0.5 1 23 1417 1439 1417 1439 0.98 41 47 0.03 0.42 11.3 0.3 1 23 1444 1466 1444 1466 0.98 42 47 0.026 0.37 11.5 0.8 1 23 1471 1493 1471 1493 0.98 43 47 0.00025 0.0035 17.8 3.3 1 23 1498 1520 1498 1520 0.99 44 47 0.074 1 10.0 3.8 1 23 1525 1547 1525 1547 0.97 45 47 4.2 59 4.5 2.0 1 23 1552 1574 1552 1574 0.93 46 47 0.007 0.099 13.3 3.3 1 23 1579 1601 1579 1601 0.98 47 47 0.31 4.3 8.1 0.8 1 23 1606 1628 1606 1628 0.97
Sequence Information
- Coding Sequence
- ATGAATATCAACTCTGTACACTTTATTGAAAGTTGTACTGTAAAATCAGAAGTAATTTTAAcagagacattttctttttgtgggaaacACGAGGATTATGCGAGTAAGGAACTGAAACCGGAGCCAGTAGATTATGAAGAATTGCCTAAACCCAACGAAGAAAATGATCTAGCAGAATCCGTAGCTACGCATGCTGCTCCGATGCGACAAGATTTTTGTAATGACtcgaattttaaaacaattggaGAAAATGGTGTAATAGAAGTTTCCAAACTAACTAAAAGCGCTAAGTACtcttgtaaggaatgtaactttacaaccgagtcagaattttctataaaagaGCATACGAGAATTCACAATGATGgatataatattgaagaatataattttaagacGCCACAGATATTCTCACTATCATCACAGTTGAAAACTTCAAGAAGCGAGAATGAATAtgcttgtaatgaatgtaattacaccacattgaacaaaaataatCTAAGGAAGCATGTGAAGATTCACAAAGGTGTAACATATAGTTGTGAGAAATTTGGTTATAATAGAGGGTGGAAAAGTCACTTCACAGGTCATGAAAAAATTCACTctagtgatgaatataagtgtaaagattgtgattataaaacaaaatggaaaagtgctataatgcaacatgtcaaaatacatacaggcgatgaatataggtgtaaagaatgtgattataaaacaatatggaaaagtgACCTAATGAAACATGTCAAAAGTCATACAGGTGTTatatataagtgcaaagaatgtgattataaaactgtacgaaaaaatcatctaaaggaacatgtcaaaattcatacaggtgatgaatataaatgtaaagaatgtgattataaaacaatatggaaaagtgCTATAATACAACATGTTAAAAtacatacaggcgatgaatataagtgtaaagaatgtgattataaaacaatatggaaaagtgatctaatgaaacatgttaaaattcatacaggttttgaatataagtgcaaagaatgtgatgaatataaatgtaaagaatgtgattataaaacaatatggaaaagtgCTATAATACAACATGTTAAAAtacatacaggcgatgaatataagtgtaaagaatgtgattataaaacaatatggaaaagtgatctaatgaaacatgttaaaattcatacaggttttgaatataagtgcaaagaatgtgattataaaactgtacgaataaatcatctaaaggaacatgtcaaaattcatacaggtgaggaatataagtgtaaagaatgtggttataaaacagttcggaaagataatctaatgcaacatgttaaaattcatacaggagatgaatataagtgtaaagaatgtgattataaaactgtatggaaaggtaatctaatgcaacatgtcaaaattcatacaggtgataaatataagtgtaaagaatgtgattataaaacaatatggaaaaggcatctaatgcaacatgtcaaaatacatacaggctatgaatataagtgtatagaatgtcattataaaacagttttgaAAAGTGACCTAatgcaacatatcaaaattcatacaggtgttgaatataagtgtaaagaatgtgattataaaactgtacgaaaaaatcttctaaagggacatgtcaaaattcatacaggtgatgaatataagtgtaaagaatgtgattataaaactgtatggaaaaatcatttaaaagaacatgtcaaaattcatacaggtgatgaatataagtgtcaagaatgtaaatataaaactgtatggaaaagtgCTATAAtgcaacatctcaaaattcatgcaTGTGAAGAAgaacataagtgtaaagaatgtgattataaaacaatatggaaaagtgatctaatgaaacatgtcaaaattcatatgggtgttgaatataagtgcaaagaatgtgattataaaactgtacgacAAAATCGTCttaaggaacatgtcaaaattcatacaggtgatgaatataagtgtaaggaatgtgattataaaacagtttggaaaggtaatctaatgcaacatgttaaaattcatacaggagatgaatataagtgtaaaggatgtgattataaaactgtatggaaaggtgatgaatataagtgtcaagaatgtgaatataaaactgTATGGTATGCTATAAtgcaacatctcaaaattcatgcaTGTGAAGAACAtcagtgtaaggaatgtgattataaaacaatatggaaaagtgatctaatgaaacatgtcaaaattcatacaggtgatgaatataagtgtcaagaatgtgaatataaaactgtatggaaaagtgctataatgcaacatgtcaaaattcatacaggttttgaatataagtgtatagaatgtcattataaaacagttttgaAAAGTGACCTAAtgaaacatgttaaaattcatacaggtgttgaatataagtgttttgaatataagtgcaaagaatgtgattataaaactgtatggaaaagtgCTATAatgcaacatgttaaaattcatacaggtgttgaatataagtgcaaagaatgtgattataaaactgtatggaaaaatcatttaaaagaacatgtcaaaattcatacaggttttgaatataagtgcaaagaatgtgattataaaacagttttgaAAAGTGACCTAAtgaaacatgttaaaattcatacaggtgataaatataagtgtaaagaatgtgattataaaacagttttgaAAAGTGACCTaatgaaacatgtcaaaattcatatgggtgttgaatataagtgtcaagaatgtgaatataaaactgTATGGTATGCTATAAtgcaacatctcaaaattcatgcaTGTGAAGAAgaacataagtgtaaagaatgtgattataaaacaatatggaaaagtgatctaatgaaacatgtcaaaattcatatgggtgttgaatataagtgtaaagaatgtgattataaaactgtatggaaaattcatctaaaggaacatgtcaaaattcgtacaggtgatgaatataagtgtaaggaatgtgattataaaacagtttggaaaggtaatctaatgcaacatgttaaaattcatacaggagatgaatataagtgtaaaggatgtaattataaaactgtatggaaaggtgatgaatataagtgtaaagaatgtgatcacaaaactgtatggaaaattcatctaaaggaacatgtcaaagttcatacaggtgatgaatataagtgtcaagaatgtgaatataaaactgTATGgtCACCTAATACAATGGATATCGAGCATGTACactttattgaaagttgtgctGTAAAATCAGAAGTCCTTTTAAcagagacattttctttttgtgggaaacACGAAGATTATGCGAGTAAGGAACTAAAACCGGAGCCAGTAGATTACGAAGAATCGCATAGAcccaaagaagaaaatgatctaGTAGCATTCATAGCTACACATGCTGCTCCGATGCAACAAGATTTTTGTAATGActctaattttaaaacaattggaGAGAATGCTGTAACAGAAGTTTTGAAACTTACTAAAAGCGCTAAATACttttgtaaggaatgtaactttacaactgagttaaaattttctataaaagagCATACAAGAATTCACAGTGATGgatataatattgaagaatataattttaagacGCCACAGATTTTCTCACTATCATCACAGTTGAAACCTCCAAGAAGTGGGAATGAATAtgcttgtaatgaatgtaattacaccacaATGAACAAAAATGATCTAGTAATACATGTGAAGATTCACCAAGGTGTAACATATAGTTGTGagaaatttgattataatacaGGGTGGAAAAGTCGCTTCACAGcacatgaaaaaattcattctggtgatgaatataagtgtaaaaactgtgattataaaactgtacggaaaagtcatctaaaggaacatatcaagattcatacaggtgaggaatataagtgtaaagaatgtgagtataaaactgtatggaaaagtaatctaaggtgtcatgtcaaaattcatacaggtaatgaatataagtgtaaagagtgtgattataaaactatacggAAAAGAgacctaatgcaacatgtcaaaagtcatacaggtgatgaatataagtgtaaagaatgtgattataaaaccgtacagaaaagtcatctaaaggaacatgttaaaattcatacaggtgatgaatataagtgtaaagaatgtgattataaaacagtttggaaaagtgctctaatgcaacatgtgaaaattcatacaggtgacgaatataagtgtaaagaatgtgattatgaaactgTATGGAAaggtaatctaatgcaacatgtcaaaattcatacaggtgaagaatataagtgtaaagaatgtgattataaaacagtatggaaaagtcttctaaaggaacacgtcaaaattcataccggtgataaatataagtgtaaagaatgtgattatacaaCAACACGGAAAATGCATCTAATACAACAtgtcaaaatacatacaggtgatgaatataagtgtaaagaatgtgattataaaactgtacggaaaaataatctaatgtgtcatgtaaaaatacatacaggagatgaatataagtgtaaagaatgtgattataaaactgtatggaagGGTAATCTAATGTgtcacgtcaaaattcatacaggtgaggaatataagtgtaaagaatgtgattataaaacggtacatAAAagtcgtctaaaggaacatgccaaaattcatacaggtgatgaatataagtgtaaagaatgtgattataaaacagtatggataAATAATCTAAAGAACCACgaaaaaatacatacaggtgatgaatgtTGTTAA
- Protein Sequence
- MNINSVHFIESCTVKSEVILTETFSFCGKHEDYASKELKPEPVDYEELPKPNEENDLAESVATHAAPMRQDFCNDSNFKTIGENGVIEVSKLTKSAKYSCKECNFTTESEFSIKEHTRIHNDGYNIEEYNFKTPQIFSLSSQLKTSRSENEYACNECNYTTLNKNNLRKHVKIHKGVTYSCEKFGYNRGWKSHFTGHEKIHSSDEYKCKDCDYKTKWKSAIMQHVKIHTGDEYRCKECDYKTIWKSDLMKHVKSHTGVIYKCKECDYKTVRKNHLKEHVKIHTGDEYKCKECDYKTIWKSAIIQHVKIHTGDEYKCKECDYKTIWKSDLMKHVKIHTGFEYKCKECDEYKCKECDYKTIWKSAIIQHVKIHTGDEYKCKECDYKTIWKSDLMKHVKIHTGFEYKCKECDYKTVRINHLKEHVKIHTGEEYKCKECGYKTVRKDNLMQHVKIHTGDEYKCKECDYKTVWKGNLMQHVKIHTGDKYKCKECDYKTIWKRHLMQHVKIHTGYEYKCIECHYKTVLKSDLMQHIKIHTGVEYKCKECDYKTVRKNLLKGHVKIHTGDEYKCKECDYKTVWKNHLKEHVKIHTGDEYKCQECKYKTVWKSAIMQHLKIHACEEEHKCKECDYKTIWKSDLMKHVKIHMGVEYKCKECDYKTVRQNRLKEHVKIHTGDEYKCKECDYKTVWKGNLMQHVKIHTGDEYKCKGCDYKTVWKGDEYKCQECEYKTVWYAIMQHLKIHACEEHQCKECDYKTIWKSDLMKHVKIHTGDEYKCQECEYKTVWKSAIMQHVKIHTGFEYKCIECHYKTVLKSDLMKHVKIHTGVEYKCFEYKCKECDYKTVWKSAIMQHVKIHTGVEYKCKECDYKTVWKNHLKEHVKIHTGFEYKCKECDYKTVLKSDLMKHVKIHTGDKYKCKECDYKTVLKSDLMKHVKIHMGVEYKCQECEYKTVWYAIMQHLKIHACEEEHKCKECDYKTIWKSDLMKHVKIHMGVEYKCKECDYKTVWKIHLKEHVKIRTGDEYKCKECDYKTVWKGNLMQHVKIHTGDEYKCKGCNYKTVWKGDEYKCKECDHKTVWKIHLKEHVKVHTGDEYKCQECEYKTVWSPNTMDIEHVHFIESCAVKSEVLLTETFSFCGKHEDYASKELKPEPVDYEESHRPKEENDLVAFIATHAAPMQQDFCNDSNFKTIGENAVTEVLKLTKSAKYFCKECNFTTELKFSIKEHTRIHSDGYNIEEYNFKTPQIFSLSSQLKPPRSGNEYACNECNYTTMNKNDLVIHVKIHQGVTYSCEKFDYNTGWKSRFTAHEKIHSGDEYKCKNCDYKTVRKSHLKEHIKIHTGEEYKCKECEYKTVWKSNLRCHVKIHTGNEYKCKECDYKTIRKRDLMQHVKSHTGDEYKCKECDYKTVQKSHLKEHVKIHTGDEYKCKECDYKTVWKSALMQHVKIHTGDEYKCKECDYETVWKGNLMQHVKIHTGEEYKCKECDYKTVWKSLLKEHVKIHTGDKYKCKECDYTTTRKMHLIQHVKIHTGDEYKCKECDYKTVRKNNLMCHVKIHTGDEYKCKECDYKTVWKGNLMCHVKIHTGEEYKCKECDYKTVHKSRLKEHAKIHTGDEYKCKECDYKTVWINNLKNHEKIHTGDECC
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -