Ecen021389.1
Basic Information
- Insect
- Eupithecia centaureata
- Gene Symbol
- -
- Assembly
- GCA_944548335.1
- Location
- CALYMU010001134.1:8543-29261[+]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 30 0.077 6.1 8.3 2.1 1 23 368 390 368 390 0.96 2 30 0.08 6.3 8.2 1.4 2 23 398 420 397 420 0.94 3 30 0.012 0.93 10.9 2.0 1 23 451 474 451 474 0.96 4 30 1.4 1.1e+02 4.4 1.7 2 23 514 536 514 536 0.96 5 30 3.1 2.4e+02 3.3 1.2 1 21 544 564 544 568 0.91 6 30 0.12 9.4 7.7 5.5 1 23 643 665 643 665 0.94 7 30 0.0013 0.1 13.9 0.6 1 23 671 693 671 693 0.96 8 30 0.00099 0.078 14.2 0.5 1 21 698 718 698 719 0.96 9 30 0.00099 0.078 14.2 0.5 1 21 747 767 747 768 0.96 10 30 0.00099 0.078 14.2 0.5 1 21 796 816 796 817 0.96 11 30 0.00099 0.078 14.2 0.5 1 21 845 865 845 866 0.96 12 30 0.00099 0.078 14.2 0.5 1 21 894 914 894 915 0.96 13 30 0.00099 0.078 14.2 0.5 1 21 943 963 943 964 0.96 14 30 0.00099 0.078 14.2 0.5 1 21 992 1012 992 1013 0.96 15 30 0.00099 0.078 14.2 0.5 1 21 1041 1061 1041 1062 0.96 16 30 0.00099 0.078 14.2 0.5 1 21 1090 1110 1090 1111 0.96 17 30 0.00099 0.078 14.2 0.5 1 21 1139 1159 1139 1160 0.96 18 30 0.00099 0.078 14.2 0.5 1 21 1188 1208 1188 1209 0.96 19 30 0.00099 0.078 14.2 0.5 1 21 1237 1257 1237 1258 0.96 20 30 0.00099 0.078 14.2 0.5 1 21 1286 1306 1286 1307 0.96 21 30 0.00099 0.078 14.2 0.5 1 21 1335 1355 1335 1356 0.96 22 30 0.00099 0.078 14.2 0.5 1 21 1384 1404 1384 1405 0.96 23 30 0.00099 0.078 14.2 0.5 1 21 1433 1453 1433 1454 0.96 24 30 0.00099 0.078 14.2 0.5 1 21 1482 1502 1482 1503 0.96 25 30 0.00099 0.078 14.2 0.5 1 21 1531 1551 1531 1552 0.96 26 30 0.00099 0.078 14.2 0.5 1 21 1580 1600 1580 1601 0.96 27 30 0.00099 0.078 14.2 0.5 1 21 1629 1649 1629 1650 0.96 28 30 0.00099 0.078 14.2 0.5 1 21 1678 1698 1678 1699 0.96 29 30 0.00032 0.025 15.8 1.3 1 23 1727 1749 1727 1749 0.98 30 30 0.022 1.7 10.0 1.9 1 23 1755 1778 1755 1778 0.95
Sequence Information
- Coding Sequence
- ATGCCAAGAACAGTATGCCGTATCTGTCTCAGGACTGATGTTATTGTGAGAGATATAGATGAAAGACTTGTCGATTTACACGACGAAATTTGTGATTATGATATTTATAAGAACAAACAACTAAAATACTGTGGCATATGTGAACATCTACTGAGGAAGTACCACAACTTCAGAGAGAGGTGCAGTCGGGCGCACAAGCTACTACGCATCTGGCACCACAATGTTGAAGATATTCCTGAAAATCTCCATGAACAATTACCATCGCCTATTTCCATCTGCCATAACAAAGCTCGACACCAGATCTACGAGATCCGATACGACCTTGACCCTGACCGCGACTCAGGCGTTGAGGTCAATAAATATGACCTCTCTGCACTGGAAACCCTCCCTGACCTTGAAGGTCAAATAAAAACAGAGACAGATGACAAAGGCAGTTGTTTATTTGTAACAAAAGAGATTGAATACCCCCAAGAAATAAGGTTCGATTTTGAACTGAAGAAGAATGAGATACAAGACGCGGTAGAACTACAAAACGACGAAGACATCAGCTCTATGGAAGACGATATACCTCTCAGTAATTTAAAGGAGGAGAATATAGACTTCAGTGATATTGAAGAGTTTAAAACAAAATCGAAAAGAGGGAAAGCCACAAAAGAGAAAGCAAAGAGTAAAAAGAAAACAGAAGTTAAATACGAAGACGTAGACGATATTTCACTCGCTGAGTTGGAGAAACTCGGTTCTGATGATAAAATGATAAAAGCGAGGCAGAAGTTCCTTAAAAATAAGGCGAGATTAAAGGATAGGATATTAGAGTTGCTTGATACGGGTCAGTTTAATGATCTGGGTGATGATGTTAAGATGAGCAAAGAAATATTCGTTAAGATATTCAGTAGATATGACAAATGTGATATGAGAATGCTGTCGATGAAGTCAGATTCCCCCGAGCTCAAAGAGAAGCTGCGTCGCTTCCTTCTGTACCTGTTAAGGAAGTACAATCCGGAGTACGAGACCGTTTTCCATGTCGAGTATCTGGACACCGACGAGAAGAAACTACAGGAGTTCAACCGTCGCAAACACGGCGCCGCGTACCTGGAGGCGCCGTTCAAATGCGAGCTCTGCTACAAGGGGTTCGACGCACAGAGCCATCTCGACTACCACAACAAGTGGCACAAGCCGACCCCCGGCGACATAGTGTGCAACATCTGCCCCATGCGGTTTGTCTCCAAGGCAACCTGTAAGGCTCACCGGAAGAACAACCACACAGTCGTGTATTCCTGCCGGCAGTGCGCAGTCATCACTTACCACTTTTCCGTGATGATAAACCATTACAATCAACACAAGGGAGTGGAGTATCATTGCCCGACTTGCCTCAGCGTTTACACCAATAAGACCACCCTAAACAACCATCGGCGCGAGGCGCACGCCACCGCGCACATCTGCACGGCCTGCGCGCGCAGCTTCGTCTCAGAGAAAGGACTGTTCTTCCACAGGAAATACAAGCACAAGGACGAGGCCGAGGCAAACCCGGAGGAGGCAGCCACCTGCCTGCAATGTAACGTCACATTCGACTCGCACAGAGTGTACTTGAAGCACGAACGCAGCGCGCACACTCGCAGGCCTGAGACTGTATACAAGTGCTATTTCTGCAAGGACGGCTTCGACAGCATAGAGGCGCTGCGGAAACACAAACTGGAGTTCAACCACCAGAGGGCGCCCAAACAAGTCATACACAGGACATCTATATCTACGAAGCAAGGCGCTCGCCGCACGCGACGGACCAGGGTCAGTGACCCCGACAAGCCGTACCGGAAGTGTCCTGTTTGTCTGCTGGAGATCCCCAACACTTTCCCTACGAGTCTCTACATGCACTACCAGGAGGCCCATCCAGACGTGCCCTACCACTCCGTGTACACGGACTCCAAGGACTATCTCTGTGAGCAGTGCGGGAAGACTTTCCGGGTGTACTGCCATTACCTGGATCACAAGAACCAGCATTCGGGTAGCAAACCCTACGCCTGCAAGGAATGCGGCGCGGCCTTCGCGACGCGTCGCCACCTCGTGTACCACTCCGAGAGACACCGCCCGCCGCGCTACCAGTGCAAGCACTGCGGCAAGCTGTTCCTGAGCGCGGCCAATCTGTTCGCACACAGACGAGTATGTATCCTTATTGCAGGCGAGTATGTATCCTACACTAGAAGACACCCGGTCTACCACTCGGAGAGACACCGCCCGCCGCGCTACCAGTGCAAGCACTGCGGCAAGCTGTTCCTGAGCGCGGCCAATCTGTTCGCACACAGACGAGTATGTATCCTTATTGCAGGCGAGTATGTATCCTACACTAGAAGACACCCGGTCTACCACTCGGAGAGACACCGCCCGCCGCGCTACCAGTGCAAGCACTGCGGCAAGCTGTTCCTGAGCGCGGCCAATCTGTTCGCACACAGACGAGTATGTATCCTTATTGCAGGCGAGTATGTATCCTACACTAGAAGACACCCGGTCTACCACTCGGAGAGACACCGCCCGCCGCGCTACCAGTGCAAGCACTGCGGCAAGCTGTTCCTGAGCGCGGCCAATCTGTTCGCACACAGACGAGTATGTATCCTTATTGCAGGCGAGTATGTATCCTACACTAGAAGACACCCGGTCTACCACTCGGAGAGACACCGCCCGCCGCGCTACCAGTGCAAGCACTGCGGCAAGCTGTTCCTGAGCGCGGCCAATCTGTTCGCACACAGACGAGTATGTATCCTTATTGCAGGCGAGTATGTATCCTACACTAGAAGACACCCGGTCTACCACTCGGAGAGACACCGCCCGCCGCGCTACCAGTGCAAGCACTGCGGCAAGCTGTTCCTGAGCGCGGCCAATCTGTTCGCACACAGACGAGTATGTATCCTTATTGCAGGCGAGTATGTATCCTACACTAGAAGACACCCGGTCTACCACTCGGAGAGACACCGCCCGCCGCGCTACCAGTGCAAGCACTGCGGCAAGCTGTTCCTGAGCGCGGCCAATCTGTTCGCACACAGACGAGTATGTATCCTTATTGCAGGCGAGTATGTATCCTACACTAGAAGACACCCGGTCTACCACTCGGAGAGACACCGCCCGCCGCGCTACCAGTGCAAGCACTGCGGCAAGCTGTTCCTGAGCGCGGCCAATCTGTTCGCACACAGACGAGTATGTATCCTTATTGCAGGCGAGTATGTATCCTACACTAGAAGACACCCGGTCTACCACTCGGAGAGACACCGCCCGCCGCGCTACCAGTGCAAGCACTGCGGCAAGCTGTTCCTGAGCGCGGCCAATCTGTTCGCACACAGACGAGTATGTATCCTTATTGCAGGCGAGTATGTATCCTACACTAGAAGACACCCGGTCTACCACTCGGAGAGACACCGCCCGCCGCGCTACCAGTGCAAGCACTGCGGCAAGCTGTTCCTGAGCGCGGCCAATCTGTTCGCACACAGACGAGTATGTATCCTTATTGCAGGCGAGTATGTATCCTACACTAGAAGACACCCGGTCTACCACTCGGAGAGACACCGCCCGCCGCGCTACCAGTGCAAGCACTGCGGCAAGCTGTTCCTGAGCGCGGCCAATCTGTTCGCACACAGACGAGTATGTATCCTTATTGCAGGCGAGTATGTATCCTACACTAGAAGACACCCGGTCTACCACTCGGAGAGACACCGCCCGCCGCGCTACCAGTGCAAGCACTGCGGCAAGCTGTTCCTGAGCGCGGCCAATCTGTTCGCACACAGACGAGTATGTATCCTTATTGCAGGCGAGTATGTATCCTACACTAGAAGACACCCGGTCTACCACTCGGAGAGACACCACCCGCCGCGCTACCAGTGCAAGCACTGCGGCAAGCTGTTCCTGAGCGCGGCCAATCTGTTCGCACACAGACGAGTATGTATCCTTATTGCAGGCGAGTATGTATCCTACACTAGAAGACACCCGGTCTACCACTCGGAGAGACACCGCCCGCCGCGCTACCAGTGCAAGCACTGCGGCAAGCTGTTCCTGAGCGCGGCCAATCTGTTCGCACACAGACGAGTATGTATCCTTATTGCAGGCGAGTATGTATCCTACACTAGAAGACACCCGGTCTACCACTCGGAGAGACACCGCCCGCCGCGCTACCAGTGCAAGCACTGCGGCAAGCTGTTCCTGAGCGCGGCCAATCTGTTCGCACACAGACGAGTATGTATCCTTATTGCAGGCGAGTATGTATCCTACACTAGAAGACACCCGGTCTACCACTCGGAGAGACACCGCCCGCCGCGCTACCAGTGCAAGCACTGCGGCAAGCTGTTCCTGAGCGCGGCCAATCTGTTCGCACACAGACGAGTATGTATCCTTATTGCAGGCGAGTATGTATCCTACACTAGAAGACACCCGGTCTACCACTCGGAGAGACACCGCCCGCCGCGCTACCAGTGCAAGCACTGCGGCAAGCTGTTCCTGAGCGCGGCCAATCTGTTCGCACACAGACGAGTATGTATCCTTATTGCAGGCGAGTATGTATCCTACACTAGAAGACACCCGGTCTACCACTCGGAGAGACACCGCCCGCCGCGCTACCAGTGCAAGCACTGCGGCAAGCTGTTCCTGAGCGCGGCCAATCTGTTCGCACACAGACGAGTATGTATCCTTATTGCAGGCGAGTATGTATCCTACACTAGAAGACACCCGGTCTACCACTCGGAGAGACACCGCCCGCCGCGCTACCAGTGCAAGCACTGCGGCAAGCTGTTCCTGAGCGCGGCCAATCTGTTCGCACACAGACGAGTATGTATCCTTATTGCAGGCGAGTATGTATCCTACACTAGAAGACACCCGGTCTACCACTCGGAGAGACACCGCCCGCCGCGCTACCAGTGCAAGCACTGCGGCAAGCTGTTCCTGAGCGCGGCCAATCTGTTCGCACACAGACGAGTATGTATCCTTATTGCAGGCGAGTATGTATCCTACACTAGAAGACACCCGGTCTACCACTCGGAGAGACACCGCCCGCCGCGCTACCAGTGCAAGCACTGCGGCAAGCTGTTCCTGAGCGCGGCCAATCTGTTCGCACACAGACGAGTATGTATCCTTATTGCAGGCGAGTATGTATCCTACACTAGAAGACACCCGGTCTACCACTCGGAGAGACACCGCCCGCCGCGCTACCAGTGCAAGCACTGCGGCAAGCTGTTCCTGAGCGCGGCCAATCTGTTCGCACACAGACGAGGGCACATCGGCGACCGTCGCTTCAAATGCGACATTTGCTTCAAAGGGTTCCTACACTCGTACACTCTGAAGGAACACGTGAAAGGAGTCCACAACCAGATCCGACGGGTGCGCAAGGGACCGCCTAGGGACAAGTTTATACATTAG
- Protein Sequence
- MPRTVCRICLRTDVIVRDIDERLVDLHDEICDYDIYKNKQLKYCGICEHLLRKYHNFRERCSRAHKLLRIWHHNVEDIPENLHEQLPSPISICHNKARHQIYEIRYDLDPDRDSGVEVNKYDLSALETLPDLEGQIKTETDDKGSCLFVTKEIEYPQEIRFDFELKKNEIQDAVELQNDEDISSMEDDIPLSNLKEENIDFSDIEEFKTKSKRGKATKEKAKSKKKTEVKYEDVDDISLAELEKLGSDDKMIKARQKFLKNKARLKDRILELLDTGQFNDLGDDVKMSKEIFVKIFSRYDKCDMRMLSMKSDSPELKEKLRRFLLYLLRKYNPEYETVFHVEYLDTDEKKLQEFNRRKHGAAYLEAPFKCELCYKGFDAQSHLDYHNKWHKPTPGDIVCNICPMRFVSKATCKAHRKNNHTVVYSCRQCAVITYHFSVMINHYNQHKGVEYHCPTCLSVYTNKTTLNNHRREAHATAHICTACARSFVSEKGLFFHRKYKHKDEAEANPEEAATCLQCNVTFDSHRVYLKHERSAHTRRPETVYKCYFCKDGFDSIEALRKHKLEFNHQRAPKQVIHRTSISTKQGARRTRRTRVSDPDKPYRKCPVCLLEIPNTFPTSLYMHYQEAHPDVPYHSVYTDSKDYLCEQCGKTFRVYCHYLDHKNQHSGSKPYACKECGAAFATRRHLVYHSERHRPPRYQCKHCGKLFLSAANLFAHRRVCILIAGEYVSYTRRHPVYHSERHRPPRYQCKHCGKLFLSAANLFAHRRVCILIAGEYVSYTRRHPVYHSERHRPPRYQCKHCGKLFLSAANLFAHRRVCILIAGEYVSYTRRHPVYHSERHRPPRYQCKHCGKLFLSAANLFAHRRVCILIAGEYVSYTRRHPVYHSERHRPPRYQCKHCGKLFLSAANLFAHRRVCILIAGEYVSYTRRHPVYHSERHRPPRYQCKHCGKLFLSAANLFAHRRVCILIAGEYVSYTRRHPVYHSERHRPPRYQCKHCGKLFLSAANLFAHRRVCILIAGEYVSYTRRHPVYHSERHRPPRYQCKHCGKLFLSAANLFAHRRVCILIAGEYVSYTRRHPVYHSERHRPPRYQCKHCGKLFLSAANLFAHRRVCILIAGEYVSYTRRHPVYHSERHRPPRYQCKHCGKLFLSAANLFAHRRVCILIAGEYVSYTRRHPVYHSERHRPPRYQCKHCGKLFLSAANLFAHRRVCILIAGEYVSYTRRHPVYHSERHRPPRYQCKHCGKLFLSAANLFAHRRVCILIAGEYVSYTRRHPVYHSERHHPPRYQCKHCGKLFLSAANLFAHRRVCILIAGEYVSYTRRHPVYHSERHRPPRYQCKHCGKLFLSAANLFAHRRVCILIAGEYVSYTRRHPVYHSERHRPPRYQCKHCGKLFLSAANLFAHRRVCILIAGEYVSYTRRHPVYHSERHRPPRYQCKHCGKLFLSAANLFAHRRVCILIAGEYVSYTRRHPVYHSERHRPPRYQCKHCGKLFLSAANLFAHRRVCILIAGEYVSYTRRHPVYHSERHRPPRYQCKHCGKLFLSAANLFAHRRVCILIAGEYVSYTRRHPVYHSERHRPPRYQCKHCGKLFLSAANLFAHRRVCILIAGEYVSYTRRHPVYHSERHRPPRYQCKHCGKLFLSAANLFAHRRVCILIAGEYVSYTRRHPVYHSERHRPPRYQCKHCGKLFLSAANLFAHRRVCILIAGEYVSYTRRHPVYHSERHRPPRYQCKHCGKLFLSAANLFAHRRGHIGDRRFKCDICFKGFLHSYTLKEHVKGVHNQIRRVRKGPPRDKFIH
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -