Ecor010282.1
Basic Information
- Insect
- Eupeodes corollae
- Gene Symbol
- -
- Assembly
- GCA_945859775.1
- Location
- CAMAOS010000114.1:2484405-2513898[+]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 30 0.21 27 6.3 0.0 8 23 148 163 148 163 0.96 2 30 4.1e-06 0.00053 21.1 1.3 1 23 169 191 169 191 0.97 3 30 0.00057 0.073 14.4 0.3 1 23 197 219 197 219 0.97 4 30 0.00049 0.064 14.6 3.2 1 23 225 247 225 247 0.97 5 30 0.0017 0.22 12.9 4.1 3 23 255 275 253 275 0.98 6 30 0.00013 0.017 16.3 3.7 1 23 281 303 281 303 0.99 7 30 1 1.3e+02 4.2 1.7 1 21 411 431 411 432 0.93 8 30 0.00014 0.019 16.2 0.2 1 23 440 464 440 464 0.98 9 30 0.0026 0.34 12.3 0.1 2 23 466 488 465 488 0.89 10 30 0.00081 0.1 13.9 1.0 1 23 494 518 494 518 0.96 11 30 1.3e-05 0.0017 19.5 6.3 1 23 524 546 524 546 0.99 12 30 1.1e-06 0.00014 22.9 1.0 1 23 552 574 552 574 0.99 13 30 0.00014 0.018 16.3 0.1 1 23 580 602 580 602 0.98 14 30 0.0098 1.3 10.5 0.2 1 23 608 630 608 630 0.97 15 30 0.39 50 5.5 0.2 8 23 637 652 636 652 0.94 16 30 1e-05 0.0014 19.8 1.4 1 23 658 680 658 680 0.97 17 30 0.00093 0.12 13.7 0.2 1 23 686 708 686 708 0.97 18 30 0.00049 0.064 14.6 3.2 1 23 714 736 714 736 0.97 19 30 0.0017 0.22 12.9 4.1 3 23 744 764 742 764 0.98 20 30 0.00013 0.017 16.3 3.7 1 23 770 792 770 792 0.99 21 30 1 1.3e+02 4.2 1.7 1 21 900 920 900 921 0.93 22 30 0.00014 0.019 16.2 0.2 1 23 929 953 929 953 0.98 23 30 0.0026 0.34 12.3 0.1 2 23 955 977 954 977 0.89 24 30 0.00081 0.1 13.9 1.0 1 23 983 1007 983 1007 0.96 25 30 1.3e-05 0.0017 19.5 6.3 1 23 1013 1035 1013 1035 0.99 26 30 1.1e-06 0.00014 22.9 1.0 1 23 1041 1063 1041 1063 0.99 27 30 0.00014 0.018 16.3 0.1 1 23 1069 1091 1069 1091 0.98 28 30 0.0098 1.3 10.5 0.2 1 23 1097 1119 1097 1119 0.97 29 30 0.00079 0.1 13.9 5.6 1 23 1125 1147 1125 1147 0.99 30 30 4.9e-05 0.0063 17.7 0.7 3 23 1155 1176 1153 1176 0.97
Sequence Information
- Coding Sequence
- ATGGCAATGAAAATTAAAACGGAAAATAATGATGTCCATGAAAACGATTGTGGTATGGAATATAGAACTTTTATGGCACAAAATTGTGTAGAGGAAAACGAGAATGATTTAATCGATAATTCTCAATCTAATGATTATGATAGAAATTATCTGATGCATTCCACTGAAAATAGATCTCAAACAAATGAAGTGCAAAGCGGGCAAATTACTCCTGGACCTGATGACAGTAGGGTGGAAATTTTTTCGATAATGGAAATCGAGGACACAAATATCAAAAAAGAACCTTTAGAAAATCAAGGCTTGAAGAAGAACCAATGTTGGGTTTGCCAGAAAGGTCATAGTGGCATTGATGGAAATGAACGGGCTGATGAATTGGCCGGGCAAGGATTCGCCCTTCATAGTTCACTTGCGGGAAGCGTAAATATTCCTTTTAGAGCTATGAAGGTTTTTCGAGGTAAATGTCTCCTCGAGATACACATGCGCGTACACACTGGTGAAAAACCATTTAACTGCAAGTTATGTGGAAAAAACTTCCGACAGAAAGCTGCTCTCAAAACGCATTTATTAAATCACACGAAGATCAAACCATTTAAATGTGATCAATGTGAACGAGTTTTCGCCGCCCAAAAAGATCTCACATCTCATTCTCTGATTCATTCCTGTGAAAGACGATTCCATTGTGAAATATGCAATAGATCGTTTATATCTGAGAAAACATTAAAATATCACAGAGCTCGACACACCAGAGGATATCCGTTTGGATGTGATATTTGTGAGATGCGCTTTTGTTATATGCATCTTTTACGGCGCCACATGCTCACACACACCGGTGAGAAACCCTACAAATGTAAATATTGTGATAAAGGATTTCGGCAGAGACACGAATCTGTTGTTCATATGCAAACGCACCCCGAAGCCAATTCAGACAAAGATAAGCCATCACTGCAACAAGTACTTGAACTTATAAGCTCCTTGCCGAGAGAAGGAGATATGTGTGAGTCCAAAGATCAGCCACTAGAATCCTCTAATATAATCAATGGCTATCAATTCACTGAAACTGAATCGCCACCGATTTTTTCCATAGAAGAACTTAACCCAAATGATGAAGAAAAAGGACATGCGTCAACTGATAACGAAACAAATTTACACAACGAAGATATTAAAACTAATTTGGAGTTTTCAGAAATAAAAGTTAAACAGGAACCCATGACTATGGAGCTTGTTAAAAAATTCATTTGCAAATGTTGTGGGGCTCGCTTTGCATTGCAAAATACACTAAGTCATCATGTGCGACAAAATAAGTGTTCTCAAGAGCATTTTACATGTCCAAATACTAATTGTAATAGGGTGTTCGCCTCTCAGGATAAACTCGATGCACATGTTCTTACTCACACGTATTGTGATCTATGCGGACAATCGTTTAGTAGTAGCGAAGAAATCGAAGCACATAAAGCAGAACAACATAACTTGAAAGTAAAACATCCCTGTCCGCATCCTAATTGTAATAAAGCTTTTTATAAAGGCGGTCAACTCGAAAAGCACATAGAAACTCATAACATTAAAAAAGAATTCCAGTGCACACTTTGCGATAAGAGCTTTCATCGAAAATTCTATCTCACACAACACATGAACCGTCACACAAAAACGCGCCCTTTCAAATGTGAACAATGTTCAAAAGCGTTTTATAGTTCGGGAGAACTTCAAAGACATATTATGCGGCACACTGGTGATCGTCCGTTTCCATGTGATATATGCGAAAAAACATACCCTTTAGCCAGTGAACTGAGAGTTCACAAGCAATCACATTCGGGCGAAAGACCCTTTGCTTGTGAATTTTGTCCAATGCGATTTGGATTTGCAAACGTTCTAAGAAAACATCTTATAACGCACACTGGTGAAAGGTCATTTAAATTTTTTCGAGGTAAATGTCTCCTCGAGATACACATGCGCGTACACACTGGTGAAAAACCATTTAACTGCAAGTTATGTGGAAAATACTTCCGACAGAAAGCTGCTCTCAAAACGCATTTATTAAATCACACGAAGATCAAACCATTTAAATGTGATCAATGTGAACGAGCTTTCGCCGCCCAAAAAGATCTCACATCTCATTCTCTGATTCATTCCTGTGAAAGACGATTCCATTGTGAAATATGCAATAGATCGTTTATATCTGAGAAAACATTAAAATATCACAGAGCTCGACACACCAGAGGATATCCGTTTGGATGTGATATTTGTGAGATGCGCTTTTGTTATATGCATCTTTTACGGCGCCACATGCTCACACACACCGGTGAGAAACCCTACAAATGTAAATATTGTGATAAAGGATTTCGGCAGAGACACGAATCTGTTGTTCATATGCAAACGCACCCCGAAGCCAATTCAGACAAAGATAAGCCATCACTGCAACAAGTACTTGAACTTATAAGCTCCTTGCCGAGAGAAGGAGATATGTGTGAGTCCAAAGATCAGCCACTAGAATCCTCTAATATAATCAATGGCTATCAATTCACTGAAACTGAATCGCCACCGATTTTTTCCATAGAAGAACTTAACCCAAATGATGAAGAAAAAGGACATGCGTCAACTGATAACGAAACAAATTTACACAACGAAGATATTAAAACTAATTTGGAGTTTTCAGAAATAAAAGTTAAACAGGAACCCATGACTATGGAGCTTGTTAAAAAATTCATTTGCAAATGTTGTGGGGCTCGCTTTGCATTGCAAAATACACTAAGTCATCATGTGCGACAAAATAAGTGTTCTCAAGAGCATTTTACATGTCCAAATACTAATTGTAATAGGGTGTTCGCCTCTCAGGATAAACTCGATGCACATGTTCTTACTCACACGTATTGTGATCTATGCGGACAATCGTTTAGTAGTAGCGAAGAAATCGAAGCACATAAAGCAGAACAACATAACTTGAAAGTAAAACATCCCTGTCCGCATCCTAATTGTAATAAAGCTTTTTATAAAGGAGGTCAACTCGAAAAGCACATAGAAACTCATAACATTAAAAAAGAATTCCAGTGCACACTTTGCGATAAGAGCTTTCATCGAAAATTCTATCTCACACAACACATGAACCGTCACACAAAAACGCGCCCTTTCAAATGTGAACAATGTTCAAAAGCGTTTTATAGTTCGGGAGAACTTCAAAGACATATTATGCGGCACACTGGTGATCGTCCGTTTCCATGTGATATATGCGAAAAAACATACCCTTTAGCCAGTGAACTGAGAGTTCACAAGCAATCACATTCGGGCGAAAGACCCTTTGCTTGTGAATTTTGTCCAATGCGATTTGGATTTGCAAACGTTCTAAGAAAACATCTTATAACGCACACTGGTGAAAGGTCATTTAAATGTAACACTTGTGGCCGAGGGTTTGTGCATAAACAAAATTGTGAAGACCACATGAAAACACATTCGGGAGAGAAAGAATATGGCTGTGAAATCTGTGAAGCAAGATACTATACAAAAGATTCCTTAAGAAAACATATGCGAAAAAATCACAAAAATATTACTCTTGTTGAGGATAGTGCAATTGAAACTATGAATTAG
- Protein Sequence
- MAMKIKTENNDVHENDCGMEYRTFMAQNCVEENENDLIDNSQSNDYDRNYLMHSTENRSQTNEVQSGQITPGPDDSRVEIFSIMEIEDTNIKKEPLENQGLKKNQCWVCQKGHSGIDGNERADELAGQGFALHSSLAGSVNIPFRAMKVFRGKCLLEIHMRVHTGEKPFNCKLCGKNFRQKAALKTHLLNHTKIKPFKCDQCERVFAAQKDLTSHSLIHSCERRFHCEICNRSFISEKTLKYHRARHTRGYPFGCDICEMRFCYMHLLRRHMLTHTGEKPYKCKYCDKGFRQRHESVVHMQTHPEANSDKDKPSLQQVLELISSLPREGDMCESKDQPLESSNIINGYQFTETESPPIFSIEELNPNDEEKGHASTDNETNLHNEDIKTNLEFSEIKVKQEPMTMELVKKFICKCCGARFALQNTLSHHVRQNKCSQEHFTCPNTNCNRVFASQDKLDAHVLTHTYCDLCGQSFSSSEEIEAHKAEQHNLKVKHPCPHPNCNKAFYKGGQLEKHIETHNIKKEFQCTLCDKSFHRKFYLTQHMNRHTKTRPFKCEQCSKAFYSSGELQRHIMRHTGDRPFPCDICEKTYPLASELRVHKQSHSGERPFACEFCPMRFGFANVLRKHLITHTGERSFKFFRGKCLLEIHMRVHTGEKPFNCKLCGKYFRQKAALKTHLLNHTKIKPFKCDQCERAFAAQKDLTSHSLIHSCERRFHCEICNRSFISEKTLKYHRARHTRGYPFGCDICEMRFCYMHLLRRHMLTHTGEKPYKCKYCDKGFRQRHESVVHMQTHPEANSDKDKPSLQQVLELISSLPREGDMCESKDQPLESSNIINGYQFTETESPPIFSIEELNPNDEEKGHASTDNETNLHNEDIKTNLEFSEIKVKQEPMTMELVKKFICKCCGARFALQNTLSHHVRQNKCSQEHFTCPNTNCNRVFASQDKLDAHVLTHTYCDLCGQSFSSSEEIEAHKAEQHNLKVKHPCPHPNCNKAFYKGGQLEKHIETHNIKKEFQCTLCDKSFHRKFYLTQHMNRHTKTRPFKCEQCSKAFYSSGELQRHIMRHTGDRPFPCDICEKTYPLASELRVHKQSHSGERPFACEFCPMRFGFANVLRKHLITHTGERSFKCNTCGRGFVHKQNCEDHMKTHSGEKEYGCEICEARYYTKDSLRKHMRKNHKNITLVEDSAIETMN
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -