Xsex022080.1
Basic Information
- Insect
- Xestia sexstrigata
- Gene Symbol
- -
- Assembly
- GCA_941918865.2
- Location
- CALNXB020000462.1:635754-680468[+]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 36 6.5 5.4e+02 1.9 0.1 1 23 508 530 508 530 0.92 2 36 0.011 0.91 10.7 0.7 1 23 546 568 546 568 0.96 3 36 0.00032 0.026 15.5 4.0 1 23 574 596 574 596 0.99 4 36 0.00032 0.026 15.5 4.0 1 23 625 647 625 647 0.99 5 36 0.00032 0.026 15.5 4.0 1 23 676 698 676 698 0.99 6 36 0.00032 0.026 15.5 4.0 1 23 727 749 727 749 0.99 7 36 0.00032 0.026 15.5 4.0 1 23 778 800 778 800 0.99 8 36 0.00032 0.026 15.5 4.0 1 23 829 851 829 851 0.99 9 36 0.00032 0.026 15.5 4.0 1 23 880 902 880 902 0.99 10 36 0.00032 0.026 15.5 4.0 1 23 931 953 931 953 0.99 11 36 0.00032 0.026 15.5 4.0 1 23 982 1004 982 1004 0.99 12 36 0.00032 0.026 15.5 4.0 1 23 1033 1055 1033 1055 0.99 13 36 0.00032 0.026 15.5 4.0 1 23 1084 1106 1084 1106 0.99 14 36 0.00032 0.026 15.5 4.0 1 23 1135 1157 1135 1157 0.99 15 36 0.00032 0.026 15.5 4.0 1 23 1186 1208 1186 1208 0.99 16 36 0.00032 0.026 15.5 4.0 1 23 1237 1259 1237 1259 0.99 17 36 0.00032 0.026 15.5 4.0 1 23 1288 1310 1288 1310 0.99 18 36 0.00032 0.026 15.5 4.0 1 23 1339 1361 1339 1361 0.99 19 36 0.00032 0.026 15.5 4.0 1 23 1390 1412 1390 1412 0.99 20 36 0.00032 0.026 15.5 4.0 1 23 1441 1463 1441 1463 0.99 21 36 0.00032 0.026 15.5 4.0 1 23 1492 1514 1492 1514 0.99 22 36 0.00032 0.026 15.5 4.0 1 23 1543 1565 1543 1565 0.99 23 36 0.00032 0.026 15.5 4.0 1 23 1594 1616 1594 1616 0.99 24 36 0.00032 0.026 15.5 4.0 1 23 1645 1667 1645 1667 0.99 25 36 0.00032 0.026 15.5 4.0 1 23 1696 1718 1696 1718 0.99 26 36 0.00032 0.026 15.5 4.0 1 23 1747 1769 1747 1769 0.99 27 36 0.00032 0.026 15.5 4.0 1 23 1798 1820 1798 1820 0.99 28 36 4.2e-05 0.0034 18.3 2.6 1 23 1827 1849 1827 1849 0.99 29 36 5.5e-06 0.00045 21.0 0.4 1 23 1855 1877 1855 1877 0.93 30 36 1.8e-07 1.5e-05 25.7 0.1 1 23 1883 1905 1883 1905 0.98 31 36 1.3e-05 0.001 19.9 1.9 1 23 1911 1934 1911 1934 0.96 32 36 0.00025 0.02 15.8 2.6 1 23 1940 1963 1940 1963 0.96 33 36 0.00019 0.016 16.2 2.3 2 23 1970 1991 1969 1991 0.97 34 36 2.7e-05 0.0023 18.8 1.5 1 23 1997 2019 1997 2019 0.97 35 36 0.19 16 6.8 0.0 1 22 2025 2046 2025 2046 0.96 36 36 0.07 5.8 8.1 0.1 1 23 2061 2085 2061 2085 0.93
Sequence Information
- Coding Sequence
- ATGAACCAGTGTTGTTTCATCTGTAATAACAGGGCTAACTTTTCACTTAGAAATGCTTCAAGCCTGTTTGGAGAGCAAAATAAATTACAATCTGGTAAGAAAGTATCGGATGTGGTGAGTGAAATCGTAGGCAAACCGGTGGTAGAAAATAAAGTACATACCAGTATATTATGTAAGAAATGCTTCAAAACATGCTCTGAATATGATATCATTCAAGTTAGATTACAAACAATAAAAACGGAACTGTTAGAACAATTCAAACAATCATTACTTAATCATAATTTAGACTATGAAAACTATGAGAAAAAAGTTTGTTCACCGCCAAAACCCAAACCGATACTGGGGAAGAAGCTGGTGTTGCCAGCATCAAAATTACAACCAATACCACCGGATCTATTACTCAAAGTTGGGAAATTAGCATCATTTTCCAAATCTAATGTGATTTTCCCACAAATCAAGCCAGTCACTGCCTCGACATTAAACCTTAAAGTGACTGTCGGATCATCAGTCCTGACACAAACAATAAAAACTTCAACCAGTAAACCAATAGAGAAACCGATATTAGACTCAAATTCGATCCTCACTTCATTGACAGATGCTTTGAAAGATGACTTAGATAGCTTGGTAACGTCACAAAGTAATATGAACTCTATGAGTCCGAAGAATTCAATCCTGAGTTTCAATGTTAACTCTTTGCCTAAAGATTTTCTTTCTGGGGCTATTTTGAGAAAATTGGATGAAAATAATGATGTGTATGAAAAGGCGGATGAGACAAATGAAGATCATTCTATGGAAATTGATGAAGATTGCGTGTCTGTGGTTCCTGTGTCGTCGGCTAATGGAAACATGGTGCTACAAGTTGAAGGACTCCAGTCAAAATCTGACCCTACACCAGAAGTAAGCGAGTACTTAAACCTACCTTCACTTGACGGTGCTGATGAAGTGTCTGACGCACATAAGTATATCCTCGGCAAGCTGGAGATACTTAACGAGGGGGACGAAGATGATGATGACGATGATGATGATGAAGGACACACAATAGTCGTAGACAGCGAGAACGGCTCCATACTCCGCATGGTAGCTGGACAGAAGTTTATCTACGAGGGTGGAGAGATATCACTGATGATGCCTGATGATCAGCAGGACGGTGATGGTGATCAACAGGATAACGGTGATAGTCAAGATTCCAATGATGAATCTCAGATCGAGTTACAAGTTTCGGGCGACGAAGAAACTGCGAACGCCATTATTGCAGCCGCTCAAGAACAAGGAGGTGCCTTCATAAAAGTAGAATCAGGCGAGATGTTCCGCGTGAAGTCCGTGTCCAGCGCCGGTGCGGACAGAGCGGACAGAGCGGACAGAGCGGACAGAGCGGACAGAGCGGACGCGCCGCTACACATCGTGGCGCAGGACGGGGACATGTTCAAGTGTCTGCTGTGCGAGAAGAATGATGAAAACACACCACTTGAAGATGCAGATACAATGATGCGTCACTTAAAATCGCAGCACGACGCGAGGATATATCTGTGTCGCTTCTGTGGAGCCGTCATGAGGAAACGCACGGAATACGCCACGCATATAGCCGCTCACGCCAGCGCCCCGGCCGGCGGGACGGGCGGGGCGGGGCGCGGCTCGCACCGCTGCGGGGTGTGCGCCAAGAGCTACAGCTCGCGCACGCTACTGCTGGAGCACGCCAACCTGCACTCCGGCGCCAGGCCCTACGTCTGCACCCACTGCGGGAAGGCCTTCGCCTCCAAGTACACGCATCAGGCTCATCTGAAGACTCACGCGGTAACTATACACTTGCACAAGAGCTACAGCTCGCGCACGCTACTGCTGGAGCACGCCAACCTGCACTCCGGCGCCAGGCCCTACGTCTGCACCCACTGCGGGAAGGCCTTCGCCTCCAAGTACACGCATCAGGCTCATCTGAAGACTCACGCGGTAACTATACACTTGCACAAGAGCTACAGCTCGCGCACGCTACTGCTGGAGCACGCCAACCTGCACTCCGGCGCCAGGCCCTACGTCTGCACCCACTGCGGGAAGGCCTTCGCCTCCAAGTACACGCATCAGGCTCATCTGAAGACTCACGCGGTAACTATACACTTGCACAAGAGCTACAGCTCGCGCACGCTACTGCTGGAGCACGCCAACCTGCACTCCGGCGCCAGGCCCTACGTCTGCACCCACTGCGGGAAGGCCTTCGCCTCCAAGTACACGCATCAGGCTCATCTGAAGACTCACGCGGTAACTATACACTTGCACAAGAGCTACAGCTCGCGCACGCTACTGCTGGAGCACGCCAACCTGCACTCCGGCGCCAGGCCCTACGTCTGCACCCACTGCGGGAAGGCCTTCGCCTCCAAGTACACGCATCAGGCTCATCTGAAGACTCACGCGGTAACTATACACTTGCACAAGAGCTACAGCTCGCGCACGCTACTGCTGGAGCACGCCAACCTGCACTCCGGCGCCAGGCCCTACGTCTGCACCCACTGCGGGAAGGCCTTCGCCTCCAAGTACACGCATCAGGCTCATCTGAAGACTCACGCGGTAACTATACACTTGCACAAGAGCTACAGCTCGCGCACGCTACTGCTGGAGCACGCCAACCTGCACTCCGGCGCCAGGCCCTACGTCTGCACCCACTGCGGGAAGGCCTTCGCCTCCAAGTACACGCATCAGGCTCATCTGAAGACTCACGCGGTAACTATACACTTGCACAAGAGCTACAGCTCGCGCACGCTACTGCTGGAGCACGCCAACCTGCACTCCGGCGCCAGGCCCTACGTCTGCACCCACTGCGGGAAGGCCTTCGCCTCCAAGTACACGCATCAGGCTCATCTGAAGACTCACGCGGTAACTATACACTTGCACAAGAGCTACAGCTCGCGCACGCTACTGCTGGAGCACGCCAACCTGCACTCCGGCGCCAGGCCCTACGTCTGCACCCACTGCGGGAAGGCCTTCGCCTCCAAGTACACGCATCAGGCTCATCTGAAGACTCACGCGGTAACTATACACTTGCACAAGAGCTACAGCTCGCGCACGCTACTGCTGGAGCACGCCAACCTGCACTCCGGCGCCAGGCCCTACGTCTGCACCCACTGCGGGAAGGCCTTCGCCTCCAAGTACACGCATCAGGCTCATCTGAAGACTCACGCGGTAACTATACACTTGCACAAGAGCTACAGCTCGCGCACGCTACTGCTGGAGCACGCCAACCTGCACTCCGGCGCCAGGCCCTACGTCTGCACCCACTGCGGGAAGGCCTTCGCCTCCAAGTACACGCATCAGGCTCATCTGAAGACTCACGCGGTAACTATACACTTGCACAAGAGCTACAGCTCGCGCACGCTACTGCTGGAGCACGCCAACCTGCACTCCGGCGCCAGGCCCTACGTCTGCACCCACTGCGGGAAGGCCTTCGCCTCCAAGTACACGCATCAGGCTCATCTGAAGACTCACGCGGTAACTATACACTTGCACAAGAGCTACAGCTCGCGCACGCTACTGCTGGAGCACGCCAACCTGCACTCCGGCGCCAGGCCCTACGTCTGCACCCACTGCGGGAAGGCCTTCGCCTCCAAGTACACGCATCAGGCTCATCTGAAGACTCACGCGGTAACTATACACTTGCACAAGAGCTACAGCTCGCGCACGCTACTGCTGGAGCACGCCAACCTGCACTCCGGCGCCAGGCCCTACGTCTGCACCCACTGCGGGAAGGCCTTCGCCTCCAAGTACACGCATCAGGCTCATCTGAAGACTCACGCGGTAACTATACACTTGCACAAGAGCTACAGCTCGCGCACGCTACTGCTGGAGCACGCCAACCTGCACTCCGGCGCCAGGCCCTACGTCTGCACCCACTGCGGGAAGGCCTTCGCCTCCAAGTACACGCATCAGGCTCATCTGAAGACTCACGCGGTAACTATACACTTGCACAAGAGCTACAGCTCGCGCACGCTACTGCTGGAGCACGCCAACCTGCACTCCGGCGCCAGGCCCTACGTCTGCACCCACTGCGGGAAGGCCTTCGCCTCCAAGTACACGCATCAGGCTCATCTGAAGACTCACGCGGTAACTATACACTTGCACAAGAGCTACAGCTCGCGCACGCTACTGCTGGAGCACGCCAACCTGCACTCCGGCGCCAGGCCCTACGTCTGCACCCACTGCGGGAAGGCCTTCGCCTCCAAGTACACGCATCAGGCTCATCTGAAGACTCACGCGGTAACTATACACTTGCACAAGAGCTACAGCTCGCGCACGCTACTGCTGGAGCACGCCAACCTGCACTCCGGCGCCAGGCCCTACGTCTGCACCCACTGCGGGAAGGCCTTCGCCTCCAAGTACACGCATCAGGCTCATCTGAAGACTCACGCGGTAACTATACACTTGCACAAGAGCTACAGCTCGCGCACGCTACTGCTGGAGCACGCCAACCTGCACTCCGGCGCCAGGCCCTACGTCTGCACCCACTGCGGGAAGGCCTTCGCCTCCAAGTACACGCATCAGGCTCATCTGAAGACTCACGCGGTAACTATACACTTGCACAAGAGCTACAGCTCGCGCACGCTACTGCTGGAGCACGCCAACCTGCACTCCGGCGCCAGGCCCTACGTCTGCACCCACTGCGGGAAGGCCTTCGCCTCCAAGTACACGCATCAGGCTCATCTGAAGACTCACGCGGTAACTATACACTTGCACAAGAGCTACAGCTCGCGCACGCTACTGCTGGAGCACGCCAACCTGCACTCCGGCGCCAGGCCCTACGTCTGCACCCACTGCGGGAAGGCCTTCGCCTCCAAGTACACGCATCAGGCTCATCTGAAGACTCACGCGGTAACTATACACTTGCACAAGAGCTACAGCTCGCGCACGCTACTGCTGGAGCACGCCAACCTGCACTCCGGCGCCAGGCCCTACGTCTGCACCCACTGCGGGAAGGCCTTCGCCTCCAAGTACACGCATCAGGCTCATCTGAAGACTCACGCGGTAACTATACACTTGCACAAGAGCTACAGCTCGCGCACGCTACTGCTGGAGCACGCCAACCTGCACTCCGGCGCCAGGCCCTACGTCTGCACCCACTGCGGGAAGGCCTTCGCCTCCAAGTACACGCATCAGGCTCATCTGAAGACTCACGCGGTAACTATACACTTGCACAAGAGCTACAGCTCGCGCACGCTACTGCTGGAGCACGCCAACCTGCACTCCGGCGCCAGGCCCTACGTCTGCACCCACTGCGGGAAGGCCTTCGCCTCCAAGTACACGCATCAGGCTCATCTGAAGACTCACGCGGTAACTATACACTTGCACAAGAGCTACAGCTCGCGCACGCTACTGCTGGAGCACGCCAACCTGCACTCCGGCGCCAGGCCCTACGTCTGCACCCACTGCGGGAAGGCCTTCGCCTCCAAGTACACGCATCAGGCTCATCTGAAGACTCACGCGGTTCGCCCGCGGCCCTTCAAATGCACACAATGCGGCAAGTCGTTCCTCACACAACAAAATCTAAATCAGCATGAGAAAACACATTCGGGGGTCAAGGATTTCGTCTGTAATATATGCAACAAAGCCTTCAGCACACAACACAACTTAGAAGTTCACGGTGTGGTGCACTCCGGGAACAAGGCGTTTGTGTGCAGTGTGTGCGGGAAGGCGTTCGCGCGGAGAGCCGAGCTCCGGGACCACATGCGGATACACACCGGAGAGCGTCCATTCTCGTGCGACATCTGCGGCGCTCGCTTCACGCAGCGCTCCAACTTACACTCACATCGCCGCGCGACGCACTTCGACGATAAGCGGTATCACTGCGAACTGTGTCCCAAGAGGTTTAAGAGGAGAAGATTACTAGAATATCACATAAAGGCATCGCACACTGGTGAGCGCCCCCTCAAGTGTGGCATCTGTCATCTGTCGTTCGTGTACCCCGAGCATTACAAGAAGCACGTGCGCATACACAGCGGGGAGAGGCCTTACACTTGTGAGATCTGCGGCAAATCGTTCAACTCGCGCGACAACCGCAACACGCACCGCTTCGTGCACAGCGACAAGAAGCCGTACGAGTGCGTCGCGTGCGGCGCCGGCTACATGCGCAAGCAGCTGCTGTACGCGCACATGAACACTAGCGTACCGTACGTACACATACACATACACGACAAGAAGCCGTACGAGTGCGTCGCGTGCGGCGCCGGCTACATGCGCAAGCAGCTGCTGTACGCGCACATGAACACTAGCGGGCATTTAAACGGATCAATAGTCGTCAATCAACCTAGAGTTATCAAAGTTACTGAAATATTTGAAGCCAATGAATTGGAGGCGAGCGTAAAATCAGTTGATAAAACAACATTGAATCTTATACAAGAGGGCGATGAACCAACGTTACTAACTATACATAACATAGGGGAGAGAGCCGAGGGCACTATCCTAGAAGCTGTAGATGCCGATCAACTTGGAGATCAGACTGAGATCGTAGCAAACGACGAAAATGGCGGTATGGTGCGTCTCATACAAATTAAATTGCCGGACGGGAACAACGGGTGGGTCGCTATTAACCGCTGA
- Protein Sequence
- MNQCCFICNNRANFSLRNASSLFGEQNKLQSGKKVSDVVSEIVGKPVVENKVHTSILCKKCFKTCSEYDIIQVRLQTIKTELLEQFKQSLLNHNLDYENYEKKVCSPPKPKPILGKKLVLPASKLQPIPPDLLLKVGKLASFSKSNVIFPQIKPVTASTLNLKVTVGSSVLTQTIKTSTSKPIEKPILDSNSILTSLTDALKDDLDSLVTSQSNMNSMSPKNSILSFNVNSLPKDFLSGAILRKLDENNDVYEKADETNEDHSMEIDEDCVSVVPVSSANGNMVLQVEGLQSKSDPTPEVSEYLNLPSLDGADEVSDAHKYILGKLEILNEGDEDDDDDDDDEGHTIVVDSENGSILRMVAGQKFIYEGGEISLMMPDDQQDGDGDQQDNGDSQDSNDESQIELQVSGDEETANAIIAAAQEQGGAFIKVESGEMFRVKSVSSAGADRADRADRADRADRADAPLHIVAQDGDMFKCLLCEKNDENTPLEDADTMMRHLKSQHDARIYLCRFCGAVMRKRTEYATHIAAHASAPAGGTGGAGRGSHRCGVCAKSYSSRTLLLEHANLHSGARPYVCTHCGKAFASKYTHQAHLKTHAVTIHLHKSYSSRTLLLEHANLHSGARPYVCTHCGKAFASKYTHQAHLKTHAVTIHLHKSYSSRTLLLEHANLHSGARPYVCTHCGKAFASKYTHQAHLKTHAVTIHLHKSYSSRTLLLEHANLHSGARPYVCTHCGKAFASKYTHQAHLKTHAVTIHLHKSYSSRTLLLEHANLHSGARPYVCTHCGKAFASKYTHQAHLKTHAVTIHLHKSYSSRTLLLEHANLHSGARPYVCTHCGKAFASKYTHQAHLKTHAVTIHLHKSYSSRTLLLEHANLHSGARPYVCTHCGKAFASKYTHQAHLKTHAVTIHLHKSYSSRTLLLEHANLHSGARPYVCTHCGKAFASKYTHQAHLKTHAVTIHLHKSYSSRTLLLEHANLHSGARPYVCTHCGKAFASKYTHQAHLKTHAVTIHLHKSYSSRTLLLEHANLHSGARPYVCTHCGKAFASKYTHQAHLKTHAVTIHLHKSYSSRTLLLEHANLHSGARPYVCTHCGKAFASKYTHQAHLKTHAVTIHLHKSYSSRTLLLEHANLHSGARPYVCTHCGKAFASKYTHQAHLKTHAVTIHLHKSYSSRTLLLEHANLHSGARPYVCTHCGKAFASKYTHQAHLKTHAVTIHLHKSYSSRTLLLEHANLHSGARPYVCTHCGKAFASKYTHQAHLKTHAVTIHLHKSYSSRTLLLEHANLHSGARPYVCTHCGKAFASKYTHQAHLKTHAVTIHLHKSYSSRTLLLEHANLHSGARPYVCTHCGKAFASKYTHQAHLKTHAVTIHLHKSYSSRTLLLEHANLHSGARPYVCTHCGKAFASKYTHQAHLKTHAVTIHLHKSYSSRTLLLEHANLHSGARPYVCTHCGKAFASKYTHQAHLKTHAVTIHLHKSYSSRTLLLEHANLHSGARPYVCTHCGKAFASKYTHQAHLKTHAVTIHLHKSYSSRTLLLEHANLHSGARPYVCTHCGKAFASKYTHQAHLKTHAVTIHLHKSYSSRTLLLEHANLHSGARPYVCTHCGKAFASKYTHQAHLKTHAVTIHLHKSYSSRTLLLEHANLHSGARPYVCTHCGKAFASKYTHQAHLKTHAVTIHLHKSYSSRTLLLEHANLHSGARPYVCTHCGKAFASKYTHQAHLKTHAVTIHLHKSYSSRTLLLEHANLHSGARPYVCTHCGKAFASKYTHQAHLKTHAVTIHLHKSYSSRTLLLEHANLHSGARPYVCTHCGKAFASKYTHQAHLKTHAVRPRPFKCTQCGKSFLTQQNLNQHEKTHSGVKDFVCNICNKAFSTQHNLEVHGVVHSGNKAFVCSVCGKAFARRAELRDHMRIHTGERPFSCDICGARFTQRSNLHSHRRATHFDDKRYHCELCPKRFKRRRLLEYHIKASHTGERPLKCGICHLSFVYPEHYKKHVRIHSGERPYTCEICGKSFNSRDNRNTHRFVHSDKKPYECVACGAGYMRKQLLYAHMNTSVPYVHIHIHDKKPYECVACGAGYMRKQLLYAHMNTSGHLNGSIVVNQPRVIKVTEIFEANELEASVKSVDKTTLNLIQEGDEPTLLTIHNIGERAEGTILEAVDADQLGDQTEIVANDENGGMVRLIQIKLPDGNNGWVAINR
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -