Ddia001872.1
Basic Information
- Insect
- Dasypogon diadema
- Gene Symbol
- grau_1
- Assembly
- GCA_006980735.1
- Location
- jcf7180002940047:1337-14617[-]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 27 0.0032 0.22 11.9 1.2 3 23 264 285 262 285 0.92 2 27 1.2e-05 0.00078 19.5 1.2 1 23 317 340 317 340 0.96 3 27 0.013 0.84 10.0 4.6 1 22 348 369 348 370 0.90 4 27 0.092 6.2 7.3 0.2 3 23 380 401 379 401 0.91 5 27 0.33 22 5.5 0.7 1 23 407 429 407 429 0.91 6 27 0.0026 0.17 12.1 0.3 2 23 442 464 442 464 0.96 7 27 0.00022 0.015 15.5 2.3 1 23 470 492 470 492 0.97 8 27 0.0022 0.15 12.4 6.8 1 23 498 521 498 521 0.94 9 27 0.11 7.6 7.0 0.4 2 23 842 864 841 864 0.96 10 27 0.00027 0.018 15.3 6.6 1 23 896 918 896 918 0.97 11 27 0.0015 0.098 12.9 0.9 1 23 926 948 926 948 0.98 12 27 1.4e-06 9.5e-05 22.4 0.4 1 23 956 979 956 979 0.96 13 27 0.12 7.9 6.9 1.3 2 23 986 1007 985 1007 0.95 14 27 0.036 2.5 8.5 0.8 2 23 1016 1037 1015 1037 0.97 15 27 0.0013 0.088 13.1 0.1 1 23 1044 1067 1044 1067 0.94 16 27 1.6e-06 0.0001 22.3 1.6 1 23 1073 1095 1073 1095 0.98 17 27 2.5e-05 0.0017 18.5 4.5 1 23 1101 1124 1101 1124 0.98 18 27 1.2 83 3.7 0.9 2 23 1426 1448 1425 1448 0.96 19 27 0.5 33 5.0 0.4 6 23 1457 1474 1457 1474 0.99 20 27 4.5e-06 0.0003 20.8 3.3 1 23 1480 1503 1480 1503 0.97 21 27 0.01 0.7 10.3 6.1 1 23 1511 1533 1511 1533 0.97 22 27 1e-05 0.0007 19.7 1.3 1 23 1541 1564 1541 1564 0.97 23 27 0.32 22 5.6 0.5 2 23 1571 1592 1570 1592 0.95 24 27 0.018 1.2 9.5 2.5 2 23 1601 1622 1600 1622 0.97 25 27 0.0006 0.04 14.2 0.1 3 23 1637 1658 1635 1658 0.95 26 27 3.1e-06 0.00021 21.4 2.3 1 23 1664 1686 1664 1686 0.98 27 27 0.00032 0.021 15.0 2.7 1 23 1692 1715 1692 1715 0.97
Sequence Information
- Coding Sequence
- ATGACTTGTGGCGTTTGCTTAAGAGAAGATGTGGAAGTTATAAATATATTTGGTGAAGAAGGAAAGAGATTAGAAATATCAGCAATCATCAGACAGCACTTCTGGTTTGAGCCGATCGAAGATGATACGATATTAAGTCGGATATGCTGTCCATGTTGGGAGCAAATTGCAAATTTTCACGTATTTTACACGAATGTTAAAAACGCTCACAAAAAGTTATCCGAACGTTATTACGCTAACTCCCAGTATACGGAAGGAAATTATCCCGATTATTACAACCCTGAAAGTAATTCGTATGTGAATCAGCCTGAATGCAGTTATGGCTACGAGAATTCCGGAGAAAGTAGTTCGAAAGTGTCAGATGATGCAAATATAACTAACGGGTCTGCGACATATGAAGAAGCCAATGGTGATCAAAATTACACGGGTGAGGAAGTATATAACGAAAGTTACTATCAAGATAATCAAGAGAATAGTCAGATAGCCAATGAGCCAATACAAGAGGATGAATATTTACTAGAAAAAAGTAACTATGTACCTCCAGAAGATGTGGAGGAGAGTAATCCGTATGATGTTGGAGATAGTTGGAACAGTGAAAATACAAAACCGCGTTTAAAAAAGAGTACAAACACCGGTGGAAGTAGTAATGAAGTTGTTGAAAGTAAATTTATTGACAGAGACAGTTTAAGTTTTAAGAAAGATCCGAGGAATAAAGGAGTTAACAGCAGGAAAGATGAAAAAAAGATAGAGGAAGAAGATGCTCTCATTGCAAAATATATTAAACTCGATTGTGAAATTTGTAGTGAGAGTTTTACCAAGTTTTTCGAATTAACAAAACATTATCAATTTACTCATAATATAAGCGGATATGCGAGATGTTGCGATAAAAAGCTTCCAAGTAGAGCATATGCTCTCGATCATATAAATCAACATTTGAATCCAGAAAGTTATAGGTGTAAGGAATGTAACAAGACATTTTCGTCACAATACGCAATTAAAAATCATTTGGCTATTAAACACGCCCCTGAAACCGATAAAATATTCGTGTGTGAATTTTGTCCAGAGAAGTTCACTAATCAACATGCTTTAAATCAACATTGCACTCGTCATATTTCAGAGGAAGAAAAACGGGTTTATTGCATCGATTGCCGAAAAACATTTCCGACTGCCCTTGCTCTGCAAATGCATAAAAATGTTTATCATGGAAAATTACAAGGGCATATATGCCATGTATGCGGTAAAATCGTACGAGGCAAGGCTATATTTGCAAAGCATGTAAATGAACATGCCGCGGAACTGGGTCAGGAGACAATAAATGATCCGGTTTGCCCAATTTGTGGCAAGTTATCAGCATCCATAAAGACATTGCGTCATCACATTAGATATGTTCATTCCTCTCATCGCCCTCATAAATGCACTCTGTGTGGAAAAGCTTTTAAAATTCGTCAAAAATTAAAGGAACACATGGGCTCGCACGCTGTAGGAAATCCGTTTCCATGTCCTTATTGCCATAAACGTTTTAGAAGTAAGCTACATTTGTATTCTCATAAGTACCGCATGCATAAAAGCGAATATGAAGAAGAAAAGCAAATTCGATTTAATATGATGACCGAAACATTCAACAGCATGGAAAATTATGAGAATACGTCGATGAGTGAAAATGATACCCAACAATATGATAGCAGTATTGTATTAAGCGAAGACAAAATGATTTGCCGAATGTGTTTGGTTGATTCCTCAGAGTATATAAACATATTTGACACTGAGGGCATGGAGCTAAACATTGCTCTAATCATTACAAAACATTTTTGGTTCCAGCCAAAAAAGGACGATCCTGTTTCAAATATGGTATGCAATGCTTGCTGGTGGAAAATCGACAGCTTTCATACATATTACACGAGTATTGAAGAAGCTCACAGGAAGTTATCTGAGCGATTTTCGGTAAAACACGAACCCTCTTTTGTTTATTCGTTTGAAGACAGCGATATCAAACGTTTGTATGACGAAAATGTAAAAAAGGAAGAAAAATTTGCTGTTGAGGTCCAAGTCGTTGTTCAAGAAGAATTAAGTGAGGGTTTGGATCAGCGTGCGAGTGAAAATGGGGATGATTTTTCAGTGCATTCAGCAGAAGTTGAATATGTAGATGTAACTGCAGCAACTGATGAAATAGCCTCCTCGGTAGAAGCTGATGAAGTATCTACGAGCGAGATCCAAGCAGATCAAATAGCACTTGAAGACGGCACTGTTGTACAGGAAACACAAGGAGAACAAATTGACCCAACACAAAGTGAAAGGAATAAAAGTAATGAGGAGATCGCGGAAAGGCAATTTGCTGTTTTCGAAAAACGCAAAAAAAGAGGCAGACCTAAAAGGGTGTCCACAGAACCATGCAAAGAAAGTGATAACAATTTATCCCAAGATGAGCTAAGTACGGCAAACAAAAGAGATAAACCAGAAAAAGATTCAGATTACCTTAAGAAGTCAAAAGAAAACGACGAGAAAATAGCGAAACATATGAAGCTGAAATGCGAACTTTGTGTTTCGGAATATAGCACTTTCGCGGAAGTAAAGAAGCATTATCGCACCGTTCACAAACGAAAAGGTTATGTGGTATGCTGCAATAAGAAACTTTACAAACGCGTACTTATTCTAGATCATATAAATAAACATTTAAATCCAGACTACTTTAAGTGTGAAACTTGTAAAAAGAGATTTTCTGACAAGCAGTGCTTAAAAAATCACATGTTTCTACATGAATCGGAAGAAGTGAAGATATTTAAGTGTGATCAATGCCCCCGCAGATATGCTAAGCAATATTTATTAGATAACCACAAATTAATACATGTTCCGCAAGAGCAACGGGAATTCTTTTGTAATGAATGCGGAAAAGCATTTCCAACAAATACTTTATTACAGACGCATATTAGATACGTTCACGAGAATGCTTATGGGAAGATGTGCCATATATGTGCAAAAGTGATTCGAGGCAAAACAATGTTTGAAAAGCATCAGCTAGAGCACGCGGGAATAGTTGAACCCAAAGTCCAATGCGGGAAATGTGGGGCGTGGCTTAAACACGAATACAGTTTAAAGAAACATATGGCGAGACATGCAGAAGAGACGGACTCACACATTTGTAACGTATGTGGTAAAGTGGCTCCATCGAAAGGAGCGTTACGAAGCCACGTCCAATATGTTCATGCTTCAAAACGGACTCATCAGTGCAGTGTATGTGATAAAGCCTTTAAAAAGCCAATTAATCTAAAAGAGCATATGACGACACATACCGGCGAGGTTCTGTATACATGTCCTCATTGCCCAAAAACTTTTAATTCTAGCGCTAATATGCATTCGCATAGGAAGAAAGTGCACCGGAAAGAATGGGAGGAAAGCCGTAGGAATAGAGGTCTACCTCAAACAAGTTACAAAAGTAGTGAAGAGCAAGATGAGGTTGAAGAGCAACAGACTGTGGTATTTACAAATGAACAGGAAGATGTACAAGACGGAATTAGTTCAACGTACAAAAATAAGGAAAATACTGATAACAAGATGATTTGCCGGATGTGTTTGCTACCTGAAAAAGACCAAATAAATATATTTGGGAGTAACGAAAATGAAACCTGCATTGCCACGGTTATTAGGAAACACTTCTGGTTTGAGCCCAAGGTAGACGATCCAGTATCAAATTATATTTGTAATAAATGTTGGAGTCACATTGCATCATTTCATACCTATTATTGTTGCGTTGAAGAAGCACAAAGAGGATTGTCTGAGAGACATATTATCAAGGAAATTATTGCCGATCCACGGGACAATCAAGATGAACCGTTTGAAGAGCAAAAAGATTCCGTTCCCGAAGAAAAGCCTACGAACACTCTGAGGCGGAGTACACGTTTGTCGACTTATCTAGTAAATAAAACAAAAGCCAGCGAAAGGGATGCAATTATTGAAGACGAATCGACAGCTGAAGATACACATTATGTAGGAAGTAGCAAAGAGAATTTGTCGCCATCCACTAGAAAGCCGAAACGAAGAGGAAGACCAAGAAAAACACGAATCATACCACTACAAAATATTGCACGTAAAAGAGGTAGGTCTAGGAAGGCTTTAGCAGCACAACCATATCGGAAAATTATGAAAACTCAAAATTCGGATGGCTCAGAAGGGCGTAAAATCTTAATTAAAAGGGAAAAAGGAGTTAAACCCTCTGGATCTTCAGAATACGTTCGGAAGTCTAAAGAAACTGATGAAGCTATTGCTAAACACATGAAATTAAAATGCGAGCTTTGTCCTTCAGAGTGCGTAACTTTTGCAGAAATAAGAGTCCATTACCGCAGATTTCATAAACGTAAGGGTTACATTGTGTGTTGCAATAAAAAGTTCTACAGGCGAATTCTTATAATGGAACACATAAATAAACATTTGAATCCAGACTATTTCAGATGCGACAAGTGTGACAAGACCTTTTCTGATAGACAATGTTTGAGAAATCATATGATAGTTATGCACGACCCAGAAGAAAGGAAAATTCATAAGTGCAACCAGTGTCGACGGAGTTATTCAAAGCTGCACATGCTGGAGCGCCATAAGTTGATACATGTGCCTCAGGAAGAAAGGGACTTTTTCTGTGAACAATGTGGCAAAGCATTCCCCTCAAAAACTTTATTGCATTCACATATAAGATCCGTACATGATAATGCCTACGGTATAATGTGTCATTTATGCGCAAAAGTTCTTCAAGGGAAAACAATGCTTGCAAAACATCAACTGGAACATGCGGGCATAATTGAACCTAAGGTCCAATGCAAAGAATGCGGTTCGTGGTTGAAACATAAATATAGTTTAAAAAAGCATATGGAAAGGCATGAAGAACGGCGTGAGGAGCTGTCAAATCCACAAGAACAAATTTGCAATATTTGCGGCAAAGTGTCTCAATCCAAGGCAGGACTACGAAGTCATATAAAATATGTACACCAAGCAAAGAGGATTCATCAATGCAATATATGTGAAAAATCATTTAAAACTCCTTTAAGTTTGAAGGAGCATATGACCACACATACGGGAGAAGTATTATACATTTGCCCGCATTGCCCAAAAACTTTTAATTCTGGTGCGAATATGCATTCACACAGAAAAAAGGCTCACAGACAAGAATGGGAGGAAAGCAGAAAAAATCGACAACTAGCTAAAGGCCTTTTCAAGAATAGCATTAACGAGAATAGTGAAGGAACAGCTGCTACATGTGAAAATCAGTTGCAAGGAAGTGATGAGTTAGAAGACAACAAGGACGCAATTAATCTATGTGTAAATCAAGGGGAGGATAAAACAACCATAATTAAGAAAGAAATTCCAGATTTTTAA
- Protein Sequence
- MTCGVCLREDVEVINIFGEEGKRLEISAIIRQHFWFEPIEDDTILSRICCPCWEQIANFHVFYTNVKNAHKKLSERYYANSQYTEGNYPDYYNPESNSYVNQPECSYGYENSGESSSKVSDDANITNGSATYEEANGDQNYTGEEVYNESYYQDNQENSQIANEPIQEDEYLLEKSNYVPPEDVEESNPYDVGDSWNSENTKPRLKKSTNTGGSSNEVVESKFIDRDSLSFKKDPRNKGVNSRKDEKKIEEEDALIAKYIKLDCEICSESFTKFFELTKHYQFTHNISGYARCCDKKLPSRAYALDHINQHLNPESYRCKECNKTFSSQYAIKNHLAIKHAPETDKIFVCEFCPEKFTNQHALNQHCTRHISEEEKRVYCIDCRKTFPTALALQMHKNVYHGKLQGHICHVCGKIVRGKAIFAKHVNEHAAELGQETINDPVCPICGKLSASIKTLRHHIRYVHSSHRPHKCTLCGKAFKIRQKLKEHMGSHAVGNPFPCPYCHKRFRSKLHLYSHKYRMHKSEYEEEKQIRFNMMTETFNSMENYENTSMSENDTQQYDSSIVLSEDKMICRMCLVDSSEYINIFDTEGMELNIALIITKHFWFQPKKDDPVSNMVCNACWWKIDSFHTYYTSIEEAHRKLSERFSVKHEPSFVYSFEDSDIKRLYDENVKKEEKFAVEVQVVVQEELSEGLDQRASENGDDFSVHSAEVEYVDVTAATDEIASSVEADEVSTSEIQADQIALEDGTVVQETQGEQIDPTQSERNKSNEEIAERQFAVFEKRKKRGRPKRVSTEPCKESDNNLSQDELSTANKRDKPEKDSDYLKKSKENDEKIAKHMKLKCELCVSEYSTFAEVKKHYRTVHKRKGYVVCCNKKLYKRVLILDHINKHLNPDYFKCETCKKRFSDKQCLKNHMFLHESEEVKIFKCDQCPRRYAKQYLLDNHKLIHVPQEQREFFCNECGKAFPTNTLLQTHIRYVHENAYGKMCHICAKVIRGKTMFEKHQLEHAGIVEPKVQCGKCGAWLKHEYSLKKHMARHAEETDSHICNVCGKVAPSKGALRSHVQYVHASKRTHQCSVCDKAFKKPINLKEHMTTHTGEVLYTCPHCPKTFNSSANMHSHRKKVHRKEWEESRRNRGLPQTSYKSSEEQDEVEEQQTVVFTNEQEDVQDGISSTYKNKENTDNKMICRMCLLPEKDQINIFGSNENETCIATVIRKHFWFEPKVDDPVSNYICNKCWSHIASFHTYYCCVEEAQRGLSERHIIKEIIADPRDNQDEPFEEQKDSVPEEKPTNTLRRSTRLSTYLVNKTKASERDAIIEDESTAEDTHYVGSSKENLSPSTRKPKRRGRPRKTRIIPLQNIARKRGRSRKALAAQPYRKIMKTQNSDGSEGRKILIKREKGVKPSGSSEYVRKSKETDEAIAKHMKLKCELCPSECVTFAEIRVHYRRFHKRKGYIVCCNKKFYRRILIMEHINKHLNPDYFRCDKCDKTFSDRQCLRNHMIVMHDPEERKIHKCNQCRRSYSKLHMLERHKLIHVPQEERDFFCEQCGKAFPSKTLLHSHIRSVHDNAYGIMCHLCAKVLQGKTMLAKHQLEHAGIIEPKVQCKECGSWLKHKYSLKKHMERHEERREELSNPQEQICNICGKVSQSKAGLRSHIKYVHQAKRIHQCNICEKSFKTPLSLKEHMTTHTGEVLYICPHCPKTFNSGANMHSHRKKAHRQEWEESRKNRQLAKGLFKNSINENSEGTAATCENQLQGSDELEDNKDAINLCVNQGEDKTTIIKKEIPDF
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -