Tbra014808.1
Basic Information
- Insect
- Trichogramma brassicae
- Gene Symbol
- -
- Assembly
- GCA_902806795.1
- Location
- CADCXV010001194.1:327094-351192[-]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 39 7.4e-05 0.0032 17.8 3.6 1 23 70 93 70 93 0.97 2 39 0.00011 0.0047 17.2 0.5 1 23 99 122 99 122 0.96 3 39 0.12 5.3 7.6 0.6 1 17 128 144 128 145 0.92 4 39 7 3e+02 2.1 0.1 7 19 163 175 158 178 0.90 5 39 0.0034 0.14 12.6 5.7 1 23 181 204 181 204 0.97 6 39 1.3e-05 0.00053 20.2 1.3 1 23 210 233 210 233 0.97 7 39 0.00045 0.019 15.3 0.7 1 23 239 262 239 262 0.98 8 39 0.00041 0.018 15.4 1.9 1 23 363 386 363 386 0.95 9 39 0.0089 0.38 11.2 0.6 1 23 392 415 392 415 0.95 10 39 2.6e-05 0.0011 19.2 4.3 1 23 421 444 421 444 0.97 11 39 0.00078 0.033 14.6 0.6 1 23 450 473 450 473 0.96 12 39 0.0024 0.1 13.0 3.3 1 23 508 531 508 531 0.98 13 39 0.0038 0.16 12.4 6.3 1 23 537 560 537 560 0.97 14 39 0.0019 0.079 13.4 6.5 1 23 566 589 566 589 0.97 15 39 3e-05 0.0013 19.0 2.6 1 23 595 618 595 618 0.96 16 39 2.1e-05 0.00091 19.5 0.9 1 23 624 647 624 647 0.97 17 39 0.56 24 5.6 4.7 1 23 653 676 653 676 0.96 18 39 2.5e-05 0.0011 19.2 0.6 1 23 682 705 682 705 0.97 19 39 0.00036 0.015 15.6 2.0 1 23 711 734 711 734 0.97 20 39 0.00029 0.012 15.9 2.3 1 23 740 763 740 763 0.97 21 39 0.00031 0.013 15.8 0.5 1 23 769 792 769 792 0.96 22 39 0.034 1.4 9.4 0.6 1 23 798 821 798 821 0.97 23 39 2.5e-05 0.0011 19.2 2.6 1 23 827 850 827 850 0.97 24 39 0.00017 0.0073 16.6 0.8 1 23 856 879 856 879 0.97 25 39 2.3 1e+02 3.6 0.2 7 23 891 908 885 908 0.86 26 39 0.01 0.43 11.1 0.5 1 21 920 940 920 941 0.94 27 39 6.7e-05 0.0028 17.9 0.4 1 23 949 972 949 972 0.97 28 39 1.9e-05 0.00083 19.6 0.6 1 23 978 1001 978 1001 0.97 29 39 0.00011 0.0048 17.2 2.3 1 23 1007 1030 1007 1030 0.96 30 39 2.1e-05 0.00091 19.5 0.9 1 23 1036 1059 1036 1059 0.97 31 39 0.079 3.3 8.3 2.9 1 23 1065 1088 1065 1088 0.97 32 39 2.1e-05 0.00087 19.5 0.5 1 23 1094 1117 1094 1117 0.97 33 39 7.9e-05 0.0033 17.7 1.2 1 23 1123 1146 1123 1146 0.97 34 39 0.0034 0.15 12.5 2.1 1 23 1152 1175 1152 1175 0.96 35 39 0.00014 0.0059 16.9 1.3 1 23 1181 1204 1181 1204 0.97 36 39 0.089 3.8 8.1 0.3 1 23 1210 1233 1210 1233 0.96 37 39 2.5e-05 0.0011 19.2 2.6 1 23 1239 1262 1239 1262 0.97 38 39 0.0015 0.066 13.6 0.6 1 23 1268 1291 1268 1291 0.97 39 39 2.3 1e+02 3.6 0.2 7 23 1303 1320 1297 1320 0.86
Sequence Information
- Coding Sequence
- ATGCGATTCTCAGGCACGATACTACACGATATACGTCACGACACTCGACGGctacagcagaagcagcagcagctcgggaGTCGGCTCTCTCGGCTACAAGGCCTAGCTGCTATGGCACAAAGGGAGCCTCGAGCCCGAGCCGAACTCGATGCGCGACACAGACGAATAAGCCTacaccaaatatcagtacacaaaggCCGCGAAGGTTtcacatgcaacaagtgcgagagaaaatttgtaaacaaatcaactttacttaaacataaaaaaatattccatgaagttttcaaaaattacttatgtgacatgtgcgcgaagagatttgatacTAAATATAGTTTGCTagtgcacataaagacagtacacgatggtcggaaagattttacatgcgataagtgtgagaaaaaatttgggataAAATCTAATTTGCTTAACTAcctaaagacagtacatgaaaatCGAAAAGACCTCGTATGCAACAAGTacggaaagaaatttggaaaaaaatcagactTGTTGAagcacgaaggtcgcaaagattacgcatgtggcaagtgcgagaagaaatttggtcaTAAACACCATTTACTCGgtcatcaaaagacagtacacgaaggtagGAAAGactatgcatgtgacaagtgcgagaagaaatttggaaaaaaatcagatttgttgaagcaccaaaaaacagtccatgagggtcgcaaagatttcacatgcgacaagtgcgaacagaaattcggacaaataTCGGttttgctcagacaccaaagaacagttcatgagggtcgcaaagattttCTCACGAACGCCAGTTTCGATAAACAACGCAATTACTCGACGAGCAACGGCTCGTCGAGCATTTGTACGCGAGGTCATCATCGGAGCTCGTCGAAGCAAGGAACTAAAATGGATAAAGGTGAGGagataattgaaaaagttgtctcaaaagataattttagTGAACACCAGATTGCCGTTTCGGACGACAAACAAAACGTTGGTGATATTGGCCAGGAGTCATTTACAgtagaatcaagtatgataaaacaccgagacacaattcaaaatgaaaaaaaagatttcgcatgcgacaaatgcgaagagaaatttgagtttcgaagtcacttgagtagacaccaaatatcagtgcACAAAGgctgcaaagattttgcatgtggcagatgcgaaaataaatttgtgaaaaaatcgagtttagttacgcatcaaaaattagtacacaaagatcacGAAGATTACTCCtgcaacaagtgtgagaggaaatttaaaaataaatcaactttactcaaacacaaaagaatattccacgaagttttcaaagattacttatgtgaaaagtgcgccaagagatttggaaataaacagagtttgcttttgcacataaagacagttcacgatggtcgcaaagattatacACTTGATAAGCGCAAGCGGAAGTTTGAACTGCAATCGAATTCGTTTCGGCACCGAAATGTAGTTCACGAAGTTTCCCAGAATTACGTATGCGAccattgcgagaagaaatttggatatataaatactttgctccatcatcaaaatacaatacacgatggtcgcaaggattacgcatgtgacaaatgccagaagaaatttgtatataaatcgcATTTACGTTGCCACCAAAatacaatacacgaaggtcgcaaagatttcgcttgtGAAAAATGCCAAAAGCAATATGTAcataaatcgcatttgctcaaCCACCATaatacagtacacgaaggtcgcaaagatttcgcttgtgacaaatgccagaagaaatttgttaataaatcgcatttgctttaCCACCAAAtaacaatacacgaaggtcgcaaagattatgcatgtgacaggtgcgacaagaaatttggacttaaaGGCAACTTGActaaacaccaaaagacagtccacgaaaatCGCAGAGATcacgcatgtgataaatgcgagcagaaatttgggcaCAAACATATTTTGCTtgtacaccaaaagacagtacacgacggccgaaaagattacgcctgtgacaagtgtgagaagaaatttggggttcaaaataatttgataaaacaccaaaaaacagtccacgaaaatcgcaaagattacgcatgcaacaagtgcgagaaaaaatttggggttcAATGCAATTTGATcatgcaccaaaaaacagtgcacgaaggtcaGCAGAATTAcgcttgtgacaaatgtgagaagaaatttggacaaaaaacgcatttgctttatcatcaaaaaattgtacacgaaggtcggaaagattactcatgcgaccggtgcgagaaaaaatttggaaacaaAAGGGGTTTACTTAGACATCAAGAAATAGTCCAcaaaggccgcaaagatttcacatgtgacaaatgcgagaaaaaatttggagaaaattgGATTATGATTcgacatcaaaaaacagtacacgaaggtcaaaaagattacgcatgcgccaaatgcgagaaaaaatttacacaaaaacagCATTTACTCAGACACCAAgaaacaatccacgaaggtcgcaaagattttgcgtgcgacaagtgcgagaataaatttggaactaaatctgatttgttgaagcacctaAAGACTGTACATggaggtcgcaaagattacacatgcgacaagtatgagaaaaaatttggacaaaaagggaATTGGCTCtatcaccaaaagacagtccacgaagatcAgacagtacacaaaagtcagaaagattatgcatgtgacagatgcgagcagaaattcggacaaaaatcgaatttgctctaCCACAAAAAAGCAGTatacgaaggtcgcaaagattatgcatgcgactattgtgagaagaaatttggatataaatcgGTTCTGCTTTCACACATAAGAACAGTTCACGGAgatcggaaagatttcgcatgcgactaTTGTGAGAAGATATTTGGAAAGAAATCtaatttgctcttacaccaaaagacggttcatgaaggtcgcaaagatttcgcatgcgacaaatgccagaagaaatttgtacaTAAATCGCTTTTGCTTTACCACCAAAtaacaatacacgaaggtcgcaaagattatgcatgtgaccggtgcgacaagaaatttggacttaaaGGCAACTTGActaaacaccaaaagacagtccacgaaaatcgcaaagattacgcatgtgacaaatgcgagcagaaatttggacacaaacaTATTTTGCTTgtacatcaaaagacagtacacgaccgccgaaaagattacgcctgtgacaagtgtgagaaaaaatttggggaacgcagcaatttgatcaaacaccaaaaaatagtccacgaaaatcgcaaagattacgcatgcaacaagtgcgagaaaaaatttggggttcaattcaatttgattacgcaccaaaaaacaatgcaTGAAGGTCAgcaagattacgcatgtgtcagatgcgagaagaaatttggacgaagaACGCATTTGCTTTACCaccaaaaaattgtacacgaaggtcggaaagattactcatgcgaccggtgcgagaaaaaatttggaaacaaAAGGGATTTACTTAGACATCAAGAAAAAGTCCAcaaaggccgcaaagatttcgcatgtgacaaatgcgagaaaaaatttggagaaaattgGATTATGATTcgacatcaaaaaacagtacacgaaggtcaaaaagattacgcatgcgccaaatgcgagaaaaaatttacacaaaaacagCATTTACTCAGACACCAAgaaacaatccacgaaggtcgcaaagattttgcgtgcgacaagtgcgagaataaatttggaattaaatctaatttgttgaagcacctaAAGACTGTACATggaggtcgcaaagattacacatgcgacaagtatgagaaaaaatttggacaaaaagggaATTGGCTCtatcaccaaaagacagtccacgaagatcAGTAA
- Protein Sequence
- MRFSGTILHDIRHDTRRLQQKQQQLGSRLSRLQGLAAMAQREPRARAELDARHRRISLHQISVHKGREGFTCNKCERKFVNKSTLLKHKKIFHEVFKNYLCDMCAKRFDTKYSLLVHIKTVHDGRKDFTCDKCEKKFGIKSNLLNYLKTVHENRKDLVCNKYGKKFGKKSDLLKHEGRKDYACGKCEKKFGHKHHLLGHQKTVHEGRKDYACDKCEKKFGKKSDLLKHQKTVHEGRKDFTCDKCEQKFGQISVLLRHQRTVHEGRKDFLTNASFDKQRNYSTSNGSSSICTRGHHRSSSKQGTKMDKGEEIIEKVVSKDNFSEHQIAVSDDKQNVGDIGQESFTVESSMIKHRDTIQNEKKDFACDKCEEKFEFRSHLSRHQISVHKGCKDFACGRCENKFVKKSSLVTHQKLVHKDHEDYSCNKCERKFKNKSTLLKHKRIFHEVFKDYLCEKCAKRFGNKQSLLLHIKTVHDGRKDYTLDKRKRKFELQSNSFRHRNVVHEVSQNYVCDHCEKKFGYINTLLHHQNTIHDGRKDYACDKCQKKFVYKSHLRCHQNTIHEGRKDFACEKCQKQYVHKSHLLNHHNTVHEGRKDFACDKCQKKFVNKSHLLYHQITIHEGRKDYACDRCDKKFGLKGNLTKHQKTVHENRRDHACDKCEQKFGHKHILLVHQKTVHDGRKDYACDKCEKKFGVQNNLIKHQKTVHENRKDYACNKCEKKFGVQCNLIMHQKTVHEGQQNYACDKCEKKFGQKTHLLYHQKIVHEGRKDYSCDRCEKKFGNKRGLLRHQEIVHKGRKDFTCDKCEKKFGENWIMIRHQKTVHEGQKDYACAKCEKKFTQKQHLLRHQETIHEGRKDFACDKCENKFGTKSDLLKHLKTVHGGRKDYTCDKYEKKFGQKGNWLYHQKTVHEDQTVHKSQKDYACDRCEQKFGQKSNLLYHKKAVYEGRKDYACDYCEKKFGYKSVLLSHIRTVHGDRKDFACDYCEKIFGKKSNLLLHQKTVHEGRKDFACDKCQKKFVHKSLLLYHQITIHEGRKDYACDRCDKKFGLKGNLTKHQKTVHENRKDYACDKCEQKFGHKHILLVHQKTVHDRRKDYACDKCEKKFGERSNLIKHQKIVHENRKDYACNKCEKKFGVQFNLITHQKTMHEGQQDYACVRCEKKFGRRTHLLYHQKIVHEGRKDYSCDRCEKKFGNKRDLLRHQEKVHKGRKDFACDKCEKKFGENWIMIRHQKTVHEGQKDYACAKCEKKFTQKQHLLRHQETIHEGRKDFACDKCENKFGIKSNLLKHLKTVHGGRKDYTCDKYEKKFGQKGNWLYHQKTVHEDQ
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -