Bcha001562.1
Basic Information
- Insect
- Beris chalybata
- Gene Symbol
- ZBTB41
- Assembly
- GCA_949128065.1
- Location
- OX421894.1:25853088-25875142[+]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 47 0.00011 0.014 17.0 0.9 1 23 82 104 82 104 0.99 2 47 0.00058 0.077 14.7 3.3 1 23 110 133 110 133 0.98 3 47 9.2e-05 0.012 17.2 1.7 1 23 141 163 141 163 0.99 4 47 0.0098 1.3 10.8 2.7 1 23 169 192 169 192 0.96 5 47 0.00041 0.055 15.2 1.3 1 23 201 223 201 223 0.98 6 47 0.0002 0.026 16.2 0.6 1 23 229 252 229 252 0.98 7 47 6.7e-05 0.009 17.6 2.1 1 23 259 281 259 281 0.99 8 47 0.11 15 7.5 1.5 1 23 287 310 287 310 0.93 9 47 0.0012 0.16 13.7 1.5 1 23 320 342 320 342 0.97 10 47 0.0005 0.066 14.9 2.3 1 21 348 368 348 371 0.95 11 47 0.00016 0.021 16.5 2.0 1 23 382 404 382 404 0.99 12 47 0.0039 0.52 12.1 1.0 1 23 410 433 410 433 0.93 13 47 0.0021 0.28 12.9 3.2 1 23 443 465 443 465 0.98 14 47 0.00057 0.077 14.7 0.4 1 23 471 494 471 494 0.97 15 47 0.022 2.9 9.8 0.2 1 23 504 526 504 526 0.95 16 47 0.0001 0.014 17.0 2.4 2 23 771 792 770 792 0.97 17 47 0.71 95 5.0 5.2 1 23 798 821 798 821 0.96 18 47 0.02 2.7 9.8 1.8 1 23 831 853 831 853 0.96 19 47 0.00021 0.027 16.1 2.8 1 23 859 882 859 882 0.97 20 47 0.00015 0.02 16.6 6.8 1 23 900 922 900 922 0.99 21 47 0.00022 0.029 16.0 1.4 1 23 928 951 928 951 0.98 22 47 0.00036 0.048 15.4 3.6 1 23 961 983 961 983 0.99 23 47 0.0074 0.98 11.2 4.3 1 23 989 1012 989 1012 0.97 24 47 0.5 67 5.5 1.1 3 23 1024 1044 1022 1044 0.96 25 47 2.5e-05 0.0033 19.0 1.0 1 23 1050 1073 1050 1073 0.96 26 47 0.0077 1 11.2 5.9 1 23 1080 1102 1080 1102 0.99 27 47 0.002 0.27 13.0 0.3 1 23 1108 1131 1108 1131 0.97 28 47 4.7e-05 0.0063 18.1 0.7 1 23 1147 1169 1147 1169 0.98 29 47 0.011 1.4 10.7 1.0 1 23 1175 1198 1175 1198 0.97 30 47 0.00091 0.12 14.1 0.5 1 23 1208 1230 1208 1230 0.99 31 47 0.0018 0.24 13.1 0.5 1 23 1236 1259 1236 1259 0.97 32 47 0.0015 0.2 13.4 1.4 1 23 1269 1291 1269 1291 0.97 33 47 2.6e-05 0.0035 18.9 1.8 1 23 1297 1320 1297 1320 0.98 34 47 0.27 36 6.3 3.5 1 23 1330 1352 1330 1352 0.98 35 47 0.00051 0.068 14.9 1.4 1 23 1358 1381 1358 1381 0.97 36 47 0.001 0.14 13.9 1.9 1 23 1453 1475 1453 1475 0.98 37 47 0.087 12 7.8 3.6 1 23 1481 1504 1481 1504 0.97 38 47 0.0018 0.25 13.1 0.4 2 23 1518 1539 1517 1539 0.97 39 47 0.27 36 6.3 4.3 1 23 1545 1568 1545 1568 0.96 40 47 0.0007 0.093 14.5 2.2 2 23 1579 1600 1578 1600 0.97 41 47 3.1e-05 0.0042 18.7 0.7 1 23 1606 1629 1606 1629 0.98 42 47 8.6e-05 0.011 17.3 2.3 1 23 1639 1661 1639 1661 0.97 43 47 0.0003 0.04 15.6 3.1 1 23 1667 1690 1667 1690 0.97 44 47 2.7e-06 0.00035 22.1 1.0 1 23 1697 1719 1697 1719 0.99 45 47 0.00057 0.076 14.7 1.7 1 23 1725 1748 1725 1748 0.97 46 47 0.13 17 7.3 2.6 1 23 1758 1780 1758 1780 0.99 47 47 0.0002 0.026 16.2 1.2 1 23 1786 1809 1786 1809 0.98
Sequence Information
- Coding Sequence
- ATGGATAATATCAACGAATATTGGCATTATACTGAAGAGAATTTTGAACCAATTAAAACAGAGAGCGTAGAAAATAGCTTTACATCAAATACTGAACAACAAGAAGAGGATCAAATACCAGGTTCCAATCAAGGCCTTACGAAAAAAGAGATATTTCTCATTCAAGAAGTGAAGCAAGAACCCGAAATCGAAATAAACATGGAAGAAATAGACTTTGATGAAACAATAAAAAACACAAGAGATTTTAAATGTacgatatgttcgaaattattcgaAGAAAACTATAAATTGCTATGGCATATGAGAATTCATACGGGCGAAAAACCATTTCAGTGCGAAAAATGTCCGAAGAAATATAGACTGAAGAACTTTTTGACGACACATATGCAACGGACACATGCTGATATAACTACTCGGAAGTTTAAatgtttaacatgtttaaaGTTTTTCGAATATAAAAGTGAGTTGGATGCGCATATGAGAACTCATACAGGAGAAAAGTTCTTTCAGTGCGAACTATGTTCGAATAAATATCGCTTTAAAACCAGTTTGACGGCACATATGTACTCGAAACATGCCGATAAACAGGCGTGTACGACATTTTCTTGTGAGATATGTTCAAAAGTATGTGGACAAAAATCTAGATTGGATGAGCATATGAGAATCCATACAGGAGACAAACCATTCGAGTGCAAAGAGTGTCCGTTCAAATATCGGCTTAAAGTCTCTTTGACTGAACATATACGAACTGTACATCTCGGTGTAGGAGGAAAGTTTAAATGTGAGacatgttcgaaattattcacACGTAAGAACAGTTTGGATATGCATATGAGAATTCATACAGGTGAAAAACCGTATGAGTGCGAGTTATGTTCGACAAAGTACCGATCTAAGAACATTTTGCAGAAACATATTTTCGAAAAACatggcgatagaacaacgtctACTGCAAAATTTACTTGTACGATATGTTCTAAATTATTCAAAGAGAAAAATCAATTGGATCGTCACATAACATTACATACAGGCGAGAAACCATTCCAGTGCGAATTATGTCCGAAAAAATTCCGCTTAAATGGAAATTTGAAATCACATGTACGATGTGTCCATGACGGTATAAGAAGACCCATCGATAGAAAGTTTAAGTGTTCAACGTGCTCGAAACTATTCGAACGTGAAAATCGTTTAAATGCGCACATGAGAACTCATACCGATGAAAGGCCATTTCAATGCGAAGTATGTTCGAAAAAATATCGGTTTGAGATCAGTTTGACCGAACATATGTACGCGAAACATGCCGATAGAATGACGAATACTACACTTTTTACTTGTGagatatgttcgaaattataCCGGCAGAAATGTAAATTGGACGACCATATGAGAATTCATACTGGTGAAAAGCCGTTCCAATGTGAATTGTGTCCAATAAAATATCGATTAAATGGAAGCTTACAGAAACATATTCTAAGGAGTCATGCCGATATAATACCAAATAGTAAAAAGTTTGAATGTGCACCATGTTCCAAACTATTCGAAAGTAAAGCTCGTTTGAATGATCATATGAGAATTCATAGCGGTGAACTACCATTCCAGTGTAAAGTATCTCCTACTGATAGTCTTCCCGATAAGATTTGTTCCGATTGTctcaaaaaattacaacaatcGTATACTTTCAAGTTAGAATTCGATCAATCACAATCCATTTGGATGCAAATAAAGGACAAGAGAAAAGGTGTGAATTATCTCAAGGAAGATGGGATTTCTACCGAAGAAATTAAATCAGAAAATTGCTTTACATCAAAAATTGAACTCCAAGAGCTGGCTAGTATATCAAATTCCGAACAAGGCTTAACGAAAAAGGAGACATTAATCGTTCAAAAAGTCAAGCAAGAACCCGAAATTCATATAAACATAGAAGAAATATCTCCTGACAATAGTTTGCCCGATAGCATTTGTAGTGACTGtcttaaaaaaatacaacaatctTATGCTTTCAAGTTAAAATTCGATCGAGCACAAAGTAGTTTGAAGCATTTTAAGGACAGTGAAAACGATGAGAGCAATCTTAACGAATATGAGTATATTACTGAAGAGCGTATTGATCTAATCAAAATAGCAAAAATAGAACAATCCTCCTACATATCAAACAGTGAACAAGAAGAAGCGGCTATTATACCAGATTCCGAGAAGGACTTTACAACGCAAGAGATATCTTTTGTTCAACTTGTCAAGCAAGAGCCCGAAGTCGAGATAGACATGGAAGAAAAGGACTTTAACGAAGAAAGAACGAGCACAAACAAGCTTCAATGTGCGACATGTTCGAAATTTTTCCGAAATCGATATAGATTGGTTCAGCATATGAGAACTCATACAGGCGAGAAACCATTTCATTGTGAACTGTGTCCGAACAAATATCAATTCAAGGGAAGCTTGAAGGAACACATGCATAGGAAACATGTCGGTTTAACACCCAATAATAATAAGTTTGCATGTGTGTCATGTTCGGAATTGTTTCGACATAAATATAAATTGGATGAACACATGAGAATTCATACAGGCGAGAAGCCATTCAAGTGTGAACATTGTCCGAAAAAGTATCGACTTAAGGGAAGTTTAAAAGAACATATACAAAATACACATGTAACAAGTTCCAGTGCGGCACATAATGCAACAAGACCCAACACTAGAATGTACAAGTGCACGATATGTTCTAAACTATTCAAGCGTCGAGATCATATGCATCAGCATATGCGAATTCATACAGGTGAAAAACCATTTCAGTGCGAACTATGTCCGAAAAACTTTCGTATCAATCAGAGCTTTATGAAACATTTGAAAACGATACATGGCGATAAAACGCCGAATGCTAAACATTTTCGTTGTTCGACCTGCTTCAAATTATTCCGACAAAAACGTAAGTTGGTTGTACATATGAGGAGTCATGTAGGCGAAAAGCTATTTCATTGTCAAGTATGCATGAAGAAATATCGATTTAAACGAAATTTAACGGATCACATGCAAACGGCACATCCGGATAAAACAAAAAGTTCTACGCGAATTAATTGTGCGATTTGTTCGAAACTATTCCTTCATAAGTATACTTATGATCGGCATATGATAATTCATACGGGCGGAAAGCCATTCCAGTGCGAAGTGTGCCCGAAGAAATATCGATTTCGGAGCAATTTATTGCAACATATACAAGAGACACACACCGACGGatcttcaaaatttaaatgtacGACTTGTTCGAAATTATTTCATCAGAAATATAAATTGGACCGGCATGTGCGAATGCATACAGGCGAAAGGCCATTCCAGTGCGAACTATGCCCGAACAAATATCGACTTAAGGGAAGTTTAACGGCACATATTCAAATCAAACATTTACAAAGGACACAAATGCGACCGAAACATGATGATACTAGAAAATTTGAATGTGAGATATGTTCCAAATCGTTCCCACAAAAATACAGAATCGAGCAGCATATGAGACTTCATACAGGAGAAAAGCCCTTTCACTGCGAACTGTGTCCGAACAAGTTTCGACTTAAGGAAGGACTAACAGTACATTTACAAAAAGCACATGCCGATTCAATACCAAATCCTCGAAAGTTTCAATGTACAACATGTTCGAAAATATTTGAAGGTAAAGGTCGTTTGGATGAACATATGAGAATTCATACAGGAGAAAAACCGTTTCAATGTGAAGCATGtccgaaaaaatttcgaattaatCGCAGTTTAACGGAACATGTACAGAGAATACATACCGATATAACAGACAATAGTGTTCTGTTTAAATGTGATGTATGTTCGAAACTATTCGAACGTCAAAGGCGTTTGAATGAGCACATGACACTTCATACAGGTGAACTACCATTTCAGTGCGAAGTATGTTCGAAAAAATATCGACTACGATCAAGTTTGAGGGCACATATGAACAGGACACATGCCGATATAACAGCTtatactaaaaaatttaaatgtgagACTTGTTCGAAGCTATTTGAATGTCAGACTCGTTTGAGGGAGCATATTAGAATTCATACGGGAGAAAAGCCATATGAGTGTGAAGTTTGCTCGAAAACATTTCGAATTAATCGCAGTTTAACGGAACATAAGCAAAAGATGCATGCCGATAGAGAGTCCAAAGAAGCCACCTTGAAGCATTTAAAGGAGACTGAAAAAAGTGATAATAATCTCAACGAATATGGACATTATACTGAATGCTTTACGAAGCACGAGATACTTGTCGTTCAACAAGTCAAGCAGGAACCCGAAATCGAGATAAATATGGAGGGTGACTTTGAGGAAGACTTGAAGGACGACAAAACGAACACTGGAATGTTTGAATGTGTGACATGTTCACAAATATTCCGATATAAATCTAAATTCGATCAGCATATGAGAATTCATACAGGAGAGAGACCATTCCAATGTAAGATTTGTCTGAATAAATATCGATTTAAATGTGGTTTGAAGGAACATATGCTTAAGGCACATGGTGTTTCAAAACCCCAAAACAGTAAGAGCAATAAATTAACATGTGTGATATGTTCGCAAATATTCCAACAAAAATATAAACTAGACGAACATATGCGCGTTCATACCGGAGAAAACCCATTTCATTGCGAACTGTGTCCGAACAAATACCGATATAGGTGTGGTTTGAGAAAACATGTACAAAGTGCACATGCAGTTATAGCATCAGGTACTGGAACGTTAAAATGTTCGACATGTTCGCAATTATTCCGACATGAATACGAATTAAATGAACATATGAGAATTCATACGGGCAAAAAACCATTTCAGTGTGAAATATGCccgaaaaaatttcgaattaatCGCAGTTTGACTGATCATATCCAACGATTACATGCCGATATAAAATCGAATACTCTTAAGTTTAAATGTACGACATGTTCAAAACAGTTTCAACGTAACTCGCAATTGATTGACCATATGACGCTTCATACAGGAGAGCGTCCATATGAATGCGAAAAATGTCCGAAAAAATTTCGTCTTAAGTTCTGTTTGACTGAGCATATACGAAATATACATCTCGGTTTACGAAGAAAATTCCAATGCGAGATATGTTCGAAAATATTCAAACGTGCAAATCGTTTAAATCAGCATATGCGCGTGCATACAGGAGAAGAACCATTCGAGTGCGAAGTATGTTCGAAAAAATTTCGCCTTAGGGTAAGTTTCATATCACATATGAATAGGAAACATGCCGATTTAACAGGCtatactaaaaaatttaaatgtgagACTTGTTCGAAGCTATTCGAATGTCAAGCTCGTTTGAATGAGCATATCAGAATGCATACGGGAGAAAAACCATTTGAATGCGAAGTGTGTTCGAAAAGATTTCGAATAAAAAGCATTTTAACGCGACATATGCAAACGATGCATGCCGTTGCCGATAGTGGACCCAAGtag
- Protein Sequence
- MDNINEYWHYTEENFEPIKTESVENSFTSNTEQQEEDQIPGSNQGLTKKEIFLIQEVKQEPEIEINMEEIDFDETIKNTRDFKCTICSKLFEENYKLLWHMRIHTGEKPFQCEKCPKKYRLKNFLTTHMQRTHADITTRKFKCLTCLKFFEYKSELDAHMRTHTGEKFFQCELCSNKYRFKTSLTAHMYSKHADKQACTTFSCEICSKVCGQKSRLDEHMRIHTGDKPFECKECPFKYRLKVSLTEHIRTVHLGVGGKFKCETCSKLFTRKNSLDMHMRIHTGEKPYECELCSTKYRSKNILQKHIFEKHGDRTTSTAKFTCTICSKLFKEKNQLDRHITLHTGEKPFQCELCPKKFRLNGNLKSHVRCVHDGIRRPIDRKFKCSTCSKLFERENRLNAHMRTHTDERPFQCEVCSKKYRFEISLTEHMYAKHADRMTNTTLFTCEICSKLYRQKCKLDDHMRIHTGEKPFQCELCPIKYRLNGSLQKHILRSHADIIPNSKKFECAPCSKLFESKARLNDHMRIHSGELPFQCKVSPTDSLPDKICSDCLKKLQQSYTFKLEFDQSQSIWMQIKDKRKGVNYLKEDGISTEEIKSENCFTSKIELQELASISNSEQGLTKKETLIVQKVKQEPEIHINIEEISPDNSLPDSICSDCLKKIQQSYAFKLKFDRAQSSLKHFKDSENDESNLNEYEYITEERIDLIKIAKIEQSSYISNSEQEEAAIIPDSEKDFTTQEISFVQLVKQEPEVEIDMEEKDFNEERTSTNKLQCATCSKFFRNRYRLVQHMRTHTGEKPFHCELCPNKYQFKGSLKEHMHRKHVGLTPNNNKFACVSCSELFRHKYKLDEHMRIHTGEKPFKCEHCPKKYRLKGSLKEHIQNTHVTSSSAAHNATRPNTRMYKCTICSKLFKRRDHMHQHMRIHTGEKPFQCELCPKNFRINQSFMKHLKTIHGDKTPNAKHFRCSTCFKLFRQKRKLVVHMRSHVGEKLFHCQVCMKKYRFKRNLTDHMQTAHPDKTKSSTRINCAICSKLFLHKYTYDRHMIIHTGGKPFQCEVCPKKYRFRSNLLQHIQETHTDGSSKFKCTTCSKLFHQKYKLDRHVRMHTGERPFQCELCPNKYRLKGSLTAHIQIKHLQRTQMRPKHDDTRKFECEICSKSFPQKYRIEQHMRLHTGEKPFHCELCPNKFRLKEGLTVHLQKAHADSIPNPRKFQCTTCSKIFEGKGRLDEHMRIHTGEKPFQCEACPKKFRINRSLTEHVQRIHTDITDNSVLFKCDVCSKLFERQRRLNEHMTLHTGELPFQCEVCSKKYRLRSSLRAHMNRTHADITAYTKKFKCETCSKLFECQTRLREHIRIHTGEKPYECEVCSKTFRINRSLTEHKQKMHADRESKEATLKHLKETEKSDNNLNEYGHYTECFTKHEILVVQQVKQEPEIEINMEGDFEEDLKDDKTNTGMFECVTCSQIFRYKSKFDQHMRIHTGERPFQCKICLNKYRFKCGLKEHMLKAHGVSKPQNSKSNKLTCVICSQIFQQKYKLDEHMRVHTGENPFHCELCPNKYRYRCGLRKHVQSAHAVIASGTGTLKCSTCSQLFRHEYELNEHMRIHTGKKPFQCEICPKKFRINRSLTDHIQRLHADIKSNTLKFKCTTCSKQFQRNSQLIDHMTLHTGERPYECEKCPKKFRLKFCLTEHIRNIHLGLRRKFQCEICSKIFKRANRLNQHMRVHTGEEPFECEVCSKKFRLRVSFISHMNRKHADLTGYTKKFKCETCSKLFECQARLNEHIRMHTGEKPFECEVCSKRFRIKSILTRHMQTMHAVADSGPK
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -