Basic Information

Insect
Bombyx mori
Gene Symbol
Zbtb41
Assembly
GCA_027497135.1
Location
CP114963.1:17641171-17646995[-]

Transcription Factor Domain

TF Family
zf-C2H2
Domain
zf-C2H2 domain
PFAM
PF00096
TF Group
Zinc-Coordinating Group
Description
The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 21 3e-05 0.0017 18.7 0.6 2 21 158 177 157 178 0.93
2 21 1.2 69 4.2 2.0 1 23 211 234 211 234 0.92
3 21 0.014 0.76 10.4 1.0 2 23 280 302 279 302 0.94
4 21 0.0063 0.35 11.4 0.6 1 23 335 357 335 357 0.98
5 21 0.0051 0.28 11.7 0.8 1 23 363 386 363 386 0.96
6 21 0.0038 0.21 12.1 1.5 1 23 395 418 395 419 0.94
7 21 5.6e-05 0.0031 17.9 0.4 3 21 535 553 533 554 0.95
8 21 3.2e-05 0.0018 18.7 2.3 1 23 585 607 585 607 0.97
9 21 0.0003 0.017 15.6 5.3 1 23 611 633 611 633 0.96
10 21 1.5 83 4.0 0.5 6 21 640 655 639 656 0.92
11 21 0.056 3.1 8.4 1.1 1 23 666 689 666 689 0.90
12 21 1.9e-06 0.00011 22.5 1.2 1 23 695 717 695 717 0.98
13 21 0.093 5.2 7.8 3.9 1 23 724 746 724 746 0.98
14 21 0.00097 0.054 14.0 1.2 2 21 855 874 854 875 0.93
15 21 0.056 3.1 8.5 1.1 3 23 908 929 907 929 0.95
16 21 3e-05 0.0017 18.7 0.4 1 23 934 956 934 956 0.97
17 21 0.1 5.6 7.7 6.2 1 21 959 979 959 980 0.95
18 21 0.0037 0.21 12.2 0.4 1 23 992 1015 992 1015 0.94
19 21 1.7e-05 0.00097 19.5 1.3 1 23 1021 1043 1021 1043 0.99
20 21 0.2 11 6.7 1.5 1 23 1077 1100 1077 1100 0.94
21 21 0.01 0.57 10.8 0.6 1 23 1108 1131 1108 1131 0.95

Sequence Information

Coding Sequence
ATGACAAGACAAGTGGACGTGAAAGCATTAATTTCCCATCTAGTCAGAGGGGATGGCGCCGAAAAATGTCGTATTTGTATGGGTGAAACAACAGAGGGCCAAGTTTACCTAGGAGATACTGTAATGGTGGACGGAGAGAAGCCCGTGACCTTAGCTGAACTATTAGAACAAATCACGGGAGTCGAGGTAACAGTGGAAGATAACCTGCCCGACGTTCTATGCTCGGCCTGCTCATTCTCAGCATTATCCGCAGCAGAGTTCAGAAACTTCTGCCAGAAAGCCATAGAGCAATGGCATAACACAGTAAAATTACTGGAGGAGGTCACCACCTTGCATCCTGTAAACGCATCCAAAATTTATGCTGTCGTATCTGACAACGAAATATCCATCACCGAAGACTCATTAAGACAAAATAAAACAATCAAACATTCGCCTATTAAAAGTAAAGAATATGGTAAAAGCTTACAATGCTCATGTCCGAACTGTGGGAAGACATTTTCATATGCATCAGATTTGTATGATCATTTGAAAGGATCGACGGACTTGACTCGGGCTTGCTATGTGTGCGCAAGGATAATGTCTGAAGATGATCTGGTGGAGCATCTAAGAGAAAAACACAATAAAAAGCCATTCAACTGTAACAAATGTGCTGTTCTGCTCCGCTCATACAAACATTACAAGAAGCACATGGCAGACGCTCACAGGCCCGGTGTGTGCTGTGAGTTGAGCCGGGAAAGCCGGGAAATACATCAAACGAAGAATAAACTATCGGTTTTGGTTAAGAGTGATCAGAGGTCAGTCAAAGTTGGTCTCCGGGGCAGTAGGAACACTGAATGTATATGTGACTATTGTCAGAAGAGGTTTGCCGGAAAGAAATTTGTAGCGACTCACATTCAGATTGTGCATATGAAATCCACTCACCGCCCCTGCGTGTATTGTGGAAAATACCTCGCTGCTGCTCACATACCAGTCCACTTGAAGAGGCATGAAACATTGGAGACCTTCAAATGTGAACTATGCAATATTGTACTAAAATCGAAACTAGGCTATCAACAACATCTAAGACTACACTCCGGAGAGAAACCTTATGTGTGTAAATACTGCGATGAAAGATTCTCTGCTTCGTCAAGGCGGTCCGAGCATGTCCGCAAGGCACACAGACAGAGTGATACGGTCTTGAGACACTCATGTGCCGTCTGTCCTGCTAAGTTCAGGCTACCCTACCAGCTCAGGAAGCATGTTTCCTCCGTACATCACATAGACAAACCAGGGTCCTTTGATTGTGAAATTTGCAAAGTGGAGAATTCAACTCTATTACCAAACAAAATATGCAAGCACTGTAAAAGTAGTTTACACGATCTTACGCTCTTCTTAAAATTATGCATAGACTCATTCAAACGTTGGAGCACAACGGCCAATTATTTGGCCAACATTGGGCTCGAGAAAAACGCGGCTACTCTCTACGTTGTAGCCGCAAACGATTTTCGAACTTACCGCAGCAAGAGAAAGATTGAAAATCACTCGCAACTTGTAAAAGATTTGGGAGCCAAAATCAGTACGCGGTCGGCACGCATGAAAGGTCCAGAACGTTCCAGAATGGCTTGTCCCGAATGCGGGAAGTGTTTTAAGAACGCGGTGCGGTTTAATGCTCACATTAGGAATTTGAAAATAAAGTATTGCACGCAGTGCGGACGTTTAATGAACTTGAATTCTTACGGAGCTCACGCTGAAAATGATCACAACGCGAGGGTGTTCCGATGTAAGAAATGCCCGGAGGTGTTCGCGAGGTATTCGTATTTGGAAAAACACAAAACCAAGCACATCGGTGTACATTGCTGCGTAGAATGCAAACGTAGCTTTCGCAACGCAACGAGTCTGTGGACGCACTTACGGAAACACCGACCGGCGGTGTGTGCCTGCGGTAAAAAGTTCACGAACGGAATATGTTTTAGGAAACACGGAAAAACGTGCGTAAGTAACAGGGGCGCCGTTCGATACGTCTGCGACTATTGCTCGAAAGAATACAGCGTCAGGACTACCCTCAAATTTCACATATTCAATGCGCACATGTCGCTAAAGAAATTCCAGTGCGATAAATGCGGAAAGGTTTTCGGTAATCGTTCTCACCTGGAGGAACATGGTAATTCTCACAACAGAGTCGCAGATAGATTCGTGTGCGTCCACTGCGACGCGAAGTTTAGCACGAGACGAGGCCACGAGAGGCACGCGAGGAAACACGATGTGGCGGACGAAGACGCCCTTCCACCCGGGGTCTGCAGGCATTGCTCCGAGGATACCATGGCCGCTTACAACTTCAGACAGCTGTGTGATACATCGAAAAAGCGTTGGTCCAGTGCCGCGGAGCTTCTATCGCGGATTCACGCGAACGCCGACAGTGGAACTCTCTTTTTCCTATACGACGACGCTATCTTGCTCCTGAAGGACAAAATTAGCACCACAGATACAAGAGCGGCCTCAGAATTGCTGAACTCGAAGTTTGACGAGGAAACAGAGCAGAAACCGCGCAAATACCAAAGATCGTATCAGCCGCCTCTAGAATGTTCTTGTCCCGAGTGCGGCAAGACCTTCCAGAACGTTCAGTATTTGAACTATCATTTGAAGAGTTCGTTGAATTGCGCGTGTAGAACCTGCGCTTTGGTCATGCAGAAGAGATTCATTCCGGAGCACATGAGGGTCGAGCATGGCGTCTCCGTGGCTCACTGTCGGATGTGTTATGCGGTTTTCGAAGACCAAGAGTCAGTTAAGCGTCACTTGCTATCGTCTCACGGCCCTAATTCGTTCTCTTGCAATACGTGCGGCACCGGCTTCAGCGGCCAGAGAGCCTTGCGCGCTCACATGTACTCGCACACGTTGTTCGACTGTAAGTCCTGCTCCAGGACATTTGAGAATCGTAAATGCTTCAAGCATCACCAGAGAGGATGCAAACGCGAGGTCTCGCCAATCGAATCCACGTTCATATGCGACTATTGTAAAATAGAATATAACAAAAAACCATCTCTGAAAGTGCACATAATACAGAAACATTTGAACGTGTTGCCGTATGTTTGTCAGAAGTGCGGGAAGCGGGCGTCGACCGTCGCGCATTTGCAGTCGCACCTCAGGACCCATAACCACGTGAGGAAGGTGTTTGAGTGTCACTGCGGTGCTAAAATGACCACGGAATTGGGTTATCGGCTCCACCAGAGGATACACTCTGGCGAGAAGCCCTACGAGTGCAAGAGGTGCGGCGAAAGGTTTTTGTCGTCTTCGAGGCGCTTAGACCACATCAAGAGGCGACACATGGGCACCCAGAACATGCCGCACGCGTGCGACAAGTGCCCCGCGAGATTTCTAAGACCCTGGGAACTGAAAAAGCATTACTCAACGATACATTTCGATTTTATTCAGATACCAGACAAAGCACAGGACGTGCCTATAAAGCGACGTTTCAAAAATAAAATATTGGACTAA
Protein Sequence
MTRQVDVKALISHLVRGDGAEKCRICMGETTEGQVYLGDTVMVDGEKPVTLAELLEQITGVEVTVEDNLPDVLCSACSFSALSAAEFRNFCQKAIEQWHNTVKLLEEVTTLHPVNASKIYAVVSDNEISITEDSLRQNKTIKHSPIKSKEYGKSLQCSCPNCGKTFSYASDLYDHLKGSTDLTRACYVCARIMSEDDLVEHLREKHNKKPFNCNKCAVLLRSYKHYKKHMADAHRPGVCCELSRESREIHQTKNKLSVLVKSDQRSVKVGLRGSRNTECICDYCQKRFAGKKFVATHIQIVHMKSTHRPCVYCGKYLAAAHIPVHLKRHETLETFKCELCNIVLKSKLGYQQHLRLHSGEKPYVCKYCDERFSASSRRSEHVRKAHRQSDTVLRHSCAVCPAKFRLPYQLRKHVSSVHHIDKPGSFDCEICKVENSTLLPNKICKHCKSSLHDLTLFLKLCIDSFKRWSTTANYLANIGLEKNAATLYVVAANDFRTYRSKRKIENHSQLVKDLGAKISTRSARMKGPERSRMACPECGKCFKNAVRFNAHIRNLKIKYCTQCGRLMNLNSYGAHAENDHNARVFRCKKCPEVFARYSYLEKHKTKHIGVHCCVECKRSFRNATSLWTHLRKHRPAVCACGKKFTNGICFRKHGKTCVSNRGAVRYVCDYCSKEYSVRTTLKFHIFNAHMSLKKFQCDKCGKVFGNRSHLEEHGNSHNRVADRFVCVHCDAKFSTRRGHERHARKHDVADEDALPPGVCRHCSEDTMAAYNFRQLCDTSKKRWSSAAELLSRIHANADSGTLFFLYDDAILLLKDKISTTDTRAASELLNSKFDEETEQKPRKYQRSYQPPLECSCPECGKTFQNVQYLNYHLKSSLNCACRTCALVMQKRFIPEHMRVEHGVSVAHCRMCYAVFEDQESVKRHLLSSHGPNSFSCNTCGTGFSGQRALRAHMYSHTLFDCKSCSRTFENRKCFKHHQRGCKREVSPIESTFICDYCKIEYNKKPSLKVHIIQKHLNVLPYVCQKCGKRASTVAHLQSHLRTHNHVRKVFECHCGAKMTTELGYRLHQRIHSGEKPYECKRCGERFLSSSRRLDHIKRRHMGTQNMPHACDKCPARFLRPWELKKHYSTIHFDFIQIPDKAQDVPIKRRFKNKILD

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-