Basic Information

Gene Symbol
-
Assembly
GCA_013368075.1
Location
JABVZV010002131.1:2636187-2651181[+]

Transcription Factor Domain

TF Family
zf-C2H2
Domain
zf-C2H2 domain
PFAM
PF00096
TF Group
Zinc-Coordinating Group
Description
The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 43 0.043 0.6 10.8 0.2 1 21 60 80 60 81 0.95
2 43 0.00027 0.0038 17.7 1.5 1 23 174 196 174 196 0.99
3 43 0.062 0.87 10.3 0.9 1 23 201 223 201 223 0.98
4 43 0.058 0.82 10.4 3.0 1 23 228 250 228 250 0.98
5 43 0.12 1.8 9.3 1.3 1 23 255 277 255 277 0.98
6 43 0.00073 0.01 16.4 2.7 1 23 282 304 282 304 0.98
7 43 0.0023 0.032 14.8 2.2 1 23 309 331 309 331 0.99
8 43 0.34 4.8 8.0 1.4 1 23 336 358 336 358 0.98
9 43 0.00073 0.01 16.3 2.6 1 23 363 385 363 385 0.99
10 43 0.0014 0.02 15.4 1.4 1 23 390 412 390 412 0.98
11 43 0.0014 0.02 15.4 1.4 1 23 455 477 455 477 0.98
12 43 0.058 0.82 10.4 0.4 1 23 509 531 509 531 0.98
13 43 0.078 1.1 10.0 5.1 1 23 536 558 536 558 0.99
14 43 0.22 3.1 8.5 0.3 1 23 563 585 563 585 0.97
15 43 0.078 1.1 10.0 5.1 1 23 590 612 590 612 0.99
16 43 1.6 23 5.8 0.6 1 23 617 639 617 639 0.96
17 43 0.0015 0.021 15.4 4.6 1 23 644 666 644 666 0.99
18 43 0.03 0.43 11.3 0.7 1 23 671 693 671 693 0.98
19 43 0.41 5.8 7.7 0.1 1 21 725 745 725 746 0.94
20 43 0.0009 0.013 16.1 1.1 1 23 888 910 888 910 0.99
21 43 0.0054 0.077 13.6 0.4 1 23 915 937 915 937 0.98
22 43 0.0023 0.033 14.8 3.8 1 23 942 964 942 964 0.99
23 43 0.56 7.9 7.3 0.8 1 23 967 989 967 989 0.98
24 43 0.033 0.47 11.1 0.7 1 23 994 1016 994 1016 0.98
25 43 0.0058 0.082 13.5 1.5 1 23 1021 1043 1021 1043 0.99
26 43 0.82 12 6.7 1.8 1 23 1048 1070 1048 1070 0.97
27 43 0.00011 0.0016 18.9 0.7 1 23 1075 1097 1075 1097 0.99
28 43 0.2 2.8 8.7 0.5 1 23 1102 1124 1102 1124 0.98
29 43 0.023 0.32 11.7 1.1 1 23 1129 1151 1129 1151 0.99
30 43 6 85 4.0 3.0 1 23 1156 1178 1156 1178 0.93
31 43 0.039 0.56 10.9 0.3 1 23 1183 1205 1183 1205 0.98
32 43 0.019 0.27 11.9 1.1 1 23 1210 1232 1210 1232 0.98
33 43 2.2 31 5.4 4.0 1 23 1237 1259 1237 1259 0.94
34 43 0.0011 0.016 15.8 1.3 1 23 1264 1286 1264 1286 0.98
35 43 0.017 0.25 12.0 0.7 1 23 1318 1340 1318 1340 0.98
36 43 0.00036 0.0051 17.3 2.6 1 23 1345 1367 1345 1367 0.99
37 43 0.025 0.35 11.5 1.3 1 23 1372 1394 1372 1394 0.99
38 43 0.016 0.23 12.1 1.2 1 23 1424 1446 1424 1446 0.98
39 43 0.016 0.23 12.1 1.2 1 23 1451 1473 1451 1473 0.98
40 43 0.022 0.32 11.7 1.5 1 23 1478 1500 1478 1500 0.99
41 43 0.089 1.3 9.8 3.5 1 23 1505 1527 1505 1527 0.99
42 43 0.016 0.23 12.1 1.2 1 23 1532 1554 1532 1554 0.98
43 43 0.11 1.6 9.4 1.5 1 23 1559 1581 1559 1581 0.98

Sequence Information

Coding Sequence
atggataTTGATTCTAATGaaagttttgcaattaaatCGGAAGCGGTTTTAACGGAAACTTTTTCGTTTCGTGAAGAATACAGTGATTATGAGAATGCCCAATGGAAAACTGAATCGGTAAATTATGAAGAATCGTTTAAATGTAAGGTAGAAGAAGATCCTGTTTCATTGCAACAATATTCTTGCAATGAGTGTAATTTTATGACAACGGAAAAAGATTCTCTAATGgaacatttgaaaataactaaaaatgtggaatatttttgtaagaaatgtaACTTTCAAACCTCCATGGAATGTTCTACAAAAGACCATTTGAGGATTCACAATGAAGTATATATTAGATATTACCGATATGTTACTGAGGAATGCAATTTTAATATGCCATGGATTTTTCCATTAATGCCACAGATAAAGATTTCAAGAAATGGCGACCAATATGTGTGCAATGAATGGAATTATAGcacatttgtaaaaaattatctaaaaagtcatgtgaaaattcatgcaggtgatgaatataaatgtaaagaatgtgattataaaacagtgcggaaagatTATCTGagagaacatgtcaaaattcatacagctggtgaatacaaatgtaaagattgtgattataaaacagtgtggaaacgtAGTCTAAacgaacatgtcaaaattcatacaggtggcgaatataaatgtaaagaatgcgattataaaacagtgtggaaacatagtctaaacaaacatgtcaaaattcatacaggtgatgaatataaatgtaaagaatgtgattataaaacagtgtggaaaaataatatgaggaatcatgtcaaaattcatacaggtgatgaatataaatgtaaagaatgtgattataaaacagtgcggaaagatCATCTAATGGAACATGTGAaacttcatacaggtgataaatataaatgtaaagaatgtgattataaaacggtgcggaaatatcgtctaaaggaacacgtcaaaattcatacaggtgatgaatataaatgtaaagaatgtgattataaaacagtgtggaaatatcgtctaaaggaacatgtcaaaattcatacaggtgatgaatataaatgtaaagaatgtgattataaaacggtgcagAAAGtacatctaaaggaacatgttaaaattcattctggtgatgaatataaatgtaaagaatgtgattataaaacagtgcagaaaGGTAGTCTAAatgaacatttcaaaattcatacagatggtGGATATAAATGtatagaatgtgattttaaaacggcGTGGAAAAGTTGTCTGacaaaacatgttaaaattcattcagtacatctaaaggaacatgttaaaattcattctggtgatgaatataaatgtaaagaatgtgattataaaacagtgcagaaaGGTAGTCTAAatgaacatttcaaaattcatacagatggtGGATATAAATGtatagaatgtgattttaaaacggcGTGGAAAAGTTGTCTGacaaaacatgttaaaattcattcaggtgatgaatataaatgtaaagaatgtgattataaaacagtgtggaaaggtAGTCTAAacgaacatgtcaaaattcattcaggtgatgaatataaatgtaaagtatgtgattataaaacagtatggaaacatcatttaaaacaacacatcaaaattcacacaggtgatgaatataattgtaaagaatgtgattataaaacagtgtggaaaggtAGTCTAAacgaacatgtcaaaattcattcaggtgatgaatataaatgtaaagtatgtgattataaaacagtatggaaacatcatttaaaacaacacatcaaaattcacacaggtgatgaatataattgtaaagaatgtgattataaaacagtgtggaaaaatgaTATGAAGGATcatgttaaaattcacacaggtaataaatataagtgtaaagaatgtgattataaaactgtgcggaaatttcatttaaaggaacatgtcaaaatgcatacaggtgatgaatataaatgtaatgaatgtggtTTTAAAACCGTGTGGAAAAATAATCTGaaggatcatgtcaaaattcacatagattatgAGAATGCAGAATGGAAAACTGAACCACTAAATTATGAAGAATTCTTAAAATGCAAGGAAGATGATACTGTTTCATTGCAAGAATATTTCTgtaatgagtgtaattttaTGACGACGGAAGaagattctctaatagaacatttgaaaattattaaaaatgttgaatatttttgtaagaaatgtaactttaaaacctTGATGGAATGTTCGTCAAAAGCACATTTAAGGACTCACAACGAAGtatatattagatatattaCCGAGGAATGTACTTTTAATAGGCCATGGATTTTTTCATTAATGCCACAGTTAAAGACTCCAATAAGTGGTGACCAATATGTTTGCAATGAATGTTCTTATACTacattagtaaaaaattatctaaaatgcCATATGATAATTCATGGAAGTGATAAatataaagaatgtgattataaaccAATACAGAAAGATAGTTTAAAGGAACAGGTCAAAATTCACACTgttgatgaatataaatgtaaagaacgtgattataaaacagtgcggaaaaggCGTCTAAAagtacatgtcaaaattcacacaggcgatgaatataaatgtaaagaatgtgattataaaacagtgcggaaaaggGGTCTAacagaacatgtcaaaattcataaaggtgatacatataaatgtgaagaatgtgattataaagcagTGCGGAAAGATCGACTCAgagaacatgtgaaaattcattcgggtgacgaatataaatgtagggtatgtgattataaaacagtgcggaaagatTGCCTGAAactacatatgaaaattcatgatgaatataaatgtaaagaatgtgattttaaaatagTGGGGTTGAATTATTtcaaggaacatgtcaaaattcatggatgtgttgaatataagtgtgaagaatgcaattataaaacagtgtggaaaaatgaTCTAACtagacatgtcaaaattcatacaggggatgaatataaatgtaaagaatgtgagtaTAAGACAATACGGAAAGAtcgtttaaaggaacatgtcaaaattcacacgggtgatgaatttaaatgtaaagaatgtgattataaaacagtatggaaagaTCGCTTCAAGGAACACCttaaaattcatgcaggtgatgaatataaatgtaaagaatgtgattataaaacagtacggaaagATAGTTTAATggcacatgtgaaaattcacacaggtgatgaatataaatgtaatcaatgtgattataatacagtgtggaaaaatagtctaataaatcatgttaaaattcatacagatgatgaatataaatgtaaagaatgtgattataaaacagtgtggaaaaataatctGGAGAATCATGTTAAAAGTCATACAGgtgataattataaatgtaatgaatgcaaTTATACAACACTGTGGAAAAATAGTCTGAAgcatcatgtcaaaattcatacaaatgatgaatataagtgtgaagaatgtgattataaaacagtgtgggaaAATGATCTAAAGAGACATCTCAACATTCATACAGGAGatagatataaatgtaaagaatgtgattataaaacacggtGGTCAAATAGTCTGACagaacatattaaaattcatacaaatgatgaatataagtgtaaagaatgtgattataaaacagtatggaaatatAATCTGAAGGATCATTgcaaaattcatgcaggtgatgaatataattgtaaagaatgtgattataaaactgtacgggAAGGTAGTTTAAAGAAACATaccaaaattcacacaggtgatgaatataaatgtaatgaatgtaattataaaacactgtggaaaaATAGTCTGAAACatcatgtaaaaattcacacagatgatgagtataagtgtgaagaatgcgactataaaacagtatggaaaaatgaTCTAAagagacatgtcaaaattcatacaggggatgaatataaatgtaaacaatgtgattataaaacagtgcagaatggtcatctaaaggaacatatcaaaattcatacaggtgatgaatacaaatgtacagaatgtgattataaaacagtgtggaaaaataatctgaaggttcattataaaattcatacagggaATGCCATTATAAAACAGTGGAAAAATGATCTAAagagacatgtcaaaattcGAACACcagatgaatataaatgtgatgaatataaatgtaaagaatgtgattataaaacagtgtggaaaaatgaTCTAAagagacatgtcaaaattcatacaggtgatgaatataaatgtaaagaatgtgattataaaacagtgtggaaaaatgaTCTAAagagacatgtcaaaattcatacaggtgatgaatataaatgtaaagaatgtgattataaaacagtgtggaaaaataatctgaaggttcattataaaattcatacaggtgatgaatataaatgtaaagaatgtgattataaaacagtgtggaaaactCATCTGAAGgttcattataaaattcatacaggtgatgaatataaatgtaaagaatgtgattataaaacagtgtggaaaaatgaTCTAAagagacatgtcaaaattcatacaggtgatgaatataaatgtaaagaatgtgattataaaacaggaTGGAAAAAATCTCTAaagaatcatgtcaaaattcatataggtgatgaatataaatgtaaagaatgtgattataaaacagtggaaaaaaataaaactgtatgtgcataa
Protein Sequence
MDIDSNESFAIKSEAVLTETFSFREEYSDYENAQWKTESVNYEESFKCKVEEDPVSLQQYSCNECNFMTTEKDSLMEHLKITKNVEYFCKKCNFQTSMECSTKDHLRIHNEVYIRYYRYVTEECNFNMPWIFPLMPQIKISRNGDQYVCNEWNYSTFVKNYLKSHVKIHAGDEYKCKECDYKTVRKDYLREHVKIHTAGEYKCKDCDYKTVWKRSLNEHVKIHTGGEYKCKECDYKTVWKHSLNKHVKIHTGDEYKCKECDYKTVWKNNMRNHVKIHTGDEYKCKECDYKTVRKDHLMEHVKLHTGDKYKCKECDYKTVRKYRLKEHVKIHTGDEYKCKECDYKTVWKYRLKEHVKIHTGDEYKCKECDYKTVQKVHLKEHVKIHSGDEYKCKECDYKTVQKGSLNEHFKIHTDGGYKCIECDFKTAWKSCLTKHVKIHSVHLKEHVKIHSGDEYKCKECDYKTVQKGSLNEHFKIHTDGGYKCIECDFKTAWKSCLTKHVKIHSGDEYKCKECDYKTVWKGSLNEHVKIHSGDEYKCKVCDYKTVWKHHLKQHIKIHTGDEYNCKECDYKTVWKGSLNEHVKIHSGDEYKCKVCDYKTVWKHHLKQHIKIHTGDEYNCKECDYKTVWKNDMKDHVKIHTGNKYKCKECDYKTVRKFHLKEHVKMHTGDEYKCNECGFKTVWKNNLKDHVKIHIDYENAEWKTEPLNYEEFLKCKEDDTVSLQEYFCNECNFMTTEEDSLIEHLKIIKNVEYFCKKCNFKTLMECSSKAHLRTHNEVYIRYITEECTFNRPWIFSLMPQLKTPISGDQYVCNECSYTTLVKNYLKCHMIIHGSDKYKECDYKPIQKDSLKEQVKIHTVDEYKCKERDYKTVRKRRLKVHVKIHTGDEYKCKECDYKTVRKRGLTEHVKIHKGDTYKCEECDYKAVRKDRLREHVKIHSGDEYKCRVCDYKTVRKDCLKLHMKIHDEYKCKECDFKIVGLNYFKEHVKIHGCVEYKCEECNYKTVWKNDLTRHVKIHTGDEYKCKECEYKTIRKDRLKEHVKIHTGDEFKCKECDYKTVWKDRFKEHLKIHAGDEYKCKECDYKTVRKDSLMAHVKIHTGDEYKCNQCDYNTVWKNSLINHVKIHTDDEYKCKECDYKTVWKNNLENHVKSHTGDNYKCNECNYTTLWKNSLKHHVKIHTNDEYKCEECDYKTVWENDLKRHLNIHTGDRYKCKECDYKTRWSNSLTEHIKIHTNDEYKCKECDYKTVWKYNLKDHCKIHAGDEYNCKECDYKTVREGSLKKHTKIHTGDEYKCNECNYKTLWKNSLKHHVKIHTDDEYKCEECDYKTVWKNDLKRHVKIHTGDEYKCKQCDYKTVQNGHLKEHIKIHTGDEYKCTECDYKTVWKNNLKVHYKIHTGNAIIKQWKNDLKRHVKIRTPDEYKCDEYKCKECDYKTVWKNDLKRHVKIHTGDEYKCKECDYKTVWKNDLKRHVKIHTGDEYKCKECDYKTVWKNNLKVHYKIHTGDEYKCKECDYKTVWKTHLKVHYKIHTGDEYKCKECDYKTVWKNDLKRHVKIHTGDEYKCKECDYKTGWKKSLKNHVKIHIGDEYKCKECDYKTVEKNKTVCA

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-