Basic Information

Gene Symbol
-
Assembly
GCA_907165275.1
Location
OU015659.1:10913326-10940110[-]

Transcription Factor Domain

TF Family
zf-C2H2
Domain
zf-C2H2 domain
PFAM
PF00096
TF Group
Zinc-Coordinating Group
Description
The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 41 0.0052 0.39 11.8 2.2 2 23 153 174 152 174 0.96
2 41 0.021 1.5 9.9 0.1 1 23 202 224 202 224 0.96
3 41 0.0013 0.098 13.7 0.2 3 23 307 327 306 327 0.99
4 41 5.6e-06 0.00042 21.2 1.4 1 23 333 355 333 355 0.97
5 41 0.0011 0.084 13.9 0.6 1 23 361 383 361 383 0.99
6 41 6.3e-06 0.00047 21.0 0.7 1 23 389 413 389 413 0.98
7 41 3e-05 0.0022 18.9 1.5 1 23 419 441 419 441 0.98
8 41 0.00013 0.0098 16.9 2.0 1 23 447 470 447 470 0.96
9 41 0.0079 0.59 11.3 1.3 1 23 477 500 477 500 0.96
10 41 0.0038 0.28 12.3 1.8 1 23 509 532 509 532 0.94
11 41 0.0032 0.24 12.5 2.8 1 23 538 561 538 561 0.97
12 41 0.3 22 6.3 5.7 1 20 565 584 565 585 0.95
13 41 0.0027 0.2 12.7 0.8 1 23 593 615 593 615 0.98
14 41 0.0078 0.58 11.3 6.3 2 23 623 645 622 645 0.96
15 41 0.021 1.5 9.9 0.1 1 23 802 824 802 824 0.96
16 41 0.0027 0.2 12.7 0.2 3 23 931 951 930 951 0.99
17 41 2.5e-05 0.0018 19.1 0.5 1 23 957 979 957 979 0.98
18 41 0.00014 0.01 16.8 1.5 1 23 985 1007 985 1007 0.99
19 41 6.3e-06 0.00047 21.0 0.7 1 23 1013 1037 1013 1037 0.98
20 41 3e-05 0.0022 18.9 1.5 1 23 1043 1065 1043 1065 0.98
21 41 0.00047 0.035 15.1 2.0 1 23 1071 1094 1071 1094 0.96
22 41 0.026 1.9 9.6 2.4 1 23 1101 1124 1101 1124 0.94
23 41 0.048 3.6 8.8 3.0 1 23 1130 1153 1130 1153 0.95
24 41 0.54 40 5.5 3.3 1 20 1157 1176 1157 1178 0.94
25 41 0.058 4.3 8.5 0.7 1 23 1185 1207 1185 1207 0.96
26 41 0.023 1.7 9.8 6.1 2 23 1215 1237 1214 1237 0.96
27 41 0.021 1.5 9.9 0.1 1 23 1262 1284 1262 1284 0.96
28 41 0.0013 0.098 13.7 0.2 3 23 1367 1387 1366 1387 0.99
29 41 3.6e-06 0.00027 21.8 0.8 1 23 1393 1415 1393 1415 0.97
30 41 0.0042 0.31 12.1 0.6 1 23 1421 1443 1421 1443 0.99
31 41 6.3e-06 0.00047 21.0 0.7 1 23 1449 1473 1449 1473 0.98
32 41 3e-05 0.0022 18.9 1.5 1 23 1479 1501 1479 1501 0.98
33 41 0.00047 0.035 15.1 2.0 1 23 1507 1530 1507 1530 0.96
34 41 0.0047 0.34 12.0 0.6 1 23 1537 1560 1537 1560 0.94
35 41 0.066 4.9 8.4 3.2 1 23 1566 1589 1566 1589 0.95
36 41 0.054 4 8.6 3.3 1 20 1593 1612 1593 1614 0.95
37 41 0.0014 0.1 13.7 0.5 1 23 1621 1643 1621 1643 0.97
38 41 0.00014 0.01 16.8 3.5 2 23 1651 1673 1650 1673 0.96
39 41 0.079 5.8 8.1 4.4 1 22 1678 1699 1678 1702 0.91
40 41 1.3e-05 0.00098 20.0 1.8 1 23 1703 1725 1703 1725 0.99
41 41 0.012 0.89 10.7 3.8 1 21 1732 1752 1732 1753 0.94

Sequence Information

Coding Sequence
ATGAGTTACCAAGTTGGAGATTTAGTTTGTGGCTGGTCCAGTGCAAGCAATTTAAAACCTTATCATGAATTTAAAGATGAACTTAAAGAATTCAGCAAGAAGAGCAACTTTAAACTTGCGATTGAAGAAATAGAAAAGTACATTGAAAACGCTGAGAAATTCAAATCATCGAGCAGCATTACAGATAGTCATAATGAAATATATAATGGAACAACAAATGGTGTGATAATTCAAAATGGTATCAATGAAGACTCAAGAGAAAGTTCAGTTATACAAGAGAAATCGATGCAACTTAATGATAAGGGCTTGGATGTTTCGGTGGTATTGCCCCGTGTTGATATATACGACTGCAAAAGATGTTCCCGGGCGTATTTGAGCCTTGTGGAGTTGGAGGACCACGCTATCGATACACATATGGACACCGCGCATGTTGCTTTGGATCGTTTGGGTGATCTCTGTTGTGACAAATGCGGGAGTAGATTCGCTAATTCATTTGACTATACCATGCACATGCTGTCGCATCTCCGGGAGAATATTCACAGTTCACATCGCAAACGGAAGACTTACTACGACAATAAAGAATGTAAAACTTTAGCAGCGAAATACCAATGTGAGAAATGCGACGCGAAGTATGTTCTAGAGAGCACGCTGATGGCCCATTCGGCTGTCCACGAGCCGATGCCACACATATGCGAGTGCGGTGTGGGATACTACGCCAAGAATGATCTCGCAGCCCATAAACTATTGGTTCACGCGCCCCCCATACAAACAGAAACGGGAAAAAATTGCACAAAAATAAAACGTGTAGAAGAACGAGATCTCAAAGTAGCTTGCAGAAAGCGGAGAGCGACAGATAAATTGACCGACGTTGAGCAACAGATTCTCAAGAACCTGGTAAAGAAAAATGATAAATTGTTGTGTCCGATATGTAACAAAGGGATATCTATGAAAAGCAACTTTATAATACACTATCGTATACACACAGGCGACAAACGACATAGCTGTCAATATTGTGGCAAATCTTTTGTGCAAGCGACAGCGTTAAAAATTCATGTCAAAAGGCATACGGGTGTCAAAGATTACGTGTGTAAGTTTTGCGATAAGGGTTTCCACGACGGAGGCGCCTTATCCCTGCACGTGCGAACGCACACCGGAGAGAAGCCCTACGAGTGCAAACACGAGCTTTGCGGAAAGAGATTCGCGGACCCATCTGCGTTCCAAAGACACGTTCGTATACACACTGACGACAAGAGGTACACTTGCGACCGATGCCCTAAAGCGTTCACAGACAGTAGTGCTTTGTTTTATCACAAAAAACGCCACAACAACGACAAGAAACACTTCTGTCCGACTTGCGGCAAAGGATTCTACGTCAAATACTACCTCGAACAACACGTACGGAAGATGCATTTACCCAGACAGAAGACATATGAATGTAAATTTTGTCTGAAGATGTTTTACGAGAGCAGATACTTGGAAACACATATAAGCAATATTCACACGAAAGAACCCAGACAGAAGACATATGAATGTAAATTTTGTCTGAAGATGTTTTATAAGAGCAGAAATTTGGAAACACATGTGACTAATATTCACACGAAAGAAAAAGAGTTCCATTGCGACTACTGTGGTCGAGTATACCACTTCAAACCGACTCTCGCAATCCACATAAGAAATCAGCACACGATTATTTTTTACTGTCACAACTGCAACTCAAAATTTTTCTCGGAGTCACATTTCCAGCGGCATAAAGCCGAAGGGTGTACAAAAAAGGTGTACGTGTGTGAAATATGTGATAAGGAGTATTTGAAAAAAGAATCTTACAATCGTCATATAAATTATCACGCACGTTTGACGCAGCCTGTGGAATGCCATCACTGCGGCAAACGGTGTAACGAGAAGAGAAATCTCCTAGAGCACATATTCACCAGACATTTAGGGTATCCTTACAAATGTTGGGAATGCTCGAAAAAATTTCTCACCGGTTGCGATTATAAAAAACACACGTGTATAAAGAGACACTACACTTGTAAAGTTTGCAGTATAGTATATTTCATTATAAGTGCGGAAAGCCTGTCATTGCAAAGAGTTCCGACATATGTCTACCCGAGCCGAGGCGCAGCCGAAGTTTGGATTTGTAAAATAGAGTTAATGGAACAAAATGTTAAAAATCTGTTATGCGATGCACATGTGCCGCAGCGACGATGGATGATGTGGCTCGACCGCCACCCAAGGGGAAGCTTTCCGCCGGGTGAGGTGGTATGCCTGGGGAACCGCGGCTCCGATTGTAGATCGTCGGAGATCGTACAGTATATGCAGCGCAATGTGAAAACTTCTCACAGTTCACACGGCAAACGGAAGACTAATTACGCCAGTAAAGAATGTAAAACGTTAGAAGCGAAATACCAATGTGAGAAATGCGACGCGAAATATGTTCTAGAGAGCACGCTGATGGCCCATTCGGCTGTCCACGAGCCGATGCCACACATATGCGAGTGCGGTGTGGGATACTACGCCAAGAATGATCTCGCAGCCCATAAACTATTGGTTCACGCGCCCACCGTACAAACAGAAACAACTAAAAAATATCAAAAAATAAAACAAGAAACGATTAGCCGAAGCTCGTCTGACTATGATTATGACGACAGTGATGAATCGTATACTAAACCGAAACGGTTAAAAAAAAGAGGTCCCAAATTAGCTCGTAGAAAGCGGAGAGCGACAGAAAAATTGACTGACGTTGAGCAACAGATTCTCAAGAATCTGGTAATGAAAGATGATAAATTGTTGTGTCCGATATGTAACAAAGGGATGGCTACGAAACGCAACCTTATAATGCACTATCGCATACACACGGGTGACAAACGATATAGCTGTCAATTTTGTGACAAAACTTTCGTGCAAGAGGCAGCGTTAAAAATTCACGTAAAAAGGCATACGGGTGTCAAAGATTACGTGTGTAGGTTTTGCGATAAAGGGTTCTGCGACAAAGGTGGCCTAATGCTGCACGAACGTACGCACACCGGAGAGAAGCCCTACGAGTGCAAACACGAGCTTTGCGGCAAGAGATTCGCGGACCCATCTGCGTTCCAAAGACACGTTCGTATACACACTGACGACAAGAGGTACACTTGCGACCGATGCCCTAAAGCGTTCACAGACAGTAGTGCTTTGTTTTATCACAAAAAACGCCACAACAACGACAGGGGACACTTCTGTCCGACTTGCGGCAAAGGATTTTACATCAAATACTACCTCGAACAACACGTACGGAAGATGCATTTACCCAGACAGAAGACATATGAATGTAAATTTTGTTTGAAGATGTTTTACGAGAGCATACATTTGAAAACGCATGTGAGCAATATTCACACGAAAGAAAAAGAGTTTCATTGCGACTACTGTGGTCGAGTATACAACTGCAAACCGACTCTCGCAACCCACATAAGAGATCAGCACACGATTATTTTTTACTGTGACAACTGCAACTCTAAATTTTACTTGGAGTCACATTTCAAGCGGCACAAAGCCCAAGGGTGTACAAAAAAAGTGTTCGTGTGTGAAATATGTGATAAGGAGTATTTGAAAAAACAATCTTACAATAGTCATTTAAATGACCACGCTCGTTTGACGCAGCCTGTGGAATGCCATCACTGCGGCAAACGGTGTAACGAGAAGAAAGGTCTCCTACAGCACATATTCACCAGACATTTAGGTCACAGTTCACACGGCAAACGGAAGACTAACTTCGCCAGTAAAGAATGTAAAACGTTAGAAGCGAATTACCAATGTGAGAAATGCGACGCGAAATATGTTCTAGAGAGCACGCTGATGGCCCATTCGGCTGTCCACGAGCCGATGCCACACATATGCGAGTGCGGTGTGGGATACTACGCCAAGAATGATCTCGCAGCCCATAAACTATTGGTTCACGCGCCCCCCATACAAACAGAAACGGGAAAAAATTGCACAAAAATAAAACGTTTAGAAGAACGAGATCTCAAAGTAGCTTGCAGAAAGCGGAGAGCGACAGATAAATTGACCGACGTTGAGCAACAGATTGTCAAGAATCTGGTAATGAAAGATGATAAATTGTTGTGTCCGATATGTAACAAAGGGATATCTATGAAAAGCAACTTTATAATACACTATCGTATACACACGGGCGACAAACGACATAGCTGTCAATATTGTGGCAAATCTTTTGTGCAAGCGGCAGCTTTAAAAATTCATGTGAAAAGGCATACGGGTGTCAAAGATTACGTGTGTAGGTTTTGCGATAAGGGTTTCCACGACGGAGGCGCCTTATCCCTGCACGTGCGAACGCACACCGGAGAGAAGCCCTACGAGTGCAAACATGAGCTTTGCGGCAAGAGATTCGCGGACCCATCTGCGTTCCAAAGACACGTTCGTATACACACTGACGACAAGAGGTACACTTGCGACCGATGCCCTAAAGCGTTCACAGACAGTAGTGCTTTGTTTTATCACAAAAAACGCCACAACAACGACAAGAAACACTTCTGTCCGACTTGTGGCAAAGGATTCTACATCAAATACTACCTCGAACAACACGTACGGAAGATGCATTTACCCAGACAGAAGACATATGAATGTAAATTTTGTCTGAAGATGTTTTACGAGAGCATAAATTTGAAAGCGCATGTGAGCAATATTCACACGAAAGAAAAAGAGTTTCATTGCGACTACTGTGGTCGAGTATACAACTGCAAACCGACTCTCGTAACCCACATAAGAGATCAGCACACGATTATTTTTTACTGTGACAACTGCAACTCTAAATTTTACTCGGAGTCACATTTCCAGCGGCACAAAGCCCAAGGGTGTACAAAAAAGGTGTTCGTGTGTGAAATATGTGATAAGGAGTATTTGAGAAAAGAATCTTACAATCGTCATATAAATGACCACGCTCGTTTGACGCAGCCTGTGGAATGCCATCACTGCGGCAAACGATACAACGAGAAGAGAAATCTCCAAGAGCACATACTCTCCAAACATTTGGGGTACCCTTACAAATGTTGGGGATGCTCGAAAAAATTTGTCACCGGTTTCGATTATAAAAAACACACGTGTATAAAGAGACACTACACTTGCAAAGTTTGCGGTAAGGAATTTTACAGGAAGGATCAATATGTGGTTCACAGGAATAGACACAAGGGCTTTTACAGGACCTACCGCTGTTATTGTTGCGATAAGCAGTTCAATACTAAGACTGATTTGCGCCAGCACATGTTGCGTCAGATGCGCAGGGAAATGCCGCGTTGCACCCCTTGTCGTAAACGCGGCCAACGATTAGTTCAGATTTTGACCGCGCGCAGACTTATCTTGCGCGCGCTCTACAACCGATTTAACTCCCCACCGCTACAGTGCGGGCGGGCGGCTTGTGTGCACACGTGCAGGGTGCTTGTGCGGCGCAGGGCTTGCGGCGAGCGGCAGGGGCAGCGGGAGGGGGGAAGGGCACATTCAAGTTTGATACCGTACTGA
Protein Sequence
MSYQVGDLVCGWSSASNLKPYHEFKDELKEFSKKSNFKLAIEEIEKYIENAEKFKSSSSITDSHNEIYNGTTNGVIIQNGINEDSRESSVIQEKSMQLNDKGLDVSVVLPRVDIYDCKRCSRAYLSLVELEDHAIDTHMDTAHVALDRLGDLCCDKCGSRFANSFDYTMHMLSHLRENIHSSHRKRKTYYDNKECKTLAAKYQCEKCDAKYVLESTLMAHSAVHEPMPHICECGVGYYAKNDLAAHKLLVHAPPIQTETGKNCTKIKRVEERDLKVACRKRRATDKLTDVEQQILKNLVKKNDKLLCPICNKGISMKSNFIIHYRIHTGDKRHSCQYCGKSFVQATALKIHVKRHTGVKDYVCKFCDKGFHDGGALSLHVRTHTGEKPYECKHELCGKRFADPSAFQRHVRIHTDDKRYTCDRCPKAFTDSSALFYHKKRHNNDKKHFCPTCGKGFYVKYYLEQHVRKMHLPRQKTYECKFCLKMFYESRYLETHISNIHTKEPRQKTYECKFCLKMFYKSRNLETHVTNIHTKEKEFHCDYCGRVYHFKPTLAIHIRNQHTIIFYCHNCNSKFFSESHFQRHKAEGCTKKVYVCEICDKEYLKKESYNRHINYHARLTQPVECHHCGKRCNEKRNLLEHIFTRHLGYPYKCWECSKKFLTGCDYKKHTCIKRHYTCKVCSIVYFIISAESLSLQRVPTYVYPSRGAAEVWICKIELMEQNVKNLLCDAHVPQRRWMMWLDRHPRGSFPPGEVVCLGNRGSDCRSSEIVQYMQRNVKTSHSSHGKRKTNYASKECKTLEAKYQCEKCDAKYVLESTLMAHSAVHEPMPHICECGVGYYAKNDLAAHKLLVHAPTVQTETTKKYQKIKQETISRSSSDYDYDDSDESYTKPKRLKKRGPKLARRKRRATEKLTDVEQQILKNLVMKDDKLLCPICNKGMATKRNLIMHYRIHTGDKRYSCQFCDKTFVQEAALKIHVKRHTGVKDYVCRFCDKGFCDKGGLMLHERTHTGEKPYECKHELCGKRFADPSAFQRHVRIHTDDKRYTCDRCPKAFTDSSALFYHKKRHNNDRGHFCPTCGKGFYIKYYLEQHVRKMHLPRQKTYECKFCLKMFYESIHLKTHVSNIHTKEKEFHCDYCGRVYNCKPTLATHIRDQHTIIFYCDNCNSKFYLESHFKRHKAQGCTKKVFVCEICDKEYLKKQSYNSHLNDHARLTQPVECHHCGKRCNEKKGLLQHIFTRHLGHSSHGKRKTNFASKECKTLEANYQCEKCDAKYVLESTLMAHSAVHEPMPHICECGVGYYAKNDLAAHKLLVHAPPIQTETGKNCTKIKRLEERDLKVACRKRRATDKLTDVEQQIVKNLVMKDDKLLCPICNKGISMKSNFIIHYRIHTGDKRHSCQYCGKSFVQAAALKIHVKRHTGVKDYVCRFCDKGFHDGGALSLHVRTHTGEKPYECKHELCGKRFADPSAFQRHVRIHTDDKRYTCDRCPKAFTDSSALFYHKKRHNNDKKHFCPTCGKGFYIKYYLEQHVRKMHLPRQKTYECKFCLKMFYESINLKAHVSNIHTKEKEFHCDYCGRVYNCKPTLVTHIRDQHTIIFYCDNCNSKFYSESHFQRHKAQGCTKKVFVCEICDKEYLRKESYNRHINDHARLTQPVECHHCGKRYNEKRNLQEHILSKHLGYPYKCWGCSKKFVTGFDYKKHTCIKRHYTCKVCGKEFYRKDQYVVHRNRHKGFYRTYRCYCCDKQFNTKTDLRQHMLRQMRREMPRCTPCRKRGQRLVQILTARRLILRALYNRFNSPPLQCGRAACVHTCRVLVRRRACGERQGQREGGRAHSSLIPY

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-