Basic Information

Gene Symbol
ZFY
Assembly
GCA_009761765.1
Location
chr3:24112843-24143391[-]

Transcription Factor Domain

TF Family
zf-C2H2
Domain
zf-C2H2 domain
PFAM
PF00096
TF Group
Zinc-Coordinating Group
Description
The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 44 0.0082 0.55 10.2 2.1 1 21 97 117 97 118 0.96
2 44 0.00012 0.0083 16.0 3.0 1 23 186 209 186 209 0.97
3 44 0.0045 0.3 11.0 8.4 1 23 215 237 215 237 0.97
4 44 4.6e-05 0.0031 17.3 0.3 1 23 243 265 243 265 0.96
5 44 8.6e-06 0.00058 19.6 0.1 3 23 273 293 272 293 0.97
6 44 5.3e-05 0.0035 17.1 2.6 1 23 299 321 299 321 0.94
7 44 0.21 14 5.8 0.9 1 17 327 343 327 344 0.92
8 44 0.24 16 5.6 1.6 1 23 347 370 347 370 0.95
9 44 0.0014 0.096 12.6 0.1 1 23 412 434 412 434 0.97
10 44 0.013 0.85 9.6 1.3 1 23 467 490 467 490 0.97
11 44 0.32 21 5.2 3.7 1 23 538 560 538 560 0.96
12 44 0.0017 0.11 12.4 1.3 1 23 565 588 565 588 0.96
13 44 3e-06 0.0002 21.0 1.2 1 23 633 656 633 656 0.97
14 44 0.00017 0.011 15.5 4.1 1 23 662 684 662 684 0.96
15 44 4.3e-06 0.00029 20.5 0.5 2 23 691 712 690 712 0.94
16 44 0.00025 0.017 15.0 8.7 1 23 718 740 718 740 0.96
17 44 8e-07 5.4e-05 22.8 0.4 1 23 746 768 746 768 0.97
18 44 0.0039 0.26 11.2 2.0 2 20 844 862 843 864 0.90
19 44 0.29 20 5.3 1.4 2 20 872 890 872 892 0.94
20 44 0.00076 0.051 13.5 1.9 2 23 923 945 923 945 0.96
21 44 0.024 1.6 8.8 2.0 2 23 976 997 975 997 0.97
22 44 3e-05 0.002 17.9 1.5 1 23 1003 1026 1003 1026 0.94
23 44 0.00013 0.0088 15.9 1.1 1 23 1032 1054 1032 1054 0.96
24 44 5.5e-06 0.00037 20.2 0.6 1 23 1060 1082 1060 1082 0.97
25 44 1.2e-05 0.00079 19.2 2.4 1 23 1088 1110 1088 1110 0.99
26 44 0.00013 0.0084 15.9 0.1 1 23 1116 1138 1116 1138 0.94
27 44 0.021 1.4 9.0 0.6 1 19 1144 1162 1144 1162 0.97
28 44 0.0095 0.64 10.0 4.4 1 23 1240 1262 1240 1262 0.98
29 44 5.9e-05 0.004 17.0 0.1 1 23 1335 1358 1335 1358 0.96
30 44 2.6 1.8e+02 2.3 0.3 8 23 1372 1387 1365 1387 0.88
31 44 0.15 10 6.3 2.9 1 21 1393 1413 1393 1415 0.94
32 44 9.7e-07 6.5e-05 22.6 1.2 1 23 1421 1443 1421 1443 0.98
33 44 0.02 1.4 9.0 1.3 1 23 1449 1471 1449 1471 0.87
34 44 0.028 1.9 8.6 2.4 1 21 1477 1497 1477 1498 0.92
35 44 8.4 5.6e+02 0.8 2.6 1 23 1565 1587 1565 1587 0.93
36 44 0.024 1.6 8.8 1.0 1 23 1602 1625 1602 1625 0.90
37 44 3.2e-05 0.0022 17.8 2.3 1 23 1702 1724 1702 1724 0.97
38 44 0.00092 0.062 13.2 0.7 1 23 1730 1753 1730 1753 0.93
39 44 7.3e-06 0.00049 19.8 1.4 1 23 1810 1833 1810 1833 0.94
40 44 4.3e-05 0.0029 17.4 3.3 1 23 1839 1861 1839 1861 0.96
41 44 7.3e-06 0.00049 19.8 0.5 1 23 1867 1889 1867 1889 0.96
42 44 5.1e-07 3.4e-05 23.5 3.1 1 23 1895 1917 1895 1917 0.97
43 44 4.6e-05 0.0031 17.3 0.2 1 23 1923 1945 1923 1945 0.97
44 44 0.0035 0.23 11.4 2.6 1 23 1951 1973 1951 1973 0.98

Sequence Information

Coding Sequence
GAAGATGACACATTTGATAATTATGTCGATGATTACCGCGACGATTACGATGATCAGTCGCGATCGAATTTTGATACGCTTCCTCTCAACTCTGCTCCTAGTACCAGTAATTTAGCCGGTGACGATTTGTTAAAAAGTAACTCCATTGAAATTAGCCAGCTAAGTGAATCGAAAGAACCTAATGAGACTGAATTAGATAATACAGATGAAAGGACCGCAGTCGTTCTGACGGCCGACGACGAGAAGCCTAGTAACCATACCGGCGGCGACGGTGACGATTCGATATTCTATAAGTGTGCGAAATGCACCAAAGTGTTTTTGAAAAGACACAATTACCAACGGCACGTTCGCGCTATGTGCGCCGCGAAAAGTAAATTTCGCGCCACGAAATCGCCTTCGGAGGAAGAAGTGGCGGCGAACGAGTTGACTGGGAAGTGCGAGCGCGTCGTGATAAACGGGTTGAATTTTTTTAAATGCGCCGACTGCGCGTTCGTCAGCAAAACGGTGAACGCGAAGATGCTGAGGCATTTGAAGAAGCACGAGATACCGCGTATATTCAGCTGCGAAACGTGCGGTCATCAGTTTCGCGAGAAATACGCGCTGATGAAGCATATCAAGAGGCAGCACGAAAACGATCGTAAGTTCGCGTGCGACACGTGTCATTGTATGTTTTTCGATCGTTTCCATTTGGAAAATCATATGCGTCGGCATACGGGCGAGAGGCCGTTTGTATGTGACGAATGTGGCAAAGCGTTCGCCCAAGGATCGTCGTTATTTGTGCACAAGAAATTCCACGTTGGTGAACGATCGAATATTTGCCAGCTATGCGGTAAAGGTTTCGTCACTAATTCGGGCCTGTTGGTGCATATTCGTCGGCACACCGGCGAACGTAAACACGCGTGTAGTCACTGCGACAAGACGTTCGTCAGTACCAGCCAGCTCAAGTCGCACGTGGTGGTGCATTTCGACGAGCGACCGTTCAGCTGCTCCGAATGCGGAGCGCAATTTAAATTACGGAAACATTTGAAATACGACAGTTTCAAGTGCCATTTCTGCCCGATGGTTTTCTTGAAGGCGGACGTACGGAATAGGCATTTGCGTATATCACACCGTAAAGAGTTGTTGAAACACTCGGTGTCGTATTTCCGCGGCCTTGGCTTGAACGACAAGACGAACGCGGACGTGGTCCGCGAGGCGGCTGCGACGAAGAAAGACGAACCACCGGTGCCGCAATACCATTGCGACAAGTGCGGTAACGCGTTCGGCGATAAGGTGGCGTTGATCGAGCATTTAACGGTGCATCGAGACGCCGTGCAGCGGCCAAAGACCAAGAAGATGGAGAGGAAACGTAAATTCGCGAGACGTTCGCGATTACCCTGTCTGGCGGCTCCGGCTTGGTACACGTGCGCCCATTGTTCGGCCATCGTGCGGACCAAGATGGATATGAAGAGGCATTTGATGTCGTCGCACGAATTCAAGCTCGATAAACGTGGCGGTATTAGGGAACGGCGGAAGCCAGCGGAAGATGAACAAACGGTGATGGATATGGTAAGGAAATCGTTCAAGTCGACCAGCGCCAGCGGTCAGGATGAAGATTTGACGATCGGTTATTATACGTGCGCGGTTTGCGGGCTTATATTTTTGAAGAGGCATTCGTTCACGCGACACGTGCATCAGCATTCGTCGTGTCGATACGAGTGCGATATGTGTAAGAAAACTTTCAACAGCGAAAACGTGTTGCTGGAGCATCGCAAGAATAAGCACAAGGAGATACCGAAAAACGTATTTTTCGTTAATGGTCGCAAGTACCTGAAATGCGAACTGTGCTCGTTCACTTGTCGTAACGAACGTAACAAAATGGAGGAACATTTGCGCGTGCACACCGGCGAGAAACCGTTCACGTGCGAACAGTGCGGCAAACAGTTTCGTACGAGGGCGTTGTTGAGGGTCCATCGAAGGTACGTGCACGACGGCGTTAAGGAACACGCGTGCGATTTATGCGGCCGATGTTTTTCGAACAAACGATACATGGAGGAGCATCGGCGTATACATACGGGCGAAAAACCGTTGATTTGCGATTTATGCGGTAAAACTTTTCGGCAGAATTCGTCGCTGTCGAAACACGTTGAGAATCATATGGGTATTAAGAGGCACGCGTGCCATTTATGCGAGAAACGATTTTCCAACAGTCATCATTTGAGTATACACATTAGACGGCACATGGGCGAAGCGAGCTTCGTGTGCAATAAATGTGGTAAAGGGTTCGTCGATCAGTATCAGTTGAAGAATCATCAGGTGGTGCATTCGGAGGAACGACCATGTAAGCGTTCGGTGGCGAAGAAAAGAACGAGCGTAAAATCGCGACCGTCGCTTCTAGCGAAAGCGGTACGAGTATCCGCGTCGAAAGCTCGCGACGAAGCTACGAAAGAATCGCCGATCGCATCTCTTAAGGTCGAAAACGACGATACGAAAATCGGCGACGACGTCGCAAAGCGCTCGAAACGTAAAGGTTACCCGTGTTCGAATAAAACGTGTCCGACTTGCGGCAAAACGTTCACCCGTAGTAACGGTTGCAAACGTCATGTGGAGAGTAGAATCTGTATTCGGAGTCGCAAGTGCGCGAAATGCGGTCTGCGGTGCGCGAGTTACGCGATGTTATGCAAACACGTGGTGAGCTGCGAGGGTGAGCTTTCGAACATTGTGATTCGGAAATCCGAAACAAGCGCCGGCCACGTTACGAGCTCGTCTGCAGTAAAGCAGTTTCCTCGATGTGTTACGTGCAGTAAAACCTTCAGGACTATGTATAGCGTTAATAGGCACATTAAAAGAGTTCACCGATACGACGAAGCGAACGAAAAGTTGGAGAGGAAGAAGAACTTACACTTCGAACACCTAAATGTAACCGAGGATGGTAAGGAAATATTGAAATGCGACCAATGCGGTTACGCGACACCGTTCAAGCATCGTTTCGAGCAGCACTATCGTATTCATACGGGCGAGAGGCCGTTCTTGTGTTTTCATTGCGGTAAATCGTTCCGTACCAAGGGATATGTTCTGCGGCATATACTGGTCGTGCACGAAGGAGTCAAGAAATTCTCGTGCGATATTTGCGGTCGGTCGTTTTCCGATCGTCGATTCATGGAGAGCCATCGCTACATACATACCGGCGAACGTCCGTACGTTTGCGATATCTGCGGCAAAACGTATCGTCAAAGCGCTACGTTACTTGTACACAAACGATTCCATTTGAACGTGCGTGATTACGAGTGCGAAGTGTGCCAAAAGAAATTCGTTCGTCGAGGACATTTGTTGATGCATTTGAAACGACATAATAACATTAAGGATTTCGCGTGCAGTCAGTGCGGTAAAGCGTTCGTCGATAGGAAGATGTTGAAAGATCACGAAGTGGTGCATTCGAACGAACGTCCGTTTATCTGTGATTTATGCGGTGCCGGCTTCAAGCTGAGGAAGCATTTGAAACAGCACGATCTGGCGACGACGACGATCGCGGGCTTGAAACATGCGGATGCTGCGTCGATCAAACCTCAGTATCACGTTATTTTACCTAAATCCGCTTTGAGCGCAGTCTCACCTTCGAAGGTGGTACCCGAAACGCCTTCGCAGGCTAATGAAACGCGTTGTGCCGCTGCCTCTAGTAATATTCCTAAAGCTTCGGAGAAAAATGCAGCACCAGGATATGGGGTGGTAAATCATCTATACCGGTGTAAGCCATGCGGCAAGGTATTCGCCGATCGTACGAATTGCGTACGTCATTTGTGTACGCACCAAGAAGCCAACAGGCTGCAGACGACGGTGGCTATCGAACGAAATATCGAAGGCTGCCTCGATGGCGAAGTGGTGCACTTGAAAGAGTCCACTTTGCCGAATAATGTAAAGATCGTGGACGGATATGTGATCTGCGACAAGTGCCCCTATCGATGCAAAGCGCTACGAAAACACAGATTTCAAGAGCATTATCGTATACATACGGGAGATCGACCCTATCAGTGTGAGTTGTGCGGTGTAGCGTTTCGTACAAAAGGGGTACTAAAGACTCACGTGAGGGGCGTACACGCGAAACTCGAGGATGAATTCTCGTGCGATGTGTACGACTGTAAGTTCGCGCAGAATACTTCGATGGAAGAACATCGACGTATGCATAATCAAAAACGATTGTTCGTTTGCGATCTGTGCGGCAATCAGTTCGTGCACGCTTCGTCCATGTATGTGCATAAGAGGTGCCATCTGAATGTGCGAGACCATAAGTGCGAGGTTTGCGGAGCCGCGTTCAATACAAGCTCGGTGCTGAGGAGGCACATGCGACGACATACTGGTGATAAGAGGTATAAATGCGGGATGTGCGATAAGGCGTTTATCAATAAGCAAGTTTTGCAATGCCACGAAACAGTGCATTCGAAGGAACGACCGTTTGCCTGCGAACAGTGCGATTCGAAGTTTAAACTGAAGAAACATTTGAAACAGCACGAACAGAAAGCGGATATCGCGCGCCACATGTCCAGTAGTCATTCGGTGACGGCGAAGGCGGTGGTCGACGAAAAAAGTAACGTCGTTAGGAGAACGCCTAAAAGCTCGGCTGGCAGGAAAAGCGTGAAAGCGCGCGTGGACGACGATGCGAACCGGTGGCCGGAAGAGGTGTCGGCGAGTACGTCGGCCAGTTCGTCGTCCACGTTACATCGGTGTAAAGTGTGCCTAAAAATTTGCCTATCTCTTCGCGGTCTGACGCGACATAGCGCTATACACCGCAACATCGGATTGGCGCCGTTGACTAAGACGGATACGACATTTAGCTGTAACTTTTGCGATCAACGGTTCGCCAACGAGGATGATAAACGTAGTCACGTGAGCGATTATCATCGCATCGTAACGCCTATGGAGGGAAACGACGAAAACGAACGAAGATCGTTCGATTCAATTTGCGAGGTTTCAGTCAAGGTGGACGAAGAGAAGGAGAATATAAAGAACGAATTCGGCGATGAAATCGAGATGGTCGACGAGGCGGTAAAAAGTGACCTGGAAGAGGCGAAGAAAAGCACCACTCGCAAACGCAGAAGGAAGAGCGAACCGAGCTCGAGCGGAGATGTATTCGCGTGCGACGAATGCGACAAAACGTTCGACAAGAAGACCAGTTACACTCGTCACAAACTCACGCACAGCACAGGCGACGAATTTCCGTGCGACGTCTGTCAGAAGCGCTTCTCTTCGGAGATGGTGCTGAAACGTCACAAAAGCAACGTACATCGTAGCAAACGAGGCAAGATGTCTGACGAAGAAGACCCATTCAGTAAATGGTACGTTTTCGCCGACGGCCAGAAGCACTTGAAATGTGACCAGTGTAAATATAACAGTCTGTGGTACCATAAAGGCAGGTTCTTGGAGCACTATAGAATACACACGGGCGAGAAACCCTTCACGTGCGACTTATGCGGCAAGCAATTTCGCACGAAGACGTTGCTGCAGAAGCATGTGATGTTCGTGCACGAAGGTATTAAAGAGCACGCGTGCGACATCTGCGGCCGATGTTTTTCGGATAAACGATACATGGAGGATCATCGACGTATACATACCGGAGAGCGACCGTACGTTTGCGATTTATGCGGCAAAACGTTCAAGCAAAGCGCGTCGCTTTTCATACACAAAGAACTGCACAAAGGATTGCGTAAGCATAGCTGCGAAATGTGCGGTAAACGATTCACCACCAGGTCTAGCTTGACCATTCATATTAAAAGACACGTCGGCGAACGTAATTTTATTTGCGCGACTTGCGGCAAAGCGTTCGTTGATGGGAAACACCTGAAAGATCACATGGCCGTGCATTCGGAAGAGCGGCCATTCTCCTGCGAACTATGCGGCGGACGGTTTAAACTGATGAAGCATTTGAAACAACATGGTCGAACTCATAAGCGAGAATATGCGAAAATCGACAAATAA
Protein Sequence
EDDTFDNYVDDYRDDYDDQSRSNFDTLPLNSAPSTSNLAGDDLLKSNSIEISQLSESKEPNETELDNTDERTAVVLTADDEKPSNHTGGDGDDSIFYKCAKCTKVFLKRHNYQRHVRAMCAAKSKFRATKSPSEEEVAANELTGKCERVVINGLNFFKCADCAFVSKTVNAKMLRHLKKHEIPRIFSCETCGHQFREKYALMKHIKRQHENDRKFACDTCHCMFFDRFHLENHMRRHTGERPFVCDECGKAFAQGSSLFVHKKFHVGERSNICQLCGKGFVTNSGLLVHIRRHTGERKHACSHCDKTFVSTSQLKSHVVVHFDERPFSCSECGAQFKLRKHLKYDSFKCHFCPMVFLKADVRNRHLRISHRKELLKHSVSYFRGLGLNDKTNADVVREAAATKKDEPPVPQYHCDKCGNAFGDKVALIEHLTVHRDAVQRPKTKKMERKRKFARRSRLPCLAAPAWYTCAHCSAIVRTKMDMKRHLMSSHEFKLDKRGGIRERRKPAEDEQTVMDMVRKSFKSTSASGQDEDLTIGYYTCAVCGLIFLKRHSFTRHVHQHSSCRYECDMCKKTFNSENVLLEHRKNKHKEIPKNVFFVNGRKYLKCELCSFTCRNERNKMEEHLRVHTGEKPFTCEQCGKQFRTRALLRVHRRYVHDGVKEHACDLCGRCFSNKRYMEEHRRIHTGEKPLICDLCGKTFRQNSSLSKHVENHMGIKRHACHLCEKRFSNSHHLSIHIRRHMGEASFVCNKCGKGFVDQYQLKNHQVVHSEERPCKRSVAKKRTSVKSRPSLLAKAVRVSASKARDEATKESPIASLKVENDDTKIGDDVAKRSKRKGYPCSNKTCPTCGKTFTRSNGCKRHVESRICIRSRKCAKCGLRCASYAMLCKHVVSCEGELSNIVIRKSETSAGHVTSSSAVKQFPRCVTCSKTFRTMYSVNRHIKRVHRYDEANEKLERKKNLHFEHLNVTEDGKEILKCDQCGYATPFKHRFEQHYRIHTGERPFLCFHCGKSFRTKGYVLRHILVVHEGVKKFSCDICGRSFSDRRFMESHRYIHTGERPYVCDICGKTYRQSATLLVHKRFHLNVRDYECEVCQKKFVRRGHLLMHLKRHNNIKDFACSQCGKAFVDRKMLKDHEVVHSNERPFICDLCGAGFKLRKHLKQHDLATTTIAGLKHADAASIKPQYHVILPKSALSAVSPSKVVPETPSQANETRCAAASSNIPKASEKNAAPGYGVVNHLYRCKPCGKVFADRTNCVRHLCTHQEANRLQTTVAIERNIEGCLDGEVVHLKESTLPNNVKIVDGYVICDKCPYRCKALRKHRFQEHYRIHTGDRPYQCELCGVAFRTKGVLKTHVRGVHAKLEDEFSCDVYDCKFAQNTSMEEHRRMHNQKRLFVCDLCGNQFVHASSMYVHKRCHLNVRDHKCEVCGAAFNTSSVLRRHMRRHTGDKRYKCGMCDKAFINKQVLQCHETVHSKERPFACEQCDSKFKLKKHLKQHEQKADIARHMSSSHSVTAKAVVDEKSNVVRRTPKSSAGRKSVKARVDDDANRWPEEVSASTSASSSSTLHRCKVCLKICLSLRGLTRHSAIHRNIGLAPLTKTDTTFSCNFCDQRFANEDDKRSHVSDYHRIVTPMEGNDENERRSFDSICEVSVKVDEEKENIKNEFGDEIEMVDEAVKSDLEEAKKSTTRKRRRKSEPSSSGDVFACDECDKTFDKKTSYTRHKLTHSTGDEFPCDVCQKRFSSEMVLKRHKSNVHRSKRGKMSDEEDPFSKWYVFADGQKHLKCDQCKYNSLWYHKGRFLEHYRIHTGEKPFTCDLCGKQFRTKTLLQKHVMFVHEGIKEHACDICGRCFSDKRYMEDHRRIHTGERPYVCDLCGKTFKQSASLFIHKELHKGLRKHSCEMCGKRFTTRSSLTIHIKRHVGERNFICATCGKAFVDGKHLKDHMAVHSEERPFSCELCGGRFKLMKHLKQHGRTHKREYAKIDK

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-