Basic Information

Gene Symbol
ZFY
Assembly
GCA_963576655.1
Location
OY755085.1:27743362-27748380[-]

Transcription Factor Domain

TF Family
zf-C2H2
Domain
zf-C2H2 domain
PFAM
PF00096
TF Group
Zinc-Coordinating Group
Description
The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 48 0.0097 0.89 10.7 2.0 1 23 16 39 16 39 0.95
2 48 1.9e-05 0.0017 19.2 0.8 2 23 46 68 45 68 0.96
3 48 0.00053 0.049 14.7 0.7 2 23 74 96 73 96 0.96
4 48 6.4e-05 0.0058 17.6 3.6 1 23 101 123 101 123 0.98
5 48 0.012 1.1 10.4 1.2 1 23 132 154 132 154 0.95
6 48 0.00044 0.04 14.9 0.0 1 23 161 183 161 183 0.98
7 48 0.00069 0.063 14.3 4.6 1 23 189 211 189 211 0.98
8 48 3.9e-06 0.00035 21.4 0.3 1 23 218 240 218 240 0.99
9 48 0.0064 0.58 11.3 2.9 1 23 246 268 246 268 0.98
10 48 3.8e-05 0.0034 18.3 5.6 1 23 274 296 274 296 0.98
11 48 1.8e-06 0.00016 22.4 0.5 1 23 302 324 302 324 0.96
12 48 9.9 9.1e+02 1.2 0.3 1 9 330 338 330 340 0.89
13 48 0.011 1 10.5 2.0 1 23 343 366 343 366 0.96
14 48 0.014 1.3 10.1 0.4 1 23 371 394 371 394 0.95
15 48 4.9e-05 0.0045 17.9 0.7 2 23 401 423 401 423 0.97
16 48 0.00028 0.026 15.5 0.2 2 23 429 451 428 451 0.89
17 48 0.0007 0.064 14.3 3.9 1 23 456 478 456 478 0.98
18 48 0.00011 0.0099 16.8 3.3 1 23 487 509 487 509 0.99
19 48 3.5e-05 0.0032 18.4 0.1 1 23 516 538 516 538 0.96
20 48 1.3e-05 0.0012 19.8 2.8 1 23 544 566 544 566 0.99
21 48 1.7e-05 0.0015 19.4 3.5 1 23 574 596 574 596 0.97
22 48 8.1e-05 0.0074 17.2 1.7 1 23 602 624 602 624 0.99
23 48 1.1e-06 0.0001 23.1 4.6 1 23 630 652 630 652 0.99
24 48 9.8e-05 0.009 17.0 0.3 1 23 658 680 658 680 0.96
25 48 0.075 6.9 7.9 0.5 1 14 686 699 686 700 0.94
26 48 0.85 78 4.6 4.7 1 23 703 726 703 726 0.95
27 48 0.0017 0.16 13.1 0.9 1 23 731 754 731 754 0.95
28 48 1.7e-05 0.0015 19.4 1.1 2 23 761 783 761 783 0.97
29 48 0.0013 0.12 13.5 0.2 2 23 789 811 788 811 0.93
30 48 0.00098 0.09 13.8 3.4 1 23 816 838 816 838 0.98
31 48 5.9e-05 0.0054 17.7 1.9 1 23 847 869 847 869 0.98
32 48 1.5e-05 0.0013 19.6 0.1 1 23 876 898 876 898 0.97
33 48 9.5e-05 0.0087 17.0 5.8 1 23 904 926 904 926 0.99
34 48 4.2e-06 0.00038 21.3 0.1 1 23 933 955 933 955 0.98
35 48 0.0059 0.54 11.4 2.7 1 23 961 983 961 983 0.98
36 48 0.0002 0.018 16.0 3.2 1 23 989 1011 989 1011 0.98
37 48 8.3e-06 0.00076 20.3 3.1 1 23 1017 1039 1017 1039 0.99
38 48 9.5e-06 0.00087 20.2 2.6 1 23 1045 1067 1045 1067 0.96
39 48 0.0074 0.68 11.0 4.3 3 23 1088 1109 1086 1109 0.93
40 48 0.018 1.7 9.8 2.0 1 23 1114 1137 1114 1137 0.96
41 48 3.7e-05 0.0034 18.3 0.9 3 23 1145 1166 1144 1166 0.96
42 48 0.026 2.4 9.4 0.1 3 23 1173 1194 1171 1194 0.94
43 48 5.8e-05 0.0053 17.7 5.0 1 23 1199 1221 1199 1221 0.99
44 48 0.00044 0.041 14.9 1.5 1 23 1230 1252 1230 1252 0.98
45 48 0.0001 0.0093 16.9 1.6 1 23 1259 1281 1259 1281 0.98
46 48 8.9e-05 0.0081 17.1 6.9 1 23 1287 1309 1287 1309 0.99
47 48 2.4e-06 0.00022 22.0 0.6 1 23 1315 1337 1315 1337 0.98
48 48 0.024 2.2 9.4 2.1 1 21 1343 1363 1343 1364 0.94

Sequence Information

Coding Sequence
ATGACCCTATCCCGCCATAAAAATAACATCCACGGCAAACGCCCACACGAGTGCGAAATTTGCAATAGAAAATATCGCAGCCTCAAAGACGTCAAGATACATAAATCACGAGTACATTTAAAACTCACCACCGTGATGTGCGGTTTCTGCGACAAAACCTTCAAAGACAAACGCTATCTAGTGACGCACGTCAGAACATTGCACTATAACGAGAGGATATCCTGTGACCAATGTCAGGCAACCTTTTCCACATTTAAATTACTTCAAAACCACCAGAATTCAGTGCATTTAAAGATAACATACCCATGCGAGCATTGCTCCAAAGTATTCACGAATCTACAGTACTACAGGAACCACCTGAAATCGCACGGATCCGATTACGCACAGAAAATATTCCTATGCGAGATATGTCCCAAGCGTTATCTTTATCGGCATGGTCTCGCTCGGCACATGAAAGACCACTTGGGCATTACCGACAGCTTCGTTTGCGAAATTTGTGGAGAGAACCTCACCAGCGTCGCAGATCTCAGAGACCACAGGATGACTCATACAGAAGACAAGCCTTTCCACTGCGAAAGCTGCGATAAGTCGTTCAGAAAGAGGCTCAGCTTGGCGCAGCACAAGCGGACTCACAATTCGGACAGAACCCTCTATGTTTGCGATTTTTGTGGCAAAGCTCTCACTTCCAATTATAGTCTGCAAGTGCATCTGAGAATGCACACTGGCGAGGCACCGTTCCAATGTGATATGTGCCCGAAAAGATGCAGGAGCAAGCAGAGAGTGGAAGTACACAAACGTGTGCATACTGATGAGAAGTTTTTCCACTGCGATATCTGTTCGAAGAGCTTCAAGAATAGACAGAACATGAAGAGGCACAGGCTCACTCACAATAATGAGAAACCTCATGTATGCGATGTGTGCGATAAATCTTTCAGCCAAAATTCGAATCTGGTGATTCACAAGGCGATTCACACTGGACTGCGACCGTATCAGTGTGGTATTTGTTCCAAAAAATCTGCTGCTGGTTACCAGTGTGATGAGTGTGACTCTTACTTCAGGAAATACTTGAATTTAGTTCGCCACAAACAGAACACGCACGGCAGCCGACCATACGAGTGCGACATTTGTGAAAGGAAATACCGCAGCATCAGAGATGTGAAAGTGCACAAGATGCAAGTCCATTTCAAACTCAAGTCCGCCAAATGCGGTTTCTGTGACAAGATATTCAACGACAAGCGTTACTTGGTCAGACACATCAACTCCGACCATTTCAATACGAGAATCTCTTGCGAGCAATGCGATGCAACTTTCACGAACAAAACGTCGCTCAAAAATCACGTTGACGCAGTCCATTTGGGCATCACGTATCCCTGCGAGCATTGCTCGAAGATCTACACTAGCTTGCAGTATTACAGGAAGCACCTTAAATCGCACAGCTTCGGCTACGAGAAGAAGATTTTCCAATGCGATATTTGTCCCAAGCACTACCAGTACGAGCACGGCCTTGCACGTCACATGAAAACCCATTTAGGAAACGCTCTCAGCTTCATTTGCGAAATTTGTGGTAAACGTTTAACCAACAGCACCGATCTTAGAGACCACATCTTGGGCCACGGTACCGAGAAACCGTACAAATGCGAGCTGTGTGACAAATCTTTCAAGAGATTGGTGAAATTCAATCTCCACCGTCGAATGCACAGCACCAAAGAGAGGTCGTTCCACGTTTGCTACATCTGCGGAAAATGTTTGAGCTCCAATAGCAGTCTACAAATGCATCTGAGGATGCACAGTGGCGAGGCTCCGTTCCAATGCGATATTTGCCCGAAGAGATGCAGGACGAAGGAGACTGTGGAGGCACACAAGCGAGTACATACTGACGAGAGACCCTTCCAGTGTGATATTTGTCCGAAAAGCTTCAAGAACAGACATAATATGAAGAGGCACAGGCTGACCCATAGTGATGAGAAGCCGCACGTATGTGACGTGTGTAAAAAAGCTTTCAGTCAAAATGCCAGTCTCGTCATTCACAAGGCGATTCACACTGGGGTGCGACCGTACCAATGTGATGTTTGTTCCAAACAGTTTGTTACAAAatccgCTGCTGGTTTCCATTGTGTACAATGTAATTCTTACTTCAGGACATATTTGAATTTGGTTCGTCACAAAAAGAACACGCACGGAAAGCGTCCTTACGAGTGCGACATTTGTAAAAAGAAATATCGAAGCCTCAGAGATGTAAAAGTGCACAAGATGCAAGTCCACCTCAAACTCAAATCTGCCAAGTGCGGTTTTTGCGATAAGACATTCAACGACAAGCGCTACTTGGTCAGGCACATCAACTCTGACCACTTCAATGCAAAAATCCCTTGCGAGCAATGCGATTCAACTTTCACCAACGAAAAATCACTCAAAAATCACGTTGATTCAGTCCATTTGGGCATCACGTATCCCTGCGAGCATTGCTCGAAGATCTACACTAGCTTGCAGTATTACAGGAATCACCTAAAATCGCACAGCCCTGGCTACGAAAAGAAGGTTTTTCAATGCGATATCTGTCCCAAACGGTACCAATACCGACACGGCCTTGCTCGGCATATGAAAGAACATTGCGGCAATGCTCTGAGCTACGTTTGCGAAATTTGCGGCAAACGTCTAACCAACAGCACCGATCTGAGAGACCATATCTTGGGCCACGGTAGCGAGAGACCGTTCCAGTGCGAGTTTTGCGACAAATCTTTCAAGAAGATGGTGAATTTCAATCACCACCGACGCATGCACACCGAAGAGAGGTCCTTCTACGTTTGCTACATCTGCGGCAAAGGCTTGAGCTCTAATAACAGTCTGCAAGTGCACATGCGCATCCACACTGGCGAGACACCTTTTCAGTGTGACATCTGTCAGAAGAAGTGCAAGAGCAAATTCGGAGTGGAAGTGCACAAACAGGTGCACACCGATGATGCGCCCTTTCAGTGTGAGTTTTGCCCAAAGAGGTGCAAGAGCAAGCAGCGATTAGAGAGACACATGGCGGTGCATTCCGACGAGAGACCCTTCCAGTGCGACATCTGCCAGAAAGACTTcaagaacagacagaatatgaAGAGACACAGGTTAACCCATACTAATGAGAAGCCACACGTATGCGACGTGTGCAATAAGTCTTTCAGTCAACATTCGAATCTCATCACTCATAAGGCTATCCATACTGGAAATCGACCTCACCAGTGTGTTGTTTGCTTGAGGAAATCCGCAGCTGGCCATCTCTGCGTAGACTGTGATTACTACTTTAGGACTCACGTGAACTTAATTCGTCACAAAAAGAACACTCACGGCAAACGACCGCACTCATGCGATATTTGTAACCAGAAATACCGCTCTATTAGCGACGTTAAGATGCATAAGAACCGTGTCCATCTCAAACTCAAGTCAGCAATGTGTGGTTTCTGTGACAAAAAGTTCAATGACAAACGCTATCTGATCAGGCACATCAACTCTGAAcacttcaacaaaaaaattgccTGCGATCAGTGCGAAGCAACTTTTTCCACAGATAAGCTACTGAGCAACCATCTCGATTCAGTACATCTGAAGATCACTTACACTTGCGAGCATTGCTCGAAGATTTTCACCAGTCAGCAGTATTACAAGAACCATATTAAATCCCACAAACCAGACTATAAGAAGCAAATATTCCAGTGCGCTATATGCCCCAGGCGTTATCAATACCGACACGGTCTGGCCCGTCATATGAAAGACCACTTGGGCATGACTGCCATCTTCATTTGCGAAATTTGCGGACGTCACATGACTAGCAGCACTAGCCTCAAAGAACATATGAGGAGTCACACCGGAGAGAAACCGTTCCAGTGCGAGTTTTGTcacaaaactttcaagaagaagatGACTCTCAGACAGCACAGACGAATACATACTAATGAGAAGCCCTTCGTTTGTGACGTGTGCAGCAAGGCTTTCACTCAGCAAAGCAGCTTGATTGTGCACAAGAGATACCATACCGGGGAGCGACCCTACGAGTGTTCCATTTGTTTGAAGAAATTCGCTACGATAACCAATATAAAGAGACATAAGTGCAAAGTGTGA
Protein Sequence
MTLSRHKNNIHGKRPHECEICNRKYRSLKDVKIHKSRVHLKLTTVMCGFCDKTFKDKRYLVTHVRTLHYNERISCDQCQATFSTFKLLQNHQNSVHLKITYPCEHCSKVFTNLQYYRNHLKSHGSDYAQKIFLCEICPKRYLYRHGLARHMKDHLGITDSFVCEICGENLTSVADLRDHRMTHTEDKPFHCESCDKSFRKRLSLAQHKRTHNSDRTLYVCDFCGKALTSNYSLQVHLRMHTGEAPFQCDMCPKRCRSKQRVEVHKRVHTDEKFFHCDICSKSFKNRQNMKRHRLTHNNEKPHVCDVCDKSFSQNSNLVIHKAIHTGLRPYQCGICSKKSAAGYQCDECDSYFRKYLNLVRHKQNTHGSRPYECDICERKYRSIRDVKVHKMQVHFKLKSAKCGFCDKIFNDKRYLVRHINSDHFNTRISCEQCDATFTNKTSLKNHVDAVHLGITYPCEHCSKIYTSLQYYRKHLKSHSFGYEKKIFQCDICPKHYQYEHGLARHMKTHLGNALSFICEICGKRLTNSTDLRDHILGHGTEKPYKCELCDKSFKRLVKFNLHRRMHSTKERSFHVCYICGKCLSSNSSLQMHLRMHSGEAPFQCDICPKRCRTKETVEAHKRVHTDERPFQCDICPKSFKNRHNMKRHRLTHSDEKPHVCDVCKKAFSQNASLVIHKAIHTGVRPYQCDVCSKQFVTKSAAGFHCVQCNSYFRTYLNLVRHKKNTHGKRPYECDICKKKYRSLRDVKVHKMQVHLKLKSAKCGFCDKTFNDKRYLVRHINSDHFNAKIPCEQCDSTFTNEKSLKNHVDSVHLGITYPCEHCSKIYTSLQYYRNHLKSHSPGYEKKVFQCDICPKRYQYRHGLARHMKEHCGNALSYVCEICGKRLTNSTDLRDHILGHGSERPFQCEFCDKSFKKMVNFNHHRRMHTEERSFYVCYICGKGLSSNNSLQVHMRIHTGETPFQCDICQKKCKSKFGVEVHKQVHTDDAPFQCEFCPKRCKSKQRLERHMAVHSDERPFQCDICQKDFKNRQNMKRHRLTHTNEKPHVCDVCNKSFSQHSNLITHKAIHTGNRPHQCVVCLRKSAAGHLCVDCDYYFRTHVNLIRHKKNTHGKRPHSCDICNQKYRSISDVKMHKNRVHLKLKSAMCGFCDKKFNDKRYLIRHINSEHFNKKIACDQCEATFSTDKLLSNHLDSVHLKITYTCEHCSKIFTSQQYYKNHIKSHKPDYKKQIFQCAICPRRYQYRHGLARHMKDHLGMTAIFICEICGRHMTSSTSLKEHMRSHTGEKPFQCEFCHKTFKKKMTLRQHRRIHTNEKPFVCDVCSKAFTQQSSLIVHKRYHTGERPYECSICLKKFATITNIKRHKCKV

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-