Basic Information

Gene Symbol
zfy1_1
Assembly
GCA_951217065.1
Location
OX578273.1:13419919-13439519[-]

Transcription Factor Domain

TF Family
zf-C2H2
Domain
zf-C2H2 domain
PFAM
PF00096
TF Group
Zinc-Coordinating Group
Description
The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 30 0.0003 0.063 15.4 1.3 2 23 246 267 245 267 0.97
2 30 9.1e-05 0.019 17.0 1.3 1 23 272 295 272 295 0.95
3 30 1.1e-05 0.0024 19.9 1.9 1 23 301 323 301 323 0.99
4 30 3.3e-07 6.9e-05 24.7 0.3 1 23 330 352 330 352 0.99
5 30 3.1e-06 0.00066 21.6 0.5 1 23 355 377 355 377 0.98
6 30 0.0026 0.56 12.4 0.9 1 23 612 634 612 634 0.98
7 30 8.3e-06 0.0018 20.3 1.8 1 23 640 663 640 663 0.96
8 30 0.00026 0.056 15.6 4.7 1 21 669 689 669 690 0.95
9 30 1.9e-05 0.0039 19.2 0.5 2 23 816 837 815 837 0.97
10 30 0.0012 0.24 13.5 1.1 3 23 845 866 843 866 0.93
11 30 4.3e-05 0.0091 18.0 0.5 1 23 872 894 872 894 0.99
12 30 1.6e-05 0.0035 19.4 2.0 1 23 998 1020 998 1020 0.98
13 30 7.9e-05 0.017 17.2 2.9 1 23 1026 1049 1026 1049 0.96
14 30 9.3e-07 0.0002 23.3 0.8 2 23 1133 1154 1133 1154 0.98
15 30 9.7e-07 0.0002 23.2 1.9 1 23 1160 1183 1160 1183 0.96
16 30 0.12 26 7.2 1.2 1 23 1189 1211 1189 1211 0.98
17 30 6.5e-05 0.014 17.5 2.6 1 23 1288 1312 1288 1312 0.99
18 30 3.3e-05 0.0069 18.4 2.8 1 23 1342 1365 1342 1365 0.97
19 30 0.0071 1.5 11.1 2.0 1 23 1371 1393 1371 1393 0.99
20 30 1.5 3.2e+02 3.7 0.0 2 21 1455 1473 1454 1474 0.91
21 30 0.13 27 7.1 0.2 2 23 1609 1630 1608 1630 0.93
22 30 4.5e-06 0.00095 21.1 1.5 2 23 1676 1697 1675 1697 0.97
23 30 5.6e-06 0.0012 20.8 2.8 1 23 1703 1726 1703 1726 0.97
24 30 1.8e-05 0.0037 19.3 0.5 1 23 1732 1754 1732 1754 0.98
25 30 2.2e-05 0.0046 19.0 2.8 1 23 1802 1824 1802 1824 0.98
26 30 1.7e-05 0.0036 19.3 0.2 1 23 1830 1853 1830 1853 0.96
27 30 0.00078 0.16 14.1 0.4 1 23 1859 1881 1859 1881 0.97
28 30 5.7e-05 0.012 17.7 2.1 2 23 1915 1936 1915 1936 0.98
29 30 1.2e-06 0.00026 22.9 0.5 1 23 1942 1964 1942 1964 0.97
30 30 0.047 9.9 8.5 2.7 1 23 1970 1993 1970 1993 0.98

Sequence Information

Coding Sequence
ATGCCTCTGGGATTCTCTAAGGCATGTCGAACATGCTTAACACTCGATGGGACACTTATTTCAATCTACGATACAGTAAGCTACGACGGAAAGGGATGTGTTACCGATATGATTAAAGACTTCACAAAAATCAAACCCATAAAATCGGACGGACTTCCGGAACTTGTTTGTCAGACGTGTTTCAACGAAATTAATCGATGTTATTCATTTAAGGTAAAATGCCTTAATTCCGATTGTACCTTGAGGAAATTATTTGCTACCGAAGAAGAGCTTGTTGGACAACAACAGGTTGTGGTGCAAGATGAAGAGAAAGAATGTAAGCCTATCATTTCCAATCTGACATTGCTTCAAGTCCCTGTTGAGGTACAGCTAGACCAAGAACCACCAccgataaaaattaaaactgaaaaagacCTTGACGCAGACCCATTTGGTTACTTTGTAATAAGTTCGGTTTGTGGACAAAGTGATCAAGTTAATGAAGCTGAAAGCGATAATACAAATAATGGAACCTTCCATACACCACAAATCAAAGAAGAAGTCGATTATGAAATGGATTCATATAATACGTCGTCTACTAGACGCTTAACTAATAGCGATGTGTGTCCAAGTGCCCAATCACTATACCTTGATAATTTTCCCCTAGAATTAACGTACCCTCACATTAAACTGGAAGCTGAAGAATCAATTGAGTCAGAACCTCAAAAAGTAGATTCAATTATTGAGAGCGATATTCGATGCCCAACATGTGGCAAAGAATTCTTATATACTTCACGTCTTAAAAAACACATGCGTTCTCATGCTGCCAAACCTTTTTTCTGTTCAATCTGTACCAAAAAGTTTAAGCTATCTGGGCAACTTAGGCGCCACATCGATGTAAGCCATCGTAATATAAAGCCCTTCAAGTGTGAAGTGTGTTCTAGAACTTTTTCTTTTCCCTCCAAATTGAAAGAGCACTTAAAATCGCATAAAGGCATATTGCAGCCTTTCAAGTGTGAAGAATGTCCTAAAACGTTTGCTTTCCCTTCAAAACTAGAAGAGCACATTAGAACACACAAGACTTATAATTGTGAAGAGTGTCCTAGAACTTTTGTTTACCCTTCAAAATTAAAAGAGCATCTTAGATCACATAGAAAAGTTATTACAAATAtggaaacaaatttcgatgTGTCAatggaaaagaaagaaaacgacGAGAAACCAATTCTGGGTTTCAAGCCAAGCACTGAATGCTTAGAAGATAGCCATTGTGAACAGAAAAAAGAACCTAAATACGACGCAAATATTGCAGGCAGCGGAAACAATCTAGATGAGAACAACGAAGATAAAACTGATGTTAGCATGAATGATCAGCTCCGTTCAACAAGGCGTTCAGCGCGAaatagaaaagaagaaataaacaaTCCAGACGGTGATACAGCCGAGTTTTCTAGTACTAAAGTGAAAACAAGGCAGTCTGAGAAGTCAACACGCATTGATTATTCGTATGAAAGCGATGACCAACACCCAGATGAGAAAAATAGTCCTGAAGGGTCACGTCAGTTTCATTGTCCTCATTGCAGCAAAGATCCTAATAAGACTAAAATTGACTACACCGACAAAGATAATCAAACAAATGAAATGGGCGATGATCTACAAAGTGAACATatgatagaaaaagaaaatacggATGGTCAGAGTAATGCGATGAGCGATAATTTACCAGAGAACGTTGagacaaatgaaataaatagaaaagttGAAGGCGTTCAATTAAAAACGGAGTTGGTACAACAATCAGTCCTTCCGACGACAAAAGAAACAGGAGGCCAAACTAATAATGAACCACGGATTTATGAGTGTCAAAAATGCGATGAaaCATTTACTGTTCTTTCACGTTTTTATGGTCACATGAAGGGTCATGGTGTTGAAAGACCATTCATCTGTAAACACTGTGGCaaaggttttaaaaaatattggaatcTCAATAGTCACATAATGATAAGTCATAAGAAAATAAAGCCATTCAAGTGCAATGACTGCCATCGGGCATTTTCTTGTAAATCAAATCTCAACAAACATATGTCATCCCGCATATGGACTAATAGCTGTACTGAAGACTTTGCAGGGCGTCGTGGtactaaaaagaaaattaaaaaaaaaattgtcccaCAAGACTTAAATATCGAACATGACAATTTTGacgaaaacaaacaaacactaGAGGATAGTTtcgaagaaataaatgaaaacccCAATAATCCACATGAAAACGTTCAGGATCAAAACGTGGTTGGGGAAAACAAAAACGATAATAACACAGACGAAACAAATTTCGTTGAAATAACCCCGGATGTTACCCCGGATATCACTGAACCAGAACCTGAACCATCACGTCATCCTCTGTTTGATTCAATGTCAGAACAAACTACTTCAATTCCAAAGTCCAGTCCCAAGAACAAACTTGAATGTCCTCAATGCAAAAAAGTATTTGTGTATGTCTCCTATCTTAAAAAACACTTGCAAAGCCACAGCACTGAAAGACCTTACGGTTGTTTAAAATGTGACAAAACTTTCAAATATAAAGTGAGCTTAAATTCTCACAATATGATAACTCATGCAAAAGTAAAGCCATATAAATGTGATCAGTGCCCTGAAGCTTTTGTGAATGAATACAATTATAGAGTGCATAGAAATTCACACAATAAACTCAACCATACAAAAGCCAAGTACAAGCCAAGACGTGTAGTTTCAAAACCTCAATATGATGAATTATTAAACATTGCTAATGTTGATAAAAGCGTCAGCAATCTACAAGAAAATACTACAGAAGAAGTTGCTGTTGGTATGAATAATGAACAGACTACGTGTTCGCTATGTTATGTTTCAATTCAGTCAGAGAAGTTAGCAGAGCACATGCAACAGTACCATAATTCAACCGAATCGACTTCTACTCTTGTTAACTCTGTTAATCACCAGAAGGATATTTCAAGTATCGATTCAGAAAACCGACATGAGTGTCCTGAATGCAACAAAGTGTTTTTGTTTGCTTCGTATCTTAAAAAACACATGCGTAGTCATAACCATGAAAGGCCGTTTATTTGCTCAACTTGTGGAAAGACTTTCAAAGACCATTGGAGACTTAGGACCCACATTAGGGAAAAACATACTCATAAGAAGAAGAGGAAATATAATTATCGTGGTAAACCAGCAAGCACAGTTATGGACAATAAAGATGAGAAGGATCCCAATTCTAAAGTTGTAGAAGAAAGTGAAACGACCAATAAACCACAGGATTGGATTAATAATCAACATAATTCAACCGAATCTAAAATCTCTGAAACAAATTCTTCAAAACTAGAAATAGCTAAGCAATCGAAACAGATCGAAAATCGAATTGTTGAATTGGTGAAACCACGGTGTACGATATGCAACAGAGaatttacgTTTGCTTCACAGCTGGAAGCACACATGCGAACGCACACTGGCGAAAGGCCTTTCAGCTGTCCAAACTGTGAAAAAACTTTTACCCAAAAAGGCCATCTCAAAAATCACATcttaataaatcataaaaatataatgccCTTCCAGTGCAGGTTTTGCTTACGAGGATTTGCTAGTGGTTTTAATATGAGAGTGCATGAAAAAACACACACTAAAGAAAACCGCAAATTAGCAAAACCTAATCGAACAAACAATGATAATCTTATGCAAACTCAGAATGAAGATACTACAGCCGGAACGACCAGAGAAACCATGGAAAATCCCCTAGGTAACATCCCAGATCAAATCATACAAGATTTTGCGAAACTTGAGCTCCATCCACTTTTTGGAACAATGTCGAATCAAGATGAAAACACAACCAACGAACCAGATAATTGCTATCGGTGTCCTCGCCTTCATTGCAATAAAACATACAGACTCCTTTCGCAGCTTAATGAACACATGCAATCTCATAAAACTGATAATATGTTACTTCCGCAGCTTAATGAGCGAATACAAACTAATACTATTACTAATACTAATACCAATACTGATAAACCATTTACTTGCAAAATCTgtaataaaactttcaaaaaggAACCTCATCTGAAAGGGCACATACAGATGAGTCATACAAAAACAAGGAACTACAAGTGTAATGAGTGTGATCGAGATTTCCTTACATTCTATATTCTCAATAAACATATGAGATCACACAAAGGAAAGAAAAAACGCAGTAAAGGCAAAACTAAATCTAAACGTGGTCTTAAGATaggaataaataatttaaaacaaactcaTCATTCCGAACAGCTATGTAGCACAAGTGTAGACGACCATACAACTAAAGATCAAACCACACATAGAGCTGCTGAAACGCCCAACAATAGTGTAAGAATAACCTGTGCACTGTGCTCCGTTTCAGTTGATTCAGACAATTTAACTGATCATATGAAAGCGTGCCTTAAGCCATCCCAATCTACGATTGAACTGAATTTAGATCTCGGCCCACAACATTTAGGCCATCCTCTTTTCGAAACATTGTCAAACCAGGATGAGCAACCAACTGAAACCGTGAAACAAtgttttgaatgcaaaaaagTGCATTCAAAAACTCACACTGCAAGAGCCAATCGTCAAATAAGAAGTTATAAAAAACGTGCTGTTGTCAAAAAGAGTTATTACAAAGATCCGCCTTATCATGCTAAGCCTATTCATACGGAAAATCCAGATAAAACAACTACTGATacacagaaaaaagaaaacgaaaatgaaaacaataaaaatgacgATATATTTGCAAATTCATTGGAACAATACAAAGAATCGCAATATGCTGTTGATCGAATTGAAGAGTATATTAGATGTACGTTATGTTTTGTTTCAGTACAACCAGATAATTTATCAAAACACATGCAAGAGGTTCATAATACACCCGAATCTTTAACGTTCGTGGCAATCAAGAAAGAACCTTGTGAAGAGACAGAACAATTAGAACATCCTCCGTTGGAAATTGAACTTAATCAAACCCCAACAGCAAAGTTTGATTCACCACCCGGTCTTCAATGTCCGACGTGCGATAAAGAATTTGTGCACCTTTCGCGTCTTAATGAACACATGCGAACCCACAGTGATGAAAGACCATATATTTGCTTAGTGTGTGGTAAGACCTTCAAACGTCATGGTCACCTAACGACCCACATTGCGACAAATCACAAAAAGGAAAAGCCATTTAAATGCAAGGAGTGCCCACTAACGTTCTCAAGTAGCTTAAACCTGAATGTTCATTTGCAAGAACACATTAGCGAAGAGCTTCCTGTGCGTCTTCCCGTTGAAACAAGTTTATTAAACGAAGACAATTTACAGCCAGCTCAGAATGATGGGCCTTTATGTTATAATATGGATATCCAACAAACAGAATGTAGTACAACTGATGAGGCCCATCAGTGTACCCTCTGTCCTACAGtatttaaaacaatttcacaTCTCATGGAACACATGAAGGTCCACACCAGTGATCAACCATTTATTTGTTCTCTTTGCGGCAAGGGTTTCAGTAGTAATGATGACCTCTCCACTCATCTTCTCATAAGGCATAACTATTCCAAGTCTTTCAAATGTGACCAGTGCGCACGAAAATTTATAACTATCTCAGGATTGAAGAAACATCTAGTGATTCATACTGGTTACCGTCAACAGATGCTAAAAGCAATTCGCcgtgaaaaactaaataaaaaacctGATAACCAAGAATTATTAAGCTATTCGGAATACGTGCCTCAGTGTCATATCTGTAAAAAAGaATTTACCAACAATTGGGTTCTTAAAGAACACATGAAAATTCACACCAATGAAAGACCATTTATCTGTCCAATTTGTGGACATGCTTTCTCCAAGAAATATAATCTCAAAATTCATATAATggcacataaaaaaataaaacccttcAAGTGTGAAAACTGTTCGCTATCGTTTCTCACAAATGCTGGACGCAATCACCATATGCggacacaacacaaaaaaacagaattggatTTATCGAACTCTGAAAATTTGATACTCTAA
Protein Sequence
MPLGFSKACRTCLTLDGTLISIYDTVSYDGKGCVTDMIKDFTKIKPIKSDGLPELVCQTCFNEINRCYSFKVKCLNSDCTLRKLFATEEELVGQQQVVVQDEEKECKPIISNLTLLQVPVEVQLDQEPPPIKIKTEKDLDADPFGYFVISSVCGQSDQVNEAESDNTNNGTFHTPQIKEEVDYEMDSYNTSSTRRLTNSDVCPSAQSLYLDNFPLELTYPHIKLEAEESIESEPQKVDSIIESDIRCPTCGKEFLYTSRLKKHMRSHAAKPFFCSICTKKFKLSGQLRRHIDVSHRNIKPFKCEVCSRTFSFPSKLKEHLKSHKGILQPFKCEECPKTFAFPSKLEEHIRTHKTYNCEECPRTFVYPSKLKEHLRSHRKVITNMETNFDVSMEKKENDEKPILGFKPSTECLEDSHCEQKKEPKYDANIAGSGNNLDENNEDKTDVSMNDQLRSTRRSARNRKEEINNPDGDTAEFSSTKVKTRQSEKSTRIDYSYESDDQHPDEKNSPEGSRQFHCPHCSKDPNKTKIDYTDKDNQTNEMGDDLQSEHMIEKENTDGQSNAMSDNLPENVETNEINRKVEGVQLKTELVQQSVLPTTKETGGQTNNEPRIYECQKCDETFTVLSRFYGHMKGHGVERPFICKHCGKGFKKYWNLNSHIMISHKKIKPFKCNDCHRAFSCKSNLNKHMSSRIWTNSCTEDFAGRRGTKKKIKKKIVPQDLNIEHDNFDENKQTLEDSFEEINENPNNPHENVQDQNVVGENKNDNNTDETNFVEITPDVTPDITEPEPEPSRHPLFDSMSEQTTSIPKSSPKNKLECPQCKKVFVYVSYLKKHLQSHSTERPYGCLKCDKTFKYKVSLNSHNMITHAKVKPYKCDQCPEAFVNEYNYRVHRNSHNKLNHTKAKYKPRRVVSKPQYDELLNIANVDKSVSNLQENTTEEVAVGMNNEQTTCSLCYVSIQSEKLAEHMQQYHNSTESTSTLVNSVNHQKDISSIDSENRHECPECNKVFLFASYLKKHMRSHNHERPFICSTCGKTFKDHWRLRTHIREKHTHKKKRKYNYRGKPASTVMDNKDEKDPNSKVVEESETTNKPQDWINNQHNSTESKISETNSSKLEIAKQSKQIENRIVELVKPRCTICNREFTFASQLEAHMRTHTGERPFSCPNCEKTFTQKGHLKNHILINHKNIMPFQCRFCLRGFASGFNMRVHEKTHTKENRKLAKPNRTNNDNLMQTQNEDTTAGTTRETMENPLGNIPDQIIQDFAKLELHPLFGTMSNQDENTTNEPDNCYRCPRLHCNKTYRLLSQLNEHMQSHKTDNMLLPQLNERIQTNTITNTNTNTDKPFTCKICNKTFKKEPHLKGHIQMSHTKTRNYKCNECDRDFLTFYILNKHMRSHKGKKKRSKGKTKSKRGLKIGINNLKQTHHSEQLCSTSVDDHTTKDQTTHRAAETPNNSVRITCALCSVSVDSDNLTDHMKACLKPSQSTIELNLDLGPQHLGHPLFETLSNQDEQPTETVKQCFECKKVHSKTHTARANRQIRSYKKRAVVKKSYYKDPPYHAKPIHTENPDKTTTDTQKKENENENNKNDDIFANSLEQYKESQYAVDRIEEYIRCTLCFVSVQPDNLSKHMQEVHNTPESLTFVAIKKEPCEETEQLEHPPLEIELNQTPTAKFDSPPGLQCPTCDKEFVHLSRLNEHMRTHSDERPYICLVCGKTFKRHGHLTTHIATNHKKEKPFKCKECPLTFSSSLNLNVHLQEHISEELPVRLPVETSLLNEDNLQPAQNDGPLCYNMDIQQTECSTTDEAHQCTLCPTVFKTISHLMEHMKVHTSDQPFICSLCGKGFSSNDDLSTHLLIRHNYSKSFKCDQCARKFITISGLKKHLVIHTGYRQQMLKAIRREKLNKKPDNQELLSYSEYVPQCHICKKEFTNNWVLKEHMKIHTNERPFICPICGHAFSKKYNLKIHIMAHKKIKPFKCENCSLSFLTNAGRNHHMRTQHKKTELDLSNSENLIL

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-