Basic Information

Gene Symbol
-
Assembly
GCA_963576895.1
Location
OY756375.1:15584956-15596924[+]

Transcription Factor Domain

TF Family
zf-C2H2
Domain
zf-C2H2 domain
PFAM
PF00096
TF Group
Zinc-Coordinating Group
Description
The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 32 1.3 61 4.8 4.3 3 23 169 190 169 190 0.98
2 32 0.00089 0.04 14.8 1.1 1 23 199 222 199 222 0.98
3 32 1.8 82 4.3 5.5 3 23 277 298 277 298 0.98
4 32 0.00014 0.0062 17.3 0.7 1 23 307 330 307 330 0.98
5 32 0.23 11 7.2 1.7 3 23 360 381 360 381 0.98
6 32 0.0002 0.0089 16.8 2.9 1 23 391 414 391 414 0.96
7 32 0.027 1.2 10.1 0.7 3 23 453 474 452 474 0.97
8 32 0.042 1.9 9.5 0.4 1 23 483 506 483 506 0.97
9 32 0.097 4.4 8.4 1.2 2 23 540 562 539 562 0.96
10 32 0.24 11 7.1 1.0 1 23 571 594 571 594 0.96
11 32 0.15 6.9 7.7 0.7 2 23 628 650 627 650 0.96
12 32 0.76 34 5.5 1.9 1 23 659 682 659 682 0.94
13 32 3 1.3e+02 3.7 0.6 3 23 775 796 775 796 0.95
14 32 0.00012 0.0052 17.6 0.8 1 23 805 828 805 828 0.98
15 32 0.00032 0.014 16.2 0.2 1 23 899 922 899 922 0.98
16 32 0.0023 0.1 13.5 0.2 2 23 941 963 940 963 0.94
17 32 0.0053 0.24 12.3 0.1 1 23 972 995 972 995 0.95
18 32 0.041 1.9 9.5 0.9 1 23 1054 1077 1054 1077 0.96
19 32 0.012 0.52 11.3 0.2 2 23 1098 1120 1097 1120 0.96
20 32 0.37 17 6.5 0.1 2 19 1165 1182 1164 1184 0.92
21 32 0.23 10 7.2 0.8 1 23 1198 1221 1198 1221 0.91
22 32 9.6 4.3e+02 2.1 0.1 2 23 1256 1277 1255 1277 0.86
23 32 0.0013 0.058 14.3 0.5 1 23 1289 1312 1289 1312 0.93
24 32 0.28 13 6.9 0.0 1 23 1341 1364 1341 1364 0.91
25 32 5.4e-05 0.0024 18.6 1.3 2 23 1376 1398 1376 1398 0.96
26 32 0.3 13 6.8 0.4 3 23 1414 1435 1413 1435 0.94
27 32 0.048 2.2 9.3 0.6 2 23 1498 1520 1498 1520 0.95
28 32 0.0028 0.13 13.2 3.1 3 23 1533 1553 1531 1553 0.96
29 32 1.1 48 5.1 5.7 1 23 1586 1610 1586 1610 0.97
30 32 0.26 12 7.0 0.4 2 23 1618 1639 1617 1639 0.96
31 32 0.67 30 5.7 5.1 1 23 1646 1668 1646 1668 0.95
32 32 0.38 17 6.5 4.2 2 23 1673 1695 1672 1695 0.96

Sequence Information

Coding Sequence
ATGGAGGAGTGGGATAACATGATGAGTTATGCGGACGAGGGTGACAGATCCAACAGTGTGTTGAACATCTTGCAAGATCCCCTGTTGGATGCTAAACTAatagatacaaataatattaaaatagaaGTCCTAGATGAGgcTGATGCTATGGTGGATGGTGATGGAGCCCCCTTAGTATTTAACGATAAAAGAATAGAAGTAAAGAAAGAAGGAAGGAGCTCAGTCACTAAGAAACATAAGAAGAAGAAAGCTAGTTATAAATGGGAAGACGATGACGAGGAAGACAATGAAGTGTTAGCTGTAATAAAagtaaagaaagaaagaaaggatGCAGAGAATAGAACCAATAAGAAGAAAGGCGGAAGTCATAAGTCAAAGGAACATTCCATTAAGAAAGATAAGTTAAATGCAAAGATTAAGAGTCTAAAACACAAAGTGATGGAGGAGCAGGAGGAACTAAAGTTGAAGGAGTGCGGGAAGCAGGACGAGGAGGGCAGCTGGTGGTGCTCAAAATGCGGCACGCGGTGCGCGCAGAGAGTCAAGTTCCTCTACCACCTGAAGAGGTTCCACAATATAGAGATTGGCAACTTTGAGTACAAGTGTCCGCACTGTGACATGGAGTTTATGAATGATATGGCTCTGAGAGCTCACTACAAGCGACTGCACGCCGGCGAAGAACTCAAGGTTGAATCGGATGTCCTGATGGATGGTGATACATCACACACTTCTACTAAATATCAACACAGAATAGAAAAgaATCCTGATAAACTAGAGGAAGAGCAACTCTTGAAAGAGTGCGGGAAGCAAGATGAGGAAGGTAGCTGGTGGTGCTCAAAGTGCGGCACGAGATGCGCTCAGAAATTCAAATTCCTCTATCACCTCAAGagatttcataatattaagATAGGCAACTTTGAATATAAGTGTCCACATTGTGATATGGAGTTTAGGAGTGATATGGCCTTGAGGGCCCATGTGAAACGTGTGCACGCCGATGAGGACCCTAAGTTTGAACATGACCCCGACAGAGTTCTGATGATTGAGAATGGCCACCAGGACGATGAAGGCATATGGTGGTGCACCAAGTGTGGCCTGCGTTACGCCAAGGAGCATCTGTTTGTCATTCACCTGAAGAAGTTACACCAAgtggggaggcctatatccactAACCACGTGTGCAAGTTCTGTAATTATCAATTCACTCAGAAGGGAGCTCTCAATAGACATGTGAGGAATATGCATCCAGATAAACAGGCAGCGAGTTCAGAGgagTCCAGTGAGTCATCGAATCTCTTAGTCGAGGAATACAATAAACTAAACAAACACGGAAAGAAAAACGAACACGGGACGTGGTTATGTAACAACTGCAGTGCAATGTACGGCCGTAAGAGTGCGTTCATTAAGCATCTGAAGAGCTCTCACAATATAGATGTTACGAGCGTGAAGTTTGACTGTCCGTCGTGTGATGAATACTTCCTCACACAGAGGGCGTTAAACGATCACCTGAACACTAAACATCTTTCTGCTGATGCTgagACCAGCGACGGCGGGGAGACTAACAAAGAAGATTACGAGAAACTTTTGCAATACGGCAAGAAGAACGAGACGGGTGCATGGGTTTGTAATGAGTGCAGCTCCAAGTTCACTACGCAAGTCCGCTTCATATATCATCTCAAGAAGAACCACAATATCATCGCTAAAGACTTCAACTTCCCCTGTGTACAATGCACCATGAGTTTCCTGTCGTTGAAAGCTCTCCGGGCGCACATCAACAATTGTCATCTCGAAGTCACGgATACCCAAGAAGGCTCAATTACAGATTTCTTAAAGGACAAAGACAAGATCAGTTCGGAGGGCACGCGGTGCGAGGACGGCAGTTGGGAGTGCAACCGCTGCGGCACCAAGTACGCCCGGGACAGCGTGTTCATGTACCACCTCAAGCGCTACCACGGGGTGGACCTGGGGAGCTTCCAGCACAAGTGTCCCTACTGTAAGCTGGATTTCATGGCCTTCACGGGACTCCGCTATCACATACTCAACAAACATCCTAAGCAGTACGATAAATTTATAGCTAAAATGCCACTATCAAAGCAGTCTGAAGTAGAATCAGAGGAATACGAAACAGACGACGATGATAATATTGCACTGTCTGAATTAAAGAAAGGGGAGGGCCCCGGGTCCAGTAAGCCGAGTACGTCCACGAAACCTCGAAAGAGGAAACTAGACGCGGACAGAGAGAGAGACGCAAAGGGGAAACAACACACGCTCGAGCAGGAGATCATGAACAGAGAGGGAAAGAAGGACTTGGAAGGGTCGTGGTGGTGCAGCAAGTGCGGACTGCGTTACTCCAAAGACGGCGTCTTTCTCTACCACTTGAAGCGCGCCCACAACGTCACCATCGGAAGCTTTGAATACAAATGTTCGTATTGCAAGATTGAGTTTATCAGTAAGCCAGCGCTGAGGACGCACATAAACAGGACGCACCCGGAAGCTAACGAAAACTTGTTTAGgAACTCGTCTGATTTCTTGCAAGCCAGTATACTGGAGAACCACGCCAAGATCATAGAAGCCAACTGCCAAAAGGACACCAACAACCATTGGGCGTGCAAAATGTGCGGATCCAAATTCGTCAAAGACACAATGTGTTTGTATCATCTCAAACGTGACCACAACATCTTCATACAAGGATTCGATTACAAGTGCAATCTGTGCGATATGGATTACAGTAATCCGCAAGCGTTGAGGATCCATGTTAAACGAGTCCACGGTGATGACGAAGACAAAATATTAGACAAATGTATCGTTAAAGACAGCAGATGGGAGTGCAAGGAATGTCCGAAGGTTTTCATTAAGCAAGCTAGTTTCGCTTATCACTACGCTAAATACCACGCAACGGCAGACGCGAGATTCAAATACCAATGCAAAATCTGTCAAGCTAAATATATTTCGCCCAAAGGTTTGCGATTGCATTTGGTTTCTATTCACAACGAAACCGACAATACAGGAGATAACGTTCAGCTTATTAAACACGAACGAAAACCACAAGTCGGAGGCGTGAAATTAAATATATCCGTCAATAGGAAAAAGACTGAAGATGTGGGATTGAAAAAAGATGATTCGCTCGCCGGCGACATGATGGGAGGGAATCAAGACGGTATACACCAATGTCAAGCCTGCGGTTATTTATTATCGAGTAGGAAGAATTTAATGGAACATATTAAACAAGTGCATCCCATCGAGCACGAAGCGATACAGACCATATTGAACGCAAATGAATTGGACGAAGAGTCGTACTGTAAAGTCTGCGGACAGATATTTTTTAGCGTCGTCGAATTCGAACGACACGTGAAATCTGAACACAAACGCGAGGAATGGACGGATAAATTGTTAGAGTGTAAACACTGCGCGTTCTTCACAATGTCTCAGTTCGTTTTCAAAATGCATTTGAAGATTATACACGGATTGAGCGACGACACGACGATACCTCAGTGGCCTTGTAAAAAGTGTAATTTGACATTCGACGACAGAGGACGTTTGTTAGAACATAGTGTCAGTGGTGcttgttatattatttgtaacaaTATAAACGATTACGAATGTGCCGAGTGCAGACTGCGGTATTATAAGAAGAGTGATATAAATAAACACGTCACGGATCAGCACATTAGTATGAGTATTAAACCTAATGACGTAGAATTAAGTAATAACATCGTTAGGGATTATTCTAGTTCCAAtactgtcatacaaaatttgtcccAATTTACGAAATGTCCGTATTGTAGTATAATTATATCGTACGCCGAACTGTACGCACATTTAAGTAAGAATCACTTCCCGGCGGAGAAAGATCCCTCGTCTAAACATCACGAATGTCTCATCTGTAAAAAAAGTTATCCGGAATTGTCGTTACTTAAGAAACACATCGAGGAAGAGCATAAATATCGCGACGATACTACGTTGATTTTAAACGAATATCCGAACgtgaaagaaaataaagataaaaatgaaGATTCCTCAGAGTACAAATGTCTTCAGTGCGGTTTAAATATGCCGGATAATTCTTCTCTGTTAGCTCATGTTGCCGGCGAACACGGCGTaaagaaatattcaaattcaaataatccGACGTGTTCGTTATGTTCTCAACAGTTTACAAACAATTCGTCGCTCAGACGACATATTAAAGTGATACACGTTTCTAAaagtaaaatatcaaattgGGAGAAAAAACCGGCGGCTTGCGTAGTTTGTTCCGAAGTGTTTCGATCGCATTTCGGTATGATTAAACATTTTAGAGCGGTTCACGTGGATGATCCTGTAGTGAGTGGTAACGATTGTTTCGAAGTAGTGAAACGTAATAATATCAAAGTCGGATCTAAGATGAGCGTAGTGATATATAAATGTAAACTTTGTACTTTGTTTACGGCAAAAACGAAGCTGAGTATGGATATTCATCTCATCAACAAACATTGGTCGGGTGTTAAGCCGGCCTCCTGTTCTTACTGTGATTTTAAAACGGCAACCGTGAGACAGTTGGAGAGACATGTTTACACAATGCATCCCGATTCGAAACAGGCGAAAGAGACTGTGTGTTTGTGCGACGAATGCGGAAAAACGTTCAACAATAAAATCACATTTTATCAACATCGAGAGTCGCACAATCCGCCGAAATataagtgtgaattttgtaattattcGACGCATTTGAGGACGTCGATAGATAATCATAAACGGATTCACACGCAAGAAAAACCTTACAAATGTAACTTTGAAGGTTGTAGTTATTCTTGCACGAGTCACACGGGTCTAAAGGTGCACAAGTTTTCGCATTTCCCCGAAAAATACATGCAGACTTGCGAGACGTGCGGCTTTAGGACCGTGAATAAATACGTGATGAACGCACACGTTAAAGAGCACAATAAAGGTATTTTGAATTTCTCCTGTGAACATTGTTCGTATTCGTCAAACGATAAGGCTCAGTTGTCCGTACATTGTAAGAGGCACATTACGGCAAAGTCGTGTCAGTTGTGTGAATTTTCTACTAAATATAAACAGACTATGAAAAAGCATTTGAGGAAACGACACGGTATTATCGTAAATAAAGAAGATAATGTTAAATGGTTTTAA
Protein Sequence
MEEWDNMMSYADEGDRSNSVLNILQDPLLDAKLIDTNNIKIEVLDEADAMVDGDGAPLVFNDKRIEVKKEGRSSVTKKHKKKKASYKWEDDDEEDNEVLAVIKVKKERKDAENRTNKKKGGSHKSKEHSIKKDKLNAKIKSLKHKVMEEQEELKLKECGKQDEEGSWWCSKCGTRCAQRVKFLYHLKRFHNIEIGNFEYKCPHCDMEFMNDMALRAHYKRLHAGEELKVESDVLMDGDTSHTSTKYQHRIEKNPDKLEEEQLLKECGKQDEEGSWWCSKCGTRCAQKFKFLYHLKRFHNIKIGNFEYKCPHCDMEFRSDMALRAHVKRVHADEDPKFEHDPDRVLMIENGHQDDEGIWWCTKCGLRYAKEHLFVIHLKKLHQVGRPISTNHVCKFCNYQFTQKGALNRHVRNMHPDKQAASSEESSESSNLLVEEYNKLNKHGKKNEHGTWLCNNCSAMYGRKSAFIKHLKSSHNIDVTSVKFDCPSCDEYFLTQRALNDHLNTKHLSADAETSDGGETNKEDYEKLLQYGKKNETGAWVCNECSSKFTTQVRFIYHLKKNHNIIAKDFNFPCVQCTMSFLSLKALRAHINNCHLEVTDTQEGSITDFLKDKDKISSEGTRCEDGSWECNRCGTKYARDSVFMYHLKRYHGVDLGSFQHKCPYCKLDFMAFTGLRYHILNKHPKQYDKFIAKMPLSKQSEVESEEYETDDDDNIALSELKKGEGPGSSKPSTSTKPRKRKLDADRERDAKGKQHTLEQEIMNREGKKDLEGSWWCSKCGLRYSKDGVFLYHLKRAHNVTIGSFEYKCSYCKIEFISKPALRTHINRTHPEANENLFRNSSDFLQASILENHAKIIEANCQKDTNNHWACKMCGSKFVKDTMCLYHLKRDHNIFIQGFDYKCNLCDMDYSNPQALRIHVKRVHGDDEDKILDKCIVKDSRWECKECPKVFIKQASFAYHYAKYHATADARFKYQCKICQAKYISPKGLRLHLVSIHNETDNTGDNVQLIKHERKPQVGGVKLNISVNRKKTEDVGLKKDDSLAGDMMGGNQDGIHQCQACGYLLSSRKNLMEHIKQVHPIEHEAIQTILNANELDEESYCKVCGQIFFSVVEFERHVKSEHKREEWTDKLLECKHCAFFTMSQFVFKMHLKIIHGLSDDTTIPQWPCKKCNLTFDDRGRLLEHSVSGACYIICNNINDYECAECRLRYYKKSDINKHVTDQHISMSIKPNDVELSNNIVRDYSSSNTVIQNLSQFTKCPYCSIIISYAELYAHLSKNHFPAEKDPSSKHHECLICKKSYPELSLLKKHIEEEHKYRDDTTLILNEYPNVKENKDKNEDSSEYKCLQCGLNMPDNSSLLAHVAGEHGVKKYSNSNNPTCSLCSQQFTNNSSLRRHIKVIHVSKSKISNWEKKPAACVVCSEVFRSHFGMIKHFRAVHVDDPVVSGNDCFEVVKRNNIKVGSKMSVVIYKCKLCTLFTAKTKLSMDIHLINKHWSGVKPASCSYCDFKTATVRQLERHVYTMHPDSKQAKETVCLCDECGKTFNNKITFYQHRESHNPPKYKCEFCNYSTHLRTSIDNHKRIHTQEKPYKCNFEGCSYSCTSHTGLKVHKFSHFPEKYMQTCETCGFRTVNKYVMNAHVKEHNKGILNFSCEHCSYSSNDKAQLSVHCKRHITAKSCQLCEFSTKYKQTMKKHLRKRHGIIVNKEDNVKWF

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-