Basic Information

Gene Symbol
-
Assembly
GCA_004195275.1
Location
SGAY01000725.1:18252-28343[-]

Transcription Factor Domain

TF Family
zf-C2H2
Domain
zf-C2H2 domain
PFAM
PF00096
TF Group
Zinc-Coordinating Group
Description
The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 32 7.5e-06 0.00096 20.2 1.0 1 23 207 229 207 229 0.98
2 32 0.011 1.4 10.3 3.7 1 23 252 275 252 275 0.96
3 32 0.00073 0.094 13.9 0.5 2 23 282 304 281 304 0.96
4 32 0.011 1.4 10.3 0.1 1 21 324 344 324 345 0.95
5 32 0.001 0.13 13.5 0.2 2 23 444 465 443 465 0.94
6 32 0.00034 0.044 15.0 0.3 1 23 471 494 471 494 0.96
7 32 0.0001 0.013 16.6 0.1 1 23 564 587 564 587 0.96
8 32 0.0096 1.2 10.4 1.1 1 23 593 616 593 616 0.95
9 32 0.074 9.5 7.6 0.8 1 23 629 651 629 651 0.97
10 32 4.3e-05 0.0055 17.8 0.9 3 23 696 717 695 717 0.96
11 32 1.8 2.4e+02 3.2 0.4 1 23 750 773 750 773 0.92
12 32 0.0058 0.74 11.1 0.8 2 23 780 803 779 803 0.94
13 32 1.1 1.4e+02 4.0 0.0 2 23 812 834 812 834 0.90
14 32 0.031 4 8.8 0.0 2 23 841 863 840 863 0.93
15 32 1.6e-05 0.002 19.2 1.2 1 23 871 893 871 893 0.98
16 32 1.3 1.7e+02 3.7 1.2 2 23 939 962 938 962 0.95
17 32 0.005 0.64 11.3 1.0 1 23 965 988 965 988 0.97
18 32 0.00082 0.11 13.8 0.1 2 23 995 1017 994 1017 0.96
19 32 0.0011 0.14 13.4 3.9 1 21 1045 1065 1045 1066 0.95
20 32 0.0032 0.41 11.9 1.8 2 23 1080 1101 1079 1102 0.93
21 32 0.43 55 5.2 1.1 1 23 1105 1128 1105 1128 0.94
22 32 0.012 1.6 10.1 0.9 2 23 1135 1158 1134 1158 0.90
23 32 3.7 4.7e+02 2.3 0.2 2 23 1166 1188 1166 1188 0.94
24 32 0.77 99 4.4 0.0 2 23 1195 1217 1194 1217 0.93
25 32 0.27 35 5.9 2.7 2 23 1226 1248 1226 1248 0.96
26 32 1.4 1.8e+02 3.6 0.4 2 23 1254 1276 1253 1276 0.91
27 32 0.02 2.6 9.4 0.1 1 23 1338 1361 1338 1361 0.95
28 32 0.00019 0.024 15.8 0.9 1 21 1389 1409 1389 1410 0.96
29 32 0.38 49 5.4 1.9 2 23 1511 1533 1510 1533 0.94
30 32 1.1 1.4e+02 3.9 0.4 2 23 1540 1562 1539 1562 0.94
31 32 0.83 1.1e+02 4.3 2.5 1 23 1573 1595 1573 1595 0.97
32 32 0.16 20 6.6 0.1 1 20 1599 1618 1599 1620 0.92

Sequence Information

Coding Sequence
ATGCACATATTCGGCCAGGAAGGCAGAAACCGGCAGCTCGTTGACAAAATTCAAACGTGCCTGCCGTTCAAGATAGAGGAAGATGATCGTTTGCCAAAAGTCCTGTGTTATCGATGCATgtataatttggaaaatttttacgaCTTCAGAACCGCATGCGTGAATGCATCTGCTTGGTTGGAAAGAAATAGACCCAAGGAAGGAGCGAACGATGATGGTGTAAATGATAGCGCACAGTGCAGCGACATGCACGCGGAGCTTCTCaagggaaagaaaaatatgccaaTACTTATCCCGGAAGCGCCCGTGGTCAATCCGAATGCCGCATTAGGTACACCACCGAGATTGAATTCCGATGGAGAGGCCGATCCCGAGATCGAAGAGATTCTCGATGCGAGCGAAGGCACCGACGAAGTGCTCGATGATTCGACGGAGGATCGACGGTCGGAATATGAATACGAGATGGACATGGAGACGAATCCTAGCGACTTTTTAGAAATGACGCCGATGGTAACCGAAGAAAATGAGGAGGAATGCGGCGCAAATAACACAAGTGCTACGGCCGGTCAACAAGAGGCCGCGACGGTCTTTCCACCTACTTCGCAGCAGCACGAGGTCTATGTATGTTCTCTGTGCAACAAGGCATTTAGCTCCAAGGGTCATCTATCGTTGCACGCGAGGATTCACGTGGGTGAAGGTGATGTGATCGGTGAAAGGGTTATTACTGATGACCATACTTCGTATCAACGACCGTATCAATGCGATCTCTGTCATAAGTCGTATTCTACCGCGAAGCATCGCTGGGGACACGTTTCTACGACACATCGGGGACATCCTGCTGTAACGTGCGCATATTGCTCGCGAATATACTCGACACGGTACAACCTCGACGAGCATATAAAATCGCGACATGCCGGTCTACCGCCACCCCCAGAATTATCAGTTTCTCTTTCCCGCGCAGAGACTCGTTACCAATGCCAAACGTGTCCGatgatttataaagatttggCAGATTTCAACGCGCATCGTCAGATATGCATTGAGGAACAACGCACTGATCTATTAGGACAAACCGACACGCAAaacaataagaatttaatcgaTATGTCCGATATGTCGAGTATTATAGACTCGGATGACGAGAACAAAGACTTTAGGAGTGCCGAGGCTAAACTGGCCAAGAATCCGCAATTGACTATATTGAAACAAGCTTTGACTAAAGGAGACAGTTTAAAAAGGATTTATGATGATGATGGCTCAACGTCGAGTTGCAAGGCGAGAAAAATGGTCAAGAtagAAGGCGAAACAAATCCTGAGATAAAAAGGTGGTATTGCGAATATTGTCCGCAAAGTTTTACATCAGTGGACAGTTTAAAGTTGCACGAAGTCAGGCATGATGCCGAGAAGCCGTTTATCTGCGTACTATGCAAGAAggattttgttttgaaatctTCATTGATAAGGCACATTACGACGATACACGGCGTTGATCCTACTCCTATCATTGACAGCGACAAGTGTCTAAAAACATCTGTGATGCCTCAGAATTGGAATCGAATGGATGTCAGCGTTTACGAGCAGAACGATATAAAGGAACCACCAGAATTTTCGTCGTCACCCGagACAAATTTGGACAATGATGAGAAAGATTTGAGAAACAACCACGAGAATATAGAAATCGAAACAGTATTTATATGTGAGATTTGTACGAGGGACTTTAACGATCGAGCGTCGTTATGGTTGCACATGCGTGCAACACATAAGGAACTTGCTGCATATGCCTGTGGAGTGTGCTTAAAGATTTGTTCCGACAATACACAACTCCAGAGTCATCTTTACATGTATCACGGAAAATCCAAGCTTTTAATATCGGAACAAAGAAGGTACAGTTGCACGATATGCGGCAGACAGCATGATTCAAGAAAGAAGCTAATAGCTCATGTCTCGATACACAATATCGATTCTGGGTTTGATCCTGCAATTTTTGTACAGTTAAACagtaattattacaatgaGAACTTAAACGGTACCGAAGGAAATGAACAAGTAATGGATTTCGATGGAGAAGACGGCGAGAAAGTCGATTGTTACATTTGTTACAAATCTTTTCCAAACGAGGATCACCTTATACGACATCAGAGAAATGCGCATAAATCCGAACAAATAATTCCGTTAGGAGATGCCGCGGCAAGTGGAAACGCTCCGAATATCAATGGCAGCGGTAATAGGGGACAGTAtcatttgttttttgtttgtgAAATTTGTGGTAGTTCTCATTCAAGTAAATGGGAACGCTGGTTGCATATCAACAACATGCATAATAACGAATCTTCCATCAAGTGCGAATGGGAAAACTGTGGGAAAATATTCGCGACGAAATCACTGCGTAATGACCATCTCCAGCATCATTTGATCCAAGGCCCGTCACCAAACACCTGTGAGATATGCGGTAAATTGTGGCCTACTCGTGTCGATTACTGGAAACACGTGATGGGTGTTCACGCGGACACGGTACCCCTGATCTGCGGCGTTTGTCTGAAAGTATTTTCCGATGTGATGCAGTTAAGTGCTCACGTCAAGGCAAAACATTGGCCACTCACCAATGGTGATTTTAGCTGTGATATTTGCGGTAGACCATACTCCAATAAATCCAAGATGTCCCGACATAGAAAGATCCACGGTTTGGAAATGGCAGCAGCGGCAGCGGCAGCGATGATTGCGGATGTCGCGTGTGATAATAGCAATCTCAATGAGACAATCAACGAATCTATAAAACTCGAACACGGCAACAACAGGGCCGTAGATTTCAAATTGAAATGCGAACAATGCCCCGAGCACAAGTTTACGACTCTGGACATTTTACGCAATCATCGTCGGGTAGTGCACAATCTCTTCCCGTGTGATCTATGCGTTAAGTACTATGGTAGGACATCCCATTTGTGGAAACACGTGAACAGGGTACATAAGGGTCACGCGGACGTGACTTGTCCATACTGCGCGAAAACGAGCGCGTCGAGAGATCATCTTGCGGCGCACATCGCGAAGATTCACAGATATGTGCCCACGATGGGTGGCAAAGATAATCAGAACTGCGTCGTTTCTAAGTCCTTGAATATGGAGGATGGTGTCCTGCATTACTGCGAGAAATGTAACAAAGGATTTCACAAACGCTACCTGCTCCGACGTCACATGAAGGGCTGTCAAAACTACCGTAAGGATCCTGGAGCACTATTGACCCGCTGCCGAGCGTGCGAGAGGATATTCAAGGATCGTGCAAGTCTACAGAAGCACATCGAGAATCACCACAGTACATATACCTGCCATTTATGTAATGAGACTATTACGTCCAAACTGGGCATTATGACGCACAATCGCGTCAATCATATGGATCACCCGGATCTGACGTGCGATTATCCGAGCTGCAAGAAGCTTTTCCGCACTAAGGAGGATCTCGAGTCTCATCGAAAGGATCACAAATATCACAGTAACCCGAATGTCTGCGATTTTTGCGGCGACACCGTGGAgaacaaattaaagttaaagatGCACGTGCTATCATTACACCGGAACGAGATCGGTGTATCCTGCGGCGTCTGTCTCATTCCTATGAAGGATCCGAAAGATTTGAAGAAACACGTCGAGACGGAGCACAGTAGCGTTCTTTCCAATCCGAATACATGTCAAGTATGTGGCAAGCAATATGCATCCAAGTGGAAGGCGTTTGATCACACGAAAAAGTGTCACGGAAAAGTTTTTCTCACGTGCAAACAATGTTTAGCAGTTTTCACAGATGAGAACGCTATACGCGATCATTACGAACATGTACATAACGTTCCAAAGGATCAGTTAGCCGTTTTCGAATATAGAATGGACATCGGTGCGAAGAGGGAAGATTACGAGACACCTGACATCATCGTGAAGGAAGAACCGGATGATCTTGAGTTCGATGAAGAGATGTGCGATGAAAGTTCGAGTGATTCCCGTAAACGCAGACGATCGCCGAACGATACGTATGATTGTGAAATGTGCCCCGAGATCTTTCTCAATTCGGATACACTCGCCAAGCATTATCAGAATGTTCACAACACCGATCCCATTCGTATGTTCAAAAAGTTTAAGAAGAACAGTGGCGACGGCAAACGTAGAATGAGAAATAGAAACAATTACGAATGCAAGAATTGCAAGAAGCAGTTCTCTACCAAGACTTTATTCTGGAATCACATAAACGCGTGCACGCGACGAAACTCGGTATGCAAATTCGACGTTCCGAATAATATCTCGACATCGATTCTGGAGTCGCATTTGAAGAATAATAATCAGATTCAGCGAGAAGAACCGTTATCGCTGATGAACGAATCTAATTTGAACATTCCTGATTTTAACCTATTCGAGGACATCAACATGCAACTGTCAGCCCAGAAACCGGTGCCGAATCTCATGCCGTTGTCGCAGGTGAAGTCGGCAGGTAATGGCAAGTGCTCACGCAAAGACTCGCGCAAGGTATATGACGAATCGACCAATACTGAGTGCACATGCGAGGTCTGCGGCAAACAGTGGCCCGCTAAGAAGCACTTGTGGCAACACTTGATTCGTTTCCATCGTGCCGAAGCGGCCGTTACCTGCGGCGTATGTTTGAAGTTATGCAAATCCTATCAAGATTTAGCCGATCACCTGAAGGCGGAGCACGCCCCTGTTTTGTCACCGGAGGGCAACAATTTCACGTGCAAGACATGCGGCAGATATCACAATGCGAGAAGTAAACTGCTGCTACACATGAGCATCCATATCGGATACTTCCGATGCGAGAAGTGTCAGCAAGGTTTTGCGAGTGAGGAAAAACTCGGCGAGCACGTGACAAACTGCAATGGCAAGTCGGAATTTGAGGATCATGCAGCAACTGCGGATATTGAGGATAACGCGAAAAATGACAATGATGAAAAGGGCAGTTTAATCGCTGACGAAACGTCGGTCATCGAAGAGGAAGTTGAAGAAGCGGATTTTGAATCGGAGGGCGAGGGTAGTAGAGGCTTGCAAAATGAAGAAAACAATAGCGAAGAAGACAATTCTGATAGCGATGACTCGGATAGCGGTAGCAATAGTAGTTCGAGCGAGAATGAAggcgaagaggaagaagaagaggaggaagaggaagaagagaacgAAAACGAAAATGAGTCTGATACAAGGACTGAGCCGGACACGAGAACTCCGAGCAGAGCGAGCGGTGACAGTGAGTCGTGTAATGAACGTTCCGAAAGTGACGAATCGAATATGGACGAAGCAGAAGTGAGCGCGAAGAAAGCACCGCGATTGAACGATAACGATAGTTTCATGATACAAGGCGAAGAGAACTCGCCGGCGATGGAAAAATACGTCGAAGATCAAAGCACCGTTTCCGTTATCGCAACGATTGGTGGAGACCGAATTAAGCAGAGTACCTTGAATAATCTATTGATTTCCGGTGCATCTGCGAACGTAGATAAATTTAAGGCATTACGTCTCCAGGAATCTATTACAGCTACGGCGAGTGATGTAGACTTCTctaatgataatgaaaatgataatgagGAAGATgacgaaaataatgaaaatgaagaaaaagagggagatgAAGGCGAAGATGAAGGTGAAGGTGAAGGTGAAGGTGAAGCTGAAGCTGAAGGTGAAGGTGAAGCTGAAGCTGAAGGTGAAGCTGAAGGTGAAAGTGGAGATGAAGATGAAGATGAAGATGAAGATGAAGATGAAGATGAAAGTGAAGGTGAGGGTACAAGCGAGGCCGAAGCTGAGAGTGGACAGGATGAGGGTGAAGACGATGCTGATGGTGAAggggaagaagaggaagaggaggaaaatgatgaggacgacgacgatggaCCGCCTGTATTGAGTCCAATAATGCCTTTGCTGACTGAAAATGAATCCGAGGAGCACAGCAATACGACGGATCGTACAAGGCATAAGCTTAGTCCGATGGTATCGCTGAGTATGGACAAATTAATAGAGGGATGCCAGATAACGGAGATCAAAACTAATACGGAGAATACGGCGGCACTCTCGAATGCCTCCAACTTCTTTGCAGCTAATAACAACGATGTAGCCGTGACGTGGGACGAGGATGAGGAACGCGATTGTAATTCCGATGTCGAGGACAGAGATGTGATGGTAATGAAGAACGAGGAATTTGATAAGGAGTATGCTAAGAGGAATATCAATGATTTGGAAGGGGACGACGACTATGAGGAAGATTCCGCGGACGAAAATGTGATGGACGATAGAGATGGTGATGGTGGAGGTCAAGTGCACGAAATACATAGTCTAGACGGGACAGTGTTAATGATGACTAATGATGCGGAAGGTAATCCGATTTTGATAGGACATAATGTGTTAGATATCGATAACGAAGACTCTAACGCCGAGGTGACGCAGTATATTTATCCAGAGAATGCTTATGAGATTGAGGAAGAGGATGAGGAAGATTTTGCGAGTCGAAATGAAGCCGACGCCATGCAGACGGGTGAGATGCAAGGTATGTCCTACGTTCAGGATATGTCGGAGAACGAGGATAGCACGGGGGATGATGTAGAGGAGAATAGTAATGATGCCCAGAAATAG
Protein Sequence
MHIFGQEGRNRQLVDKIQTCLPFKIEEDDRLPKVLCYRCMYNLENFYDFRTACVNASAWLERNRPKEGANDDGVNDSAQCSDMHAELLKGKKNMPILIPEAPVVNPNAALGTPPRLNSDGEADPEIEEILDASEGTDEVLDDSTEDRRSEYEYEMDMETNPSDFLEMTPMVTEENEEECGANNTSATAGQQEAATVFPPTSQQHEVYVCSLCNKAFSSKGHLSLHARIHVGEGDVIGERVITDDHTSYQRPYQCDLCHKSYSTAKHRWGHVSTTHRGHPAVTCAYCSRIYSTRYNLDEHIKSRHAGLPPPPELSVSLSRAETRYQCQTCPMIYKDLADFNAHRQICIEEQRTDLLGQTDTQNNKNLIDMSDMSSIIDSDDENKDFRSAEAKLAKNPQLTILKQALTKGDSLKRIYDDDGSTSSCKARKMVKIEGETNPEIKRWYCEYCPQSFTSVDSLKLHEVRHDAEKPFICVLCKKDFVLKSSLIRHITTIHGVDPTPIIDSDKCLKTSVMPQNWNRMDVSVYEQNDIKEPPEFSSSPETNLDNDEKDLRNNHENIEIETVFICEICTRDFNDRASLWLHMRATHKELAAYACGVCLKICSDNTQLQSHLYMYHGKSKLLISEQRRYSCTICGRQHDSRKKLIAHVSIHNIDSGFDPAIFVQLNSNYYNENLNGTEGNEQVMDFDGEDGEKVDCYICYKSFPNEDHLIRHQRNAHKSEQIIPLGDAAASGNAPNINGSGNRGQYHLFFVCEICGSSHSSKWERWLHINNMHNNESSIKCEWENCGKIFATKSLRNDHLQHHLIQGPSPNTCEICGKLWPTRVDYWKHVMGVHADTVPLICGVCLKVFSDVMQLSAHVKAKHWPLTNGDFSCDICGRPYSNKSKMSRHRKIHGLEMAAAAAAAMIADVACDNSNLNETINESIKLEHGNNRAVDFKLKCEQCPEHKFTTLDILRNHRRVVHNLFPCDLCVKYYGRTSHLWKHVNRVHKGHADVTCPYCAKTSASRDHLAAHIAKIHRYVPTMGGKDNQNCVVSKSLNMEDGVLHYCEKCNKGFHKRYLLRRHMKGCQNYRKDPGALLTRCRACERIFKDRASLQKHIENHHSTYTCHLCNETITSKLGIMTHNRVNHMDHPDLTCDYPSCKKLFRTKEDLESHRKDHKYHSNPNVCDFCGDTVENKLKLKMHVLSLHRNEIGVSCGVCLIPMKDPKDLKKHVETEHSSVLSNPNTCQVCGKQYASKWKAFDHTKKCHGKVFLTCKQCLAVFTDENAIRDHYEHVHNVPKDQLAVFEYRMDIGAKREDYETPDIIVKEEPDDLEFDEEMCDESSSDSRKRRRSPNDTYDCEMCPEIFLNSDTLAKHYQNVHNTDPIRMFKKFKKNSGDGKRRMRNRNNYECKNCKKQFSTKTLFWNHINACTRRNSVCKFDVPNNISTSILESHLKNNNQIQREEPLSLMNESNLNIPDFNLFEDINMQLSAQKPVPNLMPLSQVKSAGNGKCSRKDSRKVYDESTNTECTCEVCGKQWPAKKHLWQHLIRFHRAEAAVTCGVCLKLCKSYQDLADHLKAEHAPVLSPEGNNFTCKTCGRYHNARSKLLLHMSIHIGYFRCEKCQQGFASEEKLGEHVTNCNGKSEFEDHAATADIEDNAKNDNDEKGSLIADETSVIEEEVEEADFESEGEGSRGLQNEENNSEEDNSDSDDSDSGSNSSSSENEGEEEEEEEEEEEENENENESDTRTEPDTRTPSRASGDSESCNERSESDESNMDEAEVSAKKAPRLNDNDSFMIQGEENSPAMEKYVEDQSTVSVIATIGGDRIKQSTLNNLLISGASANVDKFKALRLQESITATASDVDFSNDNENDNEEDDENNENEEKEGDEGEDEGEGEGEGEAEAEGEGEAEAEGEAEGESGDEDEDEDEDEDEDESEGEGTSEAEAESGQDEGEDDADGEGEEEEEEENDEDDDDGPPVLSPIMPLLTENESEEHSNTTDRTRHKLSPMVSLSMDKLIEGCQITEIKTNTENTAALSNASNFFAANNNDVAVTWDEDEERDCNSDVEDRDVMVMKNEEFDKEYAKRNINDLEGDDDYEEDSADENVMDDRDGDGGGQVHEIHSLDGTVLMMTNDAEGNPILIGHNVLDIDNEDSNAEVTQYIYPENAYEIEEEDEEDFASRNEADAMQTGEMQGMSYVQDMSENEDSTGDDVEENSNDAQK

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00869056;
90% Identity
iTF_00280700;
80% Identity
-