Basic Information

Gene Symbol
-
Assembly
GCA_016802725.1
Location
NC:18283336-18296494[+]

Transcription Factor Domain

TF Family
zf-C2H2
Domain
zf-C2H2 domain
PFAM
PF00096
TF Group
Zinc-Coordinating Group
Description
The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 32 5.1e-06 0.00045 20.2 1.0 1 23 265 287 265 287 0.98
2 32 0.0074 0.65 10.2 3.7 1 23 310 333 310 333 0.96
3 32 0.0005 0.044 13.9 0.5 2 23 340 362 339 362 0.96
4 32 0.0024 0.21 11.8 0.1 1 21 382 402 382 403 0.95
5 32 0.0052 0.45 10.7 0.1 2 23 501 522 500 522 0.95
6 32 0.0088 0.78 10.0 0.3 2 23 529 551 528 551 0.90
7 32 0.00015 0.013 15.5 0.2 1 23 621 644 621 644 0.96
8 32 0.055 4.9 7.5 3.1 1 23 650 673 650 673 0.95
9 32 0.099 8.7 6.7 1.5 1 23 686 708 686 708 0.97
10 32 2.8e-05 0.0025 17.9 0.9 3 23 753 774 752 774 0.96
11 32 3.8 3.3e+02 1.7 0.3 1 23 807 830 807 830 0.88
12 32 0.013 1.1 9.5 0.8 2 23 837 860 836 860 0.93
13 32 0.74 65 4.0 0.0 2 23 869 891 869 891 0.90
14 32 0.021 1.9 8.8 0.0 2 23 898 920 897 920 0.93
15 32 7.3e-05 0.0065 16.5 3.2 1 23 928 950 928 950 0.97
16 32 0.02 1.7 8.9 0.8 3 23 992 1014 990 1014 0.93
17 32 0.00017 0.015 15.4 3.3 1 23 1017 1040 1017 1040 0.97
18 32 0.00013 0.011 15.8 0.2 2 23 1047 1069 1046 1069 0.96
19 32 0.00075 0.066 13.4 3.9 1 21 1096 1116 1096 1117 0.95
20 32 0.0022 0.19 11.9 1.8 2 23 1131 1152 1130 1153 0.93
21 32 0.29 26 5.2 1.1 1 23 1156 1179 1156 1179 0.94
22 32 0.0072 0.64 10.3 1.6 2 23 1186 1209 1185 1209 0.91
23 32 2.5 2.2e+02 2.3 0.2 2 23 1217 1239 1217 1239 0.94
24 32 2.7 2.4e+02 2.2 0.0 2 23 1246 1268 1245 1268 0.88
25 32 0.19 16 5.8 2.7 2 23 1277 1299 1277 1299 0.96
26 32 0.5 44 4.5 0.5 2 23 1305 1327 1304 1327 0.92
27 32 0.014 1.2 9.4 0.1 1 23 1389 1412 1389 1412 0.95
28 32 0.00018 0.016 15.3 1.0 1 21 1440 1460 1440 1461 0.96
29 32 0.11 9.6 6.6 1.2 2 23 1562 1584 1561 1584 0.94
30 32 0.75 66 3.9 0.4 2 23 1591 1613 1590 1613 0.94
31 32 0.57 50 4.3 2.5 1 23 1624 1646 1624 1646 0.97
32 32 0.012 1 9.6 0.4 1 20 1650 1669 1650 1671 0.94

Sequence Information

Coding Sequence
ATGGCGACGATATCGACGGCCACAGCGGCGGCggggacgacgacgacaacgtcaACGTCAACGTCAacgtcgacagcggcgacgGCGTCGAGCGCGACAGCGATGGCGACGGAGCCGAGAAAATCTTATCACGATCTGTGTCGCCTGTGCGCCTCCTTCGACGCCATTAGGATGCAGATATTCGGCCAGGAAGGCAGAAATCGTCAACTCGTCGACAAGATCCAGTTGTTCCTGCCGTTCAAGGTGATGGAAGATGATTGTTTGCCAAAAGTCCTGTGTTATCGATGTATGTTCAACTTGGAAAATTTCTACGACTTCAGGACTGCGTGTATTAACGCAGCTGCTTGGTTGGAAAGAAACAGGCCGAAGGAAGGCGCGAGCGATGACGATGCAAATGATAACGCGCAAATCAGTGAAGGACACACGGAGCTTCTTAATGGAAAGGAAAATATGCCAGTACTTATTCCAGAGGCACCCGTAGTCAATCCTAATGCAGCATTGGGTACACCGCCGAGATTAAATTCGGATGGTGAAGCGGATCCCGAGATCGAGGAGATTCTCGACACAAGCGAAGGTACGGATGAAGCTACAGTGATAGACGATTCGGAAGATCGACGATCGGAATATGAGATGGCATACGAGATGGACATGGAGACGAATCCTAGCGATTTCTTGGAAATGACCCCAATGGTAACCGAGGAGAATGAAGAAGAGTGCGATACGAACAACGCGAGTGCTGCGACTACTCAAGATACCACGGTCTTTCCACATACATCGCAGCAACACGAAGTCTACGTCTGTTCTCTATGTAACAAGGCATTTAGTTCCAAGGGTCATTTGTCACTGCACGCGAGGATTCACGTGGGCGAGGGTGATGTGATCGGCGAGAGAGTGATCACAGACGATCATACTTCGTACCAACGACCGTATCAATGTGATCTTTGTCATAAGTCGTATTCTACCGCCAAACATCGCTGGGGACATGTTTCTACGACACATCGGGGACATCCTGCAGTAACGTGTGCATACTGTTCTCGTATATACTCGACGCGATATAATCTTGATGAGCATATAAAATCGCGACACGCTGGTCTACCGCCACCCCCAGAGTTATCTGTTCCTCTTTCCCGCACGGAAACTCGTTATCAGTGCCAAACGTGTCCAATGGTGTACACGGATCTGGCGGATTTCAATGCGCATCGTCAGATATGCGTCCAGGAACAACGTACAGACTTGTTGGGACAAACTGAGGCAcagaacaataaaatttttgccgACACGTCTGACATCTCGAGCGTAGATTCGGATGACGAGAACAAAGACTTTAGGAGCGCTGAGGCTAAATTGGCGAAAAATCCACAGTTAACTATATTGAAACAAGCGCTAACTAAAGGAGACAATTTAAAACGGAATTTCGACGATGATGGTTCAACTTCCAGCGGCAAGccgaaaaaaatagttaagtcagAAGAGGATGAGACGAATCCTCAGAAAAGGTGGTATTGTGAAGCTTGTCCACAAAGTTTTACATCGGTGGACAGTTTGAAGGAACACGAGCTCAGGCACGACGCCGAAAAGCCGTTTATTTGCATACTCTGCAACAAAGATTTCGCTTTGAAATCTTCATTAGTTAGACACATTACAATGTCACATGGCGTTGATCCTACTCCTATCATTGACAGTGATAAGTGTCTAAAAACGACAGTAATGTCTCAGAATTGGAACGATCGAGTGGACGTCAGTGTCTATGAACAAAGTGAAATAAAAGAACCACCAGAGCTCTCGTCATCATCtgagATAAATTTGGAGAATGATGACAAAGATTATAAGAACCACGAGAATATAGAAATTGAAACGGTATTCGTATGTGAGACTTGTAAGAGAGACTTTAATGACCGAGCATCATTATGGCTACACATACGAGCGATGCATAAAGAATATGCTGCATTTACCTGTGGAGTATGTTTAAAGATGTGCTTTAATAATACGCAACTTCAAAATCATGTCTTCATGTATCACGGAAGGTCTAAACTCTTAATATCGGAACAAAGAAGGTACAGTTGTACGATATGCGGTAGACAACATGACTCgagaaaaaaactaattactCATGTCTCGATACATAATATCGACCCTGCCTATGATCCTGCAAgttttgtacaattaaatagtaattattataatgaaaactTGAATGGCAACGAGGGAAATGAACAAGTATTAGATTTTGATGGGGAAGATGGTGAGAAGGTCGAttgttatatttgttacaaatcTTTCCCGACTGAGGATCATCTAATACGGCATCAGAGAAATGCTCATAAGTCTGATCAGATAGTTTCATTAGGAGACGGCACGGGTAATGGAAATACTCTGAGCGTTAATGGTAATGGTAATAGGGCTcagtatcatttattttttgtttgcgaAGTATGTGGCAGTTCACACTCGAGCAAATGGGAGCGCTGGTTGCATATCAACGATACGCATAACAACGAATCTTCTATCAAATGCGATTGGGAGAGTTGTGGAAAGATATTTGCGACAAAGTCGCTACGTAACGAACATCTCCAGCATCATGCGGTGCAAGGACCCTCGCCAAATACCTGCGAGATATGCGGAAAATTATGGCCTACCCGCGTCGATTATTGGAAACACGTGATGGGCGTGCACGCGGATACAGTGCCTCTCATTTGTGGCGTTTGTCTGAAAGTATTTTCCGATGTTATGCAATTAAGCGCACACGTGAAGGCGAAGCATTGGCCGTTAACTAGTGGGGACTTTTGTTGTGATATTTGCGGTAGACCATACTCCAATAAATCAAAAATGTCTCGGCATAGAAAGATCCACGGCTTGgaagcggcgacggcggcggccaTGGACGTTACATGCGATAACAGTAGTTTCAATGAAACGACCAATGAATCGGTGAAGCCTGAACACAGCAATGGCGCTTCggaattagaattaaattgCGAGCAGTGCCCCGAGCTTAGCTTCACAACACTAGACAGTTTATGTAATCATCGACGGATAGCGCACAATCTTTTCCCATGCGACTTGTGTAACAAGTGCTACGGAAGAACGTCACACTTATGGAAGCATGTGAATAGGGTTCACAAAGGACACGCGGACGTGACTTGTCCTTACTGCTCAAAGACGAGCGCGTCGAGGGATCATCTGGCAGCGCATATCGCCAAAATTCATAGGTTTATGCCCGCTATGGGTAAGGACACTCAAAACTGCGTCACTTCCAAGTCTTTGAGTGTAGAGGACGGTGTCTTGCATTACTGTGAGAAATGTAACAAGGGATTCCATAAGCGCTACCTGCTCCGTCGACACATGAAAGGCTGTCAAAACTATCGTAAGGATCCGGGTGCGTTGTTGACTCGTTGCCGAGCCTGCGAGAGGATATTCAAGGATCGTGCGAGTTTGCAGAAGCACATTGAGAATCATCACAGCACGTATACTTGCCATTTGTGTAACGAGACGATTACTTCCAAGCTGGGCATTATGACGCACAATCGCGTCAATCATATGGATCATCCAGATCTGACGTGCGATCATCCGAGCTGTAAGAAACTCTTCCGCACCAAAGAAGATCTGGAAGCTCATCGCAAGGAACACAAGTATCATAGCAACCCGAACGTTTGTGACTTTTGCGGTGACACCGTGGAAAACAAACTGAAACTGAAGATGCATGTACTGTCATTGCATCGAAACGAGATCGGTGTATCCTGCGGCGTCTGTCTCATTCCTATGAAGGACCCCAAAGATTTGAAGAAACATGTTGAAGCGGAACATAGCAGCGTTCTTTCCAATCCAAACACATGTCAGGTGTGTGGCAAACAATACGCGTCCAAGTGGAAGGCATTCGATCACACGAAAAAGTGTCACGGCAAAGTTTTTATCACGTGCAAACAGTGCTTAGCGGTTTTTACCAACGAGAACGATATACGCGATCACTATGAGCATGTCCATAACGTTCCAAAGGACCAATTGGCCATTTTCGAATATAGAATGGATATTGGTGTGAAGAGGGAGGGTTACGAGACTCCCGATATTATTGTGAAGGAGGAACCGGATGATTTGGAGTTTGACGAGGAACTGTGCGATGAGAGTTCGAACGACTCTCGCAAACGTAGAAGATCGCCGAATGATACGTACGACTGCGAGATGTGTCCCGAGATCTTCCTCAATTCGGACACTCTCGCCAAGCATTATCAGAACGTCCATAACACTGATCCCGTCCGTATGTTCAAGAAATTCAAAAAGGATAATGGCGACAGCAAGCGCAAAATGAGAAACAGAAACAATTTTGAGTGCAAGAATTGCAAAAAGCAATTCTCTACCAAGACCCTATTCTGGAATCATATCAATGCATGTTCGCGGCGAAATTCGATAGGTAGATTTGACATGCCGAATAATGTTCCAACGTCAATTCTGGAGTCTCATCTAAAGAACAATAATCAAATACCACGAGAAGAATCGGTACCGCTGACGAACGAATCCAATTTGAACATTCCCGATTTCAATTTATTCGAAGACATTAATTTACAGTTGTCAGCTCAAAAACCTGTGCCGAATCTTATGCCATTGTCGCAAATGAAGACGACGGGTAATAGCAAGTGTTCGAGAAAAGACTCGCGCAAGGTTTACGATGAATCGACCAATACCGAATGCACGTGCGAGGTCTGCGGTAAACCGTGGCCTGCCAAGAAACATTTGTGGCAACACCTAATTCGATTTCATCGCGCCGAAGCCGCCGTTACGTGTGGTGTATGCTTAAAGCTGTGTAAATCTTATCAAGATCTGGCCGATCACCTGAAGGCCGAGCACGCCCCTGTTTTGTCGCCAGAGGGTAACAACTTCACTTGCAAGACATGCGGTAGATATCATAACGCGAGAAGCAAATTGCTGTTGCATATGAGCATCCACATCGGGAATTTCCGGTGTCAGAAATGCCAGCAAGGTTTTGCGAGTGAGGAGAAACTCACCGAGCACACGACAAGCTGCAATGGCAAATCAGAGTTTGAGGATAACGCGGTGGCGGACGAAGATAACGCAAAGAACGACAACGACGAAAAGGGCAGTTTAATTGCCGACGAGACGTCGGTCATCGAGGAAGCAGAGGAAGCGGATTTCGATTCGGAAGGCGAAAGGAGTAGGGATATTCACGATGAAGACAATTCGGAAAATAGCGAAGAAGACAATTCGGATAATAGCGATGATTCAGACAGCGATAGCAACAGTAGTTCGAGTGAAGATGAAAACGAGAACGAAGagggagaagaagaagaggaggaggaagatgAAAATGAAAACGAAAATGAGTCTGATACGAGAACCACTAGTAGAGCGAGCGGCGACAGTGTATCGTGTAATTCCGAGAGCGATGATGAATCGGATATGGATGAAACGGAAGTGAATGCAACGGAAAAGAAAACGATACAATTGACCGATATTGGTAGATTCAGGATATGCAGTGATGGTATTCAAGAGAACATGTCGATGGAGAATGCCAAGGACCAGAAAACCGAGCTTACCGTAGCGACGGCTGCAGAACAGactaaacaaaataattttaatgattttatgaCTCCCAGCGCACCTGTAAACAAATTTGAGACGTTTCATCTCGAGGAATCTGCTAAAGCTGCGACGAGCGATGTAGACTTATCTGACGATAATGAAGATGATGATAACGAGGACGATGATGAGGAAAATAATGAGAatgaagaagaggaggaagatgGTGAAGAAGGCGGTGAGGATGAGGATGAAGGTGAGGCTGAGTCTGAAAGTGAGAATGAGGACGAGGATGAGAATGAGGGTGAAGCTGAGGCTGAGGCTGAGGCTGAGagtgagggtgagggtgaggcCGAGGCCGAGGCCGAGACCGAGGCTGCAGCTGAAGGAGAGGgtgaagaggaagaagaggaggaggaggaagaggaggaggacgaTGAAGACGACGATGACGGACCACCTGTGTTGAGTCCAATAATGCCTTTGTTACCAGAAAACGAATCTGAGGAGCACAGCGGTACGGATCGCACGAGGCACAAGCTCAGCCCAATGGTTTCGCTGAGTATGGCTAAAGATCTGGAGGAGTGTGAAATAACCGAGATACCAAATGATGTGGAGAACATGAATGCAGCAAACTTCTTTGCAGCTAATAACAATGATCTGCCCGTAACATGGGACGAGAACTTGGACGATAACGGAATCGACAATGGTGATTGCAATTCGGACATTGGAGACAGAGATGTAGAGAAAAATGAGGACTACAGTAAGGAGTATGACAAAATGGAAGTGAATGAGGGCGAAGATTTTGAGGATTCCGCGGATGAGAACATAGTGGACAATTGTAGGGAAGGTGATGGAGATAATCAAGTGCATGAGATGCATAATCTGGATGGAACTGTGTTAATGGTGACTAATGACGCGGATGGTAATCAGATTTTGATAGAACAAAACGTGCTAGATATCGATAACGAGGATTCTAATGTCGAAGCGACGCAATATATTTACCCAGAGAACACTTACGAAATCACGGAGGAAGATTACGCGACGCGAAACGAAACCGATGTCATGCAGACGGATGAGATGCAAGTTGGTATGTCGTACGTTCAAGATGCATCGGAGAACGAGGATAATAGTATGGAAGGTGTTGTCGAAAGAGAACAGCAGTGA
Protein Sequence
MATISTATAAAGTTTTTSTSTSTSTAATASSATAMATEPRKSYHDLCRLCASFDAIRMQIFGQEGRNRQLVDKIQLFLPFKVMEDDCLPKVLCYRCMFNLENFYDFRTACINAAAWLERNRPKEGASDDDANDNAQISEGHTELLNGKENMPVLIPEAPVVNPNAALGTPPRLNSDGEADPEIEEILDTSEGTDEATVIDDSEDRRSEYEMAYEMDMETNPSDFLEMTPMVTEENEEECDTNNASAATTQDTTVFPHTSQQHEVYVCSLCNKAFSSKGHLSLHARIHVGEGDVIGERVITDDHTSYQRPYQCDLCHKSYSTAKHRWGHVSTTHRGHPAVTCAYCSRIYSTRYNLDEHIKSRHAGLPPPPELSVPLSRTETRYQCQTCPMVYTDLADFNAHRQICVQEQRTDLLGQTEAQNNKIFADTSDISSVDSDDENKDFRSAEAKLAKNPQLTILKQALTKGDNLKRNFDDDGSTSSGKPKKIVKSEEDETNPQKRWYCEACPQSFTSVDSLKEHELRHDAEKPFICILCNKDFALKSSLVRHITMSHGVDPTPIIDSDKCLKTTVMSQNWNDRVDVSVYEQSEIKEPPELSSSSEINLENDDKDYKNHENIEIETVFVCETCKRDFNDRASLWLHIRAMHKEYAAFTCGVCLKMCFNNTQLQNHVFMYHGRSKLLISEQRRYSCTICGRQHDSRKKLITHVSIHNIDPAYDPASFVQLNSNYYNENLNGNEGNEQVLDFDGEDGEKVDCYICYKSFPTEDHLIRHQRNAHKSDQIVSLGDGTGNGNTLSVNGNGNRAQYHLFFVCEVCGSSHSSKWERWLHINDTHNNESSIKCDWESCGKIFATKSLRNEHLQHHAVQGPSPNTCEICGKLWPTRVDYWKHVMGVHADTVPLICGVCLKVFSDVMQLSAHVKAKHWPLTSGDFCCDICGRPYSNKSKMSRHRKIHGLEAATAAAMDVTCDNSSFNETTNESVKPEHSNGASELELNCEQCPELSFTTLDSLCNHRRIAHNLFPCDLCNKCYGRTSHLWKHVNRVHKGHADVTCPYCSKTSASRDHLAAHIAKIHRFMPAMGKDTQNCVTSKSLSVEDGVLHYCEKCNKGFHKRYLLRRHMKGCQNYRKDPGALLTRCRACERIFKDRASLQKHIENHHSTYTCHLCNETITSKLGIMTHNRVNHMDHPDLTCDHPSCKKLFRTKEDLEAHRKEHKYHSNPNVCDFCGDTVENKLKLKMHVLSLHRNEIGVSCGVCLIPMKDPKDLKKHVEAEHSSVLSNPNTCQVCGKQYASKWKAFDHTKKCHGKVFITCKQCLAVFTNENDIRDHYEHVHNVPKDQLAIFEYRMDIGVKREGYETPDIIVKEEPDDLEFDEELCDESSNDSRKRRRSPNDTYDCEMCPEIFLNSDTLAKHYQNVHNTDPVRMFKKFKKDNGDSKRKMRNRNNFECKNCKKQFSTKTLFWNHINACSRRNSIGRFDMPNNVPTSILESHLKNNNQIPREESVPLTNESNLNIPDFNLFEDINLQLSAQKPVPNLMPLSQMKTTGNSKCSRKDSRKVYDESTNTECTCEVCGKPWPAKKHLWQHLIRFHRAEAAVTCGVCLKLCKSYQDLADHLKAEHAPVLSPEGNNFTCKTCGRYHNARSKLLLHMSIHIGNFRCQKCQQGFASEEKLTEHTTSCNGKSEFEDNAVADEDNAKNDNDEKGSLIADETSVIEEAEEADFDSEGERSRDIHDEDNSENSEEDNSDNSDDSDSDSNSSSSEDENENEEGEEEEEEEDENENENESDTRTTSRASGDSVSCNSESDDESDMDETEVNATEKKTIQLTDIGRFRICSDGIQENMSMENAKDQKTELTVATAAEQTKQNNFNDFMTPSAPVNKFETFHLEESAKAATSDVDLSDDNEDDDNEDDDEENNENEEEEEDGEEGGEDEDEGEAESESENEDEDENEGEAEAEAEAESEGEGEAEAEAETEAAAEGEGEEEEEEEEEEEEDDEDDDDGPPVLSPIMPLLPENESEEHSGTDRTRHKLSPMVSLSMAKDLEECEITEIPNDVENMNAANFFAANNNDLPVTWDENLDDNGIDNGDCNSDIGDRDVEKNEDYSKEYDKMEVNEGEDFEDSADENIVDNCREGDGDNQVHEMHNLDGTVLMVTNDADGNQILIEQNVLDIDNEDSNVEATQYIYPENTYEITEEDYATRNETDVMQTDEMQVGMSYVQDASENEDNSMEGVVEREQQ