Basic Information

Gene Symbol
-
Assembly
GCA_963575645.1
Location
OY754475.1:21461407-21473217[+]

Transcription Factor Domain

TF Family
zf-C2H2
Domain
zf-C2H2 domain
PFAM
PF00096
TF Group
Zinc-Coordinating Group
Description
The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 26 6 1.2e+02 3.9 0.6 7 20 267 280 261 282 0.83
2 26 8.5e-07 1.7e-05 25.5 1.8 1 23 289 311 289 311 0.99
3 26 0.00016 0.0033 18.3 4.3 1 23 317 339 317 339 0.99
4 26 3.3e-05 0.00068 20.5 4.0 1 23 345 367 345 367 0.99
5 26 3.3e-05 0.00069 20.5 1.5 1 23 373 395 373 395 0.99
6 26 2.4e-05 0.00048 20.9 1.8 1 23 401 423 401 423 0.99
7 26 2.3e-05 0.00046 21.0 4.1 1 23 429 451 429 451 0.98
8 26 3.5e-05 0.00071 20.4 2.2 1 23 457 479 457 479 0.99
9 26 1.1e-05 0.00022 22.0 1.1 1 23 485 507 485 507 0.99
10 26 6.4e-07 1.3e-05 25.9 2.2 1 23 513 535 513 535 0.98
11 26 4.2e-06 8.6e-05 23.3 3.3 1 23 541 563 541 563 0.99
12 26 0.006 0.12 13.4 0.9 1 23 569 591 569 591 0.96
13 26 0.00014 0.0029 18.5 4.1 1 23 597 619 597 619 0.99
14 26 0.0027 0.056 14.4 3.3 1 23 625 647 625 647 0.98
15 26 0.00017 0.0036 18.2 1.6 1 23 653 675 653 675 0.99
16 26 2.2e-05 0.00045 21.0 3.0 1 23 681 703 681 703 0.99
17 26 3.9e-06 8e-05 23.4 0.7 1 23 709 731 709 731 0.98
18 26 0.0019 0.039 14.9 1.6 1 23 1019 1041 1019 1041 0.98
19 26 3.3e-06 6.7e-05 23.6 2.4 1 23 1047 1069 1047 1069 0.99
20 26 2.2e-05 0.00046 21.0 4.6 1 23 1075 1097 1075 1097 0.99
21 26 5.3e-06 0.00011 23.0 1.9 1 23 1103 1125 1103 1125 0.99
22 26 0.16 3.3 8.9 2.7 1 23 1245 1267 1245 1267 0.90
23 26 0.00017 0.0036 18.2 1.3 1 23 1273 1295 1273 1295 0.98
24 26 0.00054 0.011 16.7 1.0 1 23 1612 1634 1612 1634 0.97
25 26 8e-05 0.0016 19.3 1.2 1 23 1640 1662 1640 1662 0.99
26 26 0.46 9.4 7.4 1.8 1 13 1668 1680 1668 1684 0.89

Sequence Information

Coding Sequence
ATGGAAGAAGTATCGGCAAATATGTTCGATTTAAATACAACGTGTAGACTGTGTTTGTCAAAAGGCGTTGCGACGCTAGATATATTTGCCTTTTCAACGGCAATGATGGACAAAGTAGCCGATGCCATTatgaagtgtgctccgattcagATAACGAGATGCGATGAGATGCCAAATTTAATCTGCAATTCATGCAACACTCAACTGGAGAAAAGCCACCAGTTCCAGCAGCAAATAGTGCAGTCTGACCAGACACTGCAACGCTATCTACAGCAACTACGAACTACTAACAAACAAATCAAGGTTGAATGTTCCATTCAGGGGCCCTCCGAGGATTCTTTAATTCATAACACTGATACAGATATCACGATGAAATGTGAATCTCCGGAATTTCAAGACGCGTCTGATATAAATATCAACGTAAAAGAAGAGGTGCCTTCAAATCATAGTTCTGCAAACGCACTACACgaggaattatttttaaaatctgaaatcaaagaaGAACCGATAGATAATTATTCATCAGTTCCAGTCGAATGCACTGATAACTCAATAGATATTTTATCTGGGAGTGATGATTGTAAAACCAGCCGTGAAATTAAAAGAGAACCCTTAGATGTTAATTCATCAATTACAACGGAATGCACTGCTAATTTAATAGACACTTCATCTGAGGGTGGCCAAGGAATTGAAGTCGAGTCAAAAAATTCAGAAATCGAACAACAATCGCATAACTTCACGTCCCGTTCTGAATTAACagtgcatactggtgagaaaccatatcaatgtgacgttaCTAATCAGTGTTTTAGCCAATCACAAGATTTGAAAAATCACCAACACTTAAATACTGATGAGAAGCCatttcagtgtgacgtttgtaaaaagggTTTCACTCGATCATatgatttgcaaaaacatcagcgcatacatactggtgagaagccattcaagtgtgacgtttgtaatatcTGTTTCAGAGAATCgcgttatttgaaaaaacatcaacgtgtacatactggtgagatgCCCTTTCAGTGTAATgtatgtaataagagtttcagcgAATCAACACATATGAAAAATCACCAACGcgtacacactggtgagaaaccataccagtgtgacgtttgtaataaatttttcagCGAATCATCACGTTTAAAAGACCACCaacgcgtacatactggtgagaaaccataccagtgtgacgtttgtaataagcgttttAGAGAATCAGGTCATTTGACAGGTCATCaacgcgtacatactggtgagaagccatataagtgtgaagtttgtaataaatgtttcaacaAATCTGATACTTTGAGAAAACATCAActcatacatactggtgagaagccgtatcaatgtgacgtttgtaataagcgttATAGAGAATCAGCACATTTGAAAGATCACCaacgcgtacatactggtgagaggccatatcagtgtgacgtttgtaaaaagggTTTCACTCGTTCAACTGATTTGCGAACACATGAacgcatacatactggtgagaagtcGCATAAGTGTGACgtatgtaataagagtttcagcaCATCAGGTAATTTGAGAATACAtcagcgtatacatactggtgagaagtcgtataagtgtgacgtttgtaataagtgcttCACGACAAATGGTTCTATGGAAAGACATCAGCGTATACACATTGGTGAGAAGCCACATCAATGTGACGTATGTAATAAAAGTTTCAGCGAATTAGGTATTTTAAAAAGACATCAACTTggacatactggtgagaagccgtatcgatgtgacgtttgtaataagtgttttggATTATTAGCACATTTGAAAAGACATCAACGcgtacataccggtgagaagccATTTCAATGTgaagtttgtaataagtgtttcagtaAATCAGACATTTTGAGATTGCATCAGCTCaagcatactggtgagaagccgtaccggtgtgacgtttgtaatatgtGTTTCAGTGCATTAAACAATTTGAGAATACATCAgcgcatacatactggtgagaagccatatcagtgtgacgtttgtaataagtgtttcagtaTTTTAGATACTTTGAAAAAACACCAACGTGTACATACTGGGGAGAAGCCATATCAAtgcgacgtttgtaataaaagttTCAGCGAATTAGGTAGTTTGAAAAGACATCAACGTggacataccgTAAATTCTGACTCCAAGTGGAAGCCTAAAATTCACACAATGGAAGAAGTATCGGCAAATACGTTCGATTTAAGTACAACGTGTAGACTGTGTTTGTCAAAAGGCGTTGCGACGCTAGATATATTTTCCTTTTCAATAACAATGACGGACAAAGTAGCCGATGTCATTATGAAGTGTGCTCCGGTTCAgATAAGGAGATGCGATGAGATGCCAAATTTAATCTGCAATTCATGCAACACTCAACTGGAGAAATGCCACCAGTTCCAGCAGCAAATAGTGCAGTCTGACCAGACACTGCAACGCTATCTACAGCAACTACGAACTACTAACAAACAAATCAAGGTTGAATGTTCCATTCAGGGGACCTCCGAGGATTCTTTAATTCATAACACTAGTACAGATACCACGATGAAATGTGAATCTCCGAAATTTCAAGACGCGTCTGATATAAATATCAACGTAAATGAAGAGCTGCCTTCAAATTATAGTTCTGCAAACGCACTACCCgaggaattatttttaaaatctgaaattaaagaagaaccgatagataattattattcatcagTTCCAGTCGAATGCACTGATAACTCAATAGATATTTTGTCTGGGAGTGATGTCTGTAAAACCAGCCGTGAAATCAAAGAAGAACCCTTAGATGTTAATTCATCAATTACAATGGAATGCACTGCTAATTTAATAAACACTTCATCTGAGGATGGCCAAGGAATTGAAATCGAGTCAAAAAAGTCAGAAATCGAACAACAATCGCATAACTTCCCGCTCCGTTCTGAACTAATAGAAAATTTTACTAAACGTGAAGATGGAACCAAAAATACTTCGaaaatacttcaaaaatataagtgTGATTTCTGTAATAAAGATTTtgcttttttaagtaaattaaatattcacaaGCGTGCCCATACTGGAGAGAAGTTATACCAGTGTGACGTATGTAATAAGAATTTTAGCCAAATAgCTCATTTGAAAAGACATCaacgcgtacatactggtgagaagccatatcagtgtgacgtttgtaataagtgttataaaGACTCAGTACATTTGAAAGAACACCAAcgagtacatactggtgagagaccATATCAGTGTAACGTTTGTAAAAAGGATTTCACAAGATCAACTGATTTGCGAAGACATGAacgcatacatactggtgagaagtcGCATAAGTGTGACTACAAACTTGTACTTGACATAGAAATGTCATCTAAGGATTTTGACCTTGAATCATCAACTAAACAAATCACAGTCAAGAGTGAAATCATCGACATCAATTCAATCAAGGAAGAGGTAGTTGAAGATATCACGTGTGTAGAATACAAgcctttaaaaaacattgaaactGTTTGGCGACAACTGAACGAACAAATATCAGTTAAAAATGAACCTCCAGACGATTCTGACGCGCTACAAGCCGATATAAAACAAGAACTCTTAGATGCCAGTTTATTACCTCAAACAAACGACACTTCGTGCAATATCAAACAGGAAAAGGGAGTGAAACTGCACCATTGCGAAGTTTGTAAAAGATATTTCGCATTTTCTAGTAGTTTGGAAGCTCACAAAGTAGACCATATTGGTGAGAAATCGTTTAAGTGCACCTCGTGCGACAAGATTTTCGACAGCTCAGATTTATTAAAACGCCACGAGCTACGACACACCGtGAATTCCAAAGTTTACAAGTGGTGTGGGGTCCCTCTATGTAAAAACACGTCGATAACAACACCTaacaaattgtttgtttatgtTCCAAACAAGCAAACGGTGAGAGATGAATGGCTAAAACTCGCAAAGCGAAACCCAGCAGATGTACTTCTAAAttcgaatatttatttttgtgaagaTCACTTTGATTTGCCTAATGATATGGAAAATTATATGCAGTATCATTTGATGGGATGGGTATCACAAGTCCGTATGAATCCGGGATGTATTCCTAAAAATTTTGCATGCCAAACAGAGAAACAAACACGAACATCTGACACTATAAAACGATCGTATATACGTAAGAAGCGTAAGAAGCGAAAGATAATTGCTCTTGAAAAATGTAATATGGTTACAGAACGTTCTGATTTTGAAGAAATTGCGTCCGGAAGTTCAGTTGAGCAACAGCCTGTGGTCACTCAAGTACAACATTTTGGAAATTCTACTCACAAATATGTTGCTCCAATGGTTTTTAATGCCACCATAAAACCTGAGCCAACAATGACGGCTATGTCATCAAATAACGTTGAGACAGTTCACATAAAAACAGAAGTTCCTGAGAATTTTGATacagaaattaatataaaacaagaGCCAATCGATGACAATTTAATCTATGAAACAAATACGCATCGAAGCAACGAATTAAAAACGAATAGTCACTATATTAAACAAGAAAACGCCGTCGTGAAATTAGAAACGGAAGTATTTCTTGATGATTTTTCGACAGGCGATAATCAGTTTAACGAACAATTCGAATATGTCAACACTGACGTGAAGTTAAAAACGGAATATAATTCTTCTGTTGATAATTTTGAAAGTCAAGCCGGACAGTTCGTTGATTCTAACGATTCAAAAACAAGTGATAAAATATATGAATGTGAGGACACTGGTAAGAAACCGTACCCCTGTGACGGTTGCAGCAAGTATTTCACAAATACGAGTGATTTGAATAGTCATAAGCAACTTCATGTTAGTGCAAAATCGTATCAGTGTGACGAGTGTGGAAAGACATTTGCACATTTAAGCATTCTGAAGGAGCATAGACGtattcatactggtgagaaaccttatcagtgtgacgtttgtgaaaAATGTTTCTCAAGGTTCGtacaaATTAATACTTCTTCGTCGAGTAATTCGGCGAGTAACGCCGAACGTTCGGGtggtttaaaaagaaaacaaattgctGAATATCATCGGGACTATAGAGCTCGTAAAAAAGCcgagtttgaaaatattttttgtgacgaTAGAACTTCCTGTAATTCTAGACGAAGAAAAACAGCCGCCGAATACCAACGAGAGTATCGAGCACGTAAAAAAGCTAGGCGTCAAAGTATGTTGCTTAATTCTACTAATATTGATTCATCTACAAATTCCGATGCATCGATGAGTGTTAGTTCATCTATGCCTGCAAATCTATCAATGAGTGCTGGTTCTACTTTAAGTAATGTCGGAGATGTAACTAATTCAAGAAGAAGGAAAACAGCCGCACAATATCAGCGAGAGTATAGAGAACGTTTAAAATCGAAACGTCAAAGAATGTTACTTAATTACGCAAATGTTGAACCATCTACGATTGCTGATACATCTACCGATAATAGTACAATTAACCATGAATGTACAAATATTGCCAATACAGgtgaaaaacGGCCGGTGGTCAGTCCAGTACAATATTTCGAAAATTCTGGTCAAAGCAATGTTGATCCAACAGTATTAAATATCACAGTTATGTGTGAACCAACATTGGCACAAATAGCAGAAGATACTAATGATGACAAGTCTTTGAATAACGCTGAGACACTTTGCATAAAAACCGAAGTTCCTGATCATTTTGATACACaagtttatataaaacaagagCCAATACACGACAATTTAATAATTGGAACAAATACGTATCAATGTagcgaattaaaaaataatagccAATATATTAAACAAGAACACATCGATGTAAAATTAGAACCGAAGGTATCTTTTGACGATATTTCGATAGGTGGCCATGATGGTCAATTTGAATATGTCAACACCGATGTGAAGTTAGAAACAGAATATGATTCTTCCGTTGATAACCCTCTGACAGATGGTCAAGACGGacagttaaaaaattcgaataattaa
Protein Sequence
MEEVSANMFDLNTTCRLCLSKGVATLDIFAFSTAMMDKVADAIMKCAPIQITRCDEMPNLICNSCNTQLEKSHQFQQQIVQSDQTLQRYLQQLRTTNKQIKVECSIQGPSEDSLIHNTDTDITMKCESPEFQDASDININVKEEVPSNHSSANALHEELFLKSEIKEEPIDNYSSVPVECTDNSIDILSGSDDCKTSREIKREPLDVNSSITTECTANLIDTSSEGGQGIEVESKNSEIEQQSHNFTSRSELTVHTGEKPYQCDVTNQCFSQSQDLKNHQHLNTDEKPFQCDVCKKGFTRSYDLQKHQRIHTGEKPFKCDVCNICFRESRYLKKHQRVHTGEMPFQCNVCNKSFSESTHMKNHQRVHTGEKPYQCDVCNKFFSESSRLKDHQRVHTGEKPYQCDVCNKRFRESGHLTGHQRVHTGEKPYKCEVCNKCFNKSDTLRKHQLIHTGEKPYQCDVCNKRYRESAHLKDHQRVHTGERPYQCDVCKKGFTRSTDLRTHERIHTGEKSHKCDVCNKSFSTSGNLRIHQRIHTGEKSYKCDVCNKCFTTNGSMERHQRIHIGEKPHQCDVCNKSFSELGILKRHQLGHTGEKPYRCDVCNKCFGLLAHLKRHQRVHTGEKPFQCEVCNKCFSKSDILRLHQLKHTGEKPYRCDVCNMCFSALNNLRIHQRIHTGEKPYQCDVCNKCFSILDTLKKHQRVHTGEKPYQCDVCNKSFSELGSLKRHQRGHTVNSDSKWKPKIHTMEEVSANTFDLSTTCRLCLSKGVATLDIFSFSITMTDKVADVIMKCAPVQIRRCDEMPNLICNSCNTQLEKCHQFQQQIVQSDQTLQRYLQQLRTTNKQIKVECSIQGTSEDSLIHNTSTDTTMKCESPKFQDASDININVNEELPSNYSSANALPEELFLKSEIKEEPIDNYYSSVPVECTDNSIDILSGSDVCKTSREIKEEPLDVNSSITMECTANLINTSSEDGQGIEIESKKSEIEQQSHNFPLRSELIENFTKREDGTKNTSKILQKYKCDFCNKDFAFLSKLNIHKRAHTGEKLYQCDVCNKNFSQIAHLKRHQRVHTGEKPYQCDVCNKCYKDSVHLKEHQRVHTGERPYQCNVCKKDFTRSTDLRRHERIHTGEKSHKCDYKLVLDIEMSSKDFDLESSTKQITVKSEIIDINSIKEEVVEDITCVEYKPLKNIETVWRQLNEQISVKNEPPDDSDALQADIKQELLDASLLPQTNDTSCNIKQEKGVKLHHCEVCKRYFAFSSSLEAHKVDHIGEKSFKCTSCDKIFDSSDLLKRHELRHTVNSKVYKWCGVPLCKNTSITTPNKLFVYVPNKQTVRDEWLKLAKRNPADVLLNSNIYFCEDHFDLPNDMENYMQYHLMGWVSQVRMNPGCIPKNFACQTEKQTRTSDTIKRSYIRKKRKKRKIIALEKCNMVTERSDFEEIASGSSVEQQPVVTQVQHFGNSTHKYVAPMVFNATIKPEPTMTAMSSNNVETVHIKTEVPENFDTEINIKQEPIDDNLIYETNTHRSNELKTNSHYIKQENAVVKLETEVFLDDFSTGDNQFNEQFEYVNTDVKLKTEYNSSVDNFESQAGQFVDSNDSKTSDKIYECEDTGKKPYPCDGCSKYFTNTSDLNSHKQLHVSAKSYQCDECGKTFAHLSILKEHRRIHTGEKPYQCDVCEKCFSRFVQINTSSSSNSASNAERSGGLKRKQIAEYHRDYRARKKAEFENIFCDDRTSCNSRRRKTAAEYQREYRARKKARRQSMLLNSTNIDSSTNSDASMSVSSSMPANLSMSAGSTLSNVGDVTNSRRRKTAAQYQREYRERLKSKRQRMLLNYANVEPSTIADTSTDNSTINHECTNIANTGEKRPVVSPVQYFENSGQSNVDPTVLNITVMCEPTLAQIAEDTNDDKSLNNAETLCIKTEVPDHFDTQVYIKQEPIHDNLIIGTNTYQCSELKNNSQYIKQEHIDVKLEPKVSFDDISIGGHDGQFEYVNTDVKLETEYDSSVDNPLTDGQDGQLKNSNN

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-