Basic Information

Gene Symbol
-
Assembly
GCA_958510825.1
Location
OY294045.1:1949367-1973738[+]

Transcription Factor Domain

TF Family
zf-C2H2
Domain
zf-C2H2 domain
PFAM
PF00096
TF Group
Zinc-Coordinating Group
Description
The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 34 0.19 11 7.6 0.9 3 21 213 231 212 232 0.94
2 34 1.4 78 4.9 0.1 1 23 256 278 256 278 0.97
3 34 0.016 0.89 11.1 0.2 1 23 310 333 310 333 0.96
4 34 0.0032 0.18 13.2 1.9 1 23 339 361 339 361 0.97
5 34 0.16 9 7.9 0.7 1 23 367 390 367 390 0.92
6 34 0.00061 0.035 15.5 1.1 2 23 402 423 401 423 0.97
7 34 4.8e-05 0.0028 19.0 2.2 3 23 437 457 435 457 0.96
8 34 0.00012 0.0068 17.7 0.8 1 23 463 485 463 485 0.98
9 34 1.9e-05 0.0011 20.3 2.0 1 23 491 513 491 513 0.98
10 34 0.047 2.7 9.6 0.6 2 23 603 624 602 624 0.97
11 34 0.14 8 8.1 0.1 1 23 644 667 644 667 0.96
12 34 0.016 0.89 11.1 1.4 1 23 671 693 671 693 0.98
13 34 0.0014 0.081 14.3 0.4 1 23 699 722 699 722 0.98
14 34 0.0049 0.28 12.7 0.3 1 23 728 750 728 750 0.96
15 34 0.061 3.5 9.2 0.8 1 23 756 778 756 779 0.90
16 34 0.013 0.75 11.3 1.1 2 23 791 813 790 813 0.96
17 34 2.7e-05 0.0015 19.8 0.7 3 23 827 847 825 847 0.96
18 34 1.8e-05 0.001 20.3 1.5 1 23 853 875 853 875 0.98
19 34 3.1e-06 0.00017 22.7 1.7 1 23 881 903 881 903 0.98
20 34 0.05 2.8 9.5 0.3 1 22 909 930 909 930 0.96
21 34 0.0004 0.023 16.1 0.6 3 23 1051 1071 1050 1071 0.97
22 34 9.1e-05 0.0052 18.1 2.6 1 23 1077 1099 1077 1099 0.98
23 34 1e-05 0.00059 21.1 1.8 1 23 1105 1127 1105 1127 0.99
24 34 0.00011 0.0064 17.8 2.5 2 23 1202 1223 1201 1223 0.97
25 34 0.098 5.6 8.6 0.2 1 23 1243 1265 1243 1265 0.98
26 34 0.0028 0.16 13.4 2.5 1 23 1269 1291 1269 1291 0.98
27 34 0.042 2.4 9.7 0.4 1 23 1297 1320 1297 1320 0.97
28 34 0.0061 0.35 12.3 0.2 1 23 1326 1348 1326 1348 0.96
29 34 0.11 6.1 8.4 1.1 1 23 1354 1377 1354 1377 0.90
30 34 0.011 0.62 11.5 1.5 1 23 1388 1411 1388 1411 0.96
31 34 0.00011 0.0063 17.8 2.0 3 23 1423 1443 1421 1443 0.96
32 34 6e-05 0.0034 18.7 0.9 1 23 1449 1471 1449 1471 0.97
33 34 3e-05 0.0017 19.6 1.0 1 23 1477 1499 1477 1499 0.98
34 34 0.12 6.8 8.3 0.5 1 19 1505 1523 1505 1526 0.96

Sequence Information

Coding Sequence
ATGCAACCCTTTCAACAACAAAAACATGAAGccatttatataaaagaaaatgatgatGAAAATAACGGtgataaaaatgataataatgaCAATTATATTGATGAATATTATTACGATGAAAATGATTATGATGGAAATGATAATGAAGAAAACGACAAGGGAGAAGAAAATCAGAACCATATACTACTTCGTGTAGCAACTAATTATACATCAATTCAAGAGCATATGTCAATTACTGATGGTGATTCATTACAAGAAAGAAGTCCAATAGAAGAATACATAATAGAAAAGAATAAAGTAAATAGGAAAAAGAGTAAAAAGCAAAGTCAGTCCCTCAAGAGAAACGCAAAAGTAAAGGtattaaaaatagaagtagTCGAAGACaacgatgatgatgatgattataTTCCAGAAAGCAGAATTGATATAAAAGCAGATTCATTACTGGACCTAAAACTGGACTTAGACCTTCTGGAAAAACCGAAACAAAAACCGaaacaaaaaccaaaaccaaaaaaggaagcagtcctcaaagaaagaaaaaaaaccaTAAAGGAAGCAGTCCTCAAAGAAAGCAAAAAAACCATAAAGGAAAGCAACAAAGTAACAAAAGAAAGCAGAATGAAATTCTGGCTGTGCAAGGGATGTCACTCCGATTTCACCACCAAGCGAGATCTGGTAGACCACAAAAAGATATGTGTAGCAAGCAAATTAGAAGAAAACTGTGAAGAGGAGAAATACGTTTACGATGTAGACTCAGAAGTATATGTTTGTAACAATTGTTCTGCTGAATTCCAGGAAGAAACAGAAGTAATAGAACACTACGAAACACACATAGAAAAATATGTCTGCGAAATTTGCAATGTTACAGTAACTGGAGCGTTTTTATTCGCCTGTCATAGCCAAAAACACAGAGTGGATCAGGTGTTTGCTTGTCCTGTTTGCAGTTACACTACAAACAGGAAAACCGGAATGCTTGTCCATATCAACAGGATGCATTTAAATAAACATGACTTCCAATGTGATACTTGCGGAAAGTGTTTCAACGATGGTTTATCTTTTAAAGAACACGAAAACATTCACCTAGGTCTTAAaccttttgtttgtattgtttgTAGTAAAGAGTTTCCTTACTCCAGATACCTAGCAACACACCAGTTCCGTTCTCATACGGTTAGAGTTATAGACGAAAACTCAAAAACTCAATGCAAGTATTGTAACAAGATCTTCGCTAAGGAAATTACGTTAAAAAAGCACGTTGCAAAACACGAGATCATCCATCAAGGTCCTTATGATAAGAAACACCTATGTGATGTTTGTGGGCAAGGCTTCTCTAGGAAAGACAAATTAAAAATCCACTACAGAAAGCATAGCGGAGAGAAGCCGTACTCTTGTTCATATTGCGGAAAGAGTTTTATAAAAAGAGAATACATGGTAATGCATGAAAGGGTACACAGTGGCGAAAAACCATACAGCTGTCATATTTGCGGAAAATGCTTCAATCAAGGAGCTCCTTTAAGGATTCATATTCGCAGCCATACGGGAGAAAGGCCatacatTAATAATTTAGAGATTGAGTTAAAAGAAGAAGATGGAGAAGTCCATTTTCCAATGAGAACAAGACCCGCAAGAAAACGAACTATATCATACGAATCAAtaaaagtagaagaagaagatttagAAAGCGATGTAGAATACATTCCCAGCAGCGTTAAAAAAGCTATTAGCAAAGGAAAACAAAAAATGCCAACCAAAAAgcgagaaaataagaaaaaacccGAAAAGAAAAGGGTTGACAAAAAAGTAACATGTAGGCAATGTAACGAGGAATTTAGTAgcagaaaaatattatcagtacacataaaaatgcataaaaaggtCAAAGAAGAGGAAAAACATACTTTTAAGTACGATGAAGAACAAGACCTTTACATATGCAACACTTGTTTGGCAGAATTCCCTTCAAAAGATCAAGTAGAGTTGCACATTGCCAAAACGCACGTAGAATCATTCGTTTGCGAAGTTtgcaataaattatgcaacagtGTTTACAACTTTGCTATTCACCTTAAATCTCATTCACAAAGTGAAGGCTACAAGTGTCCCTTGTGCACTTATATTTGTGCAAGAAGGGCTGGACTAGCAACCCATATTAACAGAGTCCATTAtcataaatattactattactgCGAAAAATGCGGAAAAGGGTTTAACGATGGTGTTTTGTTTAAAGAACATGAGAACGAGCACCTTGGAATTAAACCTTTTATATGTATTGTTTGTGGTAAGGATTTCGCGTTCTCTAGATACCTTTTGGTTCATCAAGTCAGACACCACACTGTGCCAATAGATGGTCTTCTACATAAAACCCAATGCAGTATTTGTATGAAAGTATTTAGTAAAGTCCAAACTTTGCTTAGGCATATCACCACAAAGCATAACCCCATAAAAGGCTTATCCGAAAAAGTACCTCACTTATGTGACATTTGTGGTAAAGCATTTGGTTCTTCAGATAAACTAAAAATCCACTACAGGGTTCACACAGGTCATAAACCTTTCTCCTGCTCGTTTTGCGGAAAAAGTTTCACCAAAAAAGACTATCTGGTGATGCATGAAAGAGTACACACAGGTGAAAAACCTTACAGTTGCGAGTTTTGCGGGAAATGCTTTAATCAGGCATCATCCCTGAGGATCCATGTACGTGGACATACTGGAGAACGTCCGTATATTTGTCAGTTTTGTAATAATGGTTACATTTCTAGAGGTACTCTTAATCTGCACATGAAAACATATCCAATAAATAATCGAGAATCAATAAATAATCCAGATccaataaataatacagatccaataaatgaagaaaaagaagaagaaaaagaaattaaGATTGATATTGATGAATTGAATGAAGTAGAAATGTTTAATAATGAGCAAAATGAAAAGGAAAAAGCCAATCAAACTAACGGAAGTGGAATAATAGAGGAATCTTCCGTTGACTCATTTGATTCAAGCGAAGATGAAGAATTAAGAAGAAACAAGGCACAggcaaaaagaagaagaagaaaaagcgAAGAAGATTATTCGGCGTCAGATTCTCCCCCAGATGAGGGttattatagaaaaaaacagaaaaaaccaaaaagaCGTATGTGCGAATATTGTGCCAAAGTGTTCAGAGGTACATACAACCTTCGATTGCACCAGCGTCTACATACAGGGACGAAAAACTACTCGTGCACTTTTTGTTCGAAACCTTTTGCCCAAAAGGAACACTTAAAAATGCACGAAAGAATCCATACTGGTGAAAAGCCCTATGAATGCCATATTTGTGATTCACGGTTTAACCAGAGAACAGCCTTGAGAGTTCATATTAGGACTCATACGGGAGAACGGCCATATTCTTGTCACATATGCCGTAACAGGAAATTAAGTAAGAAGTATTCTCtaagaagaaataaaaatgtcagCCAGACTAAAGTAGATAAGGATCAGGAGTGTGTTCCAGAAAATATGAAAGACAGGTCTGCCATAGAATATAAactcattaaaacaaaaacagctTTAAAACTGAACATAATAAAGATCAAAAAGGGCAAAACTTGGTCGTGTAAAAAATGTGGTGAAAAGttcacaacaaaaaaaacattagtagaacatagaaaaaaacattatatagaTGAAAGCAGTGAGAAACACACATACAAATTTGACCCTGTCCAGGATCTCTATATTTGCAACACGTGTTCCGCAGAGTACCAAGTCAAAGAGGAAATCGAAAAACACATGAAAATTCACGATGAAAACTACGAATGCGAAGTATGCcatgaaaaattcaaaaaagccTATACTTTTGCCACTCACGTTTATACTCATAGTTCGGATAAAATATACAGATGTCCTTTGTGTAGCTACAATTCCATTAAAAGAACATCACTTCTTGTGCACATAAACTACGTCCAcctaaagaatttttattacatttgcaCCACCTGTGGTAAAGGTTTTAATGATGTAGTCATGTTCAAAGAACATGAAAATGAACATTCCGGAGTCCGCCCATTCATTTGTATAGTTTGCAACAAGGATTTTTCTTATTCAAGATATTTATACACTCACCAAGTAAGATCTCACAAAGTCGGAATAGAAGGAGAACTTTTACCTCACCAATGCACAGTTTGTAACATAATCTACGCAAAAGATTCAACGTTGGAAAAACATTACAGagaaaagcatttaaaaagcGGTCCACACGAAAAAAAGCATTTATGCGACACTTGCGGAAAGGGCTTTGCTCAGAAAAACAAGCTGCGGCTGCATGTGAGAACGCATACAGGATACAAACCGTTTGCTTGCAGCTACTGCGAAAAGTCGTTCACCAAAAAGGAATACCTGGTGATGCACGAAAGGGTACACAGTGGTGAGAAACCATACAGCTGTGAATATTGTGGAAGGTGCTTTAGCCAAGGAGCTCCTCTGAGGATTCATCTTCGCAGTCATACAGGGGAGAGGCCCTATATTTGTCAGATATGTAATTCTGGTTTTACATCTAGGGGTGCTCTAAATATGCACTGTAAAAGTTGTCCTGGTGGAAGAACTTTACAACTGCTCtga
Protein Sequence
MQPFQQQKHEAIYIKENDDENNGDKNDNNDNYIDEYYYDENDYDGNDNEENDKGEENQNHILLRVATNYTSIQEHMSITDGDSLQERSPIEEYIIEKNKVNRKKSKKQSQSLKRNAKVKVLKIEVVEDNDDDDDYIPESRIDIKADSLLDLKLDLDLLEKPKQKPKQKPKPKKEAVLKERKKTIKEAVLKESKKTIKESNKVTKESRMKFWLCKGCHSDFTTKRDLVDHKKICVASKLEENCEEEKYVYDVDSEVYVCNNCSAEFQEETEVIEHYETHIEKYVCEICNVTVTGAFLFACHSQKHRVDQVFACPVCSYTTNRKTGMLVHINRMHLNKHDFQCDTCGKCFNDGLSFKEHENIHLGLKPFVCIVCSKEFPYSRYLATHQFRSHTVRVIDENSKTQCKYCNKIFAKEITLKKHVAKHEIIHQGPYDKKHLCDVCGQGFSRKDKLKIHYRKHSGEKPYSCSYCGKSFIKREYMVMHERVHSGEKPYSCHICGKCFNQGAPLRIHIRSHTGERPYINNLEIELKEEDGEVHFPMRTRPARKRTISYESIKVEEEDLESDVEYIPSSVKKAISKGKQKMPTKKRENKKKPEKKRVDKKVTCRQCNEEFSSRKILSVHIKMHKKVKEEEKHTFKYDEEQDLYICNTCLAEFPSKDQVELHIAKTHVESFVCEVCNKLCNSVYNFAIHLKSHSQSEGYKCPLCTYICARRAGLATHINRVHYHKYYYYCEKCGKGFNDGVLFKEHENEHLGIKPFICIVCGKDFAFSRYLLVHQVRHHTVPIDGLLHKTQCSICMKVFSKVQTLLRHITTKHNPIKGLSEKVPHLCDICGKAFGSSDKLKIHYRVHTGHKPFSCSFCGKSFTKKDYLVMHERVHTGEKPYSCEFCGKCFNQASSLRIHVRGHTGERPYICQFCNNGYISRGTLNLHMKTYPINNRESINNPDPINNTDPINEEKEEEKEIKIDIDELNEVEMFNNEQNEKEKANQTNGSGIIEESSVDSFDSSEDEELRRNKAQAKRRRRKSEEDYSASDSPPDEGYYRKKQKKPKRRMCEYCAKVFRGTYNLRLHQRLHTGTKNYSCTFCSKPFAQKEHLKMHERIHTGEKPYECHICDSRFNQRTALRVHIRTHTGERPYSCHICRNRKLSKKYSLRRNKNVSQTKVDKDQECVPENMKDRSAIEYKLIKTKTALKLNIIKIKKGKTWSCKKCGEKFTTKKTLVEHRKKHYIDESSEKHTYKFDPVQDLYICNTCSAEYQVKEEIEKHMKIHDENYECEVCHEKFKKAYTFATHVYTHSSDKIYRCPLCSYNSIKRTSLLVHINYVHLKNFYYICTTCGKGFNDVVMFKEHENEHSGVRPFICIVCNKDFSYSRYLYTHQVRSHKVGIEGELLPHQCTVCNIIYAKDSTLEKHYREKHLKSGPHEKKHLCDTCGKGFAQKNKLRLHVRTHTGYKPFACSYCEKSFTKKEYLVMHERVHSGEKPYSCEYCGRCFSQGAPLRIHLRSHTGERPYICQICNSGFTSRGALNMHCKSCPGGRTLQLL

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-