Basic Information

Gene Symbol
-
Assembly
GCA_018231745.1
Location
DVQP01004495.1:1-7945[+]

Transcription Factor Domain

TF Family
zf-C2H2
Domain
zf-C2H2 domain
PFAM
PF00096
TF Group
Zinc-Coordinating Group
Description
The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 29 0.074 6.1 8.2 0.1 1 20 112 131 112 133 0.94
2 29 0.049 4.1 8.7 0.3 2 22 161 181 160 183 0.89
3 29 2.8 2.3e+02 3.2 3.3 1 23 206 228 206 228 0.97
4 29 0.002 0.16 13.2 0.8 1 23 232 254 232 254 0.98
5 29 0.02 1.6 10.0 1.2 1 23 259 282 259 282 0.92
6 29 0.043 3.6 8.9 0.4 2 23 290 312 289 312 0.93
7 29 0.00075 0.062 14.5 6.1 1 23 319 341 319 342 0.95
8 29 0.007 0.58 11.4 1.2 1 23 348 370 348 370 0.95
9 29 2e-06 0.00017 22.5 2.0 2 23 377 398 376 398 0.96
10 29 6.8e-06 0.00057 20.9 2.2 1 23 404 426 404 426 0.98
11 29 0.013 1.1 10.6 2.8 1 23 520 543 520 543 0.94
12 29 1.3 1.1e+02 4.2 0.1 2 23 569 591 568 591 0.89
13 29 0.043 3.6 8.9 5.1 3 23 616 636 614 636 0.96
14 29 0.0012 0.1 13.8 0.6 1 23 640 662 640 662 0.97
15 29 0.11 9.1 7.7 0.2 1 23 669 692 669 692 0.91
16 29 0.0026 0.21 12.8 1.3 2 23 700 722 699 722 0.95
17 29 1.7e-06 0.00014 22.8 1.2 2 23 728 750 727 750 0.97
18 29 2.1 1.8e+02 3.6 1.0 1 23 756 779 756 779 0.89
19 29 3.2e-07 2.6e-05 25.1 1.2 1 23 785 808 785 808 0.98
20 29 0.0069 0.57 11.4 0.0 1 23 900 923 900 923 0.94
21 29 9.4 7.8e+02 1.6 0.4 2 23 950 972 949 972 0.93
22 29 0.0033 0.28 12.4 0.8 1 23 994 1016 994 1016 0.96
23 29 0.0003 0.025 15.7 0.3 1 23 1020 1042 1020 1042 0.98
24 29 0.081 6.7 8.1 1.1 1 23 1047 1070 1047 1070 0.90
25 29 0.0058 0.48 11.7 1.8 2 23 1078 1100 1077 1100 0.95
26 29 0.00011 0.0093 17.1 0.8 2 23 1108 1130 1107 1130 0.97
27 29 0.0022 0.18 13.0 3.8 1 23 1135 1157 1135 1157 0.96
28 29 2.1e-06 0.00017 22.5 2.3 1 23 1163 1185 1163 1185 0.99
29 29 0.0028 0.23 12.7 5.7 1 23 1191 1214 1191 1214 0.98

Sequence Information

Coding Sequence
AATACGATAAAACTTGAGGTGTCGAGGGGAAATGAGGATTCCGATAATGACGAATCAGATAAACTGTCAGATTACGAGGTAACTATCAAACAGGAGAAGGAAGATGAAAAGCCGAAGAAGCGTCAGCCCAAAGCCACGACCTCCAAGATAAAGAAAGCCAAAACGGAAGGGGAACCCTCGAAACGAACGGACGGCGAACCGCCCAAACgcaagaaaaagaaaaagaaatcgGAAGTAAGCGCGACACCCGAGCGCATTGAACACAGGGTCAACCTGACAGCCATATTGCAGTTCTCCAACGCGAGTCCGTTCCGGGATAAGACGATGAAAGGCTTCTCCTGCCTCTACTGCGCCGAGAGTTTCCAAGATATTGACGAACTCAGAGCGCATACGTCCCAGCAGAATGAGAAAGACAAAATTAACACAATGCTTGATTACAAACTTAGCTATAATCCGATAAAACTGGACATAACTAATCTACAGTGCACCGTCTGCCATAGAGATTTGAAGGATCTGAACGAGCTCAAAGATCATTTAGTAGCTGTTCACAATAAGACCATACACAAGGACATCAAAGACACGATACTGCCATTCAGATTGGAGAACGGCCATAACTTCACATGCGTCATATGTTCCGTCGTGCACATatctttcaaaaatttatatcatcacATGAGTAGTCACTATCGTAACTATTGCTGTAAGAAATGCGGTGCTGGTTACATCACCATAGCTGCCTTGAGGAAGCACGGAAAAACTCACTATCAAGGTCATTTTCCTTGCGACTTCTGTGATAAATCGTATACGTCGCTAACAAAGAAACGGAATCACGAGAAGGGTGTCCACACAGGCGGCTGGCTGAGAAACAAATGTCCACACTGTCCCGAAATCTTCGTCAGCTATTACGACCGCAGCGAGCACTTGGTCAAGGTTCACAACGAGGCACCAGTTGTATATCCTTGCAACGCGTGCAATAAGacatacaaaaagaaattcgAATTAAACAGACACATAAAGCACCACCACTTACAGCAGAGGAGCTTCCTGTGTGACAAATGCAATGCTAAGTTCTTTTCGAAGCGGGGTCTCGTTGATCACATGACACGGCATACGGGTTCAGAGATGTGTTCCTGTGACGTATGCGGTAAAGCCTTCTCCAGGATAAGGACCCTGAGAGAGCACATGCGGACACACGAGGACGACAAACGCTTCCAATGCGAGGTCTGCAAGAAGACGTTCATGCAGAAGAGCAGTCTCAAGAGTCACGTGAAGCTGCATCAGGACGACTTGGACATATTCAAAGAATTTGATGACGTCAAGCATTTGATAGACGACAGGGAGATGACTCTGAAACAAATAGCAGCCGAGAACAAGATGAGGTCTATAGgAACCCGAAAAAAAAGGATAACGCCCGTAAAGGAGGAAAGCGGGCAGACGTTGAATAGTGTTGTGTTCCGTGAAAAGCATCTACGTGAGATGCAGAAGCAGTGGCACAATCTAACTACCCTCCTTAAATATTCTAACGTTACCCCGTTCAAGGACCGAAACGACGCGGGTTATATATGCGCTTATTGCTTTAAGACCTTCCCAGATCCCAATGTCCTAAGACATCACACCCACTATGACCATGTCAAAGAGAAACCAACTTACAAAGCCGGTTCTGGCATTAGcagtttcgtagtttttttGGACATAGTCGATTTAAAATGCACAATATGTGGTTTGTCCATGGAgagtattaattttcttaccgAGCATCTGGTCAGAGAACacgataagaaatattatttaggtgTGACGGACTACTTCCAGCCGTTCATTCTAACTAGCGAGCAGCAAATACACTGCTGTCTGTGTGACGAAGTTTTccataatatgaaattattgatgCAACACATGAACATGCACTACCGTAATTTCATCTGCACCACATGCGGTGCCGGCTTCGTGAACAGTTTCCGCCTGAAGAGGCACGAGACGACAcatttgaaaaagaaaaccGGCTTCGCTTGCCGGCATTGCGGGCTGGTGTTCGCGGCTGAATCTAAAAAGAAAGCTCACGTGAACGCTGAGCATAAGGGAATAGAAGGTCACAGCGTGTGTCAAATTTGCAAGGCCcgatttaagaattattatcaGAAAACCAGGCATATGATGCAAGTTCATAACGTAGAAGGTATTAAATGTGATAAGTGTGATAAGCGTTTTAATCTCAAATCTAATCTGATGCTTCATATGCGGAGCGTGCATTTGAAAGAGCGGCCATACGAATGTTCGGTATGCAGCatgggattttttattaagcgtCACATGCTAGGTCATTATATGGCCACGCATACCAAtgagagaaaatttaaatgtgacGTATGCGGCAAGGCGTATGCTACACAAAATAGCTTGAGGAAACACATGAAAAAGAATCACGGTGTGGAGAATCAAACAACATTAATCGAAATTGTTATCAAACAAGAACCGATGTCAGACTCGGAACTGCCAGAAGACGCTAAACCGGATacagttttcaatataaaaccaGAGAACGATAGAAGTGAGGAGGTTAAAGATGTCAAGAAAACTGCATCTGAAAAGAGATCCAAAAAAGATCAGATTATCGAAATGgaaaagcatttaaaaaatataagcacaATACTACTTAACACAAACGCTACGCCCATAAGATATCACGACGGTGCCAATTATGTTTGTGCGCTCTGCCCCGAAACATATCCTTTGCCATCAGATTTGAAAGTTCACGTACTGGAAGAACACGATGAAATAGATAAGTCGTCTTTCATGGAAGGCCATAGATTAACTTCCTACATGGTCAAGTTAGATATAACAAATTTGCGTTGTTTGATTTGCCACAACGATGTAGAAAGCTTCGATCCCCTTTTCGACCACCTCAAATCGGTGCACGGAAAAGAAATGCACACTGATATACCAAACCATATCCTGCCCTTTAGATTCGTTGGGAACGGTTTCGATTGCGTCGTGTGTCCCAAGTCCTTTGAGCATTTCAAGCTAGTCCAGGAGCACATGACTGTGCACTACAGGAACTACATCTGCGAAACTTGTAATGCACCTTTTGTTAATAAGCGAACTCTACAGAATCATGCTAACAGGCATAAGAAAGGCGATTTCCCGTGCAGCCAATGCCCAAAAATATTCGACACCAATCGTAAGAAACTGAATCACGAGAAATTTGTCCACGACGGTGATTACAAGAGAAAGAAATGCCCGTATTGTCAGGAGAAGTTCACAAATTACGCAAAGAAGAGATCTCATATGGTTAAAGAGCATGGTGCTGAGCCGTTGTCAGTGAAGTGTGATATTTGCAACAAAATATTCAGCACGCGAGCGAGGTTAAGAGGGCACACTCGGAGAGATCATATGGAGTGTCAGCACGCGTGCCACAATTGTGATATGAGATTCTACACGAAGTTGGAACTCGTAAAACATATGGTTAAACACTCGCCACTGAAGGAATACCAGTGCGACATTTGTAAAAAGGCTTACGCGAGAAAACACACGCTCCGAGAACACATGAAGATACATTCCAATATAAGAAACTTTAAGTGCGACTTGTGctcttttacatttatacagAAATGCAGTTGGAAGTCTCACATGCGCAAAAAGCACAAAATTCAGGTCTAA
Protein Sequence
NTIKLEVSRGNEDSDNDESDKLSDYEVTIKQEKEDEKPKKRQPKATTSKIKKAKTEGEPSKRTDGEPPKRKKKKKKSEVSATPERIEHRVNLTAILQFSNASPFRDKTMKGFSCLYCAESFQDIDELRAHTSQQNEKDKINTMLDYKLSYNPIKLDITNLQCTVCHRDLKDLNELKDHLVAVHNKTIHKDIKDTILPFRLENGHNFTCVICSVVHISFKNLYHHMSSHYRNYCCKKCGAGYITIAALRKHGKTHYQGHFPCDFCDKSYTSLTKKRNHEKGVHTGGWLRNKCPHCPEIFVSYYDRSEHLVKVHNEAPVVYPCNACNKTYKKKFELNRHIKHHHLQQRSFLCDKCNAKFFSKRGLVDHMTRHTGSEMCSCDVCGKAFSRIRTLREHMRTHEDDKRFQCEVCKKTFMQKSSLKSHVKLHQDDLDIFKEFDDVKHLIDDREMTLKQIAAENKMRSIGTRKKRITPVKEESGQTLNSVVFREKHLREMQKQWHNLTTLLKYSNVTPFKDRNDAGYICAYCFKTFPDPNVLRHHTHYDHVKEKPTYKAGSGISSFVVFLDIVDLKCTICGLSMESINFLTEHLVREHDKKYYLGVTDYFQPFILTSEQQIHCCLCDEVFHNMKLLMQHMNMHYRNFICTTCGAGFVNSFRLKRHETTHLKKKTGFACRHCGLVFAAESKKKAHVNAEHKGIEGHSVCQICKARFKNYYQKTRHMMQVHNVEGIKCDKCDKRFNLKSNLMLHMRSVHLKERPYECSVCSMGFFIKRHMLGHYMATHTNERKFKCDVCGKAYATQNSLRKHMKKNHGVENQTTLIEIVIKQEPMSDSELPEDAKPDTVFNIKPENDRSEEVKDVKKTASEKRSKKDQIIEMEKHLKNISTILLNTNATPIRYHDGANYVCALCPETYPLPSDLKVHVLEEHDEIDKSSFMEGHRLTSYMVKLDITNLRCLICHNDVESFDPLFDHLKSVHGKEMHTDIPNHILPFRFVGNGFDCVVCPKSFEHFKLVQEHMTVHYRNYICETCNAPFVNKRTLQNHANRHKKGDFPCSQCPKIFDTNRKKLNHEKFVHDGDYKRKKCPYCQEKFTNYAKKRSHMVKEHGAEPLSVKCDICNKIFSTRARLRGHTRRDHMECQHACHNCDMRFYTKLELVKHMVKHSPLKEYQCDICKKAYARKHTLREHMKIHSNIRNFKCDLCSFTFIQKCSWKSHMRKKHKIQV

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00958812;
90% Identity
iTF_00420709;
80% Identity
-