Basic Information

Gene Symbol
-
Assembly
GCA_014356515.1
Location
chr7:2090526-2107152[+]

Transcription Factor Domain

TF Family
zf-C2H2
Domain
zf-C2H2 domain
PFAM
PF00096
TF Group
Zinc-Coordinating Group
Description
The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 50 0.0002 0.01 16.5 1.8 1 23 181 203 181 203 0.98
2 50 1.3e-05 0.00066 20.2 1.4 1 23 208 230 208 230 0.98
3 50 2e-05 0.0011 19.6 0.6 1 23 235 257 235 257 0.97
4 50 2.3e-05 0.0012 19.4 0.5 1 23 262 284 262 284 0.98
5 50 0.00092 0.047 14.4 4.9 1 21 289 309 289 311 0.96
6 50 3.3e-06 0.00017 22.1 0.9 1 23 316 338 316 338 0.98
7 50 0.00012 0.006 17.2 3.2 1 21 343 363 343 365 0.96
8 50 7e-06 0.00036 21.0 1.3 1 20 370 389 370 391 0.93
9 50 0.066 3.4 8.5 3.3 2 23 398 419 397 419 0.96
10 50 4e-06 0.00021 21.8 0.6 2 23 425 446 424 446 0.96
11 50 0.00087 0.045 14.4 0.3 1 23 451 473 451 473 0.97
12 50 1.5e-06 7.9e-05 23.1 0.7 1 23 478 500 478 500 0.98
13 50 2.7e-05 0.0014 19.2 1.4 3 23 507 527 506 527 0.97
14 50 3.2e-05 0.0016 19.0 0.7 2 23 533 554 532 554 0.95
15 50 1.5e-05 0.00079 20.0 0.4 1 23 559 581 559 581 0.98
16 50 0.0019 0.097 13.4 0.3 1 20 586 605 586 607 0.95
17 50 0.52 27 5.7 5.5 1 23 613 635 613 635 0.97
18 50 4.7e-05 0.0024 18.4 0.3 2 23 641 662 640 662 0.94
19 50 2e-06 0.0001 22.8 0.6 1 23 667 689 667 689 0.98
20 50 1.3e-05 0.00068 20.2 1.7 2 23 695 716 695 716 0.97
21 50 3.2e-06 0.00017 22.1 1.0 1 23 721 743 721 743 0.98
22 50 4.8e-06 0.00025 21.6 0.3 1 23 748 770 748 770 0.97
23 50 0.00012 0.006 17.2 1.7 1 20 775 794 775 796 0.95
24 50 0.00073 0.038 14.7 1.4 1 23 803 825 803 825 0.98
25 50 4.6e-05 0.0024 18.4 0.5 2 23 831 852 830 852 0.95
26 50 5.3e-06 0.00027 21.4 1.6 1 23 857 879 857 879 0.98
27 50 8.2e-05 0.0043 17.7 0.0 3 23 886 906 884 906 0.96
28 50 0.0021 0.11 13.2 0.2 3 20 913 930 912 932 0.94
29 50 0.62 32 5.5 1.4 2 23 939 960 938 960 0.95
30 50 7.6e-06 0.00039 20.9 1.5 2 23 967 988 966 988 0.96
31 50 0.00032 0.017 15.8 5.5 1 23 993 1015 993 1015 0.97
32 50 0.00017 0.0088 16.7 0.7 1 23 1026 1048 1026 1048 0.96
33 50 0.002 0.11 13.3 0.4 1 23 1058 1080 1058 1080 0.94
34 50 0.00016 0.0085 16.7 0.2 2 20 1090 1108 1090 1110 0.94
35 50 0.038 2 9.3 2.0 1 22 1117 1138 1117 1140 0.90
36 50 0.00077 0.04 14.6 1.3 2 23 1146 1167 1145 1167 0.96
37 50 6.2e-06 0.00032 21.2 0.1 2 23 1173 1194 1172 1194 0.96
38 50 0.016 0.81 10.5 1.2 2 20 1200 1218 1199 1220 0.92
39 50 0.041 2.1 9.2 3.7 3 23 1228 1249 1226 1249 0.95
40 50 5.7e-05 0.003 18.2 2.7 2 23 1255 1276 1254 1276 0.95
41 50 1.2e-05 0.00063 20.3 0.4 1 23 1281 1303 1281 1303 0.95
42 50 3.9e-05 0.002 18.7 1.5 1 20 1308 1327 1308 1329 0.95
43 50 0.0023 0.12 13.1 1.9 2 23 1337 1358 1336 1358 0.96
44 50 3.5e-05 0.0018 18.8 1.1 2 23 1364 1385 1363 1385 0.96
45 50 0.016 0.83 10.5 0.4 1 23 1390 1412 1390 1412 0.95
46 50 4.9e-05 0.0025 18.4 0.9 1 23 1417 1439 1417 1439 0.97
47 50 0.00045 0.023 15.3 0.4 2 23 1446 1467 1446 1467 0.97
48 50 0.051 2.7 8.9 0.7 3 23 1566 1586 1565 1586 0.93
49 50 3.3e-05 0.0017 18.9 2.8 2 23 1591 1612 1591 1612 0.97
50 50 0.00036 0.019 15.6 2.3 1 23 1617 1639 1617 1639 0.97

Sequence Information

Coding Sequence
ATGGATTTCCATTTTGTCGATTGTGAACAGACTTTGAGGGTAGGCAAAGCAGCCGACCTGAACAGCCATTTTCAGGATCCTCTCTACATGCCAACAGTGAAGACTGAACCGGGCTTAGAACATATGGATCCAAACAATCACTACAATCATAACCCCGCATACATGGAGTATGGTCCAAAAAGAATCAAAATGGAGGATGAAGGAGAGCCTGGAGAAATGATGATCAAGACTGAGCCAGATGCCTATGATGATTATTATGGGAACGGCGAATACCATGGAATACCCTGTGAAACTCAGATCAGAGAGGTGGAGTGCCAATCATACCCGGACTACAAGCGAGACATGCAAAAGCAGCCGGACAACAGTCACAGCATGAACTCAACCGAAATCGTAGTCTCTCTGAATCCAGCCGAACACTATCCACCCGGCTTCTATCATCAGGGCGGCAGCAACATGATGAACGGAGGCTGGAACGACACGCCGCAGGAGCGCAACAAGTTCACCGCCAACCAGCCACCCCGCATGGGCGCCAGGAAACGCTACAAGTGTCGACTCTGCTTCAAAAGTTTCACCACAATCTGGTACCTCAAAGTCCACATGGACAGTCACAAGGGCAAGGAGTACTCGTGCAATATCTGCAACAAAACATTCACCTCCCAACTGTCCTACACTGTCCATCAAAACATTCATCAAGGTAAGGAATTCTCCTGCGACAAATGCAACAAGACCTTCACCACAAGAGCCGGCTGGAAAATCCATATGGACATTCACGAAGGTAAGACCTATCCATGTACTGTTTGTAATAAAACGTTGACCACCAAAAACGGTTTGAAACTACATATGAACATCCATATGGGCAAACAGTTTACCTGTCAACTATGTAACAAGAGTTTTGCTACCAAGTGGTACTTCAGAGACCATATGCGATGTCATCAAGGCATAATGTACACTTGTACTCAATGCGGTAAAGGATTCACCACGCGTATGAGTTTGAAACTGCACATGGACATTCATGAAGGCAAACAGTATACGTGTGAGGTATGTAACAAATCATTTTCAGCGAAGAGTACACTGAAAATGCATATGAACTGTCACAGCGGTGAAATGCATGCTTGCACTTACTGTAACAAAAGTTTCACCACCAAATCCTATCTGAAAGTACATCAGGAAGGCAGCGAAGAAACCAACATCACCTGTTACTTTTGTAGTGAATCGTTCACCAGGAAATGTGTGTTCAAAGCGCATCTAGAGAGACACATGGGCGAACAGCTGACTTGTAAGATATGCAATAAAAGTTTCACCACAAAGGCTGGTTTGAAACTGCATATTGACATGCATGAAGGCAAGAACTACCCGTGTGAACTCTGTGAACGCAGTTTCTCGTCGAAAGCCGGTTTAAAGTTTCACATGGACATGCACGAAGGAAAGGAGTACCCGTGTGAACTGTGCAATAAAACATTCACTACAAAATGGTACCTGACTGCTCATCTTAACAGTCATGATGGAAATCAACCGATTTGTACGTTGTGCAATAAAACGTTTACAACCACCTACAGTCTCAAACTACACATGGAAATGCATGCTGGAAAGGTGTTGAGATGTGAACAGTGTGATAAGAATTTCACCACCAAAACAGGACTGAAACTGCATATGGAAGCGCATGAAGGCAAACGTTATCCTTGTGTTGTTTGTAATAAAACGTTCACGGCTAGAAGTTCGCTGAAAATTCATTTGAACATGCACACCGGAGAATCGTTTGCTTGTGAGTTTTGTTTGAAACGGTTTTCGACAAAAGCATATTTGAAAATTCATCAGGATGCAAGCGAGCAGAAGTTTTTCTCGTGCTACTTTTGTAATGATACATTCAACAGGAAGTGTTCGTTCAGAGTGCACCTGGAAAGTCACGGTGGAGATCAGCTGACTTGCAAGATTTGCGAAAAAACGTTCACCACCAAGGTCGGCCTCAAGTTGCACATCGACGCCCATGAAGGCAAGGAGTACTCGTGCACTATTTGCAACAAGAGTTTCGCGTCCAAATGGTATCTGAACGTTCACATGGACAGTCACGACGGCAAGCAGCCGACCTGCGAGATCTGCAAACGCACATTCACCACCAACTATAGTCTCAAAATGCACATGGACAGTCACGAAGGCAAGGGCTACTACTGCGAAATCTGCGACAAAAATTTCACCACCAAAACCGGACTTAAACTGCATATGGACAGTCATGAAGGCAAACGGTATCCGTGCGAGTATTGTGAAAAGTCTTTCACAGCTAAAAGCTCTTTGAAAATTCACATCAACGGACATTCAGGCGAGCAGTATACTTGTGATTTTTGCCTGAAAAGTTTCACTACTAAATCCTATCTGAAAATACACCAAAGCAACAATCAGGAGGAGAAGCATTTCACGTGTGAAATGTGTGATGCGACTTTCACGCGTAAATGCGCGATGCGCGTGCATGCCGAGACCCACGATGGAGATCAGCTGAGTTGTAAGGTGTGCGATAAACGGTTCACAACCAAAGTCGGTCTCAAACTCCATCTCGACATTCACGAGGGAAAACAGTACACGTGCGACCTGTGCGACAAGTCGTTCTCAACCAAGTGGTACTTCAAGGCTCATCTGAGTAGTCACGACGGCAATCAGATAGCCTGCGAAGTGTGCAATGAAAATTTCACTAATAAAGCCGATTTGAAAGCGCATCTCGAAATTCACGGCGGTGATAAGAATATTTGCGAGTTTTGTAGCAGAAGTTTCTCGTCGAAAGGCTATCTGCGAATGCATCAAGACGTCAATGAAGAGAAACCCATCTCCTGCTCTATCTGTGAGGACAAGTTCACCTCTCGATGTGCTTACAAAATGCATCAAGAAGCTCATGCCGAGAAAAACGAAATTAAATGCGACATTTGTCAAAAGTCGTTCAATACTGAATCGTATCTGAAACTACACATGAAGTTTCACAAACGTAGACGTCATTCTTGTAGTGTGTGCAACAAGAGTTTCATCAAAAAACGTCATCTAGATTCGCACATGAACTCACACAACAAAGACGTTGTTGTAGAGGAGAAAACCTATCCGTGCTCCACTTGTGATAAATCGTTTTCGGACGAGAACAAATTGCGTCTACATGCTTACAGTCACCAAAAAGAAACACAAGAGAAGAAGATTTACAAGTGTGAGCTGTGTGATGAAAATTTTACTTCTCGAGAAGGTTTCAATCTGCATGTGTATGGTCACGGCGACGAGTATGTGATGGAGAAGAATACTTGTGAACATTGCGGTCGTGGATTCAATTCAAGTTCAGCTTTAAAGATTCATCTGGATGTTAAAGACGAATCGGCCACATTCACGTGCGAATTTTGTGAACAAACGTTTGAGAAGAAATGTGGTTTAAGGGCGCATGTGGTCGAAAAACACCAAGAAGAAAAGCTTACTTGTAAAGTTTGTAATAAGGTTTGTAATTCTATTTCGGCGCTCAAGCTACACATGGAGTACCATGAGAACAAACAGTTGCCTTGTGAGTTGTGTGACAAGGTGTTTCCAACTAACAACTCGTTGAAAATTCACATGGCAACGCACGATGCCGAAGAGAAGACTTGCGAGTTTTGTAATCGAACATTTACCACTAAATCGTACAAAGCAACTCATCTGGAGAACAATGAGGAGAGTAATATAGATTGTGAGTACTGTTCGGAGAAGTTTACAAGGCGGTGCGCGTACAAAGCACACGTGGAACGTTGTCATGATGACGAACAGCTGATTTGTAAACAGTGTAACAAAACGTTTGTAACAAAGCACGGTTTAAGACAGCATATGGACAATCACGAGGGTAAGCGCTACCCGTGTACAGTTTGTGACAAAACGTTCTCCGCGAAAAGTTCTTTGAGAACACATTTGGACGGACACACGGGCGAACAGCACACCTGCGATTTTTGTCAGAAAAGTTTCACAACCAAATCCTACCTGAAAGTGCATCTAGAAGGTAATGAGGAGGAAAAGGAGTTGAGTTGTGATTATTGCGATGAAACGTTTACTAAAAAATGTGTGTACCGTGTGCATATTGAAACGCACAACGGCGATGATCTGAATTGTAAACAGTGCAACAAAACGTTCACCTCGAAACCGGGCCTCAAGTTGCACATGGACATGCACAAAGGCAAGCAGTATCCGTGCGATGCATGCGACAAAGTGTTCAACTCGAAAAATGGTCTCAACTTTCACAAGGAGACTCACTCGGGCAATGAGTATAACTGCGAGATTTGCAGCAAAACATTTCCAACCAATCGCCGGCTCAAGATTCACATGGATTCGCACATCACAAAATTGTCGAATTCGTGCAATGTGTGCAATCAGCTGTTTCCGTCGAAAGCCAGCTTGAAACTGCATATGGAAACGCATGAGGGAATTGAGAACAACGATGACGATTCGAAAGACGAGGAAGCTTTTATTATGAATGAGGAGGACATCAAGGAAGAGTTGATTATTAAAGAAGAGGACATCTCAATGGATGCGTCGGAATTGATTGAAGAGGAAACTGCTGAAGATATTGATATCAAGATTGAGGAGGGAGGAGGAGGAGATTTGGAGGAGTCTACTGATGTGAAGAAGGAAAGTAACATAGAAACACCGTCGAAATCTGACAACGAAGATCATGTTGTGAGTTCTACTGATGACAATGATAATCCAAACGCCTGCGATTGTTGCGACAAAATCTTCGAATCGGCCGAAGCGTTGAAACTGCACAAGTCGACTCACACCTCGGAAAACGTGTGTCACATTTGTCAGAAGACGTTCTCGTCCAAGTTCTATTTGAAGGTGCATGTGAACAGTCACGAGAGCAACTTGTTCTCATGCCACATCTGCAAGAAAGAGTTCATCACAAAAGAATCGTTGGATGAGCACACAAAAGAGCATGCCATATAA
Protein Sequence
MDFHFVDCEQTLRVGKAADLNSHFQDPLYMPTVKTEPGLEHMDPNNHYNHNPAYMEYGPKRIKMEDEGEPGEMMIKTEPDAYDDYYGNGEYHGIPCETQIREVECQSYPDYKRDMQKQPDNSHSMNSTEIVVSLNPAEHYPPGFYHQGGSNMMNGGWNDTPQERNKFTANQPPRMGARKRYKCRLCFKSFTTIWYLKVHMDSHKGKEYSCNICNKTFTSQLSYTVHQNIHQGKEFSCDKCNKTFTTRAGWKIHMDIHEGKTYPCTVCNKTLTTKNGLKLHMNIHMGKQFTCQLCNKSFATKWYFRDHMRCHQGIMYTCTQCGKGFTTRMSLKLHMDIHEGKQYTCEVCNKSFSAKSTLKMHMNCHSGEMHACTYCNKSFTTKSYLKVHQEGSEETNITCYFCSESFTRKCVFKAHLERHMGEQLTCKICNKSFTTKAGLKLHIDMHEGKNYPCELCERSFSSKAGLKFHMDMHEGKEYPCELCNKTFTTKWYLTAHLNSHDGNQPICTLCNKTFTTTYSLKLHMEMHAGKVLRCEQCDKNFTTKTGLKLHMEAHEGKRYPCVVCNKTFTARSSLKIHLNMHTGESFACEFCLKRFSTKAYLKIHQDASEQKFFSCYFCNDTFNRKCSFRVHLESHGGDQLTCKICEKTFTTKVGLKLHIDAHEGKEYSCTICNKSFASKWYLNVHMDSHDGKQPTCEICKRTFTTNYSLKMHMDSHEGKGYYCEICDKNFTTKTGLKLHMDSHEGKRYPCEYCEKSFTAKSSLKIHINGHSGEQYTCDFCLKSFTTKSYLKIHQSNNQEEKHFTCEMCDATFTRKCAMRVHAETHDGDQLSCKVCDKRFTTKVGLKLHLDIHEGKQYTCDLCDKSFSTKWYFKAHLSSHDGNQIACEVCNENFTNKADLKAHLEIHGGDKNICEFCSRSFSSKGYLRMHQDVNEEKPISCSICEDKFTSRCAYKMHQEAHAEKNEIKCDICQKSFNTESYLKLHMKFHKRRRHSCSVCNKSFIKKRHLDSHMNSHNKDVVVEEKTYPCSTCDKSFSDENKLRLHAYSHQKETQEKKIYKCELCDENFTSREGFNLHVYGHGDEYVMEKNTCEHCGRGFNSSSALKIHLDVKDESATFTCEFCEQTFEKKCGLRAHVVEKHQEEKLTCKVCNKVCNSISALKLHMEYHENKQLPCELCDKVFPTNNSLKIHMATHDAEEKTCEFCNRTFTTKSYKATHLENNEESNIDCEYCSEKFTRRCAYKAHVERCHDDEQLICKQCNKTFVTKHGLRQHMDNHEGKRYPCTVCDKTFSAKSSLRTHLDGHTGEQHTCDFCQKSFTTKSYLKVHLEGNEEEKELSCDYCDETFTKKCVYRVHIETHNGDDLNCKQCNKTFTSKPGLKLHMDMHKGKQYPCDACDKVFNSKNGLNFHKETHSGNEYNCEICSKTFPTNRRLKIHMDSHITKLSNSCNVCNQLFPSKASLKLHMETHEGIENNDDDSKDEEAFIMNEEDIKEELIIKEEDISMDASELIEEETAEDIDIKIEEGGGGDLEESTDVKKESNIETPSKSDNEDHVVSSTDDNDNPNACDCCDKIFESAEALKLHKSTHTSENVCHICQKTFSSKFYLKVHVNSHESNLFSCHICKKEFITKESLDEHTKEHAI

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00857849;
90% Identity
iTF_00857849;
80% Identity
-