Basic Information

Gene Symbol
-
Assembly
GCA_014465815.1
Location
CM025327.1:15686719-15698305[+]

Transcription Factor Domain

TF Family
zf-C2H2
Domain
zf-C2H2 domain
PFAM
PF00096
TF Group
Zinc-Coordinating Group
Description
The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 47 0.00017 0.021 16.6 1.8 1 23 190 212 190 212 0.98
2 47 1.1e-05 0.0013 20.3 1.4 1 23 217 239 217 239 0.98
3 47 1.8e-05 0.0021 19.7 0.6 1 23 244 266 244 266 0.97
4 47 2.1e-05 0.0024 19.5 0.5 1 23 271 293 271 293 0.98
5 47 0.0008 0.095 14.5 4.9 1 21 298 318 298 320 0.96
6 47 2.7e-06 0.00032 22.3 0.6 1 23 325 347 325 347 0.98
7 47 0.0001 0.012 17.3 3.2 1 21 352 372 352 374 0.96
8 47 3.6e-06 0.00043 21.9 2.0 1 20 379 398 379 400 0.95
9 47 0.058 6.9 8.6 3.3 2 23 407 428 406 428 0.96
10 47 3.5e-06 0.00042 21.9 0.6 2 23 434 455 433 455 0.96
11 47 0.00076 0.09 14.6 0.3 1 23 460 482 460 482 0.97
12 47 8.1e-06 0.00096 20.8 2.0 1 23 487 509 487 509 0.98
13 47 5.8e-06 0.00068 21.2 1.3 3 23 516 536 515 536 0.97
14 47 2.8e-05 0.0033 19.1 0.7 2 23 542 563 541 563 0.95
15 47 1.3e-05 0.0016 20.1 0.4 1 23 568 590 568 590 0.98
16 47 0.0083 0.99 11.3 0.3 1 20 595 614 595 616 0.94
17 47 0.012 1.4 10.8 5.4 1 23 622 644 622 644 0.97
18 47 5.1e-05 0.006 18.3 0.2 2 23 650 671 649 671 0.94
19 47 1.7e-06 0.00021 22.9 0.6 1 23 676 698 676 698 0.98
20 47 1.2e-05 0.0014 20.3 1.7 2 23 704 725 704 725 0.97
21 47 2.8e-06 0.00033 22.2 1.0 1 23 730 752 730 752 0.98
22 47 1.9e-05 0.0023 19.6 0.3 1 23 757 779 757 779 0.97
23 47 0.0001 0.012 17.3 1.7 1 20 784 803 784 805 0.95
24 47 0.0024 0.29 13.0 2.0 1 23 812 834 812 834 0.97
25 47 4.1e-05 0.0048 18.6 0.5 2 23 840 861 839 861 0.95
26 47 4.6e-06 0.00055 21.6 1.6 1 23 866 888 866 888 0.98
27 47 7.2e-05 0.0086 17.8 0.0 3 23 895 915 893 915 0.96
28 47 0.0007 0.084 14.7 0.2 3 20 922 939 921 941 0.94
29 47 3.4 4.1e+02 3.1 3.6 3 23 949 969 947 969 0.94
30 47 6.7e-06 0.00079 21.0 1.5 2 23 976 997 975 997 0.96
31 47 6.6e-05 0.0078 17.9 5.7 1 23 1002 1024 1002 1024 0.98
32 47 0.02 2.3 10.1 3.5 1 23 1034 1056 1034 1057 0.93
33 47 0.0018 0.21 13.4 0.4 1 23 1063 1085 1063 1085 0.94
34 47 0.00082 0.098 14.5 0.7 2 20 1095 1113 1095 1114 0.95
35 47 0.0015 0.18 13.6 2.2 1 23 1122 1145 1122 1145 0.91
36 47 6.2e-05 0.0074 18.0 1.2 2 23 1151 1172 1150 1172 0.96
37 47 6e-06 0.00072 21.2 0.2 2 23 1178 1199 1177 1199 0.96
38 47 0.014 1.6 10.6 1.2 2 20 1205 1223 1204 1225 0.92
39 47 0.059 7.1 8.6 3.8 3 23 1233 1254 1231 1254 0.94
40 47 5e-05 0.006 18.3 2.7 2 23 1260 1281 1259 1281 0.95
41 47 0.00014 0.017 16.8 0.1 1 23 1286 1308 1286 1308 0.95
42 47 3.4e-05 0.0041 18.8 1.5 1 20 1313 1332 1313 1334 0.95
43 47 0.047 5.5 8.9 1.8 2 23 1342 1363 1341 1363 0.95
44 47 0.00097 0.12 14.2 1.4 2 23 1369 1390 1368 1390 0.95
45 47 0.0016 0.19 13.5 0.7 1 23 1395 1417 1395 1417 0.95
46 47 1.6e-05 0.0019 19.9 0.9 1 23 1422 1444 1422 1444 0.98
47 47 5.5e-05 0.0065 18.2 0.3 2 23 1451 1472 1451 1472 0.97

Sequence Information

Coding Sequence
ATGGATTTCCATTTTGTCGATTGTGAACAGACTTTGAGAGTGGGCAAAGCAGCCGACCTCAACAGCCATTACCAGGATCCATTGTATACCATTCCAACAGTTAAGACTGAGCCGGGCTTAGAACATATGGATCCAAACAACCACTACAACCATAACCCCGCTTACATGGATTACGGCCCAAAAAGAATTAAAATAGAGGATGAAGGAGAGGCTGAAGAAATGATAATCAAGACTGAGCCAGATGCCTATGATGATGATTATGGGAACGGCGAATACCATGGAATTCCCTGTGAAACTCAAATCAGAGAGGTGGAATGCCAGTCATTCCCAGACTATAAGCGAGACATGCAAAAGCAGCCGGACAACAGTCACAACATGAACTCCACCGAAATAGTAGTATCTCTGAATCCAGCTGAACACTACCCATCAGGGCTCTATCATCAGGGCCAGGGCGGCAGTAAGCACGAAGCCAATCCGTTCATGAATGGTGGCTGGAATGATGGCTTTCATGAGCGGAGCAAATTCACTGCAAACCAGCCCCGCCTTGGCAACGCTACCAGGAAACGCTACAAATGTCGACTTTGCTTCAAAAGCTTCACTACAATCTGGTACCTCAAAGTGCACATGGACAGTCACAAAGGCAAGGAGTACTCGTGCAACATTTGCAACAAAACCTTCACTTCCCAACTGTCCTACACAGTGCATCAGAACATTCATCAAGGTAAGGAGTTCTCCTGCGACAAATGCAACAAAACCTTCACCACTCGAGCCGGCTGGAAAATTCATATGGATATTCACGAAGGTAAGACCTATCCATGTACTGTCTGTAATAAAACCTTGACTACTAAGAATGGCTTGAAACTACATATGAATATCCATTTGGGCAAACAGTTCACCTGTCAACTGTGTAACAAGAGTTTTGCCACCAAGTGGTATTTCAGAGATCACATGCGGTGTCATCAAGGTATAATGTACACCTGTAATCTATGTGGCAAAGGGTTCACGACACGTATGAGTTTGAAATTGCACATGGACATTCACGAAGGCAAACAATACACATGTGAAGTGTGCAACAAGTCGTTCTCAGCAAAGAGCACCCTGAAAATGCATATGAACTGTCACAGCGGCGAAATGCATACCTGCACTTACTGCAACAAGAGCTTCACTACCAAATCCTACCTGAAAGTGCATCAAGAAGGCAGCGAAGAGACAAACATCACTTGCTACTTTTGTAGTGAGTCGTTCACAAGGAAATGCGTGTTCAAAGCCCATCTCGAGAGACATATGGGTGAGCAGTTGACTTGCAAAATATGCAATAAAAGCTTCACTACAAAAGCTGGTCTAAAATTGCATATTGACATGCATGAGGGCAAGAACTACCCCTGCGAGCTTTGTGAACGCAGTTTCTCGTCAAAGGCCGGTTTAAAGTTCCATATGGACATGCATGAAGGCAAAGAGTACCCATGTGAGCTATGCAACAAAACCTTCACCACCAAGTGGTACCTGACTGCTCACCTTCACAGTCATGAGGGCAATCAGCCAATCTGCACCTTGTGCAATAAAACATTCACCACCAACTACAGCCTGAAACTTCACATGGAAATGCATGCAGGAAAGGTGTTGAGGTGTGAACAGTGTGACAAGAATTTCACCACTAAGACAGGCTTGAAATTGCATATGGAAGCACATGAAGGTAAACGATATCCGTGTGTCGTTTGCAATAAAACGTTCACGGCTAGAAGTTCTCTGAAAATTCATTTGAACATGCATACAGGTGAGTCGTTTGCATGTGAATTTTGTATGAAACGTTTCTCCACAAAAGCCTATTTGAAAATCCACCAGGATGCCAGTGAGCAGAAGTTTTTCTCGTGCTACTTTTGCAACGAGACATTCAACAGAAAGTGTTCGTTCCGAGTGCACCTGGAGAGCCACGGTGGCGATCAGCTGACCTGCAAGATTTGCGAAAAGGTGTTCACCACCAAGGTCGGTCTGAAACTGCACATCGATGCTCACGAAGGCAAGGAGTACTCGTGCACCATTTGCAACAAGAGTTTCGCCTCGAAATGGTACCTGAATGTGCACATGGATAGTCACGACGGAAAGCAGCCGACGTGCGAGATCTGCAAGCGCACATTCACCACCAACTACAGTCTCAAAATGCACATGGACAGTCACGAGGGCAAAGGCTACTACTGTGAAATCTGCGACAAGAATTTCACCACCAAAACCGGGCTCAAGCTGCACATGGACAGTCATGAAGGCAAGCGCTACCCTTGTGAGTTCTGTGAAAAGTCGTTCACTGCAAAAAGCTCTTTGAAAATTCACATCAATGGCCATTCAGGCGAACAGTACACTTGTGATTTTTGTCTGAAGAGTTTCACAACCAAATCCTACCTGAAAATACATCAGAGCAACAATCAGGAGGAGAAGCATTTCACGTGTGAACTGTGCGATGCGACATTCACGCGTAAATGCGCGATGCGTGTGCATTCCGAAACGCACGATGGCGATCAGCTGAGTTGCAAGGTGTGTGACAAACGCTTCACCACCAAAGTCGGTCTCAAACTGCATCTCGACATACACGAGGGCAAACAGTACACCTGCGACCTTTGCGACAAATCCTTCTCAACCAAGTGGTACTTCAAGGCCCACTTAAGTAGTCATGATGGCAACCAAATAGCCTGCGAAGTTTGCAATGAGAATTTCACCAATAAAGCCGATTTGAAAGCTCATTTAGAAATTCACGGCGGCGATAAGAATATTTGCGAATTTTGTAATCGAAGTTTCTCATCGAAAGGCTACTTGCGAATGCACCAAGATGTCAATGAAGAGAAGCCCATCTGCTGTTCCATTTGCGAGGACAAGTTCACTTCTCGGTGTGCTTACAAAATGCATCAAGAAGGTCATGCAGAGAAGAACGAGATTAAATGTGACATCTGCCAAAAATCGTTCAATACTGAATCGTATCTGAAGTTGCACATGAAGTTCCACAAGCGTAGGCGGCACACTTGCAGTGTGTGCCAGAAGAGTTTCATCAAGAAGCGTCATCTAGAGTCACACATGAACTCTCACAATAAGGTTGTTGAAGAAGAGAAAACCCATCCTTGCACTTCTTGTGATAAATCATTTGCAGATGAGAACAAATTGCGATTACATTCTTACAGTCACCATGATGAGAAGAAGATATACAAGTGCGAATTGTGTGATGAAAACTTTACTTCTCGCGAAGGCTTCAATCTGCATGTCTATGGACACGGTGATGATTATGTGTTTGAAAAGAATACCTGCGAGCATTGTAGTCGTAGTTTTACATCAACTTCAGCTCTGAAAATTCACCTGGACGCCAGTGACGATTCATCAACATTCACGTGCGAATTTTGTGATAAGGTATTTGAGAAGAAGTGTGCCTACAGAGCCCACATGGTCGACAACCATCAAGAAGAGAAGCTGACGTGCAAAATTTGTAACAAAGTTTGCAACTCAGCTTCGGCGCTCAAATTGCACATGGAGTATCACGAGAACAAAGAGCTGCCCTGTGAGTTGTGTGACAAGGTATTTCCAACTAACAATTCGTTGAAAATGCACATGTCAACGCACGACGCAGAAGAGAAGACGTGCGAGTTTTGCAATCGAACATTCACCACCAAGTCCTACAAAGCAACCCATCTGGAGAACAACGAAGAGAGCAATATTGATTGCGAGTACTGTTCGGAAAAGTTCACAAGGCGATGCGCTTACAAAGTGCATGTGGAAAGATGTCATGATGACGAGCAGCTCATTTGTAAGCAATGTAATAAAACTTTTGTAACTAAACACGGCCTGAGGCAGCATATGGACAACCACGAGGGTAAGCGATACCCGTGTGCTCTCTGCGATAAGACGTTCTCGGCGAAAAGCTCCCTGAGGATGCATGTGGAAGGGCACACGGGCGAACAGCATACCTGCGATTTCTGCCAGAAAAGCTTCACCACAAAGTCGTACCTGAAAGTGCATCTGGAGGGGAATGAAGAGGAGAAGGAGCTGAGCTGCGACTATTGCGAGGAAACGTTCACAAAGAAGTGCGTGTATCGAGTGCACGTGGAGTCGCACAACGGAGACGACTTGAACTGCAAGCAGTGCAGCAAAACGTTCACTTCGAAGCTCGGTCTCAAACTGCACCTGGACATGCACAAAGGCAAACAGTATCCGTGCGACATCTGCAACAAGGTGTTCAACTCGAAGAACGGACTCAACTTCCACAAGGACACTCATTCGGGCAAGCAGTATGTCTGCGAGATTTGCAGCAAAACATTCCCCACCAATCGGCGTCTCAAGCAACACATGGATTCGCACATCACCAAACTGTCCAACTCGTGCGACGTATGCAATCAGATGTTTCCATCCAAGGCTAGTTTGAAGCTGCACATGGAAACTCATGAGGGAATTGAGAACAATTGTGACGAATCCAAGGACGAGGACATTGAAGAAGAAAGCATAAAAGAAGAGGATATTGAAGAAGAGTGCATAAAAGATGAAGATATTGAAGAGTAG
Protein Sequence
MDFHFVDCEQTLRVGKAADLNSHYQDPLYTIPTVKTEPGLEHMDPNNHYNHNPAYMDYGPKRIKIEDEGEAEEMIIKTEPDAYDDDYGNGEYHGIPCETQIREVECQSFPDYKRDMQKQPDNSHNMNSTEIVVSLNPAEHYPSGLYHQGQGGSKHEANPFMNGGWNDGFHERSKFTANQPRLGNATRKRYKCRLCFKSFTTIWYLKVHMDSHKGKEYSCNICNKTFTSQLSYTVHQNIHQGKEFSCDKCNKTFTTRAGWKIHMDIHEGKTYPCTVCNKTLTTKNGLKLHMNIHLGKQFTCQLCNKSFATKWYFRDHMRCHQGIMYTCNLCGKGFTTRMSLKLHMDIHEGKQYTCEVCNKSFSAKSTLKMHMNCHSGEMHTCTYCNKSFTTKSYLKVHQEGSEETNITCYFCSESFTRKCVFKAHLERHMGEQLTCKICNKSFTTKAGLKLHIDMHEGKNYPCELCERSFSSKAGLKFHMDMHEGKEYPCELCNKTFTTKWYLTAHLHSHEGNQPICTLCNKTFTTNYSLKLHMEMHAGKVLRCEQCDKNFTTKTGLKLHMEAHEGKRYPCVVCNKTFTARSSLKIHLNMHTGESFACEFCMKRFSTKAYLKIHQDASEQKFFSCYFCNETFNRKCSFRVHLESHGGDQLTCKICEKVFTTKVGLKLHIDAHEGKEYSCTICNKSFASKWYLNVHMDSHDGKQPTCEICKRTFTTNYSLKMHMDSHEGKGYYCEICDKNFTTKTGLKLHMDSHEGKRYPCEFCEKSFTAKSSLKIHINGHSGEQYTCDFCLKSFTTKSYLKIHQSNNQEEKHFTCELCDATFTRKCAMRVHSETHDGDQLSCKVCDKRFTTKVGLKLHLDIHEGKQYTCDLCDKSFSTKWYFKAHLSSHDGNQIACEVCNENFTNKADLKAHLEIHGGDKNICEFCNRSFSSKGYLRMHQDVNEEKPICCSICEDKFTSRCAYKMHQEGHAEKNEIKCDICQKSFNTESYLKLHMKFHKRRRHTCSVCQKSFIKKRHLESHMNSHNKVVEEEKTHPCTSCDKSFADENKLRLHSYSHHDEKKIYKCELCDENFTSREGFNLHVYGHGDDYVFEKNTCEHCSRSFTSTSALKIHLDASDDSSTFTCEFCDKVFEKKCAYRAHMVDNHQEEKLTCKICNKVCNSASALKLHMEYHENKELPCELCDKVFPTNNSLKMHMSTHDAEEKTCEFCNRTFTTKSYKATHLENNEESNIDCEYCSEKFTRRCAYKVHVERCHDDEQLICKQCNKTFVTKHGLRQHMDNHEGKRYPCALCDKTFSAKSSLRMHVEGHTGEQHTCDFCQKSFTTKSYLKVHLEGNEEEKELSCDYCEETFTKKCVYRVHVESHNGDDLNCKQCSKTFTSKLGLKLHLDMHKGKQYPCDICNKVFNSKNGLNFHKDTHSGKQYVCEICSKTFPTNRRLKQHMDSHITKLSNSCDVCNQMFPSKASLKLHMETHEGIENNCDESKDEDIEEESIKEEDIEEECIKDEDIEE

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00365978; iTF_01353947;
90% Identity
iTF_01353947; iTF_00365978;
80% Identity
-