Basic Information

Gene Symbol
-
Assembly
GCA_030068095.1
Location
CM057955.1:9296365-9304906[-]

Transcription Factor Domain

TF Family
zf-C2H2
Domain
zf-C2H2 domain
PFAM
PF00096
TF Group
Zinc-Coordinating Group
Description
The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 50 0.0078 2.1 11.2 0.1 2 23 60 81 59 81 0.95
2 50 0.3 78 6.2 3.2 1 23 87 109 87 109 0.97
3 50 0.035 9.1 9.1 1.6 1 23 115 137 115 137 0.89
4 50 0.00022 0.058 16.1 3.4 1 23 143 166 143 166 0.98
5 50 0.00022 0.057 16.1 3.1 1 23 178 201 178 201 0.94
6 50 0.0074 1.9 11.3 1.2 3 23 212 232 211 232 0.98
7 50 1e-05 0.0027 20.3 0.1 1 23 238 260 238 260 0.97
8 50 2.8e-07 7.5e-05 25.2 1.1 1 23 266 288 266 288 0.98
9 50 6.3e-05 0.017 17.8 0.1 1 23 294 317 294 317 0.97
10 50 0.0047 1.2 11.9 0.0 1 23 330 352 330 352 0.98
11 50 5.9 1.6e+03 2.1 0.1 2 22 461 481 460 482 0.77
12 50 0.74 1.9e+02 5.0 5.3 1 23 488 510 488 510 0.98
13 50 0.037 9.7 9.1 1.9 1 23 516 538 516 538 0.89
14 50 0.0011 0.29 13.8 1.8 1 23 544 567 544 567 0.96
15 50 0.0018 0.48 13.2 1.4 1 23 579 602 579 602 0.93
16 50 0.082 22 8.0 0.7 3 23 613 633 612 633 0.97
17 50 4.7e-05 0.012 18.2 1.3 1 23 639 661 639 661 0.97
18 50 0.00034 0.088 15.5 0.9 1 23 667 689 667 689 0.98
19 50 0.88 2.3e+02 4.7 0.0 6 21 700 715 700 716 0.96
20 50 0.097 25 7.7 1.0 1 21 825 845 825 847 0.85
21 50 0.0023 0.59 12.9 1.9 1 23 853 876 853 876 0.96
22 50 0.0036 0.94 12.2 0.6 1 23 888 911 888 911 0.94
23 50 0.0017 0.45 13.2 0.5 3 23 922 942 921 943 0.95
24 50 0.12 31 7.5 0.5 3 23 954 974 953 974 0.97
25 50 2.5e-06 0.00067 22.2 0.4 1 23 980 1002 980 1002 0.97
26 50 3.7e-05 0.0097 18.5 1.0 1 23 1008 1030 1008 1030 0.98
27 50 0.0066 1.7 11.4 0.2 1 21 1036 1056 1036 1057 0.94
28 50 0.15 38 7.2 0.3 2 23 1111 1132 1110 1132 0.92
29 50 0.056 15 8.5 0.9 1 22 1166 1187 1166 1188 0.86
30 50 0.0011 0.29 13.8 1.8 1 23 1194 1217 1194 1217 0.96
31 50 0.0031 0.81 12.5 0.8 1 23 1229 1252 1229 1252 0.94
32 50 0.0011 0.28 13.9 0.5 3 23 1263 1283 1262 1284 0.95
33 50 0.00065 0.17 14.6 0.3 3 23 1295 1315 1294 1315 0.98
34 50 0.00032 0.085 15.5 1.0 1 23 1321 1343 1321 1343 0.97
35 50 0.0003 0.078 15.7 1.2 1 23 1349 1371 1349 1371 0.98
36 50 0.0066 1.7 11.4 0.2 1 21 1377 1397 1377 1398 0.94
37 50 0.056 15 8.5 0.9 1 22 1486 1507 1486 1508 0.86
38 50 0.0011 0.29 13.8 1.8 1 23 1514 1537 1514 1537 0.96
39 50 0.0019 0.49 13.1 0.8 1 23 1549 1572 1549 1572 0.94
40 50 0.00058 0.15 14.7 1.5 3 23 1583 1603 1582 1603 0.99
41 50 0.15 38 7.2 0.3 2 23 1654 1675 1653 1675 0.92
42 50 3 7.8e+02 3.1 3.7 1 23 1681 1703 1681 1703 0.97
43 50 0.038 10 9.0 0.7 1 23 1709 1731 1709 1731 0.87
44 50 0.0011 0.29 13.8 1.8 1 23 1737 1760 1737 1760 0.96
45 50 0.0043 1.1 12.0 0.9 1 23 1772 1795 1772 1795 0.94
46 50 0.0017 0.45 13.2 0.5 3 23 1806 1826 1805 1827 0.95
47 50 0.00087 0.23 14.2 0.3 3 23 1838 1858 1837 1858 0.98
48 50 0.00032 0.085 15.5 1.0 1 23 1864 1886 1864 1886 0.97
49 50 3.2e-05 0.0084 18.7 2.1 1 23 1892 1914 1892 1914 0.98
50 50 0.00078 0.21 14.3 0.3 1 21 1920 1940 1920 1941 0.95

Sequence Information

Coding Sequence
ATGGCAAAAGATTCAGGTGCACAGCTTATACcacttttaagaaaaagcaaacGGAATAAGATAAAACTGGAGGTCAACGACGAAAACGAACTGATCAACATCCCTAAAAATAAACCCAAAAGGGTTAAAAAACAGCGCATCAAAAAACCTCCTCCCATAGACCTGGACGATCCAGTGGAGTGCGATGTGTGCAACGAAACCTTTAAGAACAACGTGGCCTTCGCCCTCCACTCGATCCGCCATAGTACCGACCAAAAGTACAACTGTCACCTGTGCGAGTACAAGGGCACCTCCAAGTACCACATCGAGATGCACGTGAGGGCCCACGAGGGCACCACCAAATTCAAATGCGAGATCTGCCATAAGGCGTTCATCGTCAGCACCAACGCCTTGGAGCACAAGTACTTCCACACGGGGGAGAAGCCGTTCCAATGCGAGATATGCGGCAAACACTTTATGTACTCCCGGTTCCTCGCCTCTCATAGGAGAACCCAACATTGGGAGATAATGACCGGCACCCCGTTGGTAAAATACGATTGCACGATATGCAACAAACATTACACCTCCTCGAGCGGCCTGAAGAGGCACAATTTACGAAACCATAACACCGAAGGGATCGACACGAGCGTCTTATGCGAGAACTGCGGCAAACGCATATCGACGAAGGAAAAGTTAAAGTTTCACATGCGCATCCACACCGGGTACAAACCGTTCGCGTGCGAGGTGTGCGGCAAAGCGTTTACGGAAAAAGAGCGACTCAAGGAGCACATCCGAGTACACACCAAAGAGAAACCGTTCGTGTGCACGTACTGTGGCAAAAGTTTCACCCAACGTTCCCCATTGAAGATCCACGAGAGAACCCACACGGGGGAGCAGCCCTATCCGTGTCGTTTGTGTGGCAAAGGGTTCGTGTCCAAAAGTGCCATGGACGCACACATGAAGTCTGGACATTTTTCCACCGGGCGCACCTCCAGACACCATCATCACTTCACCTGCAAAATATGCGGGAGCATCTTCGGCTCTGGGGAGTCTTTGGAGGCGCATCTGGTGACTCACACTCTGTCACAGCCCCCCAGAGGGGGGAAGGAGGAGGAAGGGAAACATTTGCAAGAGCTGTTGGAGAGGCGCCAGTTCACCGCCTCCTTAAACGAGCTGCTCATGGCGAGGAGCAGGGAGAGAGTGAGTTTTGAGGCTCACGAGTTGATTCAGAAGCGTGAGGAGGAGGTGAACGTCGGGGAGCCTCACATCGAAgtaaaagtagaaataaaagaGGAAGATCTTGCAGATAATTTCAACCCCCATGAATACATTGCTAGAAAAAGCACGAAAAAGAAACCCAAGGAGGAGCATCCTCCACCACTGGAACTGAAGGAGCCGATCGAGTGCGAGCTTTGCGATAACGCCTTCAAAAACAACATCATCTTTGCCTTGCACAGCCTTCGTCATAGCCAAGATGGCAAATACTCCTGCCACTTGTGCTCTTATAGGAGTTGCTCCCAGTATAGCATTAAAATGCACGTAAGGGCCCACGAGGGTACCACCAGGCACAAGTGCGAAGTGTGCCAGAAGGCTTTTACCAACAGCGTATACGCCTTGGAGCACAGGTATTTCCATACAGGAGAGACTCCTTTTCACTGCGAGATCTGCGGAAAACAGTTCATGTATTCCAGATTTTTGGCTGCGCATAGACGCAACATTCACTACGAACTGTTGACCGGGCAGCCGTTGGTGAAGCACGATTGCAACATTTGCGGCATTCATTATGCTTCTGCTTCCGGTCTTCGCCGACATAATCTGAAGAACCATAGCGAACAACCAGTAGATTTATCGGTACTGTGTGATATTTGCGGCAAAAGGCTGACCAATAAGGAGAGGCTCGTTTTTCACAGACGTATCCATACTGGTCATAAGCCCTTTGCTTGTCATGTTTGTCCGAAAAAATTCACCACGAGGGACCGACTCCTTGAGCATGTGAGGGTGCATACTGGAGAGAGGCCTTTTATTTGTAAGTTCTGTCAAAGAGGGTTTACCCAGAGGGCTCCTCTAAAAATTCATGAGAGGACACATACCGGGGAGAGGCCTTATGTTTCCAAGTGTTGCGGTAAGGGGTTCATTTCTAAGGGTCTGATGAATGGCCATATGAAGAATTGTAAAGTTAAAAAGTCTTCTGAAGGGGaaataaaagtagaaataaaagaGGAAGACCTCCTAGAGAGTTACAATCCCCATGACTTCATGGTTTTACAAAATGCGAAGAAGAAGCCTAAGGTGGACCATCCCCCACCACTGGAACTGAAGGTGCCAATCAAGTGCGAATTCTGCGACAACCTCTTCAAAAACAACATCGTATTTGCTCTACACAGCCTTCTGCATAGCGAAGATGGCAAATACTCGTGCCACTTGTGCCCTTACAGGAACGCCTCCAAATATATCATTAAAATGCACGTGAGGGCCCACGAGGGTACCACCAGGTACAAGTGCGAAGTGTGCCAGAAGGCTTTCGTCAAAAGTGTCTACGCGTTGGAGCACAAGTACTTCCACACCGGAGAGACTCCTTTCCACTGCGAGATCTGTGGAAAACAGTTCATGTATTCCAGATTCTTGGCCGCACATAAACGCAACATTCACTACGAACTGTTGACCGGTCAACCGCCGGTGAAGTACGATTGCAACATTTGCGGCATTAATTATGCTTCTGCCTGTGCTCTTCGCCGACACAACCTGAAGAACCATAGCGAACAACCGGTGGATTTGTCGGTACTGTGTGACATTTGCGGTATTCCTTATGCTTCTGTCTACAATCTTCGCCGACATAATCAAAAACACCATAGTGACCAACGGGTGGACATGTCGGTACTCTGTGACATTTGCGGCAAGAGGTTGACCAGTAAGGCGAGGCTCGTTTTTCACAGACGTATCCATACTGGTCAGAAACCGTTCGCTTGTGATATATGCCCTAAAAAATTCACGACAAGGAACCAGCTTCAGGAACATGTGAGAGTCCATACCGGCGAGAGGCCCTTTATTTGTAAATTCTGTGGAAGAGGATTCACTCAGAAGACCCCCCTAAAAGTTCACGAGAGGACACATACTGGAGAGAGGCCTTATGTGTGCAAATATTGCAGTAAAGGGTTCATTACTAAAGGCTTAATGGATGGTCATATGAAGAATTGTAAAGTTAAAAAGTCTTCTGAAGGGGaaataaaagtagaaataaaagaGGAAGACCTCTCAGAGAGTTTCAACCCGCATGACTTCATGGTTATAAAAAACTCTAAGAAAAAACCTAAGGAGGACCATCCTCCACCACTGGAATTAAAAGAGCCAATCAAGTGCGAATTCTGCGACAACGTCTTCAAAAACAACATCCTATTTGCTCTACACAGCCTCCTGCATAGCGAAGATGGCAAATACTCGTGCCACTTGTGCCCTTACAGGAATGCCTCCCAATATATCATTAAAATGCACGTAAGGGCCCATGAGGGTACTACCAAGTACAAGTGCCAAGTGTGCCAAAAGGCTTTTATCAACAGTGTCTACGCGTTGGAGCACAAGTATTTCCACACCGGAGAGACTCCTTTCCATTGCGAGATCTGTGGAAAACAGTTCATGTATTCCAGATTCTTGGCTGCACATAGACGCAACATTCACTACGAACTGTTGACCGGGCAACCGTCGGTAAAGTACGATTGCAACATTTGCGGCGTTCATTATGCTTCTGCCTCCGGTTTTCGCAGACATAACCTGAAAAATCATAGCGATCAACCGGTGGATTTGTCGGTGCTGTGTGACATTTGCGGTACTCCTTATGCTACTGCCTACGGTCTTCGTCGACACAACCAAAAACACCATAGCGACCAACGGATGGATTTGTCGGTACTGTGTGATATTTGCGGCAAGAGGCTTACCAATAAGGAGAGACTTGCCGTTCACAGACGCATCCATACGAATTATAAACCGTTTGCTTGTCATATATGTCCCAAAAAGTTCACGCTGAGGGACCGTCTCCTGGAACATGTAAGGGTCCATACGGGGGAGAGGCCCTTTATTTGCAAGTTCTGTGAAAGAGGATTCACCCAGAAGACCCCCCTGAAAGTTCACGAGAGAACACATACTGGAGAGAGACCTTACGTGTGCAAATATTGCAGTAAAGGGTTTATTACTAAAGGCCTGATGGATGGCCATATGAAGAATTGTAAAGTTAAAAAGTCTTCTGAAGaggaaataaaattggaaataaaagAGGAAGACCTCACAGAAAGTTTCAACCCGCATGACTTCATGGCTATAGAAAACTCGAAGAAAAAACCTAAGGAGGACCATCCTCCACCACTGGAATTAAAAGAGCGAATCAAGTGCGAAGATGGCAAATACTCGTGCCACTTGTGCCCTTACAGGCATGCCTCCCAATATATCATTAAAATGCACGTAAGGGCCCATGAGGGTACCACCAAGTACAAGTGCCAAGTGTGCCAAAAGGCTTTTATCAACAGTGTCTACGCGTTGGAGCACAAGTATTTCCACACCGGAGAGACTCCTTTCCATTGCGAGATCTGTGGAAAACAGTTCATGTATTCCAGATTCTTGGCTGCACATAGACGCAACATTCACTACGAACTGTTAATCGGGCAATCGTCGGTGAAGTACGATTGCAACATTTGCGGCATTCATTATGCTTCTGCCTCCGGTTTTCGCAGACATAACCTGAAAAATCATAGCGATCAACCGGTGGATTTGTCGGTACTGTGTCATATTTGCGGCAAGAGGCTTACCAATAAGGAGAGACTTGCCGTTCACAGACGTATCCATACGAATTATAAACCAGgggaaataaaattagaaataaaagagGAAGACCTCACAGAAAGTTTCAACCCGCATGACTTCGTGGCTATAGAAAACTCGAAGAAAAAACCTAAGGAGGACCATCCTCCACCACTGGAATTAAAAGAGCCAATCAAGTGCGAATTCTGCGACAACGTCTTTAAAAACAACATCCTATTTGCTCTACACAGCCTCCTCCATAGCGAAGATGGCAAATACTCGTGCCACTTGTGCTCTTATAAAAACGTCTCCAAGTACCACATAGAAATGCACATAAGGGCCCATGAGGGTACTACCAAGTACAAGTGCCAAGTGTGCCAAAAGGCTTTTATCAACAGTGTCTCCGCGTTGGAGCACAAGTATTTCCACACCGGAGAGACTCCTTTCCACTGTGAGATCTGTGGAAAACAGTTCATGTATTCCAGATTTTTGGCTGCACATAGACGCAACATTCACTACGAACTGTTGACCGGGCAACCGTCGGTGAAGTACGATTGCAATATTTGCGGCGTTCATTATGCTTCTGCCTCCGGTTTTCGCAAACATAACCTGAAAAATCATAGCGACCAACCGGTGAATTTGTCGGTACTGTGCGACATTTGCGGTATTCCTTATGCTTCTGTCTACAATCTTCGCCGACACAACCAAAAACACCATAGCGACGAACGGGTGGATTTGTCGGTACTGTGTGATATTTGCGGCAAGAGGCTTACCAGTAAGGAGAGACTTGCCGTTCACAGACGTATCCATACGGGTTATAAACCGTTTGCTTGTCATATATGTCCCAAAAAGTTCACGCTGAGGGACCGTCTCCTGGAACATGTGAGGGTCCATACGGGGGAGAGGCCCTTTATTTGCAAGTTCTGTGGAAGAGGATTCTCCCGGAAGACCACCCTGACACTTCACGAGAGGAGACATACTGGAGACAGGCCTTACGTGTGCAAATATTGCAGTAAAGGGTTCGTTACCAAGAGCCTGATGGATGTTCATATAAAGAATTGTAAAGTTAAAAAGTCTGCTGAGTGA
Protein Sequence
MAKDSGAQLIPLLRKSKRNKIKLEVNDENELINIPKNKPKRVKKQRIKKPPPIDLDDPVECDVCNETFKNNVAFALHSIRHSTDQKYNCHLCEYKGTSKYHIEMHVRAHEGTTKFKCEICHKAFIVSTNALEHKYFHTGEKPFQCEICGKHFMYSRFLASHRRTQHWEIMTGTPLVKYDCTICNKHYTSSSGLKRHNLRNHNTEGIDTSVLCENCGKRISTKEKLKFHMRIHTGYKPFACEVCGKAFTEKERLKEHIRVHTKEKPFVCTYCGKSFTQRSPLKIHERTHTGEQPYPCRLCGKGFVSKSAMDAHMKSGHFSTGRTSRHHHHFTCKICGSIFGSGESLEAHLVTHTLSQPPRGGKEEEGKHLQELLERRQFTASLNELLMARSRERVSFEAHELIQKREEEVNVGEPHIEVKVEIKEEDLADNFNPHEYIARKSTKKKPKEEHPPPLELKEPIECELCDNAFKNNIIFALHSLRHSQDGKYSCHLCSYRSCSQYSIKMHVRAHEGTTRHKCEVCQKAFTNSVYALEHRYFHTGETPFHCEICGKQFMYSRFLAAHRRNIHYELLTGQPLVKHDCNICGIHYASASGLRRHNLKNHSEQPVDLSVLCDICGKRLTNKERLVFHRRIHTGHKPFACHVCPKKFTTRDRLLEHVRVHTGERPFICKFCQRGFTQRAPLKIHERTHTGERPYVSKCCGKGFISKGLMNGHMKNCKVKKSSEGEIKVEIKEEDLLESYNPHDFMVLQNAKKKPKVDHPPPLELKVPIKCEFCDNLFKNNIVFALHSLLHSEDGKYSCHLCPYRNASKYIIKMHVRAHEGTTRYKCEVCQKAFVKSVYALEHKYFHTGETPFHCEICGKQFMYSRFLAAHKRNIHYELLTGQPPVKYDCNICGINYASACALRRHNLKNHSEQPVDLSVLCDICGIPYASVYNLRRHNQKHHSDQRVDMSVLCDICGKRLTSKARLVFHRRIHTGQKPFACDICPKKFTTRNQLQEHVRVHTGERPFICKFCGRGFTQKTPLKVHERTHTGERPYVCKYCSKGFITKGLMDGHMKNCKVKKSSEGEIKVEIKEEDLSESFNPHDFMVIKNSKKKPKEDHPPPLELKEPIKCEFCDNVFKNNILFALHSLLHSEDGKYSCHLCPYRNASQYIIKMHVRAHEGTTKYKCQVCQKAFINSVYALEHKYFHTGETPFHCEICGKQFMYSRFLAAHRRNIHYELLTGQPSVKYDCNICGVHYASASGFRRHNLKNHSDQPVDLSVLCDICGTPYATAYGLRRHNQKHHSDQRMDLSVLCDICGKRLTNKERLAVHRRIHTNYKPFACHICPKKFTLRDRLLEHVRVHTGERPFICKFCERGFTQKTPLKVHERTHTGERPYVCKYCSKGFITKGLMDGHMKNCKVKKSSEEEIKLEIKEEDLTESFNPHDFMAIENSKKKPKEDHPPPLELKERIKCEDGKYSCHLCPYRHASQYIIKMHVRAHEGTTKYKCQVCQKAFINSVYALEHKYFHTGETPFHCEICGKQFMYSRFLAAHRRNIHYELLIGQSSVKYDCNICGIHYASASGFRRHNLKNHSDQPVDLSVLCHICGKRLTNKERLAVHRRIHTNYKPGEIKLEIKEEDLTESFNPHDFVAIENSKKKPKEDHPPPLELKEPIKCEFCDNVFKNNILFALHSLLHSEDGKYSCHLCSYKNVSKYHIEMHIRAHEGTTKYKCQVCQKAFINSVSALEHKYFHTGETPFHCEICGKQFMYSRFLAAHRRNIHYELLTGQPSVKYDCNICGVHYASASGFRKHNLKNHSDQPVNLSVLCDICGIPYASVYNLRRHNQKHHSDERVDLSVLCDICGKRLTSKERLAVHRRIHTGYKPFACHICPKKFTLRDRLLEHVRVHTGERPFICKFCGRGFSRKTTLTLHERRHTGDRPYVCKYCSKGFVTKSLMDVHIKNCKVKKSAE

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-