Basic Information

Gene Symbol
-
Assembly
GCA_034766995.1
Location
CM068313.1:21797267-21842383[-]

Transcription Factor Domain

TF Family
zf-C2H2
Domain
zf-C2H2 domain
PFAM
PF00096
TF Group
Zinc-Coordinating Group
Description
The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 48 4.2e-06 0.00059 21.3 1.2 3 23 240 260 240 260 0.99
2 48 0.12 17 7.3 0.1 2 14 299 311 298 315 0.88
3 48 9e-07 0.00013 23.4 5.3 1 23 324 346 324 346 0.99
4 48 0.047 6.7 8.6 1.7 1 23 352 372 352 372 0.97
5 48 0.0016 0.23 13.2 0.2 1 23 383 406 383 406 0.93
6 48 1e-06 0.00014 23.2 1.0 2 23 412 433 411 433 0.97
7 48 2.1e-05 0.003 19.1 3.5 1 23 520 543 520 543 0.98
8 48 0.004 0.56 11.9 4.8 1 23 551 573 551 573 0.97
9 48 0.0052 0.73 11.6 0.9 1 23 579 601 579 601 0.97
10 48 0.00011 0.015 16.8 3.8 1 23 607 629 607 629 0.98
11 48 2.6e-07 3.7e-05 25.1 1.0 1 23 635 657 635 657 0.98
12 48 2.9e-05 0.0041 18.7 2.1 1 23 686 708 686 708 0.98
13 48 7.6e-07 0.00011 23.6 1.4 1 23 718 740 718 740 0.98
14 48 1.1e-05 0.0016 20.0 0.1 1 23 746 768 746 768 0.98
15 48 2.4e-06 0.00034 22.1 1.3 2 23 775 796 774 796 0.97
16 48 3.4e-07 4.8e-05 24.7 1.4 1 23 802 824 802 824 0.99
17 48 0.026 3.7 9.4 1.9 3 21 835 855 830 858 0.64
18 48 2.5e-05 0.0035 18.9 6.3 1 23 863 885 863 885 0.99
19 48 3.3e-05 0.0047 18.5 1.6 2 23 895 916 894 916 0.96
20 48 2.1e-05 0.0029 19.1 0.3 1 23 919 941 919 941 0.98
21 48 5.5e-05 0.0078 17.8 2.2 2 23 948 969 947 969 0.97
22 48 7.6e-07 0.00011 23.6 0.4 1 23 975 997 975 997 0.98
23 48 0.0084 1.2 10.9 1.1 1 23 1056 1078 1056 1078 0.93
24 48 8.6e-05 0.012 17.2 1.1 1 23 1084 1106 1084 1106 0.98
25 48 8 1.1e+03 1.5 0.4 1 8 1112 1119 1112 1120 0.94
26 48 2.6 3.7e+02 3.1 7.9 3 23 1180 1200 1178 1200 0.93
27 48 0.0015 0.21 13.3 0.6 3 23 1255 1276 1253 1276 0.95
28 48 6.8e-06 0.00096 20.6 3.0 1 23 1282 1304 1282 1304 0.98
29 48 1.6 2.2e+02 3.8 0.3 1 14 1311 1324 1311 1327 0.85
30 48 0.00048 0.067 14.8 2.2 2 23 1340 1362 1339 1362 0.96
31 48 1.3e-05 0.0018 19.8 0.6 2 23 1368 1389 1367 1389 0.97
32 48 0.0031 0.44 12.3 8.4 1 23 1398 1420 1398 1420 0.98
33 48 0.00014 0.019 16.5 0.2 1 23 1423 1445 1423 1445 0.97
34 48 0.0011 0.15 13.7 1.7 2 23 1452 1473 1451 1473 0.96
35 48 1.8e-07 2.6e-05 25.6 0.5 1 23 1479 1501 1479 1501 0.99
36 48 2.2 3.1e+02 3.3 1.7 1 23 1554 1576 1554 1576 0.97
37 48 1.4e-06 0.0002 22.8 2.0 1 23 1592 1614 1592 1614 0.98
38 48 0.0009 0.13 14.0 0.0 1 23 1617 1639 1617 1639 0.96
39 48 1.3e-06 0.00018 22.9 1.0 2 23 1646 1667 1645 1667 0.97
40 48 0.001 0.15 13.8 0.7 1 16 1673 1688 1673 1690 0.93
41 48 2.1e-05 0.0029 19.1 0.4 2 23 1721 1742 1720 1742 0.97
42 48 2.2 3.1e+02 3.3 1.7 1 23 1784 1806 1784 1806 0.97
43 48 0.005 0.7 11.6 1.8 1 23 1827 1849 1827 1849 0.98
44 48 0.00098 0.14 13.8 5.3 1 23 1859 1881 1859 1881 0.98
45 48 2.4e-06 0.00034 22.0 0.4 1 23 1887 1909 1887 1909 0.98
46 48 0.00011 0.015 16.9 5.1 2 23 1916 1937 1915 1937 0.96
47 48 2.6e-07 3.7e-05 25.1 2.7 1 23 1943 1965 1943 1965 0.98
48 48 1.3 1.8e+02 4.1 1.3 1 21 1971 1991 1971 1992 0.91

Sequence Information

Coding Sequence
atgtcgCAAATAGTTGATTGTTCGGAGTTTTGTAGACTTTGTTTAGTGAAGGAACAAGTAAATACTCCAATTTTTGAAGAAGATGGTGGTTATCAtcaattatccacaaaaatttCAACTTGTTTGCCAGTCAAGgtATCAAGAGAAGATAGTCTTCCTAAAAGAATTTGTGATGGGTGTTGTTACAAAGTTGAGCAACAATATCAATTTTGGAATATTAGTTTAAAtgctgaaaaacaatttaaacaatgGTTAGGAATTTCAGAAGAAACAGAGACTGTTAAACAAAGTgttgttaacaaaataaaaaaagaatctacGTTTGAAGGTCTTTCAGGAAACAAAAGACCGAAACGAAATGCAAGACGAAAAGCTGTACAAAACATTCGACAAGCAACTGCAGCACAACAGTCTGATAGTGATAGTGATATTCCTTTGGCAGCTAGAACCGCTGCACCACCTGTTGAAGAGTCAGTTGCAAagaatgaaaatgaTTCAGACCATCATGATTTCGCTGAGGATAATGATACAATGGAAGAACctaattttgttaatatgaaTTTGGAAGATGGCCAAGACCAAGCAGGACCTTCAGGATTACAAAAAGCTGTATCAACAGAAACTCTTGAACCAACACCAGCTGCATTAAATTTGGAAACCGAAGCTGGAGAGGGCTTAAATAATCATCAGTATAAACATAGATCCGATAGAACTGGTTGTCACATTTGTGGTAAAACATATAGTTATGAAGGTGATCTTAAAAGACATATTCTTACCCACGATCCGAACTATATATCAAAAACAACATTCCCGTGTgatatgtgtaaaaaaaagttCTTCTGTAAATCGGCTTTAGATTTTCACGTACAGGTACATCATAGAGGGGAATTGCTTGTTTGTGATGTTTGTGGAAAAGGATTTAAGTCAAAAGAAGGGGTACATGCCCGAATCCATACAGGAGAAAAACCTTTTAAATGTTCAGATTGTGATAAATGTTTTACGCAACGTAGTTCTTTAACTGTTCATAAAAGGAGTCATACGGGGGAACGACCGTATCCATGTCATTTATGTCCAAAAGCTTACGAAAACGATTTCAAACGTCACGTTCGGACACACGATCCGGATTATgtgatgaaaattaataattatccgTGTGAGTTGTGCGATAAGAAGTTTCCACGTCAATCAGGGCTAGATTTTCATGTTAGAGTCGAGCATAAAGGCCAGTTATTAGTGTGTGATGTCTGCGGAAAAGATTTTCGTACAAAAGATGGATTAAGGCACCATATTCGGTCACATACTGGAGAAAAACCATACAAATATTATCAAACAATTGATACATCAAAATTGAGAAGTTCATGTATAACATTTGTTAATTGTAATAAGAATGAAGAAAACAAGTCTACAATATTACCGAAGTATAAAAAGCGAGGTTTTGTCGAACGACAGTTTAAAGCATCGTATCTCAATAATTCTGTAGACAAGTGCGCTAGAACTGACAGAGGTTATCAGTGTACATTACAAACACATGTTGAAATTGAACATGTTGGCAAACGATACACTTGCGAGACGTGTCAGAGTACTTTCAGATCACAGTCATATCTAAATACTCATATGAGAAGAAAGCACGATCCTAATTATGTATCCACATATAAATGTCATATGTGTAATAAAGTGGTGTCTACCAAGTCGTCTTTAATGTGTCATTTAAAAGCACACGAAGGAATCAAAAACTACACATGCGATATGTGTGGCAAGAATTTTTTAACACTTCAAAGTTTGCGATTGCACAAAAGTATACATACGCAAGAAAAGCCGTTTGCTTGTGATATCTGCCataataaattcagaaaaaaatcagatGTACGAACGCACATGCGTACTCATACGGGGGAGAAACCGTACAAATGTGAAGAATGTGGGAAAAAGTTTACTCAACGCAGTAGTTTGGTAGTTCATGTGCGATATCATAGTGGCGATAGACCTTATAAATGTCATTTATGTCCGGGAAATTTTTTACAAGAGCATATAGATATTGAACACAAAGGAAAAAGATACAATTGTGATATTTGTAATAAAGGGTTCAAATCCCAGATTTACTTGAATACACATCGCAAGAAGCATGATCCAAACTACGTATCAGAAAAATCGTTCGAATGTGATATGTGCGGGAAAGTGTACTCGTCtcatacaaatttgaaaattcatattaatacacATAAAGGATTGAAAAATTATACGTGTGATATTTGTGGACGAGGATTTGTGTCGGTGACAAATTTGAATACGCATATGGTTGTTCATACTGGAATAAGGCTGATAGAATGTGAAGTTTGCAATAAACGATTTCGTAAAAAGCaagaattgaaaatacatatgCGTAGTCATACTGGGGAACGTCCCTATCAATGCAATGAATGTGGAAAAAGTTACACTCAGCGTAGCAGTTTAGTAGTCCATAGGAGATATCATACCGGAGAGCGACCTTATCTTTGTCATTTATGTGATAAAGTATTTTGCAATAAAGGATTTTATCAAAAAGCagaattaaaaatgcatatagaTGCTGATCATAATGCAATTCGGTACCAGTGTCATATATGTAAGAAAACGTTCTGCAGTAAAGGATATCTTTCGATACATTACAGAGTGCACGATCCAAATTATGTGCCGCCAGAGTTGAAATGTGAAATTTGTggaaaaacatatcattcacaAGATTTATATAGACGACATATCAAAGCACATGAAGGATTCACATGCGATGTTTGCGGAAAAACATTAACGTCACAGATCAGTTTAAAAAACCATCTACTTATACATCGTGGAGAGAAACCGTTATCGTGTAACGTGTGCGGTAAATGTTTTAACAAACGAGCTATTCTGAAAGTGCATATGCGTACACACACAAACGAAAAACCGTACGAATGTAAGGAATGCGGAAAAAGATTTGCTCAACGTAGTCCTCTTGTTATACATATGCGATATCACACAGGCGAAAGACCATATCCATGTAAACAGTGTTGCGAAGCATTTGTTAGTAACACAGCAGTTTGGATCTGTAAACAAGAGCCTTTATATGAAGAAGTAGAAGTAAAATCAGAGTACGAttataattgtacaaaaaattctgAGGAAAGCGGATATGTAAAAATTCACGATAAATTTTTGTGTAgtgtttgtaataaaagtttCAGTAAAAAGATCCGTTGGAAAGCACATTACGCACAACATTTCGAAGATCGTCCTTTCAAATGTGATGTTTGCGATAAAAGTTACAAAACGAAAGACGGTCTCTATGGCCATACGAAATTACATTCAAGAGAAAAGAAATATAGTTGTCCTACATGTGGAAAATTGCATTATTCAACGACGGCCGGAAAATTTTGCGAAAAAACTCATTCGAATGATTTTGCACACAAGGATTTGACAtggaaaataaaacaagaaccATTATATGAAGAACAACAAGTAGCTGCTACATCCAACGTCGATAAAGAAATGCAAGGATATACGATacgtaaaaatatgatttgttgtaaaatatgcaACAAAACATTCCGTAAATTCACTAGATGGAAAATACATTCCTGTAAACATGAAACTGTAGAAGAATTAAACTATCACATAGAAGATGTCAAAGTTGAAATAAAAGATGATCCTAATTTGGATCAAAATATCCTGTTGCAAACATTAAATagcaatttaaatgaaacattgaaCGAAGAAGAAAACTATTATCGAGAAGATGAcaaaattttctgtaaattatgTACACGGACGTTCAAAAGGCCTGCAAAATGGAGAATTCACTATCGTGGAGTACATTGTAATATACGTTTATTCAGTTGTGATCAGTGTAATAAacgatttaaagaaaaatcgcAATTGAAAGTTCATCGAATGACGCATGAAACCGATAGGCCAAAGTTCGCGTGTAAAATTTGTAGTAAAGAATTTCATTACGCTAGTTCGGCACGCGTTtgcgaaaaaaatcattcgcagttatacaaaatgaaatgtcCTCATTGTGATAAAGGATTCCATTATCAGAGTTTATTAGATACGCATATCGAAGTGAAACACAATGCAGTGAGGCAAGAATGTCCTATATGCAATAAAACTTTTGCCAATAAAGTATGTTTGGCAGCTCACGGCCGTACTCACGAACCTGATTATGTGTCCACTCAGTATAGCTGTGAAAGATGCGGCAAAACGTACAATTCGCGCAAATGCTATCATCGACACATGAAGTCGCATGAAGGTTTCGCGTGCGATATATGTGGTAGAATTTTTGGATACGAACATGGATTGAAAAGTCACATGTTGATTCATAGAGGAGAGAGACCTTGGAATTGTGAAGTGTGcggtaaaagttttaataagaaACATATAATGACAGTGCATTTAAGAACGCATACGAATGAAAAACCATACAAATGTGATGTTTGTGGGAAACGGTTCACGCAACGAAGTCCACTTGTCATTCATATGCGTTCACATACCGGCGAACGCCCTTATCCTTTTACAATCTCCAATTCCAGGACAATTGCTGAAGACGTTAAACACGAATATACTGATAACCAGTCGTCAATTCCTGATGAGAATCAGGTCTCAGAATGGATGGAAGGTGAAGGTTACACTCGTATGCACGACGGATACCACTGCAACGGTTGTAAAGATGTATTCTTATATAAGGATGATTGGCTAAATCATTATCGATTACACACACATTATAAAGTGTTCCACGATGCGAATTATGTCGCGCCAGTGTTTACTTGTGAAGTGTGCGATAAAACTTACAATACACgtggttatttaaaaaaacatttgcgCAGCCATGAGGGGTTTCCTTGTGACATTTGCGGAAAGATACTGACTACGAAACCTAGTTTAGTGGATCATTTAGTTGTACATAGTGGAATGAGACCGATTAGTTGTGATATATGCGGGAAaagttttagtaaaaaaagaacTCTGAGCGTACATATGCGACAGCACACTGGTGAAAAACCCTACCAGTGTAGCGAATGCGGAAAGAGGTTTACACAGAGGACGAGTTTGTTATATAAAGTGAAATGCCCTCATTGGGACAAAACGTTCCATTATCAAAGTATATTAGATAGGCATATTGATGTGAAGCGGAATGCAGTGAGACAGGAATGTCCTATTTGCAATAAAACATTTGGCAATAAAGTTCGTTGGGAAAAACATATCCGAACTCATAAATCCTCGACAATTGCTGAAGACGTTAAACACGAATATACTGATAACCAGTCGTCAATTCCTGATGAGAATCAGGTTTCAGAATGGATGGAAGGTGAAGGTTACACTCGTATGCACGACGGATACCACTGCAACGGTTGTAAAGATGTATTCTTATATAAGGATGATTGGCTAAATCATTATCGATTACACTATTGTAGTTCAATCGAGGAATATGTTTCGCATAAACGCACGGATCATCAACGTATTGATTTCTATTGCGATGTGTGTGCAAGGAAATGCCGTGATCAGATTGATCTCGATAAACATCTTCAGACACACAAGGATAGTTACAAAAAACAGCGACCATTTCAGTGTTCGCATTGTGAACATAGATTTGTTTCCAATAAGAGTTTACGAGCGCACGAAAAATCGCACAATCCCAATAACAGATTCGTTTGTCCGGAATGTGGAAAAAGCGTTTCTACAAAGCATACATTGGTCGAGCATATCAGATTGCACACGGGTGAACGGCCGTGCGTATGTCATATTTGCGGACAGAATTTCACGAAGAAGTCGACGTTACGTACACATGTACAAAGTCATTTTGGACAAAAATTACATCAGTGTCCTGTTTGCAAGAAAAGTTTCACTCAACGCAGCACCTTGATTGTACACATGCGGTATCATTCTGATGACAGACCTCACAAATGTATTCATTGCGACCGAGGATATGTAAGCCGTTCGTTACGGAACACACATCAAGCAACATGCATTTTTCAGcgaaattttaatgtaaaatcatTGAATCAGAGTCCAGACAATAACGATGAAAGTAATGTTTAA
Protein Sequence
MSQIVDCSEFCRLCLVKEQVNTPIFEEDGGYHQLSTKISTCLPVKVSREDSLPKRICDGCCYKVEQQYQFWNISLNAEKQFKQWLGISEETETVKQSVVNKIKKESTFEGLSGNKRPKRNARRKAVQNIRQATAAQQSDSDSDIPLAARTAAPPVEESVAKNENDSDHHDFAEDNDTMEEPNFVNMNLEDGQDQAGPSGLQKAVSTETLEPTPAALNLETEAGEGLNNHQYKHRSDRTGCHICGKTYSYEGDLKRHILTHDPNYISKTTFPCDMCKKKFFCKSALDFHVQVHHRGELLVCDVCGKGFKSKEGVHARIHTGEKPFKCSDCDKCFTQRSSLTVHKRSHTGERPYPCHLCPKAYENDFKRHVRTHDPDYVMKINNYPCELCDKKFPRQSGLDFHVRVEHKGQLLVCDVCGKDFRTKDGLRHHIRSHTGEKPYKYYQTIDTSKLRSSCITFVNCNKNEENKSTILPKYKKRGFVERQFKASYLNNSVDKCARTDRGYQCTLQTHVEIEHVGKRYTCETCQSTFRSQSYLNTHMRRKHDPNYVSTYKCHMCNKVVSTKSSLMCHLKAHEGIKNYTCDMCGKNFLTLQSLRLHKSIHTQEKPFACDICHNKFRKKSDVRTHMRTHTGEKPYKCEECGKKFTQRSSLVVHVRYHSGDRPYKCHLCPGNFLQEHIDIEHKGKRYNCDICNKGFKSQIYLNTHRKKHDPNYVSEKSFECDMCGKVYSSHTNLKIHINTHKGLKNYTCDICGRGFVSVTNLNTHMVVHTGIRLIECEVCNKRFRKKQELKIHMRSHTGERPYQCNECGKSYTQRSSLVVHRRYHTGERPYLCHLCDKVFCNKGFYQKAELKMHIDADHNAIRYQCHICKKTFCSKGYLSIHYRVHDPNYVPPELKCEICGKTYHSQDLYRRHIKAHEGFTCDVCGKTLTSQISLKNHLLIHRGEKPLSCNVCGKCFNKRAILKVHMRTHTNEKPYECKECGKRFAQRSPLVIHMRYHTGERPYPCKQCCEAFVSNTAVWICKQEPLYEEVEVKSEYDYNCTKNSEESGYVKIHDKFLCSVCNKSFSKKIRWKAHYAQHFEDRPFKCDVCDKSYKTKDGLYGHTKLHSREKKYSCPTCGKLHYSTTAGKFCEKTHSNDFAHKDLTWKIKQEPLYEEQQVAATSNVDKEMQGYTIRKNMICCKICNKTFRKFTRWKIHSCKHETVEELNYHIEDVKVEIKDDPNLDQNILLQTLNSNLNETLNEEENYYREDDKIFCKLCTRTFKRPAKWRIHYRGVHCNIRLFSCDQCNKRFKEKSQLKVHRMTHETDRPKFACKICSKEFHYASSARVCEKNHSQLYKMKCPHCDKGFHYQSLLDTHIEVKHNAVRQECPICNKTFANKVCLAAHGRTHEPDYVSTQYSCERCGKTYNSRKCYHRHMKSHEGFACDICGRIFGYEHGLKSHMLIHRGERPWNCEVCGKSFNKKHIMTVHLRTHTNEKPYKCDVCGKRFTQRSPLVIHMRSHTGERPYPFTISNSRTIAEDVKHEYTDNQSSIPDENQVSEWMEGEGYTRMHDGYHCNGCKDVFLYKDDWLNHYRLHTHYKVFHDANYVAPVFTCEVCDKTYNTRGYLKKHLRSHEGFPCDICGKILTTKPSLVDHLVVHSGMRPISCDICGKSFSKKRTLSVHMRQHTGEKPYQCSECGKRFTQRTSLLYKVKCPHWDKTFHYQSILDRHIDVKRNAVRQECPICNKTFGNKVRWEKHIRTHKSSTIAEDVKHEYTDNQSSIPDENQVSEWMEGEGYTRMHDGYHCNGCKDVFLYKDDWLNHYRLHYCSSIEEYVSHKRTDHQRIDFYCDVCARKCRDQIDLDKHLQTHKDSYKKQRPFQCSHCEHRFVSNKSLRAHEKSHNPNNRFVCPECGKSVSTKHTLVEHIRLHTGERPCVCHICGQNFTKKSTLRTHVQSHFGQKLHQCPVCKKSFTQRSTLIVHMRYHSDDRPHKCIHCDRGYVSRSLRNTHQATCIFQRNFNVKSLNQSPDNNDESNV

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-