Basic Information

Gene Symbol
-
Assembly
GCA_963575645.1
Location
OY754475.1:21674988-21683605[-]

Transcription Factor Domain

TF Family
zf-C2H2
Domain
zf-C2H2 domain
PFAM
PF00096
TF Group
Zinc-Coordinating Group
Description
The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 32 0.0034 0.07 14.1 6.7 1 23 244 266 244 266 0.98
2 32 3.9e-05 0.00079 20.3 0.6 1 23 272 294 272 294 0.98
3 32 0.00033 0.0068 17.3 2.6 1 23 300 322 300 322 0.97
4 32 2.1e-05 0.00043 21.1 1.9 1 23 328 350 328 350 0.97
5 32 0.0026 0.054 14.5 3.7 1 23 371 393 371 393 0.97
6 32 0.00011 0.0022 18.9 0.7 1 23 399 421 399 421 0.96
7 32 0.0035 0.072 14.1 0.7 1 23 427 449 427 449 0.95
8 32 5.9e-05 0.0012 19.7 0.5 2 23 456 477 455 477 0.94
9 32 2.8 57 5.0 0.3 1 12 483 494 483 496 0.91
10 32 10 2e+02 3.2 0.5 1 18 504 521 504 521 0.94
11 32 0.0058 0.12 13.4 0.3 1 23 707 729 707 729 0.97
12 32 0.0016 0.033 15.2 2.3 1 21 737 757 737 762 0.94
13 32 1.2e-05 0.00025 21.8 3.9 1 23 772 794 772 794 0.97
14 32 1.2e-05 0.00024 21.9 0.8 1 23 812 834 812 834 0.96
15 32 0.0023 0.048 14.7 0.6 1 21 889 909 889 910 0.94
16 32 0.0033 0.067 14.2 1.2 2 23 923 944 922 944 0.92
17 32 0.38 7.8 7.7 4.2 1 23 949 971 949 971 0.97
18 32 0.00097 0.02 15.9 4.5 1 23 975 997 975 997 0.97
19 32 0.00019 0.0039 18.1 2.2 1 23 1003 1025 1003 1025 0.99
20 32 0.023 0.47 11.5 5.0 1 23 1034 1056 1034 1056 0.98
21 32 0.0043 0.089 13.8 3.0 1 23 1067 1089 1067 1089 0.98
22 32 0.0059 0.12 13.4 0.6 1 23 1098 1120 1098 1120 0.98
23 32 0.00027 0.0056 17.6 2.2 1 23 1454 1476 1454 1476 0.97
24 32 0.0013 0.027 15.5 2.6 1 23 1487 1509 1487 1509 0.98
25 32 5.9e-05 0.0012 19.7 6.0 3 23 1833 1853 1831 1853 0.95
26 32 1.4e-05 0.00028 21.7 1.0 1 23 1859 1881 1859 1881 0.98
27 32 0.00086 0.018 16.0 0.4 1 23 1887 1909 1887 1909 0.98
28 32 0.00017 0.0034 18.3 2.6 1 23 1915 1937 1915 1937 0.92
29 32 0.0078 0.16 13.0 3.0 1 23 1943 1965 1943 1965 0.96
30 32 1.6e-05 0.00033 21.5 1.1 1 23 1971 1993 1971 1993 0.97
31 32 0.021 0.43 11.7 5.2 1 19 1999 2017 1999 2021 0.90
32 32 0.0003 0.0062 17.4 1.0 1 23 2027 2049 2027 2049 0.98

Sequence Information

Coding Sequence
atgattacgaTGTGTACTTCCGTTAAGgtcTCAAGATACGAGGGAATACCAAATTTCATCTGTCGGGCGTGTTacgttcaattaattaaaacataccGCTTTCGAAGACAAGCTCAGAAATCTGACAAAACATTACGCGGCTATCTGCAAAGACGTGTCGGTGACAAGAAAACCGAACCTGAAAggtcttttcaacaattttctGAGAACGCTGGTGATCAGGACATCAAGAAAAATAATGGTGTTttcgaattaaatattaaagaagAAATAGCATCGATAGATGCAGCACCAGAAGACTTTGCTGGAATTCTAGACAGTATAAAACAAGAATATTATCCTGAACAAAATTCTTCACGTGTCGAAACAGTTTCTTTGAATTCAGTACAATCTGAAAACAACGAAACATTCAGAATCGATATTAAGGAAGAATCGATTTTTTCGTGTCGGGAATCTGAAGACTTGGAGATATTTCGAGATAATTCGAAACAAACATCTGCTGCAACTTTGAAAAATGGCTTCTTGCAACAAGTTGACAATGACGTGAAGCTAGAAGCACCACACAGTGTATTCTTAAGTagtgataataattttaagtcGGAAAAAACAGATGGCGACGGAATATTGAATGAAACGAATTTGTTTGGTAGCTACAAAGTACAGTTAAATACTTCCAAAATAAACGTAGAGAATTTTGAAGACTCAGACACTTTGACTGTAGAGCAACCACATACTTGCGAATTTTGCAAGAGATCATTTTCGCGCCTCTTTgatttaaaaacacacaaaagaaaacatacgggtgagaaaccatacaagtgtgacatttgtaataagcgttTCCCATACTTAGTAAGCTTGAGAAAACATATTCTGCtgcataccggtgagaaaccacaccagtgtgacgtttgcaataaaCGATTCAGTGAAACaggcaatttaaaaaaacataaactaatacatacaggtgagaaaccgtaccagtgcgaTGTGTGTGATAAGACGTACTCGCGATCGAATGATCTGAAAAGACACAAATTTGTGCATACCGGTTGTCAATTgagtgatttaaaaaattccacaatAACACATCCTCGCGAGAAACCGTATCGGTGTGATGTTTGTTATAAGTGCTTTTCACAAATGGGAactttaaataaacataaactgCTACATACTCGCGGTAAACCGTTTCCATGTGAAGTTTGCGAGAAATGTTTCCCAAATAGGTATGCTTTAAAAAGGCATGGCTTGATACATGCTGAtaagaaaccttatcagtgcgATATATGTGATAAGGATTTTGTTAATTCGAATAATTTAGCATTCCATAAGTTaaaacatactggtgagaaacccttGCCATGTAACGTTTGCAGTAAGCGTTTCACACGCCCGTACGATTTGAAAAGACATGAGCTGTTGCATACAAACGAGAAACCctttcagtgtgacgtttgcaataaaGGGTTTATCCAGgttACACTACACGACGAGATGCCGCATTACATTTGCGATTTGTGCTACTACCGACTGAGAAATTCACAAAACCTTCGGAAGCAAGCCTTGAAATCAGACTTAATACTACGCGGTTATTTGCGACAGCTACGTATTTCCAACAAACAAACCGAGTCAGAAGGTTCTAGTCAACAACATTCGATcaaccaaaattcaaaagtgaTCAACGACAATGAAGTACTCCAGTTAAATGTTAGCGTATCGCCTGGGGATTTATTAAACTATTCAAACCAAAAAAGTGATACCGAGCAAAGTTTTTGTAATGAAATCGATGACTCGGATGCCGAACAAGATCCTATTTTTATCGAACATAGTGATTTTAACGATACCGGCTTGCAGCAAGCTGAGGCAAGTTTGAAGCTAGCGACAGGTCGAGATCATTTTAGTGATGCCTGTTGTATCAGCGAAACGGAAAATGAAGTTGGCTTCGACTTTGATGGAAATCATTTTTTGAATTGTTCTAAGGATCCGGAAGCTATTTTGATCGACTCCGATGATAATAGTTGTTTCGAACAGAGCGATTTGTTAAAGAATTCCACCGCTGAGAGTTGGATCGGACACATGACCATAGACGACTTTCAGCGCAATGAAACACGGTTTATGTGCAACATTTGCAAGAAATATTTCGCTGATCGAACTCTTCTAGCGGGCCATGAATTAACACATAAAAACGAAACTGAATATCGGTTccaatgtgacgtttgcaaaAGATGTTTCAAACAATTGCGGTCGTTGAAGATCCACGAGAGAAATCCAAACGTTCACAAAGATCTAAACACAAGTATATACGCCCACCGGTGTGAAGTTTGCGGCAAAAGTTTTACGAACAAAACATATCTGAAAAGACACGGCCGCCAACATTTAGATACCGACGAATTAGCGGAACATAAGAAACGTTACCAACAAAAATTGTTCCGATGTAATGTTTGCGATAAAAGTTTTGCAAATCGGAATAATTTGGAGAAGCATAATTTAATTCATACACGTCAGAATACGTTTCAATGTGACCTTTGCCAGAGGATTTTTTTGGCGCAACTCGGCTTAGACCAGCATAAAAACGTAACACATGCACTTTGTGATATTTGTGATTCGACTCCATGTGATTTAGAGAAGCACAAAGCCGTTTATAATAGCGATCATCCGTATCAATGCGATGTATGTAAAATGCGTTTCAGAACGAAGGTGGGTTTGAAAAACCACGAGAAAAATCCCGACAGTCACAAAGTTGCGGACGCGCCGCAAGTTTATTGTGGCGTTTGCAATAAGGGTTTTAGAAGCAATATTCATTTGCGTCGGCATTCGGCTCAACATCGCAGGAAGCAACACAAGTGCGATATGTGCGACGTGCGTTTTGCGAAAATAAAACACTTACTAAGCCATAAAACAAGTCATCAAGAAGATCACAAATGTGACATTTGCGGTAAAGGTTTTATAAATACGCATTTTTTGCGCAGACACAAACGTGGACACGTTTTGGATTCACGGTATAAGTGCGATGTTTGCGACGCGTGTTTCAAAGGTCCGAGAGCGCTAAGGAAACATAAGAAAACTCACTtagaaattaaaagaaaatcggTATAtaaatgtgacatttgtcataCAATTTTTAGATACGAAAGAAGTTTTGAAAAACATAATAAGCGACATTCTGATAAATCGTTAGCAAAAGAAGGGGAATATAAATGTAGCGTTTGTAACCTGACTTATAAATAccaaaagaattttgaaactcacagtaaaatacattttgatcAATGGAAAGAAGGAGAATATAGTTGTGACGTATGTCAGGTGATTTTTAgataccaaaacagttttgaaaAGCACGCCAAGATACATTCTGATCAAGGGAAAGATAAAAAACGTAAGATGCATTCTGACCAGTCGTTGGTAAGAGAAGGAAAAGATAAAAAACGTATTAAGAAAAATTCTTATCAATCATTAGTAAAAGAGGGGGAAGCTAAAAAACATAATAAGATACATTCTGATCAATCAGTAAAAAAAGAGGGAGATAAAAAACGTAAGATGCATTCTGATCAGTCGTTGGTAAGAGAAGGAAAAGCTAAAAAACGTGTTAAGAAAAAATCTTATCAATCATTAGTAAAAATGGGAGAAGCTAAAAAACGTAATAAGATATATTCTGATCAATCAGTAAAAAAAGGAGGAGATAAAAAACGTAAGATGCATTCTGATCAGTCGTTGGTAAGAGAAGGAAGAGATGAAAAACGTATTAAGAAAAATTCTTATCAATCATTAGTAAAAGAGAGGGAAGCTAAAAAACATAATAAGATACATTCTGATCAATCAGTAAAAAAAGGGGGAGATAAAAAACGTAGGATGCATTCTGATCAGTCGTTGGTAAGAGAAGGAAAAGATATAAAACGTGTTAAGAAAAATTCTTATCAATCATTAGTAAAAGAGAGGGAAGCTAAAAAACATAATAAGATACATTCTGATCAATCAGTAAAAAAAGGGGGAGATAAAAAACGTAGGATGCATTCTGATCAGTCGTTGGTAAGAGAAGGAAGATATGAAAAACGTATTAAGAAAAGTTCTTATCAAACATTAGTAAAAGAGAGGGAAGCTAAAAAACATAATAAGATACATTGtaatgaatcatttaaaaaaagggAAGATAAAAAACGTAAGATGCATTCTGATCAGTCGTTGGTAAGAGAAGGAAAAGATAAAAAACGTATTAAGAAAAATTCTGATCAATCAGTAAAAAAAGGGGAAGatagaaaacataataaaacacGTTCTGATCAACCATTAGTAAAAGAAGGGGAAGATAGAAAACATGATAAAAAAACTTCGGATCAAATGAAAAAAGGCGAACATGAATGCAACGATTGTGAACTGGTATTTAAATACCGAAAGAATTTGGAAAACCACACTAAGATCCATTCTGAGGAAACGATAGTTATAGAAGGGGCACACACTTGCGACATTTGTAAGCGCATATTTGAATACCGAAAGAATTATGAAAATCACATCAAGATCCATTCAAAACGATCACCAAGAGACGGGGAAGATGAAAAACTTgataaaatagtttccgatcaAATGAAAAAAGCCACAATTACGCGGAATAATATTATCATGCAATCGAACGCCATggatgcatttagtaaaaagcgtGTAATTATAATTGGACGTGGTCTTACTAAAATGTGTAGATTGTGTTTGAAAAGTGGTGCAGGTATATTTTcgtttaaaagaaaaaagaaacttACAGGAATGATCACAAAATGTACTTCCGTTATGATTACACATTACGACGAGCTTCCGAAATATATTTGTCGGTTATGTTACAATCGGCTATTTCAAACACACCGTTTCCAAAAGCAGGCCTTAAATTCCGATAAAATCCTGCGCGACTATCTGCAGCAAATACGCTTTGCTGACAATCAAACTGGACTTGAAAGTCAGCAGTCTTCTACCGAACACAACTCTGAATCGTCTACTAAGCAAAATGCAGTCAACGCGGGACCAGGAGATTTCAGTCAAATTCTGGGCAATATAAAACAAGAAATTGGCAGTGAAACAGTTTGGTTAAACGATGAACAACAAAGGCATTATGAAGGGCATGAAATAATCAAAGAAGAAAAGACCGAAACTAATCGAAACAATATCAAGCAAGAGTCAACTACAACGCAGGGTGTACTTGCTGAAACGAGTGATAGTTACTTGAAAAAAACTAGACACGTCTCAAAAATCAGGATAGAAACTTCTTACAGCTTACCTTCCGGTTGTAACTCCAAATACGAAGATCTGGATGGTGATGTTGATGTTAATCATCAAAATTGTGATTATTTAGGCTACAGTGATGTAGATGAGGAAACAACAAATGTTTTCACGGAAGAGTCTcaactaaatttaaataatcgaGGAGAACGGATATATATACCGGAGGATTCCGAAAATGCATCTGCACCTCAAGAAGTGTGGAACAATGATGTTAAATCACATctgtgtgacgtttgcaataagtgtttctccAATAAAAGTAACTTGAAAAGACACAAACTTGTACATACTgacgagaaaccttatcagtgtgaatTTTGTGACAAATCATTCATCCAATCGGATGATTTAAAAAACCACGAACTAAAACATACCGGGGAAACGCCTTATCAATGTGACGTATGTAATAAAGGTTTCATTCGATTGGGTGATATGAAATCGCATTATACTatccatactggtgagaaaccgtatcagtgtggcgtttgcGAGAAGCATTTTAAGCGAAGAAGTGACTTAAAAAGACACAGCGTATTGCATACTGGTGATAAACCGTATCGATGTCACGTTTGCAGTAAATCGTTCTCGTTCTTACCTGTTCTGAAGAGACATTTGTTtttacatacaggagagaaaccctttcagtgtgacgtttgtaataagcgtttcGCTGATTGGAGTAATATGAAAAAACATAAACTGTTGCATACAGGCGAGAGACCGTTTCTGTGCGATGTCTGCAACAAAAGTTTTACGCGGCATTTTGATTTGAAAAGACATTGTATATTACATTCGAACGAAAAACCGTTTCAGTGTGATGTTTGCGTTAAGGGGTTTACGACGTTTGCGTCTTTGAAAAGGCACAGGTTGTCACATGTTATTTAG
Protein Sequence
MITMCTSVKVSRYEGIPNFICRACYVQLIKTYRFRRQAQKSDKTLRGYLQRRVGDKKTEPERSFQQFSENAGDQDIKKNNGVFELNIKEEIASIDAAPEDFAGILDSIKQEYYPEQNSSRVETVSLNSVQSENNETFRIDIKEESIFSCRESEDLEIFRDNSKQTSAATLKNGFLQQVDNDVKLEAPHSVFLSSDNNFKSEKTDGDGILNETNLFGSYKVQLNTSKINVENFEDSDTLTVEQPHTCEFCKRSFSRLFDLKTHKRKHTGEKPYKCDICNKRFPYLVSLRKHILLHTGEKPHQCDVCNKRFSETGNLKKHKLIHTGEKPYQCDVCDKTYSRSNDLKRHKFVHTGCQLSDLKNSTITHPREKPYRCDVCYKCFSQMGTLNKHKLLHTRGKPFPCEVCEKCFPNRYALKRHGLIHADKKPYQCDICDKDFVNSNNLAFHKLKHTGEKPLPCNVCSKRFTRPYDLKRHELLHTNEKPFQCDVCNKGFIQVTLHDEMPHYICDLCYYRLRNSQNLRKQALKSDLILRGYLRQLRISNKQTESEGSSQQHSINQNSKVINDNEVLQLNVSVSPGDLLNYSNQKSDTEQSFCNEIDDSDAEQDPIFIEHSDFNDTGLQQAEASLKLATGRDHFSDACCISETENEVGFDFDGNHFLNCSKDPEAILIDSDDNSCFEQSDLLKNSTAESWIGHMTIDDFQRNETRFMCNICKKYFADRTLLAGHELTHKNETEYRFQCDVCKRCFKQLRSLKIHERNPNVHKDLNTSIYAHRCEVCGKSFTNKTYLKRHGRQHLDTDELAEHKKRYQQKLFRCNVCDKSFANRNNLEKHNLIHTRQNTFQCDLCQRIFLAQLGLDQHKNVTHALCDICDSTPCDLEKHKAVYNSDHPYQCDVCKMRFRTKVGLKNHEKNPDSHKVADAPQVYCGVCNKGFRSNIHLRRHSAQHRRKQHKCDMCDVRFAKIKHLLSHKTSHQEDHKCDICGKGFINTHFLRRHKRGHVLDSRYKCDVCDACFKGPRALRKHKKTHLEIKRKSVYKCDICHTIFRYERSFEKHNKRHSDKSLAKEGEYKCSVCNLTYKYQKNFETHSKIHFDQWKEGEYSCDVCQVIFRYQNSFEKHAKIHSDQGKDKKRKMHSDQSLVREGKDKKRIKKNSYQSLVKEGEAKKHNKIHSDQSVKKEGDKKRKMHSDQSLVREGKAKKRVKKKSYQSLVKMGEAKKRNKIYSDQSVKKGGDKKRKMHSDQSLVREGRDEKRIKKNSYQSLVKEREAKKHNKIHSDQSVKKGGDKKRRMHSDQSLVREGKDIKRVKKNSYQSLVKEREAKKHNKIHSDQSVKKGGDKKRRMHSDQSLVREGRYEKRIKKSSYQTLVKEREAKKHNKIHCNESFKKREDKKRKMHSDQSLVREGKDKKRIKKNSDQSVKKGEDRKHNKTRSDQPLVKEGEDRKHDKKTSDQMKKGEHECNDCELVFKYRKNLENHTKIHSEETIVIEGAHTCDICKRIFEYRKNYENHIKIHSKRSPRDGEDEKLDKIVSDQMKKATITRNNIIMQSNAMDAFSKKRVIIIGRGLTKMCRLCLKSGAGIFSFKRKKKLTGMITKCTSVMITHYDELPKYICRLCYNRLFQTHRFQKQALNSDKILRDYLQQIRFADNQTGLESQQSSTEHNSESSTKQNAVNAGPGDFSQILGNIKQEIGSETVWLNDEQQRHYEGHEIIKEEKTETNRNNIKQESTTTQGVLAETSDSYLKKTRHVSKIRIETSYSLPSGCNSKYEDLDGDVDVNHQNCDYLGYSDVDEETTNVFTEESQLNLNNRGERIYIPEDSENASAPQEVWNNDVKSHLCDVCNKCFSNKSNLKRHKLVHTDEKPYQCEFCDKSFIQSDDLKNHELKHTGETPYQCDVCNKGFIRLGDMKSHYTIHTGEKPYQCGVCEKHFKRRSDLKRHSVLHTGDKPYRCHVCSKSFSFLPVLKRHLFLHTGEKPFQCDVCNKRFADWSNMKKHKLLHTGERPFLCDVCNKSFTRHFDLKRHCILHSNEKPFQCDVCVKGFTTFASLKRHRLSHVI

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-