Basic Information

Gene Symbol
-
Assembly
GCA_905147135.1
Location
LR990066.1:6229613-6248174[+]

Transcription Factor Domain

TF Family
zf-C2H2
Domain
zf-C2H2 domain
PFAM
PF00096
TF Group
Zinc-Coordinating Group
Description
The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 40 0.23 21 6.9 0.5 1 23 238 260 238 260 0.95
2 40 0.0083 0.76 11.4 0.2 2 23 289 311 288 311 0.93
3 40 0.0053 0.48 12.0 0.3 2 23 332 354 331 354 0.96
4 40 0.0018 0.16 13.5 2.0 2 23 360 382 359 382 0.95
5 40 0.043 3.9 9.1 1.1 1 23 387 410 387 410 0.96
6 40 4.1 3.7e+02 2.9 0.1 2 23 413 435 412 435 0.93
7 40 7.3 6.6e+02 2.1 3.6 2 21 442 461 441 462 0.93
8 40 0.0023 0.21 13.2 0.4 1 23 521 544 521 544 0.93
9 40 3.2 2.9e+02 3.2 0.1 3 23 573 594 571 594 0.94
10 40 0.09 8.2 8.1 1.0 2 23 617 640 616 640 0.95
11 40 0.0015 0.14 13.7 0.6 3 20 647 664 645 668 0.90
12 40 0.087 7.9 8.2 1.0 1 23 673 696 673 696 0.97
13 40 0.0017 0.15 13.6 0.3 2 23 702 724 702 724 0.92
14 40 0.00012 0.011 17.2 4.9 1 23 728 751 728 751 0.97
15 40 0.00069 0.063 14.8 0.6 2 23 758 779 757 779 0.96
16 40 0.49 45 5.8 1.6 1 23 785 807 785 807 0.95
17 40 0.1 9.5 7.9 5.9 1 23 844 866 844 866 0.98
18 40 0.00044 0.04 15.4 0.4 1 21 958 978 958 979 0.95
19 40 0.013 1.2 10.7 0.5 1 23 1024 1047 1024 1047 0.91
20 40 0.65 59 5.4 0.2 2 23 1075 1097 1074 1097 0.96
21 40 0.031 2.8 9.6 1.8 3 23 1150 1171 1148 1171 0.96
22 40 0.054 4.9 8.8 0.2 2 23 1177 1199 1176 1199 0.95
23 40 0.024 2.2 9.9 1.6 1 23 1204 1227 1204 1227 0.93
24 40 0.00012 0.011 17.2 3.3 1 23 1231 1254 1231 1254 0.97
25 40 0.0054 0.49 12.0 0.5 2 23 1261 1282 1261 1282 0.96
26 40 0.034 3.1 9.5 2.0 1 23 1288 1310 1288 1310 0.96
27 40 0.016 1.4 10.5 2.6 1 23 1351 1373 1351 1373 0.97
28 40 0.06 5.5 8.7 0.1 2 23 1401 1423 1400 1423 0.95
29 40 0.008 0.73 11.4 0.1 2 23 1444 1466 1443 1466 0.95
30 40 0.001 0.093 14.2 2.8 1 23 1471 1493 1471 1494 0.93
31 40 8.5e-05 0.0077 17.7 0.7 1 23 1498 1521 1498 1521 0.91
32 40 0.0088 0.81 11.3 2.3 2 23 1524 1546 1523 1546 0.96
33 40 0.017 1.5 10.4 2.5 1 23 1622 1645 1622 1645 0.94
34 40 8.5e-05 0.0078 17.6 0.3 1 23 1717 1740 1717 1740 0.96
35 40 2.5e-05 0.0023 19.3 0.8 3 23 1747 1767 1745 1768 0.94
36 40 0.065 5.9 8.6 2.1 1 23 1773 1796 1773 1796 0.96
37 40 9.9e-05 0.009 17.4 0.2 2 23 1802 1824 1802 1824 0.96
38 40 0.15 14 7.4 6.7 1 23 1828 1851 1828 1851 0.97
39 40 0.16 15 7.3 0.2 2 23 1858 1879 1857 1879 0.96
40 40 0.25 23 6.7 2.6 1 23 1885 1907 1885 1907 0.96

Sequence Information

Coding Sequence
ATGGAATTCGACGATTTAATTGTGAAGGAGAGCCCCGGCGTTTGTCGGTGCTGTCTGTCCGAGGGATGTTACAAGGACTTGGCGACAGAATACACTTGGATGAATGATACCGAAGTTTACGCCGATATGCTATTGGAATGCTTTGATATTAgtATATCCCAACACATGGACGGCCCAAACGGTCCGAATAGGCTAATATGCGAAGTGTGTATAACAAGGCTACGAGATGCTTGCAACTTCAAGAAACAAGTTGTGCAGACGGAGCAACGGTTTGTGGACATGCTTGGCCGCGGAGAGTTTGGACAGAAAGTGGTGATATACCAGCCTTTAAAGGCGGAGGCGCCGCCGGAACCCGCGCCCGACGACTGCGAGATCGAATACCTGGACGAGGATATGCAGTTTGATGATGCTGATGACGAACCGCTCAGGCCACCAACGGAATCCGTGCTGGTGGAGCCATCTTCGTCGGAGGACATAACCGTATCAGCACTCAAGGTGAAAGGCAAgcgaggccggcccaagaaaactgataaacctgAGAAGAAACCGAGGACTCTAGTGGACGAAAAGCCGAAATCCTCCAAGACTGCCGCCAAAGGTGTTAAAAAGCGGAAATTGAAGGGCGCAGAGTCGTCCTCTATGCGCCGCAGGAGGAACCTTGAGATAATCTTCAATAACACCTCGATTATTCCTTTCAAATGGCGGGGGAAGTACCTCTGCTTCTACTGTAGCAAAAATATAGCGGAGTACGCCGAGCTGAAGAGGCATACGAAAGCCCACGGCTCTTGCTCCACCAAGGATCACTCGTTGAAGGTCCTGAAGGGTGGTCAGAACATGGAGATTAAAATAGACGTATCGGATATTGTGTGCGAGGATTGTGGTGAAACGTTTCCGAGCTTCGATGAGGTCGTCAACCATCTGTGCGGGAAACACAAAGTAGGGTATGATAAGGGCGTCGAAATGGCTATAGAGGAGTATCGGCTCGCTGATCTGAGCTGCTTGGTTTGCGGCGAGAAATTCACGTACTTCGGCTATCTAGTCTCTCATGTGAACACGAAACATCCGAAGAAGTGTCTGATTTGCGAAAAATGTGAACAGAAGTTCAACAAGAAGCGTGATCTGTTTTCTCACATGAAAAACTATCACAGAGAAGGAGGGTATCAGTGCGAGTTGTGTGCACAAAGTTTCAGTTCCCTGAACATACTTAGGAAGCATCGGAACAATAGGCATTTGACCAGGTGCGGTATATGCCAGCTGCAATTGCCGTCAGCTGCATTGAAACAGAAGCACATGGAGTTAGAACACCCTGACGATGGCTCTTTGCGGTGTGACAACTGTTGCAAGGAGTTCCATACGAAGCAGGGCTTAAAAATGCACGCTCGAAAGTGCAAAGGCGAGGATTTGCTCGAAATCTTCAAAAAGGAAGACGTGGACGCCATGGATTTGGACCAGAACTACGAGGATACGGTTAAGAGGCCGAGCGTCAAGCAGATTAGGCAAAACATCGTCATTGTGATCAATATGTCCACAGCGATACCGTTCAACTTCTACAAGAACAAATTCAATTGCTTCTATTGCTCCAAAGATTTCCCGGATTCCGATTTGCTTCGGGAGCACACGGTGATGGAGCACCCGGTTTGTGATGTCAAAGAGAAATGTATTCGAAAATGCAGGGAGTCAGTTGCGTGCGTCAAAATCGATGTCTCCTCCTTAGCTTGCAAGTTGTGCTTCGAGTCCATGCCGGACTTGGACAGTTTAATAGACCATTTGATATCAAAACATGAGGCGGACTATGACAAATCTATTACGACTTGCTTGCAACCCTACAGGTTGATAAAAGACCACATGGTGTGTCCACACTGTCCGAGTGAGGTCTTCAGATTCTTTGGCACCCTTTTGAAGCATATGAATAATAAACATACGGATAATCATATAATTTGCGTGTATTGCGGGCAGTCTTTCAGACGCGATCAGAACCTGAGGGTCCACATATGGCGGCATCACAGAGATGGAAGATTCAAGTGTAATATATGTGACGCCGAGTGCAACATCCCTTCCCGTCTATATATGCATATGGCGAAGTTCCACGGTGTGAAAGCAGCCAAGTGTCCCAAATGTCCGGAGACGTTTGCGACGCAGTATTTGCGGCAAAAGCATTTGATCGACGTCCACGAGTCCGGTCACAAGTGTACGTACTGTGGGAAGCTGTTCACGAGGAACTCTTTTATGCGTGACCATATCAGACGCACGCATTTGAAGGAGAAGAATGTGGAGTGTTCCATATGTAACTTGAAGTTCTTCAACAATATACTTCTTAGGAGGCATATGGTGAAGCATAGTGGTGAAAAAAACTTTCACTGTGACGTGTGTGGGGAAAGGTTCTTGTGGAGGAAGAGCCTCAGGAGCCATATGGTCAGGCACGGGAATGAAAAACAGCACATTGTCGATGTGGAAAGCGCGCGCCGTCGGCGCACTCTCCGAACCCTATTTGACAATACCACTGTGCTCCCGTTCGTCTGGGACGACGGGTTTAAGTGTGCGTATTGCGAGAGACGCTTGCACAGCCACAAGGACTTGCGGAAGCACACGAGGGCACACGGGAAGTGCCCGGACCTCAAAACGTGCGTGAAAAAACTAGACCCTCGCAAGAAACTAAAGATCGACGTCTCGGAGCTCATTTGCGAACTGTGCGACGAATCGAGGATCAACAGTTTCGCGGAGCTGACCACCCACCTCGCTACGCACGGACTCGAATGCGACGCGACCATCGCAGAGGCGGTAGAACAGTACATTCTGATAGATAACAGTTGCGCGGTGTGCTACGACATATTCCCGGATTTCGCGACGCTGCTAGAACACTCCAGGGGTCATTTCGAATGCCAGGGTTGCTCTAAAACATTCAGCGACTTGACTTTCTTGGAGAGACACGTCAAGTTGTGTTCTGTAGTCGGTACAGATAGGAAGAGTGATGAGTTGAAGAAGAAAGCCGACTTGCGCATGCTACGGGAGAACATAGCTGCGTTTCTGAATAGTAGTACCGCTATGCCGTTCAAATTCTTCATGCAGAGGTTTCGCTGCTTCTATTGCAGCAAGGACTTCAGCGATTTCGAACCGCTGAGGGTGCACACTGCCGCGGAGCACACCTACTGCGACCCAAAGGGGAAACTCATGAAATCCCTCAAGGGAAACAACATCAACATCAAAATTGATATCTATTCCATGACTTGTAAGATTTGTACGCAACCCCTCAGTAGTTTCCAAGCCATGATTAGTCACCTGCAGTCGGAGCACAAAGCTAAATACATCCAGGCGTTCTGTGACGTCGTCCAACCGTTTAAGCTGGTTAAAGACAAGATGCCTTGTCTATTCTGTACAGAAGTTAGTTATAGGTACTTCTCTAAACTCCTAGAGCACATGAACGAAATGCACGCGGTGAAGAAGATCATATGCTGCACTTGTGGCTCTTCCTTCGCCAAGGACGCGAACTACAGGTCTCATATGAAGCGGTATCACAATCCGACCGGTTCCAAATGCCGAGACTGCGATGTGGACTTTGGCAATGCTGCCAAATTGTCGAACCATCGCGCCAAGGTCCACGGAGAGAAGATGTTCAAATGCCTCAAATGTCAAGAGACTTTCGGGACCCAGTACAAGAGGCAGAAGCATTTGATCGACGTGCATGGAACGGGGCACCCTTGCTCATACTGCGGGAAGCTGTTCACGAGGAACTCCTTCATGAAAGACCATATACGGAGGACGCATTTGAAAGAGAAGAACGCAGAGTGTTCGGTCTGCAAAGAGAAGTTCTTCGACAATATAATGCTGAAGGTGCATATGGTGAAACACGTTGGCGAGAGGAATTTCCACTGTGACGTCTGTGGGAAGACCTTCCTGTGGAAGAAGAATCTTAGAGGGCACATGTCCTCGCATAAAAGACTAAATTTCTTTTCAGATGATAACCCGGGAAAGGGAGCCAATAAGCTCAGAAGGCGGAACCTGCAAATTTTGTTCAACAACACCACGGTGATACCGTTTAAGTGGCGAGGGAAATATATGTGCTTTTACTGCGGACGGCATGTTGAGAAATACGAGGAGCTCCGGAAACACACGAAGGCGCACGGAAGTTGCGGGGACAAGGACCGAGCTCTCAGGCTCGTGAAGGCCGGCGACGCGGAGGTGAAGATCGACGTCTCGGACATGACTTGCGAGCTCTGCAACGAGTCCTTTCCGAATTTCGAGGAGATAATCTCCCATTTATCGGTGAAGCACGAGCTTCCGTACAGCAAAGAAGTGGAGCTGACCATCATGACGTATCGGCTCGTCGACCTGAAATGCTTGCAATGCGGGGAGGCGTTCGAATTCTTGGGGAAACTCATCATGCACATGAACAACGTACACCCAGACAACTGCTTCCTATGCGACAGTTGCGACCAAAAGTTCAACAAGAAACGCGATCTCCAAGCTCACGTGAGATCGCACCACAAGAATGAGTACAGTTGCCTGAAGTGCCCGAAGACTTTTAGTTCGTACTCGGCCCTTCAAGCCCATAGAGGGAATTCGCACAGCTCCATGTGTAACTTGTGCTTTAAGACTTTCTCATCAGCGACGAAGAGGTTAAAACATATGAATAAAGAGCACGAATTCGAGGGAATACTCCAATGTGCTCTCTGCCTCAGCATCCTCAACACTAAGCAATCTTTCCTCCGGCATGCGGCGAAGTGTAACCTGACCAGTGAAGAGGCCAAGCCATTCGACACTCAAGAGGAGAAAACCAACAGGGCCAGCGTGAAACAGCTCAGGAGCAACATAGCCTGTATTCTGAACATGTCGACAGCGATCCCGTTCAAACACTTCATGAACAGGTTCCGATGCTTCTATTGCCCAAAGGATTTCACGGAATGCGACGATATGAGACAGCACACAGTGATGGAGCATCCCCTCTGTGATGTCAAGTTGAAGTGCATGAGGTTGAGGAACCGGAGGGAGGAATGTATCAAGATCGATACGGCTTCGCTGTCGTGCAAGATGTGTTTCGAATCGATGCCCGATCTGGACACTCTGGTCGGGCATCTGATATCCGAGCACAAGGCTCGGTACGACAAGTCCATAGCGCACGTGTTCCAGCCGTTCAAACTCGCGAAGAACAGCTTTCCGTGCCCCTATTGCGGGGAAGTGTATAGTTACTTCAGCACTCTGCTCAAACATGTCAGCAAGGATCACACGGACAATCGCATCATATGCTTGTATTGCGGCAAGTCGTTCAGGACTGACCCAAATCTGAGGGCGCATATCACACGTCACCACAAGGCTGCCAAATACAAATGCACGAGTTGCGAGTCAGAGTTTACGTCCAGCAACGACTTGCATCTCCATCTGGGGACGAAACACGGCGTCAAGTTCGCTGAGTGTCCGCAGTGCCCTGAGAAATTCACCTCCCAATACTCTGTCCAGAGACATCTAATCACTGTCCACGCGACCGGGCACAAGTGCTCCTTTTGCGAGAAACTGTTTACAAGGCATTCCTTCATGATGAGCCATGTGAGACGGTTGCACCTCAAAGAGAAGAATGTGGAGTGTGCAGTCTGCTTTGAGAGGTTCTTCGACGCTCAGCGCCTCAAGATGCATATGGTGAAGCATATTGGGGAGAGGAATTTTCATTGCGACTTTTGCGGCAAGAAGTTCCTCTGGAAGAAGAACTTGAGGGGGCACATGGCCTCGCACATCAAGCACGGGCAAGCGCAACTGCAGGATTGA
Protein Sequence
MEFDDLIVKESPGVCRCCLSEGCYKDLATEYTWMNDTEVYADMLLECFDISISQHMDGPNGPNRLICEVCITRLRDACNFKKQVVQTEQRFVDMLGRGEFGQKVVIYQPLKAEAPPEPAPDDCEIEYLDEDMQFDDADDEPLRPPTESVLVEPSSSEDITVSALKVKGKRGRPKKTDKPEKKPRTLVDEKPKSSKTAAKGVKKRKLKGAESSSMRRRRNLEIIFNNTSIIPFKWRGKYLCFYCSKNIAEYAELKRHTKAHGSCSTKDHSLKVLKGGQNMEIKIDVSDIVCEDCGETFPSFDEVVNHLCGKHKVGYDKGVEMAIEEYRLADLSCLVCGEKFTYFGYLVSHVNTKHPKKCLICEKCEQKFNKKRDLFSHMKNYHREGGYQCELCAQSFSSLNILRKHRNNRHLTRCGICQLQLPSAALKQKHMELEHPDDGSLRCDNCCKEFHTKQGLKMHARKCKGEDLLEIFKKEDVDAMDLDQNYEDTVKRPSVKQIRQNIVIVINMSTAIPFNFYKNKFNCFYCSKDFPDSDLLREHTVMEHPVCDVKEKCIRKCRESVACVKIDVSSLACKLCFESMPDLDSLIDHLISKHEADYDKSITTCLQPYRLIKDHMVCPHCPSEVFRFFGTLLKHMNNKHTDNHIICVYCGQSFRRDQNLRVHIWRHHRDGRFKCNICDAECNIPSRLYMHMAKFHGVKAAKCPKCPETFATQYLRQKHLIDVHESGHKCTYCGKLFTRNSFMRDHIRRTHLKEKNVECSICNLKFFNNILLRRHMVKHSGEKNFHCDVCGERFLWRKSLRSHMVRHGNEKQHIVDVESARRRRTLRTLFDNTTVLPFVWDDGFKCAYCERRLHSHKDLRKHTRAHGKCPDLKTCVKKLDPRKKLKIDVSELICELCDESRINSFAELTTHLATHGLECDATIAEAVEQYILIDNSCAVCYDIFPDFATLLEHSRGHFECQGCSKTFSDLTFLERHVKLCSVVGTDRKSDELKKKADLRMLRENIAAFLNSSTAMPFKFFMQRFRCFYCSKDFSDFEPLRVHTAAEHTYCDPKGKLMKSLKGNNINIKIDIYSMTCKICTQPLSSFQAMISHLQSEHKAKYIQAFCDVVQPFKLVKDKMPCLFCTEVSYRYFSKLLEHMNEMHAVKKIICCTCGSSFAKDANYRSHMKRYHNPTGSKCRDCDVDFGNAAKLSNHRAKVHGEKMFKCLKCQETFGTQYKRQKHLIDVHGTGHPCSYCGKLFTRNSFMKDHIRRTHLKEKNAECSVCKEKFFDNIMLKVHMVKHVGERNFHCDVCGKTFLWKKNLRGHMSSHKRLNFFSDDNPGKGANKLRRRNLQILFNNTTVIPFKWRGKYMCFYCGRHVEKYEELRKHTKAHGSCGDKDRALRLVKAGDAEVKIDVSDMTCELCNESFPNFEEIISHLSVKHELPYSKEVELTIMTYRLVDLKCLQCGEAFEFLGKLIMHMNNVHPDNCFLCDSCDQKFNKKRDLQAHVRSHHKNEYSCLKCPKTFSSYSALQAHRGNSHSSMCNLCFKTFSSATKRLKHMNKEHEFEGILQCALCLSILNTKQSFLRHAAKCNLTSEEAKPFDTQEEKTNRASVKQLRSNIACILNMSTAIPFKHFMNRFRCFYCPKDFTECDDMRQHTVMEHPLCDVKLKCMRLRNRREECIKIDTASLSCKMCFESMPDLDTLVGHLISEHKARYDKSIAHVFQPFKLAKNSFPCPYCGEVYSYFSTLLKHVSKDHTDNRIICLYCGKSFRTDPNLRAHITRHHKAAKYKCTSCESEFTSSNDLHLHLGTKHGVKFAECPQCPEKFTSQYSVQRHLITVHATGHKCSFCEKLFTRHSFMMSHVRRLHLKEKNVECAVCFERFFDAQRLKMHMVKHIGERNFHCDFCGKKFLWKKNLRGHMASHIKHGQAQLQD*

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-