Basic Information

Gene Symbol
Zfa_1
Assembly
GCA_949318235.1
Location
OX439127.1:40864790-40881616[-]

Transcription Factor Domain

TF Family
zf-C2H2
Domain
zf-C2H2 domain
PFAM
PF00096
TF Group
Zinc-Coordinating Group
Description
The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 41 0.63 44 5.1 1.2 1 23 357 380 357 380 0.95
2 41 0.14 9.6 7.2 7.4 2 23 391 412 390 412 0.96
3 41 3.7 2.6e+02 2.7 1.6 1 23 416 438 416 438 0.96
4 41 0.18 13 6.8 0.5 1 19 447 465 447 469 0.91
5 41 1.6 1.1e+02 3.9 0.2 2 23 479 500 478 500 0.95
6 41 0.0033 0.23 12.3 0.1 1 23 508 531 508 531 0.95
7 41 0.032 2.2 9.2 0.5 1 23 576 599 576 599 0.94
8 41 0.11 7.7 7.5 1.7 1 23 638 661 638 661 0.92
9 41 0.0013 0.095 13.5 0.6 2 23 681 702 680 702 0.96
10 41 0.015 1.1 10.2 0.7 1 23 708 731 708 731 0.97
11 41 3.6e-05 0.0026 18.5 1.4 1 23 751 773 751 773 0.98
12 41 0.0051 0.36 11.7 1.0 1 23 779 802 779 802 0.96
13 41 0.0051 0.36 11.7 1.6 1 23 1039 1061 1039 1061 0.98
14 41 0.011 0.78 10.7 1.3 1 23 1067 1090 1067 1090 0.95
15 41 0.029 2 9.4 1.7 1 23 1161 1184 1161 1184 0.93
16 41 0.011 0.79 10.6 1.1 2 23 1200 1221 1199 1221 0.96
17 41 0.011 0.8 10.6 1.2 1 23 1227 1250 1227 1250 0.98
18 41 2.2 1.5e+02 3.5 6.7 1 23 1317 1339 1317 1340 0.93
19 41 0.0017 0.12 13.3 0.4 1 23 1358 1380 1358 1380 0.98
20 41 0.77 54 4.9 0.8 1 23 1386 1409 1386 1409 0.87
21 41 0.1 7.2 7.6 0.9 1 23 1432 1454 1432 1454 0.97
22 41 0.00085 0.06 14.2 2.7 1 23 1458 1480 1458 1481 0.96
23 41 5.8 4.1e+02 2.1 3.3 1 23 1532 1555 1532 1555 0.93
24 41 0.0039 0.28 12.1 0.8 2 23 1573 1594 1572 1594 0.96
25 41 0.02 1.4 9.9 1.0 1 23 1600 1623 1600 1623 0.97
26 41 0.065 4.6 8.2 0.3 2 23 1653 1674 1652 1674 0.96
27 41 0.0093 0.66 10.9 0.3 1 23 1678 1701 1678 1701 0.97
28 41 0.076 5.4 8.0 3.4 1 23 1736 1758 1736 1758 0.98
29 41 0.00072 0.051 14.4 2.8 1 23 1764 1787 1764 1787 0.98
30 41 0.096 6.8 7.7 0.4 2 23 1816 1838 1815 1838 0.94
31 41 0.0016 0.11 13.3 0.4 3 23 1850 1871 1850 1871 0.98
32 41 0.011 0.75 10.7 0.1 2 20 1877 1895 1876 1899 0.92
33 41 0.13 9 7.3 0.7 2 23 1994 2016 1993 2016 0.96
34 41 6e-06 0.00043 20.9 0.8 1 23 2021 2043 2021 2043 0.97
35 41 0.048 3.4 8.7 8.7 1 21 2048 2068 2048 2069 0.96
36 41 0.17 12 6.9 2.0 1 23 2076 2099 2076 2099 0.96
37 41 0.1 7.3 7.6 4.9 2 23 2107 2129 2106 2129 0.96
38 41 7.4e-05 0.0053 17.5 0.9 1 23 2134 2156 2134 2156 0.96
39 41 0.02 1.4 9.8 2.0 2 23 2162 2184 2161 2184 0.96
40 41 3.4 2.4e+02 2.8 1.7 1 23 2189 2211 2189 2211 0.87
41 41 0.00097 0.069 14.0 0.4 1 23 2218 2241 2218 2241 0.97

Sequence Information

Coding Sequence
ATGGAATGCGCCTGGGCAAGCCCACACAATGAACAAGATTGTAAGTTTGAGAGTGTGATGCTCCAAGAGCAGGCCAGCGAGGAAATTGATCCAGAACCCATGCAGACCTTGACTGAGGAGGAGTATGCAATAATATTAGAAGGTACTGAGATCAAAGAAGAGAGAATCCCAGAAGATAGCAGTGATGAGGAACCATTCAACTTAGAGGAGATTGCTGGTGAGAAGCTGATCCCTCAAGTTGCAGCCAGCTTTGGCCGTGACCCTCTCAAGTCAGGTAAAAACTTACCTTTACTTTCTGCGTACTCCCTCGCAACACCATGGTTCTCCCTCTACCTCGCCTCTTCTTTGTGTGGGGATGAATTGAAGGCAGACGACAACTGGAACCCACAGGATGCAGTCAACCTTGCAGCTCCTCAAGTACAAGAGGCAGACGACAACTGGAACCCACAGGATGCAGTCAACCTTGCAGCTCCTCAAGTACAAGAGGCAGACGACAACTGGAACCCACAGGATGCAGTCAACCTTGCAGCTCCTCAAGTACAAGAGGCAGACGACAACTGGAACCCACAGGATGCAGTCAACCTTGCAGCTCCTCAAGTACAAGAGGCAGACGACAACTGGAACCCACAGGATGCAGTCAACCTTGCAGCTCCTCAAGTACAAGAGGCAGACGACAACTGGAACCCACAGGATGCAGTCAACCTTGCAGCTCCTCAAGTACAAGGTTATTCAGTGTACACATGTGAACATTGTTCTACATGTTTCCTCAACCAGGAGACCCACCAAAGCCATGTGTGCACAAACACATACACTGTAACCAACGTACAACCAAGTACATCGTCATTACTATTTACAAGGCTATTTATTAATCCTTTCTATTTGAAAAATCACATCAAAGTATGTCAAATGCAGCCAAATGTTAAAAGATTGGTTGTTCGAAGGCAAGGAATAGGTTTTCAACAAACGAGTCAAGCCCCAGCAATACAAAGGCAAGGAATACGTTTTCAACAACCAAAGAGTCAAGCCCCAGCAATACAACACccaagagaaaagaaaaaagaaaaaaagtacacaTGTAAGATCTGTAACATAAAGTTGAGCTGCAAAACAGAAGCAGAGAAACACATGATTTCTGCTCATTGTCAGTCATCAGACAGTGACGAGGAAATATCATGCGACAGGTGTCACTTTACTTGTTCTAGTCAACGTCACTTGCaagaacatttaaaattacacaaaactAAATTTTCCTGCACTAGGTgtgattttatttgtttctcGATTGCCGACTTAAGTGATCACAATAAAATACACGAGTTGGCATGCAGTGAACCCAAATTTTCTTGTGACATCTGTGACTTTATCTGCGACTCCAGGGATGAACTTGTCTCTCATGATGCAACACATATCAACTCTCCTCAAGATTCCACATTGATGTGCAACACTTGTGACTTCGTTGCCGACTCGATGATTGAGCTACAACAACATAAGAAACTGCATCTTGCTGAGAGCTTGTTTCAATACCCTTGTCAATTTTGTGGTGTCAGCTTCATCAATGAAAATGAATTGAAAGGACACGTGAAAGAAAATCACTCTAATGACAACCAGCTTGGTCTACATTACTCAAGCACTAGGGACTCTGATGTTGATGACACTGATTCTGATAATTCAGATGACTCGGTTGAAGAATATCAACCTGATTCGACCGAGAGTGATGATAATTCTTTCGCGTGTTACTTGTGTGACAGAGACTTTTCTTCAATATCCTTCTTACGTCTACATGAAAGTAAAGTCCACTCCACGACTACCAGAGCTAAACTACACAAACCACAATTTGGAAAAGAAGATCCCGATTTCTCTGAATCAATCCAAAAATGCTTGACGGAACGACAGAGATTGCATAATGATATATTTCTCTGTAGTCACTGTAACATTGAATTCATTTCTAGCAACCTCTTGTTCTATCATGCAGAAAAGTTCCATGCTGATATCAGAAtgaagttacaaaataaaagaaagctGGGTAGATCTGGTCCCAATAAATGTGATGAATGCCCAAGTAGCTTCAAATCTGCCTCGTATCTCAGACTTCATAAGTTGTTGCATAGAGGTATCAAAGCTTTTCCATGCCGTTACTGTGGTAAACAGTATGCATTTAATAATGCCGTGTTAAACCATTACAAGAAAGTGCATCCTGAGAGCGGAGGAGTAAAGAAttctaaaagaaagaaaGCCAGGTCACGTGGGTCGTTTCCATGTGAGGAATGTGGTGCAACTTGCGAAAGTAAGAAACAATTACGACAGCACATGATGCGTCACAGAGGAATTAAGGCCTATCACTGCAATTACTGCCCCTGGAAAACTACTTCCATATCGGGACTCAAAACACATGTCATACGGCTTCATGTTCAAGAATATAAACAATTTCAACAAGATAAGGCAAAAAAGAAGATGGAACAAATTTCTTTGTCTCGTCATGAAGAATCAAGTGATGATGATTCTGATGATATAGAAAGTAGTGAGTCTAATGATGACAAGAAATCTTGCGCTGAAaagaatgaaacaaaaaatgcaGGATATCGTTCTGAAAGCAATTTGCAAACAGAGCCACTTTCTGGACAGCAGAATTTGCAGAAACGAAAACATGAAGAGGAAAATGAAGATGTGctgaaagaaaaacaaagaaaattaaaagcaagTACTCTAGATTTCTATTCAGGAGGACTGGAATTACAGAATTGTGTCCCTGTAAGAGATTCAAATGAGTCTGCAGTAAGAAGTGGATCGCCAAGACCTTTGATGATTGACGAGAGCATTGAAGAAGTGCTTACAAATGACAAGGAAAATTCAAGCGGTGATGAAAGGGATGTTTACGTACACAATGACAAAGTGAGTGatttaaataatgaagaaaCTAAATCatctgaagaagaaaaagacagTGAAAGAAATTCTCAAACATTAACGCAAGCCCTTAAGGGATCACAGTGGTTAAAATTCTTACAATGCAATCATTGCCTGGTAAAGTTGCCCAGCATTGAAAAGGTTTTTGCTCATGGTCAGCTAAATCATAAAGACATATTTGTCAAATTCAAAGCTACATCAACTGGCTCGTTCCAATGTGAAGAGTGTAGCGCCAGTTGTGACACCATGAAACAATTACAAGAACATTTGATGCGGCACAGAGGCATCAAGCCGTATCTGTGCCGTTACTGCACCTGGAGAACTACAAGTGCAAATGGACTCAAAAGACATGTCAAACGTACACATGAACTAGAGCACTACAAttttttagaagcaatagagCCGGGCAGAACAAAGCCAACAATTAGTGTTGAATATAGCAATGTTGAAAACGTCAAACAACAGACGATTGGCACATtagaaaaaatacaagaaaaaaagcaAGATTCACAGCAAACAGAAGTTCAGTTTCAGGACTATGAGATTGATCATGTACAATCTGTGTGGTATGATTATTATGAATGTAATCATTGCCTTGAGAGGTTCCCCtctaatacaaaattatttattcatggCCAATTAAAACATAAAGACATCTTCATCAATTTCAAAAGCAGAGCAACGGGACCACTTAAATGTGATGAATGTCCTGCCTCTTGTGAAGACAGGGCTCAATTAAGAGACCATGTAATGCATCATCGTGGTATTAAAGCATACTACTGTAATTACTGCACCAAAAAAGGTTACACTTCATCTACTATAAAAATACATGTCATGACTGTGCATAAAGAAGAGTATGTGAAATTTCAGAAATCGAAAGAGGAAAAGATAAAGTTGAAAAAACTTTCTGAAATTAATACTAAAGATGTACAGTTAAAGCAAAACATTATAGCAAGTTTGACAAATGATACGAAGAAAATGGCTTTGAAAGAAGGTGCAGAAGTTAccccaaaatattttaaaagctcGAATGATATTGATGAATATCTCTGCAGTCATTGTAATATGAAATTCCAATCACATATCATGTTGTTTGAACATGCACAAAAACATCATAGTGATGTGCAGTGTAAAGTAGTAACAAGTAGTGATGCCATGGGAGGAGATTTTCAGTGCAATGAGTGCTCTGCTACCTTTGCTAAAATTAGTTTTCTTCGCTCTCATATTTTGCTTCATCGTGGAGTAAAAGCATATGTATGTCGATATTGTGGCCAGACGTACGCTTGTCGTTTGGCCTTGGAAATGCATTACCAAGCTGACCATCCTAATGGAGAAATGATTGATCACACTAAGTTACTAGTGAGAACTGCAAATGAAGAACGTGGCAATTTTAAGTGTGATGTTTGCAAACTGGGCTTTAAAACGATTAAAATTTTGGCTCGTCATAAGTTAAGACACATAGAAAATTACAGGTGTAAATATTGCAACCAAGGATTTGCTCGTAGTTCATCTCGTTGGGCCCATTATATAAGACATCATCAAAAGCAGTGggaagcagaaaaaaataaaagtgctgTCGTAGAACATGGGAAAGGAAAGAAGGAGTGTACTGAAAGTGAAGAGAAATTGGATGCGGAGGATATATACATGAAGTATTGCTCGAAAACGGGACGTCGAGGTGGTGACGTGTTTTTGTGTAATTACTGTAGTATTGGCTTTAAGTGCAGTAACCAACTGTTTTATCATGCACAGAAAATGCATGGCAACATTCAAGTTGTGCTTGGAGGACGTTTGTCTACAGGAGGAAGCAAAAAGTGTGATGAGTGCCCAAGTACATTCAAAACTGTCTCCGATTTAAGATTGCATAAGTTACTACACAGAGGTATCACTCCTTACCCCTGCAAGTATTGTGGCATGGGGTTCAATTGTAGATCATCTGTACTCAGTCATATTAAGGTTGAACATAAAGGGGGCAAAAATGTTATTCAGAATTACACATACAGCACAAGCCAAGTACTGCGTAGAAAGAATTATGAGGAGTATGGGGATCTGATATGTGATGTTTGTAAGGTGGGATACAAGAATGAAAAAATTCTCAAACGACATATGTTACGACACACTAAACCTTTTATATGTAATGTGTGTGGGATTGGCTTCGATAACAGAAATGCGCGTTGGTATCATCAAAGGAAAATGCATTCCGACAAGGGTGCTGAAGATAAAAAAGATAAAGGCAGTGTTGCTGGACAGGAAATTCATAAAGATAAATGTAAACGTgccaaaaaagaaaacaatggaGAGCACAAGTGTGGTGAATGCAGCAGTTCCTTCATCGATCTTTGCCAGCTTCATGAACACATTCTTGTGCATCGTGGTGTGGTAGCATATAAATGTCGTCATTGCCCGTATCGTGTTTCTACCCTGTCAGGACTTAAGCTGCATCTCAGAAAACAACATTCACAGAAATATATCCAACTTGAGAAAGATAGCGTAGTTCAAACCAGAGTGACACAGAAGAAAATTGTGAAGGAGAAGTCACAAAGTTGCAAACTCTGCAAACTTTGGTTTGCAACTCAGTCCTTGCTTAAAATGCATTTGGCTTCAGAACATTATGACAGAAAAACAATTAGCAGATATGCATGGTGCAAGTTGTGTCCAAGGTATTTCACCTCACCGATTTTACTGAAGCTGCACATGATGATAAGACATACAAAGACCTCAGTCATGTGTAGGGTGTGCGACACGCCATTCACGTCACAGGCGAGCTTGGCTTATCATATGTGGAAACTGCACAACTTGAGAAAGCAGACAGAGATGAGCCCTAACAAACAGTACTCCAAGTCACAGAATGTACCAGTTGTAAAGCCAATCTACATTGGCAACACACCTAGACGTTCGTCAAGGCTGCAACAAATTACTGTATCAAATGAAAAGGAGGCAAAGAACAACAAGAGTTCTCATACAAAACAGCCAGATCTAATTGTAACAAGAAAACTAAGCACTAAAGCCAATTGTAAACCTAAACAGAAGACTGCAGCAATACTAATGCCAATCAAAAAGAAgactaagaaaataaaatgcaaggTTTGTGCAGGCCAATACCCCAGCAAAGTACAGTTGCATAGCCACATGATGTCATCTCACAACATCATGTTATTTGATTGTGACTTGTGTGGGAAGATGTTCACAAAAAAGATTTCGTTACTTCGTCATATGAAATGGCATTTCCCACCCACTTATAAATGCTCTCATTGCACAAAATGTTACAACAAGAAGTACCTCTTCACGACACATATTTGTGTTAATGCAGAGGGACGAACATTTGCGTGTAAAGTCTGTAGCAGTACATTCAACACTCGTGAGCTGTTATTGTTCCATCGCCGGACAAAGCACATTTTTGTAGACACCCAGTGTAAATGTGACACTTGTGGAGCATAtctaaaaaacaaagtttgtttgAAAGATCACATCAGGCGGTTCCATCTGGCACATCCTTACTTATGCTCTATTTGTGGTGCCACATTTTCTCACAatgcttatttaaaaatacacgcAAAACGTCATGAAGAATTGAAATGGCAATGTCCACATTGTCCTCGCAAATTTCTCACGAGCCGTGACTTCTATCAACATAGGAACATGATTCACCTCAAACGCTGCCATGTGTGCAACATTTGTGGTCGTGCTATCAAAGCCAAACAACATTATGATTTCCATATGGCAAGACACAAGAACAAGGTAAAGCCTTTCCCATGTTCTCGCTGTGAGAAAAAATATGCCACTTTAACTGCACTCAAGTACCATCTCAGAGTTGCCCATTTAATTCAGTAG
Protein Sequence
MECAWASPHNEQDCKFESVMLQEQASEEIDPEPMQTLTEEEYAIILEGTEIKEERIPEDSSDEEPFNLEEIAGEKLIPQVAASFGRDPLKSGKNLPLLSAYSLATPWFSLYLASSLCGDELKADDNWNPQDAVNLAAPQVQEADDNWNPQDAVNLAAPQVQEADDNWNPQDAVNLAAPQVQEADDNWNPQDAVNLAAPQVQEADDNWNPQDAVNLAAPQVQEADDNWNPQDAVNLAAPQVQGYSVYTCEHCSTCFLNQETHQSHVCTNTYTVTNVQPSTSSLLFTRLFINPFYLKNHIKVCQMQPNVKRLVVRRQGIGFQQTSQAPAIQRQGIRFQQPKSQAPAIQHPREKKKEKKYTCKICNIKLSCKTEAEKHMISAHCQSSDSDEEISCDRCHFTCSSQRHLQEHLKLHKTKFSCTRCDFICFSIADLSDHNKIHELACSEPKFSCDICDFICDSRDELVSHDATHINSPQDSTLMCNTCDFVADSMIELQQHKKLHLAESLFQYPCQFCGVSFINENELKGHVKENHSNDNQLGLHYSSTRDSDVDDTDSDNSDDSVEEYQPDSTESDDNSFACYLCDRDFSSISFLRLHESKVHSTTTRAKLHKPQFGKEDPDFSESIQKCLTERQRLHNDIFLCSHCNIEFISSNLLFYHAEKFHADIRMKLQNKRKLGRSGPNKCDECPSSFKSASYLRLHKLLHRGIKAFPCRYCGKQYAFNNAVLNHYKKVHPESGGVKNSKRKKARSRGSFPCEECGATCESKKQLRQHMMRHRGIKAYHCNYCPWKTTSISGLKTHVIRLHVQEYKQFQQDKAKKKMEQISLSRHEESSDDDSDDIESSESNDDKKSCAEKNETKNAGYRSESNLQTEPLSGQQNLQKRKHEEENEDVLKEKQRKLKASTLDFYSGGLELQNCVPVRDSNESAVRSGSPRPLMIDESIEEVLTNDKENSSGDERDVYVHNDKVSDLNNEETKSSEEEKDSERNSQTLTQALKGSQWLKFLQCNHCLVKLPSIEKVFAHGQLNHKDIFVKFKATSTGSFQCEECSASCDTMKQLQEHLMRHRGIKPYLCRYCTWRTTSANGLKRHVKRTHELEHYNFLEAIEPGRTKPTISVEYSNVENVKQQTIGTLEKIQEKKQDSQQTEVQFQDYEIDHVQSVWYDYYECNHCLERFPSNTKLFIHGQLKHKDIFINFKSRATGPLKCDECPASCEDRAQLRDHVMHHRGIKAYYCNYCTKKGYTSSTIKIHVMTVHKEEYVKFQKSKEEKIKLKKLSEINTKDVQLKQNIIASLTNDTKKMALKEGAEVTPKYFKSSNDIDEYLCSHCNMKFQSHIMLFEHAQKHHSDVQCKVVTSSDAMGGDFQCNECSATFAKISFLRSHILLHRGVKAYVCRYCGQTYACRLALEMHYQADHPNGEMIDHTKLLVRTANEERGNFKCDVCKLGFKTIKILARHKLRHIENYRCKYCNQGFARSSSRWAHYIRHHQKQWEAEKNKSAVVEHGKGKKECTESEEKLDAEDIYMKYCSKTGRRGGDVFLCNYCSIGFKCSNQLFYHAQKMHGNIQVVLGGRLSTGGSKKCDECPSTFKTVSDLRLHKLLHRGITPYPCKYCGMGFNCRSSVLSHIKVEHKGGKNVIQNYTYSTSQVLRRKNYEEYGDLICDVCKVGYKNEKILKRHMLRHTKPFICNVCGIGFDNRNARWYHQRKMHSDKGAEDKKDKGSVAGQEIHKDKCKRAKKENNGEHKCGECSSSFIDLCQLHEHILVHRGVVAYKCRHCPYRVSTLSGLKLHLRKQHSQKYIQLEKDSVVQTRVTQKKIVKEKSQSCKLCKLWFATQSLLKMHLASEHYDRKTISRYAWCKLCPRYFTSPILLKLHMMIRHTKTSVMCRVCDTPFTSQASLAYHMWKLHNLRKQTEMSPNKQYSKSQNVPVVKPIYIGNTPRRSSRLQQITVSNEKEAKNNKSSHTKQPDLIVTRKLSTKANCKPKQKTAAILMPIKKKTKKIKCKVCAGQYPSKVQLHSHMMSSHNIMLFDCDLCGKMFTKKISLLRHMKWHFPPTYKCSHCTKCYNKKYLFTTHICVNAEGRTFACKVCSSTFNTRELLLFHRRTKHIFVDTQCKCDTCGAYLKNKVCLKDHIRRFHLAHPYLCSICGATFSHNAYLKIHAKRHEELKWQCPHCPRKFLTSRDFYQHRNMIHLKRCHVCNICGRAIKAKQHYDFHMARHKNKVKPFPCSRCEKKYATLTALKYHLRVAHLIQ

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-