Basic Information

Gene Symbol
-
Assembly
GCA_001188975.4
Location
LGAM02013578.1:148298-192636[-]

Transcription Factor Domain

TF Family
THAP
Domain
THAP domain
PFAM
PF05485
TF Group
Zinc-Coordinating Group
Description
The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 40 7.4e-16 1.2e-12 48.6 4.8 1 86 847 919 847 920 0.86
2 40 2.4e-14 3.8e-11 43.8 6.5 1 87 947 1016 947 1016 0.82
3 40 2.1e-16 3.3e-13 50.4 0.3 1 87 1038 1110 1038 1110 0.85
4 40 2.1e-13 3.4e-10 40.7 1.7 1 87 1203 1272 1203 1272 0.81
5 40 1.7e-13 2.7e-10 41.1 5.3 1 86 1296 1367 1296 1368 0.80
6 40 2e-13 3.2e-10 40.8 2.2 1 87 1404 1474 1404 1474 0.78
7 40 1.5e-10 2.4e-07 31.6 2.7 1 85 1520 1588 1520 1590 0.74
8 40 2.4e-15 3.9e-12 47.0 2.1 1 87 1615 1681 1615 1681 0.82
9 40 4.7e-12 7.5e-09 36.4 0.4 1 86 1702 1771 1702 1772 0.79
10 40 2.5e-13 4e-10 40.5 1.9 1 87 1799 1871 1799 1871 0.86
11 40 0.0016 2.5 9.1 0.0 1 58 1922 1972 1922 1997 0.80
12 40 7e-13 1.1e-09 39.1 0.1 1 86 2018 2096 2018 2097 0.83
13 40 7.6e-13 1.2e-09 39.0 1.4 1 86 2123 2192 2123 2193 0.80
14 40 5.5e-12 8.8e-09 36.2 6.6 1 85 2268 2337 2268 2339 0.81
15 40 8.9e-15 1.4e-11 45.2 1.3 1 87 2361 2430 2361 2430 0.83
16 40 6.9e-11 1.1e-07 32.7 0.5 1 86 2809 2879 2809 2880 0.78
17 40 1e-15 1.7e-12 48.1 0.3 1 86 2929 3004 2929 3005 0.81
18 40 2.2e-05 0.035 15.1 0.1 1 62 3042 3106 3042 3122 0.73
19 40 7.9e-13 1.3e-09 38.9 4.4 1 87 3141 3211 3141 3211 0.81
20 40 2.4e-14 3.8e-11 43.8 5.0 1 87 3238 3325 3238 3325 0.79
21 40 1e-11 1.6e-08 35.4 1.7 1 86 3355 3423 3355 3424 0.79
22 40 6.8e-14 1.1e-10 42.3 5.9 1 87 3447 3516 3447 3516 0.82
23 40 1.8e-11 2.9e-08 34.5 0.7 1 86 3567 3633 3567 3634 0.80
24 40 4.1e-13 6.5e-10 39.8 1.3 1 87 3675 3747 3675 3747 0.80
25 40 1.1e-11 1.7e-08 35.3 0.1 1 86 3776 3847 3776 3848 0.78
26 40 1.6e-14 2.5e-11 44.3 1.4 1 86 3872 3942 3872 3943 0.80
27 40 0.00081 1.3 10.0 0.1 1 43 3983 4022 3983 4031 0.83
28 40 4 6.4e+03 -1.8 0.0 50 84 4061 4089 4046 4090 0.68
29 40 7.5e-16 1.2e-12 48.6 3.2 1 86 4116 4192 4116 4193 0.84
30 40 1.9e-13 3e-10 40.9 0.1 1 86 4230 4305 4230 4306 0.80
31 40 3.2e-12 5e-09 37.0 1.1 1 85 4506 4575 4506 4577 0.78
32 40 4.7e-13 7.5e-10 39.6 0.9 1 87 4667 4738 4667 4738 0.82
33 40 1.9e-13 3.1e-10 40.9 0.8 1 86 4817 4893 4817 4894 0.82
34 40 8.4e-13 1.3e-09 38.8 0.4 1 86 4942 5011 4942 5012 0.80
35 40 1e-13 1.6e-10 41.8 4.3 1 86 5075 5149 5075 5150 0.82
36 40 7.6e-14 1.2e-10 42.2 0.4 1 87 5312 5381 5312 5381 0.81
37 40 3.9e-10 6.2e-07 30.3 2.7 1 87 5428 5497 5428 5497 0.82
38 40 4.9e-13 7.8e-10 39.6 0.8 1 87 5540 5613 5540 5613 0.79
39 40 3.2e-12 5.1e-09 37.0 0.3 1 86 5634 5704 5634 5705 0.76
40 40 1.7e-10 2.7e-07 31.5 3.1 1 86 5728 5796 5728 5797 0.83

Sequence Information

Coding Sequence
ATGTcgcaaaatcaaaacaaacaacatttacaacaacagcaacaacttgAGCATcaccaactacaacaacagcaacaccaccACCATCAACAACAGCTCTTAcaccatcatcatcaacatcagcagcagcaacaacagcaacgacaaTGGCATCAACAAtatcaacaccaacaacaacagctacatcATCACCatcagcaccaacaacaacaacaggcacaatatgctgctgctgccgcaGCACATGCACATGTTGCTGCGGCCACCGGTATGAGGAGTGGTAATGGTATGCCTGGCATGTTTGGAAATACACTAGGTAGGGGTGGTTGTTTATATGACACGAGTGGTGGTAGTCTCGTTGGTGGCGGCAACGGGGGCGGCGGCGGAGTCGGCGGCTTTGACCTTGATATGGCTGGCCATCATATAGGTAGTGGAAATGCCTCGGCAATTACGGCCACTTCTACAGCACCAGCTGTTGGTGTTAGCATTGCGGGTGGCTCAGTGGGTGGTTATATGCCTTCACATGTCAATGTTTCTTCTGTCGCCGGTGGCGCGTACAATGCCGGTATGTCTGGCATGTCGACTGCTAGTGAAAGTGGAAGGAGAGCAAGTGTGGGCGGTTGTGCATACACACAGGCATCGCAGTCACAACCACAGCCGCCACCACAGCAaccgcaacagcaacaacaacacacaatttTGCCACCGGAAAGAATAAAAATGGAGCCACTGGAGCAAATTCTCACACCAACTATCGAAATGGAAGAATTGATAATAAAAACCGAACCCACAGATGATACGTACAATAAAGTTAGTACGGTCGATGAAGCCATTGCTACAGGAAGCACAAATATGAATCCTAAcgctggttatatgaatttttctaagCATTTACAATCATATCCTCcgcatcagcagcagcaacatcaacaacagcagctgctggaacaacaaaaacagcgccAGAAATTGCAACATCAACATgcattgcaacaacagcaacatctacagcaacagcaacaacagcatctgcagcaacaacagcaattgcagcagcaacaacaacagcgtgagcaacaattgcaacagcaaatgTTATCACACGATGCCATGGCCGAAGCCGGCTTGTTAACAGTAAAACTTGAACCGCAAATTCATATTAAAGAGGAAATGCCGGAAGCGTCACAGAATAAGCCGCTCAACTTTCCACGCCGTAAAGTTCAAACGGAACGTTCCGACACATTGCCAATTTGCCAGCGCTGCAAACAGGTATTCTTCAAAAAGCAAAGCTACAGCAAACACGTTTCTCAAAGTCTCTGCGAAATTGTCGAATATGACTTCAAGTGTTCTATATGCCCCATGTCGTTCACATCCAGCGAGGAGCTACAAACGCACGAACAACTACATCGCGAGAATATGTATTTCTGTCACAAATACTGTGGCAAATATTTTGACACTATCGAACTGTGCGAGGTGCACGAGTACATGCAGCATGAATATGTCAATTATATCTGTAATGTCTGTTCGGCGGGCTTTGCTAGTCGCGATTTGTTATTCGCTCACATGCCACAACATCGCAACCAGCCGCGCTATGATTGCCCGGTATGCCGTTTGTGGTTTCATACAGGCATACAGTTGCATCAGCACCGCATACAAGCGCCATATTTTTGTGGGAAATTTTACCGTCGCGCTGGTGGTGCAGCACCTACCATGCCTGCTGGTGGCGCACCATTTGGGCAGGTGCCGCCAACTCATTCGACTAATTATAATCTACAAGACTGCTCCATGGGCATTATAGAAATGCCCAATAGCAATAGCTTCTCTTCATATCTACAAAACCGCGCCTTTCATCATCAACATCCAGCTCCGCCACATCCGCCGCTGCCACCGCAGCAAATGCATCCACATctcgcgcaacaacaacaacatttgcaacGTCAACAACACCAATTACATATGCAACAACAGCATCCACAATTTCCAACACAACCGTTACATGCAGCACCGCAACAACAGCCAACACCTGCTCAGCCAGATTTTATGCGCCGTAACAGCATTACTGGTGCCACCGGCAGCGCCACAACAGAATTTCAAATGCCACAAATTAAAACCGAAATAAAAGTGGAACCCGATCTTTATGGCACACCAGATTATCCGCTGCAGACACCATTGCCACCACCACCTTTACCGCCACCGCCGTTAAGCTCCGCACAACAGTCTGCGTTAGTCTCACCATCACGTCAGCGTAGTCGCTTTGGTGATTTCACAAATGAGTCGTTCGGCGCTGGTGTTACGAGCAGTGACACTTCGCACGCATTTAGCTCGCACAATAACAGCGGCGGCAGCAATGGCAACAGTAATGATTTTTCCAACAGCAACACTTCGAGTTTGCAGCGCAACATGAgcgccaacaacagcaatgagAACAATAGCAGCAATCATAAGCCGTCTGCATTTCCTGTTGGCAATTTTCATTTCCCCACCACACCAGTGGTGTCGCGCACACGCGTGGTGGATACCGGTGAGGATGGCGCCGTTTGTTGTGTGCCCCATTGCGGCGTCACTAAACAGTCCAGTCCCACGCTGCAGTTCTTTACATTCCCGAAAGATGAAAAGTATTTGCATCAATGGCTACATAATCTCAAAATGTTTCCTGAACCCGATTCAACGTACAGTCAATACCGCATTTGTAGTTTACATTTTCCCAAGCGTTGTATTAATCGTTATTCGTTGTGCTATTGGGCAGTGCCTACATTCAATTTGGGTCACGACGATGTCGCAAATTTATATCAGAATCGTGAAATCACCAACACATTTACGATCGGTGAACGCGCTCAGTGCAGCATGCCTGGCTGTCCGAGTCAGCGTGGTGAGAGTAATTGCAAATTCTACAATTTTCCCAATGACATGAAGACGCGCATCAAATGGTGTCAGAATGCGCGTTTACCCGTGAATAGTCGGGAGCCGCGTCATTTTTGTAGTCGTCACTTTGAAGATCGCTGCTTTGGCAAGTTTCGTCTCAAGCCATGGGCTGTGCCCACACTACATTTGGGCACACCATATGGCAGAATACACGATAATCCGGGCGTATTTTATTTGGAGGAGAAAAAATGCTGTCTGCCACATTGCAAGCGCACGCGCTCATCCGATTTTAATCTATCGCTATATCGTTTTCCACGCGATGAAGTGTTATTGCGACGCTGGTGTTATAACTTGCGTCTGGATCCGGCAATATATCGTGGCAAGAATCACAAAATTTGTAGCGCGCACTTTATTAAAGAGGCGCTGGGCTTGCGCAAACTTTCACCAGGTGCTGTGCCGACCTTGAATTTGGGCCACAACGACCGCTTCAATATCTATGAGAATGAGTTGCCAACACCACCGGTAGTACCGCCgtctaaaatgtttaatttccaTAATGTCTCCGCGCCACCACCGGTCTACAGTAGTCATCGGCACAGCATTGCGTCGAGCAGCAGTCACAGCGAACGCTTGTATCCAAATCCTGTATCGAAGCTCAAATTTAGCATATCGAATATAACGGCGGGTGATGTGAGCGCCATGAGCGCCAGCGCCACACAAAATATCTTAGATTCATTGGAAGTTTGCTTTGTGCCAAGCTGTAAGCGTAATCGAAATATAGATAACGTGACATTACACACCATACCGCGTCGTCCCGAACAGATGCGCAAGTGGTGTCACAATCTGAAAATCGGCATCGAAACGCTGCATAAGGGTGTACGTATATGCAGCGCGCACTTTGAGCCATATTGCATTGGCGGCTGTATGCGTCCATTTGCAGTGCCCACCTTGAATTTGGGCCACGACGATCCGAATATCTATCGCAATCCGGATGTTATAAAGAAACTGAATATCCGTGAAACATGTTGTGTGCAAGATTGCAAACGTAATCGTGATCGCGATCATGCTAATCTGCATCGTTTCCCATCGAATTTCGAAATGCTCACCAAATGGTGTGAGAATCTGCTGAAGCCAGTACCCGATGGCACTAAACTCTTCAACGATGCCATATGTGAGGTGCATTTCGAAGAGCGTTGCATACGCAACAAACGTTTGGAGAAGTGGGCTGTGCCCACGGTGAAACTTGGTCATACCGAGGAACCGAAACATCGTTTGCCAACGGATGAAGAGATTGCTGAGCAATGGCCGAAGCCGGTAATGCCAAATAAGGGTATAGAAGAAGGTGAATGTTGTGTGTCCACATGCCGACGTGATCCGAAGATCGACGATGTTAAACTCTATCGCACACCCGAAGATCCCGAGTTGTTGGCGAAGTGGGCGCATAATTTGCAAACGGAAACGGAAGATCTGACCACATTACGTATTTGTAATCTACACTTTGAAGAGCATTGCTTTAGCAAAAAGCGTCGCCTACACTATTGGTCATTACCCACACTCAATTTGGCTACGAATGTAGATCAATTATATGAGAACCCCGAACCGCTTCTACCACAGGTGATCATAAAGTCGGAGACGAAACGCGAACGTCGGCGCATGCGCGATTCCAAGGAGCCATTGAAGCCGTGGACACCACGTTGCTGCCTGCCGCACTGCCGCAAACGACGTGATACGGATCACGTGCAGCTTTTCCGTTTCCCAGTGCGCAATCGACCCATGTTGGCTAAGTGGTGCCACAATTTGCTGCTACCGTTGGTCGGTCATGGCCATCGTCGTTTGTGTTCTACTCATTTCGAGCCGCGCGTGCTGTCGAAACGTTGTCCCATACCGATGTCGGTGCCCACGTTGGATCTACATACCCCACCAGGTTATAAAATCTATATGAACCCGGCGCGTCTAAAAGCCGTTAAGTTGCAACAAGTCTGTTGCATATCATCCTGCAGTCGTACACGTGCCGATGGCGTACAGCTCTTCCGATTTCCGCACAGTCGCAGTGTGTTGCGCAAGTGGTGTCACAATACGCGGCAGCACGCTCGCGGCGAATATCGCGTGTGCTCGCGTCACTTTGAACCGCACGCGTTCGGTGCTAAACGTCTAATGCCTGGCGCAATACCGACATTAAATTTGGATCCAGAAGTTGAGGATGTTTATGCGAATGAAGCACAAAGCTTCGCTGAAACGCAGTGCGTAGTAAGCGGTTGCGTCGCCAGCAAGGAGTTGGATGGCGTACGGCTCTTCAAATTCCCTAGCGATGATGTCGATTTACTATGGAAGTGGtgtaataatttgaaaatgaaccCGGTAGACTGCCGTGGCGTGCGTATCTGTAATCGACACTTTGAATCCGAATGCGTCGGTCCAAAGATGCTCTACAAGTGGGCTGTACCGACCCTAGCTCTCGGTCATAACGATGCCGAAATCAAGTTGGTGTCTGTGCCGCCGCCAGAGGCGCGTTACAGCGATGTAGTAACGAAGTGTTGTGTGCCCACATGCGGCAAGTCACGTAAATTTGATGACGCTCAAATGAACAGCTTCCCGAAGAATTTGCGTCTATTCCGCTGTTGGAAACATAATCTAAAATTGGACTTTTTAGACTTTAAGGATCgtgagaaatataaaatttgtagtgATCATTTCGAACCGGTTTGTCTGGGTAAACTGCGTCTCAATTATGGCGCTATACCTACACTCAATCTGGGCCACGATGACACAGACGATTTGTATCCCGTTGATCCTGCGCAGATACAAGTGTCGCTTTTCGGCAAAATGCGCCCACGCATTGCTAGCGATAAGGACGCAACTGAATTTGAACTAGCTAGTTGTGTTGGCAAAGATGACATAAAATGTTCCTATCCCAATTGTACTGCAACGAAAATGCTGTTAAGCGAACCTTACGACATGCCAACCGCTTTGCCGCTACATGCGCTTTGGTGCGCACAAATGAAAGTCGACACAGAGGCGCTCAGTGAGATGCCAAAACTGTGCGGCCTGCATTTCTTAAAGCTCTACAAAGCTACATTAGACGCGGCTAACGCGCTATCCGAGGGTGACAACGACCTGAGTGAGGCCATGAAATCTCTGAACGCAACTTATGAAAAGTGTTGCGCATCACCAATTGTTTGTAGTGCGCAGTGTTGCGTTGTCGGTTGCGGTAGTAATCAGGTTAGTGGCAGCCGTACAGCCCAACGCCTTTACTTATTTCCAACTGCGGGCGGTAGTGGCAtagaattattagaaaaatggTGTCACAATGTAGATGTTAGTGCATCGGACCTCACTCGATTTACGCACAGAGTATGCGCGCGCCATTTCGACGCGCAATGTATTGGTCCGACACAACGCCTGCGCTCATGGGCCATACCCACTTTGCAGCTCAAACAGAAACCGGAACATAAAATACACGCCATACCAGAATTGGGTAGTGCGTCGCGTCCGCATGAAGCGCGTTGCTGTGTAGCCACGTGCGCACGCGAACGCGGTGATTTCGAAACAATGCGTCTTTTTAGTTTTCCCATTATCGATGAAGTGCTCGAAAAGTGGTTGCACAACTTGCAATTGACACGCAATCAATGCGCACGCTTACGTATTTGTGAACAACATTTCGAGCCGCGTTGTATTGTCAAGACGCGGTTGCTGCGTTGGGCAGTGCCAACATTGCTGCTGGAGCGCAATGCAGAAGAGCTGCTACAGAACGAAccgcctgtgcaagctgcaGCGCCTGTTGCACCAACCGTTGATGAAGTCGATACAATTGACGCTGTAAGTAATAACGAAGGCGACGAAGATCTGCGCGATGTTGATAATGAAGAGGATGACAATGATAGCGGTGAATTGGCTGCCGTGCCTAACGTTAAGATGAAAAAATCACTTGGTATTTTTAAGTGTTGTGTACCGTGCTGCAGACGTACGCGCCTGCAACACGGCGTGCGCCTCTTCCAGTTTCCCACAAGTCATCAAATCTTACGCAAATGGTGTCATAATCTAAAAATTTCGCTCGAAAAGGCTTCAGATCCCGCCTTACGTATTTGCAGTGTACACTTTCACAAACGTTGCATTGACGGCAAAGAGTTACGTCCTTGGGCCAAGCCGACCAATGCGCTCGGTCATACCGGTCCCATCTATGAGAATCCTAAAAATCTACCCGGCGTCTTTTTACCCAAATGCTGTTTGGCACATTGCCGTCAGCGACGCACACTCGAAAATGACCTACGGACATTTGGCTTCCCTAAAGATGAGGCTCTTATGCGCAAGTGGTGTGCAAATTTACGTATGGAACCACGTGCTAATCACGCCCGCATCTGCATGTCACACTTCCCGCCCGAGGCGATGGGTAATAAGAAGTTGCGCGCCAATGCGGTGCCTACGCTAAATTTGGGCCACAACGAACCGTTGGAGTATGACAACGCTCAGTTGATCGCGACAGCGATGAAGGCAAGAAATCCATTAgaggaaaaaaaagttttaaattcgaATGCTAGTGATGCTGAATTAGGAAACAACGGCGAATACGATGATGAAATGGACGACGAAGACAATGATGATAATGACGATGATGACTATCACATCGATGAAGAAGATGACGAAAACGACTTTTACGCTTCCGCCGGCGTTGGTAATAGCGATGAAGAGGAAGAGGAAACGGTTGAGGACTTTACGCCGAGCATTAGTGCCGCAACTGGTGACGACGATGAGTTGTATAGTAACACCGAAGCTCGTGATGCCGACAATTTCATAGTCGATTCGGCAGATGAAGATGAGGAGGGCGCCGAGGTCGAAGATGAGGATGAAGAAGAAATCTACGGTAACgaggatgatgatgatgaggaCGATAGTGAGGAGAGCGATGAAAGTGTTGCCGAAAATGTATACAACAGTCCTGCAGAGGAGGATGACAGCGACAGTGAAAGCGAAGTTGACGCAGACGACAACAACGCTGACAGCAAAGATGCAGAAGTCGACGATGATGATGTGGAAATGCTCATCGATGCCGATGATGATGAAGATGATAAAAGCAGTTACAATGCTTGCGATCCACTCGATTTTGTGGAATGTGTTAACCACAGTGATGATACGCAACAACATAAAGCGGCGGCGACCTCAATAAAAATCAACAATCTACGCAgagcagaaaaaaatttgcGTAAGCGTCCAATATCGTCGACACCGACAAATTATGAAGATGACTCCAATACAAACTATAGTAATGTCGATGAAACGACACGTTTCACTGCAACCACTTCGAATTCCGCATCGGCAACCACATCGTCGGCATCAGCTTCAGCTGCGGCCACGACCAACGCACCGAATACACATCATTGTGGTGCTGAAAAAATAAGCCGTTCGGTTTTTCGGCTTTGTTGCCTCAGACATCGTAGGAAGAAGAAAGCACCACCAGATCCACCACCTGATATGCGATGCGTGCGCGAAAAAGTGCTAACAGCACATCAGACATTGATGAATAGTACTACACGCGTTAGCTTGCGGCAGCGTCAACAGCGCGCTTACTGTGCTGTACGCCGTTGTGGACGCACTGCTGGTGCTGCTGTCATTCTATATCGTTTTCCATTAAGCGGCAGTCGTTACTGTCGCCGTTGGTGCGCACagttaaaagtaaaaatgtcgCACTCATCACGACTGCGCATTTGTCAACGACACTTTCCATATAGTTTAGTGGATCGGCGTCGTAGGCGCTTGCGTTTCGGCGCAATACCAACACGCAATTTACATAACACGTCAGGACAATTTAAGCGTAATCCACTTTTtggcttatataaaaatacacaattgACAGCAGACCCATCAGAGCGCGAGGCGTTCGCGGATACAACTCAAACAACCGCCGCGCAATTGAACGTTTATAATCGTTGTTGCGTGCCACATTGTGGCAAATCGCATCAAGTTGATGGCGTAACACTATTCCGCTTTCCAAAACTACGCTCACTCTATCTCCAATGGGCTACGAATTTGCGCTTAATGCCAACCATGCGTTTGGTACAAGTCTACAAAGTTTGTAGCGACCACTTTGAGCCCAGCTGTCTCAGCTATCAGCGCAATGATCGTGCAAAGCTGAAATACGGCTCTGTGCCTACACTGAAGTTGGGTCATAGCGACACTTTAAAAATCTATAAGCAAAACACGCTGCCGCCACAAAAGCGTCGCGGCCTTGGCAGGCGCAAATGGCATCCACAAAGAGGTGTCAACGAATGCGCTGTGCATGATTGTAAGGTAGCGCAGTTTCTACAAATGCAACTGTTTATGCTGCCTGCTTCGCTTAAACTGCAAGAACGCTGGTGTAACTACTTCAAATTAAACTTCACCGGCGCATCGAAAGACGCCACCGAATTCTTTCAAAATGTGCGTTTATGCGCTTTGCACTACATGGAGGGCTATCAAATGGCGACGAATAGCGATGGTTCGCGCAAAAATTCATCTGCGGCGTTGGACGAACTCGAGGTGAATTATGCGCGCATAACTAGTTCGACGCGCATACAAATGCTGAAGTGTTGTGTGCCCAATTGTTCGACGAAATTCACGGACAACTTGCGTTTGACCGCGTTTCCCAGTGCGGAAGAGCTGCGCGCTAAATGGCAACACAATACACAAGTGTCATTTAGCTCATCGCATCGTTATTTGTATAAAGTATGCGCATTGCATTTCGAAGAGCGCTGCTTCGCCAAAAAACGCCTCTTTATGTGGGCTATACCGACATTGCATTTACCGCGACCGCAAAATCAAGATCCAGGACATAAGCTTTTTGAAAATCCCAACTTTGACATCGTGAATGGCGCCCAATGTTGTATTGAAGATTGCACAACAAATGAAGTCAAACAGGAACCTTCAGCGAATGATGACGGGGAAGTGTGTAAGGCGGCAGTTCGCTTTTGGCATTTTCCACAAGACGACGCATTGCGCGATAAATGGTGCCATAATTTAGGACTCAGCGCGCAAACTAACGAAATCAGTCACACCAGTCGGCGCTGGCGCATTTGCAGTCGCCACTTCGAGTCATTTTGCATTGGTAAAACGTTGCACAGCTGGGCTGTACCCACACTGAATTTACCTAAACCGGTCAAACATGCTAAGAGCGGCAGGCGTTCCACATATATTTACCAAAATCCCGATAGCGCTGCAGTCTACTATCGCTGCTGCATTAAAACATGCCATCAGCTGCGCGATCTCGATGCTGGCATACGTCTATACGCTTTTCCCAAAAAGGAGACAATGCTACAAAAATGGGCGCACAACATACGCATGCCCGCGGTGAAGTGTCGATACGCGCGCATCTGCACGTTGCACTTCGAAGCGCAATGTTTGCGTCCGCAAATGCAATCGTGGGCGCTACCGACAATCGATTTGGGACATGACGAAGCGGATATTTTCCGAGTGCCGAAGGTGAAGTTAATGGTTACGAACGAACGCTGCTGCCTACCACATTGCAGCAAGCGACGCAGTCGGGACAATGTGCACCTTTTCACTTTTCCGCGCGACAAACATGTGTTAGACAAATGGTGGCACAATTTAGCAATTGGCGTACAGGACGTAAAACGACGTCTCATATGTGAATCACATTTTGAACCACGTTGCATAAGCAAGCGCCGTTTGAAACGTTGGGCCATACCAACACTCCATTTGGGACACACAAACGAGACGCTTAAAAATCCAACACCAGATGAAGTGGCGacttatgaaaataatacaacaacGCGACGGTCACAAACACCCTCGAAAATGACGCGCGCTAAGTCAACTCAAACAACGCCTACAAGCACTTTACAGAAGTGTTCAATCGCACAGTGCGCAGGTGGCACCGATAGCAGCGCGCTGTATCGCTTTCCCAAGCCGGACTGGTTACGCAAGAAATGGTGTGACAATACGCGTATAGACGAAGAGACTGCCAAACAGGCGAAAATTTGCGCACGCCATTTCGAACCACATGTTATGGGCAATCGCAAACCGCGTCCATGGGCGTTGCCAACTATGGAATTGGGCGCTGACGAAAACGGCACGCCATTGCCTGCCGTACATGCAAATCCTAAACAGCTGTCTCGTTTTCATCCCGAAGAGCATGAATGGGGCGAGTTGCGTTATGTGCGCGCTAACCATTGTTCTATAATATCTTGCTTGAAATCTAAAAAAGATGGCGTAACACTTTTCAACTATCCCACAAATCGTCACATGTTGCAGAAATGGGCTGAGAATTGTCGCCATTATCCGTATCAGGCGAAACGTTATCGCTTCCAGCTGTGCGGCGCACATTTCACTACCGATTGTTTCAAACGTGAAGGCACGCGTTTACGCAAAGGCTCCGTGCCTACCTTAAATTTGGGACATGACGATGCGCAGATACACCAGAGTGAATTCGAAAGCGTTAGcgccataaaaattgaaatgaaaatactgAAAAGTTGCAGCGTACCACAATGTGGGCGCACAAATCTTCACGATGGCGTACGGCTCTTCAAGTTTCCCTATGAACGCTCCGAAACTCTAGAAAAGTGGTGCTATAATTTACGCATGAACGCGGCGGACTGTCGCAACGCGTTAATCTGTAATATGCATTTTGAGCCGCGTTGCGTTGGTGGTGGTCATCGCGGTTTGCTTTTGCGCGCCATACCCACATTACTGCTGGGACACAACGACACCGATATTTTAAACAATCCAGAAACATTTGAGCGCCCTGAGAAGCTAATCAGCTGCTGCGTACCCGGTTGTATTAACACCAAACAAACCGAAGGCATAACGTTGAGCGCTTTTCCAAAGCTGCGTAGCCATTTTGAAAAGTGGTCACATAACCTACAGCTGCCGGTCAGCACTACCGTTTGGCATACATACAAGGTATGTAGCGCACACTTTGAGAACTACTGCTATGAGCACGGACGTATTAAGGTGGGCGCAATGCCGACACTAAAGTTGGGACACAACAATACAATTGATTTGTACACCGTCAGCGAAGAGACAATGAGTCAAGCATTTAAGCGCAAACGTGTCGCGCCTAAAACTGAAGCGCCGAAAGCGCCACACGGATCGTGTTGTTATCCGGAATGTAGAGAAATGGAATTGCGTTTGACAAATCAACTCTTTGAGTTTCCGAAACTGGTTGCTATAAGACGCGCTTGGTACGAAAGTGTTGGGTTGAGAGCTGAAGAACAGGTCGAAGCCGAGTGTGCGGCAATTGAAAATGAAACACAAGAAACAGCGCCCGACACCAGCATCCCACAAGAGGATGCTACTGCTGCTGCTCCTGTAAAAGTTACGAAAATAACTCCCAAATTATGTCCAAtgcattttaaattactttacatTGAGCATGCCACACTGGTGAATACACTCAAATTGGACAGCACACCAGACACGCTGCGAATATTACAAAAGTTGGAAGAGACTCACATAAAAGTGTGCGATTTATCTTGTGTGCGTCGCATTAGCTGCGCCGTGCCGGGTTGTAATTCGAATTATCTCACCACTAAGTCGCTCAAATTCTTCAAATTTCCCGACAATGCAGAGATGCGCGCCAAATGGTGTCACAACACGCAAGTCACAGTCGATCCAGATCGCTTGTATTGCtacaaaatatgtgaatttcaTTTTGAAGCAAGCTGTGCCTCGCAATTGGCCAAAAAAATGCAACGTTTGAAATTCTGGGCGCTGCCCACGCTACAATTGCCGCCACGCACTGAAGGCGCGCCAGAAATATATACACTACCAGCACCTGATACGCTAGCCGAAACGAAACGCGCCTCATTGCTACTGAACACGCCACTCAATAAATGTTGTATACCCAGTTGTGTGTATGCTAAAGCACCGGCGGAGCGCGCGGTTGGCGCTGATAGTGAAATacagtttttcaattttcccaATGATGCGGAATTACTCTACAAATGGGTATATAATACACAAATCAGTATGGTCGCGGCTACTAATGCGCGCATCTGCTCGCTACACTTTGAGAAACACTGTATTTATAAACGTTTGCGTATGTATGCGGTGCCGACGCTGCTGTTGGGCCACCAGAAAACCGACATCTATAAGAATCCCTCGGATAAAAGAGTACAAGCGGAGGCAATGGAGAGCAAACCACAGAAGATGCCGAAGTATTACGAAGATAATATTGCTGATGGCGTTGAAGTTGAAGAAAATGCGGAGGATGATGATGAGTTGTTAGCTGAAGAAAAGACGCaccaaatgaaattgaaaaaatcagacataactaaatataaaactgAGTCGTTTGAAGAAAGTGATCAGCGCACAGCGCGAAAATCTTCCGCGTACGTTAAGAAGAAGTTAGACACAGAGCTGCTTGAGCCAATGCTAGTAGTGAAAACCGAATTGAAGGAGCGCAAAGAAAccgcaacgacaacaacaaataaaccgGCAACGCAATCCAATCCATATCACCAGCctttcattataaatattaaacaagaaaaggATATCGAAGAGAGCTACACCGCCGAAGGCATTGAAAATCAAATGAAGCAAGGCTTACTCGATATGTTTAATAGTTTTGGCGATACTGCCACTGAGCAGGAGCCTGATGAAATTAACGAGCTAGGCGACGAAGTTGCACTAATGGAACGTGATACCGCGCGTCATTGTCGCATTGTTGGTTGTAATAGCTACGCGCGCAATGCGGGCGTCACGCTGTTTAAATTTCCCTTTCCACTCGATCAGTTCCGCAAATGGCTGCATAACACACAGTTAGAGGTGGACTATACACGCCGTTGGCGCTATCGCATTTGTCATCGTCACTTCGAACCGATTTGCATGCAATTCCGCAAGCTACCGCCGGGCACCATGCCCACGAGGAATTTGGGTCCTAAACGCCCCGcgcatatatatgaaaatgaattCGATGTAAATAgcttaagtaaatataaaacaaaaacacaacaaaacctATCAACAACCACAACTAACAATGTGTTGAGCGAATTAAATGACGAAGCTGACACGAGCAGCTCTTTCGCCGCGCCCACACACATCAACAATGATACTGTCGATGCAAGCGCTGACTATAACGATGAGTATGCTGATTTTGTAGACAATACGCCAATAGACCGCGATACATCACGCCATTGTCGCATACCCGATTGCAATAGTCACGCTAAAGATCCCGGCGTTACGCTCTTCAAATTCCCAATGTCCGAGTATCTCTTTCATAAATGGCTCTACAATACACAACTGAAGGTGGATTATACGCGGCGTTGGCGCTATCGCATCTGTCAGCGCCACTTCGAACCGATTTGCATGCAATTTCGTAAGCTACCGCCCGGCACAATGCCAACGCTGAATTTAGGTCCTTCGCGCCCGGCGCGCATCTACGAGAATAGTTTCGATATAAATCATCTGAAGAAATTCAaaactaaattgaaaaataacacaacaacaagtaCAGCTATTGCTCCAACAGCCACTTTAGCCATAATGCACTCCAATCACGCCGATTATACCGATGACTTAGAAACGAGCAGCTCTTATGCCGTCGATGCAAATACCAGTAATACCGCTCAGCTACCGCTACTCTCATGTACTGTACGCCATTGCACGAGTCAATATCACGCGCTACACGAGGGCttacatttgcataaattgccgACGAATATTATGTTACGCGAGAAATGGATTTATAATTGTCGCTTCTCGGAGGAGACGCTCATCAACATGAGTTCACGCATACGCATCTGTTCACTCCACTTTACTCCAAACTGTTATTATGGTGTTAAGCGTCAATTGAAATTCGGTTCAGTGCCCACTTTGCGTTTAGGCCATACAGATCCAAATATATATCCGCATGGTTTTAGTAATGAAAGCGAGGCACAATTGGCGGGTAAACACCACACAGCACAGCATAATTGGCGCATCAGCCGCTTGCAGTCGTCCACTGATGCCAATCAAGATATTTGCTGCCTCATCAATTGTCGCCATAGCAGACGGGAGTACACGCGTCATTTCGCCTTTCCCACTGAGCGGGCCCTACTCGATCAATGGCTCAATGCGCTCGGCATGGAGTTTAATAGTTCGCGTCCAGATGATTATAAAATCTGTGAATGGCATTTCAAAGCGTCCGATTTCGATGGAGAAGTACTGCGTGCCGATGCGGTGCCCACGCGTAATTTGAAGATAGACGATGCCAATCAAACGGACGATGAGGATGATAATGAGGACGAAGATGATGAGTTGGCTTGGAATGCAAATGAAGAGCCCCTAGATGAACGTCCATCAACGTCAGCAGCAGCCGCTGCCGCAGCGTCAACTTCCACTGCCGATGGCTACAACAAATTAATCCCGGGCTCGCGACGATGTTGCTTGGCGCACTGCCGCAAACAACTATTCCAAGATAATGTGCGCACCTTCAAATTCCCCACAATGCATGAACAATTCGAGAAATGGGTGCACAATTTAGGCATCAAATACGACGGCGACGCGCCTTGGCGCTATCAAATTTGCAGCGAACACTTTGAAAACCAATGCATTATACATTACGAGAACAAAGCCAAGTTGTTTAGATGGGCTGTGCCCACCTTGAAATTGGGTAAACATGCGCCAGCCATACTCTTCACGAACGAGAATCCCAAAAAAATACAGCAAAGAGATGCAGACTATGAGCGCGTCTATAGAAATTCGAATGCAGGCGGCTATGCGGACAACATAGAGAATGAAGAGACTATGGATACGACAAatgacgagttgaattccacTGCGCAGCCCGCAGATGTGCCAAAGCATATGGCTTATCAGCAAGAAGATATGGATTTGCTGGCGCCAATAGAGCGACCGCCACACAAAATGAGTGGCGCAACGAAACGCAGCAACTATGCGTATACCGCGAGCAATGAGGACGGCGATGATTActatgatggtgatgatgatgctTACGATATGGGTGATGGTGAAAATTCACTCTTGAATATCATACGCGAGGATAAACCGAGTGCTGTGAAAGAAGGTACCCCAGCCTCGTCGTTCTTCTCATTGCAACTGGTGCGCGGCGGTTCTGCGAAAGTGCGTGCCTGCTGTCTGCCGCATTGTGGACGTACGCGTGAGTCCGGTGTGCGTTTATTCCGTTTCCCCACAGAACCGGTCTTTCTTAAACGCTGGGAATATAATTTACGCGTACTCTTCAATGAGTCACAGCGCAATACACACTTAATATGTAGCGCGCATTTCGAGCGCGGACAATATAATAAACGTTTAGTCGTAGATGCCATACCGACATTGAATTTGGGGCACAACAGCACCGATATCTATCGTAATGGACAATATGAGCCAACCAGAATGCATAAGCGCGCACTGACAGCATCACCACCGCGCATACCATCGTCAAACAGCATGCCGCTGAGTAGCCACAAGCCACTGCATTGCAATGTGCCCGCTTGCGCTGATACGCAAAGTGAGCGTCGTCTATTCCCCTTCCCCAGCAATCATGTGTTCGTGAAAATTTGGGCCGAGCGCACACAGATCGCTTACGATGCGCGTCAACATGCAGAATTGCGTGTTTGTGAGCTGCACTTTGAAACCGATTGCTTTAGCGCGCATTGCCTAAATAATAACGCTGTTCCCACATTGTTTTTGCCTGCACCAAATGCATTGCCAACCCCTGCTTCGATTGCGCCAGTCGCACCAACAAAAATTCCACCTCCCATTGGCCGCTCACTGGCAGCGTCCGCTACAATGCCGAAGCTGAGTGCCATTACTTGTAGCGTTAACAATTGCGGCAATAGTACCGCTACGCGTACggacttgaaaatattctcgaaATTCCCGGAGGATTTCGAATTATTTACCAAATGGTGTTTCAATTTGAAAATCGATCCACGCACCTATGTGGATGGCAGTTATAATGTATGCAGCGAACATTTCGAACCATTCTGCATTGGCGGCCATAGTTTACGTGTCTGGGCTGTGCCCACGTTGCATTTAGGTCACAACAGCAAACTAATACATAGTGTCGAACGTCCTGCGGAGATGGAAACGAAATGTTGTCTGCCCCATTGTGGACGCAAGAAGAGCAAAGATGGCGTGGAGTTCTATAGCTTCCCTAAAGGTGATATCTACCGACAGTGGTGTCAAATATTGAAAATCGACGAAGGCCTCTACCGCAACAGTGATAAGAAGATCTGTAGTGCACATTTCCGTGCAGACTGTTTCAACAGTAACGGTACGCTGCGAATGGGCGCACGCCCAACGTTGCTACTACGAAATCGCACCGCCACAGCAGCAGCACATATGCTCAAACCGCCCGCGCCATACCGCAGCAAGTGCATAGTGCGCATCTGTCATGAAATGCAACAGCTCTACAGCTTCCCGGTGCAACGCAATCTCTGCACGAAATGGTGTCACAATCTGAAAATTGACTACTACCCGAAACTGCATGAGAATATGAACTTCAAAATCTGTCGGCGTCATTTCGAGCCGAACTGTCTGCTGAGTGGCGGCAAATTGCATACCGAGGCGGTGCCGACGGTGCAGTTGGGGCACAGCGATGTCAATATCTATCAAAATTTGGTCGGCATGAAGCAGAGCGCCAGCACGACCAGCTATGATGATAACAGCAGTTTGCGCACAAGCGTTAGCACTGTCCACACATGGCTGATGGATGTGGATGCGGAAACGAATGCTGCTAACAATATGCCGGCTGCCGGTAGTGGTGATGCTATCACTGGTAGTCGTGCCGGTGTTGCCGCTGCTGCTATAGACGCTGATGATGACATGGTGCGCATGGAATACGAGCCACCTGTCGATTTGGAGCCAACTGTTGTCACTGAGAATATTGCCGATGATAATTTGGACTTGACCGACAACGCCTATATGCAAATGGAGGATGACACCTACTATGCGGACTTTGAGGAGCAACGTTTGTTGCCGCAGAGCAGCACTTTTGTAGCGGCTGAGAGCGCAGAGGTGATTGACCTGGACGCTGTGGATGCTGTGCAAGAACAATTTCCCAATTGGTCGCAAGACGATGCAGTGCTGGtggatgatgatgatgaggaCGAAGATGATGATGCGCTGTTGTGGccattgaattaa
Protein Sequence
MSQNQNKQHLQQQQQLEHHQLQQQQHHHHQQQLLHHHHQHQQQQQQQRQWHQQYQHQQQQLHHHHQHQQQQQAQYAAAAAAHAHVAAATGMRSGNGMPGMFGNTLGRGGCLYDTSGGSLVGGGNGGGGGVGGFDLDMAGHHIGSGNASAITATSTAPAVGVSIAGGSVGGYMPSHVNVSSVAGGAYNAGMSGMSTASESGRRASVGGCAYTQASQSQPQPPPQQPQQQQQHTILPPERIKMEPLEQILTPTIEMEELIIKTEPTDDTYNKVSTVDEAIATGSTNMNPNAGYMNFSKHLQSYPPHQQQQHQQQQLLEQQKQRQKLQHQHALQQQQHLQQQQQQHLQQQQQLQQQQQQREQQLQQQMLSHDAMAEAGLLTVKLEPQIHIKEEMPEASQNKPLNFPRRKVQTERSDTLPICQRCKQVFFKKQSYSKHVSQSLCEIVEYDFKCSICPMSFTSSEELQTHEQLHRENMYFCHKYCGKYFDTIELCEVHEYMQHEYVNYICNVCSAGFASRDLLFAHMPQHRNQPRYDCPVCRLWFHTGIQLHQHRIQAPYFCGKFYRRAGGAAPTMPAGGAPFGQVPPTHSTNYNLQDCSMGIIEMPNSNSFSSYLQNRAFHHQHPAPPHPPLPPQQMHPHLAQQQQHLQRQQHQLHMQQQHPQFPTQPLHAAPQQQPTPAQPDFMRRNSITGATGSATTEFQMPQIKTEIKVEPDLYGTPDYPLQTPLPPPPLPPPPLSSAQQSALVSPSRQRSRFGDFTNESFGAGVTSSDTSHAFSSHNNSGGSNGNSNDFSNSNTSSLQRNMSANNSNENNSSNHKPSAFPVGNFHFPTTPVVSRTRVVDTGEDGAVCCVPHCGVTKQSSPTLQFFTFPKDEKYLHQWLHNLKMFPEPDSTYSQYRICSLHFPKRCINRYSLCYWAVPTFNLGHDDVANLYQNREITNTFTIGERAQCSMPGCPSQRGESNCKFYNFPNDMKTRIKWCQNARLPVNSREPRHFCSRHFEDRCFGKFRLKPWAVPTLHLGTPYGRIHDNPGVFYLEEKKCCLPHCKRTRSSDFNLSLYRFPRDEVLLRRWCYNLRLDPAIYRGKNHKICSAHFIKEALGLRKLSPGAVPTLNLGHNDRFNIYENELPTPPVVPPSKMFNFHNVSAPPPVYSSHRHSIASSSSHSERLYPNPVSKLKFSISNITAGDVSAMSASATQNILDSLEVCFVPSCKRNRNIDNVTLHTIPRRPEQMRKWCHNLKIGIETLHKGVRICSAHFEPYCIGGCMRPFAVPTLNLGHDDPNIYRNPDVIKKLNIRETCCVQDCKRNRDRDHANLHRFPSNFEMLTKWCENLLKPVPDGTKLFNDAICEVHFEERCIRNKRLEKWAVPTVKLGHTEEPKHRLPTDEEIAEQWPKPVMPNKGIEEGECCVSTCRRDPKIDDVKLYRTPEDPELLAKWAHNLQTETEDLTTLRICNLHFEEHCFSKKRRLHYWSLPTLNLATNVDQLYENPEPLLPQVIIKSETKRERRRMRDSKEPLKPWTPRCCLPHCRKRRDTDHVQLFRFPVRNRPMLAKWCHNLLLPLVGHGHRRLCSTHFEPRVLSKRCPIPMSVPTLDLHTPPGYKIYMNPARLKAVKLQQVCCISSCSRTRADGVQLFRFPHSRSVLRKWCHNTRQHARGEYRVCSRHFEPHAFGAKRLMPGAIPTLNLDPEVEDVYANEAQSFAETQCVVSGCVASKELDGVRLFKFPSDDVDLLWKWCNNLKMNPVDCRGVRICNRHFESECVGPKMLYKWAVPTLALGHNDAEIKLVSVPPPEARYSDVVTKCCVPTCGKSRKFDDAQMNSFPKNLRLFRCWKHNLKLDFLDFKDREKYKICSDHFEPVCLGKLRLNYGAIPTLNLGHDDTDDLYPVDPAQIQVSLFGKMRPRIASDKDATEFELASCVGKDDIKCSYPNCTATKMLLSEPYDMPTALPLHALWCAQMKVDTEALSEMPKLCGLHFLKLYKATLDAANALSEGDNDLSEAMKSLNATYEKCCASPIVCSAQCCVVGCGSNQVSGSRTAQRLYLFPTAGGSGIELLEKWCHNVDVSASDLTRFTHRVCARHFDAQCIGPTQRLRSWAIPTLQLKQKPEHKIHAIPELGSASRPHEARCCVATCARERGDFETMRLFSFPIIDEVLEKWLHNLQLTRNQCARLRICEQHFEPRCIVKTRLLRWAVPTLLLERNAEELLQNEPPVQAAAPVAPTVDEVDTIDAVSNNEGDEDLRDVDNEEDDNDSGELAAVPNVKMKKSLGIFKCCVPCCRRTRLQHGVRLFQFPTSHQILRKWCHNLKISLEKASDPALRICSVHFHKRCIDGKELRPWAKPTNALGHTGPIYENPKNLPGVFLPKCCLAHCRQRRTLENDLRTFGFPKDEALMRKWCANLRMEPRANHARICMSHFPPEAMGNKKLRANAVPTLNLGHNEPLEYDNAQLIATAMKARNPLEEKKVLNSNASDAELGNNGEYDDEMDDEDNDDNDDDDYHIDEEDDENDFYASAGVGNSDEEEEETVEDFTPSISAATGDDDELYSNTEARDADNFIVDSADEDEEGAEVEDEDEEEIYGNEDDDDEDDSEESDESVAENVYNSPAEEDDSDSESEVDADDNNADSKDAEVDDDDVEMLIDADDDEDDKSSYNACDPLDFVECVNHSDDTQQHKAAATSIKINNLRRAEKNLRKRPISSTPTNYEDDSNTNYSNVDETTRFTATTSNSASATTSSASASAAATTNAPNTHHCGAEKISRSVFRLCCLRHRRKKKAPPDPPPDMRCVREKVLTAHQTLMNSTTRVSLRQRQQRAYCAVRRCGRTAGAAVILYRFPLSGSRYCRRWCAQLKVKMSHSSRLRICQRHFPYSLVDRRRRRLRFGAIPTRNLHNTSGQFKRNPLFGLYKNTQLTADPSEREAFADTTQTTAAQLNVYNRCCVPHCGKSHQVDGVTLFRFPKLRSLYLQWATNLRLMPTMRLVQVYKVCSDHFEPSCLSYQRNDRAKLKYGSVPTLKLGHSDTLKIYKQNTLPPQKRRGLGRRKWHPQRGVNECAVHDCKVAQFLQMQLFMLPASLKLQERWCNYFKLNFTGASKDATEFFQNVRLCALHYMEGYQMATNSDGSRKNSSAALDELEVNYARITSSTRIQMLKCCVPNCSTKFTDNLRLTAFPSAEELRAKWQHNTQVSFSSSHRYLYKVCALHFEERCFAKKRLFMWAIPTLHLPRPQNQDPGHKLFENPNFDIVNGAQCCIEDCTTNEVKQEPSANDDGEVCKAAVRFWHFPQDDALRDKWCHNLGLSAQTNEISHTSRRWRICSRHFESFCIGKTLHSWAVPTLNLPKPVKHAKSGRRSTYIYQNPDSAAVYYRCCIKTCHQLRDLDAGIRLYAFPKKETMLQKWAHNIRMPAVKCRYARICTLHFEAQCLRPQMQSWALPTIDLGHDEADIFRVPKVKLMVTNERCCLPHCSKRRSRDNVHLFTFPRDKHVLDKWWHNLAIGVQDVKRRLICESHFEPRCISKRRLKRWAIPTLHLGHTNETLKNPTPDEVATYENNTTTRRSQTPSKMTRAKSTQTTPTSTLQKCSIAQCAGGTDSSALYRFPKPDWLRKKWCDNTRIDEETAKQAKICARHFEPHVMGNRKPRPWALPTMELGADENGTPLPAVHANPKQLSRFHPEEHEWGELRYVRANHCSIISCLKSKKDGVTLFNYPTNRHMLQKWAENCRHYPYQAKRYRFQLCGAHFTTDCFKREGTRLRKGSVPTLNLGHDDAQIHQSEFESVSAIKIEMKILKSCSVPQCGRTNLHDGVRLFKFPYERSETLEKWCYNLRMNAADCRNALICNMHFEPRCVGGGHRGLLLRAIPTLLLGHNDTDILNNPETFERPEKLISCCVPGCINTKQTEGITLSAFPKLRSHFEKWSHNLQLPVSTTVWHTYKVCSAHFENYCYEHGRIKVGAMPTLKLGHNNTIDLYTVSEETMSQAFKRKRVAPKTEAPKAPHGSCCYPECREMELRLTNQLFEFPKLVAIRRAWYESVGLRAEEQVEAECAAIENETQETAPDTSIPQEDATAAAPVKVTKITPKLCPMHFKLLYIEHATLVNTLKLDSTPDTLRILQKLEETHIKVCDLSCVRRISCAVPGCNSNYLTTKSLKFFKFPDNAEMRAKWCHNTQVTVDPDRLYCYKICEFHFEASCASQLAKKMQRLKFWALPTLQLPPRTEGAPEIYTLPAPDTLAETKRASLLLNTPLNKCCIPSCVYAKAPAERAVGADSEIQFFNFPNDAELLYKWVYNTQISMVAATNARICSLHFEKHCIYKRLRMYAVPTLLLGHQKTDIYKNPSDKRVQAEAMESKPQKMPKYYEDNIADGVEVEENAEDDDELLAEEKTHQMKLKKSDITKYKTESFEESDQRTARKSSAYVKKKLDTELLEPMLVVKTELKERKETATTTTNKPATQSNPYHQPFIINIKQEKDIEESYTAEGIENQMKQGLLDMFNSFGDTATEQEPDEINELGDEVALMERDTARHCRIVGCNSYARNAGVTLFKFPFPLDQFRKWLHNTQLEVDYTRRWRYRICHRHFEPICMQFRKLPPGTMPTRNLGPKRPAHIYENEFDVNSLSKYKTKTQQNLSTTTTNNVLSELNDEADTSSSFAAPTHINNDTVDASADYNDEYADFVDNTPIDRDTSRHCRIPDCNSHAKDPGVTLFKFPMSEYLFHKWLYNTQLKVDYTRRWRYRICQRHFEPICMQFRKLPPGTMPTLNLGPSRPARIYENSFDINHLKKFKTKLKNNTTTSTAIAPTATLAIMHSNHADYTDDLETSSSYAVDANTSNTAQLPLLSCTVRHCTSQYHALHEGLHLHKLPTNIMLREKWIYNCRFSEETLINMSSRIRICSLHFTPNCYYGVKRQLKFGSVPTLRLGHTDPNIYPHGFSNESEAQLAGKHHTAQHNWRISRLQSSTDANQDICCLINCRHSRREYTRHFAFPTERALLDQWLNALGMEFNSSRPDDYKICEWHFKASDFDGEVLRADAVPTRNLKIDDANQTDDEDDNEDEDDELAWNANEEPLDERPSTSAAAAAAASTSTADGYNKLIPGSRRCCLAHCRKQLFQDNVRTFKFPTMHEQFEKWVHNLGIKYDGDAPWRYQICSEHFENQCIIHYENKAKLFRWAVPTLKLGKHAPAILFTNENPKKIQQRDADYERVYRNSNAGGYADNIENEETMDTTNDELNSTAQPADVPKHMAYQQEDMDLLAPIERPPHKMSGATKRSNYAYTASNEDGDDYYDGDDDAYDMGDGENSLLNIIREDKPSAVKEGTPASSFFSLQLVRGGSAKVRACCLPHCGRTRESGVRLFRFPTEPVFLKRWEYNLRVLFNESQRNTHLICSAHFERGQYNKRLVVDAIPTLNLGHNSTDIYRNGQYEPTRMHKRALTASPPRIPSSNSMPLSSHKPLHCNVPACADTQSERRLFPFPSNHVFVKIWAERTQIAYDARQHAELRVCELHFETDCFSAHCLNNNAVPTLFLPAPNALPTPASIAPVAPTKIPPPIGRSLAASATMPKLSAITCSVNNCGNSTATRTDLKIFSKFPEDFELFTKWCFNLKIDPRTYVDGSYNVCSEHFEPFCIGGHSLRVWAVPTLHLGHNSKLIHSVERPAEMETKCCLPHCGRKKSKDGVEFYSFPKGDIYRQWCQILKIDEGLYRNSDKKICSAHFRADCFNSNGTLRMGARPTLLLRNRTATAAAHMLKPPAPYRSKCIVRICHEMQQLYSFPVQRNLCTKWCHNLKIDYYPKLHENMNFKICRRHFEPNCLLSGGKLHTEAVPTVQLGHSDVNIYQNLVGMKQSASTTSYDDNSSLRTSVSTVHTWLMDVDAETNAANNMPAAGSGDAITGSRAGVAAAAIDADDDMVRMEYEPPVDLEPTVVTENIADDNLDLTDNAYMQMEDDTYYADFEEQRLLPQSSTFVAAESAEVIDLDAVDAVQEQFPNWSQDDAVLVDDDDEDEDDDALLWPLN

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00192124;
90% Identity
iTF_00191395;
80% Identity
-