Basic Information

Gene Symbol
-
Assembly
GCA_963457695.1
Location
OY735246.1:47812886-47838344[+]

Transcription Factor Domain

TF Family
THAP
Domain
THAP domain
PFAM
PF05485
TF Group
Zinc-Coordinating Group
Description
The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 42 2.4e-15 2.4e-12 47.9 3.2 1 86 572 644 572 645 0.85
2 42 1.6e-13 1.6e-10 42.1 6.3 1 87 672 741 672 741 0.83
3 42 2.9e-15 2.9e-12 47.6 0.3 1 87 763 835 763 835 0.84
4 42 3e-14 2.9e-11 44.4 1.4 1 87 924 994 924 994 0.78
5 42 1.4e-13 1.4e-10 42.3 6.4 1 87 1018 1090 1018 1090 0.80
6 42 8.1e-15 8e-12 46.2 1.0 1 87 1125 1194 1125 1194 0.82
7 42 1.1e-10 1.1e-07 33.0 4.5 1 86 1231 1300 1231 1301 0.79
8 42 1.2e-14 1.2e-11 45.7 1.6 1 86 1327 1396 1327 1397 0.82
9 42 7.4e-13 7.4e-10 39.9 2.6 1 86 1418 1487 1418 1488 0.83
10 42 1.2e-11 1.2e-08 36.0 3.9 1 86 1515 1586 1515 1587 0.86
11 42 0.0011 1.1 10.5 0.1 1 58 1667 1716 1667 1734 0.77
12 42 5.8e-13 5.7e-10 40.3 1.5 1 86 1762 1832 1762 1833 0.79
13 42 4.4e-14 4.4e-11 43.8 0.2 1 86 1863 1935 1863 1936 0.78
14 42 1.4e-09 1.3e-06 29.5 2.8 1 85 1980 2048 1980 2050 0.81
15 42 1.7e-14 1.7e-11 45.2 1.8 1 87 2072 2141 2072 2141 0.84
16 42 1.1e-13 1e-10 42.6 0.2 1 87 2270 2339 2270 2339 0.80
17 42 1e-12 1e-09 39.5 0.7 1 86 2401 2477 2401 2478 0.80
18 42 2.8e-15 2.8e-12 47.7 0.1 1 87 2505 2577 2505 2577 0.85
19 42 1.6e-10 1.6e-07 32.4 2.2 1 86 2601 2672 2601 2673 0.75
20 42 1.1e-11 1.1e-08 36.2 0.1 1 86 2696 2764 2696 2765 0.79
21 42 3.8e-11 3.7e-08 34.5 2.5 1 86 2789 2858 2789 2859 0.80
22 42 5.3e-13 5.2e-10 40.4 0.9 1 86 2889 2959 2889 2960 0.80
23 42 1.3e-07 0.00013 23.1 0.2 1 71 2994 3054 2994 3067 0.70
24 42 1.6e-12 1.6e-09 38.8 0.4 1 86 3090 3163 3090 3164 0.80
25 42 1.5e-12 1.5e-09 38.9 2.5 1 86 3197 3266 3197 3267 0.79
26 42 1e-10 1e-07 33.1 0.7 1 86 3289 3357 3289 3358 0.77
27 42 4.9e-10 4.9e-07 30.9 0.4 1 86 3427 3495 3427 3496 0.77
28 42 2e-12 1.9e-09 38.6 1.3 1 86 3523 3592 3523 3593 0.81
29 42 0.0035 3.5 8.9 0.0 1 57 3623 3671 3623 3693 0.78
30 42 1.3 1.3e+03 0.7 0.0 49 60 3856 3868 3841 3884 0.66
31 42 8.1e-15 8e-12 46.2 1.4 1 86 3915 3989 3915 3990 0.80
32 42 8.2e-14 8.1e-11 43.0 3.0 1 86 4020 4090 4020 4091 0.82
33 42 8.4e-14 8.4e-11 42.9 0.2 1 86 4198 4268 4198 4269 0.81
34 42 3.2e-14 3.1e-11 44.3 0.8 1 87 4372 4443 4372 4443 0.81
35 42 2.4e-11 2.4e-08 35.1 0.9 1 86 4562 4632 4562 4633 0.76
36 42 1.5e-13 1.5e-10 42.1 0.8 1 87 4762 4834 4762 4834 0.83
37 42 1.3e-12 1.3e-09 39.1 1.9 1 87 4867 4934 4867 4934 0.83
38 42 1.7e-15 1.7e-12 48.4 3.0 1 87 4976 5052 4976 5052 0.83
39 42 1.3e-12 1.3e-09 39.1 1.4 1 86 5146 5216 5142 5217 0.80
40 42 6.2e-10 6.1e-07 30.6 0.1 1 87 5249 5321 5249 5321 0.81
41 42 6.7e-11 6.6e-08 33.7 0.0 1 86 5330 5397 5330 5398 0.82
42 42 1.8e-12 1.7e-09 38.7 5.5 1 87 5427 5501 5427 5501 0.84

Sequence Information

Coding Sequence
ATGtcacaacaaaatcaaaacaaacaacaacagctagcacaacagcagcagcagcaacagcagcgacaacaacagcaatggcaTGCACATGTCGGTCCCGCCATCTCCGTGGCTGGCGCCAATTTGCATAGTATGTTTACGGGTGGCGTTGGCTGCGGCGGCAGTACCAACAACATTGGTGGGCCTTTCAGTGGGGGTGCCGCCTTGGGTTGTGATGGTAGCTTTGACCTTGAGATGCGCGCTCATAGTGTTGGTCTGGGCTCTCGTGCAGGTGGCGTAGGTGTGGTTTCTACCCCTACGTTCGATGCATTCTTTccacaaaatcaacaacaacagcagcaacaacgacgtCAACAACAGTTGCAACAAGTTTTGTACGCTGCACATTCGCAGCCACAACTACAGTTACAGCCGCCTATACCAAACATCAAAATGGAGCCATTGgaaccacaacagcagcaagaagAACCAGAACACGATATGGACATACTAACGCCAACAATTGAAATGGAGGAACTGATTATAAAAACTGAGCCAAATACGGAGGACTCCATgtataaaatgaattttccaCATGATGATAATTTTGGTTTTAGCAATTATACGCGACACAACCAACAAACTATGACGCTGAGCATGCCCCATACCAAAGTGGAGCACATCAAGGAGGAACCATTGGCGACATTGGCTCTCCAAAAGCCGCTGAATTTTCCACGTCGCAAAATGCAAACGGAACGCGGTGATACGCTGCCGATTTGTCAGCGTTGCAAACAGGTCTTCCTCAAGAAATACAGCTACACCAAACATGTCGCCCAGAGCAGCTGCGACATTGTCGAATATGACTTCAAATGCTCGATTTGCCCTATGTCCTTCATGTCCAGCGAGGAGCTGCACGTGCACGAGCAAGTGCATCAGGCACAGAAGTATTTCTGTCACAAGTACTGCGGCAAATACTTTGACACTATCGAATTGTGCGAGGTGCATGAGTATATGCAACATGACTATCCCGCCTATGTGTGTAATTTCTGTGCCGCTACCTATTCCACGCGTGAGCAGCTATTTTCGCATATGCCGCTGCATAAGCTACAGCAGCGCTACGACTGCCCCGTTTGTCGTGTGTGGTTTCAAAATGCGCACGAGCTGCATCAGCATCGCTTGGCCGCGCCGTACTACTGTGGCAGATACTATGCGTCCAAAGAACGAGCTGCTTCAGCGCCACACACGCGTACCACAAACACATTGCCAACCTCGCATCTACCAACTACACACAATCAACAGAACTACAATCTGCAGGAGTGTCGTATGGGCGTCATGGAAGTGCCCTCTTCCAGTAATCGCGATGCCGCCTCCTTTCTAAATCCTCAATGCTCGACGGCGAGTTTAAACAATAACTGCTTCCAGCCACCCACACAACTGAAGAGCGAAATCAAAATGGAAACCAACGACTACTTCTATCCGCACGAATTCGGACAGCACAATTTCAGCGATTTCGTCAATGATTCCTACGCCAATTCAATGCCGTTCGGCAATAGCCAccacagccacaacaacagcaatagtagcagtagcagcagtcGGGAATTCAAGGATCCCACGCTAAGCTTTAATTTCGCCGCACATCTGCCAGCCGGTTCCGCCGTGCACACTGCTGCTACAACCACTGCCCAGCCCATCATAGCATGTCGCAGCACATCCAGCGACGCGAACGACATTTGTTGCGTGCCTAAGTGTGGCGTGCGCAAGAGCACAAGCCCATCGCTGCAGTTCTTTTCGTTTCCAAAGGACGAGAAGTATCTACTGCAGTGGCTGCACAATCTAAAGATGTTCCAAACGCCACCGCTGACCTATGCCAGCTATCGCATTTGCAGTTTGCATTTTCCCAAGCGTTGCATCAATCGCTACTCACTATGCTACTGGGCCGTACCGACGTTCAATTTGGGCCACGACGATGTGGCGAACTTATATCAGAATCGCGAGCTCACCAACACCTTCACAACGGGCGATCTGGCGCGCTGTAGCATGCCCGGCTGCACGAGTCAACGCGGCGAAAGCAATTTCAAGTTCTACAATTTCCCCAACGACACAAAATCGCTGATCAAATGGTGTCAGAATGCGCGCCTGCCCGTCTGCGCCAAGGAACCGCGCCACTTTTGCAGTCGGCACTTTGAAGAGCGCTGCTTCGGCAAGTTTCGCCTGAAGCCTTGGGCTGTGCCGACGTTGCATCTGGGCACGCCCTATGGCAGAATACATGAAAATCCCGGCCTATTTTATTTGGAGGAAAAGAAATGCTGTTTGACGCACTGTCGCCGCGCGCGCTCCTCCGACTTTAATCTGTCGCTGTATCGTTTTCCGCGTGACGAGATGTTGTTGCGTCGCTGGTGCTACAATCTGCGCCTGGATCCCTCTGTGTATCGtggcaaaaatcataaaatctgCAGTGCGCACTTTGTGAAGGAGGCTTTGGGTTTGCGCAAATTGTCACCAGGTGCTGTGCCTACGTTGAATCTGGGTCACAATGATACCTTCAACATATACGAAAACGAGCTATATCAACCGCCACCACCGCCACCGCCAGCACCACCGATCACACACACGCTCTCTCAGGCAGCCTCAAGCTCTGCCAAACTGTACAAATTCCATAATGCGGGCATGTCCGCGCTGAACTATGCAGCCATGTCGCTGTCGCCCACGGACTTCGCCAGCAACCTGTCTGCCGCTTCCACTTCCAATTCAGCTTCCACTTCGAACGCATACGACTCCATTGATGTGTGCTGCGTGCCGAAGTGCGGACGCAATCGGCATACGAACGGTGTCACGCTGCATACCATTCCGCGGCGCCTCGAGCAACTGCAAAAGTGGGCGCACAATCTGAAGATGGACTTGGAGCGGCTGCAGCACAAGGCGCTGCGCATTTGCAGCGCACACTTCGAGACCTACTGCATCGGCGGCTGCATGCGTCCTTTCGCAGTGCCCACGCTCAATTTGGGCCACGACGACACCAATCTCCATCGCAATCCGGACGTCATCAAGAAGCTGAACATTCGCGAAACCTGCTGCGTGCCCTGTTGCAAGCGCAATCGCGATCGTGACCATGCTAATCTGCATCGCTTTCCCTCCAACGCGGCGATGCTGCAGAAGTGGTGCGATAACCTGCATAAGCCCGTGCCTGACGGCAGCAAACTCTTCAACGACGCCATCTGCGAGGTGCACTTTGAGGACCGTTGCTTACGTAACAAGCGTCTGGAGAAATGGGCGCTGCCTACGCTCAATCTGGGCCTCGTGGAGCTGATGCACAAACTGCCCAGCGAGGCGGAGGTGGCCGAGCTGTGGTCGAAGCCCAGCGCGCCCAACACCGGCGAGAACGAGGGTGAGTGCTGCGTGGAGACGTGCAAGCGCAATCCGCAGGTGGACGACATCAAGCTCTACCGCCCGCCGGAGGACGGCGATGTGCTGGCCAAGTGGGCGCATAATCTGCAGCGAGAAGCGGCCGAGCTAACTAATTTACGCGTATGCAGCCTGCACTTCGAGTCGCACTGCAGCGGCAAGCGGCGCCTACACTCGTGGGCCATACCAACACTGAATCTGGGAAAAACGCTCGAGCAGCTGTATGAGAACCCCGAGCACATGCTCGTCATAAAGAAAGAGAAGCTGCACAAGCTGTACGACCCGATGAAGTCGTGGGCACCGCGATGCTGCCTGTCGCACTGCCGCAAGATGCGCGGAATCGACAACGTACAGCTGTTCCGCTTCCCTCACCGCCATAGCAAGACGCTGGCCAAGTGGTGCCACAATCTGCAAATGCCTATGGTGGGCAACATGCATCGCCGCGTCTGCTCCGCGCACTTTGACCCGCAGGTGCTGACGAAGCGCTGTCCCGTACCACAGGCGGTGCCCACGCTGGAGCTGAACACGCCGCCCGGCTACAAGCTCTACCAAAATCCGGCGCGCCTCAAGGCCAAAAAGTTGCGGCAGGACGTGTGCATCATACCTAGCTGCCGTAAAGCGCGCGCTGACGGTGTACAACTCTTCCGCTTTCCACACAACAACAGCCTGCGCCGCAAGTGGGGCCATAACACGCACACGCGCGCTAACGAGGCCATGCGTGCCAACTGGCGAATCTGCTCCACGCATTTCGAGCCGCACTCTTTCGGCGTGAAGCGCCTGTGTCCCGGAGCCATACCCACGCTGCAGCTCGGCCACGACGACGAGAATATCTATCCGAACGAGGCGCAAACCTTGGCCGAGCAGCAGTGCATTGTGAATGGCTGCGAGGCTAACAAAGAGGTGCAGCAGGTGCGCCTATTCAAATTTCCCTGCGATGACGAGGATCAACTGTGGAAGTGGTGCAAAAATCTGAAGATGAACCCCATCGACTGCCAAGGCGTGCGCATCTGCTATCGCCACTTCGAGCCGGAATGCATGGGCCCAAAGATGCTCTACAAATGGGCCATACCGACACTGCAGCTGGGTCATGACGACGCCGAGATCGAGCTGGTGCCCATACCTAAGCCGGAGGAGCGCTACACTGAGCTGATATTCAAGTGTTGCGTGCCCAGCTGCGGCAAGACGCGAAAATTCGACGACGCGCAATTGAACAGCTTCCCGAAACACATTAAACTTTTCCGCCGCTGGAAACATAACCTCAGATTGGTGCACTTGAATTTCCGCGAAcgtgaaaaatacaaaatctgCAATGAACACTTCGAGCCGATATGCTTGGGAAAGAATCGACTGAACTTCGGGGCCATACCGACGCGAAATTTGGGTCACGGGCAGACTATTGGAATGTATAAGGTGAATCCGGCGCAGATACAaacgaaattattttgcaCGCCCAAACTAACTGTGGGCAGCGATGACGATGACGATGAAGCGAGTGAGGATGAGGAGGAAGAGGAGGAGGAGCAGCAGGAACAGGAGGAACAGGAGCAGGAGAGTAACACTGAAATGACAGAGGAAAATGAGGAGATTAATGGTAATGCAGACGCTCAAGCCAAATGTTGCCATGCCGCTTGCACGGCGCCCAAAACGCTGCTTCGTGAACCTTACGACATGCCGCAGCAGGCAGAGCTGCTAGCGCTCTGGCAAAAACACATACAAGTTGCTGAAGAGGAGCAGGAGCCGCGCCAACTGTGCGGATTACACTTTATGGCTGTCTATCAGGCCACTTTGACGGCAGCAAATGCGCTGCTGTCCTGCAATTCAGCGCTGGAGCCCGAGCTGCAAGCGCTTCAGGCCGCCTACGAGCGTTGTGCCAACTCGCTGCTCATATGCAGTGCGCAGTGTTGCGTGGCCGGCTGCACCAGCAATGAGCTCGGTCCGCACAAGCTCTATCAATTCCCACGAAACGCGGAGACGGCACAGCAATGGCGCTTCAACACGGGCGCGCAAGTGGAGCCGACTAATCCCAGTTTGCATAAAGTGTGCGCTCTGCACTTTGAGCCTCATTGCTTTACGGACATGCAACGTCTGCGCCCTTGGGCGCTACCCACGCTCGAGCTGCAGCACGACACGCCCTCGGAAATGTATCGCAATccagatttaaataaaattgacgTGAGCAGCTTAGGTCCAGCCACGCAGCAGTGCAGCGTGCGCGCTTGTGGCGCCACAAATACAGTCGCCAGTCCGTTGCGTCTTTACCGTTTTCCGCGTGACGAGGTGCTGCTGGAAAAGTGGCTGCACAATCTGCAGTTGGAGCGTGAGCAGGCGCCACTGTATCGCATCTGCAAGGCGCACTTCCAGCCACAGTGCTTCGTCGACCATGACAGCTGCAACTTGCGCGCCGGGGCACTGCCCACGCTGCAGCTGGGCCACAGCGACACAACGCACATACACGAAACTCTTGTGGAGGACGTtgacgagcagcagcagtcggaGCCGCTTGAGGATCCACCAACGCAAGTCAAAATGAAGACTTCACTCGATACGCTTAAATGCGTCGTGACCACTTGCCGTAAGAGCCGATTGCAGCATGGTGTGCGCCTCTTTCCGTTCCCCGCCAGCGGCACCATGCTGCGCAAGTGGTGCTACAATCTGATGCTGCCGTTGAACATCGCCGACAAGCAGCCGCACATTTGTAACATGCACTTTCACAAGCGCTGCATCGAGGGCAAGCAGCTGCGCCAGTGGGCCAAGCCTACCAAGCATCTCGGCCATAGCAACCCCATATATGACAACCCGAAGAATTTGCCGGGCATCTTTCTGCCCATCTGCTGTCTGCCGCATTGCCGCAAGCGGCGCACTTTGGACAATGATCTGCGCACCTTCGGTTTCCCGAAAAATCGTACCATGCAAGAGAAGTGGTGCGAAAATCTGCGCGTGCGGCCGACGGCCAGCCAGGCACGCCTCTGCGCGGAGCACTTCGAGCCGCAAGTGCTGGGGCATCGCAAGCTGCGCACCGGCGCCGTGCCTACATTGAATTTGGGGCACAGCGAGCCGCTGCAGCATGACAATCGCGTTATTATAGAGAGCAGGGATACGATGTGGCAGCAGAGCGACACGgcggcgcagcagcagcaggaggaGGAGCAGTCGCAGCTGCATGAGTTGAACGACAGCTCGGCGGATTTCGGTGACTACGACAAAGAGGTGTACTATAATGAGGTATATGGCGAGGAGTTGGCTCACCAGGCGGCGCTCGATTTGACAGCCGACAGCGAGGACGATGAGCAGCCGTCTAACGCAGGCACAGCAAATGCTGCAGAATTCTGCGATCCCATGAGCTATTTGGAGTGCGTTGTGGAGGAGCAGGCTGGCACAATGCCCATCAAAAAGGAGCGCGTCGTCAACAACATCTCCCCGATTTGCTGTCTGCCGCATTGCGGCAAGCAGAAGACGCCCGAGCAGCATCTCAGCACTTTCGGTTTTCCGAAGGACCCTACAGTGCTGGCGAAGTGGGCGGCCAATCTGCACCTGCGCCTGGAGGACTGCATCGGGCGCGTGTGCATCGACCATTTCGAGCTGCGCGTGGTCGGCAATCGGCGCTTGAAGACGGGCGCAGTACCCACCATAAATCTAGGCCACAATGACCACTTGCCGTACATAAACACAGTGGATGCAGACCCCAAGAAGCAGCAGTCAAAACAAACTCCTTTTATAAAGCTTGAGCCGCAGTATCTTAGCGACGGCTGCACCACACCCACCTCGGCAACATCTACGCCCTATGCTGAGAAGCTTAACCATTCGGTTTTTCGGCTTTGTTGCCTCAAACATTGTCGTAGAAAGAAAGCCGCGGCGGCGGCGATGGGCACGCCGGAGCAACACATACGCACTTTCGGATTTCCCAAAGCTGAGCAGCTCTACGTCAAGTGGTGCGAGAACTTGCGCCTCGATCGTGAGTACTGCCGCGGTCGTCGCGTCTGCATCGACCACTTCGAGCCGGCGGTCGTGGGCAAGCAGAAGCTGCATCCGGGCGCCGTGCCAACACTCGACTTGGGCCCGCAGCAACCAGAGCCACTGCATCGCAACTCCGAGCTGACGCTGCACTACGTGCCGAGTAACAACAACATCTGCAATGTGCCCGGTTGCGGACGCGCCGCCAATGCAGACGCCGACGACGGCGTGCGACTCTTTCGCTTTCCGCACGACGCGCAGCTGCTGGCGCAGTGGTGCGATAATCTGCAGCTGCGGCATGGCAATTGCGAGAACTACAAAATCTGCGAGCGGCATTTTGAGCCGCAGTGCCTCGGCAGCACGCGCCTCATGGTCGGCGCTATACCAACGCTGCATCTGCCGCATGGCGGCAGAATGGCACCGAAGCACGTCACCAATCCGCTGAGCATCATGCGCAGCAGAGTGTGCTGCATAGCGGCCTGCCGAAAGGCGTCGAAACAGCCGCCGACATCGCTCTTCCTCTTTCCGAAGCCGAGCCAGCCGCTGGCGCGCAAATGGTACCACAACACGCAGCAGAAGCTGCAGCGGCACGCACTACGGCCGCGCATCTGCGTCGCGCACTTTGAGCCCCACGCGATGCTGGCCAACGGCCGGCCGCGACCTTGGGCGGTTCCCACGCTCAAGCTGGGCCATGCGGACGCCATCTTTCAGAATCCACACAAGCTGCTGAAGGAGCTGCAGCGTGAAAAGTGCTGCCTGCCGAACTGCCAGGCCAGTGAGCTCGTGCCGCTGTACAGCTATCCGCAACGCGGCGCGCTCCTGCGCCAGTGGGCGCACAACACCCAGCAGGACGCGTACCTGGCGCAGCGCCGCAACTACAAGGTATGTCGACGGCACTTCGACAGTGGCTGCTTTGCCGACGACGGGAGTCTCAAGAGTGGCGCCGTTCCCACACTGGAGCTGGGCCGCGACGTAGGCGATATCTACACTCCCGAGCCGGAGATAACGGAGCAGACGCACAGCCAGTGCTGCTCCATACGCAAGTGCGGGCGCTCGGTGCGCATCGATGGCGTAAAACTGTTCCCCTTTCCGCTGCAAAACCCGGAGCTGCTGCATCGCTGGTGCCACAATCTACATCTTTCGGCGGCCGACTGCGCGCAGCATCAAATCTGTAATCTGCACTTCGAGGCTGGCTGCCTGCATAAGCGACAGCTGCATGAATGGGCCGTGCCGACGTTGCTGCTGGGCCGCGGCGCCGCCGAGCAGCCGATGCAGCTGTATCGTAATCCCGAGCTGGAGCAGTTGAAGCCGCTGCCCAGTAGCGCCTACTGCTGCGTGTCCAGCTGCGGCAAGTGCCGCCAAATGGATGGCGTGCGCATGTACAGCTTTCCGAAGCAGCGACCGCTGTATCTACGTTGGACACACAACCTCAAGCTGCAGCCGACGGCGCGCCTGCTGAGCGCCTACAAGGTGTGCCACGAGCACTTCGAGGAGTACTGCAATGGGCCGATGCACCTGCGGCAGGGCGCCGTGCCGACGCTGCGTCTGGGCCACGCCGACGACAACATCTACCGTAACAACCGCGCGCAGCTGACGGAGAGTGGCAAGCTGGAGGTGGAGGTGCAGCCGCCGCCAACGCTGCAGTGCGTCGTGCCGCAGTGCCATCTGGCGCAACTGCTACGCATGCCACTCTACAACCTGCCCGCGGAAGCGTCGCTGCGTCGGCGTTGGTGCGAGCAACTGAAATTGGCCGATCCGCCGACGACGGCCAAGCTGTGCGCGCTGCACTTCCGACAGACCTACAACGAGTGCAATGCGCGCGACAACGACGAGAATCAGAATGTGCAGACGCCCGCCGCGGTCGCGCCGGACAAGAGCCTCTGCAGTGACTACGCGGCCATTTGCCGCACAATGCGCCTACTGGCGTACCAGTGTAGCGTGCCCGGCTGCGACACGAACGCCACCAATGCGGAGCTGCGTCTGTACAAGTTTCCCGCCAGCGCCGCACTTTTCGACAAGTGGCAGCAGAACACGCACGTGCAGCTGGATGCCAAGTTTCGGCGCCGCGGCTGGTACAAGGTGTGCGCGCTGCACTTCGAGTCGCGCTGCTTCGGCAAAGCGCAGCGGCTTTACACTTGGGCGCTGCCcacgctgcagctgcagcacaCAGCGGAGCATGCCGTACACATGCTGCCACTCGATgtcgagcagcagcagcaagacgACTCCGCGGCAAAGATCAACGCGCGCGAGTGCTGCATCGCTTCGTGCCGACAGCAGCGAGATCGAGACGCTGGCATACGGCTGTACGCCTTTCCCACCAGCAACGTTTCACTGTTCCAGCAGTGGCTGCACAATGTGCGCCTGGAGGCAGCGCACTGTCGACGGGCGCGCATCTGCACGCTACACTTTGAGCAGCGGTGCGTCGGTAAGCGGCTGCACAAGCGCGCTGTTCCTACGCTTGATCTGGGCCACAGTGAGGAGCACATATACCGCAATAAACGACTGAAGAAGAAGCGCGTGGCACGTTGTCTGCTGGCGCATTGCCGCAAGACGCCACGTTTGCAACGCGTGCAGCTGCACGCCTTGCCGAAGTTGGaggcgctgctgcagcaatgGCAATATAACCTACATTTGCCGGCGACtgcaaaaattggcaaaatttgCAGCGCACACTTTGAGGCGTGTTGCTACGACGCCGCTCACCGCTTGAAAAAGTGGGCTCTGCCTACTTTGGATTTGGGACATGACGGCGCTGTGCTTGAAAATCGTACGGCAGCaatgaaagaagaagaaacagcagcagaagtggGCAACTCAATCAAAGTAAAAGCAGAAACCCAAACAGCTATTGAAGCCGTTaagaaagcagcagcagtggaagCAGCCGAAGTAAAAGATACATTGAAAGCAATTACGGCTAGTGGTGCCGCTATTGAGCATTGCGCCATACCTGGTTGCTCCCGGACGTCAGCCACTGCCGACTGCCAGCTGTATGCCTTTCCCAAAGCCGCTTGGTTGCGGCAGAGTTGGTGTGATAATACGCACCTACCCTTCGCGGAGGCGGCGCAGCTGAAAATCTGTGATGCGCACTTTGAGCCCAATGTCAAGGGCAGCATGAACCTGCGTCTCTGGTCGCGGCCCACATTGCTGCTGGGTCACAACGACGCCATACACGATAACCCCAAACTGCCATGCACGGTAACCTCGAAGTACGTGCGGCGCAACAGCTGCGCCATCGTGTCGTGCGGGAAGTCCATGACGGACGGCGTCCAGCTGTTCCAATACCCAAAGAAGAGTGCGCTGCTGCGGAAATGGGCCAAGAACTGCAAGCACAGCGTTTCTCAAGCGATTCGCGACAAATTTCGCATCTGCAACGAGCACTTCGAGGCGTACTGCTATGAGCACGGGCAGCTGAAGTGCGGCTCCATACCGACGCTGCAGCTGGGCCATGACGACGGTGATATCTTTACTATGCAAAGGGAGCTTCTGTCGGATGTGTTGGGGGACGAGGAGAGCAGCGAGCTGAAGTGCTGTCAGCCGCAGTGCGCGGAAGTGAGCAATGTCAAGCAGCTATACGAGCTGCCAAAGGCAACTGTTCTGAGAGCTGCTTGGCTGGCGAGCATGGGCTTACAGGGCACTGTGGAAAAGAAAGCGGCAGTTGTAGAGCAGCAGCATGCCCTGAAAGACAACGGTACTGCTGCAGCAGAGGAGGCAAAAAATAtggacagcaacaaaagcgcaGAAATATGCTACATCGAAACAGCAACAGAGATAAAAATTGCACGGCAAATCAAAATGGAGGACCACGGCAGTGAGGCAGAAAATGTCCAGCTTGCTCGGGAGGATGAAGTGGACgatgcaaaaacaacagcagatATAACTCTTGTAAAGCAGACGAAATTGCCGGAGtacaaaacagcagcagagtTCAGTATTTCCCTAGAATTCGACGCCGAACTAGCAGAAGACGTAAAGCTTTTAAGACATGGGGAATCGCAGGTCCCGAGCGCCGCAGAAGAAAGCAAAACTTTGCAAGAGGATGGtgccaaaattgcaaaatttttggaggAGGTCGAAATAAGCGATTTTAGAACGGCAGAAAAGAGTCAGCGAGAGCGTGTAGCTAACGATGACTTTAAAGCAACAGAAAATTCCATGGATGAGGAAGCGCTCGAACGCAGCGAAGCAGCAGAGACCGAACTTGAAGTAGACGGCAAGCAAAAAGAGGACGCTGCACTCCGACTCTGTGCAGTACATTTTAAAATCAGCTACGACAAAAATGCACCGCTTCTAGAGCAGTTGCTGGCATCCGCCGAGACCGACACAGATTTGCGGGACAGCCTGCAGAAGTTGCAAAGTCAACATGCGGCCGTCTGCAACACAACACGCGTGCGCAGCCTCTCATGCGCCGTCGTCGGCTGCCGCACGCGCGTACTCCAACACGAGCCCATAAAACTGTATTCGCTGCCGTGCAACCGCGAGCTGCTGCAGAAGTGGCTACACAACACGCAAGCGGCGGTTTGCGAGGAGCGATCCTCGTATATGAAGGTGTGCGAGCTGCACTTCGAGAGCAACTGCTTTGGTGAGTCGATCTCCAAGCGGCTGAAGTACTGGGCGGTACCCACGCTGCAGCTGCCGCCGGCACGTGACGACGACACACAGGCGCCTTTCAGCAATCCCGCCATAGAGCAGCAGGAACGACATATGCTAGAGCGCAAGTGCTGTATAGCCGCGTGTAAGCACGCGCGGACGCAGGCGGAGCCGCACGATGTTCAGCTGTACAGCTTCCCCAAGAACACGGAGCTACTGCAGAAGTGGCTGCACAACACACAGCTGAGCCGGCGCGAAGCGTTCAGCGCGCGCATCTGTGCGCTGCACTTTGAGAAGCATTGCATTAACAAGCGCATGCGCTACTGGGCGCTGCCCACTTTGTTGCTGGGCCATGACAACCCGAATATATATCTAAATCCGGGTACGACTACGAGTGCAGTAGGCACAGACACAGGCGCTGCCGACAACGTGGCGGCGAGCAGCACTAACAATGCGCCGTTGGAAGAGGAATCCAATGACATGGAGCTGCTTGATCCTCCTATGAGCATCAAAATAAAGCTGGAGATTGAAAGTGACGATGAAGTGCacgagaagcagcagcaggaagaCGAGCTGGAGGAGGAAGAGGAGCAGCAGGACGAGATGGAGGAAGAGGAGCAGTATGACTCGAGGCCTTCGCGACAGCCGCACACCTCGCGGCACTGCCAGATCGTTGGCTGCCGAGGACATGCTGACCAGCCGGGCATCACGCTGCACAAATTTCCCATACCGGAAGAGCTGTTCCGCAAGTGGCTGCACAACACACAGATTAACGTGGTTCGCGAGATCCGCTGGAAGTATCGTATTTGTAGTCGGCACTTTGAGGCGGCGTGCTTTCGCGGCAGACGTCTGCAGCCGGGCACCATGCCCACGCTGCTGTTGGGCCCGAAGCAGCCGACCACAATCTATGAGAATGAATTCGCCAGCGAGGGCAGCAGTCGCGCCACTGAAAACGCCTCTCCAAATGGCGACGGCGACAGCGCCAGCTACGACGAATTCGAGCCTGCGCGTCTGGTCATGGCCAGCTACGACAGCAACTCAAAGGAGCCACTGGAGCAAGACAAACCACTCGAGATTAAGCTGGAGGTACAGCACGTGGAAGAGTTCAAGGAACACTTCGAGGCAGCGGTTGATTACGCCCACTTGCAGCTGCATGCACCGCTGCGCGTCAACAAGAAGACCAGCTGCGCCATTGTGGGGTGCACCAGTGAAGCCGGGCACGTAGGCATCACGTTGCACAAGTTCCCCATGTCCGAGGAACAGTGCCGCAAGTGGGAGTACAACACGCAGATCGACGTGGACCGCAACAGCCGCTGGAAGTATAAGATCTGCAGTCGGCACTTTCAGGCGAACTGCTTTCGCGGCAAGCGGCTGCTGACGGGCACCATGCCCACGCTGCATTTGGGCGCCAGCCGGCCGGCGGAGATATATGAGAATGAGTTCTATCGAAATGACGGTCAGTCGGAGCCGGTGGATGAGACGTGCCAGGAACCACCGATTGACGCGCTGCAAGTGGTCAAAAACGAAGTGCTGGATGACGAAACGAGCGACCCGGACGATATGACGCCGCTGAGCAGCTACCTGGATGTGACGCTGGCGGAGCAGCAGCTACAGCCGCAGACGCAGTTGGAGGCCGCACTCAAAAAGAAATACATGCGAGAGTTCGATCAGAATATGTCGGCCGATGCGGACCTAAGCTACGACGCCGCTCCCGAGGAGAGCCTCTCGCTCAGTCTGCAGAGTAACACGCGCCACTGCCGCGTCGTAGGTTGCAACAGCCACCTCGGCCAACTGGGTGTCAAGATGCACAAGTTCCCGCTGCCCGAGGATCTCTTCCAGAAGTGGATGCACAATACGCAGGTGCAGGTGGATCGCACCTGCCGCTGGAAGTATAAGATTTGCGGGCGCCACTTCGAGCCGGGCTGCTTAAGGGGCAGACGTTTTTTCACCGGCACCATGCCGACACTGCATCTTGGTCCCAACCGCCCGGCGAAGATCTACAAAAACGAGTTCGTGCTGTTCAAAGCGACAAGCGGCAACGCGCTGGAGCACAGCCATGAGTTGTCTGCGGAGCTGGATGTGAGCGTGCTGTCAGCCGGCGATGATGACGATGATAGTTATTACGCGCCCGAGGCTGAGGTAGTATACGAGGACTCTGTGCCGGCGCTAACGCATTTCATGGATCCCAGCGGTAGCGCCGTACTGGCGGAAGAGAGCAACAGTCACCTGGACTCCGTTTCGGACTTGTATCAGCCAGATTTGGACTTGGAGTTGCAGCTGACGGCGGTGCCGGACTTTGAGGAGCAACTGGACGTGACGCAGCGCCAGCACTTGTCGACCAATAAGAAGCGCATCTGCAGCCTGGTCGGCTGCAATGTACACGTGGATCAGGAACCGGGCATACGGTTGCACAAGTTCCCCGTAGCGCCGGAGCACTTCTCCAAGTGGGTGCACAATACTCAGCTGGACGTGGACCGCGAGATGCGCTGGAAGTACAAGCTGTGCAATCGACACTTCGAGCCGGAGTGCTTCCGCGGCATACGTTTGCAGCCGGGCAGCATGCCCACTCTGCACTTGGGCCCCAACAGCCCGGCGTTCATCTACGAGAATAGCTTCACGCGGCGCCAAGTACAATTGCCCGGCTGGCAAATGCCTTCGGTGGAGCGCTCTTGCTGTATCCCGAATTGTGAGGGCTCATCGGCCACGCTATATGATTTTCCCAAGTGCAAGCAGTTGCTGGCTAAGTGGCTGCCACACCTCAATATAGAATACGATGCGGCGCGCCACTGGGCGTATCAGATTTGTGCGCGGCACTTCAAGGCGCACTGCTTCGAGGGCGACCAATTGCGGGAAAATGCCGTACCTACTTTGAATTGCGGTGCGACGACTAAGGCGGAGAAGTCGTTACGCAGTGCAAATGCAGGCTTGCCCCTGTCACCTTCCAAAGCTGCTAGCGAGGACTTTGTCTACAATGAAATCAAATCGGGCTACCAGAAATGCAGTCTCATACACTGTCAGAAGCAAGTGGCCAAGGACAATGTGAAGACCTATAAATTCCCAAAATCGGCGGAGCAACAGCGGCAATGGTCTCACAATCTGCGTATACAGTACGATCCTCAACGACCCTGGAAGTATCTGATTTGCAGCGCGCACTTCGAGGCGCAATGCGTGGACGAGCAGCAGGAGGGCAAGACGCAGTTGCAGCCCTGGGCTGTGCCTACGCTGAATCTGGGCGACAATGTGCCTGCCATACGCTACACAAATGAGAAAACGCGAGCACTGTACCTGGCTCAAACGGAAGCTGGCAGTGATAAAGAGGAGGCGGCAGAAATGGAGCTGGAGGCAGAGACGGAGCCGTCACAGGAGGAGCCGGCTGCTAAGCATATGCGTATGGACTTCGGCGAAGGCACTCCCGATTTTCACTGGCTGGAGGACAGCCTCTCTCACGATGCCGTGGACAAGCCTACTAAGCAGAAGTCATTTGGCGATGTGAAAAGCGTGAAGAAATGCTCGCTGGCGCATTGCCAGCTGGAGCGCGGTGCGGGCGGCTGCAAGTTGTTTCGCTTTCCCACAGACGCTGAGACGCTCAAGAAATGGAAGCACAATCTGCGCAAGAAATTTAACGCGCATCAGCGCAATGTGCAGCGCGTCTGTAGCATGCACTTCGAACCGCACTTGATAGGCGTGAAGCGGCTGCTGAAGGGCTCCGTGCCGACGCGCAATCTGGGCCACAACGACAAGAACATCTACGACAATCCCAAAAGCATAATACAAACTGCCAGCGGTGTCGCCGCTTCGAGTGTGGACCAGCTGGTCTGCGACGTGCCCGGCTGCGGCCGCTCCGAGGCCAGCGAGGGCGGCGCGCGCTTCTTTCAGTTTCCCGAGGCGCAGCGGTACACATGCATCTGGCAGCGGCGCCTGCAAATCGACTATGAGCCCGCCAGGCGCGACGACATTCACGTGTGCGATGCACACTTCAATCCGCAATTCATATCGCACAACGGCCTGGACACGAATGCTGTGCCAACGCTGAATCTGCCGCCGGAGCGTTTCCAAGTGTGCGCGGTGCCCGGCTGCGGTAACTCGGGGCTGGaaatgttttcgctgccaaagcaTGATGACTTCCGCCGAACTTGGCTGAAACGGCTGCAAATCGCCGCCTGCGACGCGCAGGAACTGCGCAACATGCAAGTGTGTGCGGCGCACTTCGAGCAGAGCTACTTCCGCGGTCTGAAGCTCAACAGCAAAGCGCTGCCAACGCTGCGCCTGCCACAGGAACAGTTGATCGCCGACCCGCCAACTGTTCACGTTgtggagcagcaacagcagccggaCGTTACGTCCGCGTACTGCAGCGCGCCGCATTGCATCAACAACGCTGCGACGATGTGCAGCAGCAAACTCTACAGATTCCCGGAGAAACGCAACCTGTACGTGAAATGGTGTCACAATCTGCAGCTGGACTACGAGGCAATGACGCACCGCAATTCGCAGTACGCCATCTGCCCGAAGCACTTTGAACCGCACTGTCAGCGTCACGGCAAGCTGCACCCAGAGGCCGTGCCCACTTTGCATTTGGGCCATGCCGATCCGAACATCTTTCAGAGTACGGCCAGCTTTGTCACGCAAACCACTACTGACCGACACACCACTGGATCGCTCGGCTACGACGAGCACAGCATACTGCGTAGCAGCCTGAGCTCGCACAACACAAGCGGCACGCTGCAGGCGCACAGAGAGCGCCAGTTGGAAAATGATGAAGGCAATGATGCCAGCTACGCCGACTTCGCGCCGCAGAGCAGCACCTTCATGGCCAGTGATTTGGATGAGAGCATGCAGTTGTCGTCCACtgcgccgcagcagcagcgccgttTGCCGCCTTGGCAAGCGCTCAAGCAGCTGCCGAAATGA
Protein Sequence
MSQQNQNKQQQLAQQQQQQQQRQQQQWHAHVGPAISVAGANLHSMFTGGVGCGGSTNNIGGPFSGGAALGCDGSFDLEMRAHSVGLGSRAGGVGVVSTPTFDAFFPQNQQQQQQQRRQQQLQQVLYAAHSQPQLQLQPPIPNIKMEPLEPQQQQEEPEHDMDILTPTIEMEELIIKTEPNTEDSMYKMNFPHDDNFGFSNYTRHNQQTMTLSMPHTKVEHIKEEPLATLALQKPLNFPRRKMQTERGDTLPICQRCKQVFLKKYSYTKHVAQSSCDIVEYDFKCSICPMSFMSSEELHVHEQVHQAQKYFCHKYCGKYFDTIELCEVHEYMQHDYPAYVCNFCAATYSTREQLFSHMPLHKLQQRYDCPVCRVWFQNAHELHQHRLAAPYYCGRYYASKERAASAPHTRTTNTLPTSHLPTTHNQQNYNLQECRMGVMEVPSSSNRDAASFLNPQCSTASLNNNCFQPPTQLKSEIKMETNDYFYPHEFGQHNFSDFVNDSYANSMPFGNSHHSHNNSNSSSSSSREFKDPTLSFNFAAHLPAGSAVHTAATTTAQPIIACRSTSSDANDICCVPKCGVRKSTSPSLQFFSFPKDEKYLLQWLHNLKMFQTPPLTYASYRICSLHFPKRCINRYSLCYWAVPTFNLGHDDVANLYQNRELTNTFTTGDLARCSMPGCTSQRGESNFKFYNFPNDTKSLIKWCQNARLPVCAKEPRHFCSRHFEERCFGKFRLKPWAVPTLHLGTPYGRIHENPGLFYLEEKKCCLTHCRRARSSDFNLSLYRFPRDEMLLRRWCYNLRLDPSVYRGKNHKICSAHFVKEALGLRKLSPGAVPTLNLGHNDTFNIYENELYQPPPPPPPAPPITHTLSQAASSSAKLYKFHNAGMSALNYAAMSLSPTDFASNLSAASTSNSASTSNAYDSIDVCCVPKCGRNRHTNGVTLHTIPRRLEQLQKWAHNLKMDLERLQHKALRICSAHFETYCIGGCMRPFAVPTLNLGHDDTNLHRNPDVIKKLNIRETCCVPCCKRNRDRDHANLHRFPSNAAMLQKWCDNLHKPVPDGSKLFNDAICEVHFEDRCLRNKRLEKWALPTLNLGLVELMHKLPSEAEVAELWSKPSAPNTGENEGECCVETCKRNPQVDDIKLYRPPEDGDVLAKWAHNLQREAAELTNLRVCSLHFESHCSGKRRLHSWAIPTLNLGKTLEQLYENPEHMLVIKKEKLHKLYDPMKSWAPRCCLSHCRKMRGIDNVQLFRFPHRHSKTLAKWCHNLQMPMVGNMHRRVCSAHFDPQVLTKRCPVPQAVPTLELNTPPGYKLYQNPARLKAKKLRQDVCIIPSCRKARADGVQLFRFPHNNSLRRKWGHNTHTRANEAMRANWRICSTHFEPHSFGVKRLCPGAIPTLQLGHDDENIYPNEAQTLAEQQCIVNGCEANKEVQQVRLFKFPCDDEDQLWKWCKNLKMNPIDCQGVRICYRHFEPECMGPKMLYKWAIPTLQLGHDDAEIELVPIPKPEERYTELIFKCCVPSCGKTRKFDDAQLNSFPKHIKLFRRWKHNLRLVHLNFREREKYKICNEHFEPICLGKNRLNFGAIPTRNLGHGQTIGMYKVNPAQIQTKLFCTPKLTVGSDDDDDEASEDEEEEEEEQQEQEEQEQESNTEMTEENEEINGNADAQAKCCHAACTAPKTLLREPYDMPQQAELLALWQKHIQVAEEEQEPRQLCGLHFMAVYQATLTAANALLSCNSALEPELQALQAAYERCANSLLICSAQCCVAGCTSNELGPHKLYQFPRNAETAQQWRFNTGAQVEPTNPSLHKVCALHFEPHCFTDMQRLRPWALPTLELQHDTPSEMYRNPDLNKIDVSSLGPATQQCSVRACGATNTVASPLRLYRFPRDEVLLEKWLHNLQLEREQAPLYRICKAHFQPQCFVDHDSCNLRAGALPTLQLGHSDTTHIHETLVEDVDEQQQSEPLEDPPTQVKMKTSLDTLKCVVTTCRKSRLQHGVRLFPFPASGTMLRKWCYNLMLPLNIADKQPHICNMHFHKRCIEGKQLRQWAKPTKHLGHSNPIYDNPKNLPGIFLPICCLPHCRKRRTLDNDLRTFGFPKNRTMQEKWCENLRVRPTASQARLCAEHFEPQVLGHRKLRTGAVPTLNLGHSEPLQHDNRVIIESRDTMWQQSDTAAQQQQEEEQSQLHELNDSSADFGDYDKEVYYNEVYGEELAHQAALDLTADSEDDEQPSNAGTANAAEFCDPMSYLECVVEEQAGTMPIKKERVVNNISPICCLPHCGKQKTPEQHLSTFGFPKDPTVLAKWAANLHLRLEDCIGRVCIDHFELRVVGNRRLKTGAVPTINLGHNDHLPYINTVDADPKKQQSKQTPFIKLEPQYLSDGCTTPTSATSTPYAEKLNHSVFRLCCLKHCRRKKAAAAAMGTPEQHIRTFGFPKAEQLYVKWCENLRLDREYCRGRRVCIDHFEPAVVGKQKLHPGAVPTLDLGPQQPEPLHRNSELTLHYVPSNNNICNVPGCGRAANADADDGVRLFRFPHDAQLLAQWCDNLQLRHGNCENYKICERHFEPQCLGSTRLMVGAIPTLHLPHGGRMAPKHVTNPLSIMRSRVCCIAACRKASKQPPTSLFLFPKPSQPLARKWYHNTQQKLQRHALRPRICVAHFEPHAMLANGRPRPWAVPTLKLGHADAIFQNPHKLLKELQREKCCLPNCQASELVPLYSYPQRGALLRQWAHNTQQDAYLAQRRNYKVCRRHFDSGCFADDGSLKSGAVPTLELGRDVGDIYTPEPEITEQTHSQCCSIRKCGRSVRIDGVKLFPFPLQNPELLHRWCHNLHLSAADCAQHQICNLHFEAGCLHKRQLHEWAVPTLLLGRGAAEQPMQLYRNPELEQLKPLPSSAYCCVSSCGKCRQMDGVRMYSFPKQRPLYLRWTHNLKLQPTARLLSAYKVCHEHFEEYCNGPMHLRQGAVPTLRLGHADDNIYRNNRAQLTESGKLEVEVQPPPTLQCVVPQCHLAQLLRMPLYNLPAEASLRRRWCEQLKLADPPTTAKLCALHFRQTYNECNARDNDENQNVQTPAAVAPDKSLCSDYAAICRTMRLLAYQCSVPGCDTNATNAELRLYKFPASAALFDKWQQNTHVQLDAKFRRRGWYKVCALHFESRCFGKAQRLYTWALPTLQLQHTAEHAVHMLPLDVEQQQQDDSAAKINARECCIASCRQQRDRDAGIRLYAFPTSNVSLFQQWLHNVRLEAAHCRRARICTLHFEQRCVGKRLHKRAVPTLDLGHSEEHIYRNKRLKKKRVARCLLAHCRKTPRLQRVQLHALPKLEALLQQWQYNLHLPATAKIGKICSAHFEACCYDAAHRLKKWALPTLDLGHDGAVLENRTAAMKEEETAAEVGNSIKVKAETQTAIEAVKKAAAVEAAEVKDTLKAITASGAAIEHCAIPGCSRTSATADCQLYAFPKAAWLRQSWCDNTHLPFAEAAQLKICDAHFEPNVKGSMNLRLWSRPTLLLGHNDAIHDNPKLPCTVTSKYVRRNSCAIVSCGKSMTDGVQLFQYPKKSALLRKWAKNCKHSVSQAIRDKFRICNEHFEAYCYEHGQLKCGSIPTLQLGHDDGDIFTMQRELLSDVLGDEESSELKCCQPQCAEVSNVKQLYELPKATVLRAAWLASMGLQGTVEKKAAVVEQQHALKDNGTAAAEEAKNMDSNKSAEICYIETATEIKIARQIKMEDHGSEAENVQLAREDEVDDAKTTADITLVKQTKLPEYKTAAEFSISLEFDAELAEDVKLLRHGESQVPSAAEESKTLQEDGAKIAKFLEEVEISDFRTAEKSQRERVANDDFKATENSMDEEALERSEAAETELEVDGKQKEDAALRLCAVHFKISYDKNAPLLEQLLASAETDTDLRDSLQKLQSQHAAVCNTTRVRSLSCAVVGCRTRVLQHEPIKLYSLPCNRELLQKWLHNTQAAVCEERSSYMKVCELHFESNCFGESISKRLKYWAVPTLQLPPARDDDTQAPFSNPAIEQQERHMLERKCCIAACKHARTQAEPHDVQLYSFPKNTELLQKWLHNTQLSRREAFSARICALHFEKHCINKRMRYWALPTLLLGHDNPNIYLNPGTTTSAVGTDTGAADNVAASSTNNAPLEEESNDMELLDPPMSIKIKLEIESDDEVHEKQQQEDELEEEEEQQDEMEEEEQYDSRPSRQPHTSRHCQIVGCRGHADQPGITLHKFPIPEELFRKWLHNTQINVVREIRWKYRICSRHFEAACFRGRRLQPGTMPTLLLGPKQPTTIYENEFASEGSSRATENASPNGDGDSASYDEFEPARLVMASYDSNSKEPLEQDKPLEIKLEVQHVEEFKEHFEAAVDYAHLQLHAPLRVNKKTSCAIVGCTSEAGHVGITLHKFPMSEEQCRKWEYNTQIDVDRNSRWKYKICSRHFQANCFRGKRLLTGTMPTLHLGASRPAEIYENEFYRNDGQSEPVDETCQEPPIDALQVVKNEVLDDETSDPDDMTPLSSYLDVTLAEQQLQPQTQLEAALKKKYMREFDQNMSADADLSYDAAPEESLSLSLQSNTRHCRVVGCNSHLGQLGVKMHKFPLPEDLFQKWMHNTQVQVDRTCRWKYKICGRHFEPGCLRGRRFFTGTMPTLHLGPNRPAKIYKNEFVLFKATSGNALEHSHELSAELDVSVLSAGDDDDDSYYAPEAEVVYEDSVPALTHFMDPSGSAVLAEESNSHLDSVSDLYQPDLDLELQLTAVPDFEEQLDVTQRQHLSTNKKRICSLVGCNVHVDQEPGIRLHKFPVAPEHFSKWVHNTQLDVDREMRWKYKLCNRHFEPECFRGIRLQPGSMPTLHLGPNSPAFIYENSFTRRQVQLPGWQMPSVERSCCIPNCEGSSATLYDFPKCKQLLAKWLPHLNIEYDAARHWAYQICARHFKAHCFEGDQLRENAVPTLNCGATTKAEKSLRSANAGLPLSPSKAASEDFVYNEIKSGYQKCSLIHCQKQVAKDNVKTYKFPKSAEQQRQWSHNLRIQYDPQRPWKYLICSAHFEAQCVDEQQEGKTQLQPWAVPTLNLGDNVPAIRYTNEKTRALYLAQTEAGSDKEEAAEMELEAETEPSQEEPAAKHMRMDFGEGTPDFHWLEDSLSHDAVDKPTKQKSFGDVKSVKKCSLAHCQLERGAGGCKLFRFPTDAETLKKWKHNLRKKFNAHQRNVQRVCSMHFEPHLIGVKRLLKGSVPTRNLGHNDKNIYDNPKSIIQTASGVAASSVDQLVCDVPGCGRSEASEGGARFFQFPEAQRYTCIWQRRLQIDYEPARRDDIHVCDAHFNPQFISHNGLDTNAVPTLNLPPERFQVCAVPGCGNSGLEMFSLPKHDDFRRTWLKRLQIAACDAQELRNMQVCAAHFEQSYFRGLKLNSKALPTLRLPQEQLIADPPTVHVVEQQQQPDVTSAYCSAPHCINNAATMCSSKLYRFPEKRNLYVKWCHNLQLDYEAMTHRNSQYAICPKHFEPHCQRHGKLHPEAVPTLHLGHADPNIFQSTASFVTQTTTDRHTTGSLGYDEHSILRSSLSSHNTSGTLQAHRERQLENDEGNDASYADFAPQSSTFMASDLDESMQLSSTAPQQQRRLPPWQALKQLPK

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-