Basic Information

Gene Symbol
-
Assembly
GCA_960531455.1
Location
OY482674.1:108937870-108976665[+]

Transcription Factor Domain

TF Family
THAP
Domain
THAP domain
PFAM
PF05485
TF Group
Zinc-Coordinating Group
Description
The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 38 3.8e-16 2.4e-13 52.0 2.7 1 86 919 991 919 992 0.86
2 38 1.3e-14 8.3e-12 47.0 5.8 1 87 1019 1088 1019 1088 0.82
3 38 1.5e-15 9.4e-13 50.1 0.3 1 87 1110 1182 1110 1182 0.85
4 38 1e-10 6.3e-08 34.6 4.3 1 87 1285 1352 1281 1352 0.76
5 38 2.1e-13 1.3e-10 43.2 4.2 1 86 1376 1447 1376 1448 0.79
6 38 9.1e-13 5.8e-10 41.1 1.2 1 87 1484 1554 1484 1554 0.79
7 38 4.1e-10 2.6e-07 32.6 3.3 1 86 1603 1672 1603 1673 0.76
8 38 2.6e-15 1.7e-12 49.3 1.1 1 87 1700 1770 1700 1770 0.83
9 38 2.4e-12 1.5e-09 39.8 0.8 1 86 1791 1860 1791 1861 0.80
10 38 2.4e-12 1.5e-09 39.8 0.9 1 87 1888 1960 1888 1960 0.87
11 38 0.0022 1.4 11.1 0.0 1 58 2012 2062 2012 2080 0.78
12 38 9.2e-12 5.9e-09 37.9 0.1 1 86 2108 2186 2108 2187 0.78
13 38 9.2e-13 5.8e-10 41.1 1.7 1 86 2213 2282 2213 2283 0.79
14 38 1.7e-11 1.1e-08 37.0 2.6 1 86 2356 2426 2356 2427 0.80
15 38 2.3e-14 1.4e-11 46.3 0.9 1 87 2449 2518 2449 2518 0.82
16 38 8.1e-09 5.1e-06 28.5 0.3 1 86 2928 2994 2928 2995 0.80
17 38 2.5e-13 1.6e-10 42.9 0.1 1 86 3040 3115 3040 3116 0.79
18 38 0.0003 0.19 13.9 0.1 1 61 3156 3218 3156 3239 0.75
19 38 1.5e-11 9.8e-09 37.2 3.1 1 87 3271 3341 3271 3341 0.80
20 38 1e-12 6.5e-10 41.0 4.4 1 87 3369 3475 3369 3475 0.83
21 38 8.2e-12 5.2e-09 38.1 1.6 1 86 3507 3575 3507 3576 0.79
22 38 2.8e-14 1.8e-11 46.0 4.4 1 87 3599 3668 3599 3668 0.83
23 38 3.3e-13 2.1e-10 42.6 0.4 1 86 3728 3796 3728 3797 0.80
24 38 5.7e-13 3.6e-10 41.8 0.7 1 87 3841 3913 3841 3913 0.81
25 38 1.1e-10 6.8e-08 34.5 0.1 1 86 3942 4012 3942 4013 0.77
26 38 7.3e-14 4.6e-11 44.7 1.3 1 86 4038 4108 4038 4109 0.79
27 38 0.0026 1.6 10.8 0.3 1 47 4149 4192 4149 4201 0.78
28 38 1.7e-16 1.1e-13 53.1 2.7 1 86 4321 4396 4321 4397 0.79
29 38 5.8e-13 3.7e-10 41.8 0.2 1 86 4434 4512 4434 4513 0.80
30 38 1.8e-13 1.2e-10 43.4 3.2 1 87 4773 4844 4773 4844 0.83
31 38 2.3e-14 1.5e-11 46.2 0.2 1 86 4937 5013 4937 5014 0.83
32 38 4.3e-10 2.7e-07 32.6 0.4 1 84 5054 5121 5054 5124 0.76
33 38 2.5e-12 1.6e-09 39.7 2.9 1 86 5191 5265 5191 5266 0.80
34 38 2.4e-13 1.5e-10 43.0 0.6 1 87 5464 5533 5464 5533 0.83
35 38 9e-10 5.7e-07 31.5 0.4 1 86 5588 5656 5588 5657 0.79
36 38 2.9e-12 1.8e-09 39.5 0.7 1 87 5741 5814 5741 5814 0.78
37 38 2.7e-11 1.7e-08 36.4 0.1 1 86 5835 5905 5835 5906 0.76
38 38 4.1e-10 2.6e-07 32.6 2.6 1 86 5929 5997 5929 5998 0.83

Sequence Information

Coding Sequence
atgcccagccagcaagCTGAACCTACAGATGATGCTTACAGTAAAGTTAGTACGGCCGATGAGGCCTTAGCAGCTGGACACAGCGCAAATCCAATGCATCCCAATGCTGCTGGTTATATGAATTATCCAAAGCATTTGCAATCATATCCCCATCTCCAACACcaccaacaacagcagcaacaacaccaGCATCAATTGCAGCgtcagcagcaacaacagcagttgcgacaacaacagcaacagcaacaaatgcgacaacagcaacatcaacaacaacatcaacaactgCGACAACAACAAGAGCAGCAACGTTTGCGACAACAACGGCAACAAGAACTACACAATCAAGAACAGTTGCGACAGCAGCacgaacaaaaacaacagcaacaacaattgcgattgcagcagcaacaacacgAAATACAAGAACAACAATTGCGactgcagcagcagcagctagaacaacaacaacaacagcaccaATTGCGATTGCAACAgctgcagcagcagcaacaacatcagcaCGAACAACATGAAATGCAATTGCGACTGCAGCAACAGAGAAAAGAAACAGAAGAGCTAttgcgacaacaacaacaagagcagcAGCAACAGGAGGAATTGAGATTACAGCAGCAGAAAAAAGAAACAGAGGAGCTATtgcgacaacaacaacagcaacaagagcagcagcagcagcaagaGGAATTGAGATTGCAGCAGCAGAAGAAAGAAACGCAAGAGCTATATAGacagcaacaacatcagcaacaagAGCAGCAGCAACAGGAATTGAGactgcagcagcaacaacaacagcaacaacaacaagagaaACAGAagcaacaacaagagcaacagaagcaacaacaagagcaacagaAGCAGCAGGAGCAGGAGCAGCACAAACTGCTAGAGGAGCAACATCAACAGAAGCAACGGCAGCAAGAACAACAGGAACAACAAGAGAAACAAGAAACACATACACAAaagcaacaagaacaacaaacaCAAAAAGAACAGGAAGAGCTGCGACAACAGCCATCACAGGAACCACAGCTGCAAGAAGAAtatcagcagcagcaacagcatcaccaaaaacaacagcaacaacaatatgagcaacaacaaaaatgcgACGCAAGAGAAAATGAACTAAAGGAAAAGGAACAGCAGCACGAAGAACTTAAAGGAAAACAGCAGCAAGAAGAACTTAAAGAAAAACAGCAACAAGAAGAACTTAAAGAAAAGCAACAACAAGAAGAACGACATGAACTACAACAAGAGCGTCCAGAACAGGAACAGCAAcgacaacaaaataaaacaccaGATCAACAAGACGACCAAGACCAAGAGCTAATGTTAACACATGATGAGATGGCCGAAGCCGGCTTGCTAACGGTAAAACTGGAACCGCAAATACATATTAAAGAGGAGATGCCCGAACCATGTCAGAATAAGCCCCTCAACTTTCCACGTCGCAAAGTACAGACCGAACGTTCGGACACTCTACCAATTTGTCAACGTTGCAAGCAGGTGTTCTTCAAAAAGCAAATCTACACCAGACATGTCTCACAAAGTCTCTGTGAAATTGTTGAATTCGATTTTAAATGTTGCATCTGTCCAATGTCTTTCCATTCCGGTGAAGAGCTGCAAGAACACGAGGACTTACATCGAGAGAATATGTTCTTTTGTCATAAATATTGTGGCAAATATTTTGAAACGATTGAATTGTGCGAAGTGCATGAGTATATGCTTCATGAGCATAGCAATTTCATTTGTAATGTTTGCTCGGCCGGTTTCACTAGTCGCGATCAGCTCTTCCAACATATGCCGACACATCGCAATCAAACGCGCTTTGACTGTCCGATATGTCGCTTGTGGTTCCATACATCCGCACAATTGCATCAGCATCGCTTAGCGGCACCACATTTTTGTGGCAAATTTTATAATGCGGATGGAGAAGGCGCTGCAGCTTCAGTTGACAATACATATGGTGCCACTCGGCCATCACATGCCACCAATTATAATCTGCAAGACTGTTCAATGGGCGTAATAGAAATGCCCGGTAGCAATAGTTTCTCCTCAGCTTTGCAAAATCGCGCCGCCTTCCATCACCAGCAGCACCAACGCCACGATCCACATTATTCACATTCGCACGCgcaacaacaatatcaacaaCATCTTCAACgtcaacaacagcaacaacaacgctATCAAGCACAGCAACACCAACTACACCACCGTTCGCCAGATTATATGCGTCGCAGCAGTTACGGCAGCGTGAGTGGCGCGGCAGTTGCTGGCGCGGGTGCGGGTTTGGGTGCTGCGAATGACTTCCACATGCCACAaatcaaaactgaaatcaaaattgaacCAGACACCTATGACGCGGCCGATTATGCCAAGCAAACACCGTTGCGGCCACCGCCCGTACCACAATCGCCCTCTGcgctaatggcatcaccatcATCACGTCAACGCCGCTTCACCGATTACTCAAATGAAACGTTTGGTGCTGGTGCAGTGGATAGTATGCTTTCGCTTAGCGTGAATAGCAGCGTGGGCAGCAATAGCAATAGCAATGACTTTTTAAATGGTGATGGTATGCAACGTACATTGAGTTCGAGCAGTCTCAGCCGCCATCCTTCCTCCTCGCAACAATATTCGCCAAATTTACGTTTTCCAACCACGCCCATGGCCAGTGCGTCATCGCGTAGTCGTGTTGTGGACACAGGCGATGAAGCAGCTATATGTTGTGTGCCGCATTGTGGCGTCAATCAAATGTCCAGTCCAACACTGCAATTCTTTACATTCCCGAAAGATGAGAAATATTTACAGCAATGGTTGCATAATTTGAAAATGGCACCCGAACCCGGCTCCGATTATAGCCAATATCGTATATGCAGTTTACATTTCCCGAAACGTTGCATGAATCGTTATTCGCTTTGCTATTGGGCTGTGCCCACCTTCAATTTGGGTCACGACGATGTAGCCAATTTATATCAGAATCGTGAGATTACCAACACGTTTACGGTGGGCGATCGTGCGCAATGCAGCATGCCCGGCTGTTCGAGTCAGCGTGGCGAGACCAATTGTAAATTCTATAATTTTCCAAACGATATGAAGACGCGCATCAAATGGTGCCAAAATGCACGTCTACCTATACATAGCAAGGAACCGCGGCATTTGTGCAGTCGTCACTTTGAGGATCGTTGCTTTGGCAAATTTCGCTTGAAGCCATGGGCGGTGCCAACATTGAATTTGGGTACACCCTATGGCCGTATTCACGATAATCCAGGCATATTCTATTTGGAAGAGAAGCGTTGCTGTTTGCCGCATTGTAAGCGTACACGCTCATCTGATTTTAATCTGTCGCTCTATCGTTTTCCGCGCGATGAAATGTTGCTGCGTCGCTGGTGTTACAATTTGCGTTTGGATCCGGCAATTTATCGTggtaaaaatcataaaatatgtAGTGCACACTTTATAAAGGAGGCGTTGGGTTTGAGAAAGCTGTCGCCAGGCGCCGTGCCAACCTTAAATCTCGGTCACAATGATCGCTTCAATATCTACGAGAATGAGCTACCTACACCACCACAACCGCCAGCACCCACCCCAGTAGCGGCACCCAAAATGTTTAGCTTCCATAATTTGTCCGCACCACCCGAACACTACAACAAAAGCCATAACCGCGAAAGCATCGCCGCTGCGAGCAGCCAACATAGCCGCCACATGAGCCGCATGTTCCAGAATCCTGTATCAGAGCTTAGATTCAGCATATCCAATATTACAGCCGGTGATATGAGTTCTATGGGTTCAGCACAAAACCTACTCGATTGCATTGACTTTTGCTTTGTGTGTAAGCGTAATCGTAACACAGATAATGTCACCCTACACACCATACCGCGCCGGCCCGAGCAAAGGCGCAAATGGTGTCACAATCTAAAGATTAGTGTTCACTCGCTGCACAAGGGTGTGCGCATTTGTAGCGCCCACTTCGAACCCTACTGCATTGGTGGCTGTATGCGACCGTTTGCAGTGCCCACACTGAATTTGGGACACAACGATCCGAATATTTATCGTAATCCAGATGTAATTAAAAAGTTGAATATACGTGAGACATGTTGTGTACAAGACTGTAGACGTAATCGTGATCGCGATCGTGCCAACTTGCATCGTTTCCCATCGAATTACGATATGTTAACGAAATGGTGTGAGAATTTGTTGAAGCCGGTGCCGGATGGCAGTAAACTCTTCAACGATGCCATATGTGAGCGACATTTCGAAGATCGTTGCTTGCGCAATAAACGTTTGGAGAAATGGGCGGTGCCCACCGTCAAACTGGGACACAACGAAGAACTCAAGCATCAGCTGCCAACCGACGAAGAGATTGCTGAATTGTGGCCGAAGCCGCTATTGCCCAACAAGGGCATTGATGATGGTGAATGTTGCGTTGCGACATGTCGTCGCGATCCCAAAGTCGACGATGTTAAACTCTATCGCTCACCCGAAGACGCCGAAGTGTTGGCCAAATGGGCGCACAATCTGCAAGTGGAAACCGAAGATCTAACCACGTTGGTCATTTGCAATTTACACTTTGAAGAGCGTTGCTTTAGCAAGAAACGACGTCTACACGATTGGGCTTTGCCAACACTCAATCTCGCCAACAACGTGGAACAATTGTATGAGAATCCTGAACCGGTGGTGCCATTGGCCGTTATGAAGGCGGAGGAGCGTCGTGAACGTCGCGAACGTCGCCGCATGCGCGACCCCAACGAACCGGCCAAACCGTGGACGCCACGCTGTTGCCTGCCGCATTGTCACAAACGTCGCGATACCGATCGTGTGCAACTCTTCCGCTTCCCCATACTCAATCGTCCGATCTTGGTGAAATGGTGTCACAATTTACAGCTACCGCTGGTGGGCAATGCGCATCGTCGCTTGTGCTCCGCACATTTTGAATCACGCGTGCTATCGAAACGCTGCCCCATACCGACCTCAGTGCCAACGCTTGACTTGAATACACCGCCCGGCTATAAAATCTACACGAATCCGGCGCGCTTGAAGGCGGTGAAATTGCGTATGCAACAAACTTGCTGCATAGCCTCGTGCGCACGCACGCGCGCCGATGGCGTACAGCTATTCCGTTTCCCGCACAGTCGCTCAATGGTGCGCAAATGGTGTCACAATACGCGTCAGCAAGCGCGCGAAGCGATACGCGGCCAATATCGCGTTTGTTCGCGTCACTTTGAACCGCACGCATTCGGTGCGAGACGTCTGATGCCGGGCGCTATTCCAACGCTGAATTTAGATCCGGAAGTGGAGGATGCTTTTGCGAATGAAGCGCAAGGCTTTGCCGAGACGCAATGTGTTGTTAATGGTTGTGTGGCCAGCAAGGAGCTGGATGGCGTGCGTCTATTCAAATTTCCGCACGAAGATGAGGATTTACTATGGAAATGGTGTAACAATTTGAAGATGAATCCAAGCGAATGCAATGGCGTGCGTATATGCAATCGGCATTTCGAAGATGAATGTGTCGGACCGAAAATGCTGTACAAATGGGCCATACCCACACTGGCGCTGGGTCATGAAGACcccgatatcaaattggtgccAGTGCCGCCGCCTGAAGAGCGTTACAACGATGTCGTCACCAAGTGTTGTGTACCCACATGCGGCAAATCACGTAAATTCGATGACGCACAAATGAATAGCTTCCCCAAGGAGATACGTTTCTTTCGCTATTGGAAGCACAATCTCAAATTGGATTATTTAGATTTCAAGGATCGtgataaatataaaatatgcaGTGATCACTTCGAGCCGGTGTGTTTGGGTAAGACGCGTTTGAATTATGGCGCCATACCCACGCTTAATTTGGGTCACGACGACACCGAAGACCTGTACAAAGTCGATCCGACACAACTGCAAGTCTCGCTGTTCGGCAAAATGCGTCCATGTGCAGCGGGTAGCGGCGAGAAACCCGCCGAAGAGCACGAGGCCACGGCTGAAAAGGATGATGACGTAAAATGTTCCTATCCAAATTGCACAGCGACCAAGACGCTGCTACGTGAACCCTACGATATGCCGGCCGCACTGCCGCTGCATGCGCTTTGGTGCGCACAAATGAAAGTGGATACGGAATCGCTAAGCGAAACACCAAAACTTTGTGGTTTACATttcataaaactatacaaatcAACGCTGGATGCGGCTAATGCTCTGGCGGAAGGCGATGAAGCGCTGAGTGAGGCCATGCAGCAGCTGACTGCCACCTATGAAAAATGTCGCGCCTCACCAATCGTTTGCGCTGCGCAATGCTGCGTCACTGGCTGCAGCAGCAGTCAAGTTAGCGGCAGTCGTACCGCCCAGCGCCTCTATCTCTTTCCCTCGCCCACAAGCAGTGGAGTGGAGTTGATTGAAAAATGGTGTGAAAATGTGGGAGTTGCTGCGGCCGATTTGAATCTTTATCCGCACAAGGTGTGTGCACGTCACTTTGAAGCACAATGCATTGGTCCGACGCAGCGTTTGCGCTCGTGGGCCGTACCAACGTTGTATTTGAAGAAGAAAAGCGGTGCGGAAATACATCGCATACCCGAATTGGGCAGCGCGGTGCGGCCGCATGAGGCGCGCTGTTGTGTGCCGTCTTGCGCCCGCGAACGCGGTGATTTTGAAACAATGCGTCTCTTCAGCTTCCCCATCATAGAGGAAGTGCTCGAGAAGTGGCTGCACAATTTGCAGCTGACGCGCAGTGAATGTGCGCGCTTACGCATTTGCGATCAACATTTCGAATCGCGTTGCATTACAAAGGCACGTCTACAACGTTGGGCTGTGCCCACGCTGCTGCTCGAACGCAAACATGAAGAGCTGTTACAAAATAAGCCACCCGTACACGCAGCAACTGGCGCGTCGGCTGGAAGACCTCCGAAGGTGGCTAGGGCTGCTGTTGCCGGCAaagacgatgatgatgatgatgatgaactCGGTGAGGATACCGCTGGTAGCGCCGAGTTGGCTGTTGTGCCTAAtgtaaaaatgaaaaaatctCTAGACACTGTAAAATGTTGCGTTGCGAGTTGTCGACGTAGTCGCCTTAAACACGGCGTACGCCTATTTCCATTACCCACGACACCATCGATACTGCGCAAATGGTGTCACAATCTAAGAATGCCCGTTGAAAAAGCTGCAGACACATCTTTACGCATTTGCAGTTTACACTTCCACAAACGTTGCATTGATGGCAAGGAGTTGCGTGTATGGGCTAAACCAACCATTGCACTTGGACATACCGGTCCCATCGATGAGATTCCGAAGAATCTGCCCGGTGTCTTTTTGCCCAAATGCTGTTTGCCGCATTGTCGTCAGCGGCGTACACTGGAAAATGATTTGCGCACATTTGGTTTCCCCAAAGATGCGGTACTGTTGCAGAAGTGGTATGCCAATTTGCGTATGCAGAAACGTACTACGCATGCGCGCATTTGCATGTCACATTTTCCACCCGAAGCCATGGGCAATAAGAAGTTGCGCAACAATGCGGTGCCGACGTTGAATCTGGGACACAATGAGCCACTCAAATATGATAATGAACTGTTGATAGCTTCGGCGCCATCAGCAAAGCTAGAGAAGAAGAATcgtctgcaaaaaaacaaaactacTGCTACTGCAGCCAGCGATGCTGGCCAAGAAGCGAAGTCAGCCGAGGCCGAcgatgatgatgctgatgatgcTTATGGCGACGATAGAGAGGAGGAAGAGGATGATGAAAATGGTTTATACGCCGGCACTGGTGTCGCTAACAGCGATGAAGAGGAGGAGGAAGAAACCTTTGAAGACTTTACGCCAAGCCTTAGTGCAGCCACTGGTGATATCGATGATGATGAAACACGAAGTTATGCTGGCAGGCGTGCAGCTGATAAGCTCAATGATAATTCCGATGAAGACGACGACGATAATGATGATGAGGATGGCGAAGATGAAGATGATGAGGATGAGGATGAAGatgaagaggaagaagatgaagaTGACGAGGAGGGGGAGGAGGATATTTATGGCAGTGGCGCTGATTATATTGACAATTCCGCAGTTGCCGAGCGTTTGTACAACGAAAATGACAACGATGACGacgatgaagatgaagatgaggATGAGAGCGATATAGATAATGATGGCAATGCCTTAATGAATGATGAAAGCAAAGATGTAGATGACGAAgatgatgaggatgatgatgatgacgacgacgacgacgatgaTGAGGATAACGATGATgatgacgacgacgacgacgatgatgatgatgaggatgTTGAAATGCTGATCGATGACGAAGGTGTTAAAAATCGTAATACTGATTGTGACCCATTAAATTTTATGGCAGACGATGCAATGACGACGCCGCTAAGGCCAACTGCAGCAGCAGGCGCTGCAAAGAAGCGCACTGCTGCAAATAAAAATGCAcagaaaaaaccaaaaactcTTTTGCCGGCACACTTTGACTCCGAATCGAATACGAATTTCAGTAATGCCGATGACACGACACGTTTCACCACAACAACAATGACATCCAATTCAGCAGCAAATATGTCCTCGGCAAAAGCAGCCGCCGTTGGCGCAACAAAAGTGCGTAATTGCAGCGCAACCGAAAAGATAAATCGTTCGGTTTTTCGGCTTTCTTGCCTCAAACATCGTAGGAGGAAGAAGAAAGCACCACCAGATCTAACACCACCACCATTTGATAGCATGCAGCAGAAGTTGAATAATTTAATTACGCACACACGCTGTGCCGTGCCACGCTGTGGTGCTGGCGTCGCCCTATATCGCTTTCCTCCTATCGGCAGTCGCTTCTGTCGCGATTGGTGTGCGCAATTGAATGTGAACCAGTCCGAAGCAGCACGCTTGCGTATATGCCAACGACACTTTGCCTATAGCTTAGTGGATCGACGGCGCAGGCGCTTACGATTTGGTGCCATACCGACACGCAACTTATACAACACAATGTTTGCGCGTAAACCTACAACATTACAACAAGCACCAGTAAAACAAatgccaacaaaaattaaagTAGACACCAACACAAGCGCAAACTCAACGGCGTACGTGAACGCCTACAATCGCTGTTGCTTGCCGAGTTGCGGCAAAACGCATCGTGGCGATGGTGTCGCGCTCTTTCGTTTTCCCAAACTACGTTCGCTTGCCATACAGTGGGCGCACAATTTGCATCTCTTGCCCACGTCGCGTTTACTGCAAGTCTATAAAGTGTGCAGCGAACATTTCGAGAGCCAATGCCTGAGTTATCAAATTAAGGGACGTACTACACTGAAATATGGTTCGGTGCCAACGCTCAAATTGGGCCATAACGATGATTTAACAAATACCAACAACAGCAGTAATAGCTTAGCGTTAACACAGAAGCGACGCAAGCCGGGCAAACGCAAAAGTCTGCCAGCGAAGGGACTCAACGAATGCGCTGTGCACGATTGCCGTGTAGCGCAGTTCCTGCAAATGCAACTCTTTCCACTGCCGAATGCGCAAAAGCTGCAGGAGCGCTGGTGTAACTATTTCAAATTGACCTTCACAACCACCGCGACCATTGGCAATGTCACGGAGTTCTTTGAAAATATACGCCTCTGTGCGTTACACTACATGGAGGGCTATCAGATGGCGACGCACAGTAGTGATGGCGTGCGCAAAGGTAAGGACAAGGGCAGCAGCAGTGGCAATTCGGCTGCTGCTATTGCTGCTGCCTTGGAGGAGGTGGAGGCGAACTATGCGCGCATCACCAATTGCACACGCATACAAATGATGAAGTGTTGCGTGCCAAATTGTTCGACGAAATTCACCGACAACTTGCGTCTAACCGCATTTCCCGGGCCGGAAGAGTTGCGCGCCAAATGGCAGCACAATACGCAAGTCACGTTCAGTGCAACGCACCGTTATTTGTATAAGGTATGCGCGCTGCATTTCGAGGAACGTTGCTTCGCCAAGAAACGTCTTTTCATGTGGGCCATACCCACATTGAATCTGCCAGCGGCACCTTTAAGCAATCCAGCGCATAAAATCTATGAAAATCCCGCTGTACAAGTTGTTGGGCCGAGTACACGCTGTTGCGTTGAAGGTTGCGAAACAAATGAGAAAAGGCAGCAGGAGCAGACGACGACAGCGCTGGCTGCTACTGATGTAGTGCAGCATGCCGCCGTTGGTGGTGACGAAAAAGCAGCCAACTCTAACGCTGCGCCGCCGCGCTTATGGCATTTCCCGCAAGATGAGAAACTCTGCGAGAAATGGTGTCATAATTTGGCTCTGACCGGTCAAACGCAACAACAAATTAGTCACACAAGTCGCCGTTGGCGTATTTGTAGTCGTCACTTTGAAGACTATTGCTTTGGCAAAGCATTGCGCCCTTGGGCCGTACCCACGTTGCACTTACCCAAACCGGCCAAACAGAGCAAAGCGGGCGGCGGCAAACGTTCCACATTCATTTATCAAAATCCCGATAGCGCAGCGCTCTTCTATCGCTGTTGCATTAAAACTTGTCGCCAACAGCCTGATGTAGATGCTGGCATACGCCTCTACGGCTTCCCCAAAAACGACACAATGCTGCAGAAGTGGGCGCACAATGTACGCATGCCGGCGGTGAAGTGCAGCAAGGCGCGCATCTGCAACTTACACTTTGAAGCGCAATGTATGCGCCCACAAATGCAAGCTTGGGCCTTGCCCACCATCGATTTGGGACACGACGAAGCGGATATCTTCCGCAATCCCAAAGTCAAGTTGAAGGTAACAAATGGCCGCTGCTGTTTGCCACATTGCAATAAACGACGCCAAAATGACAATGTACACCTTTTCGCCTTTCCGCGTGACGAGCAAGTGCTGGCCAAGTGGTGGCACAACTTGGGCATAGCCGCACAAGACGCCAAACATCGTATGATTTGTGATACGCATTTTGAGCCGCGTTGCATTAAGTTGCGCCGCTTGAAGCGTTGGGCCATACCAACACTCAATTTGGGACACAGCAATGAGCTGTTGAAGAATCCAACGCCCGAAGAGGTATTGGcgtttgaaaataaaaatgcatCAATGCGACGCTCGCAAACGCCATCGAAAATGGCGCGCACAAAAGCGACGACGGGAACTGGAACGACGGCGCAGCAGTCAACGGCCAGCAGCTCAACACAAAAATGCGCTATTGCCGGTTGTGAGCGCGGTCCGGATGCTGATGCAAGCACGCTCTATCGCTTTCCCAAACCCGATTGGCTGCGCAAAAAATGGTGTGAGAATACGCGTCTGAGCGAAGAGGCGGCCAAGCAGGCAAAAATCTGTGCGCGCCATTTCGAGACGCAAGTGATGGGCAATCGTAAGCCGCGTCCGTGGGCCATACCCACCATCGAGCTGGGTGTGGATGCGGAAAGTGGCGCGCCCTTGGCCGCCGTGCATGCAAATCCCACACAACTATCGCTATCGCGCTTCCATCCCGAGGAGCATGAATATGGCGAGCTGCGTTATGTGCGCGCCAATCACTGTTCCATCATATCGTGCATGAAATTTAAAACGGATGGCGTCACCATGTTCAATTATCCCACCAAAAAGGAGATGCTGCAGAAGTGGGCTGAAAATTGCCGTCATTATCCCTATCAGGCGAAACGCTATCGTTTCATATTGTGTGGCGCACATTTCACGCCCGATTGCTTTAAACGCGAAGGCACACGTTTGCGCAAAGGCGCTGTACCAACACTCAATTTGGGTCACGATGATCCGCATATACACCAAAGCGTATTCGAAAATGCTGTGACAGCGTCTGCAGATGGGCTGAATAAAAAATATTGCAGCGTGCCAGAATGTGGACGCAATTCGTTGGATGATGGTGTACGCCTGTACAAATTCCCGCTTGAACGCgctgaaattttggaaaagtgGTGCCATAATCTGCGCATCAGTGCGACCGATTGTCGTCACGCGCTCGTCTGTAATATGCATTTTGAGTCGCGTTGCATTGGCGCTGGTTTGCGTTTGTTACATCGCGCCATACCCACGCTACTGCTGGGACACAACGATCGTGAGGATATCTACGAAAATCCAGAGTCTTTTGAGCGACCAGAGAAAGTGGTGTGTTGTTGTGTGCCTGGCTGCAGCAATACCAAACTCACCGAGGGTGTACAGCTGAGCGCGTTTCCCAAATTGCGTTCGCAATTCGAAAAGTGGGCGCACAATCTACGCTTGCCCTGCACTTCCACCGTTTGGCATACGTATAAAGTGTGCAGTGAGCATTTCGAAAGCTATTGCTATGAATATGGACGAATACGTGTTGGCGCGATGCCGACGCTGAAATTGGGACACAATGACACACACGACCTATTTACCGTTAGCGAAGATTCAATGTGCTATTCGTTGAAGCGTAAGCGCGTACCGCATAAAAATGAAAGCTTAAAAGCATCACAGGAAGCATGCTGCTATCCGGGTTGCAAAGAAATGGAATTGCGTGTAGGGCATCACCTCTACGAATTGCCCGAATGGAAGCCAATAAGACGTGAATGGCTGAAGAGCATGGGTTTGTGTGATGCTGCTGATGATGAAGACAACAAGAAAGAAAAGAAGCAGAGCGTAGATGCTGTAAAAGATGAAGTAATTGCTGGAGCAGAAGGAAAAAGTGTTGGTGAAAAGACAGACACACTGGAAAGCGATTTAAAAATTGCGAAAGTAGAAGAAGTGGATAAAGCACAGGCGTTGAAATCGGAGAATGCAGTAGATGCCCAAGGCCTGCAGATTAAATCTACTGAAATTACACGCAAACTGTGTCCACTACATTTCAAATTACTCTACATACAACATGAATCCATGATTCAGACAGCGCTACAGTCCACAACTGCTGCGAGTCGAACGGCAGCAGAACGACTTTGTCTGCAACAACTCAAAGACGATATGGACATCATAAACGATTTGTCGTGCATACAACGTTCATCATGCGCTGTACCCGGCTGCAATTCGTACAGTTTGATGCCCAATAAATCTGTGAAATTCTTCAAGTTTCCCGATAATGAAACATTACGCGCCAAATGGTGTCATAACATACAGGCAAAACTCGACGTGGATCGCTTGTATTGCTACAAAATATGTGAGGAACATTTTGAGTCCGTCTGCATGTCGTTGGCGACTAATCGACGTTTGAAATCTTGGTCGATACCAACATTAAAATTGCCCGCGCGCAACGATGACATGCCCGACATCTATCCGCTGCCAGCGCCCGAAGCGATGCAGGAAACAAGACGCACCGCGTTAATCGCGCAAACATCCGTTAATAAATGTTGTATACACAATTGCTTGCATGCCAGGGCGCCCGAGGAGCGTGGAGCACCAAATGGAGCTGAAGGAGGTGAAGTgcaatttttcaattttcccAATGATGCCGAGCTACTATACAAATGGGTTTATAACACGGGTGTGAGTATGGTGCAGTCAACGCATGCGCGAATTTGTTCGTTACATTTTGAAAAGCACTGTATTAATAAGCGTTTACGCATGTATGCCGTGCCGACGCTGCTGTTGGGTCATAACCGCACTGATATTTATCTGAATCCAGCCGATAAAAGAGACGAGGTGACGACGGGAGTGGCAGAAGAGAGCAAAAGGGAAGAAGCCGAGGAAGAGCTGAAAGATGAAGATGAAAACGAAGATGAGGAGGAAACGGAAGTGGAAGAAGAAATGGCGGAGGATGCGGAGCAATTCGCTAAAACTAAAAAGGGCACCGCAACGGCAGTAGAAAAACAGAAACACAAactaaaagcaattaaaaaagcAGGCAAGAAATTAAAGTCGTTAAAGCGCGCAGCTGAAGTGAAAGACGTGCCGCAAAAAGCGCCGAAAACAAGCAAAAAAGAGTCAAATAAACCAACACCAACCACACAAACAAAACCCTATCACAAACCATTCATATTAAATGTAAAAGAAGAAAAGGATTCGGACGACGAGGCAGCTAGTGCCGCTGCTAACAGCATTGAAAACCAAATGAGACAAGGTTTGCTAGATATGTTCCATGGTTTTGGCGGGGACGGTGATAGCGCGGACGAGGATATGGTGGATGACGATGAGGAGGAGGAGGAAGAAGCAGAGCAGCACGACGATcgttttaaaaaatataaaacaaagcttgaaaataatcaacagacaaaacaaaaagaagaacaagaaaaagaaaaacaacgAGCACGACAAGCTCAAACTGAGAATGATGGTCGTCCCGACGACGATGACGCCGATGGCTCTGATTATGGCAATTTGTTGGATACCATGACGTCAGTAGAACGCGACACCTCACGTCACTGTCGCATACCCGATTGTACAAGTCATGCCAAGGATCCCAATGTTACACTTTTCAAATTTCCACGCTCCGAACATCTATTCCGCAAATGGCTGCATAACACACAACTGCGAGTGGACTATACGCGTCGTTGGCGTTATCGCATTTGTCAGCGTCATTTCGAACCGATTTGTATGAAATTCCGCAAGCTACCGCCCGGCACAATGCCCACACTCAATTTGGGACCATCACGCCCGGCACGCATACACGAGAATAgttttgatgtgaaaaatttgaaaaagttCAAACGCAAAAAACCAACAATGGCTGAGAAGGCGAGAGCAGCAGCgtcagcagcagcagcgcaATCGGCGGCAGTAGCAGCAGCAGCCGCCGCAACAGAATATGACGGCGATACGAATAGTTCGTTTGCCACTGCGCCGAAAGGCGACATCCACTCGTTGGCGACGGCGCAGCAACGTGCAACACAGGTGTCGCCATCGCCACTACTCTGCTGTGCCATACGTAACTGTACCAGTAAATATCATCAACTTGGCGAAGGCGTACATTTGCATAAATTACCAAAGAATTCTCTGTTGCGTCAAAGGTGGATATACAATTGTCGCTTGAGCGACGAGAAAGTGCTCAGTCTTGGCAGCCGCCTGCGTATTTGTACCCTCCACTTTGCCAAAAATTGTTTCTACGGCATCAAGGAGCAACTGAAATTTGGCGCTGTTCCAACGTTGCGTTTAGGTCACACCGATTGCGATATATTTCAGGATGGTTTTAGCAATGCCGGCGAGAGTGGCTTCCGTCAAACACCATTTATTGCATCTGCCGGCAAATCGGGTGTAATGCATGATGTGTGTTGTTTGGCGAATTGTACACGTAGCGCACGCGAGTACACACGCCATTTTGCATTTCCAACGGATAAGAATATACTTGATAAATGGTTGGATGCTTTGTGTATGGATTTGAGTGGCGCACAAGCAAAGGATTATAAAATATGCGAATGGCATTTTAAATCGTCCGATTTTGATGGAGAAGTTTTGCGTTTTGATGCAGTGCCGGCGCGTTATTTGCGAATGACCGGCGGCGCCGATGCCAACGATGAAACGGATGAGGAAGATGgttatgaaaatgaaaacggtGGCTTGAATTGGCATGACAATGAAAAGGCAGCGCTGGATGAGCGTCCATCAACTTCCGCTGCAGCAGCCATCCAAAACACCACCACACCGGATGGCTATAATCGTTTAATTCCCGGCTCCCGCAAATGTTGTTTGGCTCATTGTGGCAAACAAATATTTCCAGATATGGTACGCACCTTTAAATTCCCCACGGTGCGCGAACAATTCGACAAGTGGACACATAATTTAGGTGTGAAATATGATGGCGAAACACCATGGCGTTATCAAGTGTGTAGCGAACATTTCGAAGAAAGATGCCTTATACACTATGAGAACAAATCGAAACTGTTTCGCTGGGCGGTGCCCACATTAAAGTTGGGTAAAAATGCACCAGCTGAACTCTTCACAAATGATAAGCCCCACAAGGTTTTGCAAGCAATGAAAGGTAATTTTGGAAATGCTGGAGCGCGCGGAGCGGGCTATGGCGTTAATGATGATGACGAGACTATGGACACGACAAACGATGATGAGAATTCCAATGCATTGCCACCCATAGATATTGTGCCGCAAGTGGTTTATGATGAcgaggatgatgatgataatgagcAGGAGGAGGAGAATGATATGGATTTACTAGCACCGAGTAGCACAACAAAACGTGCTGATGCATACACAGCGAAGAATAACAAAAATACTAATAAAAATGACAATTACGATGGCAATGCTGATGACAATGATTAcaatgatgatgacgatgattatgatgatgccGCCGTTGTCACTGCCGCTGACCGCAAAGCCTCCGCCGCCTATGATATGGCTGAGGAGAATTCACTTTTGGATGTGATACGCGAAGAGCGACCGAGCCCCGTGAAAGAGGGTTTTGTTGGTGccccatcatcatcatcatccacATTCTTTTCACAACAATTTTTACGCAGCAGCAGCGCATCCAAAGAGCGCGCTTGTTGCTTAATGCATTGCGGTCGTACGCGCTCCACTGGCGTACGTTTGTTCCGTTTCCCCACAGAACCAATGTTCCTACAACGTTGGGAATATAATTTGCGTGTACGCTTTAGCGAATCACAACGAAATACCCACTTGATTTGTAGTTTGCACTTCGAACGCTCACAATATAATAAACGTTTAGTGGCCGATGCAATACCCACACTCAATTTGGGCCACAACAGCCCGGACATTTATAGAATTGGTGAATTTGAGCTGAATAGAATAAATAAACCGCTGCCACCACTACCACCACTACAACCACCACCATCAGCACTACTAAAACCGATTGTAAGCAGCAAGACATCGGCAACTATGGCTGCCAAATCCTTGTGTAACGTACCCGCCTGCAATGAAACGCAAGCAACGCGCAGTCTCTTCCCCTTCCCCACAAACAGCACGTTTGTTAAGATTTGGGCTGAGCGTACGCAAATCGCTTACGATCAGCGTTATCATGCCGAATTGCGTGTGTGTGAGCTGCACTTTGATACGGATTGCTTTACCGGTTTGAACTTGAATAGCAATGCTGTGCCAACGTTGCGTCTTCCAGCGCCCATGTCAATGCCAATGCTCGCTGCTGCGTTAGCACCAAGTGCTGCAACAGCAACTGCCACAGCGGCAGCAGCAGTAGCAGTAGCAGCTACGACAACAAAAACCACTGCAGCGCCACTTAAACAACGCCAAATAGCAcatgcagcagcagcagcatcagCAACGGCAACAGCACAACCGACCGCAGTTGCACTCAAATCAGCGGCGCCACCGCCGCCACCAAAATTAAACGCCATCACTTGCAGTGTTACGAATTGTGGCAATAGCACCGCCAAACGTCCGGATTTGAAAATGTTTTCGAAATTCCCCGACGACTTTGAATTATTCACGAAATGGTgtttcaatttgaaaatcgaTCCGCGCACCTACGTCAATGGCAGTTATAATGTGTGCAGCGATCACTTTGAGCCATTCTGTATTGGCGGTCATAGTTTGCGCGTGTGGGCCGTGCCAACTTTGAATTTGGGGCACAACAGCAAGCTGATACATAGTGTTGAGCGGCCAGCGGAAATGGAGATGAAATGTTGTCTGCCGCATTGTGGTCGTAGTAGAAGTAAAGATGGCGTGGAGCTGTTCAATTTTCCTAAAGGCGAACTCTATCGTCAATGGTGTCACATACTGCGCATCGAGGAGGGTCTCTACCGCAACACAGATAAGAAAATCTGCAGCGCGCACTTCCGTGCCGATTGCTTTGCCAACAATGGCGCTCTCCGTCTCGGCGCTCTGCCCTCTCTACTGCTGCGCAATCGCACACCTACTGCGGCTGCGCACATACTGAAACCGCCGGCGCCCTATCgcagcaagtgtattgtgcgcaCGTGCCACGAAATGCAACAACTCTACAGCTTCCCAGCGCAACGTATATTATGCACGAAATggtgtcataatttaaaaatcGATTACTATCCAAAATTGCATGAGAATATGAATTTCAAAATTTGTCGCCGCCACTTCGAACCGAATTGCCTGTTGAGTGGCGGCAAATTGCATGCCGATGCAGTGCCGACGGTGCAACTGGGACACAACGATATGAACATCTATCAGAATGTGATTGGCATGATGATGAAGAGTCGCGGCAATACGCCCAGTTATGATGATAATAGCAGTTTGCGTACAAGTGTTAGCACCGTACACAATTGGCTGATGGATGTTGATGCGGAGACAAATGCTGGCCAACAAATGTTGGCCGGCGGTGGCGGTGGCTGTGGCAATGACAATGATTTGCTTGATGCGAGTGGTTGCGATGGTCCAGCTGCAGGTCCGTATGAGCCGCGCGTTGCACTTGAACCCATCACGGTGGAGCTTgacgacgatgatgatgatgatgacgttGCTGGCGTAGTTGGTGTTGGTATGGGTGCAGGAGTTGGTGGCGGTGGCGGTATGAGAGCACGTCGACCATATATGCCACCAATGGCCGCCGATGATACGTATTGTGCGGATTTCAGTGAACAACGTTTACTGCCGCAAAGTAGCACATTTCTAGCAGCCGAAAGCGCGGAAGTAATTGATTTGGATGATGTGGATGCGGTGCAAGAGCAATACCCCAGTtgggcgcaaccacaacgcacCGTTGATGCTGTATTGGTGGACGATGATGAGGATGAAGCTAATGTTGCTAATGATGTTAATCTTGCAAGTGTTGATAGACACAGAGGTGTTGGTGCACGTGCTGCTGCTGGTGCTGGTGATGATGATGCGTTTATGTGGCCGTGTTAA
Protein Sequence
MPSQQAEPTDDAYSKVSTADEALAAGHSANPMHPNAAGYMNYPKHLQSYPHLQHHQQQQQQHQHQLQRQQQQQQLRQQQQQQQMRQQQHQQQHQQLRQQQEQQRLRQQRQQELHNQEQLRQQHEQKQQQQQLRLQQQQHEIQEQQLRLQQQQLEQQQQQHQLRLQQLQQQQQHQHEQHEMQLRLQQQRKETEELLRQQQQEQQQQEELRLQQQKKETEELLRQQQQQQEQQQQQEELRLQQQKKETQELYRQQQHQQQEQQQQELRLQQQQQQQQQQEKQKQQQEQQKQQQEQQKQQEQEQHKLLEEQHQQKQRQQEQQEQQEKQETHTQKQQEQQTQKEQEELRQQPSQEPQLQEEYQQQQQHHQKQQQQQYEQQQKCDARENELKEKEQQHEELKGKQQQEELKEKQQQEELKEKQQQEERHELQQERPEQEQQRQQNKTPDQQDDQDQELMLTHDEMAEAGLLTVKLEPQIHIKEEMPEPCQNKPLNFPRRKVQTERSDTLPICQRCKQVFFKKQIYTRHVSQSLCEIVEFDFKCCICPMSFHSGEELQEHEDLHRENMFFCHKYCGKYFETIELCEVHEYMLHEHSNFICNVCSAGFTSRDQLFQHMPTHRNQTRFDCPICRLWFHTSAQLHQHRLAAPHFCGKFYNADGEGAAASVDNTYGATRPSHATNYNLQDCSMGVIEMPGSNSFSSALQNRAAFHHQQHQRHDPHYSHSHAQQQYQQHLQRQQQQQQRYQAQQHQLHHRSPDYMRRSSYGSVSGAAVAGAGAGLGAANDFHMPQIKTEIKIEPDTYDAADYAKQTPLRPPPVPQSPSALMASPSSRQRRFTDYSNETFGAGAVDSMLSLSVNSSVGSNSNSNDFLNGDGMQRTLSSSSLSRHPSSSQQYSPNLRFPTTPMASASSRSRVVDTGDEAAICCVPHCGVNQMSSPTLQFFTFPKDEKYLQQWLHNLKMAPEPGSDYSQYRICSLHFPKRCMNRYSLCYWAVPTFNLGHDDVANLYQNREITNTFTVGDRAQCSMPGCSSQRGETNCKFYNFPNDMKTRIKWCQNARLPIHSKEPRHLCSRHFEDRCFGKFRLKPWAVPTLNLGTPYGRIHDNPGIFYLEEKRCCLPHCKRTRSSDFNLSLYRFPRDEMLLRRWCYNLRLDPAIYRGKNHKICSAHFIKEALGLRKLSPGAVPTLNLGHNDRFNIYENELPTPPQPPAPTPVAAPKMFSFHNLSAPPEHYNKSHNRESIAAASSQHSRHMSRMFQNPVSELRFSISNITAGDMSSMGSAQNLLDCIDFCFVCKRNRNTDNVTLHTIPRRPEQRRKWCHNLKISVHSLHKGVRICSAHFEPYCIGGCMRPFAVPTLNLGHNDPNIYRNPDVIKKLNIRETCCVQDCRRNRDRDRANLHRFPSNYDMLTKWCENLLKPVPDGSKLFNDAICERHFEDRCLRNKRLEKWAVPTVKLGHNEELKHQLPTDEEIAELWPKPLLPNKGIDDGECCVATCRRDPKVDDVKLYRSPEDAEVLAKWAHNLQVETEDLTTLVICNLHFEERCFSKKRRLHDWALPTLNLANNVEQLYENPEPVVPLAVMKAEERRERRERRRMRDPNEPAKPWTPRCCLPHCHKRRDTDRVQLFRFPILNRPILVKWCHNLQLPLVGNAHRRLCSAHFESRVLSKRCPIPTSVPTLDLNTPPGYKIYTNPARLKAVKLRMQQTCCIASCARTRADGVQLFRFPHSRSMVRKWCHNTRQQAREAIRGQYRVCSRHFEPHAFGARRLMPGAIPTLNLDPEVEDAFANEAQGFAETQCVVNGCVASKELDGVRLFKFPHEDEDLLWKWCNNLKMNPSECNGVRICNRHFEDECVGPKMLYKWAIPTLALGHEDPDIKLVPVPPPEERYNDVVTKCCVPTCGKSRKFDDAQMNSFPKEIRFFRYWKHNLKLDYLDFKDRDKYKICSDHFEPVCLGKTRLNYGAIPTLNLGHDDTEDLYKVDPTQLQVSLFGKMRPCAAGSGEKPAEEHEATAEKDDDVKCSYPNCTATKTLLREPYDMPAALPLHALWCAQMKVDTESLSETPKLCGLHFIKLYKSTLDAANALAEGDEALSEAMQQLTATYEKCRASPIVCAAQCCVTGCSSSQVSGSRTAQRLYLFPSPTSSGVELIEKWCENVGVAAADLNLYPHKVCARHFEAQCIGPTQRLRSWAVPTLYLKKKSGAEIHRIPELGSAVRPHEARCCVPSCARERGDFETMRLFSFPIIEEVLEKWLHNLQLTRSECARLRICDQHFESRCITKARLQRWAVPTLLLERKHEELLQNKPPVHAATGASAGRPPKVARAAVAGKDDDDDDDELGEDTAGSAELAVVPNVKMKKSLDTVKCCVASCRRSRLKHGVRLFPLPTTPSILRKWCHNLRMPVEKAADTSLRICSLHFHKRCIDGKELRVWAKPTIALGHTGPIDEIPKNLPGVFLPKCCLPHCRQRRTLENDLRTFGFPKDAVLLQKWYANLRMQKRTTHARICMSHFPPEAMGNKKLRNNAVPTLNLGHNEPLKYDNELLIASAPSAKLEKKNRLQKNKTTATAASDAGQEAKSAEADDDDADDAYGDDREEEEDDENGLYAGTGVANSDEEEEEETFEDFTPSLSAATGDIDDDETRSYAGRRAADKLNDNSDEDDDDNDDEDGEDEDDEDEDEDEEEEDEDDEEGEEDIYGSGADYIDNSAVAERLYNENDNDDDDEDEDEDESDIDNDGNALMNDESKDVDDEDDEDDDDDDDDDDDEDNDDDDDDDDDDDDEDVEMLIDDEGVKNRNTDCDPLNFMADDAMTTPLRPTAAAGAAKKRTAANKNAQKKPKTLLPAHFDSESNTNFSNADDTTRFTTTTMTSNSAANMSSAKAAAVGATKVRNCSATEKINRSVFRLSCLKHRRRKKKAPPDLTPPPFDSMQQKLNNLITHTRCAVPRCGAGVALYRFPPIGSRFCRDWCAQLNVNQSEAARLRICQRHFAYSLVDRRRRRLRFGAIPTRNLYNTMFARKPTTLQQAPVKQMPTKIKVDTNTSANSTAYVNAYNRCCLPSCGKTHRGDGVALFRFPKLRSLAIQWAHNLHLLPTSRLLQVYKVCSEHFESQCLSYQIKGRTTLKYGSVPTLKLGHNDDLTNTNNSSNSLALTQKRRKPGKRKSLPAKGLNECAVHDCRVAQFLQMQLFPLPNAQKLQERWCNYFKLTFTTTATIGNVTEFFENIRLCALHYMEGYQMATHSSDGVRKGKDKGSSSGNSAAAIAAALEEVEANYARITNCTRIQMMKCCVPNCSTKFTDNLRLTAFPGPEELRAKWQHNTQVTFSATHRYLYKVCALHFEERCFAKKRLFMWAIPTLNLPAAPLSNPAHKIYENPAVQVVGPSTRCCVEGCETNEKRQQEQTTTALAATDVVQHAAVGGDEKAANSNAAPPRLWHFPQDEKLCEKWCHNLALTGQTQQQISHTSRRWRICSRHFEDYCFGKALRPWAVPTLHLPKPAKQSKAGGGKRSTFIYQNPDSAALFYRCCIKTCRQQPDVDAGIRLYGFPKNDTMLQKWAHNVRMPAVKCSKARICNLHFEAQCMRPQMQAWALPTIDLGHDEADIFRNPKVKLKVTNGRCCLPHCNKRRQNDNVHLFAFPRDEQVLAKWWHNLGIAAQDAKHRMICDTHFEPRCIKLRRLKRWAIPTLNLGHSNELLKNPTPEEVLAFENKNASMRRSQTPSKMARTKATTGTGTTAQQSTASSSTQKCAIAGCERGPDADASTLYRFPKPDWLRKKWCENTRLSEEAAKQAKICARHFETQVMGNRKPRPWAIPTIELGVDAESGAPLAAVHANPTQLSLSRFHPEEHEYGELRYVRANHCSIISCMKFKTDGVTMFNYPTKKEMLQKWAENCRHYPYQAKRYRFILCGAHFTPDCFKREGTRLRKGAVPTLNLGHDDPHIHQSVFENAVTASADGLNKKYCSVPECGRNSLDDGVRLYKFPLERAEILEKWCHNLRISATDCRHALVCNMHFESRCIGAGLRLLHRAIPTLLLGHNDREDIYENPESFERPEKVVCCCVPGCSNTKLTEGVQLSAFPKLRSQFEKWAHNLRLPCTSTVWHTYKVCSEHFESYCYEYGRIRVGAMPTLKLGHNDTHDLFTVSEDSMCYSLKRKRVPHKNESLKASQEACCYPGCKEMELRVGHHLYELPEWKPIRREWLKSMGLCDAADDEDNKKEKKQSVDAVKDEVIAGAEGKSVGEKTDTLESDLKIAKVEEVDKAQALKSENAVDAQGLQIKSTEITRKLCPLHFKLLYIQHESMIQTALQSTTAASRTAAERLCLQQLKDDMDIINDLSCIQRSSCAVPGCNSYSLMPNKSVKFFKFPDNETLRAKWCHNIQAKLDVDRLYCYKICEEHFESVCMSLATNRRLKSWSIPTLKLPARNDDMPDIYPLPAPEAMQETRRTALIAQTSVNKCCIHNCLHARAPEERGAPNGAEGGEVQFFNFPNDAELLYKWVYNTGVSMVQSTHARICSLHFEKHCINKRLRMYAVPTLLLGHNRTDIYLNPADKRDEVTTGVAEESKREEAEEELKDEDENEDEEETEVEEEMAEDAEQFAKTKKGTATAVEKQKHKLKAIKKAGKKLKSLKRAAEVKDVPQKAPKTSKKESNKPTPTTQTKPYHKPFILNVKEEKDSDDEAASAAANSIENQMRQGLLDMFHGFGGDGDSADEDMVDDDEEEEEEAEQHDDRFKKYKTKLENNQQTKQKEEQEKEKQRARQAQTENDGRPDDDDADGSDYGNLLDTMTSVERDTSRHCRIPDCTSHAKDPNVTLFKFPRSEHLFRKWLHNTQLRVDYTRRWRYRICQRHFEPICMKFRKLPPGTMPTLNLGPSRPARIHENSFDVKNLKKFKRKKPTMAEKARAAASAAAAQSAAVAAAAAATEYDGDTNSSFATAPKGDIHSLATAQQRATQVSPSPLLCCAIRNCTSKYHQLGEGVHLHKLPKNSLLRQRWIYNCRLSDEKVLSLGSRLRICTLHFAKNCFYGIKEQLKFGAVPTLRLGHTDCDIFQDGFSNAGESGFRQTPFIASAGKSGVMHDVCCLANCTRSAREYTRHFAFPTDKNILDKWLDALCMDLSGAQAKDYKICEWHFKSSDFDGEVLRFDAVPARYLRMTGGADANDETDEEDGYENENGGLNWHDNEKAALDERPSTSAAAAIQNTTTPDGYNRLIPGSRKCCLAHCGKQIFPDMVRTFKFPTVREQFDKWTHNLGVKYDGETPWRYQVCSEHFEERCLIHYENKSKLFRWAVPTLKLGKNAPAELFTNDKPHKVLQAMKGNFGNAGARGAGYGVNDDDETMDTTNDDENSNALPPIDIVPQVVYDDEDDDDNEQEEENDMDLLAPSSTTKRADAYTAKNNKNTNKNDNYDGNADDNDYNDDDDDYDDAAVVTAADRKASAAYDMAEENSLLDVIREERPSPVKEGFVGAPSSSSSTFFSQQFLRSSSASKERACCLMHCGRTRSTGVRLFRFPTEPMFLQRWEYNLRVRFSESQRNTHLICSLHFERSQYNKRLVADAIPTLNLGHNSPDIYRIGEFELNRINKPLPPLPPLQPPPSALLKPIVSSKTSATMAAKSLCNVPACNETQATRSLFPFPTNSTFVKIWAERTQIAYDQRYHAELRVCELHFDTDCFTGLNLNSNAVPTLRLPAPMSMPMLAAALAPSAATATATAAAAVAVAATTTKTTAAPLKQRQIAHAAAAASATATAQPTAVALKSAAPPPPPKLNAITCSVTNCGNSTAKRPDLKMFSKFPDDFELFTKWCFNLKIDPRTYVNGSYNVCSDHFEPFCIGGHSLRVWAVPTLNLGHNSKLIHSVERPAEMEMKCCLPHCGRSRSKDGVELFNFPKGELYRQWCHILRIEEGLYRNTDKKICSAHFRADCFANNGALRLGALPSLLLRNRTPTAAAHILKPPAPYRSKCIVRTCHEMQQLYSFPAQRILCTKWCHNLKIDYYPKLHENMNFKICRRHFEPNCLLSGGKLHADAVPTVQLGHNDMNIYQNVIGMMMKSRGNTPSYDDNSSLRTSVSTVHNWLMDVDAETNAGQQMLAGGGGGCGNDNDLLDASGCDGPAAGPYEPRVALEPITVELDDDDDDDDVAGVVGVGMGAGVGGGGGMRARRPYMPPMAADDTYCADFSEQRLLPQSSTFLAAESAEVIDLDDVDAVQEQYPSWAQPQRTVDAVLVDDDEDEANVANDVNLASVDRHRGVGARAAAGAGDDDAFMWPC

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-