Basic Information

Gene Symbol
-
Assembly
GCA_949127995.1
Location
OX421844.1:46202169-46222688[-]

Transcription Factor Domain

TF Family
THAP
Domain
THAP domain
PFAM
PF05485
TF Group
Zinc-Coordinating Group
Description
The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 43 8.6e-15 2.1e-11 44.8 5.8 1 86 569 641 569 642 0.83
2 43 1.4e-12 3.5e-09 37.7 6.5 1 87 670 739 670 739 0.81
3 43 2e-16 5e-13 50.1 0.5 1 87 761 833 761 833 0.85
4 43 2.4e-13 5.9e-10 40.2 3.2 1 87 898 967 898 967 0.80
5 43 8.2e-14 2e-10 41.7 4.7 1 87 991 1063 991 1063 0.80
6 43 3.1e-12 7.7e-09 36.6 1.6 1 87 1098 1166 1098 1166 0.79
7 43 6.5e-11 1.6e-07 32.4 4.0 1 85 1212 1280 1212 1285 0.73
8 43 1.6e-13 4e-10 40.7 0.5 1 87 1309 1379 1309 1379 0.80
9 43 3.4e-12 8.3e-09 36.5 3.5 1 86 1400 1469 1400 1470 0.80
10 43 5.1e-15 1.2e-11 45.6 2.6 1 87 1498 1570 1498 1570 0.87
11 43 8.7e-07 0.0021 19.2 0.1 1 59 1639 1697 1639 1713 0.82
12 43 9.2e-13 2.2e-09 38.3 0.8 1 87 1742 1813 1742 1813 0.81
13 43 5.3e-15 1.3e-11 45.5 1.1 1 87 1843 1914 1843 1914 0.81
14 43 1.6e-11 3.9e-08 34.4 5.7 1 86 1951 2020 1951 2021 0.82
15 43 1.6e-14 3.9e-11 44.0 0.3 1 87 2053 2122 2053 2122 0.79
16 43 3e-13 7.2e-10 39.9 2.1 1 87 2250 2320 2250 2320 0.78
17 43 3.3e-14 8e-11 43.0 1.5 1 87 2403 2475 2403 2475 0.81
18 43 5.7e-15 1.4e-11 45.4 1.2 1 87 2502 2571 2502 2571 0.81
19 43 7.5e-12 1.8e-08 35.4 3.1 1 87 2607 2678 2607 2678 0.80
20 43 1.5e-12 3.8e-09 37.6 1.7 1 86 2704 2772 2704 2773 0.79
21 43 1.1e-15 2.8e-12 47.7 1.4 1 86 2801 2871 2801 2872 0.81
22 43 1.4e-08 3.5e-05 24.9 0.1 1 62 2909 2971 2909 2991 0.74
23 43 1.6e-15 3.9e-12 47.2 1.0 1 86 3007 3076 3007 3077 0.84
24 43 1.7e-10 4.1e-07 31.1 0.0 17 86 3095 3149 3086 3150 0.76
25 43 3.8e-12 9.3e-09 36.4 0.8 1 87 3174 3243 3174 3243 0.79
26 43 9e-15 2.2e-11 44.8 1.2 1 86 3288 3360 3288 3361 0.82
27 43 3.4e-11 8.2e-08 33.3 2.4 1 86 3396 3465 3396 3466 0.80
28 43 4.7e-14 1.1e-10 42.5 1.8 1 86 3500 3568 3500 3569 0.79
29 43 3.4e-15 8.3e-12 46.1 1.7 1 86 3593 3663 3593 3664 0.81
30 43 3.1e-07 0.00075 20.6 0.4 1 61 3706 3758 3706 3781 0.76
31 43 1.4e-15 3.4e-12 47.4 3.1 1 86 3809 3881 3809 3882 0.80
32 43 3.9e-15 9.5e-12 45.9 1.1 1 86 3903 3975 3903 3976 0.79
33 43 3.9e-12 9.6e-09 36.3 5.6 1 86 4066 4139 4066 4140 0.77
34 43 1.7e-12 4e-09 37.5 1.5 1 87 4248 4322 4248 4322 0.79
35 43 1.5e-13 3.6e-10 40.9 3.6 1 87 4425 4499 4425 4499 0.83
36 43 1.7e-12 4e-09 37.5 0.2 1 87 4581 4656 4581 4656 0.79
37 43 1e-14 2.5e-11 44.6 1.5 1 86 4698 4771 4698 4772 0.82
38 43 4.5e-13 1.1e-09 39.3 3.9 1 87 4822 4898 4822 4898 0.84
39 43 4.5e-14 1.1e-10 42.6 0.4 1 87 5043 5115 5043 5115 0.81
40 43 5.1e-10 1.2e-06 29.5 0.3 1 87 5164 5235 5164 5235 0.80
41 43 4.5e-13 1.1e-09 39.3 1.6 1 87 5281 5353 5281 5353 0.82
42 43 6.7e-10 1.6e-06 29.2 0.0 1 84 5374 5441 5374 5444 0.72
43 43 1.9e-11 4.7e-08 34.1 6.2 1 86 5473 5540 5473 5541 0.87

Sequence Information

Coding Sequence
ATGtcgcaaaatcaaaacaaacaacaacagcagcaccacggcctacaacaacaacaccaccaccaacaacaacaacaatggcacgCAGCACATGTTGCTGAGGCCGTAGCAGCAAACGCCGCTGCTGCCGCGGGCTTGAATATGTTTACAAACAATTCATATAGCGGCGGGGGTGGAACGAGAAATGCGTGTAATATGAGCGGCGGCGGTAGTAGTGGTGGTGGAGGATTTGACCTTGACATTGTTGGTCGCGCCTCTGCTACGCCTGTGTATGATGCATTTTCGCAGTCTTCGCCGCAGTTTCAACAAGTGCTggcgcagcaacaacatcaacagcaacaacagttgATGATGAACATTGCTCCAGCGGTGCACATGCTCACGCCCACTATCGAAATGGatgaattaattattaaaacTGAACCGCTCGATGAACCTTGCAAAATCAATTTAGTCGATGACGATAGCGTTTCCTATGTTGATtataccaaacaacaacagcagcagcagcaactcaCCCAATACCCAACAATGTCAATATCTAAAATAGAACACATTAAGGAGGAACTTTTACCATCAGCGTCGCAACAAAAGTCACTCAACTTTCCGCGTCGCAAAATGCAAACCGAACGTTCTGAGACGCTGCCAATTTGTCAACGTTGCAAGCAGGTCTTTTTCAAGAAATACAGTTACACCCGCCACGTTGCACAGTCAGCGTGCGATATACTCGAATACGATTTCAAATGTGTCATTTGTCCGATGTCCTTTATGTCATGCGAGGAGCTGCAAATGCACGAACAACTACATCGCGCGAACAAGTTCTTTTGTCACAAATactgtggcaaatttttcgaTACGATCGACTTGTGCGAGGTGCATGAGTTCATGCATCACGATTACGCTACTTTTGTGTGTAATATTTGCACTGCTACATTCCCGAAACGTGAATTCCTGTTCGCTCATATGCCGCTGCATGAACACCAACAGCGCTACGATTGTCCTGTGTGTCGTTTGTGGTTCCAATTGCCGCATGAACTGCATCAGCACCGTTTGGCCGCGCCCTACTACTGTGGAAAATATTATATCGATAgcgTTCCTGTACTACAGGCACCACGTGGAGCGTCATTTTCTGCTGGGGCCGGTAATGTGCTACAACAGAGTCAAAAATCTTATAATCTACAAGATTGTCGCATGGGCATCATGGAAGTGCCCtcgataaacaattttgcgcCCTTCGCTAACGATCGACTCTACCAGCCGCAAAATAACTATTCAACTGGTGGCGGTGGTGCCCCCATTAATATACACCACAACAGCAACTTCCAAATACCGCATCCAATTAAAACCGAGTTTAAAACTGAACCGGATTTTTATCCAAGTGATTATGAGCACAATTTTAATGACTTTACAAATGATTCCTACACCAGCAGCGGTGGCGGCAGTAGTGTGCATAACGCTCTGTCGCTGTCGCAGCCGCCTCCACTGCACTACAATACCGTCGAGTAtcaacgacagcagcagcaacaacaattacaacaacaacaacatcaacgccaacataatagcaacaacaacaacatatttaaCATATCCAATTTGTCCGCCAgtagcaatagcagcagcatcgTTACCCCACAACTCGTCCCAATCGAGCCATCACGCATTATCGCTGCGATGGATAATGTTTCCGATGAAGACGCACAGTGCTGTGTACCCAAATGTGGCGTTTGCAAAAGTACCAGTCCCACActacaattttttgaatttcccAAAGACGAGAAATATCTGCACCAGTGGCTGCACAATCTCAAAATGTCGACTAATCCGCCAGGGAATTTGGAAAACTTTCGCATTTGCAGTCTGCACTTTCCGAAGCGGTGCATCAATCGTTATTCGCTTTGCTATTGGGCTGTGCCAACTTTTAATTTGGGTCACGATGATGTCGCCAATTTGTATCAGAATCGCGAACATACGAACGCATTTGCAGGTGGCAGTGAATTGGCGCGCTGCAGTATGCCCGGTTGCAGCAGTGAACGTGCTGCATGCAACAttaaattctataattttccgaaagaaacgaaaactttGATTAAATGGTGCCAAAACGCTCGTCTTCCGGTTTATTGTCGCGAACCACGTCACTTTTGCGGCCGCCACTTTGAGGAGCGTTGCTTTGGCAAGTTTCGTCTCAAGCCATGGGCAGTGCCAACGCTTCACTTAGGTACGCCCTTTGGCAAAATACACGACAATCCTGGCGTTCTCTATTTGGAGGAAAAGAAATGCTGCCTGTCGCATTGTCGGCGGACGCGTTCttctgattttaatttatcgcTTTATCGTTTTCCACGCGACGAGACCATGTTACAACGCTGGTGCTATAATCTACGCCTTGAACCGGAGGTTTATCGTGGCAAGAACCATAAGATTTGTAGTGCCCATTTTGTTAAGGAGGCTTTGGGTTTGCGCAAGCTGTCGccagGTGCTGTACCTACACTTAATCTTGGTCACAACGACACCTTTAACATTTACGAGAATGAGCTGTACACTCCACCACCGCCTCCACCACCATCTACCGCAAAGATGTACAACTTCCATAATACACCGCTAAATTATCCTCCAGTTTCGCCAACTAATTACACTACTTCTAGTGCTTCGACGTCAACCCCAAACCTATACGATTCCATTGATGTGTGTTTTGTGCCGAGCTGCAAGCGTAATCGCAATATCGACAATGTAACGTTACACACCCTACCCAGACGACCCGAACATTTGGAAAAATGGTGCCACAATCTTAAGTTGGATGCCGCCTCTCTGCACAAAGCTGTACGCATTTGCAGTGCACATTTTGAGTCTTTTTGTATCGGCGGTTGCATGCGTCCTTTTGCTGTGCCTACGCTCAATTTGGGACATGATGATCCGGACATTTATCGGAATCCCGAAGTCatcaaaaaactaaacattcGCGAAACCTGTTGTGTTCAAATATGTAGACGCAATCGTGATCGTGATCATGCAAACTTGCATCGTTTCCCATCAAAACCCATTCCCCTGGAGAAATGGTGTCAGAATCTACGAAAACCGGTGCCGGATGGATCGAGGCTTTTTAATGATGCCATCTGTGAGATGCATTTCGAAGACCGATGCTTGCGCAACAAACGTTTAGAAAGATGGGCCGTACCAACGCTTAATCTTGGCTTTCCTGATGTCGTACACGAACTGCCAACAGAAGCCGAAGTCGCTGAGCTTTATGCAAAACCCAGCGCTCCTAATATCGGTGACGAAGAAGGTGAATGTTGCGTTCATACGTGTAAACGTAATCCTCAAGTGGATGACATAAAGCTGTATCGCATGCCAGAAGATGCTGAAGTACTCGCAAAGTGGTCGCACAATCTGCAATTAGATGTTGATGTATTGCCCGTGTTGCGCATATGTAACCTACATTTTGAGTCGCATTGCATTACTAAGAAATTACATACGTGGGCCATACCCACATTGAATCTTGCGAACAATGTCGAAAATCTCTTTGAAAATCCAGAGCCTGTACTGGTCATtaataaaaagattaaaatcaaaaaagagcGAGTGTACAAGTACCATCTTAATCTCAATATGGTTAAAACGTGGTTGCCGCGATGCTGCTTGTCACATTGTGGTAAGACACGTAGCCAAGACAACGTGCAGCTCTTTAGGCTACCCACTCTTCATCGAAATATGTTGGCGAAATGGTGTCACAATATCCAACTCCCGATGGTAGGAAGTTCGCACCGTCGAATTTGTTCGGCGCACTTTGAGCCGCATGTTTTAACGAAAAAATGTCCCATGCATAAATCGGTGCCAACGCTAAATTTAAACACACCACCCGGCTATAAGATTTATCAAAATCCACCGCAACTCAAGCTCGCAAAGATGCGTCTTCAACGCGTTTGTATCATAGCGTCGTGTCGAAAAACGCGCGTTGATGGGGTACAGCTCTTTAGGTATCCTCATAATCGCGGCTTTCTTCGAAAATGGGGCCACAATACTCGGCAGTCCACTTCTCAAGCGGTACGTGCACAATATCGTGTCTGTTCCAGTCATTTTGAACCACACTCCTTTGGTTCGCGTCGTCTATGTCCCGGAGCCATTCCTACGCTTAATTTAGATCCCGAAGTTAAAGATCCATTTCCGAACGAAGCACAAAGCTTTGCGGAAATGCAGTGTTGCGTCAACGGTTGTGATGCCATCAAAGAAACCGAAAATGTGcgtctttttaaatttccgtGCGACGATGAAGATCTGCTCTGGAAATGGTGTAACAATCTTAAAATGAATCCGGTCGATTGCCAGGGTGTGCGTATTTGTAATCGTCACTTTGAAGCGGACTGTATTGGTTCGAAGCTACTCTTTAAATGGTCTATACCCACATTGGCTTTGGGACACGAAGATGCTGATATTGAACTAATTGCAAACCCGAAACCAGATGAACGTTACGTCTCGGAGGTACTATTTAAGTGCTGCGTGCCAACCTGCGGCAAAACGCGAAAGTACGATGAAGCACAAATGAATAGTTTTCCAAAGAATAGCAAGAGTTTTCGACGTTGGAAGCACAATCTTAAATTGGACTATTTGGATTTCAAAAAGCGAGAGAATTACAAGATCTGTAATGATCATTTCGAAGATATTTGCGTGGGAAAGACACGTTTGAATAATGGTGCTATTCCAACACTTAACTTGGGTCATGACGACACTAGTGATCTATATCAAGTCAAACCTGAACAGATTCAAAGTTCACTGTTTGGCAGACAAAAGTCTGCCGTCGTTGCTGGAGATCCTTCTCTTGCAAATGATGAAGGTAATGGTGATGAAGAAGTCGATGAAGATGACCACAACTCTGAGACAATTTTACCCGATTCGTTTACTGAAGTCGATGCAAAGTGTTCCTATCCATCGTGTACTGCTTCAAAAACGGTGCTCAAAGAAACGTATGACTTACCATCTGTCAAACATAAGCAACTATTTGAACTCTGGTGTCAAAATATGCAGGTTGACAAAAATGAGCTACTAGCTGAATTCGAGAAATCCTCGCCAAAAGTTTGTGGTCTGCATTTCATGCAAATTTACGATGCTACATTAGAAGCTACTAGCATTTTGGTAAAAGACTATCCCGAACTGGACACAGCACTGCAAGCACTCAACCTTGCCAACGAACGCTGTTGTAATTCTCTAATACTACGCAGCGTACAATGCAGTGTCCGCGGTTGCGTAACATCCCAGTTGAGTGACCACAAACTCTTTCAATTTCCATACAGTCGTGAGCTTATCGAAAAGTGGAGTTTTAACACGGGCATAGAAGTGGATGAACATCGTCGTTACTTGAATAAAGTATGTTCGTTGCATTTCGAGGATAATTGTCTTACTGAAACACAACGTCTGCGTTCGTGGTCCATACCAACGCTGCACTTAAATCATGATGAACCAGATAAGATCCATCAAAACCCTAATTTACTATCTTTGGAACGTCGATTGCTGGGACCTGTGGCATTGAAGTGTTGCGTCGCCAACTGTGGTCGTGAGCGAACAGACGATGATCCGTCAATTAAACTGTTCAACTTTCCCACGGATGAGGCGCTATTGGCTAAATGGGCCTATAATCTGCGTTTGGAGCGTGAACAGTGTCCACTGTTTAAGCTGTGTGAGCAACATTTTGAACGCAATTGCTATGGCAATACACGTTTGCGATCTTGGGCATTACCTACCCTTAATTTGGGGCATAACGAAGCTGATATACATCTAAATGCTGAAGTAAGAGCCGAAGAAGATGAACCTAAGGTGgagaaagttaaaataaaaaaatcgttggaTTCAATTAAGTGTTGTGTTCCCAGTTGTCAGAAGAGCCGTTTGTCGCATGGTGTGCGTTTGTTAGACTTTCCAACGGGACCGGTGATGGTTAAGAAATGGTGTCACAACTTACAGTTGCCTTTAGTTGTTACTAAGAAGCATCCTCGAATTTGTAATACTCATTTTCACAAACGCTGCTTTAAtgacaaacaattattttcttgGGCCATACCTACCATGCAGTTAGGACACTCCGATGTTATCTACGATAATCCAAAACTTGTGCAACCCAAACTTGTAGAATCAAAATTATCATCTAATTCTTTTACTTCGATTTGCGCTCTATCGCACTGTGGAAATAAACGAACTGCAGAGAACAATCTGCGAACTTTTGGTTTTCCGAAAAGTGAAGAATTATTAGAAAAGTGGTGTGACAATTTGAAGTTAACACCTGAAGACTGTAAGGGACGAGTATGCGTAGAGCATTTTGATGTCGAAGTAATGGGCAATCGCAAGCTTAAATTTGGCGCTGTACCGACACTTAATTTAGGACACGAAGCGGAGCTTAAGTACAcgaatgattttttaatacaaaacataAAGCAGACTCTAAATAGTTCCACGTCTACAGAGCAAGACAATAGTTTTGAAGAGTctcaaacatttgaaaaaagttttgatctATGTGCTACCGATGATGAGGAAGAAACAGAAGAAGTAGAAACCGACGtagaattaaaaaccaaattggTGGCCACTGACAATGAAGAGTCTGAAGTGAAAAGCAATGAAACAGTTGCTTCATCGAAATTCGACAAGGAAACAACGTTTGATAAAGTAAAGGAGctgacaaagaaaaacaaagacaTTTCGAAGGTAATAACACATAGCTTGAGAGATAAGCCTGTAAACAACTGTGCGCCCATTTGTTGTCTTAAACATTGTCGCAAAGAAAAGACACCAGGTCATCATCTGACCACATTTGGGTTTCCAAAAGACAAAGAACTACTTCGACAATGGTGTGATAATTTGAAACTACGTCTTGAAGACTGCGTTGGACGCGTGTGTAtcgaacattttaatttaaacatgatAACCAAAAATCATCGTTTATTGCCAGGAACAGTACCGACACTTAATCTAGGACACAATAATATATTGCAGCATGCAAACAGCAACGACACAGATTTCAAAGATGAAGCCCAGTCAACAACTTTAGttaataaagataaaaataaaacaccacagtcaacaaataaagaaaaaaatgtgacTCAACAATCGACAGCAAACCTGAATGAAGATGACAATCCAGAAATCATCAATAGTGATACCAATTCCTCGTCATATTACGCTGAAAAGTTGAGCCATTCGGTTTTTCGGATTTGTTGCCTCAAACATTGCCGTAGAAAGAAACCGACGGATCAAAATTGGCACAGCTTCGGTTTTCCAAAAAATGAGGACTTACTTAAGAAATGGTGCGATAATCTGAGGTTACCCATTGAGTACACAACTATTACTGGACGGCGAGTTTGTGACGCACACTTTGAACCATCTACTATGGGAAAGACAAAGTTAGCTCCTGATGCATTACCTACATTGAATTTGACGCCAGATGATGATAAGCCAATACACAACAACTCCAAATTTACAACAGACTATGTGATACAGGGAAATAAGTCTTGCAGCGTAGCTGACTGCGAAAGCACCACCAACTATGAATATCCTGCTCTTTTTGGATTCCCAAAACATTCTGAACTTAGTAAAAAGTGGTGTGATAACCTTAAACTAGAACTGAAATGGTACACTCGTTATCGTATatgcaaaaaacattttgaggCTCGTAGTATAGCTTCTAAACGCCTGCGTCCATGGGCTATACCAACACTAAACTTAGGGCACTCCGAAGCACCGCTGCACCGTAATCCCACGAagcttaaaaagaaattaggTGTTAAATCAAATGCTTCTCAGCAAAGAAAAAACCCGATGTCGCGTTGTTGTATAAAAACTTGTGGCTGTGATGCAGTTGAATTGCATGATCATTTCAACTTTCCAAAACGTGGCGCAATTTTGCGAAAATGGGCTCAGAACACCAAAATAAATGCGTACATAGCAACGCGACGAGAATATCGTGTGTGCCGGCGACACTTTGAGGCAAACTGTTTTACTGAGGATGATagacttaaaataaatgcagtaCCTACTTTACATTTGGGGCTTGATAGTTCTGACTCTGCTGCTTTAAAAGTTGATGTGGACGATGAGTCGACGACCAGCCTGCTGTGGTGTTCTCTGCCAACGTGTAAGCGTTCCGTGACAATTGATCGTGTGAGAATTTATCCATTCCCAAAAGATCACGAACTACTTAAGAAATGGTGTAATAACCTGCAGTTGTCAGCTGAAACTTGTTTGgactataaaatttgtaatcttCATTTTGAAGCAAATTGTATTACCAAAATGAAACTACATGAGTTGGCTGTACCTACTTTAATGCTAGGCCATTCTAAAGAAAGTGTGGCACTATATCAGAATACTGTATTAAGTGATAAACCATTATTGACGTTCAATGTAACTTGCTGTGTACCCGGTTGCGGCAAATCAAAGCAAGAAGATGGCATTCGCATGAATAGTTTCCCAAAAAATCGTACTCTATATATGACCTGGATGCATAATCTCAAACAAAAGTCGTGCAGTAAAGTTATAAACACGTACAAAGTGTGTGACGACCATTTCGAAGATGCATGCAAAGGCAAGATGCATTTAAAATCTGGAGCCATTCCTACGCTAAAGTTGGGTCATAGTGATGAAAATATCTTCCGCAACAATCTTAAGCTGCTTAACAATAGCGGCAAGCgcattaaaaaacaagtagATGAGGCTACACCGAAAGCCCTTAAATGTGTTGTGACCGACTGCAATGTTGACACcgttttaaatatgaaacttTTTGCATTACCCGTAAATGAGGAATTGCGACGCAGCTGGTGTGCGTTTTTAAAAGTAGACTTGGTGCCACAGACGGTGACTAATAAGAGCAAATTATGCGCGCTACACTTTATGCAGGCTTATCATGGCGTTGCTAACAACATagaacaacaagaaacatCTTCAGAGACTACTGCTCCACCCGATgataaactaaataataattatgaaaAACTATGTAATACATCATTTATTAAACGATTTAACTGTTGTGTAGACGGTTGTAATACAAATTACACCCACAACATCAAACTCTACCAGTTTCCAACGAATCAAGAGCTTTGCgaaaaatggcaacaaaacACTAAAACCATTGTCGATATAAATCGTCGTCATTTATATCGCGTTTGTGGTTTACATTTTGAGGATGCCTGTTCAGGCAATGCACGTCTTCGTGCGGGAGCCGTGCCGACTCTTCGACTTAATCATAATGATGATATTTTCCCGCATTTGACTCGCGACGTTGATGCCGGTATTAAGCTGTATACGTTCCCGAAATTAGAGGATCTATATCAGAAGTGGGTGCACAATATTCGCATGGATGCAGATACATGCCGTAATGCTCGCATTTGTACATTACATTTCGAAAAACGTGTCGTTggaaaaaaactgtttaaatttgCCATTCCAACGCTGGATCTTGGCCACAATGAAACGGACATCTTTCCAAATCCAAGgatgaagaaaaagaaaatgcaatcTGAGCAATGCAGCTTGCCAAATTGTGGCAAAATTCGCCGTTTACATCATGTGCGTCTATTTTCTTTTCCAAGTGATGATAGTGATATGTTGGAAAAGTGGAGACACAATTTGAACAACGTGAACCAGTTGGAAAATGGTAAAATCTGCAACGAACATTTTGAAGCTGGCTGTATTCGTTTAAAGAAACTACAACCCTGGGCGTTACCCACTTTAAACTTAGGTCATTCAGATATGATTTATGAAAATCCACAATTTAAACCAACGGAGAAACCTAAACTTAAGTTGTCGAGACATGATGAATCTCAATCTACATCATCAGTTCATCATAGTCGTGATATGTTACACAGCTGTTGCATACCAACTTGCCACCATAAATCTTTAACCGAAGATACAGCTGAAGCTGTACAACTGTTtggttttccaaaaatttcatgGCTTCGTGAAAAATGGATCCAAAATACGCGTACATCTATTGCAGAAgccttaaattcaaaaatttgcagTGAACACTTTGAGCCACACACACTAGGCAAGACAAAACCGCGACCATGGGCTATACCAACTCTGGAACTTGGTCACAccgataaaatatttgaaaatccaAAACAACTACATAAATATCATCCAGAGGAAAAGGAATGCggtgaaattaaatatgtgCGCAGTAATTATTGCGCCATAATATCCTGTTTGAAATCGAAAAAGGATGGCGTGCATCTTTACAAATACCCCAGAAAGGGATTGCTCCTGCAAAAATGGGCggaaaattgtaaacattATAAACACCAAGCGGCCCgatattgttttcaaatttgtagcgAGCATTTTGAAGAGAATTGTTTCAATGGATTTCGCTTAAATTTCGGATCGATACCTACACTGAAGTTGGGTCATAATGACGAAGACATTCATCGTAGTGAAATGCAACGAAATGACGTCGAAAAGAATGAATCAACACAGAGCCTTCAAGTTGAAGCTCAGCACTGTGGTGTACCGAATTGTAAGCACACAAAGTCGGCagataatgtaaaattttttaaatttccagaAAATGCTGAGGTACTCACTAAATGGTGTCACAATCTCAAATTGTCGTCTGAAGACTGTCATCAACTACGCATATGTAATATGCATTTTGAATCACGTTGTTTTGGCGGTGGACGACTTCTTCTACGTGCGATACCCACTCTACTCCTGGGTCATATGGATAttgatatatttgaaaatcccGATTCGTTTGAGAAGCCGGAAGTAATTGTTCGCTGTTGTGTAGCAAAATGTGGAAAATCGAAATCGGAAAATCAAGTGCAGTTGAGTCAATTCCCTAAATTACGTTCGCTTATCGATAAGTGGCTTCACAATCTAAAGTTGCCGGCAAAGTCGGAAGTCATGCACAATTATAAAGTCTGTTCCGAACATTTTGAGAGTTATTGCTACGAACATGGACGCCTGAAATTCGGTTCTATACCCACATTAAAACTGGGTCATGAAGATGATGAGATCTACCATATGAATCAGGAAATGCTGTTGTGTAAATCAAAGCGTAAGGCAGCAGCTTCTACTAATGCAGATACGTCTTTAACAGAAACCAACGACACTAAATGTTGTTACTCCGAGTGCATCGAACTTGAAAAGACAAAGGTCAAAGGCTTTGCATTACCACAACtggaaactttaaaaagaGAATGGCTGAGAAGCATCGGTTTATCAAACAATAAATCTGAGGAGTTAAGACTTTGCGCTATACACTTTAAACTTACCTATGACAAAAacttaatgcatttaaaaaatgtacaagaGTCATTCGAAAACGTGGACGATGCTCATAAATCGAGTATAACCGCATGTCTCGATAAATTAAGTTCTGACTACACGTTTGTCTCAAATATGTCTCGCATCCGAAGCTTAACTTGCTCTGTACCAACCTGCAATAACAGCCAGATAATAAGTTCAATTAAACTCTTTCAATTTCCCTACAATCGTGAATTAATGGAAAAATGGTGTCATAATACGCGAATTACTTTTGATGATGATCGTCGCTACATTTTTAAGGTGTGCGCGGATCATTTCGAAAGTGATTGTTATGCGGAATCATCCAAGCGTATTAAAAATTGGGCTATACCAACGCTAAAATTACCACACGACGATAGCGTTGAAATCTTTAACAACCCTGACTACTCCATCAGTATGAAATGCTGTATATCCAGTTGCGAAAATGCCAAACTAGAGGAGAAATTACCATCTACCACGATGCTACTTTTTCAATTCCCACGCGATACGGAAATGCTTAATAAATGGCTTTATAATACCCAGCTCACATCACGCAAAGCCTTTAATGCACGTGTGTGTGCCTTACATTTtgagaaatattgtataaacaaGCGGCTGCGCAGTTGGGCGGTGCCAACTCTCTTGCTAGGACATGAAACACCGGATTTATTTCAGAATCCCACAAGTAAAATTGACAGTTTAACCGAGGAAGTACCAAATGAATGGAAAACCCAAGATGATATCGAAGCTATAGGCGATGAAACCTTAAATGACACAACTCAATCCATAGAGTATCCAGTATCCAGCAGAAAGTCAAAGAGAAATGCTACATCAATGCTAAAAGATATGATGGAAGTAGAAAGCATTGAACGAGCCATATCACCAGCAGCTCAAAAAGTCAGCAGCAATCATTGCCGTATTATTGGCTGTAAAAGTCATACGGCCCAAAAAGGCATAACGATGCATAAATTTCCCTTGCATGACAATGTGTTTCAAATGTGGGCGCATAACACTCAGTGTAAAGCTGAAAAGAAATCACTTTGGAAATATCGAATTTGCAATCGTCATTTTGCGTCGGAATGCTGGTTAAAACATCAACCGCCACGTTTTCGTCATGGCACAATGCCAACCTTATATCTAGGTCCAAATAGGCCAACAATGATTtacgaaaacaattttgaacttGATGGAGGCATTAAGAAACGTTTGAAAAAATCGGCTAGGACAGATGAAGAAAACAATCCTGCTGATACATCCTCGCTAAATGAGCAAAGTTTGCTTCCGCTTGAAAACGCCGAAGGAGAAGTTTTTACAGACGACTGCTCAGATATGAACCTGGAGACAAACGAAGTCGTCGAACTTGGCGACGAATCAGTTGACAGCATCAAAAAAGTTGTAATGGAtcgaaaattaacaacaacaagcccGCTAcctaaaatatcaaaaacaacacgTCATTGTCGAGTGGTTGGATGCTGGAGTTTCGTAGGCGCAGAAGGCATTACAATGCACAAGTTTCCAAAGATTGAAAAAGATTTTCAGAAATGGGTTCACAATACACAATTAAAAGCTGATATAAACTTTCGCTGGAAATTTCGCATTTGCGGTCGTCATTTTGAACCGGATTGTTTTTTCCCTAGTGCGAAACCCCGCTTCCGCATGGGAACAATGCCAACTTTAAATCTGGGCCCAAATAGACcaccaaaaatttatgaaaatgtatttatcgCCAGTAAAAcgaagacaacaaaaactaaagaacTGAACTTAAGAACGCTTCAGGCAGATGCGGAAGATGACAATGAACATTGGGAAATGAACGAAGATGAAGTCGAAGACGATTTAACTCACAATGAAACATTAAACGAAAGTGAAATTGAAAACTCAATTTCTTTGAAGCCAGCAGAGCAGGTACTAAACGTCACATTGGAAGTAAATCGCATAGAAGCAGTGACCACACCGATGACCACTTTGGAAAATAGTAACTGCTGTCGAATTATTGGCTGTTCAAGTCATAAAGGGCAAAAAGACATTATGATgcataaatttccaattaataTGGATGCCTTTCAGAAATGGGCACATAATACACAACTTCGATTCGATAAGAATTTGCGTTGGAAATTCCGGATATGTAGTCGACACTTTGAGCCCAATTGTTGGCTTCCAAGTCGACGAACACGTTTTCGAGTTGGTACCATGCCAACTTTACATCTGGGTCCAAATCGACCAGCAACAATACACGAAAACGATTTTATAGCCACCGGCAATGAACTGTCAATGGAAGCAGTTGATACCACCACCAACGATGGTGTTTCTACACAAAATGAACTGAGTTTGCTGTCGCTGCAGGATGATGACAATGACAATGTCACAGACAACTTGGACCTTGAATTGACGGCGCAATCTTCAACCAACGACAAAAAGTCAGTGCGCTCTTCTAGTTCTCATCTTATCTGTGCTATAGCTACGTGCGGGCGTCGCCTTGATCCTGAAAATATGCGTCTGCATAAGCTACCGCTTAACGGTATATTACAGCGCAAGTGGATGTACAATTGTCGCCTAGAACCGAATGTCACTGGTGATAGTAACTTTCACACACGTATCTGCAGTCGACACTTCGAGCAGAAGTGCTACTACGGTACAAAGTTACAACTACGATTCGGCTCACTGCCGACCCTACACTTGGGCCATGACgatgtaaatatttatcagaACATTCACACGAACCGTGGCGGTGCACAtatcgatgatgatgatgatgaggtAGAGGATGATCGCGATACGGAGACAACAGCAACCGTTACTTGTTGTGTGCGAAATTGTGTACGTAGTCATTATCGCGATGATGAAACTCGATTCTTCAGATTTCCGCAAAGTAAGAAGGTTCTTTTCAAGTGGCTGTTCAATCTTAAAATGGAATACGATGCTGCGCGTCCATGGGATTACAATATTTGTGAATTGCATTTCGTTGAGGATGCTTTTGACAGCGCCAATAATCGACTGCACGAGTACGCTGTGCCAACTTTACGTTTAGGTCGCAAGGTTAAGACAAATGAGCTGCGATCGATGAGCGTACCTCCTGTAGCAAATATTGCAACGATGCAGCTTGGAATCGAAAGTTCTACTAGCGCCAACACCGTTCACAACtacaatgaaattaaatctGGATATAAGAAATGCTCGCTTATCCACTGTCAAAAGAGTCAAAAGAAGgaccaaataaaaacatacgGATTCCCGAAAACTATCGACCAGCAAGAGAAATGGGAACACAATTTACGGATTAAATACGATGCCGTACGTCCGTGGAAATACCTTATTTGTAGTGATCATTTCGAGAAGCAGTGCTTCCAGGAAAACCGAAAGTATATAAACCAGATTTTCAAATGGGCGTTGCCAACGTTAAATTTGGGTGATAATACGCCCACTGCACTGTTTACCAATGAAAATGCACGCGAACAATACGAAGCGCAATGTGCGCCTAAAGGAAAATCGAAATCAGTCAAAAtggcaaaaccaacaacaacaacagtgaaaGACGCACCTGGCTTTAgtgaaattgaagaaaatgacgacaaagacgacgacgacgacgttggGGTCTTAGCGGCTGAAGAGGAGGACGAGgaggaagaaaaagaagaggaGGTGGAGGCAGAACAGGAAggagatgaagaagaagaggaactGATTGTACCAATGGTAAAACGCATGCGTCTGGACCTCAACTATGACAATTCTAACGCTTCTGAATCCGTTGGATCGCTGCTCCAGTTAAGCGATGGCGGCGGATCATCGTCCAACTTTATGCCCGGCACCTACAAATATGAAGCAACACACTGCTGCCTGCCGCATTGTAAACGCGCGCGTACTTCTGATATACGTTTGTATCGGCCGCCCACCGCACCTGATTTGCTCTCCAAATGGGAACACAATCTACGTATGAGCTATACTGGACGTAGTGTCAATCGGACGCTCATTTGCAGCGAACACTTTGAACCGCATTTGGTAAACAAACGTATCAAGCAGTTGACGCGTGGTGCAATACCAACCCTGAATTTGGGTCACTCGCATACATCAGatctttacaaaaatagcTTTGAAAACTTGGTGCCGAAGCAGAGCACAACTCTTTCTCCGACTGTGCAAAAAATTAGAGAATCTCCACAGGCGCCGCGGCCACCGCCACGAACTGCGATACGTTGTCATGTTGCTGGCTGCACGTTGCGTCAagcatttgacatttttccatttcccgataatcaaaatttcataaaaatttgggtggAACAGACACGCATCGCTTACGTGCCTACAAAACATGACGAAATCGGCGTCTGTGGTGCACATTTCGAGCCAGCTTGCTTTAACAAGGTGGTAAAGGGCAAACTTCGCAATACTGCTATACCGACGCTGCATTTGCCCGAACAACCTAAGCCCATTTCAATGATTGGGACCGCTCCACCGAAACCACTACCATCGCGAATTCTGCAGCAAGTTGTCTCGTCGAGTGTGTCGGCGGTGTCGTCATCTTCCGCCGCCACTGCAATCGCTTGTAGCGTTCCGCAGTGCCACAACACGAGCGCCATGCATTCCATTAAACTGTTCTCCAAGTTCCCCGAAGACTTTGAGCTCTTCACCAAGTGGTGTTTTAATCTCAAAATCGATCCGCGCAAATATGTCGATGGCGCTTATCAAGTGTGCAGTTCTCATTTTGAGCCATCCTGCATTGGCGGTCACAGTTTGCGCGTTTGGGCGGTGCCAACACTACATTTGGGCCATTCCAGTGGAGAAATTCACAAAGTAGAGCGTCCAGCCGAAGCGGAATTGAAGTGCTGTCTGCCGCATTGTGGGCGCAAGAAGAGTTTGGACGGTGtgaatttttacagttttccAAAGGGCGACACTTACAAAGCGTggtgtcaaattttaaacatcgATGAGGGTCTCTATCGCATTAAGGATAAAAAAATCTGCAGCGCCCACTTTACTGGCGATAGTTTTATGGGCGCCCTCCTAAAACCCGGTGTTAAACCTATGCTGCTGCTGGGACCCAAAGCAACAACCGCTGCCGCTGTTGCCGCCGTTCCAAAAGAAAAGCGTTTGGGCGCAGCGATCTTTGTGAACAAATGCATCGTACGCACCTGTCATGCGACGCAGCACCTATATAAGTTTCCCGACAATCGAAACTTGTGCGTAAAATGGTGTCACAATCTCAAAATCGACTACGACAAAAAGTTGTCGAAAAATCCTAACTTTAAGTTGTGCCCCAAACACTTCGAACCGTCCTGCATGCGCCAGGGGCGACTACATACGGAAGCAGTACCGACTCTACAATTGGCTCACGCCGATCCGAATATATTCCAGAATACGGCCTCGTTcagcacacagacacagactaCGGCGATCACAAGCGCCGCTAAGGGAGCGCCCAGCACGCCCAGCTATGACGATAACAGCAGTTTGCGCACCAGCGTCAGTACCATTAATACATTGACCATGGACGCTGATATTAGCGCCGATATGGACATTGAGGACGACACATATTATGCCGAGTTTGAGGCGCGACAGATGCTACCGCAAAGTACATTTGTTGTTGGGGATAATGTTGTCGCTGCCGGTGGTGGCAGCAGCGGTGTCATTGCGGATGTTATTGATTtggatgatgatgacgacgaggACGATGACGATGAAATGGCCGCTTGGCATTCTGACGGTGCTGGCGTTGATGACGAAGATGAAGACGATATGCTGTTGCAGCTGGATTGA
Protein Sequence
MSQNQNKQQQQHHGLQQQHHHQQQQQWHAAHVAEAVAANAAAAAGLNMFTNNSYSGGGGTRNACNMSGGGSSGGGGFDLDIVGRASATPVYDAFSQSSPQFQQVLAQQQHQQQQQLMMNIAPAVHMLTPTIEMDELIIKTEPLDEPCKINLVDDDSVSYVDYTKQQQQQQQLTQYPTMSISKIEHIKEELLPSASQQKSLNFPRRKMQTERSETLPICQRCKQVFFKKYSYTRHVAQSACDILEYDFKCVICPMSFMSCEELQMHEQLHRANKFFCHKYCGKFFDTIDLCEVHEFMHHDYATFVCNICTATFPKREFLFAHMPLHEHQQRYDCPVCRLWFQLPHELHQHRLAAPYYCGKYYIDSVPVLQAPRGASFSAGAGNVLQQSQKSYNLQDCRMGIMEVPSINNFAPFANDRLYQPQNNYSTGGGGAPINIHHNSNFQIPHPIKTEFKTEPDFYPSDYEHNFNDFTNDSYTSSGGGSSVHNALSLSQPPPLHYNTVEYQRQQQQQQLQQQQHQRQHNSNNNNIFNISNLSASSNSSSIVTPQLVPIEPSRIIAAMDNVSDEDAQCCVPKCGVCKSTSPTLQFFEFPKDEKYLHQWLHNLKMSTNPPGNLENFRICSLHFPKRCINRYSLCYWAVPTFNLGHDDVANLYQNREHTNAFAGGSELARCSMPGCSSERAACNIKFYNFPKETKTLIKWCQNARLPVYCREPRHFCGRHFEERCFGKFRLKPWAVPTLHLGTPFGKIHDNPGVLYLEEKKCCLSHCRRTRSSDFNLSLYRFPRDETMLQRWCYNLRLEPEVYRGKNHKICSAHFVKEALGLRKLSPGAVPTLNLGHNDTFNIYENELYTPPPPPPPSTAKMYNFHNTPLNYPPVSPTNYTTSSASTSTPNLYDSIDVCFVPSCKRNRNIDNVTLHTLPRRPEHLEKWCHNLKLDAASLHKAVRICSAHFESFCIGGCMRPFAVPTLNLGHDDPDIYRNPEVIKKLNIRETCCVQICRRNRDRDHANLHRFPSKPIPLEKWCQNLRKPVPDGSRLFNDAICEMHFEDRCLRNKRLERWAVPTLNLGFPDVVHELPTEAEVAELYAKPSAPNIGDEEGECCVHTCKRNPQVDDIKLYRMPEDAEVLAKWSHNLQLDVDVLPVLRICNLHFESHCITKKLHTWAIPTLNLANNVENLFENPEPVLVINKKIKIKKERVYKYHLNLNMVKTWLPRCCLSHCGKTRSQDNVQLFRLPTLHRNMLAKWCHNIQLPMVGSSHRRICSAHFEPHVLTKKCPMHKSVPTLNLNTPPGYKIYQNPPQLKLAKMRLQRVCIIASCRKTRVDGVQLFRYPHNRGFLRKWGHNTRQSTSQAVRAQYRVCSSHFEPHSFGSRRLCPGAIPTLNLDPEVKDPFPNEAQSFAEMQCCVNGCDAIKETENVRLFKFPCDDEDLLWKWCNNLKMNPVDCQGVRICNRHFEADCIGSKLLFKWSIPTLALGHEDADIELIANPKPDERYVSEVLFKCCVPTCGKTRKYDEAQMNSFPKNSKSFRRWKHNLKLDYLDFKKRENYKICNDHFEDICVGKTRLNNGAIPTLNLGHDDTSDLYQVKPEQIQSSLFGRQKSAVVAGDPSLANDEGNGDEEVDEDDHNSETILPDSFTEVDAKCSYPSCTASKTVLKETYDLPSVKHKQLFELWCQNMQVDKNELLAEFEKSSPKVCGLHFMQIYDATLEATSILVKDYPELDTALQALNLANERCCNSLILRSVQCSVRGCVTSQLSDHKLFQFPYSRELIEKWSFNTGIEVDEHRRYLNKVCSLHFEDNCLTETQRLRSWSIPTLHLNHDEPDKIHQNPNLLSLERRLLGPVALKCCVANCGRERTDDDPSIKLFNFPTDEALLAKWAYNLRLEREQCPLFKLCEQHFERNCYGNTRLRSWALPTLNLGHNEADIHLNAEVRAEEDEPKVEKVKIKKSLDSIKCCVPSCQKSRLSHGVRLLDFPTGPVMVKKWCHNLQLPLVVTKKHPRICNTHFHKRCFNDKQLFSWAIPTMQLGHSDVIYDNPKLVQPKLVESKLSSNSFTSICALSHCGNKRTAENNLRTFGFPKSEELLEKWCDNLKLTPEDCKGRVCVEHFDVEVMGNRKLKFGAVPTLNLGHEAELKYTNDFLIQNIKQTLNSSTSTEQDNSFEESQTFEKSFDLCATDDEEETEEVETDVELKTKLVATDNEESEVKSNETVASSKFDKETTFDKVKELTKKNKDISKVITHSLRDKPVNNCAPICCLKHCRKEKTPGHHLTTFGFPKDKELLRQWCDNLKLRLEDCVGRVCIEHFNLNMITKNHRLLPGTVPTLNLGHNNILQHANSNDTDFKDEAQSTTLVNKDKNKTPQSTNKEKNVTQQSTANLNEDDNPEIINSDTNSSSYYAEKLSHSVFRICCLKHCRRKKPTDQNWHSFGFPKNEDLLKKWCDNLRLPIEYTTITGRRVCDAHFEPSTMGKTKLAPDALPTLNLTPDDDKPIHNNSKFTTDYVIQGNKSCSVADCESTTNYEYPALFGFPKHSELSKKWCDNLKLELKWYTRYRICKKHFEARSIASKRLRPWAIPTLNLGHSEAPLHRNPTKLKKKLGVKSNASQQRKNPMSRCCIKTCGCDAVELHDHFNFPKRGAILRKWAQNTKINAYIATRREYRVCRRHFEANCFTEDDRLKINAVPTLHLGLDSSDSAALKVDVDDESTTSLLWCSLPTCKRSVTIDRVRIYPFPKDHELLKKWCNNLQLSAETCLDYKICNLHFEANCITKMKLHELAVPTLMLGHSKESVALYQNTVLSDKPLLTFNVTCCVPGCGKSKQEDGIRMNSFPKNRTLYMTWMHNLKQKSCSKVINTYKVCDDHFEDACKGKMHLKSGAIPTLKLGHSDENIFRNNLKLLNNSGKRIKKQVDEATPKALKCVVTDCNVDTVLNMKLFALPVNEELRRSWCAFLKVDLVPQTVTNKSKLCALHFMQAYHGVANNIEQQETSSETTAPPDDKLNNNYEKLCNTSFIKRFNCCVDGCNTNYTHNIKLYQFPTNQELCEKWQQNTKTIVDINRRHLYRVCGLHFEDACSGNARLRAGAVPTLRLNHNDDIFPHLTRDVDAGIKLYTFPKLEDLYQKWVHNIRMDADTCRNARICTLHFEKRVVGKKLFKFAIPTLDLGHNETDIFPNPRMKKKKMQSEQCSLPNCGKIRRLHHVRLFSFPSDDSDMLEKWRHNLNNVNQLENGKICNEHFEAGCIRLKKLQPWALPTLNLGHSDMIYENPQFKPTEKPKLKLSRHDESQSTSSVHHSRDMLHSCCIPTCHHKSLTEDTAEAVQLFGFPKISWLREKWIQNTRTSIAEALNSKICSEHFEPHTLGKTKPRPWAIPTLELGHTDKIFENPKQLHKYHPEEKECGEIKYVRSNYCAIISCLKSKKDGVHLYKYPRKGLLLQKWAENCKHYKHQAARYCFQICSEHFEENCFNGFRLNFGSIPTLKLGHNDEDIHRSEMQRNDVEKNESTQSLQVEAQHCGVPNCKHTKSADNVKFFKFPENAEVLTKWCHNLKLSSEDCHQLRICNMHFESRCFGGGRLLLRAIPTLLLGHMDIDIFENPDSFEKPEVIVRCCVAKCGKSKSENQVQLSQFPKLRSLIDKWLHNLKLPAKSEVMHNYKVCSEHFESYCYEHGRLKFGSIPTLKLGHEDDEIYHMNQEMLLCKSKRKAAASTNADTSLTETNDTKCCYSECIELEKTKVKGFALPQLETLKREWLRSIGLSNNKSEELRLCAIHFKLTYDKNLMHLKNVQESFENVDDAHKSSITACLDKLSSDYTFVSNMSRIRSLTCSVPTCNNSQIISSIKLFQFPYNRELMEKWCHNTRITFDDDRRYIFKVCADHFESDCYAESSKRIKNWAIPTLKLPHDDSVEIFNNPDYSISMKCCISSCENAKLEEKLPSTTMLLFQFPRDTEMLNKWLYNTQLTSRKAFNARVCALHFEKYCINKRLRSWAVPTLLLGHETPDLFQNPTSKIDSLTEEVPNEWKTQDDIEAIGDETLNDTTQSIEYPVSSRKSKRNATSMLKDMMEVESIERAISPAAQKVSSNHCRIIGCKSHTAQKGITMHKFPLHDNVFQMWAHNTQCKAEKKSLWKYRICNRHFASECWLKHQPPRFRHGTMPTLYLGPNRPTMIYENNFELDGGIKKRLKKSARTDEENNPADTSSLNEQSLLPLENAEGEVFTDDCSDMNLETNEVVELGDESVDSIKKVVMDRKLTTTSPLPKISKTTRHCRVVGCWSFVGAEGITMHKFPKIEKDFQKWVHNTQLKADINFRWKFRICGRHFEPDCFFPSAKPRFRMGTMPTLNLGPNRPPKIYENVFIASKTKTTKTKELNLRTLQADAEDDNEHWEMNEDEVEDDLTHNETLNESEIENSISLKPAEQVLNVTLEVNRIEAVTTPMTTLENSNCCRIIGCSSHKGQKDIMMHKFPINMDAFQKWAHNTQLRFDKNLRWKFRICSRHFEPNCWLPSRRTRFRVGTMPTLHLGPNRPATIHENDFIATGNELSMEAVDTTTNDGVSTQNELSLLSLQDDDNDNVTDNLDLELTAQSSTNDKKSVRSSSSHLICAIATCGRRLDPENMRLHKLPLNGILQRKWMYNCRLEPNVTGDSNFHTRICSRHFEQKCYYGTKLQLRFGSLPTLHLGHDDVNIYQNIHTNRGGAHIDDDDDEVEDDRDTETTATVTCCVRNCVRSHYRDDETRFFRFPQSKKVLFKWLFNLKMEYDAARPWDYNICELHFVEDAFDSANNRLHEYAVPTLRLGRKVKTNELRSMSVPPVANIATMQLGIESSTSANTVHNYNEIKSGYKKCSLIHCQKSQKKDQIKTYGFPKTIDQQEKWEHNLRIKYDAVRPWKYLICSDHFEKQCFQENRKYINQIFKWALPTLNLGDNTPTALFTNENAREQYEAQCAPKGKSKSVKMAKPTTTTVKDAPGFSEIEENDDKDDDDDVGVLAAEEEDEEEEKEEEVEAEQEGDEEEEELIVPMVKRMRLDLNYDNSNASESVGSLLQLSDGGGSSSNFMPGTYKYEATHCCLPHCKRARTSDIRLYRPPTAPDLLSKWEHNLRMSYTGRSVNRTLICSEHFEPHLVNKRIKQLTRGAIPTLNLGHSHTSDLYKNSFENLVPKQSTTLSPTVQKIRESPQAPRPPPRTAIRCHVAGCTLRQAFDIFPFPDNQNFIKIWVEQTRIAYVPTKHDEIGVCGAHFEPACFNKVVKGKLRNTAIPTLHLPEQPKPISMIGTAPPKPLPSRILQQVVSSSVSAVSSSSAATAIACSVPQCHNTSAMHSIKLFSKFPEDFELFTKWCFNLKIDPRKYVDGAYQVCSSHFEPSCIGGHSLRVWAVPTLHLGHSSGEIHKVERPAEAELKCCLPHCGRKKSLDGVNFYSFPKGDTYKAWCQILNIDEGLYRIKDKKICSAHFTGDSFMGALLKPGVKPMLLLGPKATTAAAVAAVPKEKRLGAAIFVNKCIVRTCHATQHLYKFPDNRNLCVKWCHNLKIDYDKKLSKNPNFKLCPKHFEPSCMRQGRLHTEAVPTLQLAHADPNIFQNTASFSTQTQTTAITSAAKGAPSTPSYDDNSSLRTSVSTINTLTMDADISADMDIEDDTYYAEFEARQMLPQSTFVVGDNVVAAGGGSSGVIADVIDLDDDDDEDDDDEMAAWHSDGAGVDDEDEDDMLLQLD

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-