Basic Information

Gene Symbol
-
Assembly
None
Location
JAPVRH010000132.1:1-19847[+]

Transcription Factor Domain

TF Family
THAP
Domain
THAP domain
PFAM
PF05485
TF Group
Zinc-Coordinating Group
Description
The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 40 6.8e-16 1.2e-12 48.7 4.8 1 86 446 518 446 519 0.86
2 40 3.8e-14 6.5e-11 43.1 7.0 1 87 546 615 546 615 0.82
3 40 1.9e-16 3.3e-13 50.5 0.3 1 87 637 709 637 709 0.85
4 40 2e-13 3.3e-10 40.8 1.7 1 87 802 871 802 871 0.81
5 40 1.5e-13 2.6e-10 41.2 5.3 1 86 895 966 895 967 0.80
6 40 9.6e-14 1.6e-10 41.8 1.9 1 87 1003 1073 1003 1073 0.78
7 40 1e-10 1.7e-07 32.2 2.4 1 85 1119 1187 1119 1189 0.73
8 40 2.3e-15 3.8e-12 47.1 2.1 1 87 1214 1280 1214 1280 0.82
9 40 2.4e-12 4.1e-09 37.4 0.3 1 86 1301 1370 1301 1371 0.81
10 40 9.6e-14 1.6e-10 41.8 1.5 1 87 1398 1470 1398 1470 0.86
11 40 0.00091 1.5 9.9 0.0 1 58 1521 1571 1521 1589 0.80
12 40 2.5e-13 4.3e-10 40.5 0.3 1 86 1617 1695 1617 1696 0.83
13 40 8.8e-13 1.5e-09 38.8 1.2 1 86 1722 1791 1722 1792 0.80
14 40 2.4e-12 4e-09 37.4 4.9 1 85 1867 1936 1867 1938 0.81
15 40 8.8e-15 1.5e-11 45.2 1.5 1 87 1960 2029 1960 2029 0.83
16 40 1.6e-10 2.6e-07 31.6 0.4 1 86 2398 2468 2398 2469 0.77
17 40 1.2e-15 2e-12 47.9 0.4 1 86 2516 2591 2516 2592 0.80
18 40 6.9e-05 0.12 13.5 0.0 1 60 2629 2688 2629 2708 0.76
19 40 1.3e-12 2.2e-09 38.2 4.5 1 87 2728 2798 2728 2798 0.82
20 40 5e-13 8.5e-10 39.5 3.2 1 86 2825 2911 2825 2912 0.80
21 40 1.1e-10 1.8e-07 32.1 2.5 1 86 2942 3010 2942 3011 0.78
22 40 1.8e-13 3.1e-10 40.9 5.1 1 87 3034 3103 3034 3103 0.82
23 40 1.8e-11 3e-08 34.6 0.3 1 86 3154 3220 3154 3221 0.79
24 40 3.1e-14 5.2e-11 43.4 1.4 1 87 3262 3334 3262 3334 0.81
25 40 1.7e-12 2.8e-09 37.9 0.2 1 86 3363 3434 3363 3435 0.78
26 40 4.7e-14 7.8e-11 42.9 2.7 1 86 3459 3529 3459 3530 0.80
27 40 0.0016 2.7 9.1 0.0 1 45 3570 3611 3570 3620 0.81
28 40 2.3 3.9e+03 -1.0 0.0 46 79 3646 3671 3633 3677 0.60
29 40 9.6e-16 1.6e-12 48.3 2.5 1 86 3703 3779 3703 3780 0.84
30 40 9.5e-14 1.6e-10 41.9 0.4 1 86 3817 3892 3817 3893 0.79
31 40 7.1e-13 1.2e-09 39.1 1.2 1 87 4093 4164 4093 4164 0.80
32 40 4.4e-13 7.4e-10 39.7 0.9 1 87 4251 4322 4251 4322 0.82
33 40 4.6e-14 7.8e-11 42.9 0.6 1 86 4403 4479 4403 4480 0.83
34 40 4e-13 6.8e-10 39.9 0.4 1 86 4528 4597 4528 4598 0.80
35 40 1.2e-13 2.1e-10 41.5 4.0 1 86 4661 4735 4661 4736 0.82
36 40 7.7e-14 1.3e-10 42.2 0.4 1 87 4899 4968 4899 4968 0.81
37 40 6.9e-11 1.2e-07 32.7 1.1 1 87 5015 5084 5015 5084 0.82
38 40 1.5e-12 2.5e-09 38.0 0.9 1 86 5127 5199 5127 5200 0.79
39 40 7.3e-12 1.2e-08 35.8 0.4 1 86 5221 5291 5221 5292 0.76
40 40 6.6e-11 1.1e-07 32.8 2.9 1 86 5315 5383 5315 5384 0.83

Sequence Information

Coding Sequence
CCGCTTAACTTTCCGCGACGCAAAGTTCAAACAGAACGTTCCGACACATTGCCGATTTGCCAACGCTGCAAACAGGTCTTCTTCAAAAAGCAAAGCTACAGCAAACACGTTTCTCAAAGTCTCTGCGAAATTGTCGAATATGACTTCAAGTGTTCTATATGCCCCATGTCGTTCACATCCAGCGAGGAGATGCAAACACACGAACAACTACATCGCGAGAATATGTATTTCTGTCACAAATACtgtggcaaatattttgacactATCGAACTGTGCGAGGTGCACGAGTATATGCAGCATGAATATGTCAATTATATCTGTAATGTTTGTTCGGCGGGCTTTGCCAGCCGCGATTTACTATTCGCACATATGCCGCAACACCGTAATCAACCGCGCTATGATTGCCCGGTATGCCGTTTGTGGTTTCACACAGCCATACAATTGCATCAACACCGTTTACAAGCGCCATATTTTTGCGGGAAGTTTTACCGACGCGCCGGTGGAGCAGcatCTACCATGCCTGTTGGTGGCGCACCATTTGGACCGGTGCCGCCAACACATTCTACTAATTATAATCTACAAGACTGTTCGATGGGCATTATAGAAAtgccTAACAGCAATAGCTTCTCTTCATATCTACAAAATCGCGCCTTTCACCATCAACATCCAGCTCCGCCGTTGCCACCGCAGCAATTGCATCCACATCTCgcgcgacaacaacaacaacaacatctgcaacgtcaacagcagcaattacatctgcaacaacaacatccacAATTTCCAACACAACCGTTACACGGAGcgccacaacaacagccaACACCCGCTCAGCCAGATTTTATGCGCCGCAGCAGCATTACTGGTGCCACCGGCAGCGCCACAACAGAATTTCAAATGCCACAAATTAAAACCGAAATAAAAGTGGAACCAGACCTTTATGGCACATCAGATTATCCGCTGCAGACCCCACTGCCGCCACCACCTTTACCGCCACCGCCGTTAAGCGCACAACAACAGTCTGCGTTGGTCTCACCATCACGTCAGCGTCGCTTTGGTGATTTCGCAAGCGAGTCCTTTGGCGCTGTTGGTGCAAGCGGCGACACTTCCCTTGCATTTGGCACCCACAATAACAGCGGTGGAAGCAATAACCACAGTaatgatttttcaaatagcAACGCTTCGAATCTGCAACGCAACATGAgcgccaacaataacaatgacaaCGGCGGCAGTCATAAGCAGACATCATTTCCTGTTGGCAATTTTCATTTCCCCACCACACCATCGGTTTCGCGTACGCGCGTGGTCGATACCGGTGAGGATGGTGCAGTTTGTTGTGTGCCACATTGCGGCGTCACTAAACAGTCCAGTCCCACGCTGCAATTCTTTACATTCCCTAAAGACGAAAAGTATTTGCACCAATGGTTACATAATCTCAAAATGTTTCCTGAACCCGATTCCACGTACAGCCAATATCGCATTTGTAGTCTACATTTTCCCAAGCGTTGTATTAATCGCTATTCGTTGTGCTATTGGGCAGTGCCCACATTCAATTTGGGTCACGACGATGTCGCGAATTTATATCAGAATCGTGAAATCACCAACACATTTACGATTGGTGATCGAGCACAGTGCAGCATGCCCGGCTGTCCAAGTCAGCGTGGCGAgagtaattgtaaattttacaatttcccCAATGACATGAAGACGCGCATCAAGTGGTGTCAAAATGCGCGCCTGCCAGTGCACAGTCGTGAGCCGCGTCATTTTTGTAGTCGTCACTTTGAGGATCGCTGCTTTGGCAAATTTAGGCTGAAGCCTTGGGCTGTGCCCACACTACATTTGGGCACACCGTATGGTAGAATACACGATAATCCGGGAGTGTTCTATTTGGAGGAGAAAAAATGCTGTCTGCCACATTGTAAGCGCACGCGTTCGTCCGATTTTAATCTATCGCTATATCGTTTTCCACGCGATGAAGTGTTATTGCGACGCTGGTGTTACAACTTGCGCCTTGATCCGGCAATATATCGTGGTAAGAATCACAAAATTTGCAGCGCACACTTTATTAAGGAGGCGCTGGGTTTGCGCAAACTGTCACcagGCGCCGTGCCAACTCTGAATTTGGGTCACAACGACCGCTTCAATATCTATGAGAATGAGTTGCCAACACCACCGGTAGTACCGCCGTCTAAAATGTTCAATTTCCATAACGTTTCCGCTCCACCGTCGGTCTACAATAGTCATCGGCACAGCGTTGCGTCTAGCAGCAGCCACAGCGAACGCTTGTACCCTAATCCAGTATCGAAACTCAAATTTAGCATATCGAATATCACAGCGGGTGATGTAAGCGCTATGAGCGGCAGCGCCACACAAAATATGTTAGATTCATTGGAAGTCTGCTTTGTGCCAAGTTGTAAGCGTAATCGCAATATAGATAACGTCACACTGCATACCATACCGCGTCGTCCGGAGCAGATGCGCAAGTGGTGTCACAATCTGAAAATCGGCATTGAAACACTGCATAAGGGTGTGCGTATTTGCAGCGCGCATTTCGAGCCATATTGCATTGGCGGCTGTATGCGTCCATTTGCAGTGCCCACATTGAATTTGGGTCATGACGATCCGAATATCTATCGCAATCCGgatgttattaaaaaactgAATATACGCGAAACATGTTGCGTGCAAGACTGCAAGCGTAATCGTGATCGTGATCATGCCAATCTACATCGTTTTCCATCGAATTTCGAAATGCTCACGAAATGGTGTGAAAATTTGCTGAAACCGGTGCCCGATGGCACTAAACTCTTCAACGATGCCATATGTGAGGTACATTTCGAAGAGCGTTGCATACGCAATAAACGTTTGGAGAAGTGGGCCGTGCCCACGGTGAAACTAGGTCACAGCGAGGAGCCGAAGCATCGTTTGCCAACGGATGAAGAGATTGCTGAGCAGTGGCCAAAACCGGTAATGCCCAATAAGGGTATAGAAGAGGGTGAATGCTGTGTATCGACATGTCGACGTGATCCGAAAATCGACGATGTTAAAATCTATCGCACACCTGAAGATCCTGATTTGTTGGCGAAGTGGGCGCATAACTTACAAACGGAAACGGAAGATCTTACCAGCTTGCGCATCTGTAATCTACATTTTGAAGAGCATTGCTTTAGCAAGAAGCGTCGTCTACACTATTGGGCCTTACCTACACTCAACTTGGCTAATAATGTAGAGCAATTATATGAGAATCCCGAACCGCTTGTGCCACAGGTGATCATAAAATCGGAGACGAAGCGCGAACGTCGGCGTATGCGTGATTCCAACGAGCCACTGAAGCCGTGGACGCCACGCTGCTGCCTGTCGCACTGCCGCAAAAGACGTGATACGGATCACGTACAGCTTTTCCGTTTCCCAGTACTCAATCGGCCCATGTTGGCTAAGTGGTGCCACAATTTGCAGCTACCGCTGGTCGGTAATGGACATCGTCGTTTGTGCTCCACACATTTCGAGCCGCGTGTGCTCTCGAAACGTTGTCCCATACCGATGTCAGTGCCCACGTTGGATCTGAATACACCATCAGGTTATAAAATCTATATGAATCCGGCGCGTCTAAAAGCCGTCAAGTTGCAGCAAGTCTGCTGCATTTCATCCTGTAGTCGCACACGTGCCGACGGCGTACAACTCTTCCGCTTTCCGCACAGTCGCAGCGTGTTGCGCAAGTGGTGTCACAATACGCGTCAACACGCACGGGGTGAATATCGCGTGTGTTCGCGTCACTTTGAACCACATGCATTCGGCGCTAAACGTCTAATGCCCGGCGCAATACCCACTTTGAATTTGGATCCGGAAGTTGAGGATGTGTATGCGAATGAAGCGCAGAGCTTCGCTGAAACGCAATGCGTGGTAAGCGGATGCGTCGCCAGCAAGGAGTTAGATGGGGTACGACTCTTCAAGTTCCCAAGCGATGATGTCGATTTACTATGGAAGTGGtgtaacaatttgaaaatgaatccGCTAGACTGTCGCGGCGTGCGTATCTGTAATCGACACTTCGAACCCGAATGCGTCGGTCCAAAGATGCTCTACAAGTGGGCTGTACCGACATTGGCTTTGGGTCATAACGATGCTGACATCAAGTTGGTGTCTGTGCCGCCGCCAGAGGCGCGTTACAGCGATGTGGTGACGAAGTGTTGTGTGCCCACCTGCGGCAAGTCACGTAAATATGATGACGCACAAATGAACAGCTTCCCGAAGAATTTGCGTCTATTCCGCTGCTGGAAGCACAATCTGAAATTGGACTTTTTAGACTTTAAAGATcgcgaaaaatataaaatatgcagtGATCACTTTGAACCAGTTTGTCTGGGTAAACTGCGTCTAAATTATGGCGCTATACCCACACTCAATCTGGGGCATGAATACACTGATGATTTGTATCCCGTAGATCCCGCGCAGATACAAGTGTCGCTCTTCGGCAAAATGCGTCCACGCATTGTGAGCGAAAAAGACGCATCTGAATTTGAACTGACTGGTTGTGTTAGTAAAGATGACATAAAATGCTCTTATCCCAATTGTACTGCAACGAAAATGCTGCTAAGCGAACCTTACGACATGCCAGCTGCCTTGCCGTTACATGCGCTTTGGTGCGCACAAATGAAAGTTGACGCGGAGGCGCTCAGCGAAACGCCAAAGCTATGCGGCCTGCATTTCATAAAACTCTATAAATCTAGTTTAGATGCGGCTAACGCACTGGCGGAGGGTGACAACGACTTGAGTGACGCCATAAAAACTCTAACCACGacttatgaaaaatgttgcgtatcACCAATTGTTTGCAGCGCGCAGTGTTGCGTTGCCGGTTGTAGCAGTAATCAGATTAGTGGCAGTCGTACAGCACAACGCCTCTACTTATTCCCTACTGCAGGCGGTAGTAGCATggaattacttgaaaaatggTGTCACAATGTGGATGTCGCTGCATCGGACCTCAGTCGATTTACGCACAAAGTGTGCGCGCGTCATTTCGAAGCACAATGTATTGGGCCAACGCAACGCTTGCGCTCATGGGCAATACCGACCTTGCAACTAAAACTCAAACCAGAACATAAAATACATGCCATACCCGAATTGGGCAGCGCAGCACGTCCGCATGAAGCGCGTTGTTGTGTAGCCACGTGCGCGCGTGAGCGCGGTGATTTCGAAACAATGCGTCTCTTTAGTTTTCCAATTATCGATGAAGTGCTCGAAAAGTGGTTGCACAACCTGCAATTGACACGCAATCAGTGCGCACGCTTACGCATTTGTGAACAACATTTCGAGCCGCGTTGTATTGTGAAGGCGCGCTTGCTGCGTTGGGCAGTACCAACATTGTTGCTCGAGCGCAATGCAGAGGAGTTGCTGCAGAACGAACCTCCTATGCAAGCCGCAGTGCCTGCCGCACCAAACGTTGATGAAGTCGATGCAATTTATGCAGCAAGTAACAACGAAGGGGATGAAGATCTGCGCgatgctgatgatgatgaagatgACAATGGTAGTGGCGAATTGGTTGCAGTGCCTAACGTTAAGATGAAAAAATCACTTGGTATTGTTAAGTGTTGTGTATCGAGTTGCCGACGCACTCGCCTACAACACGGCGTGCGACTCTTCCAGTTTCCCACAAGTCATCAAATCTTACGCAAATGGTGTCACAATCTAAAAATATCGCTCGAAAAGGCCTCAGATCCAGTCTTACGTATTTGTAGTCTACACTTTCATAAACGTTGCATTGATGGCAAAGAGTTACGTCCCTGGGCCAAGCCGACCAATGCGCTCGGCCATAGCGGTCCCATCTATGAAAATCCCAAAAATCTACCAGGTGTCTTTCTACCCAAATGCTGTTTGGCACATTGCCGTCAGCGGCGCACGCTCGAAAATGACTTACGTACATTTGGCTTCCCCAAAGATGAGACGCTTATGCGTAAGTGGTGTGCCAATTTACGTATGGAGCCACGCGCTAATCACGCACGCATCTGCATGTCACACTTTCCGCCCGAAGCTATGGGCAATAAGAAATTGCGTGCCAATGCAGTGCCTACGCTAAATTTGGGGCACAACGAACCGCTAGAATACGACAACGCACTATTGATCGCGACAGCGATGAAGGCAAGAAATcCGTTGgaggaaaagaaaattttgacttCAGATGCCAGTGATGTTGAATTGGGAAATAGCGCCGAATACGATGATGAAATGGATGACGATgacaatgatgatgatgactaTCGCATCGATGATGAAGAAGAGGAAAACGACTTCTACGCCTCCGCCGGCGTTGGCAATAGTGATGAAGAGGAGGAGGAAACTGTTGAGGACTTTACACCAAGCATTAGTGCGGCAACCGGTGACGACGATGAGCTATATAGTAACGCGGAAGCGCGTGACGTCGACAATTTCATAGTCGATTCGGCAGATGAGGATGAGGAGGGTGCCGAAATAGAAGATGAGGATGAAGAAGATGTCTATGGTAATgaggatgatgatgatgaggaCGACAGTGATGAGAGCGATGAAAGTGTTACCGAAAATGTATACAACAATGCTGCAGAGGAAGATGACAGCGACAGTGAAAGCGAAATTAACGACAACGCTGACAGCAAAGAGGCCGAGGtcgatgatgatgatgtggAAATGCTTATTGATGCCGATGATGATGAAGATGATAAAAGCAGTTACAATGTTTGCGATCCACTCGACTTTGTTGAATGTGTTAACCAGAGCGATGATACCCAACAACATGAGGCAGCAGCCACCTCAACAAAAATCAGCAATATACGCTTTGAAGGAAAGAATGTGCCCAAACGTCCAATATCGTCGACACCGACAAATTATGAAGACGACTCCAATACAAACTACAGTAATGCTGATGAAACGACACGTTTCACAGCAACCACATCGAATTCCGCATCAGCAGTCACATCGTCGGCATCGGCCACTGCATCGAATGCACATCATTGTGGCGCTGAGAAAATAAGTCGTTCGGTTTTTCGGCTTTGTTGCCTCAGACATCGTAGGAAGAAGAAAGCGCCACCAGATCCACCACCTGATATGCGGAACGCGCGCATCAGCGACGAAGCCCCAGCAGCACATCGGACGTTGATGAATAGTATTACACGCGCTAGTGCTCGGCGTCATCAGCAGCGCGCCTACTGTGCTGTGCGTCGTTGCGGACGCGCCGCTGGTGCAGCTGTAACTTTATATCGTTTTCCATTAGTCGGTAGTCGCTATTGTCGCCGTTGGTGTGCAAAGCTAAAAGTTAAAATGTCGCACGCATCGCGCCTGCGCATTTGTCAGCGACATTTTTCGTATAGTTTAGTAGATCGTCGTCGTAGGAGCTTGCGTTTCGGCGCAATACCAACACGCAATTTGCATAACACGACAGGACAATTCAAGCGCAATCCACTTTATGgcttacataaaaatacaaaacaattgACAGCAACGTCAGAGCACGACACGCTTTCAACTCAAGCAACCGCCGCGCAATTGAACGTTTATAATCGTTGTTGCGTGCCACATTGCGGTAAATCACATCAAGTTGATGGCGTAACACTCTTCCGTTTTCCAAAACTACGCTCGCTCTATCTCCAATGGGCTACGAATTTGCGCTTAATGCCAACTATGCGTTTAGTGCAAGTCTACAAAGTTTGCAGCGATCACTTTGAGCGCAATTGCCTCAGTTATCAGCGTAATGATCGCGCAAAACTGAAATATGGCTCTGTGCCAACACTGAAATTGGGTCATAATGATACGTTGAAAATCTGTAAGCAGAACACATTGTCGCCACAAAAGCGTCGCGGAGTTGGCAGGCGCAAATGGCGCCCACAAAAGGGTGTCAACGAATGCGCTGTGCATGATTGTAAAGTAGCGCAATTTCTACAAATGCAACTATTCGCGTTGCCTATGGCACTGAAGCTGCAAGAGCGCTGGTGTaattacttcaaattaaaCTTCACTGGTGTGTCGAAAGATGCCACCGAATTCTTTCAGAATGTGCGTCTATGCGCTTTGCACTACATGGAAGGCTATCAAATGGCAACGTACAGCGACGGTGCACGTAAAGGCTCATCTGCTGCATTGGACGAGTTGGAGGCGAACTATGCGCGCATAACTAGTTCCACGCGCATACAAATGCTAAAATGTTGTGTGCCCAATTGTTCGACGAAATTCACGGACAACTTGCGTTTAACCGCGTTTCCCAGTGCGGAAGAGGTGCGCGCTAAATGGCAACACAATACACAGGTGTCATTCAGTCCATCGCATcgttatttgtataaagtatGCGCATTGCATTTTGAAGAGCGTTGCTTCGCCAGAAAACGCCTCTTTATGTGGGCTATACCGACATTGCATCTTCCGAAACCACAAAGTCAAGATCCAGCACATAAGCTATTTGAAAATCCGAACGTTGATGTCGTCGGCAGCGTCCAATGTTGTATTGAAGATTGTGCCACAAATAAAGTCAAAGAGGAGCCGGCAGCAATTGATGACGAGAAAGTGTGTAAAGCTGCAGTACGCTTTTGGCATTTTCCACAAGACGAAGCATTGCGCGATAAATGGTGTCATAATTTGGGACTCGGCGCACACACTAACGAAATCAGCCATACCAGTCGGCGCTGGCGCCTATGCGGTCGTCATTTCGAGCCGTTTTGCATCGGTAAAACGTTGCGCAGCTGGGCTGTGCCCACACTGTATTTACCCAAACCGGTTAAACATGCGAAGACCGGCAAGCgctctacatatatttaccaaaatcCGGACAGCGCGGCGCTCTACTATCGCTGCTGCATTAAGACGTGCCATCAGCTATGCGATCTCGATGCTGGCATACGCTTATATGCTTTCCCCAAAAAGGATACCATGCTACAAAAATGGGCACATAACATACGCATGCCCGCGGTGAAGTGTCGCTACGCGCGCATCTGTACGTTGCACTTCGAAGCGCAATGCTTGCGTCCGCAAATGCAATCGTGGGCGCTACCAACAATCGATTTAGGACATGATGAGGCAGATATTTTCCGCGTGCCGAAGGTAAAGTTAATGGTTACAAGTGAACGCTGTTGCCTACCGCTTTGCAGCAAGCGGCGCAGTCGTGACAATGTGcatcttttcacttttccgCGTGACAAATGTGTACTGGACAAATGGTGGCACAATTTGGCGATTGGTGTGCAGGACGTAAAACGGCGTCTCATATGTGAATCGCATTTTGAGCCGCGTTGCATTAGCAAACGCCGTCTTAAACGTTGGGCCATACCAACACTTAATTTGGGACATACAAACGAGACACTCGAAAATCCAACACCAGCTGAAGTGTTGGCATATGAAAATAACACATCAATGCGACGCTCACAAACACCCTCGAAAATAGCGCGCTCCAAATCGACGCAACCAACGCCTACAAGCACTTTACAGAAATGTTCAATCGGCGGTTGTGCGCGTGGTGCCGAAAGCAGCGAGCTTTATCGTTTTCCAAAACCGGATTGGTTACGGAAGAAGTGGTGTGATAACACGCGTATAAGCGAAGAGGCTGCCAAGCTGGCGAAAATTTGCACGCGCCATTTCGAATCACACGTTATGGGTAATCGCAAACCGCGTCCATGGGCGTTACCAACTTTGGAATTGGGCACTTTTGAAAACGGCGGGCCATTGCCAGCTGTGCACGCAAACCCCAAACAGCTGTCGCGGATTCATCCCGAAGAGCATGAATGGGGCGAGTTGCGCTATGTGCGCGCTAATCATTGCTCCATAATCTCTTGcatgaaatcgaaaaaagaTGGCGTAACACTTTTTAACTATCCCACGAATCGTCAAATGTTGCAGAAATGGGCTGAGAATTGTCGCCATTATCCATATCAGGCGAAACGGTATCGTTTCCAACTGTGCGGCGCACATTTCACTGCTGATTGTTTCAAACGCGAAAGCACGCGTTTACGCAAAGGCGCAGTGCCTACCTTAAATTTGGGACATGATGATGCGCAGATACACCAGAGTGAATTCGAAAGCATTAGCGCCATAAaaattgaaactaaaatactaaaaagttGTAGCGTACCACAGTGTGGGCGCACGAATTTGCACGATGGCGTGCGGCTCTTCAAGTTTCCTTACGAACGCGCTGAGACGCTAGAAAAGTGGTGCCACAATTTACGCATGAACGCATCGGATTGCCACAATGCGCTAATCTGTAATATGCATTTTGAACCGCGTTGCGTTGGTGGCGGTCAGCGCGGTTTGCTTTTACGCGCCATACCCACATTGCTGCTGGGACACAACGACACCGATATTCTACACAATCCAGAAACATTTGAACGCCCTGAGAAGGTAATTAGTTGCTGCGTGCCTGACTGTATTAACACTAAACAAACGGCTGGCATAACCCTGAGTGCTTTTCCAAAGCTACGTAGCCACTTTGAGAAGTGGGCGCACAATCTTCAGCTGCCTGTCAGCACCACCGTCTGGCATACATTCAAAGTATGCAGCGCACACTTTGAAAGTTACTGCTATGAGCATAGACGCATCAAGGTGGGCGCAATGCCGACACTAAAGTTGGGACACAGCAATGCTATTGACTTGTACACCGTGAGTGAAGAGGCAATGAGTCATGCATTTAAGCGCAAACGCGTCGCGCCCAAAAATGACGCATCGAAAGAGCCACACGAAGCGTGTTGTTATCCGGAATGTAGAGAAATGGAATTGCGTTTGACAAATCAAGTCTTCGAGTTTCCAGAATTGGATGCCATACGACGCGCTTGGCACGAGAGTATTGGTTTGGGCGCTGAAGAACAGGACGAAGAAGCTCGCACGCCAATAGAAGTTGAAGCACAAAATACAGGGCGTGTCACTGAAATCCCACAAGAGGacgctgttgctgctgcactCGAAAAAGTTACGAAAAAATCACCTAAATTATGTCCAATGCATTTCAAATTACTCTACATAGAGCATGCCGCACTGCTGGATACACTAAAGTCAGACAGCCCACCGGAGACCCTGCATGTTTTACAAAAGCTTGATGAGACCTATACAAGAGTTTGCGATTTATCTTGCGTGCGTCGCATTAGTTGCGCTGTGCAAGGTTGCAATTCAAATTATCTTACCACTAAGTCGctcaaattctttaaatttcccGATAATGCAGAGATGCGTGCCAAATGGTGTCACAATACACAAGCCACCGTCGATCCGGATCGCTTGTATTGCtacaaaatatgtgaattaCATTTTGAAGCAATCTGCGCCTCGCAGCTGGCAAAAAAAATGCAACGTTTGAAATTCTGGGCACTACCGACTCTACAATTGCCGCCACGCGCTGCAGGTACACCCGAAATACATGCACTACCAGCGCCCGATACGCTAGCGGAAACGAAACGCGCCTCCATGCTACTACACACGGCACTCAATAAATGTTGCATACCCAGTTGCGTGTACGCTAAGGCGCCTGCGGAGCGTGCGGTGAGCGCTGATAGTGAAATacagtttttcaattttcccaACGATGCTGAATTGCTCTACAAATGGGTATATAATACACAAGTCAGTATGGTCGCGGCGACGAATGCGCGCATATGCTCGGTACACTTTGAAAAACATTGTATTAATAAACGTTTACGCATGTATGCAGTGCCCACGTTGCTGTTGGGGCACCAGAAAACCGATATCTATAAGAATCCGTCGGATAAGAGAGTGGCGGCGGAAGCGTTGGAGAGCAAGCCACAGAAGTTGCCGAAATATTACGAAGATAATATTGCTGATGACGTTGCAGTTGAAGAAAATGAGGAGTATGATGATGAATTGTTAGCAGAAGAGGCGTCGTcgaaatttgaattgaaaaaatcacaaattactaaatttaaaactgaagCTTTTGACGAACGTGAGCCGCGCTTAGCGCGCAAGTCTGCCGCGTTCGATGAGAAGAAGCTTGACAGAGCGCTTCTAGAGCCAATGTTAGTAGTAAAAACCGAATTGAAAGAGCGCAAAGAAACCACAACGCCCACAACAAGTAAACcagcaacacaacaaaaatctTATCACCAgccttttattataaatattaaacaagaaaAGGATATCGAAGAAAGCTATACTGCGGAAGGCattgaaaatcaaatgaagCAAGGCTTACTCGATATGTTTAATAGTTTTGGCGATACTGGCGGTGAGCAAGAGCCCGATGACATTAACGAGCTATGCGACGAAGCTACACAAATGGAACGCGACACCGCCCGTCATTGTCGCATACACGGTTGCAATAGTTACGCGCGCAATGCGGGTGTCACGCTCTTCAAATTTCCCTTCCCACTCGATCAATTCCGCAAATGGTTACATAACACACAATTAGAGGTGGACTACACACGCCGATGGCGTTATCGCATTTGTCATCGTCACTTCGAACCGATTTGTATGCAATTCCGTAAGTTACCGCCCGGCACCATGCCCACACTGAATTTGGGTCCCAAGCGCCCAGcgcatatatatgaaaatgaattcgATGTAAATagtttaagcaaatataaaacgaaaacgcaacaacaaaacctaGCAACGAGCACCAATGTGCTGAGTGAATTAAATGACGAAGCCGACACGAATAGCTCCTTCGCTGTGCCCACACACAATGACAATATAGAAGCCGGCGTTGATTATAATGATGACTATGTTGATTTTGTAGACAATACGCCGATAGACCGCGACACATCGCGACATTGTCGCATACCCGATTGCAATAGTCACGCTAAAGATCCCGGCGTAACGCTCTTTAAATTCCCAATGTCTGAGTATCTCTTTCACAAATGGCTCTACAATACCCAACTTAAGGTGGATTATACGCGGCGTTGGCGATATCGCATTTGTCAGCGCCACTTCGAACCGATCTGCATGCAGTTCCGTAAGCTACCGCCCGGCACAATGCCAACGCTGAATTTGGGTCCTTCCCGTCCGGAGCGCATCTATGAGAATAGCTTCGACATAAATCAtctgaagaaatttaaaactaaattgaaaaataacacaacagcaacaacaagtggcGGTCCTGCGGCAACAACAAGCCCTGCTATAATGCAATCAAGCCAAGTCGATTACACTGATGACTTAGAAACGAACAGCTCCTATGCCATTGATGCGAATTCCAGTAATCCCGCACAACTGCCGCTACTCTCATGTACCGTACGCAATTGCACGAGTCAATATCATGCCCTACACGAGGGCttacatttgcataaattgcCGACGAATATTATGTTGCGCGAGAAATGGATTTATAACTGTCGCTTCTCGGAAGAGACGCTCATCAGCATGAGTTCGCGCATACGCATCTGTTCGCTACACTTTACACCTAATTGTTATTATGGCGTTAAGCGACAATTGAAATTCGGCTCAGTGCCCACTTTGCGTTTGGGTCATACAGATCCCAACATTTATCCGCATGCTTTTAGTAATGAAAACGAAGCACAATTGGTGGGTCAAAGGCACAATGCACAGCACAATTGGCGCTCCAGCCGATTGCAGTCGTCTACCGATGCCAATCAAGATATTTGCTGCCTCATCAATTGCCGGCACAGTAGACGTGAATACACGCGTCATTTCGCTTTTCCCACTGAGCGGGAGCTACTCGATCAATGGCTCAATGCGCTCGGTATGGAATTTAATAGTTCACGTCCAGACgactataaaatatgtgaatggCATTTTAAGGCCTCCGATTTCGATGGAGAAGTGCTCCGCGCCGACGCAGTGCCTACGCGCAATTTGAAGATAGACGATGCCAGTCAAACGGATGATGACgatgataatgatgatgaAGATGATGAGCTGGGTTGGAATACAAACGAAGACCCGCTGGATGAACGCCCCTCAACATCTGCAGCAGCCGCTGCCGCAGCGTCAACTATCACCCCCGATGGctacaacaaattaattccCGGCTCGCGACGATGTTGCCTGGCGCACTGCCGCAAACAACTATTCCAAGATAACGTGCGCACCTTCAAATTCCCCACAATGCATGAGCAATTCGAGAAATGGGTGCATAATTTAGGCATTAAATACGATGGTGATGCGCCTTGGCGCTATCAAATATGCAGCGAACACTTTGAAGACCAATGCATTATACATTATGAGAACAAAGCCAAGTTGTTTAGATGGGCTGTGCCCACTATGAAATTGGGCAAACATGCACCAGCCATACTCTTCACGAATGAGAATCCtaaaaaactacaacaaagaGATGCAGAATATGAGCGCGTCTATAGAAATTCGACTGCGGGCGGCTATGCGGACAACATAGAGAATGAAGAGACTATGGACACGACAAATGATGAGTTGAGTTCCACGGCACAACTCGCAGATGTACCAAAACATATGGCTTATCAGCAAGAAGATATGGATTTGCTGGCGCCAATTGAGCGGCCGCCACACAAAATGACCAGCGCAACGAAGCGCAACAATTATGCATATACCGCGAGCAATGAGGACGGGGATGATTACtataatgatgatgatgatgatgcttACGATATGGGTGATGGTGAAAATTCACTATTGAATGTAATAAGGGAAGAGAAACCGAGCGCTGTGAAAGAAGGTACGCCTGCTTCCTCATTCTTCTCATTGCAATTGGTACGCGGCGGTTCTGCTAAAGTGCGCGCCTGCTGCCTGCCGCATTGTGGACGCACGCGTGACTCCGGTGTGCGTTTATTCCGCTTCCCTACAGAACCGGTCTTCCTCAAACGCTGGGAATACAATTTACGCGTACTCTTCAACGAGTCACAGCGCAATACGCACTTAATATGTAGCGCGCATTTCGAGCGCGGACAATATAACAAACGTTTAGTGGTAGATGCCATACCGACATTAAATTTGGGACACAACAGCAACGATATTTATCGTAATGGACAATATGAGCCAACCAGAATGCATAAGCGCGGATTGACCGCATCACCACCGCGCATACCATCGTCAAACAGCATGCCGTTGAGTAGCCACAAACCACTGCATTGTAATGTACCCGCTTGCGCTGATGCACAAAGCAAGCGTCGCTTATTTCCCTTCCCCAGTAATCATGTGTTCGTGAAAATTTGGACCGAGCGCACACAAATCCCTTATGACGCGCGTCAACATGCCGAGTTGCGCGTTTGTGAGCTCCACTTTGAGACCGATTGCTTTAGCGCTCAAGGTCTAAACAATAATGCTGTGCCCACTTTGTTTTTGCCTGCACCAAATGCACTGCCAACGCCTGTTTCAATTATACCagccgcaacaacaaatgctctAACGCCTATTGGCCGCCCAGTGGCAGGGCCCACTGCTATGCCAAAGCTGACTGCAATTGCTTGTAGTGTTACCAATTGCGGCAATAGTACCGCTACACGTAcggatttgaaaatattctcgAAATTCCCGGATGATTTTGAATTATTCACCAAATGGtgtttcaatttgaaaatcgaTCCGCGCACCTATGTGGATGGCAGTTATAATGTGTGCAGCGAACATTTCGAACCATTCTGCATTGGCGGTCATAGTTTGCGCGTGTGGGCTGTGCCCACGCTGTGTCTGGGTCACAACAGCAAACTCATACACAATGTCGAGCGTCCTGCGGAGATGGAAACGAAATGTTGTTTACCCCATTGTGGACGCAAGAAGAGCAAAGATAGCGTGGAGCTCTATAACTTCCCTAAAGGTGATATCTACCGGCAGTGGtgtcaaatattgaaaatcgaTGAAGGACTCTACCGCAACAGCGAAAAGAAGATCTGTAGTGCACATTTCCGTGCAGACTGTTTCAATAGTAACGGTACACTGCGACTGGGCGCACGCCCAACGCTCCTACTACGCAATCGCACCGCCACAGCGGCCGCACATATGCTCAAACCGCCCGCGCCATATCGCAGCAAGTGCATCGTGCGCATCTGCCATGAAATGCAACAGCTCTACAGCTTCCCGGCGCAACGGAATCTCTGCACGAAATGGTGTCACAATCTGAAAATCGACTACTACCCAAAACTACATGAGAATATGAACTTCAAAATCTGTCGACGTCACTTCGAGCCGAATTGTTTGCTGAGCGGCGGCAAACTGCATGCCGAGGCGGTGCCGACGGTGCAGTTGGGACACAGCGATGTCAATATCTATCAAAATCTAGTGGGCATAAAGCAGAGCGCCAGCACTCCCAGCTATGATGACAACAGCAGTTTACGAACGAGCGTCAGCACGGTACACACATGGCTGATGGATGTCGATGCGGAAACAAATGCTGCTAACAATATACCTGCTGCGAGTAGTGGCGATGCGggtgctgttgttggtggtgcTGTTGGAGATGCTGATGATGACATGGTGCGCATGGAATACGAGCCACCAGTCGATTTGGAGCCAACAGTTGTGACTGAGAATATTGCCGATGATAATCTGGACTTGACCGACAGCGCCTATATGCAAATGGAGGATGACACCTACTATGCGGACTTTGAAGAGCAACGTTTGTTGCCGCAAAGCAGCACTTTTATAGCGGCTGAGAGCGCGGAAGTGATTGATTTGGACGCTGTGGATGCTGTGCAAGAGCAATTTCCCAATTGGTCGCAGGACGATGCTGTGCTGGTGGATGATGATGACGAGGAGGAGGATGATGATGCGCTGTTGTGGCcgttgaattaa
Protein Sequence
PLNFPRRKVQTERSDTLPICQRCKQVFFKKQSYSKHVSQSLCEIVEYDFKCSICPMSFTSSEEMQTHEQLHRENMYFCHKYCGKYFDTIELCEVHEYMQHEYVNYICNVCSAGFASRDLLFAHMPQHRNQPRYDCPVCRLWFHTAIQLHQHRLQAPYFCGKFYRRAGGAASTMPVGGAPFGPVPPTHSTNYNLQDCSMGIIEMPNSNSFSSYLQNRAFHHQHPAPPLPPQQLHPHLARQQQQQHLQRQQQQLHLQQQHPQFPTQPLHGAPQQQPTPAQPDFMRRSSITGATGSATTEFQMPQIKTEIKVEPDLYGTSDYPLQTPLPPPPLPPPPLSAQQQSALVSPSRQRRFGDFASESFGAVGASGDTSLAFGTHNNSGGSNNHSNDFSNSNASNLQRNMSANNNNDNGGSHKQTSFPVGNFHFPTTPSVSRTRVVDTGEDGAVCCVPHCGVTKQSSPTLQFFTFPKDEKYLHQWLHNLKMFPEPDSTYSQYRICSLHFPKRCINRYSLCYWAVPTFNLGHDDVANLYQNREITNTFTIGDRAQCSMPGCPSQRGESNCKFYNFPNDMKTRIKWCQNARLPVHSREPRHFCSRHFEDRCFGKFRLKPWAVPTLHLGTPYGRIHDNPGVFYLEEKKCCLPHCKRTRSSDFNLSLYRFPRDEVLLRRWCYNLRLDPAIYRGKNHKICSAHFIKEALGLRKLSPGAVPTLNLGHNDRFNIYENELPTPPVVPPSKMFNFHNVSAPPSVYNSHRHSVASSSSHSERLYPNPVSKLKFSISNITAGDVSAMSGSATQNMLDSLEVCFVPSCKRNRNIDNVTLHTIPRRPEQMRKWCHNLKIGIETLHKGVRICSAHFEPYCIGGCMRPFAVPTLNLGHDDPNIYRNPDVIKKLNIRETCCVQDCKRNRDRDHANLHRFPSNFEMLTKWCENLLKPVPDGTKLFNDAICEVHFEERCIRNKRLEKWAVPTVKLGHSEEPKHRLPTDEEIAEQWPKPVMPNKGIEEGECCVSTCRRDPKIDDVKIYRTPEDPDLLAKWAHNLQTETEDLTSLRICNLHFEEHCFSKKRRLHYWALPTLNLANNVEQLYENPEPLVPQVIIKSETKRERRRMRDSNEPLKPWTPRCCLSHCRKRRDTDHVQLFRFPVLNRPMLAKWCHNLQLPLVGNGHRRLCSTHFEPRVLSKRCPIPMSVPTLDLNTPSGYKIYMNPARLKAVKLQQVCCISSCSRTRADGVQLFRFPHSRSVLRKWCHNTRQHARGEYRVCSRHFEPHAFGAKRLMPGAIPTLNLDPEVEDVYANEAQSFAETQCVVSGCVASKELDGVRLFKFPSDDVDLLWKWCNNLKMNPLDCRGVRICNRHFEPECVGPKMLYKWAVPTLALGHNDADIKLVSVPPPEARYSDVVTKCCVPTCGKSRKYDDAQMNSFPKNLRLFRCWKHNLKLDFLDFKDREKYKICSDHFEPVCLGKLRLNYGAIPTLNLGHEYTDDLYPVDPAQIQVSLFGKMRPRIVSEKDASEFELTGCVSKDDIKCSYPNCTATKMLLSEPYDMPAALPLHALWCAQMKVDAEALSETPKLCGLHFIKLYKSSLDAANALAEGDNDLSDAIKTLTTTYEKCCVSPIVCSAQCCVAGCSSNQISGSRTAQRLYLFPTAGGSSMELLEKWCHNVDVAASDLSRFTHKVCARHFEAQCIGPTQRLRSWAIPTLQLKLKPEHKIHAIPELGSAARPHEARCCVATCARERGDFETMRLFSFPIIDEVLEKWLHNLQLTRNQCARLRICEQHFEPRCIVKARLLRWAVPTLLLERNAEELLQNEPPMQAAVPAAPNVDEVDAIYAASNNEGDEDLRDADDDEDDNGSGELVAVPNVKMKKSLGIVKCCVSSCRRTRLQHGVRLFQFPTSHQILRKWCHNLKISLEKASDPVLRICSLHFHKRCIDGKELRPWAKPTNALGHSGPIYENPKNLPGVFLPKCCLAHCRQRRTLENDLRTFGFPKDETLMRKWCANLRMEPRANHARICMSHFPPEAMGNKKLRANAVPTLNLGHNEPLEYDNALLIATAMKARNPLEEKKILTSDASDVELGNSAEYDDEMDDDDNDDDDYRIDDEEEENDFYASAGVGNSDEEEEETVEDFTPSISAATGDDDELYSNAEARDVDNFIVDSADEDEEGAEIEDEDEEDVYGNEDDDDEDDSDESDESVTENVYNNAAEEDDSDSESEINDNADSKEAEVDDDDVEMLIDADDDEDDKSSYNVCDPLDFVECVNQSDDTQQHEAAATSTKISNIRFEGKNVPKRPISSTPTNYEDDSNTNYSNADETTRFTATTSNSASAVTSSASATASNAHHCGAEKISRSVFRLCCLRHRRKKKAPPDPPPDMRNARISDEAPAAHRTLMNSITRASARRHQQRAYCAVRRCGRAAGAAVTLYRFPLVGSRYCRRWCAKLKVKMSHASRLRICQRHFSYSLVDRRRRSLRFGAIPTRNLHNTTGQFKRNPLYGLHKNTKQLTATSEHDTLSTQATAAQLNVYNRCCVPHCGKSHQVDGVTLFRFPKLRSLYLQWATNLRLMPTMRLVQVYKVCSDHFERNCLSYQRNDRAKLKYGSVPTLKLGHNDTLKICKQNTLSPQKRRGVGRRKWRPQKGVNECAVHDCKVAQFLQMQLFALPMALKLQERWCNYFKLNFTGVSKDATEFFQNVRLCALHYMEGYQMATYSDGARKGSSAALDELEANYARITSSTRIQMLKCCVPNCSTKFTDNLRLTAFPSAEEVRAKWQHNTQVSFSPSHRYLYKVCALHFEERCFARKRLFMWAIPTLHLPKPQSQDPAHKLFENPNVDVVGSVQCCIEDCATNKVKEEPAAIDDEKVCKAAVRFWHFPQDEALRDKWCHNLGLGAHTNEISHTSRRWRLCGRHFEPFCIGKTLRSWAVPTLYLPKPVKHAKTGKRSTYIYQNPDSAALYYRCCIKTCHQLCDLDAGIRLYAFPKKDTMLQKWAHNIRMPAVKCRYARICTLHFEAQCLRPQMQSWALPTIDLGHDEADIFRVPKVKLMVTSERCCLPLCSKRRSRDNVHLFTFPRDKCVLDKWWHNLAIGVQDVKRRLICESHFEPRCISKRRLKRWAIPTLNLGHTNETLENPTPAEVLAYENNTSMRRSQTPSKIARSKSTQPTPTSTLQKCSIGGCARGAESSELYRFPKPDWLRKKWCDNTRISEEAAKLAKICTRHFESHVMGNRKPRPWALPTLELGTFENGGPLPAVHANPKQLSRIHPEEHEWGELRYVRANHCSIISCMKSKKDGVTLFNYPTNRQMLQKWAENCRHYPYQAKRYRFQLCGAHFTADCFKRESTRLRKGAVPTLNLGHDDAQIHQSEFESISAIKIETKILKSCSVPQCGRTNLHDGVRLFKFPYERAETLEKWCHNLRMNASDCHNALICNMHFEPRCVGGGQRGLLLRAIPTLLLGHNDTDILHNPETFERPEKVISCCVPDCINTKQTAGITLSAFPKLRSHFEKWAHNLQLPVSTTVWHTFKVCSAHFESYCYEHRRIKVGAMPTLKLGHSNAIDLYTVSEEAMSHAFKRKRVAPKNDASKEPHEACCYPECREMELRLTNQVFEFPELDAIRRAWHESIGLGAEEQDEEARTPIEVEAQNTGRVTEIPQEDAVAAALEKVTKKSPKLCPMHFKLLYIEHAALLDTLKSDSPPETLHVLQKLDETYTRVCDLSCVRRISCAVQGCNSNYLTTKSLKFFKFPDNAEMRAKWCHNTQATVDPDRLYCYKICELHFEAICASQLAKKMQRLKFWALPTLQLPPRAAGTPEIHALPAPDTLAETKRASMLLHTALNKCCIPSCVYAKAPAERAVSADSEIQFFNFPNDAELLYKWVYNTQVSMVAATNARICSVHFEKHCINKRLRMYAVPTLLLGHQKTDIYKNPSDKRVAAEALESKPQKLPKYYEDNIADDVAVEENEEYDDELLAEEASSKFELKKSQITKFKTEAFDEREPRLARKSAAFDEKKLDRALLEPMLVVKTELKERKETTTPTTSKPATQQKSYHQPFIINIKQEKDIEESYTAEGIENQMKQGLLDMFNSFGDTGGEQEPDDINELCDEATQMERDTARHCRIHGCNSYARNAGVTLFKFPFPLDQFRKWLHNTQLEVDYTRRWRYRICHRHFEPICMQFRKLPPGTMPTLNLGPKRPAHIYENEFDVNSLSKYKTKTQQQNLATSTNVLSELNDEADTNSSFAVPTHNDNIEAGVDYNDDYVDFVDNTPIDRDTSRHCRIPDCNSHAKDPGVTLFKFPMSEYLFHKWLYNTQLKVDYTRRWRYRICQRHFEPICMQFRKLPPGTMPTLNLGPSRPERIYENSFDINHLKKFKTKLKNNTTATTSGGPAATTSPAIMQSSQVDYTDDLETNSSYAIDANSSNPAQLPLLSCTVRNCTSQYHALHEGLHLHKLPTNIMLREKWIYNCRFSEETLISMSSRIRICSLHFTPNCYYGVKRQLKFGSVPTLRLGHTDPNIYPHAFSNENEAQLVGQRHNAQHNWRSSRLQSSTDANQDICCLINCRHSRREYTRHFAFPTERELLDQWLNALGMEFNSSRPDDYKICEWHFKASDFDGEVLRADAVPTRNLKIDDASQTDDDDDNDDEDDELGWNTNEDPLDERPSTSAAAAAAASTITPDGYNKLIPGSRRCCLAHCRKQLFQDNVRTFKFPTMHEQFEKWVHNLGIKYDGDAPWRYQICSEHFEDQCIIHYENKAKLFRWAVPTMKLGKHAPAILFTNENPKKLQQRDAEYERVYRNSTAGGYADNIENEETMDTTNDELSSTAQLADVPKHMAYQQEDMDLLAPIERPPHKMTSATKRNNYAYTASNEDGDDYYNDDDDDAYDMGDGENSLLNVIREEKPSAVKEGTPASSFFSLQLVRGGSAKVRACCLPHCGRTRDSGVRLFRFPTEPVFLKRWEYNLRVLFNESQRNTHLICSAHFERGQYNKRLVVDAIPTLNLGHNSNDIYRNGQYEPTRMHKRGLTASPPRIPSSNSMPLSSHKPLHCNVPACADAQSKRRLFPFPSNHVFVKIWTERTQIPYDARQHAELRVCELHFETDCFSAQGLNNNAVPTLFLPAPNALPTPVSIIPAATTNALTPIGRPVAGPTAMPKLTAIACSVTNCGNSTATRTDLKIFSKFPDDFELFTKWCFNLKIDPRTYVDGSYNVCSEHFEPFCIGGHSLRVWAVPTLCLGHNSKLIHNVERPAEMETKCCLPHCGRKKSKDSVELYNFPKGDIYRQWCQILKIDEGLYRNSEKKICSAHFRADCFNSNGTLRLGARPTLLLRNRTATAAAHMLKPPAPYRSKCIVRICHEMQQLYSFPAQRNLCTKWCHNLKIDYYPKLHENMNFKICRRHFEPNCLLSGGKLHAEAVPTVQLGHSDVNIYQNLVGIKQSASTPSYDDNSSLRTSVSTVHTWLMDVDAETNAANNIPAASSGDAGAVVGGAVGDADDDMVRMEYEPPVDLEPTVVTENIADDNLDLTDSAYMQMEDDTYYADFEEQRLLPQSSTFIAAESAEVIDLDAVDAVQEQFPNWSQDDAVLVDDDDEEEDDDALLWPLN

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2