Basic Information

Gene Symbol
-
Assembly
GCA_000789215.2
Location
NW:90465-146297[+]

Transcription Factor Domain

TF Family
THAP
Domain
THAP domain
PFAM
PF05485
TF Group
Zinc-Coordinating Group
Description
The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 38 7.5e-16 6.7e-13 48.7 4.8 1 86 844 916 844 917 0.86
2 38 2.4e-14 2.1e-11 43.9 6.5 1 87 944 1013 944 1013 0.82
3 38 2.1e-16 1.9e-13 50.5 0.3 1 87 1035 1107 1035 1107 0.85
4 38 2.2e-13 1.9e-10 40.8 1.7 1 87 1200 1269 1200 1269 0.81
5 38 1.7e-13 1.5e-10 41.2 5.3 1 86 1293 1364 1293 1365 0.80
6 38 3.7e-13 3.3e-10 40.1 1.3 1 87 1401 1471 1401 1471 0.77
7 38 1.4e-11 1.2e-08 35.0 2.9 1 85 1517 1585 1517 1587 0.75
8 38 2.5e-15 2.2e-12 47.0 2.1 1 87 1612 1678 1612 1678 0.82
9 38 1.8e-12 1.6e-09 37.9 0.4 1 86 1699 1768 1699 1769 0.81
10 38 2.6e-13 2.3e-10 40.6 1.9 1 87 1796 1868 1796 1868 0.86
11 38 0.001 0.9 9.8 0.0 1 58 1919 1969 1919 1995 0.81
12 38 1.7e-12 1.5e-09 38.0 0.1 1 86 2015 2093 2015 2094 0.83
13 38 7.7e-13 6.8e-10 39.1 1.4 1 86 2120 2189 2120 2190 0.80
14 38 1.1e-12 9.6e-10 38.6 4.9 1 85 2265 2334 2265 2336 0.81
15 38 9.1e-15 8e-12 45.2 1.3 1 87 2358 2427 2358 2427 0.83
16 38 4.2e-11 3.7e-08 33.5 0.7 1 86 2800 2870 2800 2871 0.78
17 38 1.1e-15 9.8e-13 48.2 0.3 1 86 2921 2996 2921 2997 0.80
18 38 3.6e-05 0.032 14.5 0.0 1 61 3034 3094 3034 3114 0.76
19 38 6.7e-13 6e-10 39.2 4.4 1 87 3133 3203 3133 3203 0.82
20 38 1.6e-14 1.4e-11 44.4 5.0 1 87 3230 3319 3230 3319 0.80
21 38 7.1e-12 6.3e-09 36.0 1.6 1 86 3349 3417 3349 3418 0.79
22 38 2.8e-14 2.4e-11 43.7 4.9 1 87 3441 3510 3441 3510 0.82
23 38 2.9e-11 2.6e-08 34.0 1.5 1 86 3562 3628 3562 3629 0.80
24 38 6.6e-13 5.8e-10 39.3 1.2 1 87 3670 3742 3670 3742 0.80
25 38 3.5e-12 3.1e-09 37.0 0.2 1 86 3771 3842 3771 3843 0.77
26 38 2.4e-15 2.1e-12 47.1 1.3 1 86 3867 3937 3867 3938 0.80
27 38 0.00073 0.65 10.3 0.1 1 54 3978 4023 3978 4026 0.79
28 38 2.6e-16 2.3e-13 50.2 2.1 1 86 4117 4193 4117 4194 0.82
29 38 4.1e-13 3.6e-10 39.9 0.1 1 86 4231 4306 4231 4307 0.79
30 38 6.9e-13 6.2e-10 39.2 0.8 1 87 4515 4586 4515 4586 0.80
31 38 4.8e-13 4.3e-10 39.7 0.9 1 87 4676 4747 4676 4747 0.82
32 38 1.7e-13 1.5e-10 41.1 0.7 1 86 4829 4905 4829 4906 0.83
33 38 4.4e-13 3.9e-10 39.8 0.4 1 86 4954 5023 4954 5024 0.80
34 38 1e-13 9.2e-11 41.8 4.3 1 86 5087 5161 5087 5162 0.82
35 38 7.7e-14 6.8e-11 42.3 0.4 1 87 5324 5393 5324 5393 0.81
36 38 1.2e-10 1e-07 32.1 0.8 1 87 5440 5509 5440 5509 0.82
37 38 9e-13 8e-10 38.8 0.8 1 86 5552 5624 5552 5625 0.79
38 38 0.068 60 4.0 0.9 1 25 5646 5667 5646 5667 0.78

Sequence Information

Coding Sequence
ATGTcgcaaaatcaaaacaaacaacatttacaacaacagcaacaacttgaGCAtcaccaactacaacaacaacaacagcaacaccaccaccatcaACAACAGCTCTTacaccatcatcatcagcagcaacaacagcaacgacaatggcatcaacaatatcaacagcaacaacaacagctacatcatcaccatcagcagcaacaacaacaagcacaatatgctgctgctgccgcagCACATGCACATGTTGCTGCGGCCACCGGTATGAGGAGTGGTAATGGCATGCCTGGCATGTTTGGAAATACACTAGGTAGGGGTGGTTGTTTATATGACACGAGTGGTGGTAGCCTCGTTGGTGGTGGGAACGGTGGCGGCGGCGGAGTCGGCGGCTTTGACCTTGATATGGCTGCCCATCATATAGGTAGTGGAAATGCCTCGGCAATTACGGCCACGTCTACAGCACCAGCTGTTGGTGTAGGCATTGCGGGTGGCGGAGTGGGTGGTTATTTGCCTTCACATGTCAATGTTTCTTCTGTCGCCGGTGGCGCGTACAACGCCGGTATGTCTGGCATGCCGACTTCTAGTGAAAGTGGAAGGAGAGCTAGTGTGGGCGGTtgtgcatacacacaggcaTCGCAGTCACAACCACAGCCGCCACCACAGCAaccgcagcagcaacaacagcacacaatTTTGCCACCGGAAAGAATAAAAATGGAGCCACTGGAGCAAATTCTCACACCAACTATCGAAATGGAAgaattgataataaaaaCCGAACCCACAGATGATGCGTACAATAAAGTTAGTACGGTCGATGAAGCCATTGCTACAGGAAGCACAAATATGAATCCTAACGCTGGTTATATGAATTTTCCTAAGCATTTACAACCATATCCtcagcaccagcagcaacaacaacagcaactgttGGAACAGCAAAAACAGCGCCAGAAAttgcaacatcaacaacatgcattgcaacaacaacaacatctgcagcaacaacaacagcaacatctgcagcaccaacaacaccaacatctgcagcagcaacaacaacaacaacgtgagcaacaattgcaacagcaaatGTTATCGCACGATGCCATGGCCGAAGCCGGCTTGTTAACAGTAAAACTGGAACCACAAATTCACATAAAAGAGGAAATGCCGGAAGCGTCACAGAATAAGCCGCTCAACTTTCCACGCCGCAAAGTTCAAACGGAACGTTCCGACACATTGCCAATTTGCCAGCGCTGCAAACAGGTCTTCTTCAAAAAGCAAAGCTACAGCAAACATGTTTCTCAAAGTCTTTGCGAAATTGTCGAATATGATTTCAAGTGTTCTATATGCCCCATGTCGTTCACATCCAGCGAGGAGCTGCAAACGCACGAACAACTACATCgcgaaaatatgtatttctgtcACAAATACTGTGGCAAATATTTTGACACTATCGAACTGTGCGAGGTGCACGAGTACATGCAGCATGAATATGTCAATTATATCTGTAATGTCTGTTCGGCGGGCTTTGCTAGTCGCGATTTGCTATTCGCGCATATGCCACAACATCGCAATCAGCCGCGCTATGATTGCCCGGTATGCCGTTTGTGGTTTCACACAGGCATACAATTGCATCAGCACCGCATACAAGCGCCGTATTTTTGTGGGAAATTTTATCGTCGCGCCGGTGCTGCAGCACCTCCCATGCCTGCTGGTGGCGCACCATTTGGGCCGGTGCCGCCAACCCATTCAGCTAATTATAATCTACAAGACTGCTCCATGGGCATTATAGAAAtgcCCAACAGCAATAGCTTCTCTTCATATCTACAAAATCGCGCTTTTCACCATCAACACCCAGCTCCGCCACATCCGCCGCTGCCACCGCAGCAAATGCATCCGCATctcgcgcaacaacaacaacacctgcaacgtcaacaacagcaattacatCTGCAACAACAGCATCCACAATTTCCAACACAACCGTTACATGCAGCGCCACAACAACCACAGCCAACACCCGCTCAGCCAGATTTTATGCGCCGCAACAGCATTACTGGTGCCACCGGCAGCGCGACAACAGAATTTCAAATGCCACAAattaaaaccgaaataaaagtGGAACCTGATCTTTATGTCGCGCCAGATTATCCGCTCCAGACACCACTGCCGCCACCTCCTTTGCCGCCACCGCCGTTAAGCACCGCACAACAGTCGCCATTGGCCTCACCATCACGCCAGCGTGGTCGCTTTGGCGACTTCACTAATGAGTCCTTTAGCGCAGGTGGCGCAAGCGGTGACACTTCGCACGCATTTGGCACGCACAATCACAGCAGTGGTAGCAATGGCAACAGTAATGATTTTTCCAACAGCAACACTTCGAGTTTacagcacaacaacagtaatGACAATAACGGCAGCAAACATAAGCCGTCAGCATTTCCTGTTGGCAATTTTCACTTCCCCACCACACCAGCGGTGTCGCGCACACGCGTGGTGGACACCGGCGAGGATGGCGCCGTTTGCTGTGTGCCCCATTGCGGCGTCACTAAGCAGTCCAGTCCCACGCTGCAATTTTTCACATTCCCGAAAGATGAAAAGTATTTGCATCAATGGttgcataacctcaaaatgtttCCCGAACCCGATTCCACGTACAGCCAATATCGCATTTGTAGTTTACATTTTCCCAAACGTTGCATTAATCGCTACTCGTTGTGCTATTGGGCAGTGCCCACATTCAATTTGGGTCATGACGATGTTGCGAACTTATATCAGAATCGTGAGATCACCAACACATTTACGGTCGGCGAACGCGCTCAGTGCAGCATGCCCGGTTGTCCGAGTCAGCGTGGTGAAAGTAATTGCAAATTCTACAACTTTCCCAATGACATGAAGACGCGCATAAAGTGGTGTCAGAATGCGCGCCTGCCCGTCAATAGTCGTGAGCCGCGTCATTTTTGCAGTCGTCACTTTGAAGATCGCTGCTTTGGCAAGTTTCGGCTGAAGCCATGGGCTGTGCCCACTCTACATTTGGGCACACCATATGGCAGAATACACGATAATCCGGGCGTATTCTATTTGGAAGAGAAGAAATGTTGTCTGCCACATTGCAAGCGTACGCGCTCATCCGATTTCAATCTATCGTTATATCGTTTTCCACGCGATGAAGTGCTGCTGCGCCGCTGGTGCTATAACTTGCGCCTGGATCCGGCAATATATCGTGGCAAGAATCACAAAATTTGTAGCGCGCACTTTATCAAAGAGGCGCTGGGCTTGCGAAAACTGTCGCCAGGCGCTGTGCCAACCTTGAATTTGGGGCACAACGACCGCTTCAATATCTATGAGAATGAGCTGCCAACACCACCGGTGGTACCACCgtctaaaatgtttaatttccaTAACGTCTCCGCACCGCCACCGGTCTACAGCAGTCACCGCCACAGCATAGCGTCGAGCAGCAGTCACAGCGAACGCTTGTATCCCAATCCAGTATCGAAGCTCAAATTTAGCATATCGAATATAACGGCCGGTGATGTGAGCTCCATGAGCGGCAGCGCGACACAAAATATCTTGGATTCATTGGAAGTCTGCTTTGTGCCAAGTTGTAAGCGTAATCGCAATATAGACAACGTCACATTACACACCATACCGCGTCGTCCCGAGCAGATGCGCAAGTGGTGTCACAATCTGAAGATCGGCATCGAAACACTGCATAAGGGTGTGCGTATTTGTAGCGCGCACTTCGAGCCATACTGCATTGGCGGCTGTATGCGTCCATTTGCAGTGCCCACCTTGAATTTGGGCCACGACGATCCGAATATCTATCGCAATCCAGATGTTATTAAAAAACTGAATATACGCGAAACATGTTGTGTGCAAGATTGCAAACGCAACCGTGATCGCGATCATGCTAATCTGCATCGTTTCCCGTCGAATTTCGAAATGCTCACTAAATGGTGTGAGAATCTGTTGAAGCCAGTGCCCGATGGCACTAAACTCTTCAATGACGCCATATGTGAGGTACATTTCGAAGAGCGTTGCATACGTAATAAACGTTTGGAGAAGTGGGCCGTCCCCACAGTGAAACTTGGTCATGCCGAAGAGCCGAAACATCGTTTGCCAACGGATGAAGAGATTGCCGAACAGTGGCCGAAGCCGCTAATGCCAAATAAGGGTATAGAAGAGGGTGAATGTTGTGTGTCCACATGCCGACGTGATCCGAAGATCGACGATGTTAAACTCTATCGCACACCCGAAGATCCCGAGTTGTTGGCGAAGTGGGCGCATAACTTGCAAACGGAAACGGAAGATCTTACCACATTACGCATTTGTAATTTACACTTTGAAGAGCTTTGCTTTAGCAAAAAGCGTCGCCTACACTATTGGTCATTACCCACACTCAATTTGGCTACGAATGTCGAGCAATTATATGAGAATCCCGAACCGCTAGTACCGCAGGTGATCATAAAGTCGGAGACGAAACGCGAACGTCGCCGCATGCGCGATTCCAATGAGCCATTGAAGCCGTGGACACCACGTTGCTGCCTGCCACACTGCCGCAAACGACGTGATACGGATCACGTGCAACTTTTCCGTTTCCCCGTACTCAATCGACCCATGTTGGCTAAATGGTGCCACAATTTGCAGCTACCGCAAGTCGGTAATGGTCATCGTCGTTTGTGTTCCACACATTTCGAGCCACGCGTCTTGTCGAAACGTTGTCCCATACCGATGTCAGTGCCCACTTTGGATCTGAATACACCACCAGGTTATAAAATCTATATGAATCCGGCGCGTCTAAAAGCCGTCAAGTTACAACAAGTCTGCTGCATTTCATCGTGCAGTCGTACACGTGCTGATGGCGTACAGCTCTTCCGATTTCCGCATAGTCGCAGCGTGCTGCGCAAGTGGTGTCACAATACGCGGCAACATGCACGCGGTGAATATCGCGTATGTTCGCGTCACTTTGAACCGCACGCATTCGGCGCTAAACGTCTAATGCCCGGGGCCATACCCACATTGAATTTGGATCCGGAAGTTGAGGATGTGTATGCGAATGAAGCGCAGAGCTTCGCTGAAACGCAGTGTGTAGTAAGCGGTTGCGTCGCCAGTAAGGAGTTAGATGGCGTACGACTATTCAAATTCCCCAGCGATGATGTCGATTTACTATGGAAGTGGTgtaacaatttgaaaatgaacCCGGTGGACTGTCGTGGCGTACGTATCTGTAATCGGCACTTCGAACCCGAATGCGTCGGTTCAAAGATGCTGTACAAGTGGGCTGTACCAACACTAGCACTGGGCCATAACGATGCCGACATTAAGCTGGTGTCCGTGCCACCGCCAGAGGCACGTTACAGCGATGTGGTGACCAAGTGTTGTGTCCCCACATGCGGCAAGTCACGTAAATTTGATGATGCACAAATGAACAGTTTTCCGAAGAATTTACGCCTATTCCGCTGTTggaaacataacctcaaattggACTTTTTAGACTTTAAGGAtcgtgaaaaatataaaatttgtagtgATCACTTCGAACCGGTTTGTCTGGGCAAACTGCGTCTCAATTATGGCGCTATACCTACACTTAATCTGGGACATGATGACACAGACGATTTGTATCCCGTCGATCCTGCGCAGATACAAGTATCGCTCTTCGGCAAAATGCGTCCACGCATTGCTAGCGATAAGGATGCGGCTGAATTTGAGCTGGCTGGTTGTGTTGGCAAAGATGACATAAAATGCTCCTATCCCAATTGTACTGCAACGAAAATGCTGTTAAGCGAACCTTACGACATGCCAGCCGCCTTGCCGCTACATGCGCTTTGGTGCGCACAAATGAAAGTCGACGCAGAGGCGCTCAGCGAAACGCCAAAACTGTGCGGTCTGCATTTCATAAAGCTCTATAAAGCCACATTAGAGGCGGCTAACGCGTTAGCGGAGGGTGACAACGAACTTAGTGAGGCCATGAAATCGCTGAAcgaaacttatgaaaagtgttgcaCCTCACCAATTGTTTGTAGTGCGCAGTGTTGCGTTGTCGGTTGTGGTAGTAATCAGGTTAGTGGCAGCCGCACTGCACAACGCCTCTATTTATTTCCTACTGCGGTCGGAAGCGGCATAGACTTATTAGAGAAATGGTGTCACAATGTAGATGTTAGCGCAGCGGACCTCAGTCGTTTTACGCACAAAGTATGCGCACGTCATTTCGAAGCACAATGTATTGGCCCAACACAACGGTTGCGCTCCTGGGCCATACCCACTTTGCAGCTGAAACAAAAACCGGAACACAAATTACATACCATACCAGAATTGGGTAGCGCAGCGCGTCCCCATGAAGCGCGTTGCTGTGTAGCCACGTGCGCACGCGAACGCGGTGATTTCGAAACAATGCGTCTTTTCAGTTTTCCCATTATCGATGAGGTGCTCGAAAAGTGGTTGCACAACTTGCAATTGACACGCAATCAGTGCGCACGCTTGCGCATCTGTGAGCAACATTTTGAGCCGCGTTGCATTGTCAAGACGCGCTTGCTGCGCTGGGCCGTACCAACATTGCTGTTGGAGCGCAATGCCGAGGAGCTGCTACAGAACGAACCGCCTGTACAAGCCGCAGCGCCTGCTGCACCAATCGCTGATGAAGTCGATGCAATTGATGCTGCAAGTAATATCGAAGGCGATGAAGATATGCGTGATGttgatgatgaagatgatgacAATAATAGCGGCGAATTGGCGGCAGTGCCTAACGTAAAGATGAAAAAATCACTTGGCATTTTTAAGTGTTGTGTACCGAGCTGCAGACGTACCCGCCTGCAACACGGCGTGCGCCTCTTTCAATTTCCCACAAGTCATCAAATCTTGCGCAAATGGTGTCACAATCTAAAAATTTCGCTCGAAAAGGCTTCAGATCCCGCCTTACGTATTTGCAGTATACACTTTCACAAACGTTGCATTGACGGTAAAGAGTTACGTCCCTGGGCCAAGCCGACCAATGCGCTCGGTCACAGCGGTCCCATCTATGAGAATCCCAAAAATCTACCCGGCGTCTTTTTACCAAAATGCTGTTTGGCACATTGCCGTCAACGGCGCACACTCGAAAATGACTTACGTACATTTGGCTTCCCCAAAGATGAGGCTCTTATGCGCAAGTGGTGTGCAAATTTACGCATGGAGCCACGCGCCAATCACGCACGCATCTGCATGTCACACTTCCCGCCCGAGGCGATGGGCAATAAGAAATTGCGCGCCAATGCGGTGCCTACGCTAAATTTGGGGCACAACGAACCTTTGGAGTATGACAACGCTTTGCTGATCGCGACGGCGATGAAGGCAAGAAATcCGTTAGAGGAGAAGAAAGTTTTGAATTCGGATGCTAGTGACACCGAATTGGGAAACATCGGCGAATACGATGATGAAATGGATGACGAAGAcaatgatgacgatgatgactATCGCATCGATGAAGAAGATGACGAAAACGATTTCTATGCTTCCGCCGGCGTTGGTAATAGCGATGAAGAGGAAGAGGAAACGGTTGAGGACTTTACGCCGAGTATTAGTGCCGCAACCGGTGACGACGATGAGTTGTATAGTAACAACGAAGCGCGTGATGCCGACAATTTCATAGTCGATTCAGCAGATGAAGATGAGGAGGCCGAAGTAGAAGATGAGGATGAAGAAGACATCTACGGTAACGAAGAAGACGATGATGAGGACGATAGTGATGAGAGCGATGAAAGTGTTGccgaaaatatatacaacagtCCTGCAGAGGAGGATGACAGCGACAGTGAAAGCGAAGTTAACGCCATTGACAACAACGCTGACAGCAAAGACGCCGAGgtcgatgatgatgatgtggaAATGCTTATCGATGCCGATGAGGATGAAGATGATAAAAGCAGTTACAATGCCTGCGATCCACTCGACTTTGTGGAATGTGTTAACCAAAGCGATGATACGCAACAAAATGACGCGCCGGCCAACTCAACAAAAGTACGCAATCTCCACAAAGCAGGAAAAAATATGCGTGAGCGTCCAATAACGACGTCGCCGACAAATTATGAAGATGACTCCAATACAAATTACAGTAATGCCGATGAAACGACACGTTTCACTGCAACCACTTCGAATTCCGCATCAGCAACCACATCGTCGGCATCAGCCTCAGCTGCGGCCACGTCCAACGCACCAGCGAGTACGCGTCATTGTGGCGCTGACAAAATAAGTCGTTCGGTTTTTCGGCTTTGTTGCCTCAGACATCGTAGGAAGAAGAAAGCACCACCAGATCCACCACCTGATATGCGATGCGCGCGCGCCAAAGCGCCAGCAGTACATCGGACGTTGATGAATAATATTGGACGCCCTCGTCGACAGCGCGCTTACTGTGCAGTACACCGTTGCGGACGCACCGCCGGTGCTGCTGTCACGCTATATCGTTTTCCATTAAGCGGCAGTCGATACTGTCGTCGTTGGTGCGCAcagttaaaaatcaaaatgacgCACACATCACGCCTGCGCATTTGTCAAAGACACTTCCCCTACAGTTTAGTGGATCGGCGTCGTAGACGCTTGCGTTTTGGCGCAATACCAACACGCAATTTATATAATACCACGGGACAATTCAAGCGTAATCCACTTTATgacttatataaaaatataacacaatCGGCAGCGAACGCGTCAGAGCGCGCGGCTAGCGCGGATACAACCCAAACCCCCACCGCACAATTGAATGTTTATAATCGTTGTTGCGTGCCACATTGCGGCAAATCGCATCAAGTGGATGGCGTAACGCTCTTCCGCTTCCCAAAACTACGCTCACTCTATCTCCAATGGGCTACGAATTTGCGCTTAATGCCAACCATGCGTTTGGTGCAAGTCTACAAAGTTTGTAGCGACCACTTCGAACCCCAATGCCTCAACTATCAACGTAAAGATCGCGCCAATCTGAAATATGGCTCTGTGCCTACACTGAAGTTGGGTCATAACGACACCTCTAAAATCTATAAGCAAAACACGCTACTGCCACAAAAGCGCCGCGGTCTTGGTAGACGCAAATGGTATCCGCAAAAAGGTGTCAATGAATGCGCTGTGCATGATTGTAAGATGGCGCAGTTTCTACAAATGCAACTGTTTGCGCTGCCAGCGACGCTCAAACTGCAAGAGCGCTGGTGTAactatttcaaattaaacttcACCGGCGCATCAAAGGACGCCACCGAATTCTTTGAAAATGTGCGTCTATGTGCTTTGCACTACATGGAGGGTTATCAAATGGCGACTTACAGCGACGATGCGCGCAAAGGCTCATCTGCCGCGTTGGACGAACTCGAGGCGAATTATGCGCGCATAACTAGTTCGACGCGCATACAAATGCTGAAATGTTGTGTGCCCAACTGTTCGACGAAATTCACGGACAACGTGCGTTTGGCCGCGTTTCCCAGCGCGGAAGAGCTGCGTGCTAAATGGCAGCACAATACACAAGTGTCATTTAGTCCATCGCATCGTTATTTGTATAAAGTATGCGCATTGCATTTCGAAGAGCGTTGCTTCGCCAAAAAACGTCTCTTTATGTGGGCTATACCGACATTGCATTTACCGCGACCTCAAAATCAAGATACAGAACATAAGCTGTTTGAAAATCCCATCTTTGACAGCGTGAGTGGCGCCCAATGTTGTATTGAAGATTGCGCAACGAATCAAGTCAAACAGGAGGCGGCGGCAGCGAATGATGATGACAAGGAAGTGTGTAAAGCGGCAGTGCGTTTTTGGCATTTTCCACAAGACGACGCATTACGCGAAAAATGGTGCCATAATTTGGGACTCAACGCGCTAACTAACGAAATCAGCCATACCAGTCGGCGATGGCGCATTTGCAGTCGCCACTTCGAGCAATTCTGCATCGGTAAAACGTTACGCAGCTGGGCTGTGCCCACATTGCATTTACCAAAACCGGTCAAACATGCTAAGAGCGGTAGACGTTCCacatatatttaccaaaatcCTGATAGCGCTGCAGTCTACTATCGCTGCTGCATTAAAACGTGCCATCAGCTGCGCGATCTTGATGCTGGTATACGCCTATATGCATTCCCCAAAAAAGATACAATGCTACAAAAATGGGCGCATAACATACGCATGCCCGCGGAGAAGTGTCGCTACGCGCGCATCTGCACGTTGCACTTCGAAGCGCAATGCTTGCGTCCGCAAATGCAATCGTGGGCGCTACCCACAATCGATTTGGGACATGACGAAGCCGATATTTTCCGCGTGCCGAAGGTGAAGTTAATGGTTACGAACGAACGCTGCTGCCTACCGCATTGCAGCAAACGACGCAGCCGCGATAATGTACATCTTTTCACCTTTCCACGCGACAAAGATGTCTTGGACAAATGGTGGCACAATTTAGCGATTGGCGTACAGGATGTAAAACGACGTCTCATTTGTGAATCGCATTTTGAGCCGCGCTGCATTAGCAAGCGCCGTTTGAAACGTTGGGCCATACCAACATTACATTTGGGACACACAAACGACACGCTCAAAAATCCAACACCAGCTGAAGTGGCAACTTATGAAAGTAACACAACAGCGCGACGCTCACAAACACCCTCGAAAATGACGCGCGCCAAATCAACGCAACCAACAGCGCCTACAAGCACTTTACAGAAATGTTCAATCGCGCATTGCGCACGTGGCGTCGAAAGCAGCGCGCTGTATCGCTTTCCCAAACCAGATTGGTTACGCAAGAAATGGTGTGACAATACGCGCATAGACGAAGAGACCGCCAAACAGGCGAAAATTTGCGCGCGCCATTTCGAACCACATGTTATGGGCAATCGCAAACCGCGTCCATGGGCGTTGCCGACCATGGAATTGGGTGCTGACGAAAACGGCGTGCCATTGCCTGTCGTACATGCAAATCCCAAACAGCTGTCACGTTTTCATCCTGAAGAGCATGAATGGGGCGAGCTGCGTTATGTGCGCGCTAACCATTGTTCTATAATATCCTGcttgaaatcgaaaaaagatGGCGTAACACTTTTCAACTATCCCACGAAACGCCACATGTTGCAGAAATGGGCTGAGAATTGTCGCCATTATCCGTATCAAGCCAAACGCTATCGTTTCCAACTGTGCGGCGCACATTTCACTGCCGATTGTTTCAAACGCGAAGGCACGCGTTTACGCAAAGGCTCAGTGCCTACACTAAATTTGGGACATGATGATGCGCAGATACACCAGAGTGAATTCGAAAGCGTAAGCgccataaaaattgaaatgaaagaaCTAAAAAGTTGCAGCGTACCACAATGTGGACGCACAAATTTGCACGAAGGCGTGCGGCTCTTCAAATTTCCCTACGAACGCGCCGAGACGCTAGAAAAGTGGTGCCATAACTTACGCATGAACGCGGCCGATTGTCGCAACGCGCTAATCTGTAATATGCATTTTGAGTCGCGTTGCGTTGGTGGCGGTCATCGCGGCTTGCTTTTGCGCGCCATACCCACATTACTGCTGGGACACAATGAGAGCGATATTTTCAACAATCCGGAATCATTTGATCGCCCTGAGAAATTAATCAGCTGCTGCGTACCCGGTTGTACTAACACCAAACAAACCGAAGGCATAACGCTAAGCGCTTTTCCGAAGTTGCGTAGCCATTTTGAGAAGTGGGCACACAACCTACAGTTGCCAGTCAGCACCACCGTCTGGCATACGTACAAAGTATGCAGCGCACACTTTGAGAGCTATTGCTATGAGCACGGACGTATCAAGGTGGGCGCAATGCCGACACTAAAGTTGGGACACAGCAACGCAATTGACTTGTACACCGTCAGTGAAGAAACAATGAGTCAAGCATTTAAGCGTAAACGTGTCGGGCCGAAAACGGAGACGCCAAAAGCGCCACACGAAGAGTGTTGTTATCCGGAATGTAGAGAAATGGAATTGCGTTTGACAAATCAACTTTTTGAGTTTCCAAGTTTGGATGATATACGACGAGTTTGGTACAAGAGTGTTGGTTTGAGCGCTGAAGAGCAGGCGGAAGAAGTATGTGTGGCAACTGAAAATGAAACACCAAAAACCGCGCCCGACAATAGTATGCCACAGGATGCTgatgctgctgcagctgctgctccTGCTCTTGCTCCGGTAAAAGTTGCGAAAATATCTCCCAAATTATGTCCAATGCATTTCAAATTACTCTACATTGAGCATGCCGGTTTGCTGGATACACTCAAATCAGACAGCACAGCAGACACGCAGCGTATTTTGCATAAGCTGGACGAGACCTACACAAAAGTGTGCGATATATCGTGCGTGCGTCGCATTAGCTGTGCCGTGCCAGGTTGCAATTCGAATTATCTCACCACCAAGTCGctgaaattcttcaaatttccCGATAATGCCGAGATGCGCGCCAAATGGTGTCACAACACACAAGTCATAATCGATCCGGATCGCTTGTATTGCTACAAAATATGTGAATTACATTTCGAAGCAAGCTGCGCCCCGCAATTGACCAAAAAAATACAACGTTTGAAATCTTGGGCGCTACCCACGTTACAATTGCCGCCACGCGCTGCTGACGCGCCCGATTTATATGCATTACCAGCGCCCGATACGCTAGCCGAAACAAAACGCGCCTCATTGCTACTGAACGCGCCACTCAATAAATGTTGTATAGCCAGTTGTGTGTACGCTAAAGCGCCGGCAGAGCGCGCGCTCAGCGCTGATGGTGAAATacagtttttcaattttcccaACGATGCAGAATTACTCTACAAATGGGTATATAATACACAAATCAGTATGGTCGCGGCTACCAATGCGCGCATCTGCTCGCTACACTTTGAAAAACACTGTATTAATAAACGCTTGCGCATGTATGCGGTGCCAACGCTACTGTTGGGCCACCAGAAAACCGATATCTATAAGAATCCCTCGGATAAGAGAGTAGCGGCGGAGACAATGGAGAGCAAACCGCCCAAGTTGCCCAAGTATTACGAAGACAACGTTGCTGATGACGTTgaagttgaagaaaaagaagaaaatgaggatgatgatgatgatgatgatgatgtgtTGTTATCGGATGTTAAGGCGCAtaaaatgaagttgaaaaaatCCGACATAACTAAATACAAAACTGAAGTGTTTGGCGAAAGTGAACCGCGCACAGTGCGCAAGTCTGCCGCGCACGATGAGAAGAAGCTAGAGCGCGCGTTGCTTGAGCCAATGCTAGTAGTGAAAACCGAATTGAAGGAGCGGAAAGAaatggtaacaacaacaacaacaaataaaccgGCAACGCAGCCCAAACCATATCACCAGcctttcattataaatattaaacaagaaaaGGATATCGAAGAGAACTACACCGCCGAAGGCattgaaaatcaaatgaaacaaGGCTTACTCGATATGTTTAATAGTTTTGGCGACACTGGCGCTGAACAGGAGCCCGATGACATTAACGAGCTGGGCGACGAAGTGACGCAAATGGAACGCGATACCGCGCGCCATTGTCGCATTGTCGGTTGTAATAGCTACGCGCGCAATGCGGGCGTCACGCTGTTCAAATTTCCCTTTCCACTCGACCAATTCCGCAAATGGTTGCATAACACACAATTAGAGGTGGACTATACACGCCGTTGGCGTTATCGCATTTGTCATCGTCACTTCGAACCGATTTGCATGCAATTCCGCAAGCTACCGCCCGGCACCATGCCCACGCTGAATTTGGGTCCTAAGCGTCCCGCgcatatatatgaaaatgaattcGATGTGAATagcttaagtaaatataaaacgaaaccacaacaacaaaacctaaCCACAACAACTAGTAATGTGTTGAGTGACTTAAATGACGAGGCGGACACGAGTAGCTCCTTCGCCGCGCCCACACACCACAACAATGATAATATCGATCCCAGCGCTGACTATAACGATGACTATGCAGATTTTGTAGACAATACGCCGATCGATCGCGATACATCGCGTCATTGTCGCATTCCCGATTGCAATAGTCACGCAAAAGATCCCGGTGTAACGCTCTTCAAATTCCCCATGTCTGAGTATCTCTTTCACAAATGGCTCTACAATACCCAACTTAAGGTAGATTATACGCGCCGTTGGCGCTATCGCATCTGTCAGCGCCACTTCGAACCGATTTGCATGCAATTTCGTAAGCTACCGCCCGGCACCATGCCAACGCTGAATTTGGGTCCTTCGCGTCCAGCGCGCATCTATGAGAATAGTTTCGATATCAATcacttgaaaaaattcaaaactaaattgaaaaatacaactacaacaacaacaacaagtgcagcTATTGTTCCTACATCCACTTCGGCCATAATGCACTCAAGCCATGCCGATTACACCGATGACTTGGAAACGAGCAGCTCGTACGCCGTCGATGCAAATGTCAGCAACACCTCTCAATTGCCGCTACTCTCATGTACCGTACGCCATTGTACGAGTCAATATCACGCGCTACACGAGGGtttacatttgcataaattgccGACGAATATTATGTTGCGCGAGAAATGGATTTACAATTGTCGCTTCTCGGAAGAGACGCTCATCAGCATGAGTTCACGCATACGCATCTGTTCGCTGCACTTTACACCTAACTGCTATTATGGCGTTAAGCGTCAATTGAAATTTGGCTCAGTGCCCACTTTGCGTTTGGGTCATACCGATCCCAATATATATCCGCATGGTTTTAGCAATGAAAGCGAGGCCCAAATGGCGGGCAAACAGCATACAGCACAGCACAATTGGCGCACCAAACGCTTGCAGTCGTCCACTGATGCCAATCAAGATATTTGCTGCCTCATCAATTGTCGACATAGCAGGCGTGAGTACACGCGACATTTCGCCTTTCCCAGCGAGCGCGAGCTACTCGATCAATGGCTCAATGCACTCGGCATGGAGTTTAATAGTTCGCGTCCAGATGATTATAAAATCTGTGAATGGCATTTCAAAGCGTCCGATTTCGATGGTGAAGTACTGCGCGCCGACGCGGTGCCCACGCGTAATTTGAAGATAGACGATGCCAATCAAACGGATGATGAGGATGATAATGAGGACGAAGATGATGATGTGGTTTGGAATGCAAACGAAGAGCCGCTAGATGAACGTCCATCCACTTCCGCAGCAGCCGCTGCCGCAGCGTCAACTACTACACCCGATGGCTATAACAAATTAATCCCGGGCTCGCGACGATGCTGCCTGGCACACTGTCGCAAGCAACTTTTCCAAGATAACGTGCGCACCTTCAAATTCCCCACAATGCATGAACAATTCGAGAAGTGGGTGCACAAtttaggcatcaaatatgaCGGTGACGCACCTTGGCGCTATCAAATTTGCAGCGAACACTTTGAAAACCAATGCATTATACATTACGAAAACAAAGCCAAGTTGTTTAGATGGGCTGTGCCGACTTTGAAATTGGGCAAACATGCGCCAACCATACTCTTCACGAATGAAAATCCCAAAAACCTGCAACAACGGGATGCAGACTATGAGCGCGTCTATAGAAATTCGAATGCCGGCGGCTATGCGGACAACATAGATAATGAGGAGACCATGGATACGACAAATGATGAGTTGAGTTCCACTGCGCCGCCCGCAGATGCGCCCAAACATATGGCTTATAAGCCAGAAGATATGGATTTGTTGGCACCAATTGAGCGACCGCCACACAAAATGGGCGGCGCAACGAAACGCAGCAACTATGCGTATACCGCGAGCAATGATGACGGCGACGATTACTacgatggtgatgatgatgcttACGATATGGGTGATGGTGAAAATTCACTCTTGAATGTCATACGCGAGGAAAAACCGAGCGCTGTGAAAGAAGGTACACCAGCCTCGTCGTTCTTCTCATTGCAACTGGTACGCGGCGGTTCCGCTAAAGTGCGCGCCTGCTGTCTACCGCATTGTGGACGCACTCGTGAGTCCGGTGTGCGTTTATTCCGTTTCCCCACAGAACCGGTCTTCCTCAAACGCTGGGAATACAATTTACGCGTACTCTTCAATGAGTCACAGCGCAATACGCACCTAATATGTAGCGCGCATTTCGAGCGTGGCCAATATAATAAACGTTTAGTCGTCGATGCCATACCGACATTGAATTTGGGCCACAACAGCACGGATATCTATCGTAATGGACAATATGAGCCAACTAGAATGCATAAGCGCGCATTGACGGCATCACCACCGCGCATACCATCGTCAAACAGTTTGCCGCTGAGTAGTCACAAACCGTTGCATTGTAATGTGCCCGCTTGCGCTGATACACAAAGTAAACGTCGCCTGTATCCCTTCCCTAGCAATCATGGGTTCGTGAAAATTTGGGTCGAACGCACACAGATCGCTTACGATGCGCGTCATCATGCTGAATTGCGCGTTTGTGAGCTGCACTTTGAGACGGATTGCTTTAGCGCGCATGGACTGAACAATAATGCTGTGCCCACATTGTTTTTGCCGGCACCAAATGCACTGCCCACACCTGCTTCGATTGCGCCAGTCGCATCAACAAAAATTCCACCGCCCATTGGTCGTTCAGCGGCAGCGTCCACTACGATGCCGAAGCTGAGTGCCATCACTTGTAGCGTTACCAATTGCAGCAATAGTACCGCTACACGTACGGACTTGAAAATATTCTCGAAATTCCCCGATGATTTTGAATTATTCACCAAATGGtgtttcaatttgaaaatcGATCCACGCACCTATGTGGATGGCAGTTATAATGTGTGCAGCGAACATTTCGAACCATTCTGCATTGGTGGCCATAGTTTGCGTGTCTGGGCTGTACCCACGCTGCGTTTAGGTCACAACAGCAAACTCATACATAGTGTCGAACGTCCTGCCGAGATGGAAACGAAGTGTTGTTTGCCGCATTGTGGACGCAAGAAGAGCAAAGATGGCGTGGAGTTCTATAGCTTCCCTAAAG
Protein Sequence
MSQNQNKQHLQQQQQLEHHQLQQQQQQHHHHQQQLLHHHHQQQQQQRQWHQQYQQQQQQLHHHHQQQQQQAQYAAAAAAHAHVAAATGMRSGNGMPGMFGNTLGRGGCLYDTSGGSLVGGGNGGGGGVGGFDLDMAAHHIGSGNASAITATSTAPAVGVGIAGGGVGGYLPSHVNVSSVAGGAYNAGMSGMPTSSESGRRASVGGCAYTQASQSQPQPPPQQPQQQQQHTILPPERIKMEPLEQILTPTIEMEELIIKTEPTDDAYNKVSTVDEAIATGSTNMNPNAGYMNFPKHLQPYPQHQQQQQQQLLEQQKQRQKLQHQQHALQQQQHLQQQQQQHLQHQQHQHLQQQQQQQREQQLQQQMLSHDAMAEAGLLTVKLEPQIHIKEEMPEASQNKPLNFPRRKVQTERSDTLPICQRCKQVFFKKQSYSKHVSQSLCEIVEYDFKCSICPMSFTSSEELQTHEQLHRENMYFCHKYCGKYFDTIELCEVHEYMQHEYVNYICNVCSAGFASRDLLFAHMPQHRNQPRYDCPVCRLWFHTGIQLHQHRIQAPYFCGKFYRRAGAAAPPMPAGGAPFGPVPPTHSANYNLQDCSMGIIEMPNSNSFSSYLQNRAFHHQHPAPPHPPLPPQQMHPHLAQQQQHLQRQQQQLHLQQQHPQFPTQPLHAAPQQPQPTPAQPDFMRRNSITGATGSATTEFQMPQIKTEIKVEPDLYVAPDYPLQTPLPPPPLPPPPLSTAQQSPLASPSRQRGRFGDFTNESFSAGGASGDTSHAFGTHNHSSGSNGNSNDFSNSNTSSLQHNNSNDNNGSKHKPSAFPVGNFHFPTTPAVSRTRVVDTGEDGAVCCVPHCGVTKQSSPTLQFFTFPKDEKYLHQWLHNLKMFPEPDSTYSQYRICSLHFPKRCINRYSLCYWAVPTFNLGHDDVANLYQNREITNTFTVGERAQCSMPGCPSQRGESNCKFYNFPNDMKTRIKWCQNARLPVNSREPRHFCSRHFEDRCFGKFRLKPWAVPTLHLGTPYGRIHDNPGVFYLEEKKCCLPHCKRTRSSDFNLSLYRFPRDEVLLRRWCYNLRLDPAIYRGKNHKICSAHFIKEALGLRKLSPGAVPTLNLGHNDRFNIYENELPTPPVVPPSKMFNFHNVSAPPPVYSSHRHSIASSSSHSERLYPNPVSKLKFSISNITAGDVSSMSGSATQNILDSLEVCFVPSCKRNRNIDNVTLHTIPRRPEQMRKWCHNLKIGIETLHKGVRICSAHFEPYCIGGCMRPFAVPTLNLGHDDPNIYRNPDVIKKLNIRETCCVQDCKRNRDRDHANLHRFPSNFEMLTKWCENLLKPVPDGTKLFNDAICEVHFEERCIRNKRLEKWAVPTVKLGHAEEPKHRLPTDEEIAEQWPKPLMPNKGIEEGECCVSTCRRDPKIDDVKLYRTPEDPELLAKWAHNLQTETEDLTTLRICNLHFEELCFSKKRRLHYWSLPTLNLATNVEQLYENPEPLVPQVIIKSETKRERRRMRDSNEPLKPWTPRCCLPHCRKRRDTDHVQLFRFPVLNRPMLAKWCHNLQLPQVGNGHRRLCSTHFEPRVLSKRCPIPMSVPTLDLNTPPGYKIYMNPARLKAVKLQQVCCISSCSRTRADGVQLFRFPHSRSVLRKWCHNTRQHARGEYRVCSRHFEPHAFGAKRLMPGAIPTLNLDPEVEDVYANEAQSFAETQCVVSGCVASKELDGVRLFKFPSDDVDLLWKWCNNLKMNPVDCRGVRICNRHFEPECVGSKMLYKWAVPTLALGHNDADIKLVSVPPPEARYSDVVTKCCVPTCGKSRKFDDAQMNSFPKNLRLFRCWKHNLKLDFLDFKDREKYKICSDHFEPVCLGKLRLNYGAIPTLNLGHDDTDDLYPVDPAQIQVSLFGKMRPRIASDKDAAEFELAGCVGKDDIKCSYPNCTATKMLLSEPYDMPAALPLHALWCAQMKVDAEALSETPKLCGLHFIKLYKATLEAANALAEGDNELSEAMKSLNETYEKCCTSPIVCSAQCCVVGCGSNQVSGSRTAQRLYLFPTAVGSGIDLLEKWCHNVDVSAADLSRFTHKVCARHFEAQCIGPTQRLRSWAIPTLQLKQKPEHKLHTIPELGSAARPHEARCCVATCARERGDFETMRLFSFPIIDEVLEKWLHNLQLTRNQCARLRICEQHFEPRCIVKTRLLRWAVPTLLLERNAEELLQNEPPVQAAAPAAPIADEVDAIDAASNIEGDEDMRDVDDEDDDNNSGELAAVPNVKMKKSLGIFKCCVPSCRRTRLQHGVRLFQFPTSHQILRKWCHNLKISLEKASDPALRICSIHFHKRCIDGKELRPWAKPTNALGHSGPIYENPKNLPGVFLPKCCLAHCRQRRTLENDLRTFGFPKDEALMRKWCANLRMEPRANHARICMSHFPPEAMGNKKLRANAVPTLNLGHNEPLEYDNALLIATAMKARNPLEEKKVLNSDASDTELGNIGEYDDEMDDEDNDDDDDYRIDEEDDENDFYASAGVGNSDEEEEETVEDFTPSISAATGDDDELYSNNEARDADNFIVDSADEDEEAEVEDEDEEDIYGNEEDDDEDDSDESDESVAENIYNSPAEEDDSDSESEVNAIDNNADSKDAEVDDDDVEMLIDADEDEDDKSSYNACDPLDFVECVNQSDDTQQNDAPANSTKVRNLHKAGKNMRERPITTSPTNYEDDSNTNYSNADETTRFTATTSNSASATTSSASASAAATSNAPASTRHCGADKISRSVFRLCCLRHRRKKKAPPDPPPDMRCARAKAPAVHRTLMNNIGRPRRQRAYCAVHRCGRTAGAAVTLYRFPLSGSRYCRRWCAQLKIKMTHTSRLRICQRHFPYSLVDRRRRRLRFGAIPTRNLYNTTGQFKRNPLYDLYKNITQSAANASERAASADTTQTPTAQLNVYNRCCVPHCGKSHQVDGVTLFRFPKLRSLYLQWATNLRLMPTMRLVQVYKVCSDHFEPQCLNYQRKDRANLKYGSVPTLKLGHNDTSKIYKQNTLLPQKRRGLGRRKWYPQKGVNECAVHDCKMAQFLQMQLFALPATLKLQERWCNYFKLNFTGASKDATEFFENVRLCALHYMEGYQMATYSDDARKGSSAALDELEANYARITSSTRIQMLKCCVPNCSTKFTDNVRLAAFPSAEELRAKWQHNTQVSFSPSHRYLYKVCALHFEERCFAKKRLFMWAIPTLHLPRPQNQDTEHKLFENPIFDSVSGAQCCIEDCATNQVKQEAAAANDDDKEVCKAAVRFWHFPQDDALREKWCHNLGLNALTNEISHTSRRWRICSRHFEQFCIGKTLRSWAVPTLHLPKPVKHAKSGRRSTYIYQNPDSAAVYYRCCIKTCHQLRDLDAGIRLYAFPKKDTMLQKWAHNIRMPAEKCRYARICTLHFEAQCLRPQMQSWALPTIDLGHDEADIFRVPKVKLMVTNERCCLPHCSKRRSRDNVHLFTFPRDKDVLDKWWHNLAIGVQDVKRRLICESHFEPRCISKRRLKRWAIPTLHLGHTNDTLKNPTPAEVATYESNTTARRSQTPSKMTRAKSTQPTAPTSTLQKCSIAHCARGVESSALYRFPKPDWLRKKWCDNTRIDEETAKQAKICARHFEPHVMGNRKPRPWALPTMELGADENGVPLPVVHANPKQLSRFHPEEHEWGELRYVRANHCSIISCLKSKKDGVTLFNYPTKRHMLQKWAENCRHYPYQAKRYRFQLCGAHFTADCFKREGTRLRKGSVPTLNLGHDDAQIHQSEFESVSAIKIEMKELKSCSVPQCGRTNLHEGVRLFKFPYERAETLEKWCHNLRMNAADCRNALICNMHFESRCVGGGHRGLLLRAIPTLLLGHNESDIFNNPESFDRPEKLISCCVPGCTNTKQTEGITLSAFPKLRSHFEKWAHNLQLPVSTTVWHTYKVCSAHFESYCYEHGRIKVGAMPTLKLGHSNAIDLYTVSEETMSQAFKRKRVGPKTETPKAPHEECCYPECREMELRLTNQLFEFPSLDDIRRVWYKSVGLSAEEQAEEVCVATENETPKTAPDNSMPQDADAAAAAAPALAPVKVAKISPKLCPMHFKLLYIEHAGLLDTLKSDSTADTQRILHKLDETYTKVCDISCVRRISCAVPGCNSNYLTTKSLKFFKFPDNAEMRAKWCHNTQVIIDPDRLYCYKICELHFEASCAPQLTKKIQRLKSWALPTLQLPPRAADAPDLYALPAPDTLAETKRASLLLNAPLNKCCIASCVYAKAPAERALSADGEIQFFNFPNDAELLYKWVYNTQISMVAATNARICSLHFEKHCINKRLRMYAVPTLLLGHQKTDIYKNPSDKRVAAETMESKPPKLPKYYEDNVADDVEVEEKEENEDDDDDDDDVLLSDVKAHKMKLKKSDITKYKTEVFGESEPRTVRKSAAHDEKKLERALLEPMLVVKTELKERKEMVTTTTTNKPATQPKPYHQPFIINIKQEKDIEENYTAEGIENQMKQGLLDMFNSFGDTGAEQEPDDINELGDEVTQMERDTARHCRIVGCNSYARNAGVTLFKFPFPLDQFRKWLHNTQLEVDYTRRWRYRICHRHFEPICMQFRKLPPGTMPTLNLGPKRPAHIYENEFDVNSLSKYKTKPQQQNLTTTTSNVLSDLNDEADTSSSFAAPTHHNNDNIDPSADYNDDYADFVDNTPIDRDTSRHCRIPDCNSHAKDPGVTLFKFPMSEYLFHKWLYNTQLKVDYTRRWRYRICQRHFEPICMQFRKLPPGTMPTLNLGPSRPARIYENSFDINHLKKFKTKLKNTTTTTTTSAAIVPTSTSAIMHSSHADYTDDLETSSSYAVDANVSNTSQLPLLSCTVRHCTSQYHALHEGLHLHKLPTNIMLREKWIYNCRFSEETLISMSSRIRICSLHFTPNCYYGVKRQLKFGSVPTLRLGHTDPNIYPHGFSNESEAQMAGKQHTAQHNWRTKRLQSSTDANQDICCLINCRHSRREYTRHFAFPSERELLDQWLNALGMEFNSSRPDDYKICEWHFKASDFDGEVLRADAVPTRNLKIDDANQTDDEDDNEDEDDDVVWNANEEPLDERPSTSAAAAAAASTTTPDGYNKLIPGSRRCCLAHCRKQLFQDNVRTFKFPTMHEQFEKWVHNLGIKYDGDAPWRYQICSEHFENQCIIHYENKAKLFRWAVPTLKLGKHAPTILFTNENPKNLQQRDADYERVYRNSNAGGYADNIDNEETMDTTNDELSSTAPPADAPKHMAYKPEDMDLLAPIERPPHKMGGATKRSNYAYTASNDDGDDYYDGDDDAYDMGDGENSLLNVIREEKPSAVKEGTPASSFFSLQLVRGGSAKVRACCLPHCGRTRESGVRLFRFPTEPVFLKRWEYNLRVLFNESQRNTHLICSAHFERGQYNKRLVVDAIPTLNLGHNSTDIYRNGQYEPTRMHKRALTASPPRIPSSNSLPLSSHKPLHCNVPACADTQSKRRLYPFPSNHGFVKIWVERTQIAYDARHHAELRVCELHFETDCFSAHGLNNNAVPTLFLPAPNALPTPASIAPVASTKIPPPIGRSAAASTTMPKLSAITCSVTNCSNSTATRTDLKIFSKFPDDFELFTKWCFNLKIDPRTYVDGSYNVCSEHFEPFCIGGHSLRVWAVPTLRLGHNSKLIHSVERPAEMETKCCLPHCGRKKSKDGVEFYSFPK

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00192124;
90% Identity
iTF_00191395;
80% Identity
-