Basic Information

Gene Symbol
-
Assembly
GCA_031772095.1
Location
CM062651.1:22700445-22723558[-]

Transcription Factor Domain

TF Family
THAP
Domain
THAP domain
PFAM
PF05485
TF Group
Zinc-Coordinating Group
Description
The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 40 1.1e-15 1.6e-12 48.7 4.8 1 86 562 634 562 635 0.86
2 40 3.2e-14 4.7e-11 44.0 6.7 1 87 662 731 662 731 0.82
3 40 3.1e-16 4.4e-13 50.5 0.3 1 87 753 825 753 825 0.85
4 40 3.1e-13 4.5e-10 40.8 1.7 1 87 918 987 918 987 0.81
5 40 5e-13 7.3e-10 40.1 5.5 1 86 1011 1082 1011 1083 0.79
6 40 2.9e-13 4.2e-10 40.9 2.2 1 87 1119 1189 1119 1189 0.78
7 40 4e-11 5.7e-08 34.1 2.2 1 85 1235 1303 1235 1305 0.74
8 40 3.6e-15 5.1e-12 47.0 2.1 1 87 1330 1396 1330 1396 0.82
9 40 5.5e-12 7.9e-09 36.8 0.3 1 86 1417 1486 1417 1487 0.80
10 40 3.7e-13 5.3e-10 40.6 1.9 1 87 1514 1586 1514 1586 0.86
11 40 0.0034 4.9 8.6 0.0 1 58 1637 1687 1637 1706 0.79
12 40 9.5e-13 1.4e-09 39.3 0.2 1 86 1733 1811 1733 1812 0.84
13 40 2.1e-12 3e-09 38.2 1.0 1 86 1838 1907 1838 1908 0.80
14 40 9.7e-13 1.4e-09 39.2 5.5 1 85 1981 2050 1981 2052 0.82
15 40 1.4e-14 2e-11 45.2 1.5 1 87 2074 2143 2074 2143 0.83
16 40 2.3e-10 3.3e-07 31.6 0.7 1 86 2517 2587 2517 2588 0.77
17 40 5.5e-16 7.9e-13 49.6 0.3 1 86 2637 2712 2637 2713 0.81
18 40 5.7e-05 0.081 14.3 0.0 1 60 2750 2809 2750 2829 0.77
19 40 9.7e-13 1.4e-09 39.2 4.4 1 87 2849 2919 2849 2919 0.82
20 40 8.1e-15 1.2e-11 45.9 2.4 1 86 2946 3032 2946 3033 0.79
21 40 2.2e-11 3.1e-08 34.9 1.5 1 86 3063 3131 3063 3132 0.79
22 40 2.4e-14 3.4e-11 44.4 3.4 1 87 3155 3224 3155 3224 0.82
23 40 1.1e-10 1.6e-07 32.7 0.6 1 86 3275 3341 3275 3342 0.79
24 40 1.4e-13 2e-10 42.0 1.2 1 87 3383 3455 3383 3455 0.80
25 40 4.6e-12 6.6e-09 37.1 0.1 1 86 3484 3555 3484 3556 0.78
26 40 1.1e-13 1.6e-10 42.2 1.4 1 87 3580 3651 3580 3651 0.80
27 40 0.00041 0.6 11.6 0.1 1 44 3691 3731 3691 3741 0.83
28 40 3.9 5.7e+03 -1.2 0.0 46 80 3768 3794 3754 3800 0.62
29 40 4.8e-15 6.9e-12 46.6 2.3 1 86 3825 3901 3825 3902 0.84
30 40 4.9e-13 7.1e-10 40.2 0.2 1 86 3939 4014 3939 4015 0.79
31 40 3.7e-12 5.3e-09 37.4 1.1 1 87 4216 4287 4216 4287 0.78
32 40 6.9e-13 1e-09 39.7 0.9 1 87 4380 4451 4380 4451 0.82
33 40 7.5e-14 1.1e-10 42.8 0.8 1 86 4531 4607 4531 4608 0.83
34 40 4.7e-13 6.7e-10 40.3 0.4 1 86 4658 4727 4658 4728 0.80
35 40 9.5e-14 1.4e-10 42.5 4.2 1 87 4791 4866 4791 4866 0.82
36 40 1.1e-13 1.6e-10 42.3 0.4 1 87 5023 5092 5023 5092 0.81
37 40 2e-09 2.8e-06 28.6 0.8 1 86 5139 5207 5139 5208 0.81
38 40 1.5e-12 2.2e-09 38.6 0.5 1 86 5252 5324 5252 5325 0.77
39 40 8.5e-12 1.2e-08 36.2 0.3 1 86 5346 5416 5346 5417 0.75
40 40 1e-10 1.5e-07 32.7 2.9 1 86 5440 5508 5440 5509 0.83

Sequence Information

Coding Sequence
ATGAATCCTAACGCTGGTTATATGAATTTTCCAAAGCATTTACAAGCATATCCTCAGCATCAGCATGCAcatcaccagcaacaacaattgttgcaacaacagaaacaacaacaaaaattgcaacatcaacaacaattgcaacaacaacaacatctacatcaacaacaacaacagctgcagcaacagcaacaacaactgcagcaacaacgtgaacaacaattgcaacagcaacagcaacagcagcagcaaatgtTATCACACGATGCCATGGCCGAAGCCGGCTTGTTAACAGTAAAACTTGAACCGCAAATACACATTAAAGAGGAAATGCCGGAAGCGTCACAGAATAAGCCCCTCAACTTTCCACGCCGCAAAGTTCAAACGGAACGCTCCGAAACGTTGCCGATTTGTCAGCGCTGCAAACAGGTCTTCTTCAAAAAGCAAAGCTACAGCAAACATGTTTCGCAGAGTCTCTGCGAGATTGTCGAATATGACTTCAAATGTTCTATATGCCCCATGTCGTTCACATCCAGCGAGGAGCTGCAAACGCACGAGCAACTGCATCGCGAGAATATGTATTTCTGTCACAAATACTGTGGCAAATATTTTGACACAATCGAACTGTGCGAGGTGCACGAGTATATGCAGCACGAATATGTCAATTATATCTGTAATGTTTGTTCGGCGGGCTTTGCTAGCCGCGATTTGCTATTCGCGCACATGCCACAACACCGTAATCAACCGCGTTATGATTGCCCGGTATGTCGTTTGTGGTTTCACACAGCCATACAATTGCATCAGCACCGTTTGCAAGCGCCATATTTTTGTGGGAAATTTTACCGACGCGCCGCTGGTGCAGcaCCAAGTGGCGCGCCATATGGTGGCCCAATGCCGCCAACACATTCGGCTAATTACAATCTACAAGACTGTTCCATGGGCATTATAGAAAtgCCTAGCAGCAATAGCTTCTCTTCATATCTACAAAATCGCGCCTTTCACCATCAACACCCAGCGCCGCCACATCCGCCGTTGCCACCACAGCAATTGCATCCACATCCACATTTAGCGCAACAGCAACACCTGCAACGTCAACAATTAcatctgcaacaacaacatccgcAATTTCCAACACAACCGCTACATCCAGcgacacagcaacaacagccaaCCCCCGCTCAGCCAGATTTTATGCGCCGCAACAGCATTACTGGTGCTACCGGCGCTGCCACAACAGAATTTCAAATGCCACAAATTAAAACCGAAATCAAAGTGGAGCCAGACATCTATGGCACATCAGATTATCCACTACAGACGCCACTGCCACCACCACCTTTACCACCACCGCCGTTAAGCGCCGCACAACAACAATCTGCGTTGGTCTCACCATCACGTCGCTTTGGTGATTACTCAAATGAGTCCTTCGGCGCTGGTGGTGGTGCTGGCGGCGATTTGTCGCATGCATTCGGCCCACACAATAACAGTGGCGGAAGCAATAGTAACAGTAATGACTTCTCAAACAGCAACCCGTCGAGTCTGCAACGCAGCATGAGCGCCGCCAATAGCAGCAATGACATTGACATCAAGCATAAACCGACACCATTTCCTGTTGGCAATTTTCATTTTCCCACCACACCATCAGTGTCGCGTACACGTGTGGTCGATACCGGTGAAGATGGCGCAGTTTGCTGTGTGCCACATTGCGGCGTCACTAAACAGTCCAGTCCAACACTGCAATTCTTTACATTCCCGAAAGATGAAAAGTATTTGCATCAATGGTTGCACAATCTCAAAATGTTTCCCGAGCCCGATTCCACCTACAGCCAATATCGCATTTGTAGTTTACATTTTCCCAAACGTTGCATTAATCGCTATTCGTTGTGCTATTGGGCAGTGCCCACATTCAATTTGGGTCACGACGATGTCGCGAATTTGTATCAGAATCGTGAAATCACCAACACATTTACGATTGGTGAACGCGCTCAGTGCAGCATGCCCGGCTGTCCGAGTCAGCGTGGTGAGAGTAATTGCAAATTCTACAATTTCCCCAATGATATGAAGACGCGCATTAAGTGGTGTCAGAATGCGCGACTGCCGGTGAACAGTCGCGAACCGCGTCACTTTTGTAGTCGTCACTTTGAGGAACGCTGCTTCGGCAAGTTTCGGCTGAAGCCTTGGGCTGTGCCCACACTACATTTGGGCACACCATATGGCAGAATACACGATAATCCGGGCGTATTCTATTTGGAGGAGAAGAAATGCTGTCTGCCACATTGTAAGCGCACGCGTTCATCTGATTTCAATCTATCGTTATATCGTTTTCCACGTGATGAAGTGCTGTTGCGACGTTGGTGCTACAATTTGCGGCTGGATCCGGCAATATATCGTGGCAAGAATCACAAAATTTGTAGCGCACATTTTATCAAAGAGGCGTTGGGATTGCGTAAGCTGTCGCCagGCGCAGTACCGACTCTCAATTTGGGTCACAACGACCGCTTTAATATCTACGAGAATGAGTTGCCAACACCACCGGTAGTACCGCCGTCGAAAATGTTCAATTTCCATAATGTCTCCGCACCGCCACCAGTCTACAATAGCCACCGACATAGCATTGCGTCGAGCAGCAGTCACAGCGAACGCATGTATCCCAATCCAGTGTCGAAACTCAAATTTAGCATATCGAACATAACGGCGGGTGATGTGAGTGCCATGAGCGGTAGCGCCACACAGAATATGTTAGACTCGTTGGAAGTTTGCTTTGTGCCGAGTTGTAAGCGTAATCGCAATATAGACAACGTCACACTGCACACCATACCGCGTCGTCCAGAGCAGATGCGCAAATGGTGTCACAATCTCAAGATCGGCATCGAAACGCTACATAAAGGTGTACGTATTTGCAGCGCACACTTCGAACCCTATTGCATTGGCGGCTGTATGCGTCCATTTGCGGTGCCCACTTTGAATTTGGGACATGACGATCCGAATATATACCGCAATCCGGATGTCATTAAGAAACTGAATATACGCGAAACATGTTGTGTGCAAGATTGCAAGCGTAATCGTGATCGTGATCATGCAAATCTACATCGTTTCCCATCGAATTTCGAATCGCTCACCAAATGGTGTGAGAATCTGTTGAAGCCGGTGCCCGATGGCACAAAACTCTTCAACGATGCCATATGTGAGGTGCATTTCGAAGATCGTTGCATACGCAATAAGCGTTTGGAGAAGTGGGCCGTGCCCACGGTGAAACTGGGACACAGCGAGGAGCCGAAACATCGTTTGCCAACGGATGAAGAGATAGCCGAGCAATGGCCAAAGCCGGTAATGCCTAATAAGGGTATAGAAGAGGGTGAATGTTGTGTGTCCACATGCCGACGCGATCCGAAGATTGATGATGTTAAACTCTACCGCACACCCGAAGATCCGGAGCTGTTGGCGAAATGGGCGCATAACTTGCAAACCGAAACAGAAGATCTTACCACCTTGCGCATTTGTAATCTACACTTTGAAGAGCATTGCTTTAGCAAAAAGCGTCGTCTACACTATTGGTCCTTACCCACACTCAATTTGGCTAATAATGTAGAACAATTATACGAGAATCCCGAACCACTCGTGCCACAGGTGATCATAAAATCGGAGACGAAACGTGAACGCAGACGCATGCGCGATGCCAATGAGCCACTGAAGCCGTGGACGCCACGCTGCTGCCTACCGCACTGTCGCAAACGACGTGACACGGATCATGTGCAGCTTTTCCGTTTCCCCATACTCAATCGACCAATGTTGGCTAAGTGGTGCCACAATTTACAACTACCGCTGGTCGGTAATGGACATCGCCGTTTGTGCTCTACACATTTTGAGCCGCGCGTACTGTCGAAACGCTGTCCCATACCTATGTCAGTGCCCACATTGGATCTGAATACACCACCAGGTTATAAGATCTATATGAATCCGGCGCGTCTAAAGGCGGTCAAGTTGCAACAAGTCTGCTGCATTTCATCCTGCAGTCGTACACGCGCCGATGGCGTACAGCTCTTCCGCTTTCCGCATAGTCGCAGTGTGTTGCGCAAATGGTGTCACAATACGCGGCAGCACGCGCGCGGTGAATATCGCGTGTGTTCGCGTCACTTTGAACCTCACGCATTCGGCGCTAAGCGACTAATGCCTGGCGCAATACCCACATTGAATTTGGATCCGGAAGTTGAGGATGTGTATGCGAATGAAGCGCAGAGTTTCGCTGAAACGCAGTGTGTGGTAAACGGTTGCGTCGCAAGCAAGGAATTAGATGGCGTGCGGCTCTTCAAGTTCCCCAGCGATGATGTCGATTTACTATGGAAGTGGTGtaacaatttgaaaatgaacCCGTTAGACTGTCGCGGCGTGCGTATCTGTAATCGACACTTCGAAGCCGAATGTGTCGGACCGAAGATGCTTTATAAGTGGGCTGTGCCGACACTGGCTTTGGGTCATAACGATGCCGACGTCAAGTTGGTGTCCGTACCGCCGCCAGAGGCGCGTTATAGCGATGTGGTTACCAAATGTTGCGTTCCCACATGCGGAAAGTCACGCAAATTTGATGACGCACAAATGAACAGCTTCCCGAAGAATTTGCGCCTATTCCGCTGCTGGAAGCATAACCTCAAATTGGACTTTTTAGACTTCAAAGatcgtgaaaaatataaaatttgcagTGATCACTTCGAACCCGTTTGTCTGGGTAAATTGCGTCTCAACTATGGCGCTATACCCACACTCAATCTTGGACATGACGACACAGATGATTTATATCCCGTGGATCCCGCGCAGATTCATGTGTCGCTCTTCGGCAAAATGCGCCCACGCATTGCCAGCGACAAAGATGCAGCTGAATTCGAAATGGCTGCTAGTGTTGGTAAAGATGACATCAAATGCACTTATCCCAATTGTACTGCAACGAAGTTGCTGTTAAGCGAACCCTATGATATGCCAACCGCTTTGCCGTTACATGCGCTTTGGTGCGCACAAATGAAAGTCGACGCGGAGGCGCTCAGCGAGACGCCAAAATTGTGTGGACTACATTTCATAAAGCTCTATAAAACCACTTTAGATGCAGCTAATGCGCTAGCGGAGGGCGATAACGACCTGAATGAGGCCATGAATGATTTGAACGCCACTTATGAAAAATGTTGCGTATCACCAATTGTTTGCAGCGCCCAGTGTTGCGTTGCCGGTTGTAACAGTAATCAGGTTAGCGGCAATCGTACCGCACAACGCCTCTACCTATTCCCTACTGCGGGCGGTAGTGGCCTGGAATTACTTGAGAAATGGTGCCACAATGTAGATGTTTCTGCCGCGGACCTTGTTCGCTTTACGCACAAAGTTTGCGCGCGCCATTTCGAAGCACAATGTATTGGCCCCACACAACGGTTGCGCTCATGGGCCATACCGACTTTACAGCTCAAATCCAAACCGGACCATAAGATACATGCCATACCAGAATTGGGTAGCGCGGCACGTCCACATGAGGCGCGTTGCTGTGTTGGCACTTGTGCACGTGAACGCGGTGATTTCGAAACAATGCGTCTTTTCAGTTTCCCCATTATCGACGAAGTGCTCGAAAAGTGGTTGCACAACTTACAATTGACACGCAATCAGTGCGCACGTTTACGTATCTGCGAGCAACATTTCGAACCGCGTTGTATTGTGAAGACGCGCTTGTTGCGTTGGGCAGTACCAACATTGCTGTTGGAGCGCAATGCAGAGGAGCTGCTACAGAACGAACCACCTATGCATGCGGCAGTGCCTACCGCACCAATCGTTGATGAAGTGGATACAATCGATGCGGCAAGTAATAACGATGGCGAAGATATTCGCGATGTTAATGATGATGAGGATGACAATGACGACGAATTGGCAGCAGTGCCTAATGTTAAGATGAAAAAGTCACTTGGTATCGTTAAGTGTTGTGTTCCAAGCTGCAAACGCACACGTTTACAACACGGCGCGCGACTTTTCCAGTTTCCCACAAGTCATCAAATCTTACGCAAATGGTGTCACAATCTGAAAATATCAATCGAAAAGGCTACAAATCCCGCCTTACGTATTTGCAGTCTACACTTCCACAAACGTTGCATTGATGGCAAAGAGTTACGTCCCTGGGCCAAGCCGACTAATGCGCTCGGCCATAGCGGTCCAATTTATGAGAATCCTAAAAATCTACCCGGCGTCTTTTTACCCAAATGCTGTTTGGCTCACTGCCGTCAGCGGCGCACGCTGGAAAATGACTTACGAACATTTGGCTTCCCCAAAGATGAGACGCTTATGCGCAAATGGTGTGCGAATTTACGCATGGAGCCACGCGCTAATCACGCACGCATCTGCATGTCTCACTTCCCGCCCGAGGCGATGGGCAATAAGAAGTTGCGCGCCAATGCGGTGCCTACGCTAAATTTGGGCCACAACGAACCGCTGGAGTATGATAACGCACTGTTGATCGCGACAGCGATGAAAGCCAGAAATccATTGGAGGAAAAGAAAACTTTGATTTCAGAAGCCAGTGATCCTGAACTGGGAAATAACGGCGAATACGATGATATAATGGATGACGAAgacaatgatgatgatgatgactatCGCATAGATGAAGAAGATGATGAGAACGACTTCTACGCCTCCGCCGGTATGGGCAATAGTGATGAGGAGGAAGAGGAAACAGTTGAAGACTTTACGCCGAGTATTAGTGCCGCAACCGGCGACGACGATGAGCTGTTTAGTAATGCCGAAGCGCGTGATGCCGACAATTTCATAGCCGATTCGGCAGATGAGGAGGATGAGGAGGGCGCGGAAGTTGATGATGAGAATGAAGAAGATGTATACGGtaatgacgatgatgatgatgaggatgACAGTGATGAGAGCGATGAAAGTGTTGCCGAAAATGTTTACAACAATGCCGTAGAGGATGATGAGAGCGACAGCGAAAGCGAACTTAACGCTGAAGACAACAACGCTGATAGTAAAGATGCCGAAGtggatgatgatgatgtggaAATGCTCATCGAtgccgatgatgatgatgatgataagaGCAGTTACAATGCTTGCGATCCACTCGATTTTGTGGAATGTGTTAATCAGAGCGATGATACGCAACAACATGAAGTGGCCATTACAACAAAAAGCGCCAAAATGCGACATGCTAATAAAAATCTACGCAAGCGTCCAATATCAACAACATTGAAAAATTACGAAGACGACTCCAATACAAACTACAGTAATGCTGATGAAATGACACGTTTCACAGCGACCACTTCAAATTCCGCCTCAGCAACCACATCGTCGGCATCACCTTCAGCTGCCGCTGCGACCACTGCAAGGGGTGCACATCATTGTGGTGCTGAGAAAATAAGTCGTTCGGTTTTTCGGCTTTGTTGCCTCAGACATCGTAGGAAGAAGAAAGCGCCACCAGATCCACCACCTGACATGAGAGACAAAGCGCCAACAGCATATCGGACGTTGATGAATAGTATGGCACGCGTTAGCACGCAGCGTCGACAACAGCGCGCCCACTGTGCTGTGCGCCGTTGCGGACGTGCTGCCGGCGCTGCTGTCACCTTGTATCGCTTTCCATTAGTCGGCAGTCGTTACTGTCGGCGTTGGTGCGCACAGTTAAAAGTTAAATTGTCGCACACTTCACGCCTGTGTATTTGCCAGCGCCACTTTGCTTACAGTTTAGTGGATCGGAGCCGTAGACGTTTACGTTTCGGCGCAATACCAACACGCAATTTACATAATACTCCGGCACAATTCAAGCGTAATCCACTCTAtggcttaaataaaaatacaacacaaTTGACGGCAACAGCTCACCACGACGCCACCGCGCATGCAATTCAAACGACCGCAGCGCAATTGAACGTCTATACTCGTTGTTGCGTGCCACATTGCGGTAAGTCGCATCAAGTGGATGGCGTTACGCTCTTCCGCTTCCCAAAGCTGCGCTCACTCTATCTGCAATGGGCTACGAATTTGCGCTTAATGCCTACCACGCGTTTAGTGCAAGTCTACAAAGTTTGCAGCGACCACTTCGAGCGCGATTGTCTCAGTTATCAGCGTAATGACCGCGCTAAACTAAAATATGGCTCTGTGCCGACActgaaattgggtcataacgACACATTGAAAATCTACCAGCACAACACGCTCTCGCCACAAAAGCGACGCGGCGTTGGCAGACGTAAGTGGCGCCCACAAAAAGGCGTGAACGAATGCGCCGTGCACGATTGTAAGGTAGCACAATTTCTACAAATGCAACTCTTTGCTTTGCCAGCCGCGCTAAAACTGCAAGAGCGTTGGTGTAATtacttcaaattaaatttcagcGGCGCGTCGAAAGAAGCCACTGAATTCTTTGAGAATGTGCGCTTATGCGCTTTGCACTACATGGAGGGTTATCAAATGGCGACCTATAGCGATGGTGCGCGCAAAGGCACATCAGCTGCTTTGGACGAACTGGAAGCGAACTATGCGCGCATCACCAGTTCAACGCGCATACAAATGCTGAAATGTTGCGTCCCCAATTGTTCGACAAAATTTACGGACAACGTGCGCTTGGCGGCATTTCCCAGCGCGGAAGAGTTGCGCGCTAAATGGCAACACAATACACAGGTGTCATTCAGTCCATCGCATCGTTATTTGTATAAAGTATGCGCATTGCATTTCGAAGAGCGTTGCTTCGCCAAAAAGCGTCTCTTTATGTGGGCCATACCGACATTGCATTTACCGAAACCGCAAAATCAAGATCCAGCGCATAAGCTTTTTGAAAATCCCAACGTGGAAGTGGCGAGCACCGCGCAGTGCTGTATTGAAGGTTGCGCCACTAATGCGGTGAAGCAAGAGTCTGCGGTGAATGTTGACGAAAAAGTGTCGAAAGCTGCTGTGCGCTTTTGGCATTTCCCACAAGACGACGCGTTACGCGACAAATGGTGTCATAATTTGGGACTCGGCGCGCAAACTAACGAGATCAGCCATACAAGTCGACGCTGGCGTATATGCAGTCGCCACTTCGAGCCATTTTGTATCGGTAAAACGTTGCGCAGCTGGGCTGTGCCTACACTGGAATTACCCAAACCGGTTAAACATGCGAAGAGCGGCAAACgctctacatatatttatcaaaatccGGACAGCGCGGCGGTCTACTATCGCTGCTGTATTAAAACATGCCATCAGCTGCGCGATCTCGATGCTGGCATACGTCTATATGCATTCCCCAAAAAGGATACAATGCTACAAAAATGGGCGCATAACATACGTATGCCCGCGGTGAAATGTCGCTACGCGCGCATCTGCACACTGCACTTTGAAGCGCAATGTTTGCGACCGCAAATGCAATCCTGGGCGCTACCGACAATCGATTTGGGACATGACGAGGCGGATATTTTCCGCGTACCAAAGGTGAAGTTGATGGTTACGAGTGAACGATGCTGTCTACCACACTGCAGCAAGCGACGCAGTCGTGACAATGTGCATTTGTTCGGTTTTCCGCGCGATAAACGTGTATTGGACAAATGGTGGCGTAATTTAGCGATTGGTGTGCAGGACGTTAAACGTCGTCTCATATGTGAAGCGCATTTTGAGCCGCGTTGCATTAGCAAGCGGCGTCTTAAACGTTGGGCCATACCAACACTCAATTTGGGACATACAAACGAGACGCTCAAAAATCCAACGCCAGCTGAAGTGTTGGCTAATGAAGGCAATACAACAATGCGACGCTCACAAACGCCCTCGAAAATGACGCGCGCCAAGTCAGCACAACCAACAACGCCAAGCAGTTTACAGAAATGCTCCATCAGCGCTTGTGCACGCGGCGCTGATAACAACGCGCTCTATCGCTTTCCGAAACCGGCTTGGTTACGTAAGAAATGGTGTGATAATACGCGTTTAAACGAAGAGGCCGCCAAGCTTGGTAAAATTTGCGCACGTCACTTCGAAACACATGTTATGGGTAATCGTAAACCGCGCCCATGGGCGTTGCCAACTTTGGAATTGGGCGCTGATGAAAATGGCGCAGCATTGCCAGCCGTGCATGCAAATCCCAAACAGCTGTCGCGTTTCCATCCCGAAGAGCATGAATGGGGCGAGTTGCGCTATGTGCGTGCCAATCATTGCTCCATAATCTCTTGCATGAAATCGAAAAAAGATGGCGTTACACTTTTCAATTATCCCACAAAGCGTCACATGTTGCAGAAATGGGCTGAGAACTGTCGTCATTATCCATATCAAGCGAAACGATATCGTTTCCAACTATGCGGCGCGCATTTCACTGCCGATTGTTTCAAACGTGAAGGTACGCGCCTACGTAAAGGCGCAGTGCCTACCTTAAATTTGGGGCATGATGATGCGCAGATACACCAGAGTGAATTCGAGAGCGTAACCGGTATAAAAATCGAaacgaaaataatgaaaagttGCAGCGTGCCACAATGTGGACGCACAAATCTGCATGATGGCGTACGACTCTACAAGTTTCCTTACGAACGCGCTGAGACGCTGGAAAAGTGGTGCCATAATTTACGCATGGATGCGTCGGATTGTCGCAACGCGCTCATTTGTAATATGCATTTTGAGCCGCGTTGCATTGGTGGTGGACAGCGCGGTTTGCTCTGGCGCGCCATACCCACATTGCTGTTGGGACACAACGACGCCGATATCTTGCACAATCCAGAAACATTTGAGCGTCCGGAGAAGGTAATTAGTTGTTGCGTGCCCGATTGTATTAACACCAAACAAACAGTTGGCATAACGCTGAGCGCCTTTCCAAAGCTGCGCAGCCATTTCGAGAAATGGGCACACAACCTGCAGCTGCCGGTCAACACCGTCGTCTGGCATACATACAAAGTGTGCAGCGCGCACTTTGAGCGCTACTGCTATGAGCACGGACGCATCAAGGTGGGCGCAATGCCGACACTAAATTTGGGGCACAACAACGCCATTGATTTGTACACCGTCAGTGAAGAATCAATGAGTAATGCATTTAAGCGCAAACGCATCGCACCCAAAAATGAAGCGCCGAAAATGCCAAACGAAGTGTGTTGCTATCCGGAATGTAGAGAAATGGAGTTGCGTTCGACGAATCAAGTGTTTGAGTTTCCCAAAATGGGTGCCATACGACGCGCTTGGTACGAGAGTATTGGCTTAAGCAGCGAAGAGGCGCAAGTCAAAGAAGAGTGTGAGGCAATAGCAGTAGACGCACAAAATACAACGTGCGTCACTGAAATGCCGCAAGAGGACGCTACTGTTGCTGATGCAATAAAAGTTACGAAAAAATCACCTAAATTATGTCCGATGCATTTCAAATTGCTCTACATTGAACATGCCGCACTGTTGGATACGCTCAAGTCGGACAATGCACCTGACACGCAGCACATGTTACAGCAGCTCGAAGAGACTTACGCAAGAGTTTGTGATATGTCATGCGTGCGTCGCATTAGCTGCGCCGTGCCCGACTGCAATTCAAATTATCTTACCACTAAAGCGCTGAAATTCTTCAAATTCCCCGATAACGCAGAAATGCGCGCCAAATGGTGCCACAACACGCAAGTCACCATCGATGCCGATCGCTTGTATTGCTATAAAATATGTGAATTACACTTCGAAGCCATTTGCGCCTCGCAGTTGGCTAAAAAAATACAACGTTTGAAATTCTGGGCCCTACCGACGCTACAATTACCGCCACGCGCTGAAGGCGAACCCGAAATATATGCACTACCAGCGCCCGATGCGCTAGCCGAAACGAAACGCGCCTCATTGCTGCTACACACGCCACTCAATAAATGTTGTATAGCGAGTTGTGTGTACGCTAAAGCGCTAGCAGAGCGCGCGGTTAGCGCTGATAgtgaaatacagtttttcaATTTTCCGAACGATGCGGAATTACTCTACAAATGGGTATATAACACACAAATCAGTATGGTTGCCGCGACCAATGCGCGCATATGCTCGCTACACTTTGAAAAACACTGTATTAATAAACGTTTACGCATGTATGCGGTGCCCACGCTGTTGTTGGGGCATGAGAAAACCGATATCTACAAGAATCCCTCGGATAAGCGAGTGGCGGCGGAAGCGTTGGAGAGCAAGCCACAGAAGCTGCCGAAGTATTACGAAGATAATATTGCCGATGACGTTGTAGATGCAGAAAATGACGAGTATGAGGATGAATTGTTAGCGGAAGACGCGTcagagaaattgaaatttaaaaaatctggcATAACTAAATATGAAACTGAAGCGTTTGACGAGGGTGAGCCAGGCGCGGCGTGCAAGTCAGCCTCAAGCGATGAGCAGCAGCTAGACAGTTCACTTCTAGAGCCAATGCTAATGGTGAAAACCGAGTTCAAAGAGCGCAAAgaaacaccaccaccaccaacaacaagtaaagcgacaacacaacaaaaacctTATCACCAAcctttcattataaatattaaacaagaaAAAGATGTGGAAGAAAACTACACCGCTGAAGGCATCGAAAATCAAATGAAGCAGGGCTTACTCGATATGTTCAGCAGTTTTGGTGAGATTGGCGGTGAGCAGGAGCCGGATGACATTAACGAGCTGGATGACGAGAGTACGCAAATGGAACGCGACACCGCGCGTCATTGCCGCATACATGGTTGCAATAGTTATGCGCGCAATGCCGGCGTCACGCTCTTCAAATTTCCCTTTCCACTCGATCAATTCCGCAAATGGTTGCACAATACGCAATTAGAGGTGGACTATACGCGCCGTTGGCGTTATCGCATCTGTCATCGTCACTTCGAACCGATTTGTATGCAATTCCGTAAGCTACCGGCAGGCACCATGCCCACGCTGAATTTGGGTCCTAAGCGTCCAGCGCATATCTATGAAAATGAGTTCGATGTGAATagtttaagcaaatataaaatgaaaacgcAGCAACAAaacataacagcaacaacaacggcaaataCTAGCAATGTATTGTGTGAGTTAAATGACACTGAAGCCGACACGAATAGTTCCTTCGCCGCGCCCACACAACATGATACTGTCGATACTGGCGCTGACTACAATGAAGATTATGCGGACTTTGTAGACAACACACCGATAGATCGCGATACATCGCGTCATTGCCGCATACCCGATTGCAATAGTCATGCTAAAGACCCCGGCGTAACACTCTTCAAATTCCCCATGTCTGAGTATCTCTTTCACAAGTGGCTCTACAATACACAACTTAAGGTGGATTATACGCGTCGCTGGCGCTATCGCATCTGTCAGCGCCACTTCGAACCGATTTGCATGCAATTCCGTAAGCTACCGCCCGGTACAATGCCCACACTGAATTTGGGTCCTTCGCGTCCGGCGCGCATCTATGAGAATAGCTTCGATATaaatcatttgaaaaaattcaaaattaaaatgcaaaataataaagcaaCAGCGAATACTGCGCAATCGGCCACAACAACCTCAGCCATAATGCAGTCAAGTCACATTGATTACGCTGACGACATAGAAACGAACAGTTCCTACGCCATGGATGCAAATGCCAGCAACCCCGCTCAGCTGCCGCTACTCTCATGTACCGTACGCAATTGCACGAGTAAATATCACGCGCTACACGAGGGCTTACATTTGCATAAAATGCCGACGAATATTATGTTGCGTGAGAAATGGATCTATAATTGCCGCTTCTCAGAGGAGACACTTGTCAGCATGAGTTCACGCATACGCATTTGTTCGCTCCACTTTACACCGAACTGCTACTATGGCGTTAAACGTCAATTAAAGTTCGGTTCAGTGCCCACTTTGCGTTTGGGTCACACAGATCCCAATATCTATTCGCATGGTTTTGGTAATGAAAGTGAGACCGCCCACTTGGCGGTTCAACAACACAACGCCACACAACATAATTGGCGCTCCAACCGCCTGCAATCGTTAACTGATGCCAATCAAGATATTTGCTGTCTCATCAATTGTCAGCATAGCAGGCGTGAATATACACGGCATTTCGCCTTTCCCGCTGAACGTGAGCTACTCGAACAATGGCTCAATGCGCTCGGCATGGAGTTCAATAGTTCACGTCCGGATGATTATAAAATCTGTGAATGGCATTTCAAGGCGTCCGATTTCGATGGAGAAGTGCTACGCGCCGATGCGGTGCCCACACGTAATTTGAAGATAGACGATCCCAATCAATCGGATGATGAAGATGAAAATGATGACGAAGATGATGAGTTGGGTTGGAATGCAAACGAAGAGCCGCTCGATGAACGCCCTTCGACTTCTGCAGCAGCGGCGGCTGCAGCATCAACCACCACCCCCGATGGCTACAACAAATTAATCCCCGGCTCGCGACGTTGTTGCTTGGCGCATTGCCGCAAACAACTATTCCAAGACAATGTGCGCACCTTCAAATTCCCAACTATGCATGAACAATTCGAAAAGTGGGTGCATAATTTAGGCATCAAATACGATGGCGACGCACCTTGGCGCTACCAAATTTGCAGCGAACACTTTGAAAACCAATGCATCATACATTATGAGAACAAAGCCAAGTTGTTTAGATGGGCGGTGCCCACCTTGAATTTGGGCAAACATGCGCCAGCCATATTATTCACGAATGAGAATCCCAGAAAACTACAGCAAAAAGATGGAGACTATGAGCGCAATCCGAGCGGCTATGCGGAACAGTTAGACAATGATGAGACCATGGATACTACAAATGATGAGTTGAGTTCCACAGCACAGCATGTAGATGCGCCAAAACATAAGGCTTATCAGCAAGAAGATATGGATTTGCTAGCGCCAATTGAGCGGCCGCCACAAAAAGTGAGCGCCACAACGAAGCGCAGCCACTATGCGTACACCGCAAGCAATGAGGATGGTGATGATTACTATGATGGCGATGATGATGGCTACGATATGGGTGATGGtgaaaattcacttttgaaTGTCATACGCGAGGAGAAACCGAGCGCCGTGAAAGAAGGTACACCCGCTTCGTCGTTCTTCTCATTGCAACTGGTACGCGGCGGCTCAGCCAAAGTACGCGCCTGCTGTTTGCCGCATTGTGGCCGTACGCGCGAGTCCGGTGTGCGTCTGTTCCGTTTCCCCACCGAGCCGGTCTTCCTGAAACGCTGGGAATACAATTTACGCGTACTCTTCAATGAGTCGCAGCGCAATACACACCTCATATGTAGCGCGCATTTCGAGCGCGGTCAGTATAATAAACGTTTAGTCGTCGATGCCATACCCACATTGAATTTGGGACACAACAGCACCGATATCTATCGTAATGGCCAATATGAAGCAACGCGTATGCATAAGCGTCCACTGATAGCATCACCACCGCGCATACCATCGGTAAGCAGCATGCCGTTGAGCAGCCATAAACCACTGCACTGTAATGTGCCCGCTTGTGCTGACACGCAAAGTAAGCGTCGTCTATTCCCCTTCCCCGGCAATCATGTGTTCGTGAAAATTTGGTCGGAGCGCACACAGATCGCTTACGATGCGCGTCACCATGCCGAATTGCGCGTTTGTGAGCTTCATTTTGAGAGCGATTGCTTTAGCGCGCATGGTCTGAACAATAATGCTGTGCCCACGTTGTATTTGCCTGCACCAAACGCACTGCCAACGCCTCACACAACGATTGCACCAGCCGCACCAACAAAAGCTCTCACGCCCATTGGCCGTCCAGCAGCTGCGCTTACTGCAATGCCAAAGCTGAGCGCAATTGCTTGTAGCGTTACCAATTGTGGCAATAGTACCGCTACACGCACGGATTTGAAAATATTCTCGAAATTCCCAGATGATTTCGAATTGTTCACCAAATGGTGTTTCAATTTGAAAATCGATCCACGCACCTATGTGGATGGCAGCTATAATGTGTGTAGCGAACATTTCGAAACGTTCTGCATTGGTGGCCATAGTTTGCGTGTCTGGGCGGTGCCCACTCTGCGTTTGGGTCACAACAGCAAACTTATACACAGTGTCGAACGTCCAGCGGAGATGGAAACGAAATGTTGCTTGCCACATTGCGGACGCAAGAAGAGCAAAGATGGCGTGGAGTTCTATAGCTTCCCTAAAGGTGATATCTACCGCCAATGGTGTCAAATACTGAAGATCGACGAAGGTCTCTACCGCAACAGTGATAAGAAGATATGTAGTGCACATTTTCGCGCCGATTGCTTTAACACTAACGGTACACTGCGACTGGGCGCACGCCCAACAATGTTACTACGCAATCGCACCGCCACAGCAGCCGCACATATGCTCAAACCGCCTGCGCCGTACCGCAGCAAGTGCATCGTGCGCATCTGCCATGAAATGCAACAGCTCTACAGCTTCCCAGCGCAACGGAATCTCTGCACGAAATGGTGTCACAATCTGAAAATCGACTACTACCCGAAACTGCACGAGAATATGAACTTCAAAATCTGTCGGCGTCACTTCGAGCCGAATTGTCTGCTGAGCGGCGGCAAACTGCATGCTGAAGCGGTGCCGACGGTGCAGTTGGGACATAACGATGTCAATATCTATCAAAATTTGGTGGGCATCAAGCAGCACGCCAGTACGCCCAGCTACGACGACAACAGCAGCTTACGCACAAGCGTCAGCACCGTACATACCTGGCTGATGGATGTCGATGCGGAAACGAATGCGGCCAACAATATGCCGACTGCGATTGGTGGGCGCGATGCGGGTGCTGGTGGTAGTGGTGGTGGCGCTGTTGGCGATGCCGAGGATGACATGGTGCGCATGGAATACGAGCCACCAGTCGACTTGGAGCCCACTGTTGTTACGGAGAATATTGCCGATGATAATCTGGACTTGACCGACAGCGCCTATATGCAAATGGAGGATGACACCTACTATGCGGACTTTGAGGAGCAGCGCTTGCTGCCGCAAAGCAGCACTTTTATAGCGGCTGAGAGCGCGGAGGTGATTGACTTGGACGCTGTCGATGCTGTGCAAGAGCAATTTCCCAATTGGTCGCAAGACGATGCTGTGCTGGTGGATGATGATGACGAGGAGGAGGATGACGATGCGCTCTTGTGGCCgttgaattaa
Protein Sequence
MNPNAGYMNFPKHLQAYPQHQHAHHQQQQLLQQQKQQQKLQHQQQLQQQQHLHQQQQQLQQQQQQLQQQREQQLQQQQQQQQQMLSHDAMAEAGLLTVKLEPQIHIKEEMPEASQNKPLNFPRRKVQTERSETLPICQRCKQVFFKKQSYSKHVSQSLCEIVEYDFKCSICPMSFTSSEELQTHEQLHRENMYFCHKYCGKYFDTIELCEVHEYMQHEYVNYICNVCSAGFASRDLLFAHMPQHRNQPRYDCPVCRLWFHTAIQLHQHRLQAPYFCGKFYRRAAGAAPSGAPYGGPMPPTHSANYNLQDCSMGIIEMPSSNSFSSYLQNRAFHHQHPAPPHPPLPPQQLHPHPHLAQQQHLQRQQLHLQQQHPQFPTQPLHPATQQQQPTPAQPDFMRRNSITGATGAATTEFQMPQIKTEIKVEPDIYGTSDYPLQTPLPPPPLPPPPLSAAQQQSALVSPSRRFGDYSNESFGAGGGAGGDLSHAFGPHNNSGGSNSNSNDFSNSNPSSLQRSMSAANSSNDIDIKHKPTPFPVGNFHFPTTPSVSRTRVVDTGEDGAVCCVPHCGVTKQSSPTLQFFTFPKDEKYLHQWLHNLKMFPEPDSTYSQYRICSLHFPKRCINRYSLCYWAVPTFNLGHDDVANLYQNREITNTFTIGERAQCSMPGCPSQRGESNCKFYNFPNDMKTRIKWCQNARLPVNSREPRHFCSRHFEERCFGKFRLKPWAVPTLHLGTPYGRIHDNPGVFYLEEKKCCLPHCKRTRSSDFNLSLYRFPRDEVLLRRWCYNLRLDPAIYRGKNHKICSAHFIKEALGLRKLSPGAVPTLNLGHNDRFNIYENELPTPPVVPPSKMFNFHNVSAPPPVYNSHRHSIASSSSHSERMYPNPVSKLKFSISNITAGDVSAMSGSATQNMLDSLEVCFVPSCKRNRNIDNVTLHTIPRRPEQMRKWCHNLKIGIETLHKGVRICSAHFEPYCIGGCMRPFAVPTLNLGHDDPNIYRNPDVIKKLNIRETCCVQDCKRNRDRDHANLHRFPSNFESLTKWCENLLKPVPDGTKLFNDAICEVHFEDRCIRNKRLEKWAVPTVKLGHSEEPKHRLPTDEEIAEQWPKPVMPNKGIEEGECCVSTCRRDPKIDDVKLYRTPEDPELLAKWAHNLQTETEDLTTLRICNLHFEEHCFSKKRRLHYWSLPTLNLANNVEQLYENPEPLVPQVIIKSETKRERRRMRDANEPLKPWTPRCCLPHCRKRRDTDHVQLFRFPILNRPMLAKWCHNLQLPLVGNGHRRLCSTHFEPRVLSKRCPIPMSVPTLDLNTPPGYKIYMNPARLKAVKLQQVCCISSCSRTRADGVQLFRFPHSRSVLRKWCHNTRQHARGEYRVCSRHFEPHAFGAKRLMPGAIPTLNLDPEVEDVYANEAQSFAETQCVVNGCVASKELDGVRLFKFPSDDVDLLWKWCNNLKMNPLDCRGVRICNRHFEAECVGPKMLYKWAVPTLALGHNDADVKLVSVPPPEARYSDVVTKCCVPTCGKSRKFDDAQMNSFPKNLRLFRCWKHNLKLDFLDFKDREKYKICSDHFEPVCLGKLRLNYGAIPTLNLGHDDTDDLYPVDPAQIHVSLFGKMRPRIASDKDAAEFEMAASVGKDDIKCTYPNCTATKLLLSEPYDMPTALPLHALWCAQMKVDAEALSETPKLCGLHFIKLYKTTLDAANALAEGDNDLNEAMNDLNATYEKCCVSPIVCSAQCCVAGCNSNQVSGNRTAQRLYLFPTAGGSGLELLEKWCHNVDVSAADLVRFTHKVCARHFEAQCIGPTQRLRSWAIPTLQLKSKPDHKIHAIPELGSAARPHEARCCVGTCARERGDFETMRLFSFPIIDEVLEKWLHNLQLTRNQCARLRICEQHFEPRCIVKTRLLRWAVPTLLLERNAEELLQNEPPMHAAVPTAPIVDEVDTIDAASNNDGEDIRDVNDDEDDNDDELAAVPNVKMKKSLGIVKCCVPSCKRTRLQHGARLFQFPTSHQILRKWCHNLKISIEKATNPALRICSLHFHKRCIDGKELRPWAKPTNALGHSGPIYENPKNLPGVFLPKCCLAHCRQRRTLENDLRTFGFPKDETLMRKWCANLRMEPRANHARICMSHFPPEAMGNKKLRANAVPTLNLGHNEPLEYDNALLIATAMKARNPLEEKKTLISEASDPELGNNGEYDDIMDDEDNDDDDDYRIDEEDDENDFYASAGMGNSDEEEEETVEDFTPSISAATGDDDELFSNAEARDADNFIADSADEEDEEGAEVDDENEEDVYGNDDDDDEDDSDESDESVAENVYNNAVEDDESDSESELNAEDNNADSKDAEVDDDDVEMLIDADDDDDDKSSYNACDPLDFVECVNQSDDTQQHEVAITTKSAKMRHANKNLRKRPISTTLKNYEDDSNTNYSNADEMTRFTATTSNSASATTSSASPSAAAATTARGAHHCGAEKISRSVFRLCCLRHRRKKKAPPDPPPDMRDKAPTAYRTLMNSMARVSTQRRQQRAHCAVRRCGRAAGAAVTLYRFPLVGSRYCRRWCAQLKVKLSHTSRLCICQRHFAYSLVDRSRRRLRFGAIPTRNLHNTPAQFKRNPLYGLNKNTTQLTATAHHDATAHAIQTTAAQLNVYTRCCVPHCGKSHQVDGVTLFRFPKLRSLYLQWATNLRLMPTTRLVQVYKVCSDHFERDCLSYQRNDRAKLKYGSVPTLKLGHNDTLKIYQHNTLSPQKRRGVGRRKWRPQKGVNECAVHDCKVAQFLQMQLFALPAALKLQERWCNYFKLNFSGASKEATEFFENVRLCALHYMEGYQMATYSDGARKGTSAALDELEANYARITSSTRIQMLKCCVPNCSTKFTDNVRLAAFPSAEELRAKWQHNTQVSFSPSHRYLYKVCALHFEERCFAKKRLFMWAIPTLHLPKPQNQDPAHKLFENPNVEVASTAQCCIEGCATNAVKQESAVNVDEKVSKAAVRFWHFPQDDALRDKWCHNLGLGAQTNEISHTSRRWRICSRHFEPFCIGKTLRSWAVPTLELPKPVKHAKSGKRSTYIYQNPDSAAVYYRCCIKTCHQLRDLDAGIRLYAFPKKDTMLQKWAHNIRMPAVKCRYARICTLHFEAQCLRPQMQSWALPTIDLGHDEADIFRVPKVKLMVTSERCCLPHCSKRRSRDNVHLFGFPRDKRVLDKWWRNLAIGVQDVKRRLICEAHFEPRCISKRRLKRWAIPTLNLGHTNETLKNPTPAEVLANEGNTTMRRSQTPSKMTRAKSAQPTTPSSLQKCSISACARGADNNALYRFPKPAWLRKKWCDNTRLNEEAAKLGKICARHFETHVMGNRKPRPWALPTLELGADENGAALPAVHANPKQLSRFHPEEHEWGELRYVRANHCSIISCMKSKKDGVTLFNYPTKRHMLQKWAENCRHYPYQAKRYRFQLCGAHFTADCFKREGTRLRKGAVPTLNLGHDDAQIHQSEFESVTGIKIETKIMKSCSVPQCGRTNLHDGVRLYKFPYERAETLEKWCHNLRMDASDCRNALICNMHFEPRCIGGGQRGLLWRAIPTLLLGHNDADILHNPETFERPEKVISCCVPDCINTKQTVGITLSAFPKLRSHFEKWAHNLQLPVNTVVWHTYKVCSAHFERYCYEHGRIKVGAMPTLNLGHNNAIDLYTVSEESMSNAFKRKRIAPKNEAPKMPNEVCCYPECREMELRSTNQVFEFPKMGAIRRAWYESIGLSSEEAQVKEECEAIAVDAQNTTCVTEMPQEDATVADAIKVTKKSPKLCPMHFKLLYIEHAALLDTLKSDNAPDTQHMLQQLEETYARVCDMSCVRRISCAVPDCNSNYLTTKALKFFKFPDNAEMRAKWCHNTQVTIDADRLYCYKICELHFEAICASQLAKKIQRLKFWALPTLQLPPRAEGEPEIYALPAPDALAETKRASLLLHTPLNKCCIASCVYAKALAERAVSADSEIQFFNFPNDAELLYKWVYNTQISMVAATNARICSLHFEKHCINKRLRMYAVPTLLLGHEKTDIYKNPSDKRVAAEALESKPQKLPKYYEDNIADDVVDAENDEYEDELLAEDASEKLKFKKSGITKYETEAFDEGEPGAACKSASSDEQQLDSSLLEPMLMVKTEFKERKETPPPPTTSKATTQQKPYHQPFIINIKQEKDVEENYTAEGIENQMKQGLLDMFSSFGEIGGEQEPDDINELDDESTQMERDTARHCRIHGCNSYARNAGVTLFKFPFPLDQFRKWLHNTQLEVDYTRRWRYRICHRHFEPICMQFRKLPAGTMPTLNLGPKRPAHIYENEFDVNSLSKYKMKTQQQNITATTTANTSNVLCELNDTEADTNSSFAAPTQHDTVDTGADYNEDYADFVDNTPIDRDTSRHCRIPDCNSHAKDPGVTLFKFPMSEYLFHKWLYNTQLKVDYTRRWRYRICQRHFEPICMQFRKLPPGTMPTLNLGPSRPARIYENSFDINHLKKFKIKMQNNKATANTAQSATTTSAIMQSSHIDYADDIETNSSYAMDANASNPAQLPLLSCTVRNCTSKYHALHEGLHLHKMPTNIMLREKWIYNCRFSEETLVSMSSRIRICSLHFTPNCYYGVKRQLKFGSVPTLRLGHTDPNIYSHGFGNESETAHLAVQQHNATQHNWRSNRLQSLTDANQDICCLINCQHSRREYTRHFAFPAERELLEQWLNALGMEFNSSRPDDYKICEWHFKASDFDGEVLRADAVPTRNLKIDDPNQSDDEDENDDEDDELGWNANEEPLDERPSTSAAAAAAASTTTPDGYNKLIPGSRRCCLAHCRKQLFQDNVRTFKFPTMHEQFEKWVHNLGIKYDGDAPWRYQICSEHFENQCIIHYENKAKLFRWAVPTLNLGKHAPAILFTNENPRKLQQKDGDYERNPSGYAEQLDNDETMDTTNDELSSTAQHVDAPKHKAYQQEDMDLLAPIERPPQKVSATTKRSHYAYTASNEDGDDYYDGDDDGYDMGDGENSLLNVIREEKPSAVKEGTPASSFFSLQLVRGGSAKVRACCLPHCGRTRESGVRLFRFPTEPVFLKRWEYNLRVLFNESQRNTHLICSAHFERGQYNKRLVVDAIPTLNLGHNSTDIYRNGQYEATRMHKRPLIASPPRIPSVSSMPLSSHKPLHCNVPACADTQSKRRLFPFPGNHVFVKIWSERTQIAYDARHHAELRVCELHFESDCFSAHGLNNNAVPTLYLPAPNALPTPHTTIAPAAPTKALTPIGRPAAALTAMPKLSAIACSVTNCGNSTATRTDLKIFSKFPDDFELFTKWCFNLKIDPRTYVDGSYNVCSEHFETFCIGGHSLRVWAVPTLRLGHNSKLIHSVERPAEMETKCCLPHCGRKKSKDGVEFYSFPKGDIYRQWCQILKIDEGLYRNSDKKICSAHFRADCFNTNGTLRLGARPTMLLRNRTATAAAHMLKPPAPYRSKCIVRICHEMQQLYSFPAQRNLCTKWCHNLKIDYYPKLHENMNFKICRRHFEPNCLLSGGKLHAEAVPTVQLGHNDVNIYQNLVGIKQHASTPSYDDNSSLRTSVSTVHTWLMDVDAETNAANNMPTAIGGRDAGAGGSGGGAVGDAEDDMVRMEYEPPVDLEPTVVTENIADDNLDLTDSAYMQMEDDTYYADFEEQRLLPQSSTFIAAESAEVIDLDAVDAVQEQFPNWSQDDAVLVDDDDEEEDDDALLWPLN

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00192124;
90% Identity
iTF_01562932;
80% Identity
-