Basic Information

Gene Symbol
-
Assembly
GCA_963662105.1
Location
OY759210.1:93846494-93866193[-]

Transcription Factor Domain

TF Family
THAP
Domain
THAP domain
PFAM
PF05485
TF Group
Zinc-Coordinating Group
Description
The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 44 1.3e-14 1.1e-11 46.1 2.8 1 86 586 658 586 659 0.83
2 44 5.2e-13 4.4e-10 41.0 2.5 1 87 681 751 681 751 0.73
3 44 1.1e-13 8.9e-11 43.2 0.1 1 86 772 843 772 844 0.83
4 44 2.7e-11 2.2e-08 35.5 3.9 1 87 887 956 887 956 0.82
5 44 8e-15 6.7e-12 46.8 2.1 1 86 980 1052 980 1053 0.83
6 44 2.6e-14 2.2e-11 45.2 1.1 1 87 1090 1161 1090 1161 0.85
7 44 7.1e-11 6e-08 34.1 5.0 1 87 1190 1262 1190 1262 0.78
8 44 8.8e-13 7.4e-10 40.3 1.2 1 86 1289 1358 1289 1359 0.82
9 44 4.7e-12 4e-09 37.9 0.4 1 86 1382 1451 1382 1452 0.79
10 44 1.1e-13 9.4e-11 43.1 2.0 1 87 1481 1552 1481 1552 0.82
11 44 4.3e-07 0.00036 22.0 0.0 1 58 1603 1653 1603 1674 0.83
12 44 7.7e-15 6.5e-12 46.9 2.1 1 87 1699 1770 1699 1770 0.81
13 44 1.7e-14 1.4e-11 45.8 3.5 1 86 1800 1871 1800 1872 0.82
14 44 1.3e-10 1.1e-07 33.3 2.0 1 87 1936 2006 1936 2006 0.80
15 44 1.2e-15 9.7e-13 49.5 0.3 1 87 2046 2116 2046 2116 0.83
16 44 1.6e-14 1.3e-11 45.8 0.4 1 87 2178 2257 2178 2257 0.80
17 44 1.5e-13 1.3e-10 42.7 4.5 1 86 2290 2359 2290 2360 0.74
18 44 6.5e-13 5.5e-10 40.7 1.3 1 86 2378 2446 2378 2447 0.82
19 44 1.1e-13 8.9e-11 43.2 0.2 1 86 2475 2543 2475 2544 0.79
20 44 1.2e-14 9.7e-12 46.3 2.0 1 87 2569 2637 2569 2637 0.78
21 44 1.3e-05 0.011 17.3 0.1 5 62 2670 2725 2662 2743 0.70
22 44 5.2e-13 4.3e-10 41.0 4.3 1 86 2772 2842 2772 2843 0.81
23 44 1.9e-12 1.6e-09 39.2 0.1 17 86 2888 2952 2864 2953 0.68
24 44 4.7e-16 3.9e-13 50.8 3.4 1 86 2978 3048 2978 3049 0.83
25 44 2.3e-13 1.9e-10 42.1 5.1 1 86 3073 3143 3073 3144 0.80
26 44 2.5e-13 2.1e-10 42.0 0.3 1 87 3172 3248 3172 3248 0.82
27 44 2.4e-10 2e-07 32.4 0.1 1 86 3269 3340 3269 3341 0.81
28 44 5.2e-13 4.4e-10 41.0 1.3 1 87 3365 3436 3365 3436 0.80
29 44 2.1e-11 1.7e-08 35.9 4.0 1 86 3462 3527 3462 3528 0.81
30 44 3e-13 2.6e-10 41.7 0.2 1 86 3562 3634 3562 3635 0.77
31 44 5.6e-14 4.7e-11 44.1 0.6 1 86 3664 3737 3664 3738 0.82
32 44 1.1e-15 8.9e-13 49.6 0.6 1 86 3763 3838 3763 3839 0.83
33 44 1.2e-05 0.0099 17.4 0.1 1 61 3882 3935 3882 3949 0.77
34 44 8.6e-14 7.2e-11 43.5 0.7 1 87 3977 4052 3977 4052 0.80
35 44 4.8e-12 4e-09 37.9 0.5 1 87 4085 4171 4085 4171 0.85
36 44 3e-13 2.5e-10 41.8 0.1 1 87 4428 4501 4428 4501 0.78
37 44 1.2e-15 1e-12 49.5 3.0 1 87 4631 4704 4631 4704 0.81
38 44 2.6e-14 2.2e-11 45.2 0.3 1 86 4724 4795 4724 4796 0.80
39 44 9.1e-13 7.6e-10 40.2 0.0 1 87 4887 4962 4887 4962 0.80
40 44 3.7e-13 3.1e-10 41.5 0.3 1 87 5053 5123 5053 5123 0.84
41 44 9.7e-13 8.1e-10 40.1 0.7 1 87 5175 5246 5175 5246 0.83
42 44 5.6e-12 4.7e-09 37.7 0.7 1 87 5276 5348 5276 5348 0.80
43 44 6.2e-11 5.2e-08 34.3 0.2 1 86 5369 5438 5369 5439 0.78
44 44 1.1e-11 9.7e-09 36.7 3.0 1 86 5477 5551 5477 5552 0.80

Sequence Information

Coding Sequence
ATGtcacaaaaccaaaataaacaaagttacGCACATGGCTTTAGTGGGAGTGGGGTTGTTGCTGGTAGTGTTGGTGGCATGAGAATGGGTGGTACCGCAGCAGCATCTGATTCTGGCAGAATGCACGAACAGAGGAATGAATATAATTTCGGAGGAGGTGGTGGCGGTGAATACGCCGGCGCCTAttggcagcaacagcaataccAACAGCAGCAAATGCATCAACAAGTATTTcaacaacgacaacagcaacaactacaacagcgACAACAACCTCAACAATTACAACAGCGACAAACACCTCAACAATTACAGCTGCGACAACAGCAACAGTTACAACAGCGACAACAATTACTACAGCGGCAACAACCGCAGCAATTACAACAGCggcaacaaccacaacaaccacaacaattacaacagcgacaacaattacaacagcgagaacaacaacagcgacaacaacagcaacaattacaacagCGCGAACAACAGCAACTACAGCAGCgccaacaacatcatcaattaCAACAGCGACAACAGCAATTAcagcagcgacaacaacaaatgcaattaCAACACCGACAACAATTGCAATTACAAcagcgtcaacaacaacaacaacaacagcgtcaACTTGATGCTATGACTATGAAATCAGATGCATTAGAATCAAATGTTCAAGTACCAACTATTGAAATGGACGAGTTGATCATAAAAACTGAACCACCTGACGATACGGGTTTTGTTAATATAATGGAAAACCAAGACAGCAAAAGTGTAACCAATGCAATAATTCATCGCTCAACTTATTTGAACAATACTTCATTTCCTCATATAAAGGAAGAACCCTCCACATCACAACAAAAGacTCTAAATTTTCCACGTCGTAAACTACAAACAGAACGCGCTGAAACATTACCAATTTGCCAGCGATGTAaacaagtttttcttaaaaaatctaCATATATGAAACATGTGCGTCATAGTTCCTGTTCTATATTGGAATATAATTTCAAATGCCTTATATGTCCAATGTCATTTATGTCTCAAGAAGAATTGCAATCACATGAAAGTGTTCATCGTGCAGATcagtttttttgtcaaaaatattgtggaaaGTATTTTGATACTTTAGATTTATGTGAAGCGCATGAGTTTATGTTTCACGATTTCTCCATATTTACTTGTAATGTTTGTAACGCAAACTTCAAAACACGTGATGTTCTTTTTGCGCACAAAAGTCAACATCGCCATTTAAATCGATTTGATTGTCCCATATGCCGCCAGTGGTTTTCAACTCCGAGTGAGTTGCGGAGGCATCGCTTAGAAGCGCCTTTCTACtgtggaaaattttatgctgaaaaagaagaaactcCCCCCAGCTCTAACATTTCATCTAATCGCACtcaaactaataaaaacaaCGATATCATTAGCAATTTTCAACCAGATATAACGATTAAATCTGAACCAGATTTTGATAATGATTATTATACACCACCTCAATCTTCTGCGTCACCTATTGCTCCTCATCCTGCTCGTACTAATCCGCGGACTGATGCATTTTACCATAACGCTCCAACGTCCTATCCAAATACCGATTTCAATAATTATACAAACATTGATGATCACGCATACAATGACTTCAAACCTCATTATCAACATTTGGAGCGCCAACATGACAATAATTTCTCTATTTTTTCGCCGCCATTTATACCAGCTACACCAGCAGTAAGTGAAAATGATGCAATTTGTTGTGTACCACATTGTGGTAAACGCAAAAGCTCAAATCCTACATTACAATTCTTCAAGTACCCTAAAGATGAAAAATATCTCAAGCAATGGCTTCAtaatatgaaattaatatataatgCTGAAGAATCATATGAAGACTATCGTATTTGTAATTTACATTTTCCGAAACGTGTTATTAACTTACATTCACTTTGCTATTGGGCTGTACCCACCTATCATCTGGGACATGATGATTtcgcaaatatttttcataatcgTGAGCAAAGTTTATCTTCCGTGCAATGTAGCATACGTGGATGTGAAAGTGTACGTGGAAaaacaaatgtgaaattttttaattttccatctACTGAAACCCAAGCATACATGAAATGGTGCCAAAATTCGCGTCTATCACTTACTTCAACGGAGATACGTCAAATTTGCAGTCGTCACTTTGAAGAACAATGTTTTGGAAAATTACGCCTGAAAAATTGGGCATTACCTACATTGCATTTAGGTGGATTTCCTGTAATTCATGATAACCCAAGGATACTATTCACAGAAGATAAAAAATGCTGCTTACAGCATTGTCGTCGTAAACGTTCAGAAGATATTACACTTTCGTTTTATGGTTTTCCGCGTGATGAGCAATTATTATTGCGTTGGTGTTATAATTTTAGACTTGATCCCAACGAGTTTcgtggaaaattttataagatatGTAGCGCACATTTTGTTAAGGATGTAATCGGAACTGTTAAACTTTTACCAGGTGCCCTTCCCACGCTTGAATTGGGACATAATGATCCAAATATATATACCAACGAGCTCTATCCGAACAGCTTAAACACACATAATACCACTTCCACATCATCCAATTCCAGTGCCGTTGTTGGCATGCAAGAAACACCAGACACATGCAGTCTTCCCACGTGTAAGCGAAATCGTTTTTCCGACAATGTTTCAATGCACACCATGCCAGGACGTTCTGAGCAGTTTAAAAAGTGGTgtcacaatttaaaaataaatccagCAATGCTTAACAAACACTTTCGTGTTTGCAGTATACACTTTGAACCCTATTGCATTGGTGGCTGCATGCGTCCATTTGCAGTGCCCACACTATTTTTGGGTCATGATGACAAGAACATTTATCATAAcccaaaagttattaaaaaattaagtttacgCGAAACATGCTGTGTGGCCGTTTGTCGACGCAATCGTGAACGAGATAGAGCTAATCTGCATAGTTTCCCACTGCATAAACcggaacttttacaaaaatggTTGCATAATTTACAACGTCCAATGCCAGATGGAACACGCCTCTTCAATGATGCCATTTGTGAACAACATTTTGAGGACCGGTGTATACAAAATAGGCGGTTAGCGAAATGGGCATTACCAACACTACTTCTAGGGCACGATGACGATATTCTGCAACTACCAAGCGAAGCAGAAGTGGAAGAAATAATGGCCAAGACTGCGATAAATGAGGTAAATAATGGTGATGCTGGTGGAGAATGTTGTGTAGAGACATGTAAACGTAATACCAATAGCGAAGAAAATGTTAAACTATATTTTCTACCGGACGATGAAGAAGTTTTAGAAAAGTGGGCACATAATCTACAACAGCCGGACTTAAAAGACTATGCAACAACTTTACGCATTTGTAATCTACATTTTGAATCGTATTGCATTGGTAAGAAATTACACTCATGGGCCATTCCAACACTTAATTTGTCACATCAGCTAGAGCATATTCATGAGAATCCTAGTGATATGAAGGTGCTAGGCGGACTCAAAGCACCAAATGCACCGCGTTGTTGTCTGCCAAGCTGTCGCAAAATGCGCGTCGATGGTGAAGTGCAGCTATTTCGTTTTCCCACCTACagcaaaagtatttttaaaaaatggtgccaCAATTTACAAATATCAATGGCATCAGCAAGCAGCTCACATAAGCGCTTGTGTTCGACACATTTTGAGCCATGTGTGCTGACAAAACGTTGCCCAATGCCGCAGGCTTTGCCAACTTTAAATCTAAACTGTccagaaaattataaaatatatcaaaatccaGAGCGTCTTAAACTTGCAAAATTGCGGAATGAACGTATCTGCTGTCTGGCGAGCTGTCGTAAGGGCAAAAAAGACGGTATACAGCTGTTTCTGTATCCACATAACCGTAGTCTACTTCGCAAGTGGGCGCACAATACGAAACAAAAAGCAACAGACTGCACTCGTGCGCAACTGCGACTTTGTAGTGATCATTTTGAGGACAACTGTTTTGGACCTTCACGTTTGATTCCAGGAGCAATACCAACCCTTAAGTTGGGACACGATGATGACGATATATTTCCCAATACTCTACCTGAGATAAGTGAGGACGAACAACAATGCGCCATTAGTGGTTGTGGACGTAAGCGCGCCGTTGACGGTGTTAGACTTTTCAGATTTCCTGCTGACGATGAAGACTTGCTATGGAAATGGTGTATCAATTTGAAGTTAAATCCAATTGATTGTCCACAGCAGCGTATTTGCAATATGCATTTTGAGCATGAGTGTATTGGTTCGAAAGGGCTGCACAAATGGGCTATACCAACAATGCTGTTAGGGCACAACGATACCGAATTGCAATTGCATACCAATCCAAAACCTGAAGAAAGAGTTTGCCCAGAGCCTGTGTTGCTAAAATGTTGCGTTGCAACGTGTGGAAAGACTCGCAAGCACGACGATATAATGTTGAACAGCTTTCCTAAGACGCCAGACCTTTTTAATCGCTGGGCACACAATTTGCAAATGGATATACATTTTAGTGAAcgtgaaaaacataaaatttgtaaTGATCATTTCGAATCGTTCTGCGTTGGTCAAGCTAGATTACGTTCTGGTGCTATACCAACACTTAATTTAGGCCATGATAACACAGTCGATATCCATCCCATTGAACGTGCTGACTTGCAAGGTTTTTGCAAACCAATACGACATTCTACCGAGAGGTCAGtgataaaaaaattccatatagCAGAAGAAATAGAAGCTGAAGTAGAATTAGCTAAATGTTGCTATCCCGATTGCACAGCGTCAAAATCTTTGCAGCGTGATTTGCTCTTCGACTTCCCAGAAGCTGAATTATTGGCAAACGAATGGTTCAAAGCTATTGATTTGAAACAATCTGAAATTTCAAACCCAAAATTGTGTGGCATACATTTTATAGTCGTATATGAAGAAACTGCTGAAGCAAGAGCAATCTTTGCAGAAACCAATCCCGAACTTAATGACGATTTGCAAATGCTAGCAAACTGCTATTATCGCTGCTGTTCTTCCTCGCTTATTCGTAGCATGCAATGTTGTATACCAGAATGTAAGACAAATTCACTTAGCGGCACAAAACTCTATCAATTTCCATATCAACGTGAGCTCAAGGAAAAATGGGCACACAACACTGGAGTTGAATTAGATGAGAATCAGCGTCATCTATACAAAGTTTGTGCACTACATTTTGAAACACATTGTATGACAGAATCGCAACGTTTGCGTCCGTGGGCAATACCAACGCTGCATTTACAGCACGACGTCGCAATTGAACTTCATCAAAATCCTGACCTTTTGTCTTTAGAGCGACGTCTATTAGGACCACAAACTAATAAGTGCTGCGTGCCTAGTTGCGAGAGGAACCAAACTGTAGATGACAATGCCAGGCTATTTAATTTTCCTATTGAGGAATCAATGATCGATAAATGGCTGCATAATTTACGGGTGGTAGAACGTGAAAAATGTCCACTCTATCGCATTTGTAGTTGGCATTTTGAAAAGCGTTGCATAAGTAAATCAGGACGCTTACGCTCGTGGGCTATACCTACATTGAAACTAAGAAATAACAACAGCGATAGCGATGACAATGGTGATGAAGATGATGACATCATTCCAAACCCAGAAGGTCCTAGTACTTCTGCAGTAGCTTCCACAGAACCTCACGAAACTGACGAATCAGTAGCCAACAGTAGCCCAGATGTGGTCAAAGGctgtaaaatgaaaaagaatctTGACCTCATAAAATGCTGTGTGCCCACATGTCGTAAAAGTCGTTTACAGCATGGTGTGCGTTTATTCACCTCTCCAAGCGAtgtaaaaatgttaagaaaatggGCACATATTTTGCTTTTACCCGTTACGGCGCTAAAAAAACAAGTACATATCTGTAATCAACATTTTGCAAAGCGTTGCATTGATGGCAAACAATTGCATGCATGGGCAATACCTACAATGAATTTGGGCAACAGTAGCAGCGACACAAAAGCAATACCAACcccaaaacttgaaaaaatcgTCACACAATTGGAGATTAAAGAACAACCTACACCACAATTTAATCAGTACTTGCCTATATGTTGCGTACCTAATTGCGGTAGACGACGCACAGTCGAGAATGCTATGCGCACTTTTGGGTTCCCGCTAAATCCGAAATTGTTGCGACGTTGGTGCACGAACTTAAAGATGGATACATCAACGCTCAAAGGACAGCGTGTTTGTATATATCACTTTGAAGAAGAATTACGAGGCAAGACCAAACTTAAGAGCGGTGCAGTGCCGACGCTAAATTTAGGACATGATGAAGGAGCAACATATGATAACAAAGCGCTATTGTTAGcgttaaaagaaaaagataatTCAACGATAGCAGAATCAAGCGTAGAAAATACGAATACTAATATGGAATCCAACATCGAAATGAATTTAGAAGAGAGCACCGAAAATATTACTGAAATAAGCAATGAGAACTGCTGTGCCGTGACTGGttgcacaacaacaaaatctgCCGATGATGTTgcagatgatgatgacgatgataaTATACAACTTTTTAACTTTCCAACAGACAGCAACTTGCTGCAAAAATGGTGTGAAAATTTACAATTGAAGCCCGCTGCTTGTGACGCTTTAAAAATATGCCAGCGTCATTTCGAGTTGCAATGCTTTGGTAAACGCACTTCATTATATATGTGGGCTATACCAACACTACATTTAGGGCTCGACGAACTCACGCCATTACCACATGACAattctaaaaatatcaaaattgtaCAAACTGATAACAACTTATTGGATAACTTTAAGCGTTGTTGCATTGCCACATGTCGTAAAACGAGGCGGAGTCATGGTGTTACACTCTTCTCCTTTCCGAAACCCAGTCAATTACTGCTTCGAAAATGGTGCCACAACACGCGGCAAGCGTTTAAACGTCGCCAAAAACGACGCATTTGCTCAGATCATTTTGAAAAGTATGCAATGTCCAATGGTAAACTAAGACCTTGGGCACTACCAACCCTACAGCTGGGACACACAGAACCCATTTATGAAAATCCAACAAATATGTTTGACAAATGCTGTGTAACCACATGTGAATCTAACGAAAATACTCAACAGCTATTTAGTTTTCCGAAACGCAGTTCACAGTTACGCAAGTGGGCGCACAACACCAAACAGGATGCCTATTTGGCGATACGTCGAAACTATAAAGTTTGCATTTtgcattttgagaaaaattgcaTTGGCGCCAATGGCTTGAATTTCGGTGCTATACCAACTTTGCAACTGGGTCAGCACGATGTTGATGCTGACTTACAAGACAACTTTGCCAACGAAGTGGTTACTACTGGGGATGTGGATGTTAGCTGCAGCATTTCCAATTGTGGACGTAAATTAGCCAAGGATGGCATTATACTCTTTCCCTTTCCAAAGCAACAAGAACTGCTTGAGAAATGGTGTCACAATCTCAAATTAGAAACTGAGAAATGTGAAGACCTGTACATTTGTAATATGCATTTTGAATCCGGACAAGTTCGCAAGTATAAGCTTTTGGAAGAGGCCGTACCTACGCTACTGCTTGGACACAATGACAAATCACTTGAATTGATAAAAACACCAGAATCATCAAATACAGCTCCTGTTATTCACTGCTGCGTGCAAAATTGTAATAAATCCAAACCTAATGACGATGTACAAATGATGGGATTTCCTAAGAATCGCACACTCTATCTTAAATGGTTGCATAATCTGAAAGTTAAACCGAATCAGTCAGTAAGCAGGGTTTGTAGTGATCATTTCGAAGACATCTGTATGGCAGCATCGCGTATAAAGCATGGCTCAGTACCCACGCTAAATTTGGGCCACGATGATCCCAATATTTATCACAATAATAAAAAGGTTTTGTTGACCGCTGAAGAAAAACGCAAGTTAGAGAAATGTGTTGCATCTGATGATTGTCACGTTTCCAAAATgctacagctgaaactgtttgAAATGCCGAAACTAGAAAGTATTTTACAACGCTGGTTGACACATTTTGGTGTTGAGTTTAACGATAATATTGATATGGAACCGAAAGTATGTGCAATTCATTTTATGGATGCGTATGAAAAACTTTGTCAATCAAAAGATATTGATACAGTTCAAGAGACTGATCCACTACAGCTAGATTCCCAAGGCGAGGACAAGCAGGAGCTTGAAAACTTAGCTGTTACATATAAACATATTGCTAACTTACCACGCTTGCGTATACTTTTGTGCAGCGTTCCTGGTTGCCATACAAAATTTACCGACAGTACTATCCGATTAATAAGATTTCCACTAAATCCTGAAGTATGCGCTAAGTGGTGTCACAATACAAAGGCGAATTTTGACGAAGAACGTcgatatttgtttaaaatttgctCGCTGCACTTCGAAGAATATTGCTATGTTCATACTCGCTTACATTCCTGGGCCTTACCAACACTTAGACTGCCACAAACTGATGATATCTATCAAAATCCTGAGCCAGAAGAAGATCAGCCCGAAGTTTGTTGTCTAGAAAGCTGTAGACCAACGAGTTCCATCAAGTTGAACGACATCGTCGATGTAGATGATAGTACAGAAGTGCCGGAGGAAAGCGCTGAAGTAGGCAAGAAGCTTTTCTCTTTTCCCGCTGACGAAGAGCTTTTACAGAAATGGTGCCACAATTTAGATATGCCGCCTAATGCAGAAACTAGAGGTCTTAAAATATGCAGTGTACACTTTGAAAACTATTCCGTTGGTAAAGTTTTGCGGCCATGGGCAATACCAACACTCAAACTGAATCGTGACCCCACAACACTTTATACCAATCCTGAAATTAAAGACTTGCCCTTTGGGCGTACATCAACCTGTTGCGTGCGAGACTGTCGTCAGCAACGTGATGACGAAGCTGGCATAAAACTATTTGCATTCCCCCTGAAGCAGGAGTTGCGTCAAAAGTGGTTACATAATTTAGAACTAAAAGATCATGCAGATGATTGCAGAAGCTATCGTGTTTGTAGTTTACATTTTGAACGGCACTGTGTAGGCAAACGACTGCATAATTGGGCACTACCTACACTGCGCCTGTGCCAAGGCAACGAAAcagttttgtataaaaatccCAGAATGAGCAAGAAAAAACTAAATGCAAAATGTTGTCTAAAACATTGCCGTAAGAGTCGTCGTCAGAATCAAGTTCGTTTATTTAGTTTTCCAGAAACAGAATCACAAGTACTAAAAAAATGGATGCACAATTTAAAACTTGATGATGCTGCGCTTAAACGCAAGCTATGTAGCAGTCACTTTGAAAAGTTTTGCTTTTATAAAGGCATGAAAAGAGTTCACCCCTGGGCGATACCAACTCTGGACCTGGGTCATAAAGATGTCATCTTTGAAAACCCtaaaggaaaaaaattgaagcTGCAACAATTGCCTACcgcaaaagaaaaatgtgcGTTAACCTTTTGTGATAGTCAACAAAACTTATTAGCTGCAGGTGAGACATCCATCAGATTCTTTGGCTTTCCTTGCGTGATAGACTTACTGCAACAATGGTGCGATAATTTAAAACTCGATATGACGACTTATCGCTGTACAAAACAAAGAATTTGCGAACACCATTTTGAGCCGCAGGTGTTAAATGATAAAAAGCTGCTTCCTGGTGCAGTACCAACGCTCAACGTAGGCCATGAAGAAGTAATTAAACATCAAAACCCAGCGGATATGTCCAAATACCGCTGCTGTAGTCTGGCATATTGTAAAGCTGAAATGTCCGATTTAATTAAGCTTTTTGATTTGCCTCTAAACAATATGGCATTACTACGACAATGGTGTGAAAATTTGCAACTCAATGAGGAAGCATGCAAAAGTGGGCAGTATATTAAAGTATGCAGTGCACATTTTGTGCCCGCAGTTATATCGGGTAATACACTTAGGCCGCATGCCGTTCCAACGCTTAAACTTGGCCACGACAATGCAATAACACATGAAAATTCCTCACAGACATTACACTCTGAAGTGATTAAACAATGTGTTGTAACAGACTGTGCCGCTCATCTTGAGCCACACgcaaatttgtttgattttccaaaaaaacccAAACGTCTACAAAAATGGTGTGAAAATCTCAAACTTAGTGTTGCAGATGGCACTGAGTACCACATATGTAGAGAGCATTTTGACTCATCTTGCTTTGATCCAAGCGATGATCAAAGTTTATCAGACTGGGCATTGCCTACGCTTAACTTAGGACATACTGAGCAAATACCATATGAACGTAGAAAATCAAGCAGCAAATTGTCATTACAGAAGCAGTCACAATGTTGTGTTAATACTTGTCAGTCTACTACCGATGTGCAGCTATACAAGTTTCCAACTATAAATTggcttattaaaaaatggtgcTATAATTTGCGTCGCTCGGAGGAGGGTGCggaacatttgaaaatttgcgAGAAACACTTCGAAAAGCATTGCTTGGGCATATCTAAACCAAGGCCCTGGGCCATTCCTACCATTGAGCTGGGACATACGGATGAAATACATCAAAGCATGAAAATACACAAGTATCATCCGGAAGAGAAAGCATGTGGAGAAATGAAATTCGTACGTAGTAACTATTGCGCAATTGTTTCGTGTTTGAAATCAAAACAAGACGGTGTACAGCTCTATAGATATCCCAGCAAGGGTACAATGCTACATAAGTGGGCACATAATTGCCGCCATCGTGCTTATCAAGCAGCTCGCTATATTTTCCGTATTTGTAGCGAACATTTTGAGGAGCCATGTTTAAGTGGCATAGCAGAAAATCGTTTGCGCCCAGGTGCTATACCAACCCTCAAACTCGGACATGATGACATTGATATACATCCACATGAGGTTTTAGATTGTCCTACGCCGTCAAACTTAACTGAATTGAATGTGGCATGCGATGTGCCAAACTGTGGACGAACCAAATTAGTGGATAATGTAAGGCTTTTCAAATTTCCCAAGGATATAGAAATGCTTACAAAATGGTGCTACAATCTCCATTTGGAAATGATGAACGAGAAAGATGCTAAAAATAAGCGTATTTGCAATTTACATTTTGAGGAGCGCTGCTTTAGTTTGAGAAAAATTCGCCTTTTGTTACGTGCAGTTCCTACACTTTTATTGGGGCACACTGATATTGATGACATTTATGAAAATCCTGAATCATTTGAACGTCCTGAAAAAGTCTTACGTTGCTGTGTGCCCGAGTGTGGAAGAGCAAAATTCGAGGACAATGTGACACTGTTTTGTTTCCCAAAAAGTCGTAAGCTCTTCGGTCAATGGGCGGAAAACATCAGATTGGAATTAAAACCTACAACAGAAACTTATATGTATACGCGTGTTTGCAGCATGCATTTTGAGAACTACTGCAGTTTTGGACAAAGCGTACGTAGGCTGTTGTTTGGAGCCATACCAACATTAAAACTAGGACATAATGATGAGAATATTCACCATGTCAAACGCGAGATGCTATCAAGTACATTAGGTGAATTACGCAAACGTAAAAGGGAAGAAGATTCAACAAGACCTATGGTAAAGCGCGATGCTCAATGTTGTGTACCAAAATGCAATGCAAAAGACCATCACATCTATGCGTTGCCACAACATGCAACTCTACAGCACATGTGGTATTCGGCCTTAGATCTGCAAGGAGCTGAGGACATAATTTACGATGACGAAAAACAGATTTGTGCTATACATTTCAAAATAGCATATATTAAAACTCATAAACAAATGTTAGCGGTTACAGTGGATGATGCATTTGTTGCTGTTTTAAAAGATTTGACAAAGGCTTACAATGAGATATCCAGCTCTTCACGTATACGTAGTATATGGTGCAGTGTATCAGGTTGTTGTGGCAACACATTTAATGGTGGTGCTGAATTGAAATTATACTCCTTCCCTCACAACAGTGAAATCTACGAAAAGTGGATACATAATACTCAAGTGGAAGTAGATGATAAGCAAAGGTACCTCTTTAAAGTATGTTCCCTGCACTTTGAAATTCGTTGCGTTAGAGAGCCCTCAAAACGCTTGCATCCATGGGCTTTACCTACACTAAATTTACCATTAAAAGACATACCAACAGTTATTATACAAAATCCCAGCCCAGAGGCGTTGGAGTCAAATAATTCAAGCTTTACTAACTTGGACACAAACTGTTGCATTGCCACATGCGTTAACgccaaaaagaaagaaaatgataaaaaggaCAATGAAGATGACGACGATGAAGATGAAAAACCTTTATCTAGTATTATACTGTATAAATTTCCACGTGATGTACACTTATTCCGAAAATGGCTCTACAATACCCAAATGGATGCTAACGATGCTATACACGCGCGTATTTGTGGACGACACTTTGTGCCAGCTTGCATTGGGAAAATTTTGAGTTCATGGGCGATACCTACACTGCATTTGGGGCATACTAAGCCAAATATACATCAAAATCCAAAAGAAGAAATGACAGTTGACGATGATGtaatacaaaatgaaattaagttaGGAAAGGAACTAGAAACAAAAACactacaatttgaaaaaaacagcACACTGAAAGTGAACGAAGCTTCAGTTGATAAAAAACCTACAAAAGAGCTACTCCAAAGTTTGCTAAAAAAGGAGGAAATCAAATGTGCAAGTTTAGAATCACTGCTCTCGCCAAAGTGGGAACAACATAAAATAGCCCGTACTTCCACGCCTAAAGAAGAAATTAGCAAAAAACCAATAATTGTCAAAAGTGGCAAGCAAGAAAACTTCAGAGCATCTCCGCGAACAACCAAtaacaaaactgaaaaaactGGCGATGgcaatatttctgaaaaatcgCTTTCAAGCATGGAAGATGAAAGGATTATCAGCAGAGTAGAAGAGGAACAAGTGAACGAAATAGAAGATGTCGATGGAATTTTCGCAGTAGAAGATGAAGACGACGAAATAATTTCAGAAGAAAATGAATTGCTCGTAGTCGGTGAAAGAGTAATCGGAGAAGAAGATGAAGTTGATGAAGAAGGAATGCTAGAAGACGAAATGGATGAAGAAGATGCCTTCGATGAAACAGCAGAATTTGATGACCAACAATTTGGCAGCACAGATTTTAAAGGGTATAACGATGAAGTGTTTTACAGTTTAGGTGATGAACCCGCTGATCTGCTCTACGCAACTCCAACAGAAGATATACAAAGTTGTCGCGTTGATGGCTGTACAAGCCGTGTAGGCCAACCAGATATTGTGTTACATAAATTTCCAAGTAATGCACTATTACGCAATAGATGGTTACACAATACTAAAGTCAAATGGGACGCACTGCGTCCGTGGGCTTACAAAATATGTAGTCTCCATTTTGAGCCACAATATTATACTTCAATGACGAATCGTTTAACGGCATATGCAATACCAACATTAAACTTGGGTCACACGAGGCCAGAAAAAATTTACGATACTGCTATTGCATCAAATTGGAAACCAGGGGCAAAATCTTCAACACTACAAATCGGAAAGAAATCAGAACTGAAGCCGCTGAAACCAAAACCATATCCAATGTCCGGATTAATTGAACACTCTCGTCAAACACCGCCAACTTCAAGCTTCAACTTTAAGATAACAGCAATTGAAAGTCTCTTGAATGCATCAACTAAAGCTGGAGATGTTGCCATAAAGGGCGGCGATGACAACTTTGATATGGATAGCGATAGCAATTCGGTAAGCTCTACCGAATATCAGCTGCAGCAGTATTCGGATGCCACAAATCCATCAAGTGCCAGTGTTAGATCATCAAATACAACGAATCTCAGCTGCAGCATACCATCATGTCAGCGAAAAACCAACCAGGATTATGTACAATTACACATGCTACCAAAAGATACTTTATTGCTGCAGCAATGGTGTCACAATTGTAAGCTCTCGCCATCGCCACATGAACTGTACACAACCCGTGTATGCAGCTATCACTTTGAGGATCAATGTTTTACAAGAAGTCGCCGTCAATTAAAACTTGGTTCCTTGCCTACACTGAATCTTGGTCATAATGATGAAATCTATCAGAATTTACGGCAACCCAGTGCTCCATTACGGTGTTGTATAGGAGAATGTGGTCGACGACGTGTGGAGGATCAGGCGCGTCTTTTTAGATTTCCCACCGATAAACGTATGCTATCGAAATGGTTGTACAATTTGCAAATGGAATTTAATGCGGCACGTCCATGGAATTACAATGTTTGCGAAAAACATTTTGAAGATTATTGTTTTGGTAAGGCGGGTAGTATTATGCCATGGGCAGTGCCTACACTAGAACTTGGCAGCAATGGTCGAGAACAACGAGACTTACGTTTAAATGAACGTCCACTGGAAGACCAGATAAATGATGATGACCTCTTTATTATGCCTGATAGCACGCCAACGGAAGTTGAAGGCCTTGTCTCAGCACCTGAACCGAACTTTAATAAGAAATTGTCAACTGATTTTAAATCAAACAATGTTTTCTATGAACAACACTTTACCTTCAAAAAGAACATGCTGCAACTAAAagatgatattaaaatttctacatCTAACTGCTTCAGATGTATTGTACCTAATTGTGGACTTATTCGGAGTGGCGATATTAGAATGCAAATGTTTGCACTCCCCAAAGATTCACAACTCAgacaaaaatggatttttaatatcaaactTTTAAGTTACACTTATAATCCAGACAACTCTATCGATAAAATTTGTGCGCGACATTTTGAGAGAAGTGCAATTATTGGTCCAAATATGCTAAAAAAATCTGCAGTGCCAACTCTTAACCTTGGACACGGTGATCACAATATTTACGTAAACGTTGAAGAGGAACCGAAATATCTAAACCACGCTTATGGAAGCGCAGCAGATGCCTTACAAGCTATggcgaaaataaaaatagaaccATTTGAGGAACAACAGCGCTCAAAAGCCGATTCAACACTGCTACAATTTCAGCAATATGAAAATCAGTTCAGCTCTTCATCATTTCCGGCAGACCCTGATCCAAATAGTTCATTCAATATAACAGCAGTAGTGaattcaaattcaataaaatgtttccTAAAGCATTGCAAACGTAAGCGCGTAGATGGTGTCAAACTGTTTCGTTTTCCACGTGAACCGACGCTGCAAGCGCAATGGGAACATAATCTTCAAATGGTGTTCGATGATACATTACGTGAACGACTGTTCTTGTGTAGCACACACTTTGAGCCGCAATTTATGGGACGCAAATTTCTAATGAAGGGTGCAATACCTACATTAAATTTGGGGCATAATGACTCGAACATTTTTAGAGCCGATGCTTGGGCATTGTCTGAAGAGTCAAGTGAATCATCAGTAACGCCATTGCGTAATGGTGCTGGCTCCGGCTCTTTTACCACATTACCTGGGGCGATAAAGCAATCAACGCAAGTGTCGTGTTGTGTACCTGGTTGTGGCCTCAAAGAAACCGGAAGTGATAATGTTTTGTTCTTTCCGTTTCCATTGGATGAAAAGTACGCAAGTATATGGCAGCACCGTTTTCGTATCAATTACGCAAGTAGAAGAGCAAAGCTGCGTGTATGCAGTATACATTTTGAACCGCATTTGATATGTAGAAAATCTTTAGTGCAACATGCGGTGCCAACTTTAAACTTACCACCACCTGTGCCACATATGAATAAACAGCTGTCGCGCGTGGTATCTGCTCCAATAACTTCACCATCCACTTCCAGTAATACATGCAACGTAACCGGTTGCCACAAAAATACATTACAAGACAATGTGAAAATGTTCTCGAAGTTTCCCGATGATTTCGAATTATTTACCAAATGGTGttacaatttgaaaattgatCCTAGAAAATATATCGATGGCAAATATAACGTGTGTAGTTCACACTTTGAATCGTATTGTATTGGTGGGCATAGTTTGCGTTTGTGGGCTGTGCCCACACTTAATTTGGGTCACAATGATGCCAATATACATCAAGTAAAGCGACCTGCAGAAATGGAAGCGAAATGCTGTTTGCCACATTGTGGACGGCGGCGTAGTACAGATGgtgttaatttttacaattttccaAAGGgtGATTTGTATAGAGCTtggtgtaaaattttaaatattgatgaGCTCTTATATCGCAATACAGATAAGAAAATCTGTAGTGCACACTTCACTGCTGATTGCTTCAATGGTTTAACGCTTAAAGCAGGCGTTAAACCGACTTTGTATTTGCGTGCTAGGCCAGCAGTTGCAGCAGCATCAGTTGCAACAACACAACATCATCAACCAAGCAATCAACAACTGTTTAAAATCCATGATTCACAAACAAATGTACCAGGCTGTTTGCTAACACATTGCTCTAATAATTCAACTACTTCATCCTCTGTGCAATTTTATGCATTTCCAGATAAACGCTATCTATGCATGAAATGGTGTCACAATTTGAGATTAAATTATAGTTCACAAATGTTGTATAATAAACGAATTTATAAAATctgttttaaacattttgagCCCAGCTGTATTTATAATGCAAAATTACACCTTGAAGCTGTACCCACACTGGAGCTCGGTCACGCCgatgttaatatttttcaaaatactgGCTCTTTTTCAACGTTAACAAATGATAGCAGCAGCATTAGCAGTAGCTCTACTGGATTTGCGAACGCAATGGGTGGTGGTAGCAGTGACTATGACAGCAGCTTAATGTATGTGAAAGAAGAAGTAATGGATCAAGATGAGATGGAGCCATTGACTGAAGTGCCAGATGTTGATATGTTGAATAATTTTACATCCACATTTTGTGAGCCAGAAATGTGA
Protein Sequence
MSQNQNKQSYAHGFSGSGVVAGSVGGMRMGGTAAASDSGRMHEQRNEYNFGGGGGGEYAGAYWQQQQYQQQQMHQQVFQQRQQQQLQQRQQPQQLQQRQTPQQLQLRQQQQLQQRQQLLQRQQPQQLQQRQQPQQPQQLQQRQQLQQREQQQRQQQQQLQQREQQQLQQRQQHHQLQQRQQQLQQRQQQMQLQHRQQLQLQQRQQQQQQQRQLDAMTMKSDALESNVQVPTIEMDELIIKTEPPDDTGFVNIMENQDSKSVTNAIIHRSTYLNNTSFPHIKEEPSTSQQKTLNFPRRKLQTERAETLPICQRCKQVFLKKSTYMKHVRHSSCSILEYNFKCLICPMSFMSQEELQSHESVHRADQFFCQKYCGKYFDTLDLCEAHEFMFHDFSIFTCNVCNANFKTRDVLFAHKSQHRHLNRFDCPICRQWFSTPSELRRHRLEAPFYCGKFYAEKEETPPSSNISSNRTQTNKNNDIISNFQPDITIKSEPDFDNDYYTPPQSSASPIAPHPARTNPRTDAFYHNAPTSYPNTDFNNYTNIDDHAYNDFKPHYQHLERQHDNNFSIFSPPFIPATPAVSENDAICCVPHCGKRKSSNPTLQFFKYPKDEKYLKQWLHNMKLIYNAEESYEDYRICNLHFPKRVINLHSLCYWAVPTYHLGHDDFANIFHNREQSLSSVQCSIRGCESVRGKTNVKFFNFPSTETQAYMKWCQNSRLSLTSTEIRQICSRHFEEQCFGKLRLKNWALPTLHLGGFPVIHDNPRILFTEDKKCCLQHCRRKRSEDITLSFYGFPRDEQLLLRWCYNFRLDPNEFRGKFYKICSAHFVKDVIGTVKLLPGALPTLELGHNDPNIYTNELYPNSLNTHNTTSTSSNSSAVVGMQETPDTCSLPTCKRNRFSDNVSMHTMPGRSEQFKKWCHNLKINPAMLNKHFRVCSIHFEPYCIGGCMRPFAVPTLFLGHDDKNIYHNPKVIKKLSLRETCCVAVCRRNRERDRANLHSFPLHKPELLQKWLHNLQRPMPDGTRLFNDAICEQHFEDRCIQNRRLAKWALPTLLLGHDDDILQLPSEAEVEEIMAKTAINEVNNGDAGGECCVETCKRNTNSEENVKLYFLPDDEEVLEKWAHNLQQPDLKDYATTLRICNLHFESYCIGKKLHSWAIPTLNLSHQLEHIHENPSDMKVLGGLKAPNAPRCCLPSCRKMRVDGEVQLFRFPTYSKSIFKKWCHNLQISMASASSSHKRLCSTHFEPCVLTKRCPMPQALPTLNLNCPENYKIYQNPERLKLAKLRNERICCLASCRKGKKDGIQLFLYPHNRSLLRKWAHNTKQKATDCTRAQLRLCSDHFEDNCFGPSRLIPGAIPTLKLGHDDDDIFPNTLPEISEDEQQCAISGCGRKRAVDGVRLFRFPADDEDLLWKWCINLKLNPIDCPQQRICNMHFEHECIGSKGLHKWAIPTMLLGHNDTELQLHTNPKPEERVCPEPVLLKCCVATCGKTRKHDDIMLNSFPKTPDLFNRWAHNLQMDIHFSEREKHKICNDHFESFCVGQARLRSGAIPTLNLGHDNTVDIHPIERADLQGFCKPIRHSTERSVIKKFHIAEEIEAEVELAKCCYPDCTASKSLQRDLLFDFPEAELLANEWFKAIDLKQSEISNPKLCGIHFIVVYEETAEARAIFAETNPELNDDLQMLANCYYRCCSSSLIRSMQCCIPECKTNSLSGTKLYQFPYQRELKEKWAHNTGVELDENQRHLYKVCALHFETHCMTESQRLRPWAIPTLHLQHDVAIELHQNPDLLSLERRLLGPQTNKCCVPSCERNQTVDDNARLFNFPIEESMIDKWLHNLRVVEREKCPLYRICSWHFEKRCISKSGRLRSWAIPTLKLRNNNSDSDDNGDEDDDIIPNPEGPSTSAVASTEPHETDESVANSSPDVVKGCKMKKNLDLIKCCVPTCRKSRLQHGVRLFTSPSDVKMLRKWAHILLLPVTALKKQVHICNQHFAKRCIDGKQLHAWAIPTMNLGNSSSDTKAIPTPKLEKIVTQLEIKEQPTPQFNQYLPICCVPNCGRRRTVENAMRTFGFPLNPKLLRRWCTNLKMDTSTLKGQRVCIYHFEEELRGKTKLKSGAVPTLNLGHDEGATYDNKALLLALKEKDNSTIAESSVENTNTNMESNIEMNLEESTENITEISNENCCAVTGCTTTKSADDVADDDDDDNIQLFNFPTDSNLLQKWCENLQLKPAACDALKICQRHFELQCFGKRTSLYMWAIPTLHLGLDELTPLPHDNSKNIKIVQTDNNLLDNFKRCCIATCRKTRRSHGVTLFSFPKPSQLLLRKWCHNTRQAFKRRQKRRICSDHFEKYAMSNGKLRPWALPTLQLGHTEPIYENPTNMFDKCCVTTCESNENTQQLFSFPKRSSQLRKWAHNTKQDAYLAIRRNYKVCILHFEKNCIGANGLNFGAIPTLQLGQHDVDADLQDNFANEVVTTGDVDVSCSISNCGRKLAKDGIILFPFPKQQELLEKWCHNLKLETEKCEDLYICNMHFESGQVRKYKLLEEAVPTLLLGHNDKSLELIKTPESSNTAPVIHCCVQNCNKSKPNDDVQMMGFPKNRTLYLKWLHNLKVKPNQSVSRVCSDHFEDICMAASRIKHGSVPTLNLGHDDPNIYHNNKKVLLTAEEKRKLEKCVASDDCHVSKMLQLKLFEMPKLESILQRWLTHFGVEFNDNIDMEPKVCAIHFMDAYEKLCQSKDIDTVQETDPLQLDSQGEDKQELENLAVTYKHIANLPRLRILLCSVPGCHTKFTDSTIRLIRFPLNPEVCAKWCHNTKANFDEERRYLFKICSLHFEEYCYVHTRLHSWALPTLRLPQTDDIYQNPEPEEDQPEVCCLESCRPTSSIKLNDIVDVDDSTEVPEESAEVGKKLFSFPADEELLQKWCHNLDMPPNAETRGLKICSVHFENYSVGKVLRPWAIPTLKLNRDPTTLYTNPEIKDLPFGRTSTCCVRDCRQQRDDEAGIKLFAFPLKQELRQKWLHNLELKDHADDCRSYRVCSLHFERHCVGKRLHNWALPTLRLCQGNETVLYKNPRMSKKKLNAKCCLKHCRKSRRQNQVRLFSFPETESQVLKKWMHNLKLDDAALKRKLCSSHFEKFCFYKGMKRVHPWAIPTLDLGHKDVIFENPKGKKLKLQQLPTAKEKCALTFCDSQQNLLAAGETSIRFFGFPCVIDLLQQWCDNLKLDMTTYRCTKQRICEHHFEPQVLNDKKLLPGAVPTLNVGHEEVIKHQNPADMSKYRCCSLAYCKAEMSDLIKLFDLPLNNMALLRQWCENLQLNEEACKSGQYIKVCSAHFVPAVISGNTLRPHAVPTLKLGHDNAITHENSSQTLHSEVIKQCVVTDCAAHLEPHANLFDFPKKPKRLQKWCENLKLSVADGTEYHICREHFDSSCFDPSDDQSLSDWALPTLNLGHTEQIPYERRKSSSKLSLQKQSQCCVNTCQSTTDVQLYKFPTINWLIKKWCYNLRRSEEGAEHLKICEKHFEKHCLGISKPRPWAIPTIELGHTDEIHQSMKIHKYHPEEKACGEMKFVRSNYCAIVSCLKSKQDGVQLYRYPSKGTMLHKWAHNCRHRAYQAARYIFRICSEHFEEPCLSGIAENRLRPGAIPTLKLGHDDIDIHPHEVLDCPTPSNLTELNVACDVPNCGRTKLVDNVRLFKFPKDIEMLTKWCYNLHLEMMNEKDAKNKRICNLHFEERCFSLRKIRLLLRAVPTLLLGHTDIDDIYENPESFERPEKVLRCCVPECGRAKFEDNVTLFCFPKSRKLFGQWAENIRLELKPTTETYMYTRVCSMHFENYCSFGQSVRRLLFGAIPTLKLGHNDENIHHVKREMLSSTLGELRKRKREEDSTRPMVKRDAQCCVPKCNAKDHHIYALPQHATLQHMWYSALDLQGAEDIIYDDEKQICAIHFKIAYIKTHKQMLAVTVDDAFVAVLKDLTKAYNEISSSSRIRSIWCSVSGCCGNTFNGGAELKLYSFPHNSEIYEKWIHNTQVEVDDKQRYLFKVCSLHFEIRCVREPSKRLHPWALPTLNLPLKDIPTVIIQNPSPEALESNNSSFTNLDTNCCIATCVNAKKKENDKKDNEDDDDEDEKPLSSIILYKFPRDVHLFRKWLYNTQMDANDAIHARICGRHFVPACIGKILSSWAIPTLHLGHTKPNIHQNPKEEMTVDDDVIQNEIKLGKELETKTLQFEKNSTLKVNEASVDKKPTKELLQSLLKKEEIKCASLESLLSPKWEQHKIARTSTPKEEISKKPIIVKSGKQENFRASPRTTNNKTEKTGDGNISEKSLSSMEDERIISRVEEEQVNEIEDVDGIFAVEDEDDEIISEENELLVVGERVIGEEDEVDEEGMLEDEMDEEDAFDETAEFDDQQFGSTDFKGYNDEVFYSLGDEPADLLYATPTEDIQSCRVDGCTSRVGQPDIVLHKFPSNALLRNRWLHNTKVKWDALRPWAYKICSLHFEPQYYTSMTNRLTAYAIPTLNLGHTRPEKIYDTAIASNWKPGAKSSTLQIGKKSELKPLKPKPYPMSGLIEHSRQTPPTSSFNFKITAIESLLNASTKAGDVAIKGGDDNFDMDSDSNSVSSTEYQLQQYSDATNPSSASVRSSNTTNLSCSIPSCQRKTNQDYVQLHMLPKDTLLLQQWCHNCKLSPSPHELYTTRVCSYHFEDQCFTRSRRQLKLGSLPTLNLGHNDEIYQNLRQPSAPLRCCIGECGRRRVEDQARLFRFPTDKRMLSKWLYNLQMEFNAARPWNYNVCEKHFEDYCFGKAGSIMPWAVPTLELGSNGREQRDLRLNERPLEDQINDDDLFIMPDSTPTEVEGLVSAPEPNFNKKLSTDFKSNNVFYEQHFTFKKNMLQLKDDIKISTSNCFRCIVPNCGLIRSGDIRMQMFALPKDSQLRQKWIFNIKLLSYTYNPDNSIDKICARHFERSAIIGPNMLKKSAVPTLNLGHGDHNIYVNVEEEPKYLNHAYGSAADALQAMAKIKIEPFEEQQRSKADSTLLQFQQYENQFSSSSFPADPDPNSSFNITAVVNSNSIKCFLKHCKRKRVDGVKLFRFPREPTLQAQWEHNLQMVFDDTLRERLFLCSTHFEPQFMGRKFLMKGAIPTLNLGHNDSNIFRADAWALSEESSESSVTPLRNGAGSGSFTTLPGAIKQSTQVSCCVPGCGLKETGSDNVLFFPFPLDEKYASIWQHRFRINYASRRAKLRVCSIHFEPHLICRKSLVQHAVPTLNLPPPVPHMNKQLSRVVSAPITSPSTSSNTCNVTGCHKNTLQDNVKMFSKFPDDFELFTKWCYNLKIDPRKYIDGKYNVCSSHFESYCIGGHSLRLWAVPTLNLGHNDANIHQVKRPAEMEAKCCLPHCGRRRSTDGVNFYNFPKGDLYRAWCKILNIDELLYRNTDKKICSAHFTADCFNGLTLKAGVKPTLYLRARPAVAAASVATTQHHQPSNQQLFKIHDSQTNVPGCLLTHCSNNSTTSSSVQFYAFPDKRYLCMKWCHNLRLNYSSQMLYNKRIYKICFKHFEPSCIYNAKLHLEAVPTLELGHADVNIFQNTGSFSTLTNDSSSISSSSTGFANAMGGGSSDYDSSLMYVKEEVMDQDEMEPLTEVPDVDMLNNFTSTFCEPEM

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_01486219;
90% Identity
iTF_01486219;
80% Identity
-