Basic Information

Gene Symbol
-
Assembly
GCA_035046045.1
Location
JAWNOI010000024.1:1193090-1210087[-]

Transcription Factor Domain

TF Family
THAP
Domain
THAP domain
PFAM
PF05485
TF Group
Zinc-Coordinating Group
Description
The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 28 5.5e-15 8.7e-12 45.5 4.3 1 86 610 682 610 683 0.85
2 28 3.1e-15 4.9e-12 46.2 4.6 1 87 710 779 710 779 0.83
3 28 7.7e-16 1.2e-12 48.2 0.4 1 87 801 873 801 873 0.85
4 28 6.7e-16 1.1e-12 48.4 5.7 1 87 976 1046 976 1046 0.83
5 28 8.7e-15 1.4e-11 44.8 3.6 1 86 1070 1141 1070 1142 0.82
6 28 4.5e-12 7e-09 36.1 0.8 1 87 1177 1245 1177 1245 0.80
7 28 1.1e-10 1.7e-07 31.7 1.5 1 86 1292 1361 1292 1362 0.76
8 28 2.8e-15 4.4e-12 46.4 0.1 1 86 1389 1458 1389 1459 0.82
9 28 5.3e-13 8.2e-10 39.1 1.6 1 86 1480 1549 1480 1550 0.81
10 28 3.9e-15 6.2e-12 45.9 1.7 1 86 1577 1648 1577 1649 0.85
11 28 5.8e-13 9.1e-10 39.0 4.3 1 85 1729 1797 1729 1799 0.82
12 28 5.9e-12 9.2e-09 35.8 0.0 1 86 1822 1890 1822 1891 0.81
13 28 1.8e-13 2.9e-10 40.6 1.4 1 87 2024 2093 2024 2093 0.81
14 28 2.1e-11 3.3e-08 34.0 0.1 1 61 2155 2210 2155 2234 0.83
15 28 0.0043 6.7 7.4 0.0 1 58 2243 2293 2243 2313 0.76
16 28 1.3e-13 2.1e-10 41.0 1.1 1 86 2332 2401 2332 2402 0.85
17 28 6.1e-14 9.6e-11 42.1 2.9 1 87 2468 2538 2468 2538 0.83
18 28 1.5e-12 2.4e-09 37.7 1.0 1 86 2573 2644 2573 2645 0.81
19 28 3e-12 4.7e-09 36.7 0.9 1 87 2655 2727 2655 2727 0.78
20 28 2.7e-14 4.2e-11 43.3 0.1 1 86 2757 2831 2757 2832 0.76
21 28 0.00032 0.51 10.9 0.3 1 58 2863 2917 2863 2941 0.78
22 28 1.3e-14 2.1e-11 44.2 0.3 1 86 2955 3027 2955 3028 0.82
23 28 4.8e-14 7.6e-11 42.4 1.8 1 86 3160 3232 3160 3233 0.79
24 28 8.1e-13 1.3e-09 38.5 3.4 1 87 3293 3364 3293 3364 0.82
25 28 1.8e-14 2.8e-11 43.9 2.6 1 86 3476 3546 3476 3547 0.85
26 28 2.6e-12 4.2e-09 36.9 0.1 1 87 3631 3701 3631 3701 0.83
27 28 1.2e-08 1.9e-05 25.1 0.5 1 58 3718 3766 3718 3777 0.86
28 28 5e-10 7.9e-07 29.6 1.6 6 87 3776 3841 3772 3841 0.71

Sequence Information

Coding Sequence
ATgtcacaacacaacaacaacaacccccTGTCGCCGCCGCATCTTcagcatcatcaccatcactactaccagcagcagcagcagcagcagcaacaacaactgcagcagcagcagcagcagcagcagcaacaacatcatcatacgcagcagcagcagcagcaaccacacacacagctgcagcacaaacaaatacagcagcagcacagttgGTACTCACATGTTGCTTCCTACCCTCCCCACCATCCGCACGCCGGGGCCTTTGCGTCGCCCTGCAAagccaataacaacaataccaataccaataccaatgctaatgccagcagcagcaacaacaacaacgacaacaacagcattatGAATGCATACGGcgcggcagcaggagctgctgcataCTACGGCCCTGCTGCAGGGGGTGTGGTGGGCGGCTATAACCTTGAGTCCAATAATGTTGCCTATGCGCAcaaccagctgctgcagtaccaacaacaacaacaacagcaacaacaacaacaacatcagcttAATCAACGCTCGTATATGCAGCACGGCTTAATGCATAGCGCGTATCCCTACATCAAGAGCGAGCCTTTGGAGCTGCCCGATGATagacagcgccagcaacaacagcaacaacaacaacatcaacagcagcagcaacaacagcattaCCAGAGCCCAATGGCACCGCCGCCGGCACCCGCCGCCAATCGCCATGCTCTCGATGCCAGCGGCGAGATGATAATAAAATCGGAGCCCATTGAGGAACATGCTTACAAGTCCAGCTATATCGATGATAACACGCCGTTCGCCGATTTTAGTAAGTTCCCCGAATTTGGCGACGACATGCTGAACCCCAAGGTGGAGCTGACTGTCAAGGATGAGGCCTACGGCAATCAGAAGAACGCACTCAACTATCCGCGGCGCAAGCTGCAATCGGAACGTGCCTCGGAGAGCCTGCCTATTTGCCAGCGCTGCAAGGAGGTGTTCTTTAAGAAGCAAGTCTACTTGCGGCATGTGGCagagagcaactgcagcatgcaGGAGTACGACTTCAAATGCAACATCTGCCCCATGTCCTTTATGAGCAccgaggagctgcagaagcACAAGCATCTGCACCGCGTCGAGAAGTTCTTCTGCCACAAATACTGCGGCAAGTACTTCGACACGATTGCGGAATGCGAGTCGCACGAGTACATGCAGCACGAGTATGAAAGCTTTATTTGCAACATGTGCTCCGTTACGTTTGCCACGCGGGAGCAACTCTATGCGCACTTGCCGCAGCACAAGTTCCAGCAGCGTTTCGACTGTCCCATCTGCCGTCTGTGGTATCAGACGGCGTTGGAGCTGCATGAGCATCGCCTGGCAGCGCCGTACTTCTGTGGCAAGTACTACGCaggcgtgcagcagcagcagcagcagcagcgtgtgcagcatcagcagcagcagcagcatcatcaccaGCAGAATACCAACTACAAGCTGCAGGACTGTCACATGGCAACCATGGAAATGCAAACGACGCACCATAAGGCGAATGCGAGTGCTGCCTTGCCTGCAACCGCGGCTCTCAACTCGTTGTTGCAACAGCGCCAGGCGAATGCCGATGGTGCCGCCTTGTTTGCGTCCAGCATGAAGAACGAGGTGAACGTCAAGTTGGAGCGGAGCTATAGCAATTCTACAAGCGAGTCGGGTTACAGTCTGCACGACAGCAGCTACAACGCAGCCTACGGCAGCGACACCTCGTTGCAtggcactgctgcagctggagcaatAGCTGGACCGCAGGCACACTCCTCGACGCTGGACGATTCGGAGGACGCGCTCTGCTGCGTGCCACTGTGTGGCGTGCGGAAGAGCACCAGCCCGACGCTGCAGTTCTTCACGTTCCCCAAGGATGAGAAGTACCTGCATCAGTGGCTGCACAATCTCAAGATGTTCCATATTCCGGCGTCGAGCTATGCCACCTTCCGCATCTGCAGCATGCACTTTCCGAAGCGCTGCATCAATCGCTATTCGCTGTGCTATTGGGCTGTGCCCACGTTCAATTTGGGCCACGATGATGTTGCCAATCTGTACCAGAATCGCGAGCTGACCAACACCTTCACCACCGGCGAGGTAGCGCGTTGCAGCATGCCCAACTGCACTAGTCAGCGGGGCGAGAGCAACCTGAAGTTCTACAACTTCCCCAAGGACATCAAGAGTTTGATCAAGTGGTGCCAGAatgcccgtctgcccgtccaGGCCAAGGAGCCGCGTCACTTCTGCAGCCGCCACTTCGAGGAGCGCTGCATCGGCAAGTTCCGGCTGAAGCCGTGGGCCGTGCCCACGCTGCACCTGGGCGCGCAGTACGGCAAGATCCATGACAACCCCAAGAACCTGTACGTTGAGGAGAAGCGCTGCTGCCTTAACTTCTGTCGGCGCAGCCGCTCCTCGGACTTCAACATGTCGCTCTATCGCTTCCCGCGCGATGAGGTCCTCCTGCGACGCTGGTGCTACAATCTTCGTCTCGATCCTGCTGTCTATCGCGGCAAGAATCACAAGATATGCAGTGCCCACTTCATCAAGGAGGCGCTAGGACTGCGCAAGCTCTCGCCGGGAGCTGTTCCTACGCTGCACCTGGGGCACAACGACACGTTCAACATCTACGAGAACGAACTGTGGCCTCCGCCAACGGCGTCGACGCCAacgcagctccagcagcagcagcaccagcagcatcaaattcagcagcagcatcagctacaacagcagcagagcagctatCACCAGGGCAAGTATCAGCGCCATTCGGCTGCGTCCACATCCTCGTCGGCAAGTTCGGCGTCGCACTATGTGGACGCGGAGCAAGGCGCCTCGTacatgggcatgggcagctCGGTGGGATCGACGTCGGGTCTGAATGTGAGCGACAGCATGGACGTCTGCTGCGTGCCGAGCTGCGAGAGCAAGCGacacaacaacgagaacatcACATTCCATACGATACCGCGGCGGCCGGAGCAGATGCGCAAATGGTGTCACAATCTCAAGATACCCGAGGACAAGATGCACAAGGGCATGCGCATCTGCAGCTTGCACTTCGAGCCCTACTGCATTGGTGGCTGTATGCGGCCCTTTGCGGTGCCCACGCTGCATCTGggccacgacgacgacgacattcATCGCAATCCGGACGTGATCAAGAAACTCAACATACGCGAAACCTGCTGTGTGGCTGTCTGCAAGCGCAATCGGGACCGCGACCATGCGAATCTGCACCGTTTCCCCAGCAATGTGTCGCTGCTGAGCAAGTGGTGCGCCAACCTGCAGCGCCCAGTGCCGGACGGCAGCAAACTCTTCAATGACGCCATTTGCGAGGTGCACTTCGAGGATCGCTGCCTGCGCAACAAGCGGCTGGAGAAATGGGCAGTGCCCACTCTGGTGCTGGGACACGAGAACATTGCGTatccgctgccgacgccggaGCAGGTGGCCGAGTTCTATGCCCGTCCCAGTGTGCCCAACAACGGCGACGAGCAGGGCGAGTGCTGCGTGGAGACCTGCAAGCGTAATCCCAGCGTGGACGACATCAAGCTCTATCGCCCGCCCGAGGAGTCATCGGCGCTGGCCAAGTGGGCGCACAATCTCGAAGTGGACGTCGCGCAGTTGCCGAGCCTGCGAATCTGCAATCTGCACTTCGAGTCCCACTGCATTGGCAAGCGCATGAGGCCTTGGGCCATTCCGACGCTCAATCTGTCCAGCAATATTGAGAATCTCTACGAGAATCCCGAGCAGCACATGCTCTACAAGCGTCGCACACATCTCAAGGCGGAGCGGGCAGCAACCGCCTCAGCCGGCGCGGGCATGACGAAGCCGACTTGGGTGCcacgctgctgcctgccgcatTGCCGCAAGGTGCGCGCCCTGCACAATGTGCAGCTCTATCGTTTCCCCAAGCTCAATCGTTCCACGCTCGCCAAGTGGGCGCATAATCTGCAGGTGCCGCTGGTGGGCAGTGCACAGCGGCGACTCTGCTCCGCGCACTTCGAGCCGCATGTGCTGAGCAAGAAGTGTCCGGTGCCGCTGGCGGTGCCCACGCTGGAGTTGAATGCACCACCCGGCTACAAGATATACCAGAATCCCGCCAAGCTGAAGGCCAACAAGCTGTGCCTGCAGCGCGTCTGCATCGTGGAGAGCTGCCGGCGCCAGCGCGGTCCCGGCGTGCAGCTGTTCCGCCTGCCGCACAGTCCCACGCAGCTGCGCAAATGGATGTACAACATACGCATGCGCGCCAGGGGCGCTATTCGGCAGCAGTACCGCATCTGCTCCACGCACTTCGAGTCGCACTCGTTCAACGGGAAGCGACTGAGCGCTGGCGCCATTCCCACACTGCAGCTTGGCCACGACGACGAGGATATCTATCCGAACGAGGCGCAGTCGTTCGTCGAGGAGCACTGTGCCATCGAGGGCTGCGACGCCTCCAAGGAGCAGCCGGACGTGCGGCTCTTCCGCTTCCccaacgacgacgaggactTGCTCTGGAAGTGGTGCAACAATCTGAAGATGAATCCCGTCGATTGCGTGGGTGTCCGCATATGCAACAAGCACTTCGAGCCGGACTGCATCGGGCCCAAGCATCTGTTCAAGTGGGCGATACCCACGCTGCAGTTGGGCCACGACGATGCACAGATCGAGCTGATACCCAACCCCAAGCCAGAGGATCGCTACGTCGATCCCGTGTTCAAGTGCTGCGTGCCCACCTGTGGCAAGACGCGCAAATTCGATGAGGTGCAGATGAACAGCTTCCCCAAAGATCCCGAGCTCTTCCGGCGCTGGCGCCACAATCTGCGGCTCGAGCATCTGAACTTCAAGGAGCGCGAGCGCTACAAGATCTGCAATGCGCACTTCGAGGACATTTGCATTGGCAAGACACGGTTGAACATCGGCTCGATTCCCACGCTCGAGCTGGGTCACGACGAGAGCGAAGACCTCTTCCAGGTAAATCCCGCAGAGCTGCAGAGCAATCTGTTTGGCCGCCAGAGGCGCGTACAAGATTCCATGGACGTCGCTATCAAGCAGGAGGAGACCTCGGACatagaggaggaggaggacagCGTCAAACCGACGCACCTCGCCATGGCGGAGGCAACGGACCTGAACACCTCGCAGGTGAAGATAAAGAAGGTCTTGGACGACTACAAGTGCTGTGTGCCGACCTGCGGGCGCAGTCGCCTCGAGCACGGCGCCCGCCTGTTCCCCTTTCCCaacggcaagcagcagcagtgcaagTGGCGCCACAACTTGCGCCTGGAGGCGCACGAGGTAGACCGCAGCTCGCGTGTGTGCAGCGCGCACTTCAATCGACGCTGCATCGACGGCAAGCAGCTGCGGGCCTGGGCGATACCAACGCAGCAGCTGGGCCATGAGGAGCAGCCGCTCTACGAGAACCCAAAGAACATACCAGGCTTCTTCACGCCCACCTGTGCGCTGGCGCATTGTCGCAAGAGGCGGAGCATCGACAACGATTTGCGCACCTATCGCTATCCGCGCGGCGAAGAGCTGCTGGAGAAGTGGCGCGTGAATCTGCGCTTGGCTCCGGACCAGTGCCGCGGTCGCATCTGTGCCGATCACTTTGAGCCGATGGTGCGGggcaagctgaagctgaagacTGGCGCGGTGCCTACGCTCAAATTGGGCCACGACGAGGGAGTGGTCTTCGACAATGAAGCCATCAAGGCGGGTCTGCAGCAGGACGCAGCCGAGGCGGAGCCCGAGGAAGGCGACGTCAGCAGGGAATCGTTGCTGAAGATCAAAACGGAGAGGCTCGATTCAGACGAGCAGGAGGACCATGCGCCGTTGGACAACGAAGCGGACGACGAGGAGCATGAGGAAGTGCCCGACGACCACGAAGATCATGGCTATTTCGATCCCCTCGAACTGGTCGAAACCTATGCCGAGGACGATGAGAATGCCGACATTCCTGGCGATgacgacgagctgctgctgccggataCCCTGCAGGTGAAACTGGAGAATCCGTACCCCGTGCTGCGTCGCGAGAAGACCGTGAACAACGTAACGCCCATCTGCTGCCTGAAGCACTGTCGCAAGGAGCGAACTGCGAAGCATCAGCTGAGCACCTTCGGCTTTCCCAAgagccgccagcagctgctcaagtGGAGCGCCAACCTGCAGCTCTCACTCGAGGACTGCGTGGGGCGCGTCTGCATCGAGCATTTCGAGGCCGAAGTGCTGGGCACGCGCAAGCTGAAGCAGAACGCTGTGCCCACACTGAATCTGGGCCACGACACGCCCCTGATCTACAAGTGCAACGGCCTGCCCACGGGCCTGGGCATCTTCGacgagcagccgcagcattCGGTTTTTCGGCTTTGGAGCCTGAAACACTGTCGCAAAAGGAAGCTGCAAGCAATGGAACCGCCGGATCTTCTACCTGCAGCGATTAAGTCCGAGcaccgctgctgcctgccccacTGCGGCAAGCTGCCGGTGCTGCATAACGTTAAGCTCTACCGTCTGCCCCGGAACCTCTCGCATCTACACAAATGGCTGCACAACCTCAGGCTACCCGAGCGCGTGCAGGTGAGCAGTGAGACGCGCGTCTGCAGCGATCACTTCGAGTTGCAGCCGGGCGAGGCGCTAGTCGACTCTTTGCCCACCTTGAAGCTGGGCCatagcgacagcgacgtctaTCGCACTTGCAACACGGCCACCTGCTCGTCCTGCCTGGTGCCCAGCTGTCCGTGTGCTCGGCTCAATCTCTACCGCGGCTGCGACCTGCCCGCGCATCCTCTCGTCCAGCAGTCCTGGCTGCAGCACCTCCAGCTGCCCAAGCCCCAGCTGGCCAGCGATGGGCAGCTCTGTGTGATGCACTTCATGCAGCTGTACGAGcaggtgccgctgccgcaagagctgccggcggcggtgcTGCGTCGACTGCAGGAGACCTACGAGCAGATCGCCAGTTCGCCGATGGCAATgaagctgcgctgcgctgtgcccGGCTGTTCCTCTAAGTACACGGACAACATTAGGCTCACCAAGCTGCCCGCGTGCCAGGACATGTTCGCCAAGTGGCTGCACAACACCCGCATCAACTACGAGTCGAGCAAGCGGCACCTTTATCGCATCTGCATGCTGCACTTCGAGCACAGTTGCCTGGGCCAGGTGCGTCCCAAGGCGTGGGCGGTGcccacgctgcagctgcagcacgacgACGCGAATATCTATCGGAATCCCAAGCAGGAGAGCGGCATTGAGGAAACCCAAGATCAGTCGCAGGACTCCCTGCTGCAGCCTGTGCcggtgcagctgccgctgcgcatCAAAACCGATTTGGCGCACTCAGGAAGTCCGAGCGCAAGCGCCAGTCCAAGTCCGCGCGGCAAGCAATGCACTTGCTGCATAGCTAGCTGCGCGCAGCAGGCCAGCGCGCAGACACGCCTCTTCCGCTTTCCCACCAATGAGATGTTGCTGCTCAAGTGGCTGGTGAACACGCAGCAGAAGCCGCGCCTGGTCGACCCGCAGCATCTGTTCGTCTGCCAGCATCACTTCGAGGCGGAGGCCATCTGCAAGAAGCAGCTCAGCAGTTGGGCAGTGCCCACGCTGCACTTGGGCCACGACGGTCATGTCATTCCGAATGCCAAGCACAACGGCAATATCACCGACAGACAGGAGAACAGGGAGGCGCTGCAGTTCATCTGGGCGAACTACTGCTCTGTTCTCACCTGCCTGGAGCGGCGCAGTGAGCAGCTGCGGCTCTATGCTTACCCCCAGGATCGGCCCAGCATACGCAAGTGGGCTGCCAACTGCAAGCATCGCTCCACGCTGGCCAGCAGCCATGGTTTTCAGGTCTGTCAGTCGCATTTTGAGCCGCATTGCTTTGACCCGGACACGGGTGAGTTGCGCGAGGATGCGGTGCCCACGCTGCAGCTGAGTTGCATCACCACAACGGTGAACTGCGTGGTGCCCGGCTGTGTGGAGGAGGAGAACCCGTTGGGCCACCGCTTCTTCAAGGTGCCCAGACATCCGACCCAGTTGATCGAGTGGTGTCACAACCTCCAACTGAATCCGGCCGTCATGATGCTGGAGAAGAGTTTTGTGTGTGATCGCCACTTTGAGAAGCATTCGTTGAACGCGCAGAGACTGCCGCGTCGTGGCGCACGGCCAACGCTGCACTTGGGCCACAGTGCGGACATTGAGCTATTGCCCAATCCGACTGACTGGGCCAGATGCACAGAGGTGCCCGAGTCCCCAGCCGAGTTCTGCTGTGTGCCCAATTGTGGACGCTACAAGGACGcggacgaggaggacgacgacgtgCGGCTCTTCAACTTCCCCAAGCTGCGTACACTGTCCGACAAATGGCTAAAGAATATCCGCCTCGAGGTGAACAGGGagctgctggccgagctgaGAGTTTGCAATGCTCACTTCCACATAGACTGCCTGGAGAACGGAGAGCCCAAGCTGGGAGCGATGCCCACTCTGCGACTGGGCCACACCTACCAGGACATACATCAGTCCTTCAGCCTGGCGGGAAAGCTGAAGCGTTACTGCAACCGCAATGGCTGCAGCTACGAGTGCTGCTTCCCGCAGTGCGTGGAGCTGCAGAAGAGCTTCCTGCGCATCAGCTATGATCTGCCGCATACGGAGGCCATGCGCAGCGCCTGGTTGGCGTACATGGAGATCGAGGAGACGCAGGAGCAGCGCcagatgcagcagctgaaactgTGCCCGTTCCACATGGTCGTGCTGTTCGAGCACAGCGTCGAGCACTTCCCGGAGCACACGACCGAAGTCCTGCTGGAGGACAACTATGCTGCTTCGCGAAAGAGTGTGCGACTGCGTGTCATCAGCTGCGTGGTGCGAGGCTGCAAGACGCTGCAGACTCGAGACGGCGGTATTCTCCATATGCTGCCCCACCGTCCGGATCTGTTGCAGATGTGGCAGCACAACATGCAGCTGGTCTTCCACGAGAAGCAGCGCTACATGATCAAGATCTGCGACAAGCACTTTGAGCCCACGTGCTTTACCGACAACTCGCGCCGCCTGAAGCCATGGAGCGTTCCAACCTTGGGGATGCCCGATCGTGAGCCCGGTCATCCGGACCTCTATCAGAATCCCACCGAGGCCGAGTGGCAGCTGATGAATGAACAATGGGCAAATCAGCAATCGCAGCCAATTCAGCCCGTAGAGGCGGAGCAACCGCAGGTGAAGCAGGAACTCGAATTCGAGCCCGATCACCAGGAGTTGATGGCGTACGAGGAAGATTACTCACAGTCGgagtcgcagcagccgctggaAATGGAAGCGCTTGAGGTGATGCTCGAGGTGGGCCACGTGGAGAAGTGCAAGACGTACGAGCAAATGGACACGAAGGCAGATCTCAGCTATGCAGCCGTGCAGCCGGCGCTTAGCCTGCCCACGAGTGCCATCCTCACCAGCAATGGAAGCCGATACACTCCGCGGCGCTGCAGTGTCCGCGGCTGCGACATTACCATGGATGATGTCTGCAGCAATCTACGGCTGCATACGTTTCCCACGTCGTGCGAGGCCAGGAAGAAGTGGATGCACAACACGCAGGTGCGTGTGACCAAGAGTGTCTTCTGGCGCTATCGGATTTGTAGCTATCACTTCACGGATGAATGCTTTTACGGCTCGCGCCTGAAAGCTGGCGCCATGCCCACACTTCAGCTGGGCCCCGATCATCCGCAGCAAATCTTCTACAGTGAGTTTGATACGCACCCCGAGGGCGACATGGAGCAGTCCGCACAGTTGCAGTTAGCAGGTGGCGCCGTTAGTGGTGCTGGTGAGAATGCCATCAGTCTGTGCCTGCCCAAGCCGGCGCCGCCGCACAAATCGAGCAAATTCTGCCAGGTGGAGGGATGCTCCAATCACGTGAGCAGCGAGAATCTGAAGCTGCACAAGTTCCCCCACTCGCTGGACATGTGTGTCAAGTGGCAGCACAACACGCAGGTGCCGTTCGATCCGGTCTTTCGCTGGCGCTATCGCATCTGCAGCGCGCACTTCGAGCCCGTTTGCCTGATGAAGATGCGCATTCTGCACGGCAGCGTGCCCACGCTGCATCTGGGTCCGCGTGCTCCGGCACAGTTGTTCGACAACGACTTTGAGGAGATAGGCATGCGCTTGGACAGGCAGCAGAGCCTGTCCGAGCATCAAGTATTCATCAAGCAGGAGACACTGCACGAAACGCCTGATCAGCActgtgagcagcagcaggaagaggaggagccAGAGCAGGAGGATGACCTGAGCTTTCTGGTGCCTGAGATGCAGCTGCACGAGCAACAGCCGGAGAGTCAGGGTTACTACAACAGCTGGAGGGATCTGCGTTTGCCCAGCATTAAACAGGAGCAAACCATGACCTGCTACAATCCCGTCAAGTCCGGCTACGACAAGTGCTCCCTGGTGCACTGTCAGCGGCAACGCGCTCAGCATGGCGTCCACATCTACAAGTTTCCACGCTCgcgccaactgcagctgcagtggatGCACAATCTGCGGATCCGCTACGACGAGCGACGCCCCTGGAAGACGATGATATGCAGCATGCACTTCGAGCCGCAGTGCATTCGTCTGCGCAAGCTGCGCCCGTGGGCGGTGCCGACGCTAGAGCTGGGCGACAATGTGCCGCTGGAGCTGTATACGAACgagcagagccagcagcactTTGGAGAGGGAGAAGCGGATGAGGATGCAGAGAATCCCATGCTGTGTGAGGAGTCCGTCGAAGATGAGGACTACGCTGACCATGATGATTATGAAGATGAAGACGGTCATCACGTGGATGATGACAGTGATAGCGTTTTGCCACAGCCAGATTGGAAGATGCGGCCAGATCCTCGCCTGCTGAAGATTAAAACCTGCTCATTGCCCTACTGTCGCCGTCCACGAGGCGACGGCATCAAGCTCTTCCGACTGCCCAGCAGGGTCAACGACATACGCAAATGGGAGCAGGCCACGGGCATGCACTTCACTGAGTCGCAGCGGAACACGAAGCTCGTCTGCAGCCGGCACTTTGACCCGCAGCTGGTTGGCGTGAGGCGCCTCATGTTCAACGCGGTGCCGACGCTCCACTTGGGTCCAGATAGCCAAACGGAAGATGAAGCCTTGCCAAGACCGCGCTGTTTCGTCTCCGACTGCCAGGAGGatgtgcagcagcggctgcacaAGTTTCCAGCGTATCCTCAACTGCTCAGGAAGTGGTGCGAGGCGCTTAATCTGCCGGCAACTGAAAACTATAGTGACAAGTACGTTTGTGCCGCCCACCTGCCCCCGCCGGCCGCTTGCTGTCTCCTCTGCAGTgcggaggagctgcagctgcccttGCTGGACTTTCCGGAGAGCCGCAATCAGCGCGCCAAATGGTGCCACAATCTCAAAATCAATCCCATACCGAAGTGGGACAGCTCGAAGCGCATCTGCAGTCGGCACTTTGAGAGCTACTGCTTCGTCCAGCCCGGCCAGCTGGTCGATGAGGCGGCGCCCACGCTGCATTTGAAGCACACGGACACAAACATATTCCTCAACGACTACGCCATGGATCCCAGCAAGCTGCTGCGCATCAAGGACGAACCCTTGGAGAGCGACGATCTGATGTTGTAA
Protein Sequence
MSQHNNNNPLSPPHLQHHHHHYYQQQQQQQQQQLQQQQQQQQQQHHHTQQQQQQPHTQLQHKQIQQQHSWYSHVASYPPHHPHAGAFASPCKANNNNTNTNTNANASSSNNNNDNNSIMNAYGAAAGAAAYYGPAAGGVVGGYNLESNNVAYAHNQLLQYQQQQQQQQQQQHQLNQRSYMQHGLMHSAYPYIKSEPLELPDDRQRQQQQQQQQHQQQQQQQHYQSPMAPPPAPAANRHALDASGEMIIKSEPIEEHAYKSSYIDDNTPFADFSKFPEFGDDMLNPKVELTVKDEAYGNQKNALNYPRRKLQSERASESLPICQRCKEVFFKKQVYLRHVAESNCSMQEYDFKCNICPMSFMSTEELQKHKHLHRVEKFFCHKYCGKYFDTIAECESHEYMQHEYESFICNMCSVTFATREQLYAHLPQHKFQQRFDCPICRLWYQTALELHEHRLAAPYFCGKYYAGVQQQQQQQRVQHQQQQQHHHQQNTNYKLQDCHMATMEMQTTHHKANASAALPATAALNSLLQQRQANADGAALFASSMKNEVNVKLERSYSNSTSESGYSLHDSSYNAAYGSDTSLHGTAAAGAIAGPQAHSSTLDDSEDALCCVPLCGVRKSTSPTLQFFTFPKDEKYLHQWLHNLKMFHIPASSYATFRICSMHFPKRCINRYSLCYWAVPTFNLGHDDVANLYQNRELTNTFTTGEVARCSMPNCTSQRGESNLKFYNFPKDIKSLIKWCQNARLPVQAKEPRHFCSRHFEERCIGKFRLKPWAVPTLHLGAQYGKIHDNPKNLYVEEKRCCLNFCRRSRSSDFNMSLYRFPRDEVLLRRWCYNLRLDPAVYRGKNHKICSAHFIKEALGLRKLSPGAVPTLHLGHNDTFNIYENELWPPPTASTPTQLQQQQHQQHQIQQQHQLQQQQSSYHQGKYQRHSAASTSSSASSASHYVDAEQGASYMGMGSSVGSTSGLNVSDSMDVCCVPSCESKRHNNENITFHTIPRRPEQMRKWCHNLKIPEDKMHKGMRICSLHFEPYCIGGCMRPFAVPTLHLGHDDDDIHRNPDVIKKLNIRETCCVAVCKRNRDRDHANLHRFPSNVSLLSKWCANLQRPVPDGSKLFNDAICEVHFEDRCLRNKRLEKWAVPTLVLGHENIAYPLPTPEQVAEFYARPSVPNNGDEQGECCVETCKRNPSVDDIKLYRPPEESSALAKWAHNLEVDVAQLPSLRICNLHFESHCIGKRMRPWAIPTLNLSSNIENLYENPEQHMLYKRRTHLKAERAATASAGAGMTKPTWVPRCCLPHCRKVRALHNVQLYRFPKLNRSTLAKWAHNLQVPLVGSAQRRLCSAHFEPHVLSKKCPVPLAVPTLELNAPPGYKIYQNPAKLKANKLCLQRVCIVESCRRQRGPGVQLFRLPHSPTQLRKWMYNIRMRARGAIRQQYRICSTHFESHSFNGKRLSAGAIPTLQLGHDDEDIYPNEAQSFVEEHCAIEGCDASKEQPDVRLFRFPNDDEDLLWKWCNNLKMNPVDCVGVRICNKHFEPDCIGPKHLFKWAIPTLQLGHDDAQIELIPNPKPEDRYVDPVFKCCVPTCGKTRKFDEVQMNSFPKDPELFRRWRHNLRLEHLNFKERERYKICNAHFEDICIGKTRLNIGSIPTLELGHDESEDLFQVNPAELQSNLFGRQRRVQDSMDVAIKQEETSDIEEEEDSVKPTHLAMAEATDLNTSQVKIKKVLDDYKCCVPTCGRSRLEHGARLFPFPNGKQQQCKWRHNLRLEAHEVDRSSRVCSAHFNRRCIDGKQLRAWAIPTQQLGHEEQPLYENPKNIPGFFTPTCALAHCRKRRSIDNDLRTYRYPRGEELLEKWRVNLRLAPDQCRGRICADHFEPMVRGKLKLKTGAVPTLKLGHDEGVVFDNEAIKAGLQQDAAEAEPEEGDVSRESLLKIKTERLDSDEQEDHAPLDNEADDEEHEEVPDDHEDHGYFDPLELVETYAEDDENADIPGDDDELLLPDTLQVKLENPYPVLRREKTVNNVTPICCLKHCRKERTAKHQLSTFGFPKSRQQLLKWSANLQLSLEDCVGRVCIEHFEAEVLGTRKLKQNAVPTLNLGHDTPLIYKCNGLPTGLGIFDEQPQHSVFRLWSLKHCRKRKLQAMEPPDLLPAAIKSEHRCCLPHCGKLPVLHNVKLYRLPRNLSHLHKWLHNLRLPERVQVSSETRVCSDHFELQPGEALVDSLPTLKLGHSDSDVYRTCNTATCSSCLVPSCPCARLNLYRGCDLPAHPLVQQSWLQHLQLPKPQLASDGQLCVMHFMQLYEQVPLPQELPAAVLRRLQETYEQIASSPMAMKLRCAVPGCSSKYTDNIRLTKLPACQDMFAKWLHNTRINYESSKRHLYRICMLHFEHSCLGQVRPKAWAVPTLQLQHDDANIYRNPKQESGIEETQDQSQDSLLQPVPVQLPLRIKTDLAHSGSPSASASPSPRGKQCTCCIASCAQQASAQTRLFRFPTNEMLLLKWLVNTQQKPRLVDPQHLFVCQHHFEAEAICKKQLSSWAVPTLHLGHDGHVIPNAKHNGNITDRQENREALQFIWANYCSVLTCLERRSEQLRLYAYPQDRPSIRKWAANCKHRSTLASSHGFQVCQSHFEPHCFDPDTGELREDAVPTLQLSCITTTVNCVVPGCVEEENPLGHRFFKVPRHPTQLIEWCHNLQLNPAVMMLEKSFVCDRHFEKHSLNAQRLPRRGARPTLHLGHSADIELLPNPTDWARCTEVPESPAEFCCVPNCGRYKDADEEDDDVRLFNFPKLRTLSDKWLKNIRLEVNRELLAELRVCNAHFHIDCLENGEPKLGAMPTLRLGHTYQDIHQSFSLAGKLKRYCNRNGCSYECCFPQCVELQKSFLRISYDLPHTEAMRSAWLAYMEIEETQEQRQMQQLKLCPFHMVVLFEHSVEHFPEHTTEVLLEDNYAASRKSVRLRVISCVVRGCKTLQTRDGGILHMLPHRPDLLQMWQHNMQLVFHEKQRYMIKICDKHFEPTCFTDNSRRLKPWSVPTLGMPDREPGHPDLYQNPTEAEWQLMNEQWANQQSQPIQPVEAEQPQVKQELEFEPDHQELMAYEEDYSQSESQQPLEMEALEVMLEVGHVEKCKTYEQMDTKADLSYAAVQPALSLPTSAILTSNGSRYTPRRCSVRGCDITMDDVCSNLRLHTFPTSCEARKKWMHNTQVRVTKSVFWRYRICSYHFTDECFYGSRLKAGAMPTLQLGPDHPQQIFYSEFDTHPEGDMEQSAQLQLAGGAVSGAGENAISLCLPKPAPPHKSSKFCQVEGCSNHVSSENLKLHKFPHSLDMCVKWQHNTQVPFDPVFRWRYRICSAHFEPVCLMKMRILHGSVPTLHLGPRAPAQLFDNDFEEIGMRLDRQQSLSEHQVFIKQETLHETPDQHCEQQQEEEEPEQEDDLSFLVPEMQLHEQQPESQGYYNSWRDLRLPSIKQEQTMTCYNPVKSGYDKCSLVHCQRQRAQHGVHIYKFPRSRQLQLQWMHNLRIRYDERRPWKTMICSMHFEPQCIRLRKLRPWAVPTLELGDNVPLELYTNEQSQQHFGEGEADEDAENPMLCEESVEDEDYADHDDYEDEDGHHVDDDSDSVLPQPDWKMRPDPRLLKIKTCSLPYCRRPRGDGIKLFRLPSRVNDIRKWEQATGMHFTESQRNTKLVCSRHFDPQLVGVRRLMFNAVPTLHLGPDSQTEDEALPRPRCFVSDCQEDVQQRLHKFPAYPQLLRKWCEALNLPATENYSDKYVCAAHLPPPAACCLLCSAEELQLPLLDFPESRNQRAKWCHNLKINPIPKWDSSKRICSRHFESYCFVQPGQLVDEAAPTLHLKHTDTNIFLNDYAMDPSKLLRIKDEPLESDDLML

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-