Basic Information

Gene Symbol
-
Assembly
GCA_958296145.1
Location
OY282514.1:76200539-76215580[-]

Transcription Factor Domain

TF Family
THAP
Domain
THAP domain
PFAM
PF05485
TF Group
Zinc-Coordinating Group
Description
The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 34 2e-15 3.8e-12 47.3 1.1 1 86 462 534 462 535 0.84
2 34 2.4e-15 4.4e-12 47.1 4.9 1 86 562 630 562 631 0.80
3 34 3.3e-15 6.1e-12 46.7 0.3 1 87 652 724 652 724 0.82
4 34 3.8e-14 7e-11 43.3 2.7 1 86 803 871 803 872 0.78
5 34 4.8e-16 9e-13 49.3 5.9 1 87 896 968 896 968 0.81
6 34 4.5e-13 8.3e-10 39.8 0.5 1 87 1003 1071 1003 1071 0.81
7 34 1.6e-12 2.9e-09 38.1 4.2 1 87 1112 1182 1112 1182 0.77
8 34 6.9e-16 1.3e-12 48.8 1.2 1 86 1209 1278 1209 1279 0.81
9 34 3.8e-14 7.1e-11 43.2 0.2 1 87 1301 1371 1301 1371 0.80
10 34 2.1e-13 3.9e-10 40.9 2.8 1 87 1399 1471 1399 1471 0.86
11 34 2.5e-05 0.046 15.0 0.0 1 70 1542 1598 1542 1615 0.71
12 34 9.8e-11 1.8e-07 32.3 0.3 1 87 1638 1710 1638 1710 0.80
13 34 5.6e-13 1e-09 39.5 0.9 1 85 1740 1810 1740 1812 0.81
14 34 4.9e-13 9.1e-10 39.7 0.9 1 87 1858 1930 1858 1930 0.81
15 34 3.5e-14 6.5e-11 43.4 1.4 1 87 1953 2022 1953 2022 0.81
16 34 7.5e-14 1.4e-10 42.3 0.1 1 87 2216 2285 2216 2285 0.80
17 34 1.7e-13 3.2e-10 41.2 1.2 1 86 2338 2419 2338 2420 0.80
18 34 4.9e-12 9.1e-09 36.5 2.0 1 87 2453 2526 2453 2526 0.79
19 34 8.5e-14 1.6e-10 42.1 1.3 1 87 2552 2623 2552 2623 0.80
20 34 1.1e-14 2.1e-11 45.0 1.1 1 87 2646 2716 2646 2716 0.84
21 34 2.3e-15 4.3e-12 47.1 2.1 1 86 2737 2810 2737 2811 0.84
22 34 3.9e-05 0.072 14.4 4.3 1 61 2828 2877 2828 2897 0.74
23 34 1.1e-12 2e-09 38.6 1.7 1 86 2908 2977 2908 2978 0.82
24 34 3.8e-14 7.1e-11 43.2 2.4 1 87 3003 3075 3003 3075 0.79
25 34 5.7e-12 1.1e-08 36.3 1.0 1 86 3095 3165 3095 3166 0.78
26 34 2.4e-10 4.5e-07 31.1 1.5 1 87 3186 3255 3186 3255 0.77
27 34 5.3e-11 9.9e-08 33.2 3.4 1 87 3498 3571 3498 3571 0.80
28 34 1.3e-07 0.00025 22.3 0.6 1 86 3594 3663 3594 3664 0.73
29 34 1.2e-14 2.2e-11 44.8 0.6 1 86 3695 3766 3695 3767 0.83
30 34 9.4e-05 0.17 13.1 0.3 1 59 3803 3852 3803 3869 0.79
31 34 9.4e-12 1.8e-08 35.6 4.2 1 87 3895 3972 3895 3972 0.81
32 34 1.7e-14 3.2e-11 44.3 0.6 1 86 3996 4067 3996 4068 0.83
33 34 1.4e-15 2.5e-12 47.9 4.2 1 87 4234 4308 4234 4308 0.81
34 34 4.7e-11 8.7e-08 33.3 0.5 1 86 4327 4394 4327 4395 0.80

Sequence Information

Coding Sequence
ATGAAAGAGGAACgtaaacaccagcaacaacaacaacgagtaCAACAGGCACCATCGGCACAAGagcaaaaacatcatcagcagcataatgctcatcatcagcatcatcatcaacaacaacaccatcagcagcaacaccaccaccaacaacagcaacataatCATCAttatcaacaacagcagcagtcaCAACAGCCTCAACCTCAGCCTCTGTCACAgccgcatcatcatcatccacatCAACATCATGCGAATGAAAATAtatcacagcaacaacaaagacaGCAGCTTCAGCCGCCgccacgacaacaacagcaacagcaaccacCGGCCGCAGTAAGCACAGAAAATTCCAACATACCATCAACGGCTGAGGCGAAGCAACTACAAACATCCTTGGCTAATATTAAGACCGAACCAAAGcCCCTGAACTTTCCTCGCCGCAAATTACAAACAGAACGTTCCTCCACACTGCCCATATGCCAGCGATGTAAACAGGTCTTTTTGAAACGTCAGAACTACACACAACATGTGGCTCTATCCTGTTGCAATATTGTCGAATACGACTTCAAATGTTCCATCTGCCCCATGTCCTTTATGTCCAACGAGGAGTTGCAGGCCCACGAGCAATTGCATCGTTTGAATCGGTATTTCTGTCAGAAATACTGTGGCAAATACTATGAGACCACTGAGGAGTGCGAACAACATGAGTTTGGCCAACAtgaatatgaaatgtttaaatgtaaTATATGTTGTGTGTCATTTCCGAAACGTGAACAATTGTTCGCCCACCTAATGGAGCATCGTCAGCAGCCGCGTTACGATTGTTGCATTTGTCGTTTATGCTTTCAGACATTGTTGGATTTGGAGGATCATTATGTGGGGAATCCGGATTTTTGTGGCAAATTTTATGATAAGGAAGgttttaaaaatctgaaaattttttccaaacccAATAAAACTACAACACAGCAAACAACAAATCGTTCGGAGAATCTCACAAGTTTCATGATAAAGGATATAACCTCTATCAAAATGCCGGAAGCGGGACCTTTACCttctaaacaacaaaacactcaTCATATTTCTCCGTCGGCAAGTTTTGATGACATACCAGATTTCGCGGCACCGCATGTAGAAGTTAAGACCGAAATTAAAGTAGAACCCGATTTTTATCCACCTATGGATCAGACTGATTTTGCCGGTTTCGATAATGATTATTCCAATTCGCAAGACTTTTCGCAGAGCTCAAATCAGAATCTAACATTCTTACAAGATTTCCACGACAATGCTTCAAATTCCACCAACTCATCGTACTCATTCAATACTACCGGTTCGACCAGCAAAAGCAATGAAGCCACCAATCAGGAAGAGGATGCAATTTGTTGTGTGCCAAAATGCGGCGTACGAAAATTTACCTCGCCCACATTGCAGTTCTTTCCATTTCCCCGCGATGAAAAGTATCTCTCACAATGGCTGCATAATTTGAAAATGACCTACGATCCGAATGTGAATTATGGGATATATCGCGTGTGTAGTTTGCATTTTCCGAAACGTTGCATTGCCAGATATTCGTTGAGTTATTGGGCTGTGCCCACTTTTAATCTCGGCCACGATGATGTGGGTAATTTATATCAGAATCGCGAGAGTTCAGGGGGGTTTCCGTCGGGTGAAATGGCCCGCTGCTATATGCCCGGCTGTCAGTCGCAAAGAGGCGagacaaatgtaaaatttcacAGTTTTCCCCGAGATCTGAAGACTTTGATTAAGTGGTGTCAGAATTCACGCCTACCCGTTCATAGCAAAGAGAATCGTTTCTTTTGTTCACGTCACTTTGAAGAAAAGTGCTTTGGCAAATTCCGTTTGAAGCCTTGGGCAATACCAACACTTCGCTTGGGAACGGTATATGGAAAGATTCACGATAATCCCAATATTTATCAGGAAGAAAAGAAATGCTTTCTACCGTTTTGTCGACGCAGCAGATCATATGATTGCAACTTGTCCCTATACCGATTTCCCCGGGATGAGACATTACTGAGGAGATGGTGCTATAACTTAAGGTTAGACCCTGAAATGTATCGTGGAAAAAACCATAAGATTTGCTCATCTCATTTCGTTAAGGAGGCCTTGGGATTAAGGAAACTGAATCCCGGCGCTGTGCCAACAATGAACTTAGGCCATAATGAtcgtttcaatatttacgaAAATGAATTGTATACACCGCCgccacctccaccaccaccacaaccctCAACATCGGCCAAGGCCCAAAAGTTCGCTGAAATGTTCAAACAGGAAATGGGATCGGCTTCCGCCATATATGATGAGGTCTTCATGAACTCTATGATTCAGAAATTCTCTGGTTCGTCATCAGCAAATGCCTCCAATCTAGACTTGGGAGATGTGTGCTTGGTACCTTCATGTAAAAGAACCCGGCATTCGGATGATATTACTCTGCATACCGTGCCCAAGCGGGCGGAGCAATTGAAGAAGTGGTGTCACAATTTGAAAATGGATTTGGAAAAAATGCACAAGAGCGTACGTATTTGCAGTGCTCACTTTGAGAGCTATTGTATCGGCGGATGCATGAGACCTTTCGCGGTACCCACTTTGGAATTGGGTCACGACGACACCGATATATACCGTAATCCAGATGTCATTAAGAAACTAAACATTCGAGAAACCTGTTGTGTTCCTTCGTGCAAAAGGAATCGCGATCGCGATCATGCGAATCTACATCGTTTCCCTACTCATCCGGAACTTTTGCAGAAATGGTGTGAGAACTTACAGAAACCCGTTCCGGATGGTACGAAACTTTTCAATGATGCCGTGTGTGAAGTCCACTTCGAGGAGAGATGTCTGCGCAACAAGCGGTTGGAAAAATGGGCCATACCCACAATGAATTTGGGCTACGACGATTTGGCGCACAACTTACCCTCCGAAGAAGAGATATcggagttttggacaaaaccttttGCCCCCAATAATGGTGACGAACAAGGCGACTGCTGTGTATCCTCCTGCAGACGCAATCCACAAATTGATGATGTCAAGTTATACCGACCGCCTGAAGATGCTGAACAATTGCTGAAATGGGCGCACAATCTTCAACTCGATGCAGCGGAATTGCCAAATCTTAAGATTTGTAGTCTACATTTTGAGTCTCACTGTATAGGAAAACGTTTATTGAATTGGGCAATGCCCACGTTAAATTTAGGCTCTAGGGTGGAACATCTGTTCGAAAATCCTCCGCCTACTCAAGTCGTatataagaaaaaagaaaaacatggacGTCTATCTTCCAATCATGAGGTCATCAAATGGGCTCCGAGATGTTGTTTGCCTCATTGTCGCAAAACGCGCACGCAGGATAACGTGCAGCTCTTCCGATTCCCTTACGCTAACCGACAAACGCTGGCCAAATGGTGCCATAATATACAGTTGCCTTTGGTGGGTAGTTCACACAGACGCATTTGTTCGGCTCATTTCGAAACAGCGGTTTTGACAAAGAGATGTCCCATGAATATGGCTGTGCCAACGTTGCATCTCAATACTTCGGCCGGCTATAAAATCTATCAGAATCCTGCGCGTCTGAAACAAGTCAAGGTGGGAGTTCAACGACAGTGCATCATCGAATCGTGCCATAAAACCAAAGCGGATGGTGTGGTACTCTTCCGTTTTCCCAACAATCGCACAATTCTACAGAAATGGCGGCACAATATCAAAAATTGGCCCAAATGTAAATTGAGTTCTCAGTTAAGAGTATGTTCCGACCACTTTGAATCACACTCAGTGGGCGGAAAACGTTTATCGCCCGGTGCCATACCGACATTGAAATTGGGACACGACTCCGACGATCTATATCCCAATGAAACACGGTCCTTCTTTGATCTGGAAAAATGTGTGATCAATGGTTGTGAGTCAAGAAAAGATATGGAGGATGTCCGTCTGTTCCGTTTCCCCAGAGACGATGAGGAGCTTCTgcagaaatggtgtaataatttgaaaatgaattcaTTGGACTGTGTGGGCATACGTATCTGTGGTAAACATTTCGAGGTCGAATGTCTGGGTCCCAAACTTCTGTACAAGTGGGCAATACCCACTCTAAATCTGGGTCACAAAGAAGAGGACAACGtggaaataatacaaaatcCACCGCCCGAACAAAGATCGGGAGagtatattttcaaatgttgcGTACCGAGCTGTGGAAAGACGCGTAAATATGACGACGCCCAAATGAATAGTTTTCccaaacatttgaaattgttccGCAAGTGGAAACATAACCTCAAATTGGACTTTCTCAATTTCAAAGAAAGggaaaagtataaaatttgCAATGATCACTTTGAGCCGGTATGCGTTGGTAAGACCCGTTTGAATTTCGGAGCTCTACCCACCCTGAACTTGGGCCACAATGACAGTGAAGATCTGTACAAAATTAATCCCGATCGCATACGTcccaatttgtttattaaacaaaagGACATAGAACGTATGGAAAGGAAACAGATGCGATTGGAGGAAATGAAGGTAAATATGGATATGGATGAACAACAAGATATGGATCAGGACCAAGATGACGATGCCTTGGATCCTCTAAGTACACCAGCAGAATGCTGCGTGGAAGACTGCAAGGCTCCGAAATCTATAATGCGGGAACCTTACGATTTACCCGAGACACTGGAATTACGAAAGCTCTGGagtaaagaaataaatagagaTGTTGGCGATTTATCGGCAGACAGCAAATTGTGCGGTTTGCATTATCAACAGCTCTTCAGTGACCTAAAAGACGAAATGGAAGCcttaaaagaagaaaatccagACGTGAAATTAGATTATGGCAAACTGTTTTTTGCCTATCAAAAATCAGAAATCTCGCTGGTCATCAATGGTTTCCAGTGCAGAGTTGAAAATTGTCCCACAAATTTACTGAATTCCAATCATCGCTTGTATTTCTTCCCATATGGCAAAGATATTATCAGCAAATGGACGTACAATACCGGCATAATACCCGATGAGAATCGGCGCTACATGAACAAAGTGTGCACTTTGCATTTTGAACCCTACTGTGTAACCGAAACTCAGCGCCTAAGATCGTGGGCCATAccaactttaaatttaaatcactCCGATCCGGAAAGCATCTACAAGAATCCTGATCTAACACGCATCGATAGAAGAATGCTAGGGCCGCAGATATTGAAATGTGCCGTTACAAATTGTGATAGTGCGAAGGCGACGGAAACAGCGGAAACGACCAAACTGTTTAACTTCCCCACAGACGATGTTTTGCTGCGCAAATGGTgtgtgaatttgaaaatgtcacggCATCTGACACCACTGCTCAAGATCTGCTCTTTACATTTTGAGAAATTGTGCTTTGGCAGTTCGCGTATCCGTTCCTGGGCTATACCCACAAAGATGTTGGGCCACGACGCGGAACCGGAATATTTCAATAAGACCACAATCAAACAAGAAGTCTATGGGGAGCCATCAAACAAAAACgaacaactacaactgcaacaaGTGAAAATCAAGAAGTCCCTAGACTCGGTGAAATGTTATGTGGCCTCATGTCGCAGGTCACGCCTGCAGCATGGAGTCCGATTTTACGGCTTGCCCGTCAAcggcaaaatgaaaaagaaatggtTGCACAATCTTCAAGTTCCATCCAGCAAAGCCGGAAAGGTTTTGAATTTAAGAATCTGTAATTTGCATTTTCACAAGCGATGTTTGGAAGGCAAACACTTAAAGACATGGGCAGTGCCCACCATGCATTTGGGTCACACCGAACCCATTTTCGATAATCCTCGACGATTGCAAAACCCCTTGACCGTACAACGTTGTGCTTTGCCACATTGCAGTAATCATTCCATAGGAAATGAACATTTACGTACATTTGTATTTCCCAAATCCACGGAATTCCTTgaaaaatggtcgaaaaatcTCAAGTTAGACGTGTCCAAATGTAAAGGTCGCCTGTGTCACGAACACTTTGAAGCCGACGTTAAAGGTGAGAAGAAGCTCAAGAACGGTGCTGTGCCCACTATAAATTTGGGTCACGACGACGAAATCCCCTTCAATAATTGTGAGCTGATAGGCAAACTTGAAATAAATTCCACTCAAAAAGAAACGGAAGCCACAGGTGAACAAACGCCGTACGAAGATGATGAAATGGGCGAAGACGAGGAGGAGGAAGAGGAATTCGAGGATGATGAATATGTAGGCGACGGTGAGGGAATAGAAGATAATGAAGAAGATGAAATGGAAGACGACCATGATGAGGATATGGATAGAGAAgaggatgatgacgatgacgacgaagaTGAAGAAGATGTGGATATGGACAAAGTTCGCATTCGTGGTACTCTGCAGCACTGGAgttcaataaaaatgaaagaattAAGAGTTACACTTGTACCCATACGACAAGAAGACATTCTGGAAATGTCCTCCGTGTCGTCGTATGAACGTGATCGCCGTTCCATAACACCGGCCAACAGCATCAGAGATCTTCGTTCCGAAACTCCAGCCAGTATTGGTGGTGGTCACACTTCTTCCGAATATAACGATGAAAATTCTTCCAGTACGCCTTTAAGAACCGACAAACCCCTAAATAGCATAGCACCCATGTGTTGCCTCAAGCACTGCGGCAAAGAAAAAACTCCCGAGCAGCATCTTACCACATACGGATTCCCTAAAGATCCGCTACTCTTACAGAAATGGTGTGACAATTTGGGCCTACAGCCCGAAGAATGTATAGGCCGCGTGTGTATTGACCATTTTGAGCTACGGGTGATAGGAACGAGAAGACTTAAGCTGGGTGCCGTTCCCACCTTAAATCTGGGTAGCTCTCGAGTGCCGAAACACACCAACGATGAGCCAAAGAAAATTGTAAGCAACGATTCCGAACAAAAATTACCAGATAATGAGCAGATACTTACACCACCTCCGCCGTATTGCAATCCTAAATCAGGCAAACAATCGGTTTTTCGGCTATGCTGCCTCAAACACTGTCGGCGCAAGAAACAACCCGAATCGGAGATGGATCAGAGCCATGAATTAGCACAGCCACTGCTGCTATTTAAGTTTCCCACGGAGTATGAGACTCTCAAGAAATGGTCGGCGAATCTACGATTACCGGAGAAAACCTGCGGCCGCAAAGAATTGCGCGTGTGTTCCAAACATTTTGAAACCTTCGTTATTGAGGGCGACACTCTCAAGCCCAACGCTTTGCCGACTTTAGACTTAAGTTATTCACAACGCCCtccagtttttaaaaacaatcgcAAAGACTTTGAACAGAAACCCTTAGAAGTAGATGCAACGACCGAAACGACAAAGTGTTTTCTGGCACACTGTGGTAAAACAACCGAAGACCCTGAAACCTTTCTAATCAATTTTCCGAAACATGATCTGTCGATGCAGCGGAAATGgtttaaaaatctcaaattgGACAGTAAgattcaaaaatataaacatttaaagaTTTGCAATCACCACTTCGAGACGTATGCTTTCTTTAAACAGCGCAATCTTAAGACCGGAGCCGTGCCCACCCTAAATCTGGGACATACAGATCgcatttgtaaaaatttaccGAAACTACGCCGAAAAGTAAGAACACAGCCCATAGAAACCTGTTGCATTAAGACCTGTGGCAATCAGGACCAATCGCGGAAACTCTACGCCTTTCCCAAGAACTCCGAGTTAAGACGAATATGGTGCAGTAACTTGCAGATTGAGCTGAGAGAAGCTTTAAGGTGCCACTATAAACTGTGTGGTACACACTTTTCCCCAGAGAGCTTTGAGGCCGCTACCGATATTCTAAAGATAAACGCCGTGCCCTCATTGAATCTGGGCGTTGAAGCAGACAAACTTACGGTTTTATGCAAACTAGAGACTGAGAACGATAAATTCAAATGTGTGGTGGAAAGCTGTCAAAAGTCCGCCAGCGTTGATAAAGTCAAATTGTTTGGCTTTCCCAAATCCAAAGACCTTCTCAAGAAATGGTTATTCAATTTGAATCTCTCTCCCGATATAGAAGTAGACCAGACGCGCATTTGCAATCGCCATTTTGAGAAGATGTGCGTCAAACATGGCAATTTGCATGAAAAAGCCGTACCCACTATGTTCCTGAAGGCTAAATCCTGGATCTACCAGAACGATGACGATGTGTTTGAGGAGAACTATCAGTGTTGTGTATTAAATTGCAGCTACCATACCAACGAAGAGGAGTACCGGCCGATGTACCGGTTTCCCAAGCAAAAGGTGGACATAGACAAATGGTTGCACAATCTAAGACTGAAATTCGAAGACGACCAGACAGAAGTTAAAGATTTGCGTATCTGCTCAGTACATTTCGAAGATACATGCAAAACTAAGGAACATTTGCAACCTGGCACTGTGCCCACGTTACAGCTGGGCCACGAGCAAATGGAAGACATTTACCGCAATCATATTCAAAAATGCTGTATAGAGAATTGCTGCTGGACAGGCTTTACGTGTCATAAGCTACCCGATGCCGAGACACTTAAGAGCTGTTGGCAAAAGACTTTGGACCCAGAATCCCGCGTTAATAATTCCGAATATATTTGCTCAATACACTTTGTGTCTTGCTATGATCGAAGTGGGGCCACCCAAGAAAATGAAACTCTAAGGGAGCTGTACGATCAGCTACGGATATTGCCGGAATTGTCCACATTCAAGTGTTCGGTATCGACGTGCGAAACgggatttaaattgtacacgaagctttttaaatttcccaaagATGTTACACTTTTCCAGAAATGGTTACACAATTCTTCGCTGACCTTTGCTTATGCCGATCGACCGCAGTATCGTATATGTGCACAGCATTTCGAAGAACGCTGTCTGAGTGAGAAGAAGCTACATCGTTGGTCGTTGCCCACACTGTCCTTGCCATTCAATTTAAGTTTATATGTGAATCCACCGGAGGCTTTACCATCGAATCATGAAAATCTCAAACATTGCTGCGTCGCCTCCTGTAATACCGAAAAAGGACCCTTCTTTAAATTCCCCGCCAAATCCAACGACGTCAAGAAATGGATTCATAATCTAGGCCTGGGCACCCAACAGTGCACTTTGAATCTGAGAGTGTGTCATAGGCATTTTGAAAGTTACTGCCTGTCCAGAGATGAGGGGGgaagcattaccaaactgaagCATTGGTCGGTGCCAACATTGAATCTAACACGAACATGCGATCTAAATCTCAATCCTCCGGAGAAAGTCGATTATTTTGCCTGCTGTGTATGCCgtgaattacaaaacaaagcaGAAGGCCTGTATCTCTTTCGCTTTCCCACCAGACTGTCCAGTTTTCTGAAGTGGTTGCACAATCTCCGATTGCAACGTCAAGACTATCGCGACAGTATGCGCATTTGTATTAAACATTTCGAAAGCGATTGTTTCGACAAGACTCTGAAGCTGTTGCGTAAACATTCTGTGCCCACCATAGGTGTTGCCTGTTCCAGCCAAGATATGTTTACGAATCCCTCGCGGCGCCCCAACTCCAAATGTTGCATACCATCGTGCGATGGGCCATGGTTACACTTGAATGTATTCCCAAAAGATAAAATGCTGCTAAAGAAATGGTGCTTCAATTTAAATGTTGGTGAAGGTGATTTGGAAACAttaagaaattggaaaatttgccAGAAACATTTTGAGAACAAGTGTCTGAATGCTTTCGGTCTGATAAGACCCACAGCTGTGCCTACTTTAAATTTGGGCCatactaataaaatatttaaaaacactaGGTTCACAAAGAAAGTAAACCAATTGAAGGTGAAAAATAGCACGGCCGTAAAAGCACCCCGAAGATTCCTATTAAAGAAGGGAATAAAAATTGAAGAACAAACGCCTAACACGCCGGTAAAGCGAAAATCAGCCAAGCTTTTGGAGAATGTAGTAGCAAAGAACCCCAAGAGAGCAGTAAAAGCTTCCAAATTCTTGTTAAACAAGAAAATCAAGATTGAGAATCAAAAGTGTGCCGGGAAATCAAAACAATTACCTGAAGTAGCAGATGAAATGCCAAGTGCCGAAGCGTCGGCTGAAGATTTGGATAAATATGAGACGTTAGGTGATTACATCAgcgcaaagaaaaaaatcaaagcatTGCGCAAGAAACAAACAGTTGACATTACAAAGAGCTTGCCGAAAGTAAATAAGAGCCGGCAGATAAGTTGCAAGTGGTTACCGTTTAAACCTGAACCGGTCGAACGGCCTTTACTGGATATAATTATCGATAACGGAACCTACGAAGCCTCAAATCCGGAATCGAATATACTGCAAGATTTCGCAGTCAAGCAAGAATTCTGTAATGTGACAGAAAATGCACTCGAGGGCCTAGTGGTGAAGATGGAACCCATCGATGAAACTACACAAAAGTTCCTGGAAGGACTCAACGAAAGGCCATCATCTGCACAGAGACTGGGTCTACATCACAAATGCTCCATATCAAGCTGCACGAATGTTTCCAGCATACCTGGGTGTTCGTTTTTCAAATACCCACCAAATCCACGAATTCGTACGATATGGGCTCGGCATTGCCAGGAAACCTGCAAATTTATTATGACTCTGACAAAGAAACGTCGAATCTGTGTGGAACATTTCGAAGATCAGTGTTTTATTGAGAAACGCCTGTTACTGGGCGCCATACCTACGTTAAACCTAGGCCACCCTGAATCAAATTTAGATGATTGTGCCGAGGTGTTGAAACGATTTCGCAGCTTAAGATGTCGCGTGGACGAATGTCAGCGATCGGTAGAGCTAGAGCAGGTAAACAAAATACCCTTCCCCCCTGGTGAACTGAAACAGAAATGGTGTTTCAATTTGAATCTAAATGAAGCTGATATTGCAGTCGCGGACTGGATTTGCCATAGACATTTCGAGAGGCGAGTCCTCATCCGAAGTAAACGTATTAAGGACAATGCAGTGCCTACTCTGTTACTGGGCTCTCAAGCCAAACCGCAACATGAATTGTATAAAAATCCTGAATATATtaaccaacaacaatatttcattCAACTGCGCCAAATGTGTTGCGTATCGGCCTGCACGAATACCAGAAAAATGGCCGGCGTTCGCCTGGCCACCTTTCCCAAAAGAAAAGACATTTATGAGAAATGGTTGCACAATTTAGATTTAGATGATTCTCCGGAGGTACGCAATGGTTATCACATCTGCTGGCAACATTTTGAGGACAAATGTTATACGAAATATAACTACTTGAAGATAGGTTCCGTACCTACATTAAAGTTGGACAAGCACGAAGATCTGATAGGCCTCGACGAAAGCCTCTTAGTGGACGTGGATAGTTTTGGCAAACGAATCAAATATCTACAACAACACAAGAAATATAAATGCTTCCATCCACAATGTACAGAATGGAAGACACATGTGTACTCGGCACCGGCAATTGACCAGCTGAGAACGTTATGGCTGAAAAGCTTACAGCTGGAGCTTTCCCCAAGTGAAGAGCCAGAAGTACATTTCTGTGATGAACATTTCTATGAGCTGTACAAGAACTTTGAAGATCACCTGCCACTAGTGGACAGTGACATTTACCACAACGAGTTGGAAGCTCTGGAGAAAACATTCAAAGAGCTGGcggcaaaaaataaattcttcacCAAACAGTGTTTTGTACCTCAATGTCCCACGGATCTCTACTTCAGGCCCTACAAAGACAACAAATTGTACAACTTTCCGCATAATTCCGAGTTATCGCAAAAATGGTGCTATAACTGCAACATCGATTACAGCAAGCTGGACAAGGATAAATTGAATTATTACAAGATCTGTGACAGACATTTCGAAAGCTATTGCTTCAGCAAACGTACCATATTTTCCTGGGCCTTGCCCACATTAAACTTGCCGGACACCCGGCCGACACAGATTGTGGAAAATGATCCGGATGACAAAAACGCCTACACGGGAGAGTGTTGTATACGCTCGTGTATAAATGCCAATGGCTATAAACTAGAATCCAAAACTAGATTATTTAAATTTCCCGAAAACACGGACATCTTGGCCAAGTGGTTAAATAACATCAATTGTGAAATCTTAAGCGCAAATGATACGAAAATATGCGGCCTCCATTTTCGCCAGAGCTATATCAAGAAGAGGAAACTAACGGATGAAGCTGTACCGACATTGCGACTGGGTCGCTTTGATGGGTCGGAGATATATCCGAAAGTCATCAACGATACGGGAATATTTATTAAAGCCGAACCAGCGGATGAGTTCTTGTTGGATGAAGGTCTGTCCTACGACGATGACGAAGAGGAGGAAAAAGAAATGCTGGTGGAGCAAAAGGTCAATGTTAACGACGGCTGGAATGATCATGATTATTGTCATGAAATTAAACCAGACATATCTCAAGATCCTCCAAAATTCGAGGGGATTTCCATAAAACAGGAAATCATCGAGGATCAATATGGCCTCGTGGAGCAGCAACAGTTCGAAACACCGCCGGCCATTAAACAAGAAATCATAGAAATTGAAGAATCCCACACGTATGACCAGCAAGACGGCTATGTCTACTTCGATGAACTGGTGGGGGAGAACGTCACCCTTCCTCTGGCAGGCGCGGCAACCACATCCAATGATTATCGCAGTCTGATCATATCGGAAGTAAAATCACACATATTTCTGTGTTGTGTGCCGAAATGTAAAAACTCCTCGGAATCGACCAACATAAAACTCTACACTGAATTTCCCAGCGATTCGGAGATATTCATTAAATGGTGTTTTAATATCAAAATCGATCCACGTAACTACAAAGAGAACCAATACACCATATGCAGTCAACACTTCGAAGCAATTTGTTTCAGTGACAACGACTTGACACTGCATCCCTGGGCTGTGCCCACCTTAAACCTAAATCTAACGGAAAATTCGTTCATCCATCGCAATGATCCGCCAAGCGAACAGTGCATAGTCTATGGCTGCATACAGCCGATGCCGCCACTTTATAAATTTCCACTGAGGTTGGATTTATGCCAAAAATGGTTTGCCAACCTAAAATTAGACCTTACGGACTTTAGGGCCCACACGTACAGGATATGTCGAAGGCATTTCGCCCCTGAGTGCTTCGATACAAATCATATACTCAAAACAGAATCAATTCCAACGCTTTATTTGGGGCATGCCGATAGCATAGCGCATTTAAATGCATTTGAGGTTGGGCCACACGAGCAAGAAGGAACCTTAGGCGGCGACCTTATGGCCGGCTTAGTTGCCGGAGGCCTTGAGAATAGTCGGGGCAGCAGTCAGGGTTCGATTGTGCGACATTTGATATCACCCAATGATCTAGAAGATCATGACAGTAGTTATTATGAGGATTTCGAAGAATGCTATGGTGCAGACGATTGA
Protein Sequence
MKEERKHQQQQQRVQQAPSAQEQKHHQQHNAHHQHHHQQQHHQQQHHHQQQQHNHHYQQQQQSQQPQPQPLSQPHHHHPHQHHANENISQQQQRQQLQPPPRQQQQQQPPAAVSTENSNIPSTAEAKQLQTSLANIKTEPKPLNFPRRKLQTERSSTLPICQRCKQVFLKRQNYTQHVALSCCNIVEYDFKCSICPMSFMSNEELQAHEQLHRLNRYFCQKYCGKYYETTEECEQHEFGQHEYEMFKCNICCVSFPKREQLFAHLMEHRQQPRYDCCICRLCFQTLLDLEDHYVGNPDFCGKFYDKEGFKNLKIFSKPNKTTTQQTTNRSENLTSFMIKDITSIKMPEAGPLPSKQQNTHHISPSASFDDIPDFAAPHVEVKTEIKVEPDFYPPMDQTDFAGFDNDYSNSQDFSQSSNQNLTFLQDFHDNASNSTNSSYSFNTTGSTSKSNEATNQEEDAICCVPKCGVRKFTSPTLQFFPFPRDEKYLSQWLHNLKMTYDPNVNYGIYRVCSLHFPKRCIARYSLSYWAVPTFNLGHDDVGNLYQNRESSGGFPSGEMARCYMPGCQSQRGETNVKFHSFPRDLKTLIKWCQNSRLPVHSKENRFFCSRHFEEKCFGKFRLKPWAIPTLRLGTVYGKIHDNPNIYQEEKKCFLPFCRRSRSYDCNLSLYRFPRDETLLRRWCYNLRLDPEMYRGKNHKICSSHFVKEALGLRKLNPGAVPTMNLGHNDRFNIYENELYTPPPPPPPPQPSTSAKAQKFAEMFKQEMGSASAIYDEVFMNSMIQKFSGSSSANASNLDLGDVCLVPSCKRTRHSDDITLHTVPKRAEQLKKWCHNLKMDLEKMHKSVRICSAHFESYCIGGCMRPFAVPTLELGHDDTDIYRNPDVIKKLNIRETCCVPSCKRNRDRDHANLHRFPTHPELLQKWCENLQKPVPDGTKLFNDAVCEVHFEERCLRNKRLEKWAIPTMNLGYDDLAHNLPSEEEISEFWTKPFAPNNGDEQGDCCVSSCRRNPQIDDVKLYRPPEDAEQLLKWAHNLQLDAAELPNLKICSLHFESHCIGKRLLNWAMPTLNLGSRVEHLFENPPPTQVVYKKKEKHGRLSSNHEVIKWAPRCCLPHCRKTRTQDNVQLFRFPYANRQTLAKWCHNIQLPLVGSSHRRICSAHFETAVLTKRCPMNMAVPTLHLNTSAGYKIYQNPARLKQVKVGVQRQCIIESCHKTKADGVVLFRFPNNRTILQKWRHNIKNWPKCKLSSQLRVCSDHFESHSVGGKRLSPGAIPTLKLGHDSDDLYPNETRSFFDLEKCVINGCESRKDMEDVRLFRFPRDDEELLQKWCNNLKMNSLDCVGIRICGKHFEVECLGPKLLYKWAIPTLNLGHKEEDNVEIIQNPPPEQRSGEYIFKCCVPSCGKTRKYDDAQMNSFPKHLKLFRKWKHNLKLDFLNFKEREKYKICNDHFEPVCVGKTRLNFGALPTLNLGHNDSEDLYKINPDRIRPNLFIKQKDIERMERKQMRLEEMKVNMDMDEQQDMDQDQDDDALDPLSTPAECCVEDCKAPKSIMREPYDLPETLELRKLWSKEINRDVGDLSADSKLCGLHYQQLFSDLKDEMEALKEENPDVKLDYGKLFFAYQKSEISLVINGFQCRVENCPTNLLNSNHRLYFFPYGKDIISKWTYNTGIIPDENRRYMNKVCTLHFEPYCVTETQRLRSWAIPTLNLNHSDPESIYKNPDLTRIDRRMLGPQILKCAVTNCDSAKATETAETTKLFNFPTDDVLLRKWCVNLKMSRHLTPLLKICSLHFEKLCFGSSRIRSWAIPTKMLGHDAEPEYFNKTTIKQEVYGEPSNKNEQLQLQQVKIKKSLDSVKCYVASCRRSRLQHGVRFYGLPVNGKMKKKWLHNLQVPSSKAGKVLNLRICNLHFHKRCLEGKHLKTWAVPTMHLGHTEPIFDNPRRLQNPLTVQRCALPHCSNHSIGNEHLRTFVFPKSTEFLEKWSKNLKLDVSKCKGRLCHEHFEADVKGEKKLKNGAVPTINLGHDDEIPFNNCELIGKLEINSTQKETEATGEQTPYEDDEMGEDEEEEEEFEDDEYVGDGEGIEDNEEDEMEDDHDEDMDREEDDDDDDEDEEDVDMDKVRIRGTLQHWSSIKMKELRVTLVPIRQEDILEMSSVSSYERDRRSITPANSIRDLRSETPASIGGGHTSSEYNDENSSSTPLRTDKPLNSIAPMCCLKHCGKEKTPEQHLTTYGFPKDPLLLQKWCDNLGLQPEECIGRVCIDHFELRVIGTRRLKLGAVPTLNLGSSRVPKHTNDEPKKIVSNDSEQKLPDNEQILTPPPPYCNPKSGKQSVFRLCCLKHCRRKKQPESEMDQSHELAQPLLLFKFPTEYETLKKWSANLRLPEKTCGRKELRVCSKHFETFVIEGDTLKPNALPTLDLSYSQRPPVFKNNRKDFEQKPLEVDATTETTKCFLAHCGKTTEDPETFLINFPKHDLSMQRKWFKNLKLDSKIQKYKHLKICNHHFETYAFFKQRNLKTGAVPTLNLGHTDRICKNLPKLRRKVRTQPIETCCIKTCGNQDQSRKLYAFPKNSELRRIWCSNLQIELREALRCHYKLCGTHFSPESFEAATDILKINAVPSLNLGVEADKLTVLCKLETENDKFKCVVESCQKSASVDKVKLFGFPKSKDLLKKWLFNLNLSPDIEVDQTRICNRHFEKMCVKHGNLHEKAVPTMFLKAKSWIYQNDDDVFEENYQCCVLNCSYHTNEEEYRPMYRFPKQKVDIDKWLHNLRLKFEDDQTEVKDLRICSVHFEDTCKTKEHLQPGTVPTLQLGHEQMEDIYRNHIQKCCIENCCWTGFTCHKLPDAETLKSCWQKTLDPESRVNNSEYICSIHFVSCYDRSGATQENETLRELYDQLRILPELSTFKCSVSTCETGFKLYTKLFKFPKDVTLFQKWLHNSSLTFAYADRPQYRICAQHFEERCLSEKKLHRWSLPTLSLPFNLSLYVNPPEALPSNHENLKHCCVASCNTEKGPFFKFPAKSNDVKKWIHNLGLGTQQCTLNLRVCHRHFESYCLSRDEGGSITKLKHWSVPTLNLTRTCDLNLNPPEKVDYFACCVCRELQNKAEGLYLFRFPTRLSSFLKWLHNLRLQRQDYRDSMRICIKHFESDCFDKTLKLLRKHSVPTIGVACSSQDMFTNPSRRPNSKCCIPSCDGPWLHLNVFPKDKMLLKKWCFNLNVGEGDLETLRNWKICQKHFENKCLNAFGLIRPTAVPTLNLGHTNKIFKNTRFTKKVNQLKVKNSTAVKAPRRFLLKKGIKIEEQTPNTPVKRKSAKLLENVVAKNPKRAVKASKFLLNKKIKIENQKCAGKSKQLPEVADEMPSAEASAEDLDKYETLGDYISAKKKIKALRKKQTVDITKSLPKVNKSRQISCKWLPFKPEPVERPLLDIIIDNGTYEASNPESNILQDFAVKQEFCNVTENALEGLVVKMEPIDETTQKFLEGLNERPSSAQRLGLHHKCSISSCTNVSSIPGCSFFKYPPNPRIRTIWARHCQETCKFIMTLTKKRRICVEHFEDQCFIEKRLLLGAIPTLNLGHPESNLDDCAEVLKRFRSLRCRVDECQRSVELEQVNKIPFPPGELKQKWCFNLNLNEADIAVADWICHRHFERRVLIRSKRIKDNAVPTLLLGSQAKPQHELYKNPEYINQQQYFIQLRQMCCVSACTNTRKMAGVRLATFPKRKDIYEKWLHNLDLDDSPEVRNGYHICWQHFEDKCYTKYNYLKIGSVPTLKLDKHEDLIGLDESLLVDVDSFGKRIKYLQQHKKYKCFHPQCTEWKTHVYSAPAIDQLRTLWLKSLQLELSPSEEPEVHFCDEHFYELYKNFEDHLPLVDSDIYHNELEALEKTFKELAAKNKFFTKQCFVPQCPTDLYFRPYKDNKLYNFPHNSELSQKWCYNCNIDYSKLDKDKLNYYKICDRHFESYCFSKRTIFSWALPTLNLPDTRPTQIVENDPDDKNAYTGECCIRSCINANGYKLESKTRLFKFPENTDILAKWLNNINCEILSANDTKICGLHFRQSYIKKRKLTDEAVPTLRLGRFDGSEIYPKVINDTGIFIKAEPADEFLLDEGLSYDDDEEEEKEMLVEQKVNVNDGWNDHDYCHEIKPDISQDPPKFEGISIKQEIIEDQYGLVEQQQFETPPAIKQEIIEIEESHTYDQQDGYVYFDELVGENVTLPLAGAATTSNDYRSLIISEVKSHIFLCCVPKCKNSSESTNIKLYTEFPSDSEIFIKWCFNIKIDPRNYKENQYTICSQHFEAICFSDNDLTLHPWAVPTLNLNLTENSFIHRNDPPSEQCIVYGCIQPMPPLYKFPLRLDLCQKWFANLKLDLTDFRAHTYRICRRHFAPECFDTNHILKTESIPTLYLGHADSIAHLNAFEVGPHEQEGTLGGDLMAGLVAGGLENSRGSSQGSIVRHLISPNDLEDHDSSYYEDFEECYGADD

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00817465;
90% Identity
-
80% Identity
-