Basic Information

Gene Symbol
-
Assembly
GCA_963513945.1
Location
OY740717.1:21159397-21221928[+]

Transcription Factor Domain

TF Family
THAP
Domain
THAP domain
PFAM
PF05485
TF Group
Zinc-Coordinating Group
Description
The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 34 4.9e-15 5e-12 47.3 1.1 1 86 555 627 555 628 0.84
2 34 1.4e-14 1.4e-11 45.8 4.2 1 86 655 723 655 724 0.80
3 34 7.9e-15 8e-12 46.6 0.3 1 87 745 817 745 817 0.82
4 34 1.7e-13 1.7e-10 42.4 2.4 1 86 896 964 896 965 0.78
5 34 1.2e-15 1.2e-12 49.2 5.7 1 87 989 1061 989 1061 0.81
6 34 1.1e-11 1.1e-08 36.5 0.5 1 87 1096 1164 1096 1164 0.81
7 34 3.1e-11 3.1e-08 35.1 3.5 1 86 1205 1274 1205 1275 0.75
8 34 1.4e-15 1.4e-12 49.0 0.3 1 86 1302 1371 1302 1372 0.81
9 34 1.1e-13 1.1e-10 43.0 0.2 1 87 1394 1464 1394 1464 0.79
10 34 4.8e-13 4.8e-10 40.9 2.9 1 87 1492 1564 1492 1564 0.86
11 34 4e-05 0.04 15.5 0.1 1 80 1637 1703 1637 1710 0.75
12 34 1.7e-10 1.7e-07 32.7 0.1 1 87 1733 1805 1733 1805 0.79
13 34 1.3e-12 1.3e-09 39.6 1.2 1 86 1835 1905 1835 1906 0.79
14 34 1.4e-12 1.4e-09 39.4 0.8 1 87 1952 2024 1952 2024 0.82
15 34 9.7e-13 9.8e-10 39.9 1.8 1 87 2047 2116 2047 2116 0.81
16 34 1.9e-13 1.9e-10 42.2 0.1 1 87 2305 2374 2305 2374 0.80
17 34 9.5e-14 9.5e-11 43.2 3.1 1 86 2427 2507 2427 2508 0.79
18 34 2.1e-12 2.1e-09 38.9 1.1 1 86 2539 2610 2539 2611 0.80
19 34 3.5e-12 3.5e-09 38.2 1.2 1 87 2637 2707 2637 2707 0.81
20 34 3.7e-12 3.7e-09 38.1 1.7 1 87 2730 2800 2730 2800 0.81
21 34 2.8e-17 2.8e-14 54.5 1.9 1 86 2821 2894 2821 2895 0.83
22 34 2.7e-06 0.0027 19.3 4.1 1 58 2912 2958 2912 2983 0.83
23 34 1.5e-12 1.5e-09 39.3 1.4 1 86 2998 3067 2998 3068 0.82
24 34 7.8e-13 7.9e-10 40.2 2.0 1 87 3093 3165 3093 3165 0.79
25 34 9.2e-12 9.3e-09 36.8 2.6 1 86 3185 3255 3185 3256 0.78
26 34 2e-10 2e-07 32.5 2.9 1 87 3276 3345 3276 3345 0.80
27 34 1.2e-09 1.2e-06 30.0 3.2 1 86 3588 3660 3588 3661 0.79
28 34 3.9e-08 3.9e-05 25.2 1.0 1 86 3684 3754 3684 3755 0.75
29 34 5.7e-13 5.7e-10 40.7 0.2 1 86 3786 3857 3786 3858 0.82
30 34 0.0002 0.2 13.3 0.1 1 58 3894 3943 3894 3960 0.84
31 34 7.8e-12 7.8e-09 37.0 3.0 1 87 3987 4064 3987 4064 0.86
32 34 2.7e-13 2.7e-10 41.7 4.8 1 86 4088 4159 4088 4160 0.84
33 34 5e-14 5.1e-11 44.0 3.9 1 87 4329 4403 4329 4403 0.82
34 34 1.5e-11 1.5e-08 36.1 0.2 1 86 4422 4489 4422 4490 0.81

Sequence Information

Coding Sequence
ATGTGCGTATGCATGTATCATGCAAATTTCGGATTTTTGCTAGAGGGTTGTTCAACCCTTATTCCGTCGATATCGCATGATTTTCAAAGCTTTTTACAATCAGCTTGCTGCAACATTAAAGAGGAAAAATCTGAACCTATGGATGAAATGCCATTTCACAAAAATGCGCcacaaattgaaaacaatacgTTTACTATGAATGAGGAACgtaaacagcagcaacaacaacgagtACAACAGGCACAAGAGCAaaaacatcatcatcagcataaTCCTCCACCGCCttctcatcatcatcagcatcaacatcagcaacaacaacaacaccaccatcaccaacaacaacaacaacaacagcataacCATCATTATCAACACCACCTCCAGCAGCAGCAGTCACAGTCACAACAGCCTCAACCTCCGCCTCAGCCTCCGTCGTCCCAGCATCATCATCCACATCAGCATCAGTCGAATGAAAATAtatcacagcaacaacaacgacagcagcagcaccagccgccgccgccgcaacgacagccacaacaacaacatcaacaacaaacgACGGCAGCAGTAAGCACAGAAAATTCCAACATACCATCAACGGCAGAGGAAAAACAACTACAGACATCCTTGGCTAATATAAAAGCCGAACCAAAGGTAAAAGAGAATGGGACAAAAACACCAAACACTGTAAAAAACAtgCCCCTGAACTTTCCTCGCCGCAAATTACAAACGGAACGTTCCTCCACTCTGCCCATATGCCAGCGATGTAAACAGGTCTTTTTGAAACGCCAGAGCTACACACAACATGTTGCTCTGTCCTGTTGCAATATTGTCGAATACGATTTCAAATGCTCCATCTGCCCCATGTCCTTTATGTCCAACGAGGAGTTGCAGGCCCATGAGCAATTGCATCGTTCGAATCGCTATTTCTGCCAGAAATACTGTGGCAAATACTATGAGACCATTGAGGAGTGCGAGCAGCATGAGTTTGGCCAACATGAATACGAAATGTTCAAATGTAATATATGTTGTGTGTCGTTCCCGAAACGTGATCAGTTGTTCTCCCATCTTATGGATCATCGTCAGCAGCCACGTTACGATTGTTGCATTTGTCGTTTGTGTTTTCAGACCTTGGAGGATTTGGAGGATCATTACGTGGGGAATCCGGAATTTTGTGGCAAATTTTATGATAAGGAAGGtttcaaaaatctgaaaattttttctaaacCCATTAAGACTACAACACCGCAAGCAACAAATCGTTCGGAGAATCTCACCAGTTTCTTGATAAAGGATATAACCTCTATTAGAATGCCTGAACCGGGACCTTCACcatctaaacaacacaaaactgATCATATTTCACCTGCGGCAACTTTCGATGATATACCCGACTTTGCTGCCCCCCATGTAGAGGTTAAGACAGAAATTAAAGTGGAACCCGATTTTTATCCACCCATGGATCAGTCGGAATTTTCCGGTTTCGACAATGACTATTCAAATTCGCAAGAGTTTTCGCAAGGCTCAAATCAGAACTTGACATTTTTACAAGATTTTCACGACAACGCTTCGAATTCCACCAACTCATCGTACTCATTCAATACTACCGGATCGACCAGCAAGAGCAATGATACCAACCAGGAGGAAGATGCTCTTTGCTGCGTGCCTAAGTGTGGCGTACGAAAGTTTACCTCGCCCACATTGCAGTTCTTTCCTTTTCCCCGCGATGAAAAATATCTATCACAGTGGCTGCATAATTTGAAAATGACATACGATCCGAATGTGAATTATGGTATATATCGAGTGTGCAGTTTGCATTTTCCTAAGCGTTGTATCGCCAGATATTCGCTGAGTTATTGGGCTGTGCCCACTTTTAATCTAGGACATGAAGATGTGGGTAATTTATATCAGAATCGAGAAAGTTCGGGGGGGTTTCCTTCGGGTGAAATGGCTCGTTGCTATATGCCCGGCTGTTTGTCACAGAGAGGAGAGACTAATGTAAAATTTCACAGCTTTCCGCGGGATTTGAAAACTTTGATTAAGTGGTGCCAAAATTCCCGCCTGCCCGTGCATAGTAAAGAGAATCGCTTCTTTTGTTCACGCCACTTTGAGGAGAAATGCTTTGGCAAATTTCGCTTAAAACCTTGGGCCATACCAACACTTCGCTTAGGAACGGTTTATGGAAAGATACACGATAATCCCAATATTTACCAGGAAGAAAAGAAATGCTTTCTACCGTTTTGTCGACGTAGCAGATCATATGATTGCAACTTGTCCCTCTACAGATTTCCCCGGGATGAAACATTACTTAGGAGATGGTGCTACAATTTAAGGCTGGACCCCGAAATGTATAGggggaaaaaccataaaatctgCTCATCACATTTTGTCAAAGAGGCCTTGGGCTTAAGGAAACTGAATCCCGGAGCAGTGCCTACCATGAATTTAGGCCATAATGATCgatttaatatttatgaaaatgaattaTATACACCACCGCCACCTCCTCCACCACCACAGCCCTCTACATCGGCCAAAGCCCAAAAGTTCGCTGAAATGTACAAACAAGAAATGGGATCGGCTTCCGCCATATATGATGAGGTCTTCATGAACTCGATGATTCAAAAATTCTCGGGTTCATCGGCGCCAAATTCCAATAATCTCGACTTGGGAGATGTATGTTTAGTCCCGTCATGTAAGAGAACGCGGCATTCGGATGACATTACCCTTCACACTGTGCCCAAGCGGGCGGAACAGCTGAAAAAGTGGTgccacaatttgaaaattgacTTGGATAAATTGCACAAGAGTGTACGCATTTGCAGTGCACATTTTGAGAGCTATTGCATCGGGGGCTGCATGAGACCTTTTGCGGTTCCCACTTTGGAATTGGGTCACGATGACCCGGATATATTCCGTAATCCGGATGTCattaagaaattaaatataCGCGAAACCTGTTGTGTACCTTCCTGTAAAAGGAATCGTGATCGCGATCATGCAAATCTGCATCGTTTCCCCACTCACCCGGAATTGTTGCAGAAATGGTGTGAGAACTTGCAGAAGCCTGTGCCGGATGGtacaaaacttttcaacgatgCGGTGTGTGAAGTCCACTTCGAAGACAGATGTTTGCGTAATAAGCGATTAGAGAAATGGGCCATACCCACAATGAATTTGGGCTATGATGAAATAGTACATCACTTACCCTCTGAAGAAGAGATTTCGGAGTTCTGGACAAAACCTTTTGCCCCCAACAATGGAGACGAACAAGGCGAATGTTGTATTTCCTCGTGCAGACGTAATCCTCAAATTGATGATGTAAAACTGTACCGACCGCCTGAGGATGCCGAACAATTACTGAAATGGGCGCACAATCTGCAACTCGATGCGGCGGATTTGCCAAATCTAAAGATTTGCAATTTACATTTCGAGTCACACTGTATAGGAAAACGATTATTGAATTGGGCTATGCCCACACTAAATTTGTCCTCAAAGGTGGAGCATCTGTTTGAAAATCCTCCGCCTACTCAGGTCGTCTATAAGAAAAAAGAGAAACCATTACGTTTGTCTTCCAACCACGAAATAATAAAATGGGCTCCCAGATGTTGTTTACCTCACTGTCGCAAAACGCGCTCGTTAGACAACGTACAGCTCTTCCGATTCCCCTACGCTAACAGACAAACTATGGCCAAATGGTGCCATAACATACAGTTGCCTTTGGTGGGTAGTTCCCACAGACGCATTTGTTCTGTGCATTTTGAAACGGCAGTGTTAACTAAGAGATGTCCCATTAATTTGGCAGTGCCAACGTTGCATCTCAATACTCCACCCGGCTACAAGATCTATCAGAACCCGGCTCGCCTGAAACAAGTTAAAGTGGGTGTTCAAAGACAGTGTATCATAGAATCGTGCCACAAAACTAAAGTGGATGGCGTGGTACTCTTCCGTTTCCCCAACAATCGCACAATTTTACAGAAATGGCGGCACAATATCAAAAATTGGCCTAAAGGCAAATTAAGTTCTCAGCTTAGAGTTTGTTCGGAACATTTTGAATCCCACTCAGTGGGAGGGAAACGTTTATCGCCCGGTGCCATACCGACCCTTAAATTGGGCCATGACTCCGAGGATCTATACCCTAATGAAACGCGATCCTATTTTGATATGGAAAAATGTGTGGTCACTGGTTGCGATTCGAAAAAAGACATGGAGGATATTCGTCTCTTCCGATTCCCCCGAGATGACGAAGAGCTTCTGCAGAAATGGTgtactaatttaaaaatgaatacctTAGACTGTGTGGGTATACGGATCTGTGGTAAACATTTCGAGGTTGAATGTCTGGGCCCTAAACTCTTGTACAAGTGGGCCATACCAACTTTACATTTGGGCCATAAAGAGGAAGACAACGTAGAGATTATACAAAATCCTCCGCCCGAACAAAGATCGGgagaatatattttcaaatgttgcGTGCCAAGTTGTGGTAAGACCCGCAAATATGACGATGCCCAAATGAACAGTTTCCCCaaacatttgaaattgttccgcAAATGGAAGCACAACCTGAAATTGGATTTCCTCAATTTCAAAGAAagggaaaaatacaaaatttgcaatgacCACTTTGAGTCGGTATGTGTTGGAAAGACTCGATTGAATTTCGGAGCCTTGCCCACCCTGAACTTAGGCCACACAGACAGTGAAGATCTGTACAAAGTAAATCCTGATCGAATACGCCCCAATCTGTTTATTAAACAAAGGGATATAGAACGAATGGAAAGAAAACAGTTGCGATTAGAAGAAATGAAACTAACTATGGATTTGGATGAACAACAAGATATTGATCAGGAtcaagatgatgatgatgatgcattGGATCCGCTAAGTACACCAGCTGAATGCTGCATTGAAGACTGCAAGGCACCCAAGTCCATAATGAGAGAACCTTACGATTTGCCGCAGACTCCGGAGTTACGAAAGCTCTGGTCTGAAGAAATGAAGATAGATGCTGGCGATTTGCCGGCGGATAGCAAATTATGCGGTTTGCATTATCAACAGTTTTTCTGCGACCTGAAAGATGAAATGGAAGCCTTAAAGGAAGAGAGTTCCGAAGTGAAATTAGATTATGGCAAACTTTTGTTTGCCTATCAGAAATCAGAAATCTCGCTGGTAATTAATGGTTTCCAGTGCAGAGTTGAAGGCTGTCCCACAAATTTACTCAATTCAAAtaatcgtttatattttttcccctATGGCAAAGATATTATCAGTAAATGGACCTATAATACGGGTATAATACCAGATGAGAATCGGCGATACATGAACAAAGTGTGCGCTTTGCACTTTGAATCCTACTGTTTAACCGAAACCCAGCGCCTGCGATCGTGGGCCATACCTACTTTGAATTTAAACCACTCCGATCCAGACAGCATTTACAAGAACCCGGATCTAACCAGAATCGATAGACGAATGCTGGGACCTCAGATCTTAAAATGTGCTGTTGCTAATTGCGACAGTGCAAAGACAACGGAAATGGAATCGACCAAATTGTTTAACTTCCCCACAGACGACGTTTTGTTGAGAAAATGGTGTGTGAATCTGAAAATGTCACGACATCTAACACCACTGCTCAAGATCTGCTCTTTGCATTTTGAGAAAATGTGCTTTGGCAGTGCACGTATCCGTTCCTGGGCAATACCCACAAAGAATTTAGGCCATGACGACGAGCCGGAATATTTCAATAAGACCACCATCAAACAGGAAGTCTATGAGGAACCATCGACGCAAATCGAGCAACTACAACTGAAACAAGTGAAAATCAAGAAATCCTTGGACTCCGTAAAGTGTTACGTAGCGTCATGTCGAAGGTCACGTCTGCAACATGGTGTCCGCTTTTACGGCTTGCCCACGAACGGCAAAATGAAAAGGAAATGGTTGCACAATCTTCAAGTACCCTCCAATAAGGCGggaaaagttttgaatttaagAATCTGCAATTTGCATTTTCACAAGCGTTGCTTGGATGGCAAGCATTTGAAGGTATGGGCAGTGCCCACAATGCATTTGGGCCACACGGAACACATTTTCGATAATCCGCGCAGGTTGCAAAACCCCCTGGCCGTACAGCGTTGCGCTTTGACACATTGCCGGAATCACTCAATCGGAAATGAGCATTTGCGtacatttgtattcccaaaatCTACAGAATTTCTGGAGAAATGGTCGAAAAACCTTAAATTAGACGTGTCGAAATGTAAAGGTCGCCTATGTCACGAACACTTCGAACCGGCCGTTAAAGGTGAGAAGAAACTCAAAAATGGAGCAGTGCCCACAGTTAACTTGGGTCACGACGACGAAATTCCCTACGATAATTTGGAGCTGTTGAGCAAACTGCAATCAAAGGCTTTGCAAAAAGATACGGAATCCAAGAGCGAACAAATGCATGATGATGACGAAATGGAAGAAGAGGAGGAGGATTATGACGAGGATGAATTCGAAGAAGATGAATTTATAGGCGAAGGCGATGGAATAGAGGAGGATGAAATGGAAGAGGAAGATGAGGAAGAAGAAGATGTGGACAAGGAAGATGAAGATGACGAAGAAGAAGTGGACATGGACAAAGTTCGCATACGTGGTACTTTGCAGCACTGGAGttcaataaaaatgaaagaatTACGAGTTACCCTTGTACCCATAAGACAAGAAGACATTTTGGAAATGTCCTCAGTCTCGTCGTACGAACGAGATCCGCGTTCCATAACACCCGCTAACAGTGTCCGAGATCTTCGTTCCGAAACTCCAGCCAGTGTGGGTGGTGGCCATACTTCTTCGGAATATAACGACGAAAATTCTTCGAACACACCCTTGAGAACGGACAAACCCTTAAACAGTATTGCACCCATGTGTTGTCTCAAACACTGCGGCAAAGAAAAAACCCCCGAACAACACCTGACCACATATGGCTTCCCTAAAGACCCGCTGCTCTTACAGAAATGGTGTGATAATTTGGGCCTACAACCAGAAGAATGCATAGGCCGTGTGTGTATAGACCATTTTGAGCTAAGAGTTATAGGAACGAGAAGACTCAAGATAGGAGCTGTGCCCACCTTAAATTTAGGCAACTCTCGTGTGGCGAAACACACCAACGACGAGCCGAAGAAAATCGTAAGCACCGAGTCCGAACAAAAGTTAGCAGATAATGAACAGATACTCACGCCGCCACCGCCGTATGGCAATCCGAAAACGGGCAAACAATCGGTTTTTCGGCTATGTTGCCTCAAACATTGTCGGCGCAAGAAACAGCCTGTGCCGGAGAAGGATCAGAGCCATGACTCATCGCCACAACTGTTTTTTAAGTTTCCCAAAGACTATGAGACTCTCAAGAAATGGTCATCGAATTTACGTTTGCCGGAGAAAACCTGCGGCCGTAAAGAATTGCGTGTGTGTTCGAGACATTTTGAACCGTTCGTTATTGAAGGCGACTCACTGAAGCCCAACGCATTACCCACTTTAGACTTAAGTTACTCAAAACGCCCtccagtttttaaaaacaatcgcaAAGAGTTTGAACATAAACTCGTTAATGTGTCTAGCGACACGACCAAGTGTTTCCTGCCACACTGTGGTAAGAAAGAGGATCTGGAAACCTTCCTAATCAGTTTTCCTCAACATGATCTGTCCATGCAGCGCAAATGGTTTAAAAATCTCAAACTGGATAGTAagattaaaaactataaacatttaaagatCTGTAACCATCACTTCGAGACATACGCCTTTTACAAACAGCGAAATCTCAAGTCAGGAGCCGTACCCACCCTCATGCTGGGTCATACGGACCGCATATGTAAAAACTTACCGAAACTACGCCGAAAAGTAAGAACACAGCCGAAGGAAACATGTTGTATTAAGACTTGTGACAATCAGGGCAACAAAAAACTCTATGCCTTTCCCAAAAACTCTGAGCTGAGGAGAATATGGTGCTCGAACTTGCAGATCGATCTGAGGGAAGCTTTAAGATGCCACTTTAAACTCTGTGGAAAACACTTTTCCCTCGAGAGCTTTGAGGCGGGTAcggatgttttaaaaataagcgcTGTGCCCTCATTGAATTTGGGTGTTGAAGTTGATAAACTTAAGGTGTTATGCAAAATAGAGACTGAGGACGATAAGTTTAAGTGTGTGGTGGAGAGCTGTCAAAAGTCGGTGAGCGTTGACAAAGTGAAACTGTTTGGTTTTCCGAAATCCCGAGATTTCCTCAAGAAATGGTTATTCAATTTGAATCTCTCGCCTGATATAGAAGTAGACAAGACACGCATTTGCAATCGCCATTTCGAGAAGGTATGCATTAAGCATGGCATTTTGCATGAAAAAGCCGTACCCACCAAGTTCCTGAAGGCCAAATCCTGGATATACCAGAACGACGATGATGTATTCGACGAGAGCTACCAGTGTTGTGTGCCGAATTGCAACTACCAAAACAACGAAGAGGAATATCGCTCCATGTATAGGTTCCCCaagcaaaaggaggacattgaCAAGTGGATTCATAATCTAAGACTGCCAGTTGAAGATGACCAGACGGAAGTTAAGGATTTAAGAATATGCTCAGTACATTTCGAAGATGCTTGCAAAACGAAGGAGCATTTGCAACCAGGCACAGTGCCCACTTTACAATTGGGATATGAACAAACGGAAGACATTCACCGCAATCACATTCAGAAATGCTGTGTAGACAATTGCGGCTGGGCGGGTTTCACATGTCATAAACTACCCGAAAATGAGACACTCAAAAGCTGTTGGTTAAAGGCCTTTGAAAACGATTCCTGCGTTAATAATTCCGAATATATTTGCTCAATACATTTTGTATCTTGTTATGAACGAGTTGAGGCGGCACCACCACCCGTAATCACCGATGAGAATGAGACTTTAAAGAAGCTGTACGATCAGCTGCGTGTTTTACCCGAATTGGCGACATTCAAGTGTTCGGTACCGACGTGCGCGACGGGATTCAAACTCTCAACGAAGCTGTTTAAATTCCCCAAAGAAGTTACACTCCTCCAGAAATGGCTGCACAATTCTTCGCTGACTTTCGATTATGCATATCGTACGCAGTATCGTATTTGTGCCCAGCATTTCGAAGAACGCTGCCTGAGTGAAAAGAAATTACATCGCTGGTCGTTACCCACTCTGTCATTGCCGTACAATTTGAGTTTATACGTGAATCCACCGGAGGCTTTACCATCGAATCATGAAAATCTTAAACACTGCTGCGTTTCTTCGTGTGACACCGATAAAGGACCCTTCTTCAAATTCCCCGCCAAATCGTTAGAGGTCAAAAAGTGGATTCATAATCTAGACTTGGGCGCCCAACAGTGCACCTTGAATCTGAGAGTATGTCATAAGCATTTTGAAAGTTATTGTTTGTCCACCGACGAGGGCGGAAGCATTACAAAACTGAAACACTGGTCTGTGCCAACGTTGAATCTAACAAAAACAACCGATCTACATCCAAATCCGCCAGAGAAGGTGGACTATTTTGCCTGCTGTGTCTGCCGGCAATTACAAAACAAATCGGAGGGCTTGCATCTGTTTCGCTTTCCCACGAGACTGTCCAGTTTTTTGAAATGGCTGCACAATCTGAGATTAAAGCGTCAAGATTATCGCGACAGTATGCGTATTTGCATTAAACATTTCGAAACGCAGTGTTTCGATAAGACTCTTAAGCTGCTGCGCAAACATTCTGTGCCCACCATAGGTGTGGCCTGTTCCGACAAGGACATGTTCACGAACCCTTCGCGACGACCCAACTCGAAATGTTGTGTGCCCTTGTGCGAAGGCCCTTGGCTACATTTGAATGTTTTTCCAAAAGATAAAatgctgctcaaaaaatggtgCTTCAATTTAAACCTAAAAGAATCGGATTTGGAAACATtgagaaattggaaaatttgccagaAACATTTCGAAAACAAGTGTCTCAATGCTTTCGGGCTGATAAGACCCACAGCCGTGCCTACTTTAAATTTGGGtcataacaataaaatatttaaaaattctaggCTGGCGAAGAAAGTAAACGAAGGACTTGCGGTGAAAGGTGGTATTAAGGCAGTCAAAGCTTCGAGGTTCCTAGTgaagaatgaaattaaaaaggaaGAACAAAAGTCGATGGCGCTAAGCTTGAAACAAGCCAAGTTGGTAGAGAAAACTTCCAAAGGGGCAATTAAAGCTTCCCGTTTCCTGTTAAATAAGAAAATCAAGATTGAAGGCCAAAAAAGTGTAGCTAGAAAAGTCGTGAGGAAACCAAAACAGGATGAAGTACCGAGTGCCGAAGTATCTGCTGAAGATTTAGATAACTATGAAACGCTGGGCGATTACATCAGCGAAATGAGAAAACTCAAAGCCTTGAAAAAGCGGCAAACCGACGTAAGCAACACTACGAAGATCATGCAAAAGGTGGATAAGAGTAAGATTAGTAAAAAGTGGTTACCCTGCAAACCCGAGCCAGAGGACAAACCTTTGTTTGATATTATTGTCGAAACGTGTCCAATCGCAGAAATCACTGATTCACTGTCAGACTTGCCCATCAAAGATGAATTCTTAAATGAAGCCCAAAACTACCTAGACGGTCCAGACAACGCTCAAGGGTTAGTTGTAAAGTTAGAGCCCGTAGATGAAACGACACAAAAGTTCCTCGATGGACTGAGCGAAAGGCCAGCGTCGGCCCAAAAACTGGGTCTCCAACAAAAGTGTTCCATATCAACATGCACGAATGTCACAAATTATCCCGGTTGCGTATTCTTCAGATATCCGCCCAACATAAGGCTTTGTTCGATCTGGGCCAGGCATTGTCAAGAGAGCACCAAATTTATCGTGACTTTAACGAAAAAACGTAAGGTCTGTGCGGaacatttcgaagagcagtgcTTTGTTGAGAAACGCCTATTACTGGGCGCCATACCAACTTTGCGGCTAGCCCATGAAAATTCGAAGTTAGATGATTGTGGCGAACTGTTGAAAAGCTTTCGCAACTTGCGCTGTCGCATGGACGAATGTCAGAGATCGGTGGAGCTGGATCAGATAAACAAAATCCCCTTTCCACAGGGTGGTGAATTAAAACAGAAATGGTGTTTCAATTTGAACCTCAATGAAGCTGATGTAGCGCCGCATGACTGGATTTGCCATAAGCATTTCGAGCGGCGGGTCCTTATCAAAAATAAACGCACCAAGGACAATGCAGTGCCTACCTTGTTGCTGGGCTCTCAAGCCAAACCGGCGGAGGAACTTTACAAGAATCCTGAGTACATCAATCAACAACAATATTTCATACAACTGCGCCAAGTGTGCTGTGTGCTGAGCTGCCCAAATACCAGACAAATGCCGGGTGTACGTTTGACCACCTTTCCCAAACGAAAAGACATTTATGAGAAATGGTTGCATAATTTGGACTTAGAGGACGCGCTGGAAGTTCGCACGGGCTACCACATTTGCTGGCAACATTTTGAGGAAATATGTTACACGAAATATAACTATTTGAAAATTGGTTCCATACCCACGCTAAAGTTAGACAAGCAAGACGATTTGATGGCCTTGGACACTAAGCAATTGGTGGAAGTGGATAGCTTTGGCAAAAGAGTCAAAAGTCATATGCAGCACAAAAAGTATAAATGTTTCTATCCGGAATGTACCGAGTGGAAGACACAGGTATACAAGGCACCGGAAATTGATGAACTGCGCACATTATGGCTAGATAGTTTAGAACTGCAGGTTCTCACCTTAAATACCGATCAGGATGTTCACTTTTGTGATGAACATTTCTACATGCTGTACAAGAAGTTTGAACACCAGCTGCCACTGATCAACAGCGAGCTTTATGGCAATAAATTGGAATCTCTGAGGAAAACATTCCAAGAACTGGCGGCAAAAGGTAAATTCTTTACCAAACAGTGTTTGGTACCACAATGTCCCACAGATCATCTCTTTGAGCCCTACAAAGATATCAGACTGTATAACTTTCCGCAGAATGCCGAAATATCTTCGAAATGGTGTTACAACTGCAGCATTGACTACAGCAAACTCGAAAAGGATAAATACAATTACTACAAAATCTGCGATCGACATTTCGAAAGCTATTGCTTTAACAAACGTTCCATATTTTCCTGGGCCCTGCCCACCTTAAACTTGCCGGACACGAGGCCAGTAGAGATACTGGAAAATGACCCGGAGGATAAGAACGCCTATACTGGCGAGTGTTGCATACGCTCGTGCATAAATGCCAATGGCTATAAACTAGAATCGAAAACTAGACTATATAAATTCCCCGAGAACAAGGAGACCCTTGAGAAATGGCTCCACAATGTCAATTGTGAAAACTTTTgcgaaaatgaaacaaaaatatgtggCCTTCATTTCCGCTGTAGCTATATTAAAAAGAGGAAATTAACCGAGGAGGCCATACCGACACTGAGACTAGGTCATTTCAATGAAACAGAGCTAATTACCAAAGTCAAAGTCGATCCGGTAATTGTAATCAAAGAAGAGCCCACGGATGAGTTGATGCAGGAAGACGTAACCAGAGCATCGTCCGTGGAGGAAGAAGAGGACAAAGAAATGCTAATGGAGCAAAAAGTCAATGTCATCGAGGGATGGAATGATCACGATTATTGCCATGAAATTAAACCGCACAAATCTCCAAATGCTCAAAAATTCGAGGTGATAGCTATAAAGCAGGAAATCATTGAAGATCAATACGGCCTCGAtgagcagcagcaacagtaCGAAACACCGACtattaaacaagaaattatcgAAATTGAAGAATCACACACCTATGACCAGCAGGAAGAGTATATTTACTTCGATGAGAACGTCGTGGGGGAGAACGTAACTCTGCCACCGGCCGAACCAACAACATCCTCCCATTCGATGTACCGCAGCCTAGTAATATCGGAAGTTAAATCGCACATTTTCCTCTGTTGCGTGCAGAAATGTACCAACTCCTCCGAGAGCAAGGATATTAAATTGTACACCGAATTTCCCAACGATTCGGAGATATTCATCAAATggtgttttaatattaaaatcgaTCCGCGCAATTACAAAGAGAATCAATATGCCATATGCAGCCAACATTTCGATTCGATTTGCTTCAGTGACAGCGATTTGACACTGCATACGTGGGCAGTGCCCACCCTAAACTTGAACCTGCCCGAAAATTCATTTATCCATCACAATGATCCGCCTAGCGAACAGTGCATAGTCTATGGCTGCATACAGCCGACGCAACCGCTGTATAAATTTCCATTGCGATTGGACTTAAGCCAGAAATGGTTTGCCAATCTTAAATTAGAACTAACCGACTATAGGGCGCATAGTTATAGGATATGTCGTAGACATTTTGCCCCCGAGTGTTTTGATACAAATCATGTACTCAAAACCGAAGCTATACCGACATTGTATTTAGGGCACGCCGATAGCATTGCACACTGGAATGCTTTTGAGGTCAGGACCCATGAACAGGAGGGCGGCGGTGATGGTGTTGGCGTCGGCGCTGGTGACATCATCGCCGGTTTTGTGGGTGGCCTCGACAATAGTCGTGGCAGTAGTCAGGGTTCAATAGTGCGACATTTGATATCACCCCATGATCTGGAAGATCACGACAGCAGTTATTTCGAGGACTTTGAGGAATGCTATGGTGCGGACGATTGA
Protein Sequence
MCVCMYHANFGFLLEGCSTLIPSISHDFQSFLQSACCNIKEEKSEPMDEMPFHKNAPQIENNTFTMNEERKQQQQQRVQQAQEQKHHHQHNPPPPSHHHQHQHQQQQQHHHHQQQQQQQHNHHYQHHLQQQQSQSQQPQPPPQPPSSQHHHPHQHQSNENISQQQQRQQQHQPPPPQRQPQQQHQQQTTAAVSTENSNIPSTAEEKQLQTSLANIKAEPKVKENGTKTPNTVKNMPLNFPRRKLQTERSSTLPICQRCKQVFLKRQSYTQHVALSCCNIVEYDFKCSICPMSFMSNEELQAHEQLHRSNRYFCQKYCGKYYETIEECEQHEFGQHEYEMFKCNICCVSFPKRDQLFSHLMDHRQQPRYDCCICRLCFQTLEDLEDHYVGNPEFCGKFYDKEGFKNLKIFSKPIKTTTPQATNRSENLTSFLIKDITSIRMPEPGPSPSKQHKTDHISPAATFDDIPDFAAPHVEVKTEIKVEPDFYPPMDQSEFSGFDNDYSNSQEFSQGSNQNLTFLQDFHDNASNSTNSSYSFNTTGSTSKSNDTNQEEDALCCVPKCGVRKFTSPTLQFFPFPRDEKYLSQWLHNLKMTYDPNVNYGIYRVCSLHFPKRCIARYSLSYWAVPTFNLGHEDVGNLYQNRESSGGFPSGEMARCYMPGCLSQRGETNVKFHSFPRDLKTLIKWCQNSRLPVHSKENRFFCSRHFEEKCFGKFRLKPWAIPTLRLGTVYGKIHDNPNIYQEEKKCFLPFCRRSRSYDCNLSLYRFPRDETLLRRWCYNLRLDPEMYRGKNHKICSSHFVKEALGLRKLNPGAVPTMNLGHNDRFNIYENELYTPPPPPPPPQPSTSAKAQKFAEMYKQEMGSASAIYDEVFMNSMIQKFSGSSAPNSNNLDLGDVCLVPSCKRTRHSDDITLHTVPKRAEQLKKWCHNLKIDLDKLHKSVRICSAHFESYCIGGCMRPFAVPTLELGHDDPDIFRNPDVIKKLNIRETCCVPSCKRNRDRDHANLHRFPTHPELLQKWCENLQKPVPDGTKLFNDAVCEVHFEDRCLRNKRLEKWAIPTMNLGYDEIVHHLPSEEEISEFWTKPFAPNNGDEQGECCISSCRRNPQIDDVKLYRPPEDAEQLLKWAHNLQLDAADLPNLKICNLHFESHCIGKRLLNWAMPTLNLSSKVEHLFENPPPTQVVYKKKEKPLRLSSNHEIIKWAPRCCLPHCRKTRSLDNVQLFRFPYANRQTMAKWCHNIQLPLVGSSHRRICSVHFETAVLTKRCPINLAVPTLHLNTPPGYKIYQNPARLKQVKVGVQRQCIIESCHKTKVDGVVLFRFPNNRTILQKWRHNIKNWPKGKLSSQLRVCSEHFESHSVGGKRLSPGAIPTLKLGHDSEDLYPNETRSYFDMEKCVVTGCDSKKDMEDIRLFRFPRDDEELLQKWCTNLKMNTLDCVGIRICGKHFEVECLGPKLLYKWAIPTLHLGHKEEDNVEIIQNPPPEQRSGEYIFKCCVPSCGKTRKYDDAQMNSFPKHLKLFRKWKHNLKLDFLNFKEREKYKICNDHFESVCVGKTRLNFGALPTLNLGHTDSEDLYKVNPDRIRPNLFIKQRDIERMERKQLRLEEMKLTMDLDEQQDIDQDQDDDDDALDPLSTPAECCIEDCKAPKSIMREPYDLPQTPELRKLWSEEMKIDAGDLPADSKLCGLHYQQFFCDLKDEMEALKEESSEVKLDYGKLLFAYQKSEISLVINGFQCRVEGCPTNLLNSNNRLYFFPYGKDIISKWTYNTGIIPDENRRYMNKVCALHFESYCLTETQRLRSWAIPTLNLNHSDPDSIYKNPDLTRIDRRMLGPQILKCAVANCDSAKTTEMESTKLFNFPTDDVLLRKWCVNLKMSRHLTPLLKICSLHFEKMCFGSARIRSWAIPTKNLGHDDEPEYFNKTTIKQEVYEEPSTQIEQLQLKQVKIKKSLDSVKCYVASCRRSRLQHGVRFYGLPTNGKMKRKWLHNLQVPSNKAGKVLNLRICNLHFHKRCLDGKHLKVWAVPTMHLGHTEHIFDNPRRLQNPLAVQRCALTHCRNHSIGNEHLRTFVFPKSTEFLEKWSKNLKLDVSKCKGRLCHEHFEPAVKGEKKLKNGAVPTVNLGHDDEIPYDNLELLSKLQSKALQKDTESKSEQMHDDDEMEEEEEDYDEDEFEEDEFIGEGDGIEEDEMEEEDEEEEDVDKEDEDDEEEVDMDKVRIRGTLQHWSSIKMKELRVTLVPIRQEDILEMSSVSSYERDPRSITPANSVRDLRSETPASVGGGHTSSEYNDENSSNTPLRTDKPLNSIAPMCCLKHCGKEKTPEQHLTTYGFPKDPLLLQKWCDNLGLQPEECIGRVCIDHFELRVIGTRRLKIGAVPTLNLGNSRVAKHTNDEPKKIVSTESEQKLADNEQILTPPPPYGNPKTGKQSVFRLCCLKHCRRKKQPVPEKDQSHDSSPQLFFKFPKDYETLKKWSSNLRLPEKTCGRKELRVCSRHFEPFVIEGDSLKPNALPTLDLSYSKRPPVFKNNRKEFEHKLVNVSSDTTKCFLPHCGKKEDLETFLISFPQHDLSMQRKWFKNLKLDSKIKNYKHLKICNHHFETYAFYKQRNLKSGAVPTLMLGHTDRICKNLPKLRRKVRTQPKETCCIKTCDNQGNKKLYAFPKNSELRRIWCSNLQIDLREALRCHFKLCGKHFSLESFEAGTDVLKISAVPSLNLGVEVDKLKVLCKIETEDDKFKCVVESCQKSVSVDKVKLFGFPKSRDFLKKWLFNLNLSPDIEVDKTRICNRHFEKVCIKHGILHEKAVPTKFLKAKSWIYQNDDDVFDESYQCCVPNCNYQNNEEEYRSMYRFPKQKEDIDKWIHNLRLPVEDDQTEVKDLRICSVHFEDACKTKEHLQPGTVPTLQLGYEQTEDIHRNHIQKCCVDNCGWAGFTCHKLPENETLKSCWLKAFENDSCVNNSEYICSIHFVSCYERVEAAPPPVITDENETLKKLYDQLRVLPELATFKCSVPTCATGFKLSTKLFKFPKEVTLLQKWLHNSSLTFDYAYRTQYRICAQHFEERCLSEKKLHRWSLPTLSLPYNLSLYVNPPEALPSNHENLKHCCVSSCDTDKGPFFKFPAKSLEVKKWIHNLDLGAQQCTLNLRVCHKHFESYCLSTDEGGSITKLKHWSVPTLNLTKTTDLHPNPPEKVDYFACCVCRQLQNKSEGLHLFRFPTRLSSFLKWLHNLRLKRQDYRDSMRICIKHFETQCFDKTLKLLRKHSVPTIGVACSDKDMFTNPSRRPNSKCCVPLCEGPWLHLNVFPKDKMLLKKWCFNLNLKESDLETLRNWKICQKHFENKCLNAFGLIRPTAVPTLNLGHNNKIFKNSRLAKKVNEGLAVKGGIKAVKASRFLVKNEIKKEEQKSMALSLKQAKLVEKTSKGAIKASRFLLNKKIKIEGQKSVARKVVRKPKQDEVPSAEVSAEDLDNYETLGDYISEMRKLKALKKRQTDVSNTTKIMQKVDKSKISKKWLPCKPEPEDKPLFDIIVETCPIAEITDSLSDLPIKDEFLNEAQNYLDGPDNAQGLVVKLEPVDETTQKFLDGLSERPASAQKLGLQQKCSISTCTNVTNYPGCVFFRYPPNIRLCSIWARHCQESTKFIVTLTKKRKVCAEHFEEQCFVEKRLLLGAIPTLRLAHENSKLDDCGELLKSFRNLRCRMDECQRSVELDQINKIPFPQGGELKQKWCFNLNLNEADVAPHDWICHKHFERRVLIKNKRTKDNAVPTLLLGSQAKPAEELYKNPEYINQQQYFIQLRQVCCVLSCPNTRQMPGVRLTTFPKRKDIYEKWLHNLDLEDALEVRTGYHICWQHFEEICYTKYNYLKIGSIPTLKLDKQDDLMALDTKQLVEVDSFGKRVKSHMQHKKYKCFYPECTEWKTQVYKAPEIDELRTLWLDSLELQVLTLNTDQDVHFCDEHFYMLYKKFEHQLPLINSELYGNKLESLRKTFQELAAKGKFFTKQCLVPQCPTDHLFEPYKDIRLYNFPQNAEISSKWCYNCSIDYSKLEKDKYNYYKICDRHFESYCFNKRSIFSWALPTLNLPDTRPVEILENDPEDKNAYTGECCIRSCINANGYKLESKTRLYKFPENKETLEKWLHNVNCENFCENETKICGLHFRCSYIKKRKLTEEAIPTLRLGHFNETELITKVKVDPVIVIKEEPTDELMQEDVTRASSVEEEEDKEMLMEQKVNVIEGWNDHDYCHEIKPHKSPNAQKFEVIAIKQEIIEDQYGLDEQQQQYETPTIKQEIIEIEESHTYDQQEEYIYFDENVVGENVTLPPAEPTTSSHSMYRSLVISEVKSHIFLCCVQKCTNSSESKDIKLYTEFPNDSEIFIKWCFNIKIDPRNYKENQYAICSQHFDSICFSDSDLTLHTWAVPTLNLNLPENSFIHHNDPPSEQCIVYGCIQPTQPLYKFPLRLDLSQKWFANLKLELTDYRAHSYRICRRHFAPECFDTNHVLKTEAIPTLYLGHADSIAHWNAFEVRTHEQEGGGDGVGVGAGDIIAGFVGGLDNSRGSSQGSIVRHLISPHDLEDHDSSYFEDFEECYGADD

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00816591;
90% Identity
-
80% Identity
-