Basic Information

Gene Symbol
-
Assembly
GCA_035045865.1
Location
JAWNOQ010000083.1:4015181-4029649[-]

Transcription Factor Domain

TF Family
THAP
Domain
THAP domain
PFAM
PF05485
TF Group
Zinc-Coordinating Group
Description
The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 32 10 3e+04 -7.5 5.7 21 67 393 444 374 459 0.54
2 32 5.1e-15 1.5e-11 45.6 4.4 1 86 594 666 594 667 0.85
3 32 1.3e-14 3.8e-11 44.3 5.2 1 87 694 763 694 763 0.83
4 32 3.8e-15 1.1e-11 46.0 0.3 1 87 786 858 786 858 0.84
5 32 7.7e-16 2.3e-12 48.2 5.5 1 87 967 1037 967 1037 0.82
6 32 6.1e-15 1.8e-11 45.3 3.2 1 86 1061 1132 1061 1133 0.81
7 32 7e-13 2.1e-09 38.7 0.6 1 87 1168 1236 1168 1236 0.81
8 32 1e-10 3e-07 31.8 1.6 1 86 1279 1348 1279 1349 0.77
9 32 2.5e-16 7.5e-13 49.7 0.4 1 86 1376 1445 1376 1446 0.83
10 32 1.8e-12 5.4e-09 37.4 2.3 1 86 1467 1536 1467 1537 0.81
11 32 9e-15 2.7e-11 44.8 1.6 1 86 1564 1635 1564 1636 0.85
12 32 6.6e-13 2e-09 38.8 2.0 1 85 1716 1784 1716 1786 0.82
13 32 3.7e-12 1.1e-08 36.4 0.1 1 86 1810 1878 1810 1879 0.82
14 32 8.9e-14 2.6e-10 41.6 2.8 1 87 2005 2074 2005 2074 0.80
15 32 4.8e-11 1.4e-07 32.8 0.3 1 86 2157 2223 2157 2224 0.82
16 32 0.035 1e+02 4.4 0.0 1 58 2243 2290 2243 2313 0.73
17 32 1.4e-12 4.1e-09 37.8 0.2 1 86 2320 2389 2320 2390 0.83
18 32 9.4e-14 2.8e-10 41.5 1.2 1 87 2456 2526 2456 2526 0.82
19 32 2.6e-12 7.8e-09 36.9 1.0 1 86 2561 2632 2561 2633 0.80
20 32 3.3e-11 9.9e-08 33.3 0.4 1 87 2645 2718 2645 2718 0.78
21 32 5.8e-13 1.7e-09 39.0 0.2 1 86 2744 2816 2744 2817 0.81
22 32 3.2e-07 0.00094 20.6 0.5 1 58 2853 2904 2853 2921 0.86
23 32 8.2e-13 2.4e-09 38.5 0.1 1 87 2942 3014 2942 3014 0.81
24 32 1.1e-16 3.2e-13 50.9 2.8 1 86 3066 3137 3066 3138 0.83
25 32 5.2e-05 0.15 13.5 0.2 1 58 3169 3218 3169 3237 0.79
26 32 4.2e-13 1.3e-09 39.4 0.3 1 87 3256 3328 3256 3328 0.82
27 32 1.1e-14 3.2e-11 44.5 0.4 1 87 3471 3544 3471 3544 0.83
28 32 2.2e-12 6.4e-09 37.2 2.4 1 86 3609 3679 3609 3680 0.81
29 32 1e-14 3.1e-11 44.6 4.5 1 86 3783 3853 3783 3854 0.85
30 32 7.4e-13 2.2e-09 38.7 0.1 1 86 3934 4003 3934 4004 0.85
31 32 1.4e-11 4.1e-08 34.6 0.5 1 58 4030 4079 4030 4095 0.86
32 32 1.2e-10 3.4e-07 31.6 1.1 18 87 4096 4155 4085 4155 0.77

Sequence Information

Coding Sequence
ATGTCACAACATAATCCACATTATCATCCCCACCCCCATCCCCTACActatcagcaacaacagcagcagcagcagctgcatcaCCACCATACCTCtcttcaacagcaacaacataaacaaatacaacacAGCAATTGGTACTCACATGTTGCTTCCACCTCTTCCGCTCCCTACCCTCATCACCCCTCCTCGACCACCTCATCGGTGGCGGCGTCAACTTCAGGCGCTAACAACAATCACATAATGAATGCCTATGGAACACATGGATATTATGGTGCCGCTGGCGGTGGCCTCAATGTCAATGCTGTGGGTGTAGGTGTtgggggtggtggtggtggtgggggaAGTTCAAACAGTTATAACCTTGAGGCGGCCAATACAGTGGCCTATGCCCACAACCAGCTGCTGCagtatcaacaacaacaacagcatcaacaacaacatcagcaacagcatcaacaacaacaacagcagcagcaacagcaacaacaccaacatcaTCTCAATGCAAGATCTTATATGGGAGGTCATCATCATGGTATATATCCCTATATTAAAAGTGAACCCATGGAATATACCCATAACACAATGGCTCCACCTCCAGCACCTACTACAGCAACCACAGAAATGAGAATTAAATCGGAACCCATTGACGAACTGGCCTACAAATCGTCCAATTATATTGATGATAATACTCCATTTGCTGACTTTTCGAAATATAATGAATTTAGTGAGAATATGTTGAGTCCCAAAGTGGAATTAACTGTGAAAAATGAATCACCCTACGGCAAgcATCCTAATAATTATCCACGGCGTAAATTACAAACGGAACGCTCATCGGAAAATTTACCCATATGTCAACGTTGCAAAGAAGTCTTCTTCAAGAAGCAATCGTATCTACGTCATGTGGCCGAAAGTAGTTGTAGCATTCAGGAATATGAATTCAAATGCAACATTTGTCCCATGTCCTTTATGAGTGGCGAAGAATTGCAAAGGCATAAACATCTCCATCGGGCTGATAAATTCTTTTGTCATAAATATTgtggaaaatattttgatacaATTGCCGAATGTGAATCCCATGAATATATGCAACATGAATATGATAGTTTTGTTTGTAATATGTGTTCGTTGACATTTGCCACCAGGGAGCAGCTTTATACCCATTTACCACAACATAAGTTCCAGCAGCGTTACGATTGTCCCATTTGTCGTTTATGGTATCAGACGGCTGTCGAACTCCATGAGCATCGTCTGGCGGCACCTTACTTCTGTGGCAAATATTATaatcagcagcatcatcatcagtcacaacagcagcagcaacatcaccatcagcagcaacagaatcaCCAACAACAGACGCatcaaacaaattataaattgcAGGATTGTCATATGGCTACCATGGAAATGCCCACAGCACCACCGCCATCATCAGCGGTAACACATCACAAGTCTAATGCATCCGGAACATCTTCTACATTACCAGCAACGGCAGCTTTGAGTTCTCTGCTCCAACAACGTCAGGCCAATGCAGATGGTGCGGCcatgtttgctgctgctgcctcctCAACATCCCTCAAAGGGGAAGTCAACGTGAAGTTGGAACGAAGTTATAGCAACTCCACAAGTGACTCTTCTTTTGGTGGAATGCATGAATCCaactataataataataataatgcctATGGCAGTGATAATTCCATTCATGGATCTGGTGCCGTTGGTGGGCCACAAGCTCATTCCTCAACGCTGGATGACTCTGAGGATGCTCTATGCTGTGTGCCCATGTGCGGTGTAAGCAAAAGCACTAGTCCCACACTCCAGTTTTTCACATTCCCCAAAGATGACAAATATCTCCATCAATGGCTACACAATTTAAAGATGTTCCACATACCCGCCTCAAGCTATTCGACATTTCGTATCTGTAGCATGCATTTCCCAAAACGTTGCATCAATCGGTATTCGTTATGCTATTGGGCAGTGCCTACCTTCAATTTGGGACACGATGATGTCGCCAATCTCTATCAGAATCGCGAGCTAACAAATACCTTTACCACCGGCGAGGTCGCACGCTGCAGCATGCCGCACTGTAATAGCCAGCGGGGTGAGAGTAATCTCAAGTTCTATAACTTTCCCAAGgatattaaaagtttaatcAAATGGTGTCAGAATGCTCGGCTGCCTGTTCAGGCCAAGGAGCCCCGACACTTTTGTAGCCGTCACTTTGAGGAGCGTTGCATTGGCAAATTTCGTTTAAAACCCTGGGCAGTGCCCACACTACATCTGGGTGGTGCCCAATATGGGAAAATCCATGATAATCccaaaaatttgtatgtagAGGAGAAGCGCTGTTGTCTTAACTTTTGTCGTCGCAGCCGTTCAACGGATTTCAATATGTCGCTTTATCGTTTCCCAAGGAATGAGGTATTATTACGACGCTGGTGCTATAATCTGAGACTCGATCCGGGTGTATATCGGGGCAAGAATCATAAAATATGCAGTGCACACTTTATTAAAGAGGCATTGGGTTTAAGAAAACTGTCGCCGGGTGCTGTTCCTACACTTCATTTGGGTCACAATGATACCTTTAATATCTATGAAAATGAATTATGGCCACCGCCGACGCCAAGTTCCTCAACGCCACATCATcagcaccatcatcatcagcagcagcagcagcaacatggcCATGGGCATGGTCATgcacagcaacatcatcatcatcacaacAAAGCAGCGTATCATCGTCAAACGGCAGCTTCGACTTCATCATCGGCTAGCTCAACTTCGCACTACGTGGATCCGGATAATATGGGCAGCGGAGCATATCTTGGCATGGGTGGTGCTAACTCCCTTTCTGGTGGAATGAATGTCAGCGATAGCATGGACATTTGCTGTGTACCAAGTTGTGAGAGTAAGCGACATAATAGCGAGAACATCACATTCCATACGATACCCAGAAGGCCCGAGCAGATGAGGAAATGGTgtcacaatttaaaaatacccGAGGATAAAATGCACAAGGGCATGCGGATATGTAGTCTACATTTCGAGCCGTATTGCATTGGCGGCTGCATGCGTCCATTTGCAGTGCCAACTCTTCATCTGGGACATGACGATAAGGATATTCATCGTAATCCGGATGTGATTAAGAAACTTAATATAAGGGAAACTTGTTGTGTGGCAGTCTGTAAAAGGAATCGTGATCGTGATCATGCCAATCTCCATCGGTTCCCTAGCAATGTGGCCCTATTAACGAAATGGTGTGCCAATCTGCAAAGGCCTGTCCCAGATGGCAGTAAACTCTTTAACGATGCCATATGCGAAGTGCATTTCGAAGATCGTTGTTTGCGCAACAAGAGATTGGAGAAATGGGCAGTGCCGACGTTAATGTTGGGTCATGAGGATATTGCGTATCAGTTGCCCACATCCGAGCAAGTGGCAGAGTTCTATGCACGTCCAAATGCACCGAATAATGGCGAGGAGCAGGGAGAATGTTGTGTGGAAAGCTGTAAGCGTAATCCCAGTGTGGATGACATAAAACTATATCGTCCACCCGAAGAGTCAGATATACTGGCCAAATGGGCGCATAATCTTGAACTGGATGTGGCCGAGTTGCCAAATATGAGGATATGCAATCTACATTTCGAATCCCATTGCATTGGTAAACGGATGAGACCGTGGGCCATACCAACATTAAACCTATCTTCTAATATTGAGAATCTGTACGAGAATCCAGAGCACTCAATGTTGTACAAGAGGAGAACGAAGCGAGATCCAAATCGAGACGTATCCCTAGCGGCAACGAAACCAACTTGGGTTCCTAGATGCTGTTTGCCGCATTGTCGCAAGGTCCGAGCTCTGCATAATGTTCAACTCTATCGATTCCCCAAACTGAATCGTTCCACATTGGCCAAATGGGCACACAATCTACAAGTGCCAATGGTGGGCAGTGCCCAACGGAGACTCTGTTCGGCACATTTCGAACCTCATGTATTAAGTAAAAAGTGCCCTGTACCATTGGCTGTACCCACGATCGATTTAAATGCCCCGCCAGGTTACAAAATCTATCAAAATCCAGCCAAACTTAAAGCCAGCAAATTGTGCCTGCAAAGAGTTTGCATTGTGGAGAGTTGCCGTCGCACCAGGGCTCAAGGAGTCCAGCTCTTCCGTTTGCCTCACAGTCCGACGCAGTTAAGGAAATGGATGCACAACATCAAGACACGTCCACGGGCAGCTACAAGATCGCAGTATCGCATCTGTTCGATACACTTTGAATCGCATTCGTTTAATGGCAAAAGATTAAGTGCTGGAGCCATTCCCACCTTGGAATTGGGTCATGACGATGACGACATCTATCCGAATGAGGCACAAGCATTTGTGGATGAGCATTGTGTGGTCGAGAGTTGTGAATCGTCAAAGGATCAACCCGAAGTGCGTTTATTCCGTTTCCCCACCGAAGATGATGATCTTCTGTGGAAATGGTGCAACAATCTCAAAATGAATCCAGTTGATTGTGTAGGAGTGCGTATTTgtaataaacattttgaagCTGATTGCATTGGTCCCAAACACCTATTCAAATGGGCCATACCCACTATGGAGCTGGGACACGATGACAGTGAAATCGAACTGATACCAAATCCCAAGCCTGAAGAGCGATATGTTGATCCAGTTTTTAAGTGTTGTGTACCAACTTGTGGCAAGACCAGGAAATTTGATGAGGTGCAAATGAATAGTTTTCCGAAAGATCCTTTGCTCTTCCAGCGCTGGCGTCACAATCTGCGTTTGGATCACCTGAATTTTAAGGAGCGGGAACGCTACAAGATTTGCAATGATCACTTTGAGGATGTTTGCATTGGCAAAACTCGACTTAATATAGGCTCCATACCCACCCTTCAGTTGGGTCACAATGAGACGGAGGATCTGTATCAAGTCAATCCTGCGGAATTGCAAAGTAATCTCTTTGGCAGACCACGTAGATTACATGGTGGGGTTGACATTAAGCTAGAATATGCGGAGGATTCCGAGGCAGAATCAGGACTGCAGGATGTTAAACCAAATATCTATGAGATGGCCGAAGCCACCGATATAAATATCAGGCAGGTGAAGATTAAGAAATCTCTCGCTGATCTAAAGTGTTGTGTACGCAGCTGTGGTCGTAGTCGCCTGGAGCATGGTGCTCGCCTCTTCCCCTTCCCCAATGGCAAGCAACAGAATCTGAAATGGCGTCACAATCTCCAACTTGAACCGGAAGAAGTGGACAAAATGACACGCGTCTGCAGTGCGCATTTCAATCGGCGTTGCATAGATGGCAAACATCTGCGGGGATGGGCCATACCCACACAACAATTGGGACACCATCATGAACAGCCAATTTATGAAAATCCCAAAAATATTCCAGGCTTCTTTACCCCAACATGTGCCCTAAGCCACTGTAGACAGAGGCGAAGCATTGATAATGATTTGCGCACCTATCGCTATCCGAGAAGTGAGGATCTATTAGAGAAATGGCGTGCCAATTTACGTTTGGCGCCAGATCAATGCCGTGGACGGATTTGTGCTGATCACTTTGAGCCGTTGGTTAGGGGCAAACTGAAATTGAAGACTGGAGCAGTGCCCACTCTGAAATTAGGACATGATGAGGAATTAGTTTACGATAATGAAGCTATCAAAGCTAATCTAGTGGATGAAGAGGATGTCAGTTTGGAATCACCACCGCAAGTAATAACTAAAAAGGAGATTTTGGAAGAGgaagatgatgaagaagaTCTGCAAGAgcatgaggatgatgatgaggaggaggaggaggaagaaaACGATCCACCAGAAGAGGATTCACATTCCGATTATTTCGATCCCCTAGAATTGGTAGAGACATATGCCGATGATCAAGTACCAGAAGATGAATATAGTGCACCCGCTCATCAACTCCCGGCACCACCATCAGTAGCTGCTCCACCTTTTGGCAGGCGTGAAAAGGTGGCGAATAATGTAACACCCATTTGTTGTTTGAAGCATTGTCGAAAGGAACGCACTCCCACCCATCACTTGAGTACTTTTGGCTTTCCCAAAGATCATCAGCTTTTGCTGAAATGGTGTGCCAATCTTCACCTGGAACCCATGGATTGTGTGGGACGTGTTTGCATTGAGCATTTTGAAGCGGAAATGTTAGGAACACGCAAGCTAAAGCAAAATGCTGTTCCCACCATTAATGTGGGACATCAGATGCCTTTACCGTATACCTGCAACGGCCAGGAGCGTAGCGATGAGAAGGAGGATAATTCGGTTTTTCGGCTTTGGAGCCTGAAACATTGTCGCAAGAGGAAACTAATGGAACCACCAGATATTCGCCTAAAAGTGGAGAAGATGGATCCGATGGGTCTAGTGAAAGTGAAGAAggagaaaatggaaatggaggaGGAGAAAgagacaatgatgatgatgactaaACCTAAGAGATGTTGCCTTAACCAATGTGAGCAAACTGCagaattgcagaaatttccaaGAGATTTCAATTTGCTAAGAAAATGGTTGCACAACCTCAAGTTGACCCTTAACGAGGATTTGGATCCCTCACAGCTGCGTTTGTGTCTAAGGCACTTTGAAGGTCATTTGGTACGAAATGGACATCTTTCAAAAGAGGCATTACCCACTCTGGAACTGGGTCATCAGGATAAGAATATTTATAGAACAACTGTAGCAACTTCTGGTGGTTGCTTGGTGGCCAGTTGTCCATGTGCTCGTCTCAATCTCTATCGAAGTTATGCTCTACCCAAGGAGCCCTATATTAAAGAGGCGTGGCTAAACTATCTAAAGCTGCCGGCAATCACCCATGGACAACTCTGTGTAATGCACTATATGCAACTGTACGAGGAGATGCCGTTCAAGGAATTGCGTCATATCTATGAATCCATTGCCAATTCCACACAGGCTCTGAAATTGCGCTGTGCCGTACCCGGCTGTCGATCAAAGTACACGGATAATATACACTTGACCAAGTTGCCGCAAAATCAAAGCTTACTTACCAAATGGTTGCATAACACCATGTTGACCTATGATCCCAGCAAACATTCAATTTATCGCATTTGTTTGCTGCACTTTGAGCCATTCGCATTGGGTCCAGCATGTCCCAAGCCATGGGCAGTACCCACCTTGGAATTAAATTATCAGAATGACATTTATTTGAATCCTTCGAAAGAGGAATTGGCTAACATAACAGACTATCCCCGAATTAGTACTCCGCTGCAAATTAAAACAGAATTTACTTTACCATTGAGAATAAAAACGGAATTAGCCGCCTTAAGCAGTCCCAGTGTTGGTTCCACACCTAGTCCACGGGGCAAGGTCAGAATTTGTTGCATACAATCATGTCTGCAGCAAGCCAATTCCCAGTTACGTCTCTATCGTTTTCCCAATACAGAATCCGCTCTACTCAAGTGGCTGGTCAATACGCAGCAGCAACCACGTCTTGTGGATCCCACACAGTTGTATGTGTGTCAATCCCACTTCGAACTTGAAGCTATCTGTAAGAAACAATTGAGAAGTTGGGCTGTGCCCACATTAAATTTAGGACATGATGGTCATGTCATACCCAATGCCAGGCATAATGGAAATATTGCCGATAGCCAGGAAACGGAACAGGCAATGGAATTTATTAGGGAAAACTATTGTTCCGTGCTAAGTTGCTTTCAGCCAAAGAGTGAGGCTCTGCGTTTGCATCCCTATCCCAAGGATATGCCTACCATACGGAAATGGGCTGCCAATTGTAAGCATCGTTCCATGCAGGCCAGCAGTCATGGATTCCAGGTCTGTCAATTGCATTTTGAAGCAGATTGCTTTCATCCGGATACTGGTGACTTACGTGAGGGATCTGTACCCACTCTGGATCTAACAGTGACTCGGCTAAACAGCGAGTTGCGTTGCCTGGTCACTGGCTGTGTCAAAGATGAAACTCAGCCGCGACGTCGTTACTACAAACTACCTAAGCGACCTGCTTTGCTCAGTGAATGGTGCAGAAATCTCGGTTTAGTTCCTTCTGGACTCCTACATGGTGCTGATCATCACGTTTGCGAACGTCACTTTGAATCTCGTTGCTTCAACATCCACAAACAGTTGCGTTCAGGATCACGTCCGACCCTGAATTTGGGTCACAATGAAAATATTACGTTGCTGCCAAATCCAGAGATATTCTGTGATGAGATTGACGACGTCAGTACTTGCTCTGTGCCAAATTGTGGTCAATCCAAGCTAACGGATGAAACACTTCAACTAAATAGTTTGCCCAGAATGCGTAAGTTGGCGGAGAAATGGTTGCATAATCTGCATCTACCATACACTGGAAAGGAGCAACTGGCCAAGTTTCGTGTCTGCCAGAAACACTTTGATCCATCTTGCTTTGAAAACGGGTTTTTGCGTCAGGGAGCCCTGCCCACCTTGGAGTTGGGTCATGAGTCTGTGGACATTTATCAAACAGATGACCAGAGTGTGGGCAAATACAGAAAGCACCAAAAAGTATTGTCTGGCGTACGTGTATCGGGGCACGACTGTTGTTATCCCCAATGTGTGCAACAGCAAAAGAATTACCAACGAATGGTGTACGACTTGCCCAAAGAGGAGAAGCTGCGTCAGAGATGGCTACAGCATTTGGAAATTGATGaaagagaaagggaaagaCCTTTGATATTATGTCCACTCCATTATATATTCCTATACGATTATAGTGTGAAAAACTTTGAAGAACATGTTCCAAATGATCTGCTGGAAAGCAACTATGAAGATGCAAGAAATGGCTCTAGAATCCGGCTTATCAGTTGTGCTGTGCGAGGATGTGGAACACTTCAGCCACGTGATGGTGGCAGATTGCATGGTCTGCCCACGAATCCAGAGATCTTCCAGATGTGGTTGGATAACACTGAATTGGTTGTATATGAGCCACAGCGTTACATGATCAAAGTCTGTAGCAAACACTTTGAGTCTATATGTTTTACGGATATTCGCAAATTGAAATGCTGGAGTGTGCCCACTCTTCATCTACCCGGTGAGGCAGTGCATCAAAATCCAACCGAAGAGGAATGGTTAAAGATAAACGAAAGAATAGCTGTATCAGCCGCTCAGCCAGGGGAACCCTGTGAGGACAATTCAATGCTGGAACCAGTTGTTATAATGGAAGAAGAGGACTGTGTCTGTTGTGTACCCAATTGTGGACGGTCCAAGCAAATGGATAATTCCATTCAGTTTACAAGCTTCCCCAAGAACAACATGCTGGCCGAGAAATGGATTCTTAATTTTCATCTGAAAGTGACCAAAGATCAGTGGTCCAATCTTCGTGTATGCAATCGGCATTTTGAGACAACTTGTTGGGAAAACGGTCGATTGCGAAGGGGAGCCATGCCGACCCTAGAATTGGGTCATGAGAGCAGTGATATTTATCAAACCGACGAGCTAGATCTCTTCAAGAGTCGCAAGCAAACCAAGAGGACATATGGCCAGGGATGTTGTTTTCCTCAGTGCGTGGaacttttaaagaatttcCAACGTATGGTCTATGATTTGCCAAGAGAAGCTCAACTGCGACAACGCTGGTTACAATATATGGAATTGACGGAATCAGAGCAGCCATTAAAAATGTGCCCACTCCATTATATTATTCTATATGATCACAGTGTAAAAAACTTTGAGGAACATGCTCCGGAAAAGCTGcttgattttaattatgaaaatgcTAGAAATTGTGTGAGAATTCGGATTATTAGCTGTGCGGTGGAAGGATGTAATACACTGCAGCCACGAGACGGAGGTCGCATGCATGGTCTGCCACCAAGATCAGATATACTCCAGATGTGGCTGGACAACACAAGATTAGTCTTCCATGAGCATCAACGTTACATGCTAAAAGTGTGCAGTAAGCATTTTGAGCCAAAATGTTTTACGGATATTCGTAAATTGAAGAGCTGGAGTATTCCGACGCTTCATCTGCCCGATGAGGTTGTGCATCAAAATCTCACCGAAAGAGAATGGCAGCAAATGAATGAGAGACTTGCCGTGCAAAACAATCGGGAAGAGGAAAgttttgatgaaaactcaaTGCTAGAACCGATTGTTATGATGGAGCACGCCGAATCCGAAGCGGAAATGGAGGAGCAGGTCGAAACCATGCCTCAGCAAAAACTAGTGACCCATGATAAATTAAAGCACGAGTCCCAAGATGATAATGgcaataatgatgatgaaatgCAAGCATTGGAAGTACTCCTCGAAGTGGGTCATGTTGAAAAATGTTCCAGTTATGAGAAAATGGACAATAAATCACATTTACCATACTCCGAGACGAGTCCATTGAGTCCTTCGATGGGATCTATGCCACCGGGTCAACGCGGTGGTCATTATAATGCTCGTCACTGCAGTGTCCAGGGCTGTCAGATAACTGCCAATGATGTAGACGGTAATATCAAGCTGCACAAGTTCCCCACCTCCGTGGAGGCCACTGAAAAGTGGATGCATAACACCCAGGTAGATGTGGATGAGAACTATTCCTGGCGGTATCGCATTTGCAGTTACCATTTCGAACAGGAATGCTTCAATGGGGCCCGTATACGGCGGGGATCTATGCCCACATTGCATTTGGGTCCACTTCGACCCAAGGATATCTTTAGGAATGAGTTCCCGcaattggaaatggatgaAACTATGGAAGAATCAATTCCTAAAGTTACTCCCACTGTTGAACAGGAACCTGGGGCTCAGCCTATAAAGAGTAAGGTGACACAACTATGCCTGCCACGTCCTGCTCCGCCTCGAAAATCGAGCAAATTCTGTCAGATTGAAGGCTGTTCGAATCATTTGACCAGCGAGAATATGACTTTGCACAAGTTTCCCCACTCCCTGGATATGTGTGCCCGCTGGCAGCACAATACTCAGGTGCCATTTGATCCAGAGTATCGTTGGCGCTACCGCATCTGTAGTATCCATTTTCATCCAGTCTGTTTGGTCAATATGAGATTATTGCATGGCAGTGTGCCTACTTTAAAACTGGGCCCTAGAGCTCCCGCTCAACTGTTTGACAATGATTTCGATGCCATTAATATGAGATTGGATAAGAGATCACATTTGGAGCAGGGAGGTAGCAAGGTCAAGCAAGAGAGACCCCACCATCAACAGCAATCCGATGAATTCTATTTAGAGccagaaatggaaatggaagtAGATGATGAGGAGCAAGACCCAGATCAATCCCAATCCATGACATCATTTGAAAGCTGGAGACATCAACTTCGCCTACCAACTGTTAAGCAAGACAAGGTCGCCTACAATCCCATCAAATCTGGCTACGATAAATGCTCCCTAACACACTGTCAGCGTCAGAGATCCCTGCATGGCGTCCACATATACAAATTCCCACGATCGAAACGCCATCAGCAGCGATGGATGCACAATTTGCGCATACGTTATGATGAGAAGAAACCATGGAAATACATGATCTGCAGTGTTCACTTTGAACCAAATTGTATACGCCTGAGAAAACTTCGTCCATGGGCTGTGCCCACTTTGGAATTGGGTTCGAATGTGGCAGATCAGATTTACACCAATGAACAGTGCCAGGAAATGGCTTCAGATGTAAGTGAAGAAGAGGAAACCGGACCAGAAGAAAGTGGAcaagaagaagatgatgacgatgaagtAGATGACGATGGAGATACTGGTGCAGAGGCCCACATAAAGCGTGAAAGACGCCCTTGGGGAACGTCCGGAGCCGCCGGTGGTCAAATGGCTCCTTGGAAAGTAAAACAATGTTGTCTGCCCTATTGTCGTCGACCACGAGGGGATGGTATCAAACTATTCCGACTGCCCGGCAATCCTACTTCCATACGTAATTGGGAAAAGGCCACGGGGATGACATTTAAAGCATCGCAACGGAACACACGACTCATTTGTAGTCGTCACTTTGAGCCGGAATTGATGGGGGTACGCCGTTTGATGCGGAATGCCATACCCACCAGACATCTATATCACCAAAGGGACAGCTATAGCCCAGAATTGGTGATACCCACAAACACTCCAACTCCTATTGGTCCCCGTTGCTGCATTCCTGATTGCCCCCCACACGATGGGTCGTCTCAACTTCATCGATTTCCCAGTGatcCACAATTGTTGAAGCAATGGTGTGAATCTCTTAAACTTACGGATTTCCAACGCTATAGTGGACAATACGTTTGCTCTAATCATCTTCCCGCCCAGGATTTAGCATGCATTATCTGTGGCGTGGATGATATACAATTGCCGCTTCTTGATTTTCCCGAGAATCGCAATTATCGGGCTAAATGGTGTTATAATCtcaaaattgaaacaataCCCAAATGGGACAACTCCAAGCATATTTGCTCGAAACACTTTGAATCCTATTGCTTCAGTCAGCAAACCGGTGAACTGCATCCAGAGGCAGCACCTACATTGCATTTAAATCACAATGATACGAATATATTCCTCAACGAGTATGCCATAGAACAGCATTCTTTGATGAGGATTAAAGACGAGCCCTTGGACAACGATGAGATGTTGTTGGCTTAA
Protein Sequence
MSQHNPHYHPHPHPLHYQQQQQQQQLHHHHTSLQQQQHKQIQHSNWYSHVASTSSAPYPHHPSSTTSSVAASTSGANNNHIMNAYGTHGYYGAAGGGLNVNAVGVGVGGGGGGGGSSNSYNLEAANTVAYAHNQLLQYQQQQQHQQQHQQQHQQQQQQQQQQQHQHHLNARSYMGGHHHGIYPYIKSEPMEYTHNTMAPPPAPTTATTEMRIKSEPIDELAYKSSNYIDDNTPFADFSKYNEFSENMLSPKVELTVKNESPYGKHPNNYPRRKLQTERSSENLPICQRCKEVFFKKQSYLRHVAESSCSIQEYEFKCNICPMSFMSGEELQRHKHLHRADKFFCHKYCGKYFDTIAECESHEYMQHEYDSFVCNMCSLTFATREQLYTHLPQHKFQQRYDCPICRLWYQTAVELHEHRLAAPYFCGKYYNQQHHHQSQQQQQHHHQQQQNHQQQTHQTNYKLQDCHMATMEMPTAPPPSSAVTHHKSNASGTSSTLPATAALSSLLQQRQANADGAAMFAAAASSTSLKGEVNVKLERSYSNSTSDSSFGGMHESNYNNNNNAYGSDNSIHGSGAVGGPQAHSSTLDDSEDALCCVPMCGVSKSTSPTLQFFTFPKDDKYLHQWLHNLKMFHIPASSYSTFRICSMHFPKRCINRYSLCYWAVPTFNLGHDDVANLYQNRELTNTFTTGEVARCSMPHCNSQRGESNLKFYNFPKDIKSLIKWCQNARLPVQAKEPRHFCSRHFEERCIGKFRLKPWAVPTLHLGGAQYGKIHDNPKNLYVEEKRCCLNFCRRSRSTDFNMSLYRFPRNEVLLRRWCYNLRLDPGVYRGKNHKICSAHFIKEALGLRKLSPGAVPTLHLGHNDTFNIYENELWPPPTPSSSTPHHQHHHHQQQQQQHGHGHGHAQQHHHHHNKAAYHRQTAASTSSSASSTSHYVDPDNMGSGAYLGMGGANSLSGGMNVSDSMDICCVPSCESKRHNSENITFHTIPRRPEQMRKWCHNLKIPEDKMHKGMRICSLHFEPYCIGGCMRPFAVPTLHLGHDDKDIHRNPDVIKKLNIRETCCVAVCKRNRDRDHANLHRFPSNVALLTKWCANLQRPVPDGSKLFNDAICEVHFEDRCLRNKRLEKWAVPTLMLGHEDIAYQLPTSEQVAEFYARPNAPNNGEEQGECCVESCKRNPSVDDIKLYRPPEESDILAKWAHNLELDVAELPNMRICNLHFESHCIGKRMRPWAIPTLNLSSNIENLYENPEHSMLYKRRTKRDPNRDVSLAATKPTWVPRCCLPHCRKVRALHNVQLYRFPKLNRSTLAKWAHNLQVPMVGSAQRRLCSAHFEPHVLSKKCPVPLAVPTIDLNAPPGYKIYQNPAKLKASKLCLQRVCIVESCRRTRAQGVQLFRLPHSPTQLRKWMHNIKTRPRAATRSQYRICSIHFESHSFNGKRLSAGAIPTLELGHDDDDIYPNEAQAFVDEHCVVESCESSKDQPEVRLFRFPTEDDDLLWKWCNNLKMNPVDCVGVRICNKHFEADCIGPKHLFKWAIPTMELGHDDSEIELIPNPKPEERYVDPVFKCCVPTCGKTRKFDEVQMNSFPKDPLLFQRWRHNLRLDHLNFKERERYKICNDHFEDVCIGKTRLNIGSIPTLQLGHNETEDLYQVNPAELQSNLFGRPRRLHGGVDIKLEYAEDSEAESGLQDVKPNIYEMAEATDINIRQVKIKKSLADLKCCVRSCGRSRLEHGARLFPFPNGKQQNLKWRHNLQLEPEEVDKMTRVCSAHFNRRCIDGKHLRGWAIPTQQLGHHHEQPIYENPKNIPGFFTPTCALSHCRQRRSIDNDLRTYRYPRSEDLLEKWRANLRLAPDQCRGRICADHFEPLVRGKLKLKTGAVPTLKLGHDEELVYDNEAIKANLVDEEDVSLESPPQVITKKEILEEEDDEEDLQEHEDDDEEEEEEENDPPEEDSHSDYFDPLELVETYADDQVPEDEYSAPAHQLPAPPSVAAPPFGRREKVANNVTPICCLKHCRKERTPTHHLSTFGFPKDHQLLLKWCANLHLEPMDCVGRVCIEHFEAEMLGTRKLKQNAVPTINVGHQMPLPYTCNGQERSDEKEDNSVFRLWSLKHCRKRKLMEPPDIRLKVEKMDPMGLVKVKKEKMEMEEEKETMMMMTKPKRCCLNQCEQTAELQKFPRDFNLLRKWLHNLKLTLNEDLDPSQLRLCLRHFEGHLVRNGHLSKEALPTLELGHQDKNIYRTTVATSGGCLVASCPCARLNLYRSYALPKEPYIKEAWLNYLKLPAITHGQLCVMHYMQLYEEMPFKELRHIYESIANSTQALKLRCAVPGCRSKYTDNIHLTKLPQNQSLLTKWLHNTMLTYDPSKHSIYRICLLHFEPFALGPACPKPWAVPTLELNYQNDIYLNPSKEELANITDYPRISTPLQIKTEFTLPLRIKTELAALSSPSVGSTPSPRGKVRICCIQSCLQQANSQLRLYRFPNTESALLKWLVNTQQQPRLVDPTQLYVCQSHFELEAICKKQLRSWAVPTLNLGHDGHVIPNARHNGNIADSQETEQAMEFIRENYCSVLSCFQPKSEALRLHPYPKDMPTIRKWAANCKHRSMQASSHGFQVCQLHFEADCFHPDTGDLREGSVPTLDLTVTRLNSELRCLVTGCVKDETQPRRRYYKLPKRPALLSEWCRNLGLVPSGLLHGADHHVCERHFESRCFNIHKQLRSGSRPTLNLGHNENITLLPNPEIFCDEIDDVSTCSVPNCGQSKLTDETLQLNSLPRMRKLAEKWLHNLHLPYTGKEQLAKFRVCQKHFDPSCFENGFLRQGALPTLELGHESVDIYQTDDQSVGKYRKHQKVLSGVRVSGHDCCYPQCVQQQKNYQRMVYDLPKEEKLRQRWLQHLEIDERERERPLILCPLHYIFLYDYSVKNFEEHVPNDLLESNYEDARNGSRIRLISCAVRGCGTLQPRDGGRLHGLPTNPEIFQMWLDNTELVVYEPQRYMIKVCSKHFESICFTDIRKLKCWSVPTLHLPGEAVHQNPTEEEWLKINERIAVSAAQPGEPCEDNSMLEPVVIMEEEDCVCCVPNCGRSKQMDNSIQFTSFPKNNMLAEKWILNFHLKVTKDQWSNLRVCNRHFETTCWENGRLRRGAMPTLELGHESSDIYQTDELDLFKSRKQTKRTYGQGCCFPQCVELLKNFQRMVYDLPREAQLRQRWLQYMELTESEQPLKMCPLHYIILYDHSVKNFEEHAPEKLLDFNYENARNCVRIRIISCAVEGCNTLQPRDGGRMHGLPPRSDILQMWLDNTRLVFHEHQRYMLKVCSKHFEPKCFTDIRKLKSWSIPTLHLPDEVVHQNLTEREWQQMNERLAVQNNREEESFDENSMLEPIVMMEHAESEAEMEEQVETMPQQKLVTHDKLKHESQDDNGNNDDEMQALEVLLEVGHVEKCSSYEKMDNKSHLPYSETSPLSPSMGSMPPGQRGGHYNARHCSVQGCQITANDVDGNIKLHKFPTSVEATEKWMHNTQVDVDENYSWRYRICSYHFEQECFNGARIRRGSMPTLHLGPLRPKDIFRNEFPQLEMDETMEESIPKVTPTVEQEPGAQPIKSKVTQLCLPRPAPPRKSSKFCQIEGCSNHLTSENMTLHKFPHSLDMCARWQHNTQVPFDPEYRWRYRICSIHFHPVCLVNMRLLHGSVPTLKLGPRAPAQLFDNDFDAINMRLDKRSHLEQGGSKVKQERPHHQQQSDEFYLEPEMEMEVDDEEQDPDQSQSMTSFESWRHQLRLPTVKQDKVAYNPIKSGYDKCSLTHCQRQRSLHGVHIYKFPRSKRHQQRWMHNLRIRYDEKKPWKYMICSVHFEPNCIRLRKLRPWAVPTLELGSNVADQIYTNEQCQEMASDVSEEEETGPEESGQEEDDDDEVDDDGDTGAEAHIKRERRPWGTSGAAGGQMAPWKVKQCCLPYCRRPRGDGIKLFRLPGNPTSIRNWEKATGMTFKASQRNTRLICSRHFEPELMGVRRLMRNAIPTRHLYHQRDSYSPELVIPTNTPTPIGPRCCIPDCPPHDGSSQLHRFPSDPQLLKQWCESLKLTDFQRYSGQYVCSNHLPAQDLACIICGVDDIQLPLLDFPENRNYRAKWCYNLKIETIPKWDNSKHICSKHFESYCFSQQTGELHPEAAPTLHLNHNDTNIFLNEYAIEQHSLMRIKDEPLDNDEMLLA

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00604131;
90% Identity
iTF_00577817;
80% Identity
-