Dgua004321.1
Basic Information
- Insect
- Drosophila guanche
- Gene Symbol
- -
- Assembly
- GCA_900245975.1
- Location
- NW:2792211-2806257[-]
Transcription Factor Domain
- TF Family
- THAP
- Domain
- THAP domain
- PFAM
- PF05485
- TF Group
- Zinc-Coordinating Group
- Description
- The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 27 5.8e-15 5.7e-12 45.8 3.6 1 86 597 669 597 670 0.85 2 27 1.9e-14 1.9e-11 44.1 4.8 1 87 697 766 697 766 0.82 3 27 1.6e-15 1.6e-12 47.6 0.2 1 87 788 860 788 860 0.85 4 27 9.6e-16 9.6e-13 48.3 5.0 1 86 963 1032 963 1033 0.82 5 27 6.5e-15 6.4e-12 45.6 3.2 1 86 1057 1128 1057 1129 0.82 6 27 1e-12 1e-09 38.6 0.5 1 87 1164 1233 1164 1233 0.82 7 27 3e-11 3e-08 33.9 2.0 1 86 1273 1342 1273 1343 0.77 8 27 6.7e-17 6.6e-14 52.0 0.4 1 86 1370 1439 1370 1440 0.82 9 27 5.3e-13 5.3e-10 39.5 0.8 1 86 1461 1530 1461 1531 0.79 10 27 4.1e-14 4.1e-11 43.0 0.8 1 86 1558 1629 1558 1630 0.85 11 27 7.6e-13 7.6e-10 39.0 2.0 1 85 1705 1773 1705 1775 0.82 12 27 5.8e-12 5.7e-09 36.2 0.0 1 86 1798 1866 1798 1867 0.82 13 27 7.4e-14 7.4e-11 42.2 0.5 1 87 2042 2111 2042 2111 0.78 14 27 4.4e-09 4.4e-06 26.9 0.0 1 86 2184 2254 2184 2255 0.76 15 27 0.0047 4.7 7.6 0.0 1 58 2269 2313 2269 2327 0.79 16 27 9e-12 9e-09 35.5 0.0 1 86 2349 2418 2349 2419 0.82 17 27 4.4e-13 4.4e-10 39.7 0.1 1 86 2549 2617 2549 2618 0.83 18 27 2.4e-10 2.4e-07 31.0 0.2 1 84 2653 2722 2653 2724 0.80 19 27 1.2e-11 1.2e-08 35.2 0.2 1 87 2736 2808 2736 2808 0.81 20 27 1.6e-14 1.6e-11 44.3 1.7 1 86 2830 2903 2830 2904 0.82 21 27 0.00042 0.42 11.0 0.0 1 58 2932 2987 2932 3004 0.77 22 27 2e-12 2e-09 37.7 0.3 1 87 3022 3094 3022 3094 0.80 23 27 1e-12 9.9e-10 38.6 0.5 1 85 3227 3296 3227 3298 0.76 24 27 8.2e-12 8.1e-09 35.7 4.0 1 86 3355 3425 3355 3426 0.80 25 27 1.9e-14 1.9e-11 44.1 5.2 1 86 3558 3628 3558 3629 0.84 26 27 4.5e-12 4.5e-09 36.5 0.2 1 86 3726 3795 3726 3796 0.84 27 27 4.4e-10 4.4e-07 30.1 1.3 1 87 3817 3899 3817 3899 0.79
Sequence Information
- Coding Sequence
- ATGTCACAGCATAATCCACATGCCCATCCGCACTACCATCACCACCCACTGCACCAGCATCAGAcacaacaccaccaccaccatcatcagctgcagttgcagcagcagcaacaacaacaacagcaacaacaacaacaacaacaacaacagcagcattcGCAAATGCCACACAGTACACACAATTGGTACTCACATGTTGCTTCCTacccgccaccgccaccccaCCACCATGCGACGGCTACCTCGGCATTTGCCGCCACCTCCACACCTTGtaagggcagtggcagtggcagtgccaacaacaacaacatgagTGCCTATGGAGCAGGCAGCACGCATGGatattatgctgctgctgctgccgccgccgctggcgGTGGGCTCAATGTTAATGCTGTGGGTGGCGTTGGGGGTGGGTCAGCAGCCTATAACCTTGAGGCCAACACAGTGGCTTATGCCCACAACCAGCTGCTGCAGTACCAACAgcaccatcagcatcagcaacagcagcaacaccatcatcaacagcagcagcatctcagTCAGCGTTCCTATATGGGGCATGATATAATGTCTGGGACATATCCGTACATCAAAAGCGAACCAATGGATGTATCAGCGGCGTATCAGCAGCCACCCAATCCGATGGCACCACCCCCAGCGCCAGATATGATAATAAAATCGGAACCCATGGATGAACATCCGTACAAGTCCAACTACATAGACGACAATACGCCCTTTGctgattttaataaattcaacGAATTCAGCGGCGATATGCTCAGCCCCAAAGTTGAGCTAACCATCAAAGATGAGACCTATGGCAAGACctccagtggcagcagctttgCACGCCGCAAAgttctgcagcagccacaacagcaacagccaacgACAGATCGTTCGGCGGAGAGTCTGCCCATTTGTCAGCGCTGCAAGGAAGTGTTCTTCAAGAAGCAAGCCTACCTTAGGCATGTGGCCGAGAGCAGTTGTGGCATACAGGAGTACGATTTCAAGTGCAACATTTGCCCCATGTCCTTCATGAGCAGCGAagagctgcagcggcacaaGCATTTGCATCGCGCGGACAAGTTCTTCTGCCACAAATACTGCGGCAAGCATTTCGATACGATAGCCGAGTGCGAAACGCATGAGTACATGCAGCACGAGTATGAGAGTTTTGTGTGTAATATGTGTTCGGGCACGTTTGCCACACGCGAGCAGCTGTACGCCCACTTGCCGCAGCACAAGTTTCAGCAGCGCTACGACTGTCCCATCTGTCGTTTGTGGTATCAAACGGCCGTCGAATTGCACGAGCATCGGCTGGCGGCTCCATACTTTTGCGGCAAGTACTACAccaatcaacagcaacagcagcagcagcagggcaactATAAGCTGCAGGACTGTCATATGGCCACCATGGAAATGCCCACAGCACCACTGCTTAAAGTACCAACGCCTGCGAATGCCTCGGCCCTGCCAGCCACAGCGGCGTTAAGctctctgctgcagcagcgacaagcGAATGCCGATGGAGCAGCGGCCatgtttgcttctgctgcctcctccaccGCTGCCGTCGCCTCCACCGCCGTGTCGCTGAAGAGGGAGGTAAGCGTCAAGCTGGAGCGCAGCTACAGCAACTCCACCAGCGAGTCCTCGTACAGCCATCAGGAcaacagcagctacaacaatgCCTATGGCAGTGATAGCTCCATTCATGGCGGCGCTCTAGCCGGACCCCAAGCGCACTCCTCAACGCTGGACGACTCCGAGGATGCTTTGTGCTGTGTGCCGCTGTGCGGCGTGCGCAAGAGTACCAGTCCCACGCTGCAGTTCTTTACGTTCCCCAAGGACGAAAAGTATCTGAACCAGTGGCTGCACAACCTCAAGATGTTCCACATACCAGCGGCCAGCTATGCGACATTTCGCATCTGCAGCATGCACTTCCCAAAGCGCTGCATTAATCGCTATTCGCTGTGCTACTGGGCGGTGCCCACGTTCAATCTGGGCCACGACGATGTCGCCAATTTGTACCAGAATCGCGAGCTTACCAACACATTTACCACCGGCGAGGTGGCGCGCTGCAGCATGCCACACTGCACTAGCCAGCGGGGCGAGAGCAATCTGAAGTTTTACAACTTCCCCAAGGACATCAAGAGCCTGATAAAGTGGTGCCAGAATGCCCGCCTGCCGGTGCAAGCCAAGGAGCCGAGACACTTTTGCAGCCGCCACTTTGAGGATCGATGCATTGGCAAGTTCCGACTGAAGCCCTGGGCCGTACCCACTTTGCATCTGGGTGCGCAGTACGGCAAGATCCATGACAATCCCAAGAACCTGTATGTGGAGGAGAAGCGCTGCTGCCTGAACTTTTGCCGTCGGAGCCGCTCCTCGGACTTTAACATGTCGCTGTATCGTTTTCCACGCGATGAGGTGTTGCTGCGACGCTGGTGCTACAACTTGAGACTGGATCCGGGCGTCTATCGTGGCAAGAATCATAAAATATGCAGTGCACATTTCATCAAGGAAGCGCTGGGCTTGAGGAAACTGTCGCCAGGTGCCGTGCCCACGCTGCACTTGGGACACAATGACACGTTTAATATCTACGAGAATGAGTTGTGGCCACCGCCATCGCCCACTGGACAGCATGGGCAGCAGCTTATGCCGCAGCAATCGACGTCGCAGCATCATCATtcctcgctgcagcagcagcagcatccggcAATGCTCAGCAAATCCTATCAGCGCCATTCGGCAGCCTCCACATCATCCTCCGCCAGTTCGGCCTCGCATTATGTGGATCCCGAGATGAGTGGCTCGTACTTGAGCCTCTCTGCGGGTAGCTCCTCGGGCGGCATGAATGCCAGCGACTGCATGGACGTGTGCTGCGTGCCCAGCTGCGAGAGCAAGCGGCACAACAGCGAGAACATCACATTCCACACGATACCGCGGAGGCCCGAGCAGATGCGCAAGTGGTGCCACAATCTAAAGATACCCGAGGACAAGATGCACAAGGGCATGCGGATATGCAGCCTGCACTTTGAGCCATACTGCATTGGCGGCTGCATGCGTCCGTTCGCCGTGCCCACACTGCAGCTGGGgcacgaggacgaggacatACACCGCAATCCGGATGTGATCAAGAAGCTGAACATACGCGAGACCTGCTGCGTGGCCGTGTGCAAGCGAAATCGGGACAGAGACCACGCCAACCTCCATCGCTTCCCCAGCAATGTGGCGCTGCTGACCAAGTGGTGTGCGAACCTCCAACGCACAGTGCCCGATGGCAGCAAACTCTTCAACGATGCCATCTGTGAGGTGCACTTTGAGGATCGCTGTCTGCGCAACAAGCGGCTGGAGAAGTGGGCTGTGCCCACGCTAATACTCGGCCACGAGGACATTGCCTATCAGCTGCCCACGCCCGAGCAGGTGGCCGAATTCTATGCACGGCCTACCGCTCCCAACAATGGCGAGGAGCAGGGCGAGTGCTGTGTGGAGACATGCAAGCGGAATCCCAGTGTGGATGACATCAAGCTGTATCGCCCGCCAGAGGACACCTCGGTGCTGGCCAAATGGGCGCACAATCTACAAACGGAGGCCGCGGTGCTCACCAATGGATTGCGGATATGCAATCTGCACTTTGAGGCTCACTGCATTGGAAAGCGCATGCGTCCGTGGGCCATACCCACGCTCAATCTGGCCGGCAACATTGAGAATCTGTACGAGAACCCCGAGCACTCGATGCTGTACAAGCGGCGGACGCATCTCAAGCAGAAGTCCCTGCCCGTGGCGAAGCCCACGTGGGTGCCGCGTTGCTGTCTGCCGCATTGCCGCAAGGTGCGTGCCCTGCACAATGTCCAGCTGTATCGCTTCCCCAAACTGAATCGCTCCACGCTGGCCAAGTGGGCGCACAATCTACAGGTGCCGCAGGTGGGCAGTGCCCAGCGGCGTGTGTGCTCCGCACACTTTGAGCCGCATGTTCTCAGCAAAAAGTGCCCGGTGCCGCTGGCGGTGCCCACACTGGACCTGAACTCGCCACCCGGCCACAAGATCTACCAGAATCCGGCCAAGCTGAAGGCCAACAAGCTGTGCCTGCAGCGCGTGTGCATCGTGGAGAGCTGCCGCAAGACCAGGGCCCAGGGCGTGCAGCTCTTCCGGCTGCCGCACAGCCCCACGCAGCTGCGGAAGTGGATGCACAACATCAGGACACGCCCGAGGGCGGCCATGAGGAGTCAGTATCGCGTGTGCTCGCGCCACTTTGAGACGCACTCCTTCAACGGGCGGAGACTGAGCGCCGGTGCCATACCCACCCTGGAGCTGGGCCACGACGACGAGGATATCTTCCCGAACGAAGCGCAGGCCTTTGCCGACGAGCACTGCGCCGTCGAGGGCTGTGAATCGtccaaggagcagctggaggtgcGTCTCTTCCGCTTCCCCacggacgacgacgacatgcTGTGGAAGTGGTGCAACAATCTCAAGATGAATCCCGTCGATTGCATCGGTGTGCGGATCTGCAACAAGCATTTCGAAGCGGATTGCATCGGACCCAAGCATCTGTACAAGTGGGCCATACCCACGATGCTGCTCGGCCACGACGACTCACAGATCGAGCTGATACTAAATCCCAAGCCGGAGGAGCGCTACGTGGATCCGGTGTTCAAATGCATTGTGCCCACGTGCGGCAAGACGCGTCGCTTCGACGAGGTCCAAATGAATAGCTTCCCGAAGGATGCGGAACACTTTCAGCGCTGGCGGCACAACCTCCGCCTGGAGCATCTCAGCTTCAGTGAGCGCGAAAAGTACAAGATTTGCAATGCCCACTTCGAGGACATGTGCATTGGCAAGACGCGCCTGAACATTGGTTCGATACCCACGCTGGAGCTGGGCCACGACGAAACGGAGGATCTGTTCAAGGTGAATCCCGCAGAGTTGCAGAGCAACCTGTTTGGGCGTCCGCGACGCCTGTTGCGTGGATTCAACAATGTGACCATCAAACAGGAGCTGCCAGACACGGAGACGGAGCCAGACTTCCGGGACGTAAAGCCCGATGTGAGCACGGATTTTACACAGGTTAAAATCAAGAAATCTCTGGGCGATGTCAAGTGCTGTGTGCGCACCTGTGGCCGCAGTCGTCTGGAGCATGGAGCGCgtctctttccctttcccacCGGCAAGCAGCAACACCTAAAGTGGCGCCACAATTTGCGCCTCGAGCCGGACGAAGTGGACAAGTCTACTCGCGTTTGCAGCGCACACTTCAACAGGCGCTGCATCGATGGCAAGCAGCTCAGGGGCTGGGCCATGCCCACGCAGCAGTTGGGGCACCAGGAGCAGCCAATCTACGAGAACCCCAAGAACATTCCTGGCTTCTTTACGCCCACCTGTGCGCTGGGGCACTGCCGCAAGCggcggagcattgacaatgatTTGCGCACGTATCGGTATCCGCGCGCCGAGGATCTACTCGAGAAGTGGCGTGCAAATCTCGGCCTATCGCTGGATCAGTGCCGTGGGAGGATCTGTGCTGATCACTTCGAGCCGCAGGTGCGGGGCAAACTGAAGCTCAAGACAGGCGCTGTGCCCACGCTAAAGCTGGGTCATGAGGAGCCGCTGGTCTACGACAATGAGGCTATCAAGGCGGGCgtggccgaggaggaggcagccTCGCCGCTGGTCACACCCAAAACGGAAGTGCTGGACGACGAGGAGCgcgaggaggacgaggaggaggaggagaatccCGAAGAGGAGCCGCAGGAGACGCACGAAGAGGAGAAGGACGAGCATGAGGATGACACGCCCGagggagcagagcagctgAGAGATGCTGATGACGCGGAAGAAGACGATGAGGATCCAGGCAACTATTTTGATCCGTTGGAGCTGGTGGAGACGTACGCAGAGCATCTCagcgacgatgacgacaacCACAGCCATGAGGCACGCGAAGATGATGCCCGAGGggtggatgaggaggagccAGAAACTCTTCTGCCTGACACACCACATAAACCGGATCCAGTGGCAGGCCTTGGCATGCCCAAACCCTGGGACAGAGCTGCCGCCGTTGTGCCTGTGCGCCGTGAGAAGCGTCCGAATAACGTGGATCCCATCTGCTGCCTCAAGCACTGCCGCAAGGAACGCTCAGCCATGTATCTGCTGAGCACGTTTGGCTTCCCCAAggaccagcagctgctgctcaagtgGTGCGCCAATCTGCAAATGGATCCCTCGGGCTGCATTGGTCGCGTGTGCATCGAGCATTTCCAGTCGGAGGTGCTGGGCACTCGCAAACTCAAACAGAATGCGGTGCCCACGCTCAATGTGGGTCACGATGTGCCACTGCGTTACAGCTGCAATGGCCAGGATTTTCCCGATGAAATGCCACAGCATTCGGTTTTTCGGCTTTGGAGCCTAAAACACTGCCGCAAGAGGAAGCTGTCGGAgagtccagctccagctccagcagcgatcaaggaggaggagcagcagcagcagcagcatcagatgccgatgccgatggaGACTAAACCAAAGATTTGCTGCCTACCCAGCTGCGGCAATGTGGAGGGTTACGGACCAGGCGGACAATTCCAGCCCCTGCCCATGGACCAAAGAGTGCTCAAAAAGTGGCAGCACAATCTGCGGCTGCCAGTCATTAATCCTGCAGCGGATATGCATCTGTGCATGGAGCACTTTGAGGCGTGTCAGATCGAGAATGGAACAGCAATGCGCATGGCAGTGCCGACCCTCAAGCTGGGCAACACCAGTCCGAATGTGTTCCGGAACAGGAAGTGCCTGTGGCCCACGTGTCCGCCCAATCGCAAGCTGTGCTACGATCTGCCGGAGCACGAGTCGGTACGTGTGGCCTGGCTGTCGTTTATGCGACTGCCGCTGGACAGCCAAGGTCGACTGTGTGGCGTGCACTTTCTGCAGCTGTACGAGGAGCTGGATCTCCCAGGCGATGTGCCCGAAGAGGTGCTCGAGCGGCTGCAGGTGACCTACGATCAAGCCTCCATCTCGCTGAAGTTTCAGTGCTCCGTTGTGGGCTGTGGCTCCAAGTACAAGCAGGACACGCACCTGGCGAAGCTGCCAGGCGACCCGCAGCTGCTCGCCAGGTGGCTGCACAACACGAGGATATCGTATGATCGCTCGCTGCATTTTAGTTATCGCATTTGTCTGCTGCACTTTGAGGAATATTGCCTGAACGGCGTGCGTCCACAGCAATGGGCCATACCCACACTGCAGCTAAATCACGATGGAGAGATCCACCAGAATACCATGAAGCAGGATGCGCTCGGGACTTCCCCCGTGAAGCGGGAGTTGGCCGAGACTTCACCCGTCAAGCAGGAAGCGCTCGAGAGTCCCCCCGTGAAGCAGGAGTTGCTCGAGAGTCCCCCCATGAAGCAGGAGATGCCCGAGACTTCCCCCGTCAAGCAGGATGCGCTCGAGAGTCCACCCGTGAAGCAGGAGTCGCAGGAGAAAGCTCCCTTAAGCAGCAGCTTGAGTCTCTCCATTCCGCTGCACATCAAGACGGAGCAGCAGGGTCCTGTGCTGCGCTCTCGAGGCACATCTTCGCTGAGCAGTCCCTGcctcagtggcagcagtgccAGTCCTCGCCTCAAGAACAGAATTTGCTGCATTGCCAACTGCGGCGAGGGTGCCAGATCGCAGCGGCTCCATCCCTTTCCAAGCGACGAACCAACGCTGCTCAAGTGGCTGGTAAACACGCAGCAAAAGCCCGGACTGGTGGACATCCAGAGTCTGTTTGTGTGCCAGCTGCACTTCGAGGCGGATGCCATCAACAAGACGCAGCTAAGGAGCTGGGCCTTGCCCACACTGCGGCTGGGCCACGACGGGCATGTCATACCGAATGCCCGGCACAATGGGAACATTGCCAACAGCCAGGAGACGGAGCAGGCCATGGAGTTTATTCGGGCCAACTACTGCTCCGTGCTGAGCTGCTTCCAGCCAAAGACTGAGGGTGTGCGCTTCCACAAGTATCCCAGCGACATTGCCATGGTGCGCAGGTGGTCCACCAATCTCAAGCATCGCTCAATGCAGGCCAGCAGTCATGGTTTTCTCGTCTGCCAGTCGCACTTCCAGCCCGAATGCTACGATCCGCAGACGGGCGACCTACTCGAGGAGGCCGTACCGCTCGCGGTGGGTGTGAAAACCGAAGGCCAGCTGCTCCATTGCCTCGTCAGGGGATGCCCTGTGACGAGTAACTCCAGCAGGGGATTGCTTCTCTTCAGGGTGCCAAAGAGGACGCGTGTGAGGGAAGTGTGGGGCCACAATCTATGGATGTATCCGATGCAGCTGGTGGGCGAGCATTACATCTGCGAGCGACACTTTGAGCCGCACTGCGTGAatgagcagaagcagctgcacgCGGGCTCGGTGCCAACGCTCCACCTGGGCCACAACGAGCAGCCGGAGCTGCTGCCCAATCCAGCGTCCTTCCAAGAGGAGTGCGAATGCTGTGTGCCCGGCTGTTTGCGCAGCAATCGCAAGGACCCCGATCTGCAGTTTAGCAAATTTCCCAAGTGGCGCGTGCTGTCCGACAAGTGGCTGCACAACTTCCGCTTGGAAAAGCCCCGCTTGGATCAGCGCATTAAAGCGCTGCGAGTGTGCCACACGCACTTCGAGGAGAGCTGCTTCGATGATCAGGGCGTGCGGCGGGGTGCCATGCCCACGCTGGAGCTGGGACACTCGCATTCGGACATTTATCACACCGACATGGGTTCAGCTATGTTGAAGCCGTTGCACAAGAGTTTCAGCGACTGCTGCTATCCCGACTGCTATGAGGACTGCCAAAAGGCCAACATCCATCGCATGGCCTACGACTTGCCCAGCGATGGGCCGGTGCGTGAGGCCTGGCTGCAGCACATGGGCATACCCAGCAGCGGCGACGATTGCCCCTCAGGGCTCAGGCTGTGTCCGGTTCACTACATCATGCTGTATGAGCGTAGCTTCCCCGATCGAGCCGACAACCTCGTGCTGGACCGCAGCTATGAGCATGCTCGTCAGTTGCCACAGCTGCGGCGCTTCATGTGCGTCGTGCAGGGATGCCGCCACCTGCACCAGCGGGATGGCGGAACGATGCATGGAATGCCCAGAAGTGAGCAGATCCTGCGCATGTGGGTGGAGAACGCACAGCTGCGGCTGAACGAGCATGAGATTTACATGACGAAGATGTGCAGCAAACACTTTGAGGCCAGCTGCCTGTACGAGGGCAGGAGATGTTACCCGTGGAGCGTGCCAACGCTGCATCTGCCAGCGCTGCTGCCGGGACAGGTGCTGCACCAGAATCCCACCCCTGAGGAGTGGCAGCAAATGAAGCAGCAAATGAAACTGGAGGAGCAGGCACAGAAAGCCGAGCAGCAGATAGCAGAGGGACTGCCAATGGAGCCGTACGTGAAGATGGAACCCCACGACAATGACGACGAAGAAGACGCTTCGCAAACGGAATCGGAGCTGCTGATAAATGAAAGCACCATGGACTCGCAGGAGACGGACTTTCAGCCGAATGAGCCGCTGGAGGTGCTGCTCGAGGTGGGCCATGTCGAGCGGCTGGAGAGCTACGAGAAGCGGGATTACTCGACGGATATCCGCCACAGCCGCTATGGGCCTGGCAAGCGTTTTCGCCATCCGTACAGCGCCCACAAGTGCTGTGTCGAGGGCTGTGGCGTGTCGGTGGAGGATTTGGGTTGGAATCTGAAGCTGCACAAGCTGCCCAGCTCCACGGAGGTGGCCAGAAAGTGGCTGTACAACATACAGGTGGACACAGAGGACACGTGGCGCATACGCGTCTGCAGCCATCACTTTGACCGTCAGTGTCTCAATGGCTCGCGACTCAGGCGAGGATCGATGCCTACGCAGCATCTGGGGCCACGTGTGCCCGAAACTATCCACCAAAATGAGTTcttacagctgcagctggacgaTGCGCCAGCGCAGAATGACCAACACGTGGAGCCGGTGCGAACAATTGGCAAGGTGGTGAATCCGTGCGTGCCGCGACCCTCACCGCCGCGCAAGTCCAGCAAATTTTGCCAGATCGAGGGATGTTCGAATCATCTTACCAGCGAGAACATGACACTCCACAAGTTCCCGCACTCGTCGTGGATCTGCGTCAAGTGGCAGCACAACACACAGGTTCCCTTCGATCCAGAGTATCGCTGGCGCTATCGCATCTGCAGCGCCCACTTCCATCCCGTGTGCATGGTGAacatgcggctgctgcacgGCAGCGTGCCCACCCTCAAGCTGGGCCCCCGCGCACCCGCCGAGCTCTTTGACAGCGACTTTGAGGCCATCAACATCAAGATGGATAAGTTGGAGAAGTTGGAGAGAAAATCAGAGGCGCGTAGAGCTACCACGGGAGATGGCGTTGGTTATCCCGGCATGCTGGACATGGGGGAGAAGAAGGTTAAGactgagctggagctggacgaTGGAGGAGCACTCgacgaggaggaagaggagcaggaggatgcCATGCTCTACCTGGAGCCCGAGATGCATATATACGAGGATCAagaggaacaacagcaaaagcggGCACCAAAGGCAAATCCAAATGGTGGCTGGAAAACGGAGCTCCGTTTGCCTTCCAAGAGCCGCGTGACCCTCAATCCGGTGAGATCTGGCTACGACAAGTGCTCGCTGATGCATTGCCAGCGCCAGAGATCAAAGCACGGCGTCCACATCTACAAGTTCCCACGCTcggaggagcaccagcagcgctGGATGCACAATCTGCGCATCCGCTACGACGAGAAGCGCCCGTGGAAGTTTATGGTGTGCAGCGTGCACTTTGAGCCGCACTGCATACGGCTAAGGAAGCTGCGACCCTGGGCAGTGCCCACTCTGGAGCTGGGCGACAATGTACCCGAGGATATCTACACGAATGAGCAGTGCCAAATGTTTGCCAGTGGACAGGCGGCAGGAGAGTCAAATTGCAACTATAGCGATGCAGAGgcgcaggcagaggcagagcctgAGGCGGATAGCGATGGGAATGATGACGAGGATGGCCTGCAAGAGGACGAGGATGAAGAGACAGATGAGCAGGAAGCGAGCACCGCCAAGAAGCGTCGTCGTTCGCGTCTGGCGGATGCCTCCTGGCCACTGGGCCAAGTGCCACCGTGGAAAGTGAAGCAATGCTGTCTGCCCTACTGCCGCAGTCCACGCGGCGATGGCATCAAGCTGTTCCGACTGCCCAACAAAGTCAACTCCATTCGCAATTGGGAGCTGGCCACCGGCATGAAGTTCAAGGAGTCGCAGCGCAACACTCGACTCATCTGCAGCCGCCACTTTGAACCAGAACTGATTGGAGTGCGGCGTCTCATGCGCAATGCCATTCCCACGCGGCACCTGGAACCCACGGGCGATGTGAAGCCAGTGCTGCCACCACCGACAACAGCTGGCCCTAGATGCTGCATGGCCGACTGTGCCTACGATGTGGCGGATGTGAAGCTGCACAAATTTCCCAGCAATCCCAAGTTGCTGAGGGAGTGGTGCCAGGCATTGAGGGTTACCGACCTGCAAAGGTATCGCGGCAAGCACATTTGCTCCGCCCATCTGCCTGTCCACAAGGCCGTGCACTGCATTGTTTGTGGCGCGGATAGGGCGCCGCAGTTGCCGATGCTTAGCTTCAAGCCCGAGGCGGCGCCCACACTGCATTTGAATCATAATGATACAAACATATTCCTTAACGATTGTGCCATAAATCCCGCCTACACGGGAGTACGGGTAAAGGATGAGCCCATGGACAATCAGGTCCTGTCGTTGTAG
- Protein Sequence
- MSQHNPHAHPHYHHHPLHQHQTQHHHHHHQLQLQQQQQQQQQQQQQQQQQQHSQMPHSTHNWYSHVASYPPPPPHHHATATSAFAATSTPCKGSGSGSANNNNMSAYGAGSTHGYYAAAAAAAAGGGLNVNAVGGVGGGSAAYNLEANTVAYAHNQLLQYQQHHQHQQQQQHHHQQQQHLSQRSYMGHDIMSGTYPYIKSEPMDVSAAYQQPPNPMAPPPAPDMIIKSEPMDEHPYKSNYIDDNTPFADFNKFNEFSGDMLSPKVELTIKDETYGKTSSGSSFARRKVLQQPQQQQPTTDRSAESLPICQRCKEVFFKKQAYLRHVAESSCGIQEYDFKCNICPMSFMSSEELQRHKHLHRADKFFCHKYCGKHFDTIAECETHEYMQHEYESFVCNMCSGTFATREQLYAHLPQHKFQQRYDCPICRLWYQTAVELHEHRLAAPYFCGKYYTNQQQQQQQQGNYKLQDCHMATMEMPTAPLLKVPTPANASALPATAALSSLLQQRQANADGAAAMFASAASSTAAVASTAVSLKREVSVKLERSYSNSTSESSYSHQDNSSYNNAYGSDSSIHGGALAGPQAHSSTLDDSEDALCCVPLCGVRKSTSPTLQFFTFPKDEKYLNQWLHNLKMFHIPAASYATFRICSMHFPKRCINRYSLCYWAVPTFNLGHDDVANLYQNRELTNTFTTGEVARCSMPHCTSQRGESNLKFYNFPKDIKSLIKWCQNARLPVQAKEPRHFCSRHFEDRCIGKFRLKPWAVPTLHLGAQYGKIHDNPKNLYVEEKRCCLNFCRRSRSSDFNMSLYRFPRDEVLLRRWCYNLRLDPGVYRGKNHKICSAHFIKEALGLRKLSPGAVPTLHLGHNDTFNIYENELWPPPSPTGQHGQQLMPQQSTSQHHHSSLQQQQHPAMLSKSYQRHSAASTSSSASSASHYVDPEMSGSYLSLSAGSSSGGMNASDCMDVCCVPSCESKRHNSENITFHTIPRRPEQMRKWCHNLKIPEDKMHKGMRICSLHFEPYCIGGCMRPFAVPTLQLGHEDEDIHRNPDVIKKLNIRETCCVAVCKRNRDRDHANLHRFPSNVALLTKWCANLQRTVPDGSKLFNDAICEVHFEDRCLRNKRLEKWAVPTLILGHEDIAYQLPTPEQVAEFYARPTAPNNGEEQGECCVETCKRNPSVDDIKLYRPPEDTSVLAKWAHNLQTEAAVLTNGLRICNLHFEAHCIGKRMRPWAIPTLNLAGNIENLYENPEHSMLYKRRTHLKQKSLPVAKPTWVPRCCLPHCRKVRALHNVQLYRFPKLNRSTLAKWAHNLQVPQVGSAQRRVCSAHFEPHVLSKKCPVPLAVPTLDLNSPPGHKIYQNPAKLKANKLCLQRVCIVESCRKTRAQGVQLFRLPHSPTQLRKWMHNIRTRPRAAMRSQYRVCSRHFETHSFNGRRLSAGAIPTLELGHDDEDIFPNEAQAFADEHCAVEGCESSKEQLEVRLFRFPTDDDDMLWKWCNNLKMNPVDCIGVRICNKHFEADCIGPKHLYKWAIPTMLLGHDDSQIELILNPKPEERYVDPVFKCIVPTCGKTRRFDEVQMNSFPKDAEHFQRWRHNLRLEHLSFSEREKYKICNAHFEDMCIGKTRLNIGSIPTLELGHDETEDLFKVNPAELQSNLFGRPRRLLRGFNNVTIKQELPDTETEPDFRDVKPDVSTDFTQVKIKKSLGDVKCCVRTCGRSRLEHGARLFPFPTGKQQHLKWRHNLRLEPDEVDKSTRVCSAHFNRRCIDGKQLRGWAMPTQQLGHQEQPIYENPKNIPGFFTPTCALGHCRKRRSIDNDLRTYRYPRAEDLLEKWRANLGLSLDQCRGRICADHFEPQVRGKLKLKTGAVPTLKLGHEEPLVYDNEAIKAGVAEEEAASPLVTPKTEVLDDEEREEDEEEEENPEEEPQETHEEEKDEHEDDTPEGAEQLRDADDAEEDDEDPGNYFDPLELVETYAEHLSDDDDNHSHEAREDDARGVDEEEPETLLPDTPHKPDPVAGLGMPKPWDRAAAVVPVRREKRPNNVDPICCLKHCRKERSAMYLLSTFGFPKDQQLLLKWCANLQMDPSGCIGRVCIEHFQSEVLGTRKLKQNAVPTLNVGHDVPLRYSCNGQDFPDEMPQHSVFRLWSLKHCRKRKLSESPAPAPAAIKEEEQQQQQHQMPMPMETKPKICCLPSCGNVEGYGPGGQFQPLPMDQRVLKKWQHNLRLPVINPAADMHLCMEHFEACQIENGTAMRMAVPTLKLGNTSPNVFRNRKCLWPTCPPNRKLCYDLPEHESVRVAWLSFMRLPLDSQGRLCGVHFLQLYEELDLPGDVPEEVLERLQVTYDQASISLKFQCSVVGCGSKYKQDTHLAKLPGDPQLLARWLHNTRISYDRSLHFSYRICLLHFEEYCLNGVRPQQWAIPTLQLNHDGEIHQNTMKQDALGTSPVKRELAETSPVKQEALESPPVKQELLESPPMKQEMPETSPVKQDALESPPVKQESQEKAPLSSSLSLSIPLHIKTEQQGPVLRSRGTSSLSSPCLSGSSASPRLKNRICCIANCGEGARSQRLHPFPSDEPTLLKWLVNTQQKPGLVDIQSLFVCQLHFEADAINKTQLRSWALPTLRLGHDGHVIPNARHNGNIANSQETEQAMEFIRANYCSVLSCFQPKTEGVRFHKYPSDIAMVRRWSTNLKHRSMQASSHGFLVCQSHFQPECYDPQTGDLLEEAVPLAVGVKTEGQLLHCLVRGCPVTSNSSRGLLLFRVPKRTRVREVWGHNLWMYPMQLVGEHYICERHFEPHCVNEQKQLHAGSVPTLHLGHNEQPELLPNPASFQEECECCVPGCLRSNRKDPDLQFSKFPKWRVLSDKWLHNFRLEKPRLDQRIKALRVCHTHFEESCFDDQGVRRGAMPTLELGHSHSDIYHTDMGSAMLKPLHKSFSDCCYPDCYEDCQKANIHRMAYDLPSDGPVREAWLQHMGIPSSGDDCPSGLRLCPVHYIMLYERSFPDRADNLVLDRSYEHARQLPQLRRFMCVVQGCRHLHQRDGGTMHGMPRSEQILRMWVENAQLRLNEHEIYMTKMCSKHFEASCLYEGRRCYPWSVPTLHLPALLPGQVLHQNPTPEEWQQMKQQMKLEEQAQKAEQQIAEGLPMEPYVKMEPHDNDDEEDASQTESELLINESTMDSQETDFQPNEPLEVLLEVGHVERLESYEKRDYSTDIRHSRYGPGKRFRHPYSAHKCCVEGCGVSVEDLGWNLKLHKLPSSTEVARKWLYNIQVDTEDTWRIRVCSHHFDRQCLNGSRLRRGSMPTQHLGPRVPETIHQNEFLQLQLDDAPAQNDQHVEPVRTIGKVVNPCVPRPSPPRKSSKFCQIEGCSNHLTSENMTLHKFPHSSWICVKWQHNTQVPFDPEYRWRYRICSAHFHPVCMVNMRLLHGSVPTLKLGPRAPAELFDSDFEAINIKMDKLEKLERKSEARRATTGDGVGYPGMLDMGEKKVKTELELDDGGALDEEEEEQEDAMLYLEPEMHIYEDQEEQQQKRAPKANPNGGWKTELRLPSKSRVTLNPVRSGYDKCSLMHCQRQRSKHGVHIYKFPRSEEHQQRWMHNLRIRYDEKRPWKFMVCSVHFEPHCIRLRKLRPWAVPTLELGDNVPEDIYTNEQCQMFASGQAAGESNCNYSDAEAQAEAEPEADSDGNDDEDGLQEDEDEETDEQEASTAKKRRRSRLADASWPLGQVPPWKVKQCCLPYCRSPRGDGIKLFRLPNKVNSIRNWELATGMKFKESQRNTRLICSRHFEPELIGVRRLMRNAIPTRHLEPTGDVKPVLPPPTTAGPRCCMADCAYDVADVKLHKFPSNPKLLREWCQALRVTDLQRYRGKHICSAHLPVHKAVHCIVCGADRAPQLPMLSFKPEAAPTLHLNHNDTNIFLNDCAINPAYTGVRVKDEPMDNQVLSL
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00536072;
- 90% Identity
- iTF_00601081;
- 80% Identity
- -