Dpeg000535.1
Basic Information
- Insect
- Drosophila pegasa
- Gene Symbol
- -
- Assembly
- GCA_035046225.1
- Location
- JAWNOD010000072.1:1728993-1743338[-]
Transcription Factor Domain
- TF Family
- THAP
- Domain
- THAP domain
- PFAM
- PF05485
- TF Group
- Zinc-Coordinating Group
- Description
- The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 29 4.1 4.7e+03 -1.7 2.2 49 62 345 365 320 380 0.61 2 29 1.4e-15 1.6e-12 47.9 4.3 1 86 566 638 566 639 0.87 3 29 4.4e-15 5.1e-12 46.3 4.6 1 87 666 735 666 735 0.83 4 29 1.3e-15 1.6e-12 47.9 0.4 1 87 757 829 757 829 0.84 5 29 9.7e-16 1.1e-12 48.4 5.1 1 86 935 1004 935 1005 0.82 6 29 1.3e-14 1.6e-11 44.7 2.7 1 86 1029 1100 1029 1101 0.82 7 29 2.7e-12 3.2e-09 37.3 0.7 1 87 1136 1204 1136 1204 0.80 8 29 1.2e-10 1.4e-07 32.0 1.3 1 86 1249 1318 1249 1319 0.76 9 29 2.3e-16 2.7e-13 50.4 0.1 1 86 1346 1415 1346 1416 0.83 10 29 1.4e-13 1.6e-10 41.5 0.6 1 86 1437 1506 1437 1507 0.81 11 29 1.7e-13 2e-10 41.2 2.2 1 86 1534 1605 1534 1606 0.86 12 29 9.1e-13 1.1e-09 38.8 2.0 1 85 1659 1727 1659 1728 0.82 13 29 2.5e-12 2.9e-09 37.4 0.1 1 87 1752 1821 1752 1821 0.82 14 29 8.9e-14 1e-10 42.1 0.8 1 87 2001 2070 2001 2070 0.79 15 29 1.9e-12 2.2e-09 37.8 0.1 1 62 2143 2205 2143 2223 0.77 16 29 0.00017 0.2 12.3 0.2 1 58 2225 2277 2225 2291 0.82 17 29 1.9e-12 2.2e-09 37.8 0.7 1 87 2314 2384 2314 2384 0.86 18 29 3.2e-13 3.7e-10 40.3 2.2 1 86 2448 2517 2448 2518 0.82 19 29 6.3e-13 7.3e-10 39.3 1.4 1 86 2553 2624 2553 2625 0.81 20 29 6.3e-12 7.3e-09 36.1 1.8 1 87 2635 2707 2635 2707 0.82 21 29 4.1e-13 4.8e-10 39.9 0.1 1 86 2730 2799 2730 2800 0.78 22 29 1.6e-05 0.019 15.6 0.1 1 58 2831 2882 2831 2904 0.78 23 29 3.1e-14 3.7e-11 43.5 0.3 1 86 2917 2989 2917 2990 0.81 24 29 7.8e-14 9e-11 42.3 0.3 1 86 3127 3199 3127 3200 0.83 25 29 2.3e-14 2.6e-11 44.0 1.9 1 86 3257 3327 3257 3328 0.83 26 29 1.9e-14 2.3e-11 44.2 3.9 1 86 3438 3508 3438 3509 0.85 27 29 9.7e-12 1.1e-08 35.5 0.1 1 86 3593 3662 3593 3663 0.85 28 29 4.7e-07 0.00055 20.5 1.1 1 58 3680 3728 3680 3741 0.84 29 29 1.5e-09 1.8e-06 28.5 1.8 19 87 3746 3803 3734 3803 0.75
Sequence Information
- Coding Sequence
- ATGTCACAACACAATAATCAACCGCATtcgcatcagcatcagcatcaacaacaacacctcaactactaccagcagcagcaacagcaacatcacttacagcaacagcagcagcagcagcctttgcaacataaacaaatacagcagcagcatagtTGGTACTCACATGTTGCTTCCTACCCGTCCCAGCAGCCGCACCCCTATGCGACGCCgtgcaagagcagcaacaataacattcacagcaacaacaacaacaacattatgAATGCATACGGCACGGGAGGGGCGCACTATTacggtgcagctgctgctactgctgcaggGGTGGGATATAACCTTGAAGCTAATACTGTGGCCTATGCGCAcaaccagctgctgcagtaccaacagcaacaacagcagcaacaacaacaacaacagcagcagctcggtcAACGCTCGTATATGCCGCAGAGCATAATGCACAGCTCCTATCCCTATATTAAGAGCGAGCCATTGGAGCTGCCCGATGACAGACAACGCCATCAACAACACCaaacgcaacaacagcaacagcagcagcagctacaacaacaacaacattttcaaaatCCAATGGCACCGCCGCCAGCACCCTCCTCCAGCCGCCACACGCTTGATTCCAGCGGcgaaatgataataaaatcgGAACCCATTGACGAACATGCGTTCAAGTCCAGCTATATTGATGATAATACGCCCTTCGCCGATTTTAGCAAATTTCCGGAATTCGGCGACGACATGCTGAGTCCCAAGGTGGAGCTCACTGTAAAGGATGATGCCTATGGCAGCCAAAAGaaTCCGCTCAGCTTTCCACGCCGTAAGCTGCAAACGGAACGGTCCTCGGAGAGTCTGCCTATTTGCCAGCGCTGCAAGGAGGTGTTCTTTAAGAAACAAATCTATCTGCGCCATGTGGCTGAGAGCAGCTGTACGATGCATGAGTACGACTACAAGTGCAGCATCTGCCCCATGTCCTTCATGGGCGCCGAAGAGCTGCAGAAGCACAAGCAACTGCATCGCGCGGATAAGTTCTTCTGCCACAAGTACTGTGGCAAGCACTTTGACAACATTGCCGACTGCGAGTCGCACGAGTACATGCAGCATGACTACGATAGCTTTGTGTGCAACATGTGCTCAGTGGCATTTGCTACGCGTGAACAGCTTTATGCTCATTTGCCGCAGCACAAGTTTCAGCAGCGCTACGATTGCCCCATCTGCCGTCTGTGGTATCAGACTGCGCTAGAGTTGCACGAGCATCGTCTAGCGGCGCCTTATTTTTGCGGCAAGTACTATGCAGCAacacgccagcaacagcagcaccagcagctatCGGAGCAGAGCAGCTACAAGCTACAGGACTGCCACATGGGCACAATAGAAATGACTGCGCCGCAGCATAAAGCCTTGCCTGCCACGGCGGCTCTTAGTTCCTTACTGCATCAGCGCCAAACGAATGCTGACGGTGGCgcctctctcttttcctcCACCTTAAAGAACGAACCGAGCGTTAAGCTGGAGCGAAGCTACAGCAACTCGACGAGCGAGTCTGGATATAGTATGCACGAGAGCAGCGGCTACAACAATGCATATGGGAGTGACAACTCGCTGCATGGCAGCGGTGTTGTACTTGGCGGACCACAGGCACATTCTTCCACGCTAGATGAATCGGAGGATGCGCTGTGCTGTGTGCCCTTGTGTGGTGTGCGCAAAAGTACCAGCCCCACTTTGCAGTTCTTTACGTTTCCCAAGGACGAAAAGTATTTGCATCAATGGCTGCACAATCTGAAAATGTTTCATATACCTGCGTCTAGCTACACCAGCTATCGCATCTGTAGCATGCATTTTCCCAAGCGTTGCGTAAATCGGTATTCGCTATGCTATTGGGCAGTTCCCACATTCAATCTGGGCCACGATGACGTCGCCAATTTGTATCAGAATCGAGAGTTAACCAACACCTTTACCACCGGCGAGGTGGCCCGCTGCAGTATGCCCAATTGCACGAGCCAGCGTGGTGAAAGCAATCTGAAGTTTTACAACTTTCCGAAGGACATCAAGAGCCTAATCAAATGGTGCCAGAACGCACGTCTGCCCGTCCAGGCCAAAGAGCCGCGTCACTTTTGCAGTCGCCATTTCGAAGAGCGTTGCATTGGCAAATTTCGACTGAAACCCTGGGCTGTGCCTACGCTCCATTTGGGTGCGCAGTATGGCAAGATTCACGACAATCCTAAGAACCTGTACGTGGAGGAGAAGCGTTGCTGCCTAAATTTTTGTCGTCGCAGTCGATCCACCGACTTCAACATGTCCCTCTACCGTTTTCCCAGAGACGAGGTACTACTGCGTCGCTGGTGCTATAATCTAAGACTTGACCCGGCCGTATACCGTggaaaaaatcacaaaatatgCAGCGCTCACTTTATTAAGGAAGCTCTCGGCTTGCGCAAACTGTCGCCAGGCGCTGTTCCAACTCTGCACCTGGGCCACAATGACACATTCAACATCTACGAAAATGAACTGTGGCCCCCTCCAACGGCTTCGACGCCAACCCTCCAAaatcagcatcaacagcagcaactacagcagcatAATATGCAGGAGCTTcaacagcatcatcagcagcagcaggttcAACACAAATATCAGAGGCACTCGGCTGCATCGACTTCCTCGTCAGCCAGCTCCACATCCCACTACGTAGATCCAGAGCTAAGTGCCTCTTATATGGCGATGGGAATGGGCTCTGGCGCCTCATCCTCCGGGCTGCTGAATGTGAGCGATAGCATGGACGTCTGCTGTGTGCCCAGCTGCGAGAGCAAGCGACACAACAATGAGAACATAACATTCCATACCATACCGAGGCGTCCGGAGCAAATGCGCAAATGGTGTCACAATCTTCGTATACCCGAGGACAAGATGCACAAGGGTATGCGTATATGTAGTCTACACTTCGAGCCGTATTGCATTGGCGGTTGCATGCGTCCGTTTGCGGTGCCCACGCTACAATTGGGCCACGACGAGGAGGACATTCACCGCAATCCGGACGTAATAAAGAAGCTGAACATACGTGAAACCTGTTGCGTGGCTGTTTGCAAACGGAATCGAGATCGTGACCATGCCAATCTGCATCGCTTTCCCAGCAATGTGGCCCTGCTAACCAAGTGGTGTGCAAATTTGCAGCGAACCGTGCCGGATGGGATTAAGCTCTTCAACGATGCTATCTGTGAGGTGCATTTCGAGGATCGCTGCTTGCGCAACAAGCGGCTGGAGAAGTGGGCAGTGCCCACTCTTTTGCTTGGCCATGAAAACATTCCCTATCCGCTGCCCACTCCGGAGCAAGTGGCCGAGTTCTATGCCCGACCAAGTGCTCCAAACAATGGCGAAGAGCAGGGTGAATGCTGTGTGGAGACCTGCAAGCGCAATCCCAGCGTGGACGACATCAAGCTTTATCGACCGCCGGAGGAGTCGCAAGTGCTGAGCAAATGGGCGCATAATCTGCAGATAGACATCGCTCAACTACCTGGAATGCGCATTTGTAATCTGCATTTCGAATCGCATTGCATAGGCAAGCGAATGCGCCCATGGGCCATACCCACACTCAATCTGGCCAGCAATATAGAAAATCTCTTCGAGAATCCCGAGCAGCACATGATATACAAGCGACGCACCTCCCATCTCAGCGTGCAAAGAGAAGGCTCAGATGGAAGCATTAGCAAGCCCACCTGGGTGCCACGCTGCTGTCTGCCACATTGCCGCAAGGTGCGCGCGCTGCACAATGTCCAATTGTACCGCTTTCCCAAACTCAATCGGGCCACGTTGGCTAAGTGGGCGCATAACCTGCAAGTGCCGATGGTTGGAAGTGCTCAGAGGCGACTCTGCTCCGCGCACTTTGAGCCGCATGTGCTTAGCAAGAAGTGCCCAGTGCCGTTGGCGGTGCCCACGCTCGATCTCAATACGCCGCCCGGCTACAAGATTTACCAGAACCCATCCAAGATTAAAGCCAACAAGTTGTGCATGCAACGCGTGTGCATTGTGGAGAGCTGTCGTAGGCAGCGCGGCCAGGGTGTGCAACTCTTCCGCCTGCCCCACAACCCCACGCAGCTGCGCAAGTGGATGCACAACATACGAATGCGACCGCGAGGCGTAATGCGGCAACAGTATCGCATCTGCTCCCTCCACTTTGAGACGCACTCGTTCAATGGCAAGCGGTTGAGCACTGGCGCCATTCCAACGCTGGAGCTGGGCCATCAGAACGACGATATCTATCCGAACGAGGCGCAACCGTTTGTCGAGGAGCACTGTGCAATCGAGGGCTGTGATGCTTCCAAGGAGCAGCCGGATGTGCGTCTTTTCCGCTTTCccaacgacgacgaggatCTGCTGTGGAAGTGGTGTAACAATCTTAAAATGAATCCGATTGATTGTATTGGCATGCGCATCTGCAACAGGCACTTTGAGCCGGATTGCATCGGTGCCAAGCATCTCTACAAGTGGGCCATACCCACGTTGATGCTGGGCCACGACGATGTCCAGATTGAGTTGATATCGAATCCTAAGCCCGAGCAGCGCTACGTAGATCCGGTGTTTAAGTGTTGTGTTCCTACCTGTGGCAAGACTCGCAAATTCGATGAGGCGCAAATGAACAGCTTTCCGAAAGATCCGATACTCTTTCAGTGCTGGCGACATAATCTCCGACTCGATCATCTTAATTTCAAAGAGCGCGAGcgctataaaatttgtaatgctCATTTCGAAGACATTTGCATTGGCAAGACCCGCCTCAACATAGGCTCCATACCCACACTAGAGCTAGGTCACGACGAGACGGAAGATCTGTATCGCGTGAATCCCAACGACATTCAGAGCAATCTGTTTGGACGGCAGCGACGCGTGAGTACCAGGCTAGATCCAGAGGTACACTTTACTAGACAGGTCAAGCTCAAGCGCTCCCAACCGGAGCCCAAGTGCTGTGTCGCCACCTGTGTACGCAGTCGAGTAGATAATGGAGCGCGTTTGTTTCCCTTTCCGTCGGGCAAGCAACAACATACAAAGTGGTGCCACAATCTTCGCCTCAAAGCTGACGAAGTGGATGGGACGACGCGTGTTTGCAGCGCCCATTTCAATCAACGCTGCATAGAAGGCAAAATGTTGAAAGGGTGGGCCATGCCCACACAGCTGCTGGGCCACCAGGAGACAAACATCTATGAGAATCCCAAGAATATTCCGGGCTTCTTCACTCCTACCTGTGCATTGGCCCATTGTCGTAAAAGGCGGAGCATTGACAATGATCTGCGTACCTATCGATATCCGCGCAGCGAGGATATGCTCGAGAAATGGCGTGTGAATCTTCGCCTAGCACCAGATCAATGTCGCGGACGCATTTGTGAGGATCACTTTGAGCCCATGGTACGGGGAAAGTTGAAACTGAAGACGGGTGCAGTGCCTACGCTGAACTTGGGTCACGATGAGGGTGCAGTCTATGACAATGAGGCTATTAAGGGTGGTCTGAAACCCGAAGAGGAGGAGCACGACGGTGGAGAGGCAGTCTGTGACAATGAGGCTGTTAAGGTTGTTCTGACGCCCGAAGAGGAGGAGCTCGACGGTGGAGAGGCAGCAAACAGCTTGGACTCGCTTGTTAAGGTGAAGAAcgagcagcaagaagaagacCAGCATATGGAAAGCGACGAGCTGGCAGAAAAACAGGATGAtgaagaggaggagcagcaggatcaggacgaagaagaggaggagcagcaagAGCAGGGCTATTTTGACCCCTTAGAGCTTGTGGAAACTTTTGCCGAACAGCACAGCGATGATGAACAACTCTATAATGCCGAAGATGATGATTTTGATGATAATGAAGAAGAGGAGGACATACCTGGCAACGatgatgagctgctgctgcccgatACTCTGCCTATGGCTCAGCTGGCTTTTCCCCCAAGACgtgagaaaagagagaaggcCGTGAACAATGTAACGCCCATATGTTGTCTCAAGCATTGTCGTAAGGAGCGCACTGCTACCCACCCGCTGAGCACCTTTGGCTTTCCCAAGGATCAGCAGCAACTACTCAAATGGAGTGCGAATTTGCAACTAGACCTCTCCGATTGTATTGGACGTGTTTGCATTGAACACTTCGAGACGGAGGTGTTGGGCACACGCAAACTGAAACAAAACGCGGTGCCCACCTTGAATCTGGGCCACTCCACACCTCTTAGCTACCGCTGCAATGGGCAAGCTGTGGGCATCGCAGatgcacagccacagcagcattCGGTTTTTCGGCTTTGGAGCCTAAAACACTGCCGCAAAAGGAAATCACCGctgacgacaacgacgacgacgacggcgacgacgacaacgacgacgacgacgacgacgaatccagcagcgacgacaactAATCGTAGCTGCTGCGTGCCCAATTGCGGCAAGCAACTGGAGTTGCATGGCGTTCAGCTGAGACGCATGCCTAAAGAACATTTACTGTTGCGCAAGTGGCTGCACAATCTAAAGCTCCCTGAGGACGTGAACATCAACCGAGCGTATGTGTGCGAGGAGCATTTCGAAGAGCAGGCCATGCTGCCCACCCTGAAACTAGGTCACACGGACAGCAATATTTAtcgtagcagcaacagctcaatGGAAGCCAATAAGTGCCTGGTACCGAGCTGTCCATGTGCACGTCTTCATCATTATCAATGCTACGCATTGCCCGAGCATCCCCAAGTGCAAGAAGCCTGGCTGCAGTGGCTGCAACTACCGCCTCGTTCGCCTAACTTACCCAGCGATTCCCAGTTGTGCGCCATGCACTATATGCAGCTGTTTGAGGAAGTGCCACTGCCGGATCAGCTTCCCGAGATGGTGCTATGCGAGCTGAGGCTGACGTATGATCATATCGCCAACTCCACCATAGCGTTGCGCTGCGCTGTGCCCGGCTGCCGCATTAAATACACGGATAATGTGCATCTCACTAGGCTACCCCTTTGCGAGCGCACCCTCGCCCAATGGTTGCATAATACAAAGATCGACTACGATCCAGATCGCAGAAGGATGTATCGCATCTGCCTGCGGCATTTCGAGCCCCATTGCCTGGGTGCAGTGCGTCCAAGGCAATGGGCGGTGCCCACGTTACACCTCAACCACGAAGATCCAGATATTTATCAGAATACCAAGCTGGAAGACAACGCTCTTGGCGTAGCAACCCAGCCAGGTGTACCTCTGGAAATGCCGCTGCGCATTAAGACGGAGCTGCCGCCGTTACTCTCTCTCAGTCCAGCGGCCAGCCCGAGTGCAAGTATAAGTCCTCGGGTTAAAATACGCACCTGTTGCATTTCCAGCTGTGTCCAGCAGGCTAGCGCTCTCACCCGTCTCTTTCGCTTCCCCAGCGCCGAGCCTGCGCTGCTCAAATGGCTGGTGAatacacagcagcagccgcgcttCGTGGATGCGCAGCATCTGTTTATTTGCCAGGATCATTTCGAGCCGGAGGCCATTTGCAAGAAGCAGTTGCGCAGCTGGGCGGTGCCGACGCTGAAACTGGGTCACAGTGACTATGTCATACCCAACGCTCGGCACAATGGCAATATTGCTGACAGCCAAGAGAACAAGCAGGCGTTGCAGTACATCTGGGCCAACTTCTGCTCCGTTCTTACCTGCTTCCAAAAGCGCAGCGAACAGTTGCGTCTCTTCGCCTATCCCACGGATCGACCCACCATTCGCCGATGGGCTGCCAACTGTAAGCACCGCTCCATGCAGGCCAGCAGTGATGGATTTCAGGTCTGCCAGTCGCATTTCGCGCCAGATTGCTTTGAGCCTGAGACCGGGCAGTTGAAGGAGAATGCTGTGCCCACGCTCGAGCTGAGCCGACCCGTCTATCAAATGCGTTGTTTGGTCAAAGGCTGTGTTAAGGAAGAGGATGCGACGCGCCGTCGTTTATTTAAGATGCCAAAGCGACCATATCAGTTGGAGGATTGGTGTCACAATTTGCGCATCGATGCGGCGTCTATAATTGGCCAGGACCCACATGTGTGCGAACGGCACTTCGAATCGAACTGTTTCAATGCGTTCAAGCTGCTACGTCAGAATGCGCGACCCACACTGCATTTAGGCCACAACGACCAGTTGGATGTGCTGCCCAATCCGGCCAAATGGGAGGAAGATGCTCAGGTATGCTTTGTGCCTAGCTGTGGACGATCGAAGGATGCGGATAATGTGCAGCTGTTCGGGCTACCAAAGCATAAGGTGTTAGAGAAATGGCTGCAAAACTTCCGTCTCGATCTGAGTAAGGAGCAACTGAATGGCCGGCGCATATGTAGCATGCACTTCGAGCCCAGCTGCATAGATAATGAACAACTACACATGGGCTCCATGCCTACACTCCAGCTGGGTCACGACGAGACGGACAATATACATCAAAGCGTCAAGGTACTGCCATGGTCGATGCGGTTCAAACGAATGCGGATCTACTACGACTGCTGCTATCCAGATTGCGCGGAGCTGCAAAAGACATATCAAAGAATTACCTATGAGCtaccacaacagcagcgactgcgTGAAATGTGGCTGTCGTATATGGGACTGGAGCAGTTCAACCTGCAGCAGGCGTTCAAACTCTGCCCGCTGCATTTGGTAATGTTGTATGAGCAGTTCCCTGAGCAATCAGCGAAGGAACAGGTGCTAATGTCTAACTACGAGGCTTCTCGCAACAGCTTGCGCATACGCCTGGTCACCTGTGCTGTGCAAGGCTGCATGACCCTAAAGCCAAGAGACGATTATCGCTTGCATTCACTGCCCACGCGTTGGGATATGCTCGAGCTGTGGCTGGACAATTTACAGATACCTAACTACGATAATCAGCGTAACAGGTACCGCGTGTGTGACAAGCACTTCGAAGCTAATTGCATGATGGAGACGACGCGACGTCTAAAGCCTTGGAGTTTGCCCACGCTACAGCTGCCAAATCCTCCATTGCACGAAAATCCCACGGAGGAGCAGTGGCATCGAATGAATGAGCATTATGCTCCGCTGCATCGGTTAGAATCCAATGTCAAGCAGGAGTACGTCTATGACCAAAAGAAGGAGCAGCTCAGACAGGAATTACGGCTTAagcaggagctgcagttgAGACACCAGCCACAGCCTGTATATGAGAATGACTTTGACGAGCAACAGCCACATGAAATGCAAGAGCTGGAGGTGCTCCTCGAGGTGGGGCACGTAGAGACGTGCACCACGTAtgagcaaatgcaaacaaacccCTCCAGGAACTATGGCGAACAGTTGCACAATCCTCGAGGTGCGGCACATTCATTAGGTAGCGGCACAGTAGTCAGCAATGGAGCTCACTACAATGCTCGACATTGCAGCGTGCGAGGCTGCGATGTGACGGCGCTCGACGTGGACGGCAACCTAAAGCTGCACAAGTTTCCCACATCGAAAGATGCTATGGAGAAGTGGATGCACAATACACAGGTAGATGTGGATATCAACTTCGCTTGGCGTTTTCGCATTTGCAGCCATCACTTTGTCGCTGAGTGTTTCAATGGATCGCGCATTAGACGGGGAGCCATGCCCACGTTGCGTTTGGGTTCGCGACGTCCTGCGCATATTTACGACAATGAATTTAATGTTCAGCAGGAGCAGACAACAGCGACGATCGTGGAGGAGCAGCTTGCGCCGACTGTGGACTCAAAGTTTGATATAAATCTGCGCCTGCCCTGTCCCGCTCCGCCCCGCAAATCCAGTAAATTCTGCCAGATTGATGGCTGCTCGAACCATCTGACCAGCGAGAATGTGACTCTGCATAAGTTCCCCCATTCGGTGGACATGTGCGCCAAGTGGCAACACAACACACAGGTGCCCTTCGATCCGGAGTATCGTTGGCGCTATCGTATTTGCAGCGCACACTTTGAGCCTAGTTGCCTGGGCAACATGCGACTGATGCACGGAAGTGTGCCCACCTTGAAGCTGGGTCCGCGGGCTCCCAAACAGCTCTTCGACAATGACTTTGCAGCTCAAAACTTGCGCTTGGATAAAGAGAAGTGCAGCGAGCAAGACTTGACTGTAAAACAGGAGCCACTACAGCAATTCGAGGAGGTTGATCAggtgcaggagcaggagcaggaggaatTAAATCTGCTAGAGCCAGAGCTGCAGTTGCTTGAAGGTGAGGACGAGGAGGAAGAGCAGGACGATTATTTAAACTGCAGTAACAATTGGAGTGATGTGAGCATTAAGCAGGAGAAGTCTAGCGACAGCAGCTACTATAATCCGGTCAAGTCGGGCTATGATAAGTGCTCCCTAATACATTGTCAGCGCCAGCGTTCGCAGCACGGTGtgcacatttataaatttccacGCTCgcgtcagctgcagcaacactGGATGCATAATTTGCGTATTCGATACGACGAACGGCGTCCATGGAAGACCATGATATGCAGCGTGCACTTTGAGCCGCAGTGCATACGACTGAGGAAGCTGCGTCCCTGGGCAGTGCCCACCTTGGAGCTGGGCGACAATGTGCCTGAGCAACTGTACACGAACGAacagagccagcagcagatgAAGGCGGGCagcgagggagagggcgatTATGAGGATTATGATGTGGACGACACTATGCTGGAGGACTACGATGATGATTTTGAGGGCACTGAGTCTGAGCCGATGCCGGCGGAACCCCATATCAAACGAGAGCGCCGTTCACGCTGCGATCCTTTACCTCCTGGACAGCTGCCGCCGTGGAAAATAAAGCAATGCTGTCTACCCTATTGCCGCAGACCACGGGGCGATGGCATCAAACTCTTTCGCCTGCCGAACAACATTGGCGCCATACGCACGTGGGAGCAAGCGACAGGGATGCGCTTCTGTGAATCGCAGCGCAACACAAAGCTCATTTGTAGTCGTCACTTTGATCCGCAGCTAATTGGTGTGCGTCGCCTGATGTCTAATGCGGTGCCGACAAGGAACCTTGGCCCGAATAGCCAAGAGCCGGAGCTGCCGGCAAGTGGTCCACGCTGCTGTATCACGGATTGCGAACACCATGGGCTAGTCAAGCTGCACAAGTTTCCCAGTGATCCTTTGCTGCTCCATCAATGGTGTCAAGCGCTGGATTTGCCGGAGGTACAATGCTACGCTGGTAAACACATTTGTGCCATCCATCTGCCTGCTAAGGCTTTTAGTTGCCTTATCTGTGGTGTTGAGGACGTGCAGCTGCCTATGCAAGACTTTCCCGAGAATCGCAATCAGCGTGTCAAATGGTGCTATAATCTTAAAATCGAACCCATTGCCAAGTGGGACAACACGAAACACATCTGCTCTAAACATTTTGAGAGTTGCTGCTTTATACAGCCGGGTCGTCTGCTCCCGGAAGCAATGCCGACATTACATTTGAAGCACactgacaaaaatatattcctcAATGAGTATGCTATAAACAAGACTAAGATGCTCTGCATCAAGGATGAGCCCATAGAGAGCGAGGATCTGATGCTATAA
- Protein Sequence
- MSQHNNQPHSHQHQHQQQHLNYYQQQQQQHHLQQQQQQQPLQHKQIQQQHSWYSHVASYPSQQPHPYATPCKSSNNNIHSNNNNNIMNAYGTGGAHYYGAAAATAAGVGYNLEANTVAYAHNQLLQYQQQQQQQQQQQQQQLGQRSYMPQSIMHSSYPYIKSEPLELPDDRQRHQQHQTQQQQQQQQLQQQQHFQNPMAPPPAPSSSRHTLDSSGEMIIKSEPIDEHAFKSSYIDDNTPFADFSKFPEFGDDMLSPKVELTVKDDAYGSQKNPLSFPRRKLQTERSSESLPICQRCKEVFFKKQIYLRHVAESSCTMHEYDYKCSICPMSFMGAEELQKHKQLHRADKFFCHKYCGKHFDNIADCESHEYMQHDYDSFVCNMCSVAFATREQLYAHLPQHKFQQRYDCPICRLWYQTALELHEHRLAAPYFCGKYYAATRQQQQHQQLSEQSSYKLQDCHMGTIEMTAPQHKALPATAALSSLLHQRQTNADGGASLFSSTLKNEPSVKLERSYSNSTSESGYSMHESSGYNNAYGSDNSLHGSGVVLGGPQAHSSTLDESEDALCCVPLCGVRKSTSPTLQFFTFPKDEKYLHQWLHNLKMFHIPASSYTSYRICSMHFPKRCVNRYSLCYWAVPTFNLGHDDVANLYQNRELTNTFTTGEVARCSMPNCTSQRGESNLKFYNFPKDIKSLIKWCQNARLPVQAKEPRHFCSRHFEERCIGKFRLKPWAVPTLHLGAQYGKIHDNPKNLYVEEKRCCLNFCRRSRSTDFNMSLYRFPRDEVLLRRWCYNLRLDPAVYRGKNHKICSAHFIKEALGLRKLSPGAVPTLHLGHNDTFNIYENELWPPPTASTPTLQNQHQQQQLQQHNMQELQQHHQQQQVQHKYQRHSAASTSSSASSTSHYVDPELSASYMAMGMGSGASSSGLLNVSDSMDVCCVPSCESKRHNNENITFHTIPRRPEQMRKWCHNLRIPEDKMHKGMRICSLHFEPYCIGGCMRPFAVPTLQLGHDEEDIHRNPDVIKKLNIRETCCVAVCKRNRDRDHANLHRFPSNVALLTKWCANLQRTVPDGIKLFNDAICEVHFEDRCLRNKRLEKWAVPTLLLGHENIPYPLPTPEQVAEFYARPSAPNNGEEQGECCVETCKRNPSVDDIKLYRPPEESQVLSKWAHNLQIDIAQLPGMRICNLHFESHCIGKRMRPWAIPTLNLASNIENLFENPEQHMIYKRRTSHLSVQREGSDGSISKPTWVPRCCLPHCRKVRALHNVQLYRFPKLNRATLAKWAHNLQVPMVGSAQRRLCSAHFEPHVLSKKCPVPLAVPTLDLNTPPGYKIYQNPSKIKANKLCMQRVCIVESCRRQRGQGVQLFRLPHNPTQLRKWMHNIRMRPRGVMRQQYRICSLHFETHSFNGKRLSTGAIPTLELGHQNDDIYPNEAQPFVEEHCAIEGCDASKEQPDVRLFRFPNDDEDLLWKWCNNLKMNPIDCIGMRICNRHFEPDCIGAKHLYKWAIPTLMLGHDDVQIELISNPKPEQRYVDPVFKCCVPTCGKTRKFDEAQMNSFPKDPILFQCWRHNLRLDHLNFKERERYKICNAHFEDICIGKTRLNIGSIPTLELGHDETEDLYRVNPNDIQSNLFGRQRRVSTRLDPEVHFTRQVKLKRSQPEPKCCVATCVRSRVDNGARLFPFPSGKQQHTKWCHNLRLKADEVDGTTRVCSAHFNQRCIEGKMLKGWAMPTQLLGHQETNIYENPKNIPGFFTPTCALAHCRKRRSIDNDLRTYRYPRSEDMLEKWRVNLRLAPDQCRGRICEDHFEPMVRGKLKLKTGAVPTLNLGHDEGAVYDNEAIKGGLKPEEEEHDGGEAVCDNEAVKVVLTPEEEELDGGEAANSLDSLVKVKNEQQEEDQHMESDELAEKQDDEEEEQQDQDEEEEEQQEQGYFDPLELVETFAEQHSDDEQLYNAEDDDFDDNEEEEDIPGNDDELLLPDTLPMAQLAFPPRREKREKAVNNVTPICCLKHCRKERTATHPLSTFGFPKDQQQLLKWSANLQLDLSDCIGRVCIEHFETEVLGTRKLKQNAVPTLNLGHSTPLSYRCNGQAVGIADAQPQQHSVFRLWSLKHCRKRKSPLTTTTTTTATTTTTTTTTTNPAATTTNRSCCVPNCGKQLELHGVQLRRMPKEHLLLRKWLHNLKLPEDVNINRAYVCEEHFEEQAMLPTLKLGHTDSNIYRSSNSSMEANKCLVPSCPCARLHHYQCYALPEHPQVQEAWLQWLQLPPRSPNLPSDSQLCAMHYMQLFEEVPLPDQLPEMVLCELRLTYDHIANSTIALRCAVPGCRIKYTDNVHLTRLPLCERTLAQWLHNTKIDYDPDRRRMYRICLRHFEPHCLGAVRPRQWAVPTLHLNHEDPDIYQNTKLEDNALGVATQPGVPLEMPLRIKTELPPLLSLSPAASPSASISPRVKIRTCCISSCVQQASALTRLFRFPSAEPALLKWLVNTQQQPRFVDAQHLFICQDHFEPEAICKKQLRSWAVPTLKLGHSDYVIPNARHNGNIADSQENKQALQYIWANFCSVLTCFQKRSEQLRLFAYPTDRPTIRRWAANCKHRSMQASSDGFQVCQSHFAPDCFEPETGQLKENAVPTLELSRPVYQMRCLVKGCVKEEDATRRRLFKMPKRPYQLEDWCHNLRIDAASIIGQDPHVCERHFESNCFNAFKLLRQNARPTLHLGHNDQLDVLPNPAKWEEDAQVCFVPSCGRSKDADNVQLFGLPKHKVLEKWLQNFRLDLSKEQLNGRRICSMHFEPSCIDNEQLHMGSMPTLQLGHDETDNIHQSVKVLPWSMRFKRMRIYYDCCYPDCAELQKTYQRITYELPQQQRLREMWLSYMGLEQFNLQQAFKLCPLHLVMLYEQFPEQSAKEQVLMSNYEASRNSLRIRLVTCAVQGCMTLKPRDDYRLHSLPTRWDMLELWLDNLQIPNYDNQRNRYRVCDKHFEANCMMETTRRLKPWSLPTLQLPNPPLHENPTEEQWHRMNEHYAPLHRLESNVKQEYVYDQKKEQLRQELRLKQELQLRHQPQPVYENDFDEQQPHEMQELEVLLEVGHVETCTTYEQMQTNPSRNYGEQLHNPRGAAHSLGSGTVVSNGAHYNARHCSVRGCDVTALDVDGNLKLHKFPTSKDAMEKWMHNTQVDVDINFAWRFRICSHHFVAECFNGSRIRRGAMPTLRLGSRRPAHIYDNEFNVQQEQTTATIVEEQLAPTVDSKFDINLRLPCPAPPRKSSKFCQIDGCSNHLTSENVTLHKFPHSVDMCAKWQHNTQVPFDPEYRWRYRICSAHFEPSCLGNMRLMHGSVPTLKLGPRAPKQLFDNDFAAQNLRLDKEKCSEQDLTVKQEPLQQFEEVDQVQEQEQEELNLLEPELQLLEGEDEEEEQDDYLNCSNNWSDVSIKQEKSSDSSYYNPVKSGYDKCSLIHCQRQRSQHGVHIYKFPRSRQLQQHWMHNLRIRYDERRPWKTMICSVHFEPQCIRLRKLRPWAVPTLELGDNVPEQLYTNEQSQQQMKAGSEGEGDYEDYDVDDTMLEDYDDDFEGTESEPMPAEPHIKRERRSRCDPLPPGQLPPWKIKQCCLPYCRRPRGDGIKLFRLPNNIGAIRTWEQATGMRFCESQRNTKLICSRHFDPQLIGVRRLMSNAVPTRNLGPNSQEPELPASGPRCCITDCEHHGLVKLHKFPSDPLLLHQWCQALDLPEVQCYAGKHICAIHLPAKAFSCLICGVEDVQLPMQDFPENRNQRVKWCYNLKIEPIAKWDNTKHICSKHFESCCFIQPGRLLPEAMPTLHLKHTDKNIFLNEYAINKTKMLCIKDEPIESEDLML
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00519952;
- 90% Identity
- -
- 80% Identity
- -