Dpec002335.1
Basic Information
- Insect
- Drosophila pectinifera
- Gene Symbol
- -
- Assembly
- GCA_008042775.1
- Location
- VNKC01000420.1:227378-244829[-]
Transcription Factor Domain
- TF Family
- THAP
- Domain
- THAP domain
- PFAM
- PF05485
- TF Group
- Zinc-Coordinating Group
- Description
- The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 30 2.9 5.8e+03 -2.0 1.3 44 60 315 332 300 350 0.55 2 30 2.7e-15 5.3e-12 46.2 4.0 1 86 544 616 544 617 0.85 3 30 1e-14 2e-11 44.3 5.0 1 87 644 713 644 713 0.83 4 30 9.3e-16 1.8e-12 47.6 0.2 1 87 735 807 735 807 0.85 5 30 6.3e-16 1.2e-12 48.2 5.1 1 87 903 973 903 973 0.82 6 30 2.2e-15 4.3e-12 46.4 3.6 1 86 997 1068 997 1069 0.82 7 30 7.9e-12 1.6e-08 35.0 0.5 1 87 1104 1172 1104 1172 0.80 8 30 8.2e-11 1.6e-07 31.8 1.4 1 86 1214 1283 1214 1284 0.76 9 30 5e-17 1e-13 51.7 0.4 1 86 1311 1380 1311 1381 0.82 10 30 4.6e-12 9.1e-09 35.8 1.9 1 85 1402 1470 1402 1472 0.78 11 30 7.7e-15 1.5e-11 44.7 0.5 1 86 1499 1570 1499 1571 0.85 12 30 8.5e-13 1.7e-09 38.1 3.3 1 85 1644 1712 1644 1714 0.82 13 30 2.4e-12 4.8e-09 36.7 0.1 1 86 1737 1805 1737 1806 0.83 14 30 6.8e-13 1.3e-09 38.5 2.5 1 87 1949 2018 1949 2018 0.80 15 30 2e-11 3.9e-08 33.8 0.1 1 86 2107 2178 2107 2179 0.75 16 30 1.1e-05 0.022 15.4 0.0 1 60 2194 2245 2194 2265 0.78 17 30 2.9e-12 5.8e-09 36.4 0.1 1 87 2273 2343 2273 2343 0.79 18 30 6.8e-13 1.3e-09 38.5 0.4 1 87 2395 2465 2395 2465 0.82 19 30 2.7e-11 5.3e-08 33.3 0.1 1 86 2500 2574 2500 2575 0.81 20 30 2.9e-12 5.8e-09 36.4 0.0 1 87 2585 2659 2585 2659 0.79 21 30 1.1e-10 2.2e-07 31.4 0.0 1 61 2684 2739 2684 2758 0.70 22 30 0.00033 0.65 10.6 0.1 1 58 2785 2835 2785 2858 0.80 23 30 3.1e-11 6.1e-08 33.1 0.2 1 86 2875 2946 2875 2947 0.81 24 30 1.8 3.5e+03 -1.4 0.0 19 61 2952 3003 2949 3016 0.51 25 30 4.1e-16 8.1e-13 48.8 0.4 1 86 3059 3131 3059 3132 0.81 26 30 1e-12 2e-09 37.9 3.3 1 86 3195 3265 3195 3266 0.80 27 30 3.1e-12 6.1e-09 36.3 5.2 1 86 3358 3428 3358 3429 0.84 28 30 2.1e-11 4.1e-08 33.7 0.1 1 86 3509 3578 3509 3579 0.84 29 30 2.3e-10 4.5e-07 30.4 0.5 1 58 3604 3652 3604 3666 0.83 30 30 7e-10 1.4e-06 28.8 1.4 18 86 3670 3727 3660 3728 0.75
Sequence Information
- Coding Sequence
- ATGTCACAACACAACCCCAATCACGCCCACCCACACTACCACTACCCGTCCCATCCAACGCCGCTggctctgcagcagcagcagcaccagcagcagcatcagcaggagcagcacggCAGTAGTTGGTACTCACATGTTGCTTCCTACCCAGCATCCTCCCACTCCCACAACCTCTCTCACTCGGCCTTTGGCCCTGCGCCCCCTTGCAAGGccagcatcagcaacaacaacaccattATGGGTGCctacggaggaggaggagggggtGCTGGCTCGCATGGATATTTCGGCGCCGCTGGCGGTGGCCTCAATGTCAGCGGGGCGGGGGGTGGTGCTGGGTCGACCTACGGCCTTGGGGCCAACACGGTGGCATATGCTCACAACCAGCTGCTGCAGTaccagcatcatcatcagcaacaccagcagcagcagcacctgggTCTGAGCCAGCGATCCTATATGGGCCACGATGTCATGGCCGGGAGCTATCCCTATATCAAGAGCGAACCTTTGGAGGGCTTCCAGCAACCGGCCAATCCAATGGCCCCACCCCCGGCCCCagaaatgataataaaatCGGAACCCATTGACGAACTTGCCTACAAGTCAAACTACATTGACGATAATACGCCATTTGCAGACTTTAGCAAGTTTAGCGAATTCGGCGAGGACATGCTGAGTCCCAAAGTCGAGCTGACAGTTAAGAATGAGTCCTACGACAGGAATCCCAATAGCTTTTTACGCCGTAAGCAACAATCTGATCGGTCGACAACAGAGAGCCTGCCCGTCTGCCAGCGATGCAAGGAGGTGTTCTTCAAGAAGCAGACTTATCTTCGCCACGTCGCCGAGAGCAACTGCGGCATCCAGGAGTACGACTTCAAGTGCAGCATATGTCCTATGTCCTTCATGACCGCCGAGGAGTTACAACAGCATAAGCAACAGCATCGAGCGGACAGGTTCTTTTGTCACAAGTACTGTGGAAAGCACTTTGGTACCATCGCAGAGTGCGAGACTCACGAATACATGCAACACGAATACGATAACATAGTTTGCAACATGTGTTCGGGAACGTTCACCACGAGGGAACAACTGTATGCCCACTTGCCGCAGCACAAGTTCCAGCAGCGCTTCGACTGCCCCATATGCCGCTTGTGGTACCAAACGGCTCTTGAGCTGCACGAGCACCGCTTGGCAGCACCTTACTTTTGCGGGAAATACTATACGGGCGGACAGTCTCCGTCGTCCtctcaacagcaacagcaccagaaCCAGACGAACTACAAGCTGCAAGATTGTCATATGGCAACAATGGAAATGCCAAGCGCGCCACTCCTTAAGACGAACCCATCCGACTCGCCTGCTTTGCCCGCGACTGCGGCGCTTAATTCACTATTGCAGCAGCGTCAGGCAAATGCCGATGGAGCGGCCATTTTTGCCGCATCTACACTTAAGAACGAGGTCACTGTGAAACTGGAGCGCAGCTACAGTAACTCGACAAACGAATCGTCTTATAGCGCTCAGGAGAGCGGCTAcaacaatatttatagcaGCAGCGACACCTCAATCCACGGCTCCCTCGCCGGACCGCAGGCACACTCTTCGACGCTGGACGACTCTGAGGATGCGCTCTGCTGTGTGCCGCTGTGTGGGGTGCGGAAGAGCACAAGTCCCACCTTGCAGTTTTTCACGTTCCCAAAGGACGAGAAGTATCTCAACCAGTGGCTACATAACCTCAAGATGTTTCATATACCCGCCTCCAGCTACGTTAGCTTCCGCATCTGCAGCATGCACTTCCCGAAGCGATGTATAAACCGCTACTCCCTGTGCTACTGGGCGGTGCCAACGTTCAACCTAGGCCACGATGACGTAGCCAATCTCTACCAGAATCGTGAACTGACCAACACCTTTACCACTGGCGAAGTGGCGCGCTGCAGCATGCCTCATTGCACTAGCCAGCGAGGCGAGAGCAACCTTAAGTTTTACAACTTTCCAAAGGACATCAAAAGCTTGATTAAGTGGTGCCAGAATGCCCGGCTTCCGGTGCAGGCGAAGGAACCGCGACATTTCTGCAGCCGCCACTTCGAGGAGCGATGCATTGGCAAGTTTCGACTAAAACCGTGGGCGGTGCCCACACTGCACCTCGGCGCCCAGTATGGCAAGATACACGACAATCCGAAAAATCTTTACGTTGAAGAGAAACGCTGTTGCCTTAACTTTTGCCGTCGAAGTCGATCGTCTGATTTCAATATGTCGCTATATCGATTTCCTAGGGACGAAGTTCTTTTACGTCGCTGGTGCTACAATCTTCGCCTCGATCCGGGAGTATACCGCGGCAAGAATCACAAAATATGCAGCGCACACTTTATCAAGGAGGCATTGGGTCTCCGTAAACTATCTCCTGGTGCCGTGCCCACGCTTCATCTGGGCCACAATGACACATTCAACATCTACGAGAACGAACTATGGCCGCCGCCGACACCAACACCCTCTACTTGCCACTTGCAACAGCAGTCATCACTTCACTCACTACAACAGCAGATGCACAACAAATCCTACCAGCGCCGTTCGGTAGCATCCACTTCGTCGTCAGCGAGTTCAGCAGCCTCGCATTACGTGGACCCGGAGATGAGCGCCTCTTACCACCTAGCCATGTCCACCTCCGCCAGTGGCTCTGCGGCAATAAACGCCAGCGACAGCATGGATGTCTGCTGCGTGCCTAGTTGCGAAAGCAAGCGGCATAACAGCGATAACATCACATTCCACACGATACCGCGACGGCCCGAGCAGATGCGTAAATGGTGTCACAACCTTAAGATAGCTGAGGACAAGATGCACAAGGGCATGCGGATCTGTAGCCTCCACTTCGAGCCCTACTGCATCGGCGGATGTATGCGACCATTTGCGGTGCCCACGCTTCACTTGGGTCACGAAGACGAGGACATTCACCGCAATCCGGACGTGATCAAGAAGCTTAACATTCGGGAAACATGCTGCGTGGCTGTGTGCAAGCGGAATAGGGATAGGGACCATGCTAATCTGCATCGTTTCCCCAGCAACGTGGCCTTACTGAAAAAATGGTGCGCCAATTTGCAGCGCAGTGTTCCCGATGGCAGTAAACTTTTCAATGATGCCATCTGTGAGGTGCACTTTGAGGATCGCTGCCTACGCAACAAGAGGCTGGAGAAGTGGGCAGTGCCCACTCTGATCCTTGGACACGAGGACATCGCCTATCCGCTGCCCACACCAGAACAAGTGACCGAGTTCTATGCCCGGCCTACAGCTCCTAACAATGGCGAGGAACAGGGCGAGTGTTGCGTGGAAACCTGCAAGAGGAATCCCAGCGTGGACGATATTAAGCTATACCGCCCGCCGGAGGAGTCCACCGTGCTGGCAAAGTGGGCGCACAATCTGCAGACGGAGGCCAGTCAGCTGATAGGCATGAGGATCTGCAACCTTCACTTCGAGGCGCATTGCATCGGCAagaggatgcggatgtgggCAATACCAACTCTAAATCTAGCTGGCAACATCGAGAATCTCTACGAGAATCCAGAGCAATCGTTGTTGTACAGGCGGCGGACGACTCACTTAAAGACGAAGCTGCCATCGATCTCCACAAAGCCCACCTGGGTTCCCAGGTGCTGTCTTCCACACTGTCGCAAAGTCAGAGCCCTGCACAACGTCCAGCTTTATCGCTTCCCCAAGCTCAACCGCTCCACATTGGCTAAGTGGGCGCACAATCTCCAGGTTCCAATGGTGGGCAGTGCCCAGCGCAGGCTATGCTCAGCTCATTTCGAGCCGCATGTGCTTAGTAAAAAGTGCCCGGTGCCGCTGGCCGTGCCAACGCTTGACCTGAATTCACCACCAGGCTTGAAAATCTACCAGAATCCGGTGAAGCTAAAGGCCAGCAAACTGTGTCTGCAGCGGGTCTGCATCGTCGAGAGCTGCCGCAAGACGCGGGCGCAGGGCGTGCAGCTTTTCCGGCTACCGCACAGTCCCACACAGCTGAGGAAATGGATGCACAACATAAGGACGCGGCCACGTGCAGCTATGAGGGCTCAATACCGGGTCTGTTCCCGCCACTTTGAGACACACTCCTTTAATGGCCGAAGACTAAGTGCAGGTGCTATTCCGACTTTGGAACTGGGTCACGATGGTGACGATATCTATCCTAATGAAGCGCAGGCATTTGTGGATGAACATTGTGCTGTCGAAGGCTGCGAGGCGTCCAAGGAGCACCCGGAGGTGCGACTTTTCCGTTTCCccaccgacgacgacgacatgTTGTGGAAATGGTGCAACAACCTAAAAATGAATCCTGTGGACTGCATTGGGGTGCGTATCTGCAACAAGCACTTCGAGGTCGATTGTATCGGTCCCAAGCACCTGTACAAGTGGGCCATTCCCACCAAGGAGCTGGGCCACGACGACGCACAGATAGAGCTGATCCCGAATCCAAAATTAGAGGAGAGGTATGTGGATCCAGTATTCAAATGCATCGTTCCCACCTGCGGCAAGACGCGACGCTTTGATGAGGTGCAGATGAACAGCTTCCCGAAGGATCCGGATCTATTTCAGCGTTGGCGGCACAACCTGCGCTTAGAACACCTCAGTTTCCAAGAACGTGAGCGCTACAAGATTTGTAACGCTCACTTTGAAGAGATCTGTATTGGGAAAACACGGCTGAACATTGGATCCGTACCAACCTTGGAGCTGGGCCATGACGATGAGGATGATATTTTCCAAGTGAATCCGGCGGAACTGCAGAGCAACTTATTCGGGCGACAACGTCGACTGCTTGAGAGATTCGGCGAAGTGAGAGTCAAACAAGAACTGTCCGAGACGGAAGACAACGGAAATGCGGACTTGATGGCCACAGGCTCAAATCCCAAAAAGGTTAAGATCAAGAGACCTATTTCGGATCTAAAGTGTTGTGTGCGCAGCTGTGGAAGAAGTCGATTGGAACACGGGGCACGGCTGTTTCCCTTTCCAACGggcaagcagcagcacctgaAGTGGCGTCATAATCTGCACCTGGAACCAGAGGAGGTGGACCGATCAACGCGAGTTTGCAGCGCCCACTTCAATCGGCGTTGCATCGAGGGCAAACAGCTGAGGAGCTGGGCGATGCCCACTCAGCAGTTGGGACACCACGACCAGCCGATATATGAGAACCCGAAGAATATACCTGGATTCTTCACACCTACCTGTGCCCTAGGACACTGTCGCAAGAGGAGGAGTATTGACAACGATCTGCGTACCTATCGATATCCAAGGAGCGAAGATCTGCTGGAAAAATGGCGAGCTAATTTACGTCTGGCTCCGGATCAGTGTCGTGGCCGGATTTGTGCGAATCACTTCGAGCCGCAGGTGCGGGGCAAGCTAAAGTTAAAGACGGGAGCCGTGCCCACATTAATACTGGGACACGATGAGGGATTAGTCTATGACAATGAAGCTATAAAGGCGGGTATGGTTGACGAAGAGGAAGGCATCACCACAGAATTCCAGCGactgaaacaaaaaaatgagatgttcgatgaggaggaggagggtgaAGAGAATGATGGCGAAAAGCAGCACCCAGATGAACAGGACGAGGCAGACGAAGATGAAAAAGACGACCACTACTTTGATCCTCTTGAACTGGTAGAGACTTTTGCTGAACATCGCAGCGATGATGATGCccaagatgatgatgaagaagaAGAGGGTCGAGTTGACTCCCCCTCCGCTTACGAGGTCAAGGAGGAGATAGAACAGCTTCCAAGCACCCCGCCTTCACCTTTACCCCGACGCCACTACGCTCCGCGTCGAGACAAGCCGGCTAATAATGTGACTCCCATATGTTGTCTGAAACACTGCAGGAAGGAACGCACTGCCTTCCACCTTCTGAGCACTTTCGGCTTCCCAAAGGATCGCCAATTGCTGCTAAAGTGGTGTGTCAACCTGCATTTAAACCCGGATGACTGCGTTGGGCGGGTTTGCATCGAGCACTTCCAGCCGGAGGTGCTCGGAACCCGTAAGCTTAAGCAGAATGCGGTGCCCACTGTTAATGTGGGACATGAGGAGCCGCTTAGGTACTCGTGTCATGGGGTGGACCAGAATCTCGAGGAGCAAGACCCACAGCCCCAGCATTCGGTTTTTCGGCTTTGGAGCCTGAAACACTGCCGCAAAAGGAAGCTAACGGAGCCGCCAGATATTCCCCTAGCCAAGAGGAAAGCGCTGGGGATGCCGATGATGAAGCGGGAATgggagatggagatgcagAAAGAGCGGGAGTTGAGGAAAATGACTCAAACGGATAGTGAGTCAAAGAGATGCTGTGTTAGCAGTTGCGGGAACGAAGAAGCAAGCCAATTGCTGCCTTTGCCCGTGGAGAAATCCTTGCTAAGAAAGTGGAGTCACAACTTAAAGCTGTCCACTGAGACTGACACTCTTtctttaagccaaaaaagagTTTGCTTGGCCCATTTTGAGTCGCAGCTGTTGGAGAATGGAAAACTCTCGAAGGAATCAGAGGCAGTTCCCACTTTAAAACTTGGCCACCGCAGTTGGAACCTATACAGGAGCAATGGGATCTGCCTGGTGCCTAACTGCACACACAACACCATGGGTCGCTTAAGCTTCATCGATCTGCCggataatacaataattagaGAAGCTTTCTTTTCCTACCTCAACCTACCTAATCCTCCCAAGGAACAGGCAAGATTATGCGGTATCCATTTTATGGAGGTATACAAGAACTTAAGTCTTCCCAAGGTTTTGCACTCCCAAGATATAATGCAGCTGCAAAGTGTTGAAGACGAATTGCAATGCGCAGTGCCTGGCTGCTTCGAAAATACTGGTCAGAGTTTTCAGCTACTCCAGATTCCAGATAACAAAGAGGTGCTGTCCAAGTGGCTGCACAACACCAAGATCCCCTACGATCCTTCTAGGCACCGAAGCTATCGCATCTGCAGACTACACTTTGAAGCAGAGTACTTAGAAGACGCTTCGTCGCTAAACTGGGCTATACCAACACTCCACCTAAACCAAGACGATGAGATCTACTTAAATACTAAGCCCTTGCAAGAGGAAGAGGTCTCTATGTTGACTCCATTGCGGATAAAGACGGATCTGGCCTTGTTGGGCAGtccaagtgcaagtgcaagcCCCAGTCCTCGGGGCAGGATCCGTATATGTTGCATTCCCACATGTGGACAGATTGGAAGCAATCAAGTAAGGCTCTATCGATTTCCCACCGAGGAGCAGGCGTTACTTCGGTGGCTGGTAAATACGCAACAGCAGCCAAGACTTGTAGATCCCATGGACTTGTATGTCTGCCAGTCGCATTTTGAGCCTGAGGCCATTTGTAAAAAACAGCTCCGCAGCTGGGCCGAGCCCACCTTGAACCTGGGACACGACGGATACGTAATTCCGAATGCCAAACACAATGGAAACATTTCTGACAGCCAGGATACCGAGCAAGCAATGAAGTTCATTCGCGAACGCTTCTGCTCCGTCATTTCATGTTTTCAATCAAAAGGACAGGAAGAGGGAGGAGTGAGGTTGTACGACTATCCCGAAGATATGGCTACTACTCGAAAGTGGGCAGCCGCATGCAGACATCGCTCCATGCAGGCCAGGAGCCATGGGTTCAAGGTGTGTCAGTTGCACTTCTCTATGGAATGCTTTGACCCAGTTACTGGAAATTTGATTGAAGGCTCAGTGCCCACCCTGGAGTTGAGCAGGGATGATATGGAGAGGCAGTGTCTTGTAACTGTATGCGTAAGGAATGATCCTAATGGAGCCCGCCTCCGATACTACAAGATACCAAAAACTACTGCTCAATTGGAAGCGTGGAGCAACAACCTTAAGATCCACCCAACGGATCTAATGCAAGGGGAACAGCAGTACATCTGCGAGAAGCACTTTGAGGCGTTCTGCTTTGGAGCCAACAAGGGACTGCGTTCTGGTGCTCTCCCAACTCTCTTTCTGGGCCATGATGAGGAGGTCGAGATGCTTCCAAATCCGGAAAGTCTCTTCTCCCAGATCAAAACGGACAAGTGCTGCGTACCAGGTTGCGGACGTATCTGGCAGACTGGTGACCGTAAGTTCCGTGGCTTTCCGAAATTGTTGACCATGGCTAAAAAATGGAGGCATAACCTTCGTTTAGTTGCGACTATGGAGCAACTGGGCAAGCTCAAGGTTTGCAGTGCTCACTTTGAGGCCACCTCCCCCCACGTCATTACAAATGGATTAAGTCCTAGTACTTCGATACCCACCTTGGAATTGGGTCATTCTTCTCCGGATATTTACCAAGCGGACACGAGCTTAAAGTTCCAAAAGCGGTCCGTAATGGTGCGTTATTGCTGTTATCCCAAGTGCGAGGAAATCTGTCTGCCCAAGAATCTGTCTTATGGGCTTCCTGAAGAGGAGCATCTGCGAAATGCCTGGCTAAGCCACATGAACATAGAAGATCCGAAAGATGGAGCAGACGCACAACTATGTCCGCTGCACTATGTCATCCTCTACCAGCACAGTGCCACAAACTATCCCGAGTATCACGCTTCAAGCCGATTGCTTCTTGATGATAATTACAAGGATGCGCGGAACAACAGACGCGTGAAGATTGTGAGCTGTGCAATCAAGGGCTGTGACATGGTTAAGCCCCGGGATGGGGTACTACTGCACGGGATGCCGCAAAGCCAGGACATCCTGCAGATGTGGATAGATAATGGTCAGTTTGAGTTTTTAGAGCAGCAGCGGTACATGCTCAAGGTGTGCCACAATCATTTTGAGTCATGCTGCTTCTTCGACGATAGACGCCTGCTCTCATGGAGCGTGCCGACCTTGCGCCTACCTGGCAAAACATTTCACCAAAATCCTACGGCCGAACAGTGGCAGAACATGATCAACAAGCCTGCAGCAGAAAAAATCAATGCAGATGAGAAAGAGGAGCCAGATCTTGATACGGATGTGGATAAGAGTGAGCCCATTGTAAAGACGGAGCATTTTGAATCCGAAGATGAAAATATAAACTCGGAGATGCAGGCCCTAGAGGTCCTCCTAGAAGTTGGCCACGTGGAACGAATGGAGAGCTATGAGAACGTGGATAAATCACCGGTAATCTATACCGAAAATTCACCCTTCCGATCGTCACCCATACGTTGCCAATACAATGCTAACCACTGTGCCGTAGAGGGATGCCAGGTGACCGTCGAGGATGTGGACGGCACAATAAAGCTGCACAAATTCCCCGCATCGCAGGAAGCCGCACAGAAGTGGATGCACAACACCCAAGTTGACATGGACGAAAAGTTCTGGTGGCGCTACCGCATATGCAGTTACCACTTCGATCAAGAGTGCTTTCAGAGCGCAAGAATTCGAAAAGGCGCGATGCCCACGCTCTTGTTAGGACCTCGGCGACCGGATAAGGTGTACGATAATGAATTCGCACAACCAGAGACGGAGGAGCCTTTTCTAGAGCCCCCTGGAATTCAGCTGGAGGAAAGTATGACTGCGGCGTCTAAAGTTCGTAAGGAAGTTTCCAGTTTATGCCTTCCGCCAAGGGCGCCGCCTCGAAAGTCGAGCAAGTTTTGCCAGATTTATTCTTGTACGAACCACCTGACAACTGAGAACATGACACTTCACAAGTTTCCTCACTCGGAGGACATGTGCCTCAAGTGGCAGCACAACACTCAAGTACCATTCGATCCCTACTACCGCTGGCGTTACCGCATCTGCAGTGCGCATTTTCATCCGATGTGTTTGGTCAACATGCGTCTAGTTCACGGAAGCGTTCCCACTTTGAAGCTAGGTCCCAAGGCTCCATCCGAACTGTTTGACAACGATTTTGATGCCATTAACCAAAGGTTGGACAAAAGGTTGACGGAGTCGAATGCCAATGTGTATATCAAACATGAAAGGAGAGAGGAGGATGAAGACTCGATGATGTTGCCGGAGCCCGAGCTCCAGTTTCACGAGGATCAAGATGATAAGATATCAGCATGGAACAGCAAACTGCAATTGACACCTGTAAAGCTGGAGAAAAGTATCTATAGCCAGATGAAGTCCGGCTATGATAAGTGTTCGCTGGCTCACTGCCAACGCCAAAGGTTCCAGCATGGCGTCCACATTTATAAGTTTCCCAGATCGAGGCGCCAGCAGGAGCGTTGGATGCACAACCTCTGTATCCGCTATGATGAGCGTACTCCGTGGAAATTTATGATATGTAGCGTTCATTTCGAACCGCACTGCATCATCTTAAGGAAGTTGCAACCTTGGGCGGTGCCCACACTGGAGCTGGGCGACAATGTGCCAAAGAAGATCTATTCCAACGAACAGTgtgaggaggaggtggtgacTGATCGCAGTGACCTGGAGAGCGACGCCGAGGAAGAAGACGGCTTAcaggaggatgatgatgatgaagacgAGGACGATCTGAAGCCGGATAATGTTGGCATAAAAAGGCGAAGACGTTTAAAGATAGATTCCGCTTGCCCTCCTACCCCGACTGCACCCTGGAAAGTCAAGCAATGCTGCCTTCCCTATTGCCGTGCCTTCCGAGGCGATGGCATCAAGCTATTTCGGCTTCCGAGCAACCGAAATTCCATTAGCAACTGGGAACTGGCCACGGGAATGGTATTCAAGGAGTCGCAACGGAATACTCGTTTGATCTGTAGCCGTCACTTTGAGCCAGAGCTGATTGGAGTCAGGCGTCTAATGCGTAATGCCATTCCCACTAGGCACTTAAACCCCCAAGGAGCTAACCAGATCCGTTctaaaaaagagaagaaaccTCAAGCTGCTGTTATTCCCATCTGCTGCATGGCGGACTGCCACTACAATGGAAATGTGAAGCTGCACAAGTTTCCAAATGATCCCACACTGCTTAGACAGTGGTGCCAGGCTCTTCGGCTCACCGATACGCAGCGGTATTTGGGCAAGCACATTTGTTCGATGCACCTGCCGATGAACAAAACATTGACCTGTGTTATCTGCGGTGGAGACAACGTAGAGTTGCCGATGCTTGAGTTTCCGGAGAACCGCAACCAGCGCGCCAAATGGTGTTACAATCTCAAGATTGAGACAATACCAAAGTGGGACCACTCAAAGCAAATTTGCTGCCGGCATTTCGAGTCCCATTGCTTTGATAAGCCGGGTGAACTACGTCCAGGAGCGGCTCCCACGCTCTATCTGAATCACGATGACTCAAACATATTCTTCAGCGACTATGCCACTGGCCTTCCGTCCTCGCCAATACGAAGTCGAATTAAAGACGAGCCGCTGGAATCGGAGTCTGACGAGATGTTGCCGGAGTAG
- Protein Sequence
- MSQHNPNHAHPHYHYPSHPTPLALQQQQHQQQHQQEQHGSSWYSHVASYPASSHSHNLSHSAFGPAPPCKASISNNNTIMGAYGGGGGGAGSHGYFGAAGGGLNVSGAGGGAGSTYGLGANTVAYAHNQLLQYQHHHQQHQQQQHLGLSQRSYMGHDVMAGSYPYIKSEPLEGFQQPANPMAPPPAPEMIIKSEPIDELAYKSNYIDDNTPFADFSKFSEFGEDMLSPKVELTVKNESYDRNPNSFLRRKQQSDRSTTESLPVCQRCKEVFFKKQTYLRHVAESNCGIQEYDFKCSICPMSFMTAEELQQHKQQHRADRFFCHKYCGKHFGTIAECETHEYMQHEYDNIVCNMCSGTFTTREQLYAHLPQHKFQQRFDCPICRLWYQTALELHEHRLAAPYFCGKYYTGGQSPSSSQQQQHQNQTNYKLQDCHMATMEMPSAPLLKTNPSDSPALPATAALNSLLQQRQANADGAAIFAASTLKNEVTVKLERSYSNSTNESSYSAQESGYNNIYSSSDTSIHGSLAGPQAHSSTLDDSEDALCCVPLCGVRKSTSPTLQFFTFPKDEKYLNQWLHNLKMFHIPASSYVSFRICSMHFPKRCINRYSLCYWAVPTFNLGHDDVANLYQNRELTNTFTTGEVARCSMPHCTSQRGESNLKFYNFPKDIKSLIKWCQNARLPVQAKEPRHFCSRHFEERCIGKFRLKPWAVPTLHLGAQYGKIHDNPKNLYVEEKRCCLNFCRRSRSSDFNMSLYRFPRDEVLLRRWCYNLRLDPGVYRGKNHKICSAHFIKEALGLRKLSPGAVPTLHLGHNDTFNIYENELWPPPTPTPSTCHLQQQSSLHSLQQQMHNKSYQRRSVASTSSSASSAASHYVDPEMSASYHLAMSTSASGSAAINASDSMDVCCVPSCESKRHNSDNITFHTIPRRPEQMRKWCHNLKIAEDKMHKGMRICSLHFEPYCIGGCMRPFAVPTLHLGHEDEDIHRNPDVIKKLNIRETCCVAVCKRNRDRDHANLHRFPSNVALLKKWCANLQRSVPDGSKLFNDAICEVHFEDRCLRNKRLEKWAVPTLILGHEDIAYPLPTPEQVTEFYARPTAPNNGEEQGECCVETCKRNPSVDDIKLYRPPEESTVLAKWAHNLQTEASQLIGMRICNLHFEAHCIGKRMRMWAIPTLNLAGNIENLYENPEQSLLYRRRTTHLKTKLPSISTKPTWVPRCCLPHCRKVRALHNVQLYRFPKLNRSTLAKWAHNLQVPMVGSAQRRLCSAHFEPHVLSKKCPVPLAVPTLDLNSPPGLKIYQNPVKLKASKLCLQRVCIVESCRKTRAQGVQLFRLPHSPTQLRKWMHNIRTRPRAAMRAQYRVCSRHFETHSFNGRRLSAGAIPTLELGHDGDDIYPNEAQAFVDEHCAVEGCEASKEHPEVRLFRFPTDDDDMLWKWCNNLKMNPVDCIGVRICNKHFEVDCIGPKHLYKWAIPTKELGHDDAQIELIPNPKLEERYVDPVFKCIVPTCGKTRRFDEVQMNSFPKDPDLFQRWRHNLRLEHLSFQERERYKICNAHFEEICIGKTRLNIGSVPTLELGHDDEDDIFQVNPAELQSNLFGRQRRLLERFGEVRVKQELSETEDNGNADLMATGSNPKKVKIKRPISDLKCCVRSCGRSRLEHGARLFPFPTGKQQHLKWRHNLHLEPEEVDRSTRVCSAHFNRRCIEGKQLRSWAMPTQQLGHHDQPIYENPKNIPGFFTPTCALGHCRKRRSIDNDLRTYRYPRSEDLLEKWRANLRLAPDQCRGRICANHFEPQVRGKLKLKTGAVPTLILGHDEGLVYDNEAIKAGMVDEEEGITTEFQRLKQKNEMFDEEEEGEENDGEKQHPDEQDEADEDEKDDHYFDPLELVETFAEHRSDDDAQDDDEEEEGRVDSPSAYEVKEEIEQLPSTPPSPLPRRHYAPRRDKPANNVTPICCLKHCRKERTAFHLLSTFGFPKDRQLLLKWCVNLHLNPDDCVGRVCIEHFQPEVLGTRKLKQNAVPTVNVGHEEPLRYSCHGVDQNLEEQDPQPQHSVFRLWSLKHCRKRKLTEPPDIPLAKRKALGMPMMKREWEMEMQKERELRKMTQTDSESKRCCVSSCGNEEASQLLPLPVEKSLLRKWSHNLKLSTETDTLSLSQKRVCLAHFESQLLENGKLSKESEAVPTLKLGHRSWNLYRSNGICLVPNCTHNTMGRLSFIDLPDNTIIREAFFSYLNLPNPPKEQARLCGIHFMEVYKNLSLPKVLHSQDIMQLQSVEDELQCAVPGCFENTGQSFQLLQIPDNKEVLSKWLHNTKIPYDPSRHRSYRICRLHFEAEYLEDASSLNWAIPTLHLNQDDEIYLNTKPLQEEEVSMLTPLRIKTDLALLGSPSASASPSPRGRIRICCIPTCGQIGSNQVRLYRFPTEEQALLRWLVNTQQQPRLVDPMDLYVCQSHFEPEAICKKQLRSWAEPTLNLGHDGYVIPNAKHNGNISDSQDTEQAMKFIRERFCSVISCFQSKGQEEGGVRLYDYPEDMATTRKWAAACRHRSMQARSHGFKVCQLHFSMECFDPVTGNLIEGSVPTLELSRDDMERQCLVTVCVRNDPNGARLRYYKIPKTTAQLEAWSNNLKIHPTDLMQGEQQYICEKHFEAFCFGANKGLRSGALPTLFLGHDEEVEMLPNPESLFSQIKTDKCCVPGCGRIWQTGDRKFRGFPKLLTMAKKWRHNLRLVATMEQLGKLKVCSAHFEATSPHVITNGLSPSTSIPTLELGHSSPDIYQADTSLKFQKRSVMVRYCCYPKCEEICLPKNLSYGLPEEEHLRNAWLSHMNIEDPKDGADAQLCPLHYVILYQHSATNYPEYHASSRLLLDDNYKDARNNRRVKIVSCAIKGCDMVKPRDGVLLHGMPQSQDILQMWIDNGQFEFLEQQRYMLKVCHNHFESCCFFDDRRLLSWSVPTLRLPGKTFHQNPTAEQWQNMINKPAAEKINADEKEEPDLDTDVDKSEPIVKTEHFESEDENINSEMQALEVLLEVGHVERMESYENVDKSPVIYTENSPFRSSPIRCQYNANHCAVEGCQVTVEDVDGTIKLHKFPASQEAAQKWMHNTQVDMDEKFWWRYRICSYHFDQECFQSARIRKGAMPTLLLGPRRPDKVYDNEFAQPETEEPFLEPPGIQLEESMTAASKVRKEVSSLCLPPRAPPRKSSKFCQIYSCTNHLTTENMTLHKFPHSEDMCLKWQHNTQVPFDPYYRWRYRICSAHFHPMCLVNMRLVHGSVPTLKLGPKAPSELFDNDFDAINQRLDKRLTESNANVYIKHERREEDEDSMMLPEPELQFHEDQDDKISAWNSKLQLTPVKLEKSIYSQMKSGYDKCSLAHCQRQRFQHGVHIYKFPRSRRQQERWMHNLCIRYDERTPWKFMICSVHFEPHCIILRKLQPWAVPTLELGDNVPKKIYSNEQCEEEVVTDRSDLESDAEEEDGLQEDDDDEDEDDLKPDNVGIKRRRRLKIDSACPPTPTAPWKVKQCCLPYCRAFRGDGIKLFRLPSNRNSISNWELATGMVFKESQRNTRLICSRHFEPELIGVRRLMRNAIPTRHLNPQGANQIRSKKEKKPQAAVIPICCMADCHYNGNVKLHKFPNDPTLLRQWCQALRLTDTQRYLGKHICSMHLPMNKTLTCVICGGDNVELPMLEFPENRNQRAKWCYNLKIETIPKWDHSKQICCRHFESHCFDKPGELRPGAAPTLYLNHDDSNIFFSDYATGLPSSPIRSRIKDEPLESESDEMLPE
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00525910;
- 90% Identity
- -
- 80% Identity
- -