Dpar015159.1
Basic Information
- Insect
- Drosophila parthenogenetica
- Gene Symbol
- -
- Assembly
- GCA_035047505.1
- Location
- JAWNPM010000241.1:5995241-6009001[-]
Transcription Factor Domain
- TF Family
- THAP
- Domain
- THAP domain
- PFAM
- PF05485
- TF Group
- Zinc-Coordinating Group
- Description
- The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 28 3.2e-15 7.1e-12 46.1 5.1 1 86 578 650 578 651 0.86 2 28 2.9e-15 6.3e-12 46.2 4.6 1 87 678 747 678 747 0.83 3 28 6.2e-16 1.4e-12 48.4 0.4 1 87 769 841 769 841 0.84 4 28 9.5e-16 2.1e-12 47.8 5.3 1 87 946 1016 946 1016 0.82 5 28 5.6e-15 1.3e-11 45.3 3.0 1 86 1040 1111 1040 1112 0.82 6 28 1e-12 2.2e-09 38.1 1.2 1 87 1147 1215 1147 1215 0.80 7 28 9.8e-11 2.2e-07 31.7 1.4 1 86 1261 1330 1261 1331 0.76 8 28 5.3e-16 1.2e-12 48.6 0.1 1 86 1358 1427 1358 1428 0.83 9 28 4.1e-13 9.2e-10 39.3 1.2 1 86 1449 1518 1449 1519 0.81 10 28 1.5e-14 3.2e-11 44.0 1.4 1 86 1546 1617 1546 1618 0.85 11 28 2.4e-14 5.3e-11 43.3 1.2 1 86 1702 1771 1702 1772 0.81 12 28 3.3e-12 7.2e-09 36.4 0.1 1 86 1795 1863 1795 1864 0.81 13 28 7.4e-14 1.6e-10 41.7 0.9 1 87 2011 2080 2011 2080 0.81 14 28 2.3e-13 5.2e-10 40.1 0.4 1 62 2136 2193 2136 2214 0.79 15 28 0.0081 18 6.3 0.0 1 58 2233 2283 2233 2299 0.78 16 28 8.9e-13 2e-09 38.2 0.4 1 86 2322 2391 2322 2392 0.84 17 28 1.2e-14 2.8e-11 44.2 1.6 1 86 2473 2542 2473 2543 0.82 18 28 4.7e-12 1e-08 35.9 0.9 1 86 2578 2649 2578 2650 0.80 19 28 4.3e-12 9.4e-09 36.1 2.9 1 87 2660 2734 2660 2734 0.82 20 28 1.9e-15 4.3e-12 46.8 0.2 1 86 2757 2827 2757 2828 0.81 21 28 2.8e-05 0.062 14.2 0.1 1 58 2861 2911 2861 2941 0.70 22 28 6.4e-14 1.4e-10 41.9 0.0 1 86 2949 3021 2949 3022 0.80 23 28 1.2e-13 2.7e-10 41.0 1.5 1 86 3200 3272 3200 3273 0.80 24 28 1.3e-13 3e-10 40.9 1.6 1 86 3339 3409 3339 3410 0.82 25 28 3.9e-14 8.6e-11 42.6 3.3 1 86 3505 3575 3505 3576 0.84 26 28 9.7e-13 2.2e-09 38.1 0.1 1 87 3660 3730 3660 3730 0.85 27 28 9.2e-10 2e-06 28.6 2.3 1 58 3749 3795 3749 3801 0.88 28 28 3.8e-12 8.5e-09 36.2 1.1 5 87 3804 3871 3800 3871 0.74
Sequence Information
- Coding Sequence
- ATGTCACAACACAACAACCACCCCccgcatcatcatcatcatcactactaccagcagcagcagcaacaacagcaacaacaacagcaacatcataatcatcatcaacagcagcagcagcagcagcagctacaacataaacaaatacagcAGCACAGTTGGTACTCACATGTTGCTTCCTACCCTCCCCACCATCCGCACGCCGCAGCCTTTGCGGCGCCCTGCAAAaccaataataacaacaacaacaatattatgAATGCATATGGTACGGGAGCTGGTGGAGCGCATGCAGCATATTATGGCTCTGGTGGGGTGGGGGGCTATAACCTTGAGGGCAATACTGTGGCCTATGCGCACAACCAGCTGCtgcaataccaacaacaacaacagcagcagcagcaacaacaacagcagcatcatcagctCAGTCAACGCTCTTATATGGCGCACAGTTTAATGCATGGCTCGTATCCCTATATCAAGAGCGAGCCACTGGAGCTGTCTGATGATAgacaacgccaacaacaacaacagctgcaacaccaccagcaacagcagcaacaacaacaacaacacttccAGAATCCTATGGCACCGCCGCCAGCACCCGCGAATCGTCACAGTCTCGATGCTAGCGGCGAAATGATAATTAAATcGGAACCAGTTGATGAACATGCCTTCAAGTCCAACTACATCGATGATAACACGCCCTTCGCTGATTTCAGCAAATATCCGGAATTCGGCGACGACATGTTGAGTCCCAAGGTTGAGCTATCGGTCAAGGACGAGGGCTATGGCAATCACAAGGTTCCCAATCCGCTCAGCTATCCTAGACGCAAGCTGCAGTCGGACCGCTCATCGGAAATTCTTCCCATTTGTCAGCGCTGCAAGGAGGTGTTCTTCAAGAAGCAGATCTACATACGGCATGTAGCTGAGAGCAGTTGCACCATACAGGAGTATGACTTCAAGTGCAACATCTGCCCCATGTCCTTTATGAGCACCGAGGAACTGCAGAAGCACAAGCATCTACACAGAGCGGACAAGTTCTTTTGCCACAAATACTGTGGCAAGTACTACGATACAATAGCCGACTGTGAGTCGCATGAGTATATGCATCATGAGTACGATAGCTTTGTCTGTAATATGTGCTCTGTTACGTTTGCCACACGGGAGCAGCTCTACGCCCATTTGCCGCAACACAAATTCCAACAACGCTACGATTGCCCCATTTGCCGCTTGTGGTACCAGACAGCTCATGAGCTGCATGAGCACCGTCTGGCTGCACCCTATTTCTGTGGCAAGTACTACGTTGGCTCACAGTCAGCGGCAgcacatcagcagcagcaacaacaacatccacaTCAGCAGCAGGTCGGCTACAAGCTGCAGGACTGTCACATGGGCACCATGGAAaTGGCCACTCCGCATCACAAGACCAATGCAGCTGCCACCGCATTGCCAGCCACTGCAGCTCTGAGTTCATTGCTGCAGCAACGACAGGCGAATGCCGATGGTGCCGCTCTCTTCGCCTCAAATCTTAAGAACGAATCGAATGTGAAGCTGGAGCGAAGTTATAGCAATTCCACAAGCGAGTCTGGCTACAGTCTACATGACAGCAGCTATAACAATGCCTATGGCAGCGATAACTCATTGCATGCTGGTGGTGGAAGTGGTGCACTTGGTGGTCCACAGGCGCACTCCTCAACCCTGGACGATTCGGAAGATGCGTTGTGCTGTGTGCCGCTCTGTGGCGTGCGCAAGAGCACCAGTCCAACCCTGCAGTTCTTTACTTTCCCCAAGGATGAGAAGTACTTGCATCAGTGGCTGCATAAtctgaaaatgtttcacatacCGCAATCCAGTTACTCGACTTTTCGCATCTGCAGCATGCATTTCCCAAAGCGTTGCATCAATCGTTATTCCTTGTGCTATTGGGCGGTGCCCACGTTCAATCTGGGCCATGATGATGTGGCCAATTTGTATCAGAATCGTGAACTGACCAACACCTTCACCACTGGCGAGGTGGCGCGCTGCAGCATGCCCAACTGTACCAGCCAGCGCGGCGAAAGTAATCTTAAGTTCTATAACTTTCCCAAGGATATTAAGAGTCTGATCAAATGGTGCCAGAATGCCCGTCTGCCTGTCCAGGCCAAGGAGCCTCGACACTTCTGCAGTCGTCACTTCGAGGAGCGCTGCATAGGCAAGTTTCGTCTGAAGCCCTGGGCTGTACCCACTTTACATTTGGGTGCCCAGTACGGCAAAATCCATGACAATCCCAAGAACTTGTATGTGGAGGAGAAACGCTGCTGTCTGAATTTCTGTCGTCGCAGTCGATCCTCTGACTTCAACATGTCCCTCTACCGCTTTCCTCGGGATGAGGTGCTGCTACGTCGCTGGTGCTACAATTTGCGATTGGATCCTTCTGTTTATCGCGGAAAGAATCACAAAATATGCAGTGCACACTTTATCAAAGAGGCTTTGGGACTTCGCAAGTTATCGCCAGGAGCTGTGCCCACGTTGCACTTGGGACACAACGACACATTCAACATATACGAGAATGAACTGTGGCCACCACCTGCGCCGACTACGCCCACcaatcatcagcagcaactgcagcagcatcagttgcagcagcaccaacaacaacaccagcagcatcAATCACATCATGGTCATCATGGCAACAAGTATCAACGTCATTCGGCAGCGTCGACCTCCTCGTCGGCCAGCTCAGCATCACATTATGTGGATCCGGAGATGAGTGGATCGTATATGGGAGTAGGCAACTCTGGAGGATCGGGATCAGGGCTGAATGTGAGCGACAGTATGGATGTGTGCTGTGTGCCAAGCTGTGAGAGTAAGCGGCACAACAGTGAGAACATTACATTCCACACCATACCCAGGCGGCCGGAGCAGATGAGCAAATGGTGCCACAATCTCAAGATACCCGAGGACAAGATGCACAAGGGAATGCGCATCTGTAGCCTGCACTTTGAGCCCTATTGTATTGGTGGTTGCATGCGTCCATTCGCTGTGCCCACACTGCACCTGGGCCACGACGACGAGGACATTCATCGCAATCCGGATGTGATCAAGAAGCTTAACATACGCGAGACCTGCTGTGTGGCTGTCTGCAAACGCAATCGCGATCGTGACCACGCCAATCTTCACCGCTTCCCTAGCAATATCGCGCTGCTGACCAAGTGGTGCGCCAATCTGCAACGCCCAGTACCGGATGGCACCAAGCTCTTTAACGACGCCATCTGTGAGGTTCACTTCGAGGATCGTTGTTTGCGCAACAAGCGATTGGAGAAATGGGCTGTGCCCACTTTGATACTGGGCCATGAGAACATAGCGTATCCACTGCCCACGGCGGAGCAGGTGGCCGAGTTCTACGCTCGTCCCAGTGCACCCAACAATGGCGAGGAGCAGGGCGAGTGCTGCGTGGAGACTTGTAAGCGTAATCCCAGTGTGGATGACATCAAGCTCTATCGTCCACCCGAGGAGTCACAGGTGCTGGCCAAGTGGGCACACAACCTGCAGGTAGATGTCGCCAAGTTGACCAGTTTGAGGATTTGTAATCTCCACTTTGAATCCCATTGCATTGGTAAACGGATGCGTCCTTGGGCTATACCCACGCTCAACCTCTCCACGAACATTGAGAATCTCTACGAGAATCCCGAGCATCAAATGCTCTACAAGCGTCGCAAGCATCTCAATTCTGGACGCAGTGCAGCCACCCATGACGGTGGCATGGTGAAACCGACGTGGGTTCCACGTTGCTGCTTGCCACATTGCCGCAAGGTGCGTGCCCTGCACAACGTTCAGCTCTATCGCTTCCCCAAACTCAATCGTTCCACGCTGGCCAAGTGGGCACACAATCTGCAGGTGCCGCTGGTGGGCAGTGCACAGAGACGTCTCTGTTCCGCTCACTTTGAGCCGCATGTGCTGAGCAAGAAGTGCCCCGTGCCACTGGCAGTGCCCACACTGGATTTGAACTCACCACCCGGCTACAAGATCTACCAGAATCCCGCCAAGCTCAAGGCCAACAAGCTGTGCCTGCAGCGCGTCTGCATCGTCGAGAGCTGTCGTCGGCAGCGTGGTCAAGGTGTGCAGCTCTTCCGTCTGCCCCATAATCCCACCCAGCTGCGCAAGTGGATGCACAACATAAAGATGAGACCCAGAGGTGCTATGCGGCAACAGTATCGCATGTGTTCCATACACTTTGAGACGCACTCCTTCAATGGCAAGCGATTGAGTGCTGGAGCGATACCTACGCTGGAGCTGGGCCATGACGATGACGATATCTACCCCAATGAGGCGCAATCCTTTGTCGAGGAGCACTGTACTGTAGAGGGTTGTGATGCGTCCAAGGAGCAGCCGGATGTACGGCTCTTCCGCTTCCCCACAGACGACGAGGATTTGCTCTGGAAGTGGTGCAACAACCTCAAGATGAATCCCGTCGATTGTGTGGGTGTACGCATCTGTAACAAACACTTTGAGCCGGACTGCATCGGGCCCAAGCATCTGTACAAGTGGGCGATACCCACCATGGAGCTGGGACACGATGATGCCGAAATCGAGCTCATCGACAATCCCAAGCTCGAGGAACGTTACGTGGATCCGGTGTTCAAGTGCTGTGTGCCCACCTGCGGCAAGACCCGCAAGTTCGATGAGGTCCAGATGAATAGCTTCCCCAAAGATCCCATTGTCTTCCAGCGCTGGCGACACAATCTTCGTCTGGAACATTTGGATTTCAAAGAGCGTGAACGCTACAAGATCTGCAATGCTCACTTCGAGGACATTTGCATTGGCAAGACACGACTCAACATTGGTTCGGTACCTACGCTGGAGCTAGGACACGACGAGACTGAGGATCTCTATCAGGTAAATCCAGAGGAGCTGCAAAGCAATCTCTTTGGACGTCAGCGCCGCGTTCGCGAATCCATGGGCATAGCCATCAAGCAGGAGGAGGGAGAGGAGGAGGAAGggcaggaggaggaggagcacTCAGAACTGGATGAGGACATCAAGCCAGACATTAACATGCTGCGAGCAAAGGCATCTCAGGCTAAAATGAAGCGATCCCTGTCGGATCTCAAGTGCTGTGTGCCCAGCTGTGGTCGCAGCCGTTTGGAGCACGGCGCACGTCTCTTCCCATTCCCCAGTggcaagcaacagcaaaccAAGTGGCGCCACAATCTTCGCCTGACGCTGAACGAAGTGGATCGCAGCACACGTGTTTGCAGCGCTCACTTCAATCGTCGCTGCATCGATGGCAAGCAGCTGAGAGGATGGGCCATGCCCACACTACAGTTGGGCCACCAGGATCAGCCCATTTATGAGAATCCCAAGAATATTCCAGGCTTCTTTACGCCAACCTGTGCGCTGGCCCATTGCCGTAAACGTCGAAGCATTGACAACGATCTACGCACCTATCGTTATCCCCGTAGCGAGGATTTGCTCGAGAAATGGCGTGTGAATCTACGCTTGGCACCGGATCAGTGCCGGGGACGCATTTGTGCCGATCACTTTGAGCCGATGGTGCGTGGCAAACTAAAGCTGAAGACGGGAGCGGTGCCCACATTGAAATTGGGACATGATGAAGGCGTTGTCTTTGACAACGAGGTCATTAAGGCGGGTCTACAGCAGGAGGCTGATGAAAATGAGGCAAGCATGGATTCGCTAGTCAAGGTCAAACAGGAGAAGATCGAAgcagatgatgatgaggacgaGGAGACGGTAGAGAAGCACCAACAGGAGGAGGATCATGGCTACTTCGATCCCCTGGAGTTGGTGGAAACCTTTGCGGAGCAGAACAGCGAAGAAAACACAGCCTTCGATGAGCCCaacgatgatgaggatgatgacgatgacgacgatgtgaatgatgatgaggaggacgacgaggatgaggatgaggctggtgatgatgatgagctgCTGTTGCCGGATACACCGCCAATTCGCATGCCTCTGCCTCCAATGCCAGTGCGTCGCGAGAAGGCGGTGAACAATGTCACGCCCATTTGTTGTCTCAAGCATTGCCGGAAAGAGCGCACAGCTACGCATCCACTGAGCACCTTTGGCTTTCCCAAGGATCACCAGCAGCTGCTTAAGTGGAGCGCCAATCTGCAGCTGCCGCTGGAGGATTGCATGGGACGCGTCTGCATCGAACACTTTGATCCCGAGATGCTTGGCACTCGTAAACTAAAGCAGAATGCGGTGCCCACACTGAATCTGGGACACACGACACCGCTCAAATACAGCTGCAATGGTCAATCTGTGGCCGTCTATGGGGATCAGCCGGAGCATTCGGTTTTTCGGCTTTGGAGCCTGAAACATTGTCGCAAACGGAAGCTTCCAATGGAGGCACCGAATCAGACAGCGACTAAGCGACGACGCTGCTGCCTGCCCAGCTGTGGCAAGCAGCCGGATCTCCATGGTGTACAGCTGCATCGATTGCCCACCAATCGCATACAGCTGCGCAAGTGGCTGCATAATCTAAAGCTATCACCCATGGCGGATAACAGCCAGGCGCGAATTTGTAGCGATCACTTTGAGGCGCAGCTGGAGACGCTCGAGGAATGTGTGCCCACGCTGCGTCTGGGTCACGACGACACACATATCTATAGGAATCGTGGAAGTGCCAGTGGTAGTGCTAGTGTCAGTGGCATCTCGGCCTCATCCAGTGGCTGCATGGTGCCCAGTTGTCCATGTGCGCGTCTTAATCTGTATCGCTGCTATGATCTGCCCGAGAATCGCCTGGTGCAACAGGCCTGGCTGGAGTGGTTACAGCAGCCCATGCCTCAGTTGGCCAGTGATGGCAAGCTCTGTGTGATGCACTACATGCAGCTCTACGAGCAGGTTCCATTGCCCCAGGAGTTGCCCCAGTTGGTGCTACGTCGTCTACAGGAAACCTATGACTTAATCGCCGGCTCCTCGATGGCTATGAAGCTGCGCTGTGCCGTGCCGGGCTGTTACTCCAAGTACACGGATAACATACGGCTCACCAAGCTGCCCATGTGCCCAGAGACCTGCGCCAAGTGGGTGCACAACACCAAGATCAGCTATGAGCCATCCCGTCACTATATTTATCGCATTTGTATGCTGCACTTTGAGCCACGCAGCTTGGGTCCAGTGCGTCCCAAGCAGTGGGCGGTGCCGACGCTGCTATTGAATCACCAAGATCAAAAAATTTATCGGAATCCCATGCTGGACGTAAGCGCTGATAACCAATCAGGTTCTGCTTTGTCGTTGTCGGATCCCATAGCTATCTCCACGCCTGTGCCCATTGCCATGTCCGCGTCGGTGCCCGTGGAGCTGCCGCTGCGCATTAAAACGGAGGTCGCCTTCAGTGGTAGTCCCTGTGCCAGCGCCAGTGCCAGTCCCAGTCCACGTGGCAGGCTGCGTATCTGCTGCATTCCAAGTTGTGCCCAGCAAGCCTCTAGTCAGACACGTCTCTACCGCTTTCCCACCGCCGAGACTGCGCTGCTCAAGTGGCTGGTAAACACACAGCAGCAACCCAGATTGGTGGACACCACGCAGTTGTTTGTGTGCCAGGATCACTTCGAGACGGAGGCCATTTGCAAGAATCAATTGCGTAGCTGGGCTGTGCCCACACTTAAACTGGGACACGATGGTCATGTCATACCAAATGCCAGGCACAATGGCAACATTGCCGATAGCCAGGAGAACAAACATACGCTGCAGTTCATCTGGGAGAACTATTGCTCGGTGCTGAGCTGCTTCCAGCCACGCTCGGAGCAACTGCGTCTATATCCCTACCCCAATGATCGGCGCACCATACGCAAGTGGGCTACCAACTGCAAGCATCGTTCCATGCAGGCCAGCAGTGATGGCTTTCAGgTCTGTCAATCGCACTTTGGACCACATTGCTTTGACCCTGATACAGGTGAGCTGCGAGAGGATGCGGTGCCCACATTGGAGCTCAGTCGTTGCCTCAATGAGATGCGCTGTATTGTGCCCAGCTGTCATCCTGTGCACACAGAGGACGCACGCCGACGACGCTTCTTCAAGTTGCCCAAACGCTGCGCTCAGCTAAACATTTGGTGTTATAATCTTGGCATAGACTCCGCTACGTTGGGCTCTAGTGAAAATCATGTCTGCGATCGTCACTTTGAGACGCAGTGTTTCAATCAACAGAAGCTACTACGTCCCGGTGCCCGCCCCACACTGCATTTGGGGCATGAACAGCATATAGAATTGTTACCTAATCCGGCAATTTTTGCCAGCTACGTGGAAGTCTGCTGTGTGCCCGACTGTGAACAGCCTAAGAATGACGAAGAAGTGCAGCTCTTCGGATTTCCCAAAATCCGTGCACTGTCAGAGAAGTGGCTACAGAACCTACGCCTAGACGCGGGCAGGGAACAGTTGCTCAAAATGAAGATATGTGCAGCACACTTTGAACCCAATTGCATGGATAACAAGCGACTCCAGATGGGTGCCATGCCCACACTGCGACTGGGACACAATGAACACCACAATATACATCGAAGTGGAGAGCCGAATGTGGGCAAAGTCAAGAAATATTGTAATCGCAGTGGTTCCAGCTATGAATGCTGTTATCCCAACTGTGTGGAGCTGCGGAAGACATATCTGAGGATTAGCTATGATCTGCCAGAGGGCGAGGCACTGCGTCAGAAGTGGCTGGAGCATATGGACATAGTGGATGAGGAGCAGAAGCCTTTAAAACTCTGTCCGATGCACTTGATCATACTCTATGATCACAGTGACGAGCACTTTGAGGAGCACACCAAGGAGGAACTGCTGGACAACAACTATGAGGATGCACGGAGCAGTGTCCGCATACGTGTCATCAGTTGTGCGGTGCGTGGTTGTCGTACCCTAAAACCCAGAGACGGTGGACGATTGCATGGTTTGCCCCAGCGTCGGGATGTGCTCGAGATGTGGCTCTATAACATTCAGTTGGTATTCTATGAGCAACAGCGATATTTGTATAAGATTTGCAGTCGACATTTTGAGCCCAGTTGCTTCATGGACACCACGCGACGTCTGAAGCCTTGGACAATGCCTACACTGGAGCTGCCAGAGCGTAAGCCGGGCGAGGCACCAGTCTATCAGAATCCCAGCGAGGAGGAATGGCAGCGCATGAATGAACTGTTTGCCGTAGAGCAGCAGGAACTGCAGCTGGAGCTGGAACAggaccagcagcagcaagaactaGCACAGGAGCAGTACCAGGAGGAACAATATGACGAAGACTGCGATTTGTTGGAACCCATTGTGAAGATTGAACACATGGGGAATGAAGATCAGTTCGATGAAGAGGCTGAGTATCAGCAGGAGGAGCAGCGGCGGGAAGGTGTGGAGGAGGATGactacgacaacgacaactccCAGCAGCCACTCGAGGTGCTACTCGAGGTGGGACACGTTGAGAAGTGCGCAACCTACGAGCAAATGGACTCCGAAGCCGATCAAAGCTATGCCGCTGCCGCAGCcgagcagcagctacagctgAGCAGTTTTAGTGCAACAGCAGCGTCAACGCCGCATCATGGTAGTGGTTCCATGATCAATAATGGCCAAATAATTAAGTACAATGCCCGGCGCTGCAGCGTGCAGGGATGTGATGTTACGGCCAATGATTTGAATAACACAATCAAGCTGCACAAGTTCCCCACATCACTGGATGCACTGAAGAAGTGGATGCACAACACCCAGGTGGATGTGGACACGAATTTCGCTTGGCGTTTTCGCATTTGCAGCTATCACTTTAGTTGTGAATGCTTCAATGGTTCACGCATTCGACGTGGTTCAATGCCCACCATGAGTCTCGGTCCACGTCGTCCACGGAAAATCTACGACAATGAGTTCAACACATCGCTGCAACTGGACCAGGAACAGTCCAATGATGTGGGCAGCGAGGATCAACCACAAACGGAGTTGGAAACGACGAAGGGACGGACAAAGGGAAAGGTCACTCTGCGACTGCCACATCCCGCTCCACCACGTAAATCCAGTAAATTTTGTCAGATCGATGGTTGTTCCAATCACTTGACCAGCGAGAATCTCACCCTGCACAAGTTCCCGCATTCGGTGGACATGTGCGCCAAGTGGCAGCACAACACTCAGGTGCCATTTGATCCGGAGTTCCGTTGGCGCTATCGCATCTGCAGTGCCCACTTTGAGCCCATTTGTCTGATGAACATGCGCCTAATGCACGGCAGTGTGCCCACCCTGTTGCTGGGTCCACGTGCTCCTCGCCAGCTCTTCGAGAGTGACTTTGAGGCCATAAGCCTGCGATTGGATAAACAGAAGAGCGGATTGGAGCAACAGGAGCACATGATGCTGGAGCCAGAGGATGGGGATGAAATGGGTTTGCTGGAGCCGGAGATGCAACTGCTGGAGGATGCCGAAACGGAGCAATCTGATAATCCCAATGATAGCTGGAAGGATGATCATTTACCTGGTATTAAGCAGGAGCCGGGGACCAGCTATAATCCTGTGAAGTCTGGCTACGACAAGTGCTCCCTAGTGCACTGCCAGCGTCAGCGTTCCATGCACGGTGTCCACATCTACAAGTTCCCACGCTCACGGCAACTACAGCAACACTGGATGCACAATCTACGCATTCGCTACGATGAGCGTCGTCCCTGGAAGACCATGATATGCAGCGTACACTTTGAAGCCAATTGCATACGTCTGCGCAAGCTGCGTCCCTGGGCGGTGCCCACTTTGGAGCTGGGCGATAACGTGCCCGACGAGATCTATACGAATGAGCAGAGCCGCCAGCAGCTGGAGACGGGCAGTAGTGACAACGAGGAAGTAGGTCTAGATGTGTCCATGGAGGAGGACCTGGCGTTGGAAGACGACGACTGTGCTGATGAAGACGACGACGTGCTGGCCACGGAGCCATTGGTAAAGATTGAACGTCGCTCTAAATTTGATGCATATCCACCGGGTCAGTTGCCGCCTTGGAAACTCAAATTCTGTTCCTTGCCCTACTGCCGTCATCCACGCGGCGATGGCATCAAGCTCTTCCGGCTGCCTACTAACATCAGTTCCATACGCAAATGGGAGCATGCGACGGGCATTAAATTTACTGAATCCCAGCGCAACACGAAGCTCATCTGCAGTCGTCATTTTGATCCGCAGCTTATTGGAGTGCGTCGTCTTATGTACAATGCGGTGCCTACGCTGCATTTGGGCCCAATGAGTGAGAGGGTTGAACGAGCGCTAGCTGCTCCGGTTGCACGTTGCTGTATGCCCAATTGCCAGGAGGAAGTGAAGCTGCACAAGTTTCCAAGTGATCCTGCACTAAGACAGCAGTGGTGTCACTTACTGAATCTCTCGAATGCCGATAACTACCGCGACAAACACATCTGTGCCTCCCATTTGCCTGATCAGGGACCATGCTGCTTGGTCTGTGGCACAGAGGACATGCAGTTGCCATTGCTAAACTTCCCGGAGAATCGCAATCAACGCGCCAAATGGTGTCACAATCTCAAAATCGAATCCATAGCCAAGTGGGATACCTCTAAGCAGATCTGCAGCAAGCACTTTGAGAGCTACTGCTTTGCACAGCCGGGTCAACAGCTGCTACCTGACGCAGCGCCCACGTTGCACCTAAGacacaacgataacaacataTTCCTCAATGATTACGAGGCCTCAGAACACGATCAAAGCAGATTGTTGGCCATTAAGGATGAGCCCTTAGACAGCAGCGATGATCTGATGCtgtaa
- Protein Sequence
- MSQHNNHPPHHHHHHYYQQQQQQQQQQQQHHNHHQQQQQQQQLQHKQIQQHSWYSHVASYPPHHPHAAAFAAPCKTNNNNNNNIMNAYGTGAGGAHAAYYGSGGVGGYNLEGNTVAYAHNQLLQYQQQQQQQQQQQQQHHQLSQRSYMAHSLMHGSYPYIKSEPLELSDDRQRQQQQQLQHHQQQQQQQQQHFQNPMAPPPAPANRHSLDASGEMIIKSEPVDEHAFKSNYIDDNTPFADFSKYPEFGDDMLSPKVELSVKDEGYGNHKVPNPLSYPRRKLQSDRSSEILPICQRCKEVFFKKQIYIRHVAESSCTIQEYDFKCNICPMSFMSTEELQKHKHLHRADKFFCHKYCGKYYDTIADCESHEYMHHEYDSFVCNMCSVTFATREQLYAHLPQHKFQQRYDCPICRLWYQTAHELHEHRLAAPYFCGKYYVGSQSAAAHQQQQQQHPHQQQVGYKLQDCHMGTMEMATPHHKTNAAATALPATAALSSLLQQRQANADGAALFASNLKNESNVKLERSYSNSTSESGYSLHDSSYNNAYGSDNSLHAGGGSGALGGPQAHSSTLDDSEDALCCVPLCGVRKSTSPTLQFFTFPKDEKYLHQWLHNLKMFHIPQSSYSTFRICSMHFPKRCINRYSLCYWAVPTFNLGHDDVANLYQNRELTNTFTTGEVARCSMPNCTSQRGESNLKFYNFPKDIKSLIKWCQNARLPVQAKEPRHFCSRHFEERCIGKFRLKPWAVPTLHLGAQYGKIHDNPKNLYVEEKRCCLNFCRRSRSSDFNMSLYRFPRDEVLLRRWCYNLRLDPSVYRGKNHKICSAHFIKEALGLRKLSPGAVPTLHLGHNDTFNIYENELWPPPAPTTPTNHQQQLQQHQLQQHQQQHQQHQSHHGHHGNKYQRHSAASTSSSASSASHYVDPEMSGSYMGVGNSGGSGSGLNVSDSMDVCCVPSCESKRHNSENITFHTIPRRPEQMSKWCHNLKIPEDKMHKGMRICSLHFEPYCIGGCMRPFAVPTLHLGHDDEDIHRNPDVIKKLNIRETCCVAVCKRNRDRDHANLHRFPSNIALLTKWCANLQRPVPDGTKLFNDAICEVHFEDRCLRNKRLEKWAVPTLILGHENIAYPLPTAEQVAEFYARPSAPNNGEEQGECCVETCKRNPSVDDIKLYRPPEESQVLAKWAHNLQVDVAKLTSLRICNLHFESHCIGKRMRPWAIPTLNLSTNIENLYENPEHQMLYKRRKHLNSGRSAATHDGGMVKPTWVPRCCLPHCRKVRALHNVQLYRFPKLNRSTLAKWAHNLQVPLVGSAQRRLCSAHFEPHVLSKKCPVPLAVPTLDLNSPPGYKIYQNPAKLKANKLCLQRVCIVESCRRQRGQGVQLFRLPHNPTQLRKWMHNIKMRPRGAMRQQYRMCSIHFETHSFNGKRLSAGAIPTLELGHDDDDIYPNEAQSFVEEHCTVEGCDASKEQPDVRLFRFPTDDEDLLWKWCNNLKMNPVDCVGVRICNKHFEPDCIGPKHLYKWAIPTMELGHDDAEIELIDNPKLEERYVDPVFKCCVPTCGKTRKFDEVQMNSFPKDPIVFQRWRHNLRLEHLDFKERERYKICNAHFEDICIGKTRLNIGSVPTLELGHDETEDLYQVNPEELQSNLFGRQRRVRESMGIAIKQEEGEEEEGQEEEEHSELDEDIKPDINMLRAKASQAKMKRSLSDLKCCVPSCGRSRLEHGARLFPFPSGKQQQTKWRHNLRLTLNEVDRSTRVCSAHFNRRCIDGKQLRGWAMPTLQLGHQDQPIYENPKNIPGFFTPTCALAHCRKRRSIDNDLRTYRYPRSEDLLEKWRVNLRLAPDQCRGRICADHFEPMVRGKLKLKTGAVPTLKLGHDEGVVFDNEVIKAGLQQEADENEASMDSLVKVKQEKIEADDDEDEETVEKHQQEEDHGYFDPLELVETFAEQNSEENTAFDEPNDDEDDDDDDDVNDDEEDDEDEDEAGDDDELLLPDTPPIRMPLPPMPVRREKAVNNVTPICCLKHCRKERTATHPLSTFGFPKDHQQLLKWSANLQLPLEDCMGRVCIEHFDPEMLGTRKLKQNAVPTLNLGHTTPLKYSCNGQSVAVYGDQPEHSVFRLWSLKHCRKRKLPMEAPNQTATKRRRCCLPSCGKQPDLHGVQLHRLPTNRIQLRKWLHNLKLSPMADNSQARICSDHFEAQLETLEECVPTLRLGHDDTHIYRNRGSASGSASVSGISASSSGCMVPSCPCARLNLYRCYDLPENRLVQQAWLEWLQQPMPQLASDGKLCVMHYMQLYEQVPLPQELPQLVLRRLQETYDLIAGSSMAMKLRCAVPGCYSKYTDNIRLTKLPMCPETCAKWVHNTKISYEPSRHYIYRICMLHFEPRSLGPVRPKQWAVPTLLLNHQDQKIYRNPMLDVSADNQSGSALSLSDPIAISTPVPIAMSASVPVELPLRIKTEVAFSGSPCASASASPSPRGRLRICCIPSCAQQASSQTRLYRFPTAETALLKWLVNTQQQPRLVDTTQLFVCQDHFETEAICKNQLRSWAVPTLKLGHDGHVIPNARHNGNIADSQENKHTLQFIWENYCSVLSCFQPRSEQLRLYPYPNDRRTIRKWATNCKHRSMQASSDGFQVCQSHFGPHCFDPDTGELREDAVPTLELSRCLNEMRCIVPSCHPVHTEDARRRRFFKLPKRCAQLNIWCYNLGIDSATLGSSENHVCDRHFETQCFNQQKLLRPGARPTLHLGHEQHIELLPNPAIFASYVEVCCVPDCEQPKNDEEVQLFGFPKIRALSEKWLQNLRLDAGREQLLKMKICAAHFEPNCMDNKRLQMGAMPTLRLGHNEHHNIHRSGEPNVGKVKKYCNRSGSSYECCYPNCVELRKTYLRISYDLPEGEALRQKWLEHMDIVDEEQKPLKLCPMHLIILYDHSDEHFEEHTKEELLDNNYEDARSSVRIRVISCAVRGCRTLKPRDGGRLHGLPQRRDVLEMWLYNIQLVFYEQQRYLYKICSRHFEPSCFMDTTRRLKPWTMPTLELPERKPGEAPVYQNPSEEEWQRMNELFAVEQQELQLELEQDQQQQELAQEQYQEEQYDEDCDLLEPIVKIEHMGNEDQFDEEAEYQQEEQRREGVEEDDYDNDNSQQPLEVLLEVGHVEKCATYEQMDSEADQSYAAAAAEQQLQLSSFSATAASTPHHGSGSMINNGQIIKYNARRCSVQGCDVTANDLNNTIKLHKFPTSLDALKKWMHNTQVDVDTNFAWRFRICSYHFSCECFNGSRIRRGSMPTMSLGPRRPRKIYDNEFNTSLQLDQEQSNDVGSEDQPQTELETTKGRTKGKVTLRLPHPAPPRKSSKFCQIDGCSNHLTSENLTLHKFPHSVDMCAKWQHNTQVPFDPEFRWRYRICSAHFEPICLMNMRLMHGSVPTLLLGPRAPRQLFESDFEAISLRLDKQKSGLEQQEHMMLEPEDGDEMGLLEPEMQLLEDAETEQSDNPNDSWKDDHLPGIKQEPGTSYNPVKSGYDKCSLVHCQRQRSMHGVHIYKFPRSRQLQQHWMHNLRIRYDERRPWKTMICSVHFEANCIRLRKLRPWAVPTLELGDNVPDEIYTNEQSRQQLETGSSDNEEVGLDVSMEEDLALEDDDCADEDDDVLATEPLVKIERRSKFDAYPPGQLPPWKLKFCSLPYCRHPRGDGIKLFRLPTNISSIRKWEHATGIKFTESQRNTKLICSRHFDPQLIGVRRLMYNAVPTLHLGPMSERVERALAAPVARCCMPNCQEEVKLHKFPSDPALRQQWCHLLNLSNADNYRDKHICASHLPDQGPCCLVCGTEDMQLPLLNFPENRNQRAKWCHNLKIESIAKWDTSKQICSKHFESYCFAQPGQQLLPDAAPTLHLRHNDNNIFLNDYEASEHDQSRLLAIKDEPLDSSDDLML
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00478760;
- 90% Identity
- iTF_00470904;
- 80% Identity
- -