Hdun007440.1
Basic Information
- Insect
- Hirtodrosophila duncani
- Gene Symbol
- -
- Assembly
- GCA_037043425.1
- Location
- JBAMBM010000490.1:2704939-2722299[-]
Transcription Factor Domain
- TF Family
- THAP
- Domain
- THAP domain
- PFAM
- PF05485
- TF Group
- Zinc-Coordinating Group
- Description
- The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 30 5.2 1.1e+04 -2.7 1.9 50 64 331 347 312 362 0.61 2 30 7.6 1.7e+04 -3.2 0.1 6 58 363 416 358 434 0.52 3 30 3.5e-15 7.8e-12 46.0 4.4 1 86 561 633 561 634 0.86 4 30 2.6e-15 5.7e-12 46.4 4.8 1 87 661 730 661 730 0.83 5 30 7.8e-16 1.7e-12 48.0 0.4 1 87 752 824 752 824 0.84 6 30 2.1e-15 4.8e-12 46.6 5.1 1 87 916 986 916 986 0.82 7 30 3.3e-15 7.3e-12 46.0 3.5 1 86 1010 1081 1010 1082 0.82 8 30 1.2e-12 2.8e-09 37.8 0.9 1 87 1117 1185 1117 1185 0.80 9 30 4.1e-11 9.1e-08 32.9 1.5 1 86 1228 1297 1228 1298 0.76 10 30 2.6e-16 5.9e-13 49.5 0.1 1 86 1325 1394 1325 1395 0.82 11 30 2.5e-12 5.6e-09 36.8 2.7 1 87 1416 1486 1416 1486 0.81 12 30 1.9e-14 4.3e-11 43.6 1.0 1 86 1513 1584 1513 1585 0.84 13 30 2.2e-14 4.8e-11 43.4 1.5 1 86 1661 1730 1661 1731 0.84 14 30 5.5e-13 1.2e-09 38.9 0.4 1 86 1754 1822 1754 1823 0.83 15 30 2.4e-14 5.4e-11 43.3 1.4 1 87 1937 2006 1937 2006 0.80 16 30 3.9e-13 8.7e-10 39.4 0.4 1 62 2063 2126 2063 2150 0.77 17 30 0.00095 2.1 9.3 0.1 1 58 2159 2210 2159 2233 0.78 18 30 8.3e-13 1.8e-09 38.3 0.8 1 86 2249 2318 2249 2319 0.85 19 30 1.5e-14 3.3e-11 43.9 0.3 1 87 2392 2462 2392 2462 0.82 20 30 1.5e-12 3.3e-09 37.5 1.2 1 86 2497 2568 2497 2569 0.80 21 30 3e-13 6.6e-10 39.8 0.8 1 87 2579 2655 2579 2655 0.78 22 30 1.9e-15 4.3e-12 46.8 0.2 1 86 2681 2754 2681 2755 0.81 23 30 3.3e-06 0.0072 17.2 0.1 1 58 2788 2842 2788 2863 0.82 24 30 1.5e-14 3.3e-11 44.0 0.5 1 87 2880 2952 2880 2952 0.81 25 30 1.4e-15 3.2e-12 47.2 0.7 1 87 3108 3182 3108 3182 0.85 26 30 3.6e-14 7.9e-11 42.7 3.5 1 87 3254 3325 3254 3325 0.83 27 30 3e-14 6.7e-11 42.9 5.1 1 86 3464 3534 3464 3535 0.85 28 30 9.4e-13 2.1e-09 38.2 0.2 1 86 3612 3681 3612 3682 0.84 29 30 7.7e-11 1.7e-07 32.0 4.0 1 58 3709 3759 3709 3764 0.87 30 30 6.9e-10 1.5e-06 29.0 0.4 19 87 3778 3835 3767 3835 0.72
Sequence Information
- Coding Sequence
- ATGTCACAACACAATCCGCATCACCACCCCCACCCCCATGCGCATTtacaatggcagcagcagcagcaacaacaacaccatcaacataaacaaatacaacagcagcagcacagttGGTACTCACATGTTGCTTCCTACCCACCGCATCACCATGCGTCCTTTACATCACCCTGtaagagcaacaacatcatGAATGCATATGGAGTGGGAGCAGGAGGTCCTACAGGCACGCATGCATATTACGGCAATGTCAATGCTGCGGGGGGTGGTAGTGGAGCTGGGAGTGGTAACATGTCAGGTGGAGGTAGTAACATGTCAGGTGGAGGCTATAACCTTGAGGCAAATAACGTTGCCTATGCCCACAACCAACTGCTGCAgtatcaacaacagcaacaacatcaacaacagctcAGTCAAGGTTCCTCTTATATGGCGCAAAGTCTAATGCACGGCCCATATCCCTATATCAAAAGCGAGCCCATGGAGCAAATGCagtcccagcagcagcagcaacattatCAACCGCCATCGAATCCTATGGCACCGCCGCCAGCGCCAACATCAACTCATCAGTCCGTGGATGATACGATAATAAAATCGGAACCCATTGACGAACATGTCTACAAATCCAACTATATCGATGATAATACACCTTTTGCTGATTTTAGTAAATTTCCTGAATTCGGAGACGACATGCTAAGTCCAAAAGTCGAGCTAACCGTCAAGGATGAGTCCTACGGCAATAATGTAAAGAACCCACTCAATTATCCACGGCGCAAGCTGCAAAATGATCGCTCTTCGGATAGCCTACCCATATGTCAGCGCTGTAAGGAAGTTTTCTTCAAgaagcaaatttatttgcgCCATGTGGCCGATAGCAGCTGCATTATCCAAGAATACGACTTCAAGTGCAGCATTTGTCCCATGTCGTTCATGGGCGCCGAGGAACTGCAGGATCATAAGCTTCTCCACCGAGCTGACAAGTTCTTCTGCCATAAGTACTGTGGAAAGCACTTTGATACCATCGCCGAGTGTGAGTCCCATGAGTACATGCAGCATGAATACGAGAATTTCGTCTGCAATATGTGCTCGAGCACGTTTACCACACGAGAACAGCTTTATGCCCATCTGCCACAGCATAAGTTTCAGCAGCGGTACGATTGTCCAATTTGCCGATTGTGGTATCAAACGGCTTTGGAACTGCACGAGCATCGGTCGTCAGCACCTTACTTCTGCGGCAAGTACTACAGGGGAACTGAATCCCAGTCAACATCGCAGCAGCGTCAACAGCACCAGCCGCAGGAGGAACAGGCCAACTACGAACTGCAGGACTGCCATATGGGTACCATTGAAAtgccATCAACGCACCACAAGTCGAATTCATCCGCCTCAGCCTTGCCCGCAACGGCAGCTCTCAATTCGTTGCTGCAACAACGTCAGGCGAATGCCGATGGTGCCGCGGCTTTATTTGGAGCCTCGTCTTCGTCCTCATCGTTGAAGGGCGAAGTTAATATAAAGCTGGAGAGAAGTTATAGCAACTCCACTAATGAGTCCACCTACAGCTTGCACGataacagctacaacaatgcTTACGGCAGCGACACCTCAATTCATGGAGGTGGCGCTATCGGTGGGCCACAAGCGCATTCCTCGACATTGGACGATTCTGAGGACGCATTATGCTGTGTGCCACTGTGTGGGGTGAGGAAGAGCAGTAGCCCCACTCTGCAGTTTTTTACGTTTCCTAAGGACGAAAAGTATTTGCATCAATGGCTGCACAACCTTAAGATGTTCCACATACCGGCCTCCAGTTATGCGAGTTTTCGCATTTGTAGTATGCACTTCCCGAAGCGCTGCATCAATCGCTACTCTCTCTGCTATTGGGCGGTGCCCACGTTTAATCTTGGCCACGACGATGTAGCCAATCTCTATCAGAACCGCGAGCTGACTAACACCTTTACCACTGGTGAGGTGGCTCGCTGCAGCATGCCCAACTGCAATAGTCAGCGGGGAGAAAGCAATTTGAAGTTTTACAACTTTCCCAAGGACATCAAAAGCTTGATTAAGTGGTGCCAAAATGCTCGTCTTCCGGTCCAAGCCAAGGAGCCACGTCACTTTTGCAGTCGCCACTTTGAAGAACGCTGCATTGGCAAGTTCAGGTTGAAGCCCTGGGCAGTACCCACCTTGCACCTGGGTGCTCAGTATGGCAAGATTCATGACAATCCCAAGAACCTCTACGTAGAGGAGAAGCGATGTTGCTTGAATTTCTGTCGTCGGAGCCGCTCAACGGACTTTAATATGTCACTATACCGATTTCCGAGGGACGAAGTGCTTCTGCGACGTTGGTGCTATAACCTTCGGCTGGATCCGTCCGTCTATCGGGGAAAGAATCATAAGATATGCAGTGCCCACTTCATTAAAGAGGCATTGGGACTTCGGAAGCTGTCACCTGGCGCTGTGCCCACGCTTCATCTTGGTCACAACGACACCTTCAACATATATGAGAACGAACTCtggccaccaccgccgcccagcaatcatcaccagcatcagcagatgCAACACCATTCGCTTCAGCATATTTCACATCACGGCAAATCCTATCAACGACACTCGGCTGCCTCTACTTCATCATCGGCCAGTTCCGCCTCGCATTATGTGGATGCAGAGCTGAGCGCCGCATATACCAACATGTCtgctggcggcggcggttCTTCCTTGAATGTGAGCGACAGTATGGATGTGTGCTGTGTGCCCAGTTGCGACAGCAAACGTCACAACAATGAGAGCATCACATTCCATACAATTCCCAGGAGGGCAGAGCAGATGAGCAAGTGGTGCCACAATCTGAAGATACCGGAGGAGAAGATGCATAAGGGAATGAGGATTTGCAGCCTGCACTTTGAGCCCTACTGTATTGGGGGCTGCATGCGGCCATTTGCAGTTCCCACGCTACATTTGGGTCATGACGACGAAGACATTCACCGAAACCCGGATGTGATCAAGAAGCTCAACATAAGGGAAACATGCTGTGTCGCTGTGTGCAAGCGGAACAGGGATCGGGATCATGCGAATCTGCACCGCTTCCCCAGCAATGCGACCCTGCTAACAAAATGGTGTGCCAATCTCCAGCGACCCGTCCCAGATGGCAGCAAACTCTTCAACGACGCCATATGCGAGGTGCACTTCGAGGATCGTTGTTTGCGTAACAAGCGACTGGAGAAGTGGGCAGTGCCTACCTTGGTTTTGGGTCACGACGATATCGCTTATCCTCTGCCCACGCCGGAGCAGGTGGCCGAGTTTTATGCCCGGCCCAGTGCCCCCAATAACGGCGAGGAGCAGGGCGAGTGTTGTGTGGATACCTGCAAGCGTAATCCTAGTGTGGATGACATTAAGCTGTATCGACCACCGGAGGAGGCTGAGGTGCTCGCCAAGTGGGCCCATAACCTGCAGCTGGAGGTAAGCCAGTTGCCGAGCCTCAAAATTTGCAATCTTCACTTTGAGTCCCATTGCATTGGCAAGCGGATGAGGCCTTGGGCCATACCCACTCTCAATCTCGCTGGCAACATTGAGAATCTTTACGAGAATCCTGAGCACACCATGCTGTACAAGCGTCGCACCCACGTCAAATCAGATGCGACCTCCGTTGGTTCCAGTTCAAAGCCCACTTGGGTGCCTCGGTGCTGCCTTCCTCACTGCCGCAAGGTGCGTGCCCTGCACAATGTTCAACTGTATCGCTTCCCCAAACTCAACCGCTCCACATTGGCCAAATGGGCGCATAACCTGCAGGTGCCTATGATGGGCAGTGCGCAGCGTCGTGTTTGCTCCGCTCACTTTGAACCGCACGTCCTCAGCAAGAAGTGCCCCGTGCCGCTTGCGGTGCCCACTCTGGATCTGAATAGCCAACCGGGCTATAAAATCTATCAGAATCCTGCCAAGCTGAAAGCCAATAAGCTGTGTTTGCAGCGCGTTTGCATCGTGGAGAGCTGCCGTCGACAGCGGGCCCAAGGGGTGCAGCTGTTCCGACTTCCCCACAGTCCGACGCAGCTGCGTAAGTGGATGCACAACATCCGGATGCGCCCGAGAGGAGCGATGCGTGCCCAGTATCGAATCTGTTCCATGCACTTTGAGTCGCATTCGTTCAATGGCAAGCGGCTGAGCGCCGGAGCCATACCCACACTGGAGCTGGGCCATCAGGATGACGACATCTATCCAAACGAAGCTCAGGCCTTTGCCGATGAGCATTGTGCTGTGGAGCGATGTGACGCCTCCAAGGAGCAGGCCGAAGTGCGGCTTTTCCGCTTCCCCACGGATGACGACGATCTCCTATGGAAATGGTGCAACAATCTCAAAATGAATCCAGTAGACTGTGTCGGTGTTCGCATCTGCAACAAACACTTCGAACCAGACTGCATCGGGCCCAAGCACTTGTTTAAATGGGCAATTCCCACAATGCATTTGGGCCATGATGATGCTCAGATCGAGCTCATTAGTAACCCTAAGCCCGAGGAGCGCTATGTGGATCCCGTGTTTAAGTGTTGCGTTCCCACCTGCGGCAAGACCCGCAAGTTTGACGAGGTTCAAATGAACAGCTTTCCCAAGGACCCCATACTCTTTCAGCGGTGGCGCCACAATCTCCGACTCGAACATCTCGACTTTAAGAGGCGAGAGCTCTACAAGATTTGCAATGCTCACTTTGAGGACATTTGCATTGGTAAAACTCGCTTAAATATTGGCTCCATACCGACTCTAGAATTGGGCCATGATGAGACGAACGACTTGTTCCAAGTTAATCCGGCCGAATTGCAAAGCAACCTGTTTGGACGCCAACGACGACTGCTCGATTCACCGACGGATGGCAGCTTTAAAGAGGAGCTGGACGAGGATAGGGACACCAAGTCGATTGTAACGACAATGGCAGAGGCTACCGATTTGAACACCTCGCAGgtaaaaattaagaaatccCTTGCTGATTTGAAGTGTTGCGTGTCCAGTTGTGGACGCAGCCGATTGGAGCATGGAGCCCGTTTATTTGCTTTCCCCAatggcaagcagcagcaaaataaGTGGCGCCATAATTTGCGAATGGAGCCCGGTGAAGTCGATAAAATGACACGGATATGCAGTGCCCACTTCAATAGGCGCTGTGTTGATGGCAAGCAGCTACGGAGCTGGGCCATGCCCACCCTGCAGTTGGGTCATCAGGAGCAGCCCATCTACGAGAATCCGAAAAATATTCCTGGCTTCTTTACACCAACTTGCGCCCTGAGCCATTGCCGAAAAAGACGCAGCATTGACAATGATTTACGCACCTATCGATATCCTAGGTGCGAGGAGCTGTTGGAAAAGTGGCGAGCCAATTTAAGATTGGCCCCAGATCAGTGTCGTGGCCGCATTTGTGCCGATCACTTTGAGCCGCAGGTTAGGGGGAAGCTGAAATTGAAGACTGGAGCTGTGCCTACTCTGAAATTAGGTCACGATGAGGGAGTGGTTTACGATAACGAGGCTATTAAGGTGGATGAGGATGCCAGCATAGACTCGCCTTTAATTAAACCCAAGCCTGAGCCGAAAGAAGAGGATGAAGGGCGTAGGAGTccgcaggaggaggaggaagatgACCTGGATCATAGTTACTTCGATCCCCTCGAGTTGGTGGAAACATATGCCGAGCACAACAGCGATAATGACGATGATCCCGATGAATTAGAACCGGAGCAATTACTGCCAGACACGCCCCCACCTATTCTGCGTCCGGCTCCAATTCCGACTCGCCCTCGAGAAAAGCCACCCAACAATGTGACTCCTATTTGCTGCCTGAAACACTGCCGCAAGGAACGCACCCCTAGCTATCAGCTGAGCACCTTTGGCTTTCCAAAGGATCAACAGTTGCTACTTAAATGGTGTGCGAATCTTCACTTGGAGCCTTCTGATTGCATTGGTCGCGTCTGCATTGAGCATTTTGAGGCAGAGGTGCTGGGAAGTCGCAAGCTGAAGCAGAATGCAGTGCCCACTTTGAACGTGGGTCATGCCAGCCCTCTGCGCTACACCTACAATGGCCAGGAACGAGCCATCCATGATGAGAAGCCTCAGCACTCGGTTTTTCGGCTTTTCAGCCTGAAACACTGCCGCAAAAGGAAGCTAACGGAGCCGCCGGACATGAGCGAGGCAGTGAGTAAGCCTAAGCGCTGTTGCCTCCCCTCATGCGGCAAGCAAGAGGATGTAGATGGAGTGCAGCTCCATCAGCTACCCAACGATCATTCGCTGCGTCGCAAGTGGCTGCAAAATCTTAAGCTATCTCCGCACTATGAGGTGGAATTGACCAGCAATGAATATGTTTGCGTGGAACATTTTGAGACGCACCTGTTTCAGGATGACGGAGAGCCAATGGGACATGCCAATTCCACTGTAAAACAAGAGCATCCCGATAGGAATATTTTCCGTAATATTATTAGCTCCCCAACCATCTCCAACTCCTGCCTGGTGCCCAGCTGCCCATGTGCGCGGCTCAATCTCTATCGGTGCTTTGACCTGCCCGCCAATGCCGAAGTGCAAAAGGCTTGGCTGCATTGGCTCCAGCTTGATGAAGATGAGGCGACCATCTGTCAAGGGCAGCTTTGTGTGATGCACTATATGCAGCTCTATGAGGAAGTGCCCATGCCAGACACAGTCTCATCCGAGGAGCTAGAACGAATGCAAGCAACCTATGGGCAGATTGCTAACTCGTCAATGGCCCTAAAGCTTCGCTGTGCAGTTCCCGGCTGCCGTTCCAAGTACACGGACAATATTCGACTGACCAAGCTGCCCCAATGTCAAGATACATACTCTAAGTGGTTACACAACACTAAGATACCTTATGATCCGTCCAGGCATTACATTTACCGCATATGTATGCTCCATTTCGAGCCACACTGTCTGGGCCCTGTGCGTCCCAAGGCGTGGGCTGTTCCAACTTTGCGATTGAATCATAAGGACGCCATATATTCGAATCCCAAGCTGGAGCAGGATGAAACGTCAActgcagcatcatcatctacATTCAGACCAATCCTCAGTTCCACAGATGCTCTAAAGATCAAGACAGAGCTGGGACCACTGCGAATAAAGACGGAACTGCCCCTGTTGAGTAGTCCGAGCATCAGTGCCAGCCCTAGTCCACGTGGCAAACTGAAGTTCTGCTGCATCCCCAGCTGTGGCCAGCAGGCCAACGCCCTGACACGCCTTTATCGCTTTCCCATCGCAGAGCCGGCACTCCTCAAGTGGCTGGTCAACACCCAGCAGCAACCGCGCTACATGGATGCGGTGAATCTGTACGTTTGCCAGGCACACTTCGAGCCCGACGCCATCTGCAAAAAGCAGCTCCGCAGTTGGGCTGTTCCAACTCTAAATCTCGGCCACGATGGCCATGTTATACCCAATGCCAGGCACAATGGGAACATCGCCGACAGCCAGGAGAACGAGCAGGCCATGGAGTTCATCAGGGAAAACTACTGCTCGGTGCTGACCTGCTTCCAGCCACGCAGTGAAGAGCTCCGTCTGTATGCGTATCCAAAAGATATGCCCACCATTCGGAAATGGGCAGCCAACTGCAAACATCGATCCATGCAGGCCAGCAGCCATGGCTTCCAGGTATGCCAATCGCACTTCACTACTGATTGCTTTAATCCGGACACCGGAGAATTGCACGAAGACGCTGTACCCACTTTGGAGCTAGCTAGGAGCGGCACCGAACTGCGCTGTGTGGTGCCGGGTTGTATGAAGGACGAATCCTCGGCTACGCCTTCAGGGCGTCGACGTTTTCAGAAGATACCCAAGCGGACTGCGCAGCTCAGCGAGTGGTGCCATAATCTTCGTCTGGATGCGGCCACACTGGCCGTGGGTGATCATCACATATGCGAGCGTCATTTTGAGGCAAACTGCTTCAATGCGTATAAACAACTGCGTCCTGGAGCTCGACCGACCCTCCATTTGGGTCACGACGATATTGATGATTTGCTGCCTAATCCAGCGGTTTTCAGCGTGGATGAAACCGCCGAGGACGTGTGCTGTGTTCCTGGATGCGGTCGCTCGAATCAGCAGGCAGATGAAGACCACATCCAATTCAGTAGCTTTCCCAGGATGCGTGTACTGGCCGAGAAGTGGATGTATAATATCCGATTGGAGCCGGACAAAGAGCAGTTCGGCAAGCTGAAGGTTTGCAGCGTACACTTCGAATCTAGTTGCTATGAAAATGGGAAGATGCGAATGGGATCGATGCCCACGCTTATGCTGGGTCACGATAATCCCGATATCTATCAAACGGATGAGCAATCCTTTGCTAAGAATCGGAAGCACCCCAAAAAGTCCAGCGGCTCGGGTATGGATTGCTGCTTCCCGCACTGCGTGGAACTTCAGAAAAACTACCAGCGCATGGTTTACGACTTGCCGCTGACGGAGGAACTGCGTCAAGCTTGGCTAAACCACCTGGATGTGGCAGAGGAGAATTTTGGTTTAGAGGAGCAGTCGCTGAAACTCTGCCCACTGCACTTTATCATGCTTTACGATCACAGCGTTTCCCACTTCCCAGAGCATGCCGCGGAGGAGCTCATGGATAGAAACTACGACGATGCCAGAAAAAGTGTGCGAATACGCCTCATCAGTTGTGCCGTTAGGAGCTGTAGTACCCTGCAGCCAAGGGATGGCGGCCGCTTGCATGGTCTACCGCAGCGCAGGGACCTGCTCCAAATGTGGTTGCACAATACCCAGTTGGAGTTCTCTGAGCAACAGCGATATATGCTGAAGATTTGCAGCAAGCACTTCGAGAGCAGATGTTTCACAGATATCCGGAAGCTAAAACCCTGGAGCATTCCGACCCTCCACCTGCCAGATCTGGAAGAAACCGATGTTAATGCATTCGTAGCTTATCAAAATCCCACAGAGGTGGAATGGcaagaaatgaatgaaaagatGGCCAAGGAATCGAATGCAGTGAATGTAAAACCAGAGCAGGAAGATAATTCTCTGCTGAAGCCAACTGTGCTCATAGAGCATATGGATGATAACGAACAGGAGGAGCTTCCGTTTCCGTCGGTGAAGCAGGAACTGATTACCAACGacgattatgatgatgatgcggaGAGTACTAATTTTCAGCAGCCACTTGAAATGCAGGCCTTGGAAGTCCTTCTGGAAGTGGGACATGTGGAGAATTGCTCAAGCTACGAACTAATGGATACTGAAGGACTTTTAGACGACAGTAGTCCTGCGCCTTTCACTCCCCGAAAGGCAACTAATGCTTCGGTATCCACATCAACTCCAAATCGTGGTCAGTATAATGCACGCCATTGCAGCGTTCAGGGCTGCAAGGTGACCGCCAGCGACACGGATGGGATCTCTGTTAAGCTTCACAAATTCCCCACGTCCGTGGAGGCCACACAGAAGTGGATGCACAACACTCgggtggatgtggatgcaaATTTTTCATGGCGCTATCGCATCTGCAGCTATCATTTCGAGCAGGAGTGTTTCAATGGCTCGAGGATCAGACGTGGCTCGATGCCCACCCTGCATTTGGGACCGCAGATGCCTCAGGAGATCTACTACAATGAGTTTACCCAGCCCGGAACGGACCTGTCAGTGGAAGATACCGTTGAGGCAGCCCCCACTAACGCCATGGACACCGCCAATCGTGTTGGTGAGTTGAATGGCAATGGGGATGGCACTGGAAACGTTATAAAGATGCTGCTGCCACGCCCAGCTCCGCCACGTAAGTCCAGCAAGTTCTGTCAAATTGAGGGATGTCAGAATCACCTAACCAGTGAGAATATGACTCTCCATAAGTTCCCCCATGCACCAGAGATGTGCGCCAGATGGCAGCATAACACCCAGGTGCCCTTTGACCCAGAATTCCGCTGGCGCTATCGCATTTGCAGTGCCCACTTTCAACCGTCGTGCCTGGTCAACATGCGTCTGCTTCATGGCAGTGTGCCCACTCTGCACCTTGGACCACGTGCGCCGTCACAGCTCTTTGACAATGATCTGGAGATTATGATGATGAGAATGGATAAGAAAACAGCAATAGCACACCATGGCAAGCTGGCAGTGGTGAAGCAAGAGCAGAGGGTTAAGTGGGACCGTGAGCAAAAAGTGGAGGAATTTCAGCATGATTGGGTAGAGGACGATGAACACAGTTTCCTGGTGCCTGAGCTGCAACTGCAGGAACAGGAGGAGGAGTGGGAGGGGGAGGAGCTACAGCAGCAAGAGCCATCCCCTAGTTTGAGTCTTTCAGGTTGGAGGCAGCAATTACGTTTGCCCGTCATAAAGCGGGAGGGGGCCGAACAGCAACTAAAGCTGCCTCACAATCGGCATCAAAAGGTGATGTCATACAATCCCATTAAGTCTGGTTACGACAAGTGCTCGCTGGTGCATTGCCAGCGCCAGAGATCCCAGCACGGAGTCCACATCTACAAGTTTCCGCGGTCCAGGCAGCATCAACAGAGATGGATGCACAATCTGCGCATCCGCTATGATGAAAGGCGACCCTGGAAGACCATGATATGCAGTGTACACTTTGAACCGCACTGCATCCGTCTGCGCAAGCTCCGTCCCTGGGCAGTTCCCACTCTGGAACTGGGTGAGAATGTGCCGGAGCACATCTACACCAATGAGCAGTGCCAGGAAATGGCCTACAACGATAACAGTGGGATTGAAACTGAGTGTGAGGACGATGATGGTCTccaagaggaggaggaggatgacGATAACGAGGAAGAAGACGCAGTGCCCCATATCAAAAAGGAGCGGCGATCTCGACTGGACTGGCCCCCAGGCCAACTGCCACCCTGGAAAATCAAACAATGCTGCCTGCCCTACTGCCGCAGACCGCGCGGAGATGGCATCAAGCTCTTCCGGCTGCCCAACAACATCAGTTCGATTAGAAAGTGGGAAAAGGCTACGGGCATGGTGTTCAAGGAGTCTCAACGCAATACCAAACTAATCTGCAGTCGTCATTTTGATCCGGAGCTGATTGGTGTGCGTCGGCTGATGCGTAATGCCATTCCCACACGGCATCTGGGCCCGGTCAGCGATAAGAATGACCAGATGGCAGAGCCCCCGCCCCCGAGATATTCTGCACCACTTACGCCTAAATGCTGTATTCCGAACTGTCATCAGCAATCCGAAGCAGATGCCACAATGCACAAGTTTCCCAGTGATCCGTTGTTGCTCCATCAATGGTGTCAGGCCTTAAAAATCTCCAATGTCCAAGGTCATCGTCACAAGCACGTATGCTCTGCTCACCTGCCCCCCCTAAAGGTGCTCAGCTGTGTGGTTTGTGGTGTCGACAGTGTCCAGCCGCCCCTGCTTGGCTTCCCCGAGAATCGCAACCAGAGAGCCAAATGGTGCTACAATCTGAAAATAGAGCCCATACCCAAATGGGATAACTCCAAAAACATATGCAGCAAACATTTCGAGTCCTTTTGCTTCATCCAGCCGGGAATCCTTCTTCCAGATGCAGCACCCACGCTGCATTTGAATCACGATGACAGTAACATATTCCTCAATGATTATGCCATAGACTTGGACTCTACCCGGACGATGCTGATGCGCATCAAGGATGAGCCCTTGGACAACGACGAGCTGTTGGTGTAG
- Protein Sequence
- MSQHNPHHHPHPHAHLQWQQQQQQQHHQHKQIQQQQHSWYSHVASYPPHHHASFTSPCKSNNIMNAYGVGAGGPTGTHAYYGNVNAAGGGSGAGSGNMSGGGSNMSGGGYNLEANNVAYAHNQLLQYQQQQQHQQQLSQGSSYMAQSLMHGPYPYIKSEPMEQMQSQQQQQHYQPPSNPMAPPPAPTSTHQSVDDTIIKSEPIDEHVYKSNYIDDNTPFADFSKFPEFGDDMLSPKVELTVKDESYGNNVKNPLNYPRRKLQNDRSSDSLPICQRCKEVFFKKQIYLRHVADSSCIIQEYDFKCSICPMSFMGAEELQDHKLLHRADKFFCHKYCGKHFDTIAECESHEYMQHEYENFVCNMCSSTFTTREQLYAHLPQHKFQQRYDCPICRLWYQTALELHEHRSSAPYFCGKYYRGTESQSTSQQRQQHQPQEEQANYELQDCHMGTIEMPSTHHKSNSSASALPATAALNSLLQQRQANADGAAALFGASSSSSSLKGEVNIKLERSYSNSTNESTYSLHDNSYNNAYGSDTSIHGGGAIGGPQAHSSTLDDSEDALCCVPLCGVRKSSSPTLQFFTFPKDEKYLHQWLHNLKMFHIPASSYASFRICSMHFPKRCINRYSLCYWAVPTFNLGHDDVANLYQNRELTNTFTTGEVARCSMPNCNSQRGESNLKFYNFPKDIKSLIKWCQNARLPVQAKEPRHFCSRHFEERCIGKFRLKPWAVPTLHLGAQYGKIHDNPKNLYVEEKRCCLNFCRRSRSTDFNMSLYRFPRDEVLLRRWCYNLRLDPSVYRGKNHKICSAHFIKEALGLRKLSPGAVPTLHLGHNDTFNIYENELWPPPPPSNHHQHQQMQHHSLQHISHHGKSYQRHSAASTSSSASSASHYVDAELSAAYTNMSAGGGGSSLNVSDSMDVCCVPSCDSKRHNNESITFHTIPRRAEQMSKWCHNLKIPEEKMHKGMRICSLHFEPYCIGGCMRPFAVPTLHLGHDDEDIHRNPDVIKKLNIRETCCVAVCKRNRDRDHANLHRFPSNATLLTKWCANLQRPVPDGSKLFNDAICEVHFEDRCLRNKRLEKWAVPTLVLGHDDIAYPLPTPEQVAEFYARPSAPNNGEEQGECCVDTCKRNPSVDDIKLYRPPEEAEVLAKWAHNLQLEVSQLPSLKICNLHFESHCIGKRMRPWAIPTLNLAGNIENLYENPEHTMLYKRRTHVKSDATSVGSSSKPTWVPRCCLPHCRKVRALHNVQLYRFPKLNRSTLAKWAHNLQVPMMGSAQRRVCSAHFEPHVLSKKCPVPLAVPTLDLNSQPGYKIYQNPAKLKANKLCLQRVCIVESCRRQRAQGVQLFRLPHSPTQLRKWMHNIRMRPRGAMRAQYRICSMHFESHSFNGKRLSAGAIPTLELGHQDDDIYPNEAQAFADEHCAVERCDASKEQAEVRLFRFPTDDDDLLWKWCNNLKMNPVDCVGVRICNKHFEPDCIGPKHLFKWAIPTMHLGHDDAQIELISNPKPEERYVDPVFKCCVPTCGKTRKFDEVQMNSFPKDPILFQRWRHNLRLEHLDFKRRELYKICNAHFEDICIGKTRLNIGSIPTLELGHDETNDLFQVNPAELQSNLFGRQRRLLDSPTDGSFKEELDEDRDTKSIVTTMAEATDLNTSQVKIKKSLADLKCCVSSCGRSRLEHGARLFAFPNGKQQQNKWRHNLRMEPGEVDKMTRICSAHFNRRCVDGKQLRSWAMPTLQLGHQEQPIYENPKNIPGFFTPTCALSHCRKRRSIDNDLRTYRYPRCEELLEKWRANLRLAPDQCRGRICADHFEPQVRGKLKLKTGAVPTLKLGHDEGVVYDNEAIKVDEDASIDSPLIKPKPEPKEEDEGRRSPQEEEEDDLDHSYFDPLELVETYAEHNSDNDDDPDELEPEQLLPDTPPPILRPAPIPTRPREKPPNNVTPICCLKHCRKERTPSYQLSTFGFPKDQQLLLKWCANLHLEPSDCIGRVCIEHFEAEVLGSRKLKQNAVPTLNVGHASPLRYTYNGQERAIHDEKPQHSVFRLFSLKHCRKRKLTEPPDMSEAVSKPKRCCLPSCGKQEDVDGVQLHQLPNDHSLRRKWLQNLKLSPHYEVELTSNEYVCVEHFETHLFQDDGEPMGHANSTVKQEHPDRNIFRNIISSPTISNSCLVPSCPCARLNLYRCFDLPANAEVQKAWLHWLQLDEDEATICQGQLCVMHYMQLYEEVPMPDTVSSEELERMQATYGQIANSSMALKLRCAVPGCRSKYTDNIRLTKLPQCQDTYSKWLHNTKIPYDPSRHYIYRICMLHFEPHCLGPVRPKAWAVPTLRLNHKDAIYSNPKLEQDETSTAASSSTFRPILSSTDALKIKTELGPLRIKTELPLLSSPSISASPSPRGKLKFCCIPSCGQQANALTRLYRFPIAEPALLKWLVNTQQQPRYMDAVNLYVCQAHFEPDAICKKQLRSWAVPTLNLGHDGHVIPNARHNGNIADSQENEQAMEFIRENYCSVLTCFQPRSEELRLYAYPKDMPTIRKWAANCKHRSMQASSHGFQVCQSHFTTDCFNPDTGELHEDAVPTLELARSGTELRCVVPGCMKDESSATPSGRRRFQKIPKRTAQLSEWCHNLRLDAATLAVGDHHICERHFEANCFNAYKQLRPGARPTLHLGHDDIDDLLPNPAVFSVDETAEDVCCVPGCGRSNQQADEDHIQFSSFPRMRVLAEKWMYNIRLEPDKEQFGKLKVCSVHFESSCYENGKMRMGSMPTLMLGHDNPDIYQTDEQSFAKNRKHPKKSSGSGMDCCFPHCVELQKNYQRMVYDLPLTEELRQAWLNHLDVAEENFGLEEQSLKLCPLHFIMLYDHSVSHFPEHAAEELMDRNYDDARKSVRIRLISCAVRSCSTLQPRDGGRLHGLPQRRDLLQMWLHNTQLEFSEQQRYMLKICSKHFESRCFTDIRKLKPWSIPTLHLPDLEETDVNAFVAYQNPTEVEWQEMNEKMAKESNAVNVKPEQEDNSLLKPTVLIEHMDDNEQEELPFPSVKQELITNDDYDDDAESTNFQQPLEMQALEVLLEVGHVENCSSYELMDTEGLLDDSSPAPFTPRKATNASVSTSTPNRGQYNARHCSVQGCKVTASDTDGISVKLHKFPTSVEATQKWMHNTRVDVDANFSWRYRICSYHFEQECFNGSRIRRGSMPTLHLGPQMPQEIYYNEFTQPGTDLSVEDTVEAAPTNAMDTANRVGELNGNGDGTGNVIKMLLPRPAPPRKSSKFCQIEGCQNHLTSENMTLHKFPHAPEMCARWQHNTQVPFDPEFRWRYRICSAHFQPSCLVNMRLLHGSVPTLHLGPRAPSQLFDNDLEIMMMRMDKKTAIAHHGKLAVVKQEQRVKWDREQKVEEFQHDWVEDDEHSFLVPELQLQEQEEEWEGEELQQQEPSPSLSLSGWRQQLRLPVIKREGAEQQLKLPHNRHQKVMSYNPIKSGYDKCSLVHCQRQRSQHGVHIYKFPRSRQHQQRWMHNLRIRYDERRPWKTMICSVHFEPHCIRLRKLRPWAVPTLELGENVPEHIYTNEQCQEMAYNDNSGIETECEDDDGLQEEEEDDDNEEEDAVPHIKKERRSRLDWPPGQLPPWKIKQCCLPYCRRPRGDGIKLFRLPNNISSIRKWEKATGMVFKESQRNTKLICSRHFDPELIGVRRLMRNAIPTRHLGPVSDKNDQMAEPPPPRYSAPLTPKCCIPNCHQQSEADATMHKFPSDPLLLHQWCQALKISNVQGHRHKHVCSAHLPPLKVLSCVVCGVDSVQPPLLGFPENRNQRAKWCYNLKIEPIPKWDNSKNICSKHFESFCFIQPGILLPDAAPTLHLNHDDSNIFLNDYAIDLDSTRTMLMRIKDEPLDNDELLV
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -