Dlim015793.1
Basic Information
- Insect
- Drosophila limbata
- Gene Symbol
- -
- Assembly
- GCA_963924055.1
- Location
- OZ001353.1:6898681-6916043[+]
Transcription Factor Domain
- TF Family
- THAP
- Domain
- THAP domain
- PFAM
- PF05485
- TF Group
- Zinc-Coordinating Group
- Description
- The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 29 9 2.8e+04 -5.1 1.2 13 42 20 50 14 55 0.56 2 29 5e-15 1.6e-11 45.4 4.3 1 86 577 649 577 650 0.85 3 29 2.9e-15 8.8e-12 46.2 4.6 1 87 677 746 677 746 0.83 4 29 6.2e-16 1.9e-12 48.4 0.4 1 87 768 840 768 840 0.84 5 29 6.1e-16 1.9e-12 48.4 5.7 1 87 946 1016 946 1016 0.83 6 29 5.8e-15 1.8e-11 45.3 3.1 1 86 1040 1111 1040 1112 0.82 7 29 8.8e-13 2.7e-09 38.2 1.0 1 87 1147 1215 1147 1215 0.80 8 29 9.8e-11 3e-07 31.7 1.4 1 86 1260 1329 1260 1330 0.76 9 29 4.8e-16 1.5e-12 48.7 0.1 1 86 1357 1426 1357 1427 0.83 10 29 3.1e-13 9.6e-10 39.7 1.3 1 86 1448 1517 1448 1518 0.82 11 29 4.7e-15 1.4e-11 45.6 1.7 1 86 1545 1616 1545 1617 0.85 12 29 1.3e-13 3.9e-10 41.0 1.5 1 85 1699 1767 1699 1768 0.82 13 29 3.2e-12 1e-08 36.4 0.1 1 86 1792 1860 1792 1861 0.81 14 29 1.8e-14 5.5e-11 43.7 0.6 1 87 2009 2078 2009 2078 0.79 15 29 2.7e-12 8.3e-09 36.7 0.1 1 60 2133 2186 2133 2209 0.80 16 29 0.0027 8.4 7.8 0.0 1 58 2221 2271 2221 2287 0.80 17 29 7.3e-12 2.3e-08 35.3 0.3 1 86 2310 2379 2310 2380 0.84 18 29 8.6e-15 2.6e-11 44.7 2.3 1 86 2460 2529 2460 2530 0.83 19 29 2.9e-13 8.9e-10 39.8 1.2 1 86 2565 2636 2565 2637 0.81 20 29 4.6e-13 1.4e-09 39.2 1.9 1 87 2647 2719 2647 2719 0.80 21 29 1e-15 3.2e-12 47.6 0.1 1 86 2747 2820 2747 2821 0.80 22 29 0.00042 1.3 10.4 0.0 1 58 2856 2906 2856 2934 0.73 23 29 3.2e-14 9.9e-11 42.9 0.1 1 86 2944 3016 2944 3017 0.79 24 29 4.5e-15 1.4e-11 45.6 0.2 1 86 3174 3246 3174 3247 0.83 25 29 2.4e-13 7.4e-10 40.1 2.6 1 87 3314 3385 3314 3385 0.82 26 29 1.3e-14 4e-11 44.1 3.7 1 86 3495 3565 3495 3566 0.84 27 29 6.2e-13 1.9e-09 38.8 0.0 1 87 3660 3730 3660 3730 0.85 28 29 4.6e-09 1.4e-05 26.3 2.1 1 58 3749 3795 3749 3802 0.88 29 29 5.4e-10 1.7e-06 29.3 0.9 17 87 3811 3870 3798 3870 0.73
Sequence Information
- Coding Sequence
- ATGTCACAACACAACAACCCCccgcatcatcatcaacatcactactaccagcagcagcagcaacaacaacagctacaactacaTCACCAGCATCCacaccatcagcagcagcaacaactacaacataaacaaatacaacagCACAGTTGGTACTCACATGTTGCTTCCTACCCTCCCCACCATCCGCACGCCGCAGCCTTTGCGGCGCCCTGCAAAACCAataataacagcagcagcaacaacaacaacataaatgCATACGGTGCGGGAACTGGAGGCACGCATGCAGGATATTATGgctctgctgttgctgcagcagGTGGGGGGGTGGGCTATAACCTTGAGGGCAACACTGTGGCCTATGCGCACAACCAGCTGCtgcaataccaacaacaacaacagcagcaccagcagcagcaacaacaacatcagctcAGTCAACGCTCGTATATGGCGCACAGTTTAATGCACAGCTCTTATCCCTATATTAAGAGCGAGCCATTGGAGCTGCCTGATGATAGACAACGCCAacagcatcatcaacaacaacaacaccagcaacaacaacatttccaAAATCCTATGGCACCGCCGCCAGCTCCCGCCAATCGACACAGTCTCGATGCTAGCGGCGAAATGGTAATAAAATcgGAACCCATTGACGAACATGCCTACAAGTCCAACTATATCGATGATAACACGCCCTTTGCGGATTTTAGTAAATATCCGGAATTCGGCGACGACATGTTGAGTCCCAAGGTGGAGCATTCGGTCAAGGATGAAGGATATGGGAGTCAAAAAGTTCCGAACCCGCTCAGCTATCCGAGACGCAAGCTGCAATCGGAGCGCTCTTCGGAAAGTCTTCCCATTTGTCAGCGTTGCAAGGAGGTGTTCTTCAAGAAGCAAATCTACTTGCGTCATGTGGCCGAGAGCAGTTGCAGCATACAGGAGTATGATTTCAAGTGCAACATTTGCCCCATGTCCTTTATGAGCACCGAGGAGTTGCAGAAGCACAAGCACCAGCACAGAGCAGACAAATTCTTTTGCCACAAATACTGTGGCAAGTACTTTGACACGATCGCCGAGTGCGAGTCGCATGAGTACATGCAGCACGAGTATGAGAGCTTTGTTTGCAACATGTGTTCCGTTACGTTCGCCACGCGGGAACAGCTGTATGCCCATTTGCCGCAACACAAATTCCAGCAGCGTTACGATTGCCCCATCTGTCGCTTGTGGTACCAAACTGCTTTGGAGCTGCACGAACATCGTCTAGCTGCTCCTTATTTTTGTGGCAAGTATTACGCAGCCGCACAATCGGCATcacaccaacagcagcagcaacagcatccacagcagcatcagcaacaggcCAACTACAAACTGCAGGACTGTCACATGGCCACCATGGAAATGCCAACGCCGCATCACAAGGCAAATGCCGCTGCCACCGCATTGCCGGCAACGGCGGCGTTGAGTTCGTTGTTGCAACAACGTCAGGCGAATGCCGATGGAGCCGCTTTGTTTGCCTCAACGCTGAAGAACGAGCCGAATGTGAAGCTGGAGCGAAGCTATAGTAATTCCACAAGCGAGTCGGGCTACAGTTTGCACGACAGCAGCTATAACAATGCCTATGGCAGCGACACCTCGCTGCATGCTAGTGGTGGGGCTGGTGCGATTGGTGGTCCACAGGCGCACTCCTCGACGCTGGACGATTCGGAGGATGCACTGTGCTGTGTGCCGCTGTGCGGTGTCCGTAAGAGCACCAGTCCGACGCTGCAGTTCTTCACTTTCCCCAAGGATGAGAAGTACTTGCATCAGTGGCTCCATAACCTTAAGATGTTCCATATTCCGGCATCGAGTTATGCGACTTTTCGTATTTGCAGCATGCACTTTCCCAAGCGCTGCATCAATCGATATTCCCTGTGCTATTGGGCTGTACCCACGTTCAATTTGGGTCACGATGATGTTGCCAATTTGTATCAGAATCGGGAACTGACCAACACCTTCACAACCGGCGAGGTGGCCCGTTGCAGCATGCCCAACTGCACCAGCCAGCGGGGAGAGAGCAATCTCAAGTTCTACAACTTTCCCAAGGATATTAAGAGTCTGATCAAGTGGTGCCAGAACGCCCGTCTGCCGGTCCAGGCCAAGGAGCCGCGTCACTTTTGCAGTCGCCACTTCGAGGAGCGCTGCATTGGCAAGTTCCGCCTGAAGCCCTGGGCAGTGCCCACATTGCATTTGGGCGCCCAATACGGCAAGATCCATGACAATCCGAAGAACTTGTATGTGGAGGAGAAACGCTGCTGCCTTAATTTCTGTCGTCGCAGTCGCTCCTCTGACTTTAACATGTCACTGTATCGCTTTCCCCGGGACGAGGTACTCCTTCGACGCTGGTGCTACAATCTTCGTCTGGACCCTTCCGTCTATCGCGGCAAGAATCACAAAATATGCAGCGCTCACTTTATCAAAGAGGCTTTGGGACTTCGCAAATTGTCACCAGGAGCTGTCCCCACGCTGCATTTGGGCCACAACGACACCTTCAACATCTACGAGAATGAACTGTGGCCCCCACCAACGCCCGCCACGCCCACAgctcaccagcagcagcagcaacaacaacagttgcatcagcagcaccagcaacagttgcagcaacaacattcgCATAATCATGGCCACAAATATCAACGTCATTCGGCAGcatcgacgtcgtcgtcggccAGCTCGGCGTCGCACTATGTGGATGCGGACATGGGTGGCTCGTATATGGGAATGGGCAACTCGGGCGGATCTGGAGGATCGTCTGGGCTGAATGTGAGCGACAGCATGGACGTGTGCTGTGTGCCCAGCTGTGAGAGTAAACGgcacaacaacgagaacatcACATTCCATACGATACCGAGGAGGCCGGAGCAGATGAGGAAGTGGTGTCACAACCTGAAGATACCCGAGGATAAGATGCACAAGGGAATGCGCATCTGTAGTCTGCACTTTGAGCCCTACTGCATTGGTGGTTGCATGCGTCCATTTGCGGTGCCCACGTTGCATCTGGGGCACGATGACGAGGACATCCATCGTAATCCGGATGTGATCAAGAAGTTGAATATACGTGAAACCTGTTGTGTAGCTGTCTGCAAACGCAACCGGGATCGGGATCATGCCAATCTGCATCGCTTCCCCAGCAATGTCGCTCTTCTGACCAAGTGGTGCGCCAATCTGCAGCGTCCTGTTCCAGATGGCACCAAGCTCTTCAACGATGCCATCTGTGAGGTGCACTTTGAGGATCGCTGTCTGCGCAACAAGCGACTGGAGAAATGGGCGGTGCCAACGTTAATCCTGGGCCATGACAATATTGCCTATCCGCTGCCCACGGCGGAGCAGGTGGCCGAGTTCTATTCCCGTCCCAGTGCACCCAACAATGGCGAGGAGCAGGGTGAGTGCTGTGTGGAGACCTGTAAGCGCAATCCCAGCGTGGACGACATTAAACTCTATCGTCCGCCAGAGGAGTCGCAGGTGCTGGCGAAATGGGCACATAATCTGGAGCTGGACGTCGCCCAGTTGCCCAACATGAGGATCTGTAATCTGCACTTTGAGTCCCATTGCATTGGCAAGCGGATGCGACCCTGGGCCATACCCACGCTCAATCTCTCCACCAATATTGAGAATCTCTTCGAGAATCCCGAGCGACAGATGCTGTACAAGCGTCGCACGCATCTCAATTCCAATTCGAATCCCGATCGAGGAGCTGGCATCGTGAAGCCGACGTGGGTGCCACGTTGCTGCTTGCCACATTGTCGCAAGGTGCGCGCCCTGCACAATGTGCAGCTGTATCGCTTCCCCAAGCTCAATCGCTCCACGCTCGCCAAGTGGGCGCACAATCTGCAGGTGCCGCTGGTGGGCAGCGCCCAGAGACGCCTCTGCTCCGCCCACTTTGAGCCGCATGTGCTGAGCAAGAAGTGCCCGGTGCCGCTGGCGGTGCCCACGCTGGACCTTAACTCACCGCCTGGCTACAAGATCTATCAGAATCCCGCCAAGCTGAAGGCCAACAAGCTCTGCCTGCAGCGGGTCTGCATTGTCGAGAGCTGTCGGCGGCAACGTGGTCAGGGGGTTCAGCTCTTCCGGCTGCCCCACAATCCAACCCAGCTGCGCAAGTGGATGCACAACATAAAGATGCGGCCGAGAGGCGCCATGCGGCAACAGTATCGCATGTGCTCCATTCACTTTGAGTCGCACTCGTTCAATGGGAAGCGGCTGAGTGCCGGGGCGATTCCAACTCTGGAACTGGGACATCAGGATGACGATATCTATCCGAACGAGGCGCAATCCTTTGTCGAGGAGCACTGCACTGTCGAGGGCTGTGATGCGAACAAGGAGCAGCCGGATGTGCGTCTTTTCCGATTCCCCACCGAAGACGAGGATCTGCTCTGGAAATGGTGCAACAATCTCAAGATGAATCCCGTCGACTGTGTCGGCGTTCGCATCTGCAACAAGCATTTCGAGGCGGACTGCATCGGGCCCAAGCATCTGTACAAGTGGGCGATACCCACTCTGGAGCTGGGGCACGATGATGACCAGATTGAGCTCATACACAATCCGAAGCCGGAGGATCGCTACGTTGATCCGGTGTTCAAGTGCTGTGTTCCCACGTGCGGCAAGACGCGCAAGTTCGACGAGGTGCAAATGAACAGCTTCCCCAAGGATCCAACACTCTTCCAGCGCTGGCGACACAATCTCCGTCTCGATCATCTCAATTTCAAGGAGCGCGAACGCTACAAGATCTGCAACGTCCACTTTGAGGACATTTGCATTGGGAAGACTCGGCTCAACATTGGCTCCGTTCCCACACTGGAGCTGGGGCACGACGAGACCGAAGATCTCTTCCAGGTGAATCCCGCGGAACTGCAAAGCAATCTGTTCGGGCGTCAACGACGTGTCCATGGATACATCAAGCAGGAGGAACCAGACGACGATCAGGAGGAACAGGAGGAGCACTCCGAGCTGGATGAGGACATCAAGCCGGACATTAGCATGTCGGAGGCGACGGACTTGAATACATCGCAGGTTAAAATAAAGAGATCTCTGGTAGATTTCAAGTGCTGTGTGCCAAGCTGTGGACGCAGTCGCTTGGAGCATGGAGCACGCCTCTTCCCCTTCCCCAGtggcaagcagcagcagagcaaGTGGCGTCACAATCTCCAACTGTCGTCCGCCGACGTGGACAGATCGACTCGTGTGTGCAGCGCCCATTTCAATCGACGGTGCATCGACGGGAAGCAGCTGCGGGGCTGGGCAATGCCCACTCAGCTGCTGGGACACCAGGAGCAGCCCATCTATGAGAATCCAAAGAATATACCGGGCTTCTTCACGCCCACCTGTGCCCTGGCCCATTGCCGCAAGCGTCGGAGCATTGACAATGATCTGCGCACCTATCGGTATCCGCGCAGCGAGGATCTCCTCGAGAAATGGCGCGTCAATCTGAGACTGGCGCCGGATCAGTGTCGCGGACGCATTTGTGCCGATCACTTTGAGCCCATGGTGCGTGgcaagctgaagctgaagacGGGAGCGGTGCCCACCCTGAAGCTGGGACATGATGAGGGCGTGGTCTTTGACAATGAGGCTATTAAGGCGGGTCTGCAGCAGGAGGCGGAGGTGGAGGCGGACGAGGGCGAGGCATTCATGGAGTCGCTTGTCAAGGTCAAGCAGGAGAAGGTGGATCCGGAGGACGAGCAGCCAGATAACGATGATGAGGATATAGATGAGCAGCCGGACAAAGATCAGGAAGATCATGGCTACTTTGATCCCTTAGAGTTGGTGgaaacctttgcggatcatcaCAGCGACGACAACGCCTCCGaagacaatgatgatgatgatgaggatggGGAAGGGGATGACgctgacgatgacgatgaagCTGTCGATGATGACGAGCTGTTGCTGCCGGATACGTCGCCAGTTCCAGTGGCGCCACTTCCATTGCCGCCACGCCGAGAGAAGCCTGTGAACAATGTGACGCCCATTTGTTGCCTGAGACACTGTCGCAAGGAGCGCACGGCAAGTCATCTGCTGAGCACCTTTGGCTTCCCCAAAGATCCGCAGCAACTGCTTAAGTGGAGCGCCAATCTGCAGCTGGCACCCGACGATTGCGTCGGTCGCGTCTGCATCGAACACTTTGAGTCCGAGATGCTGGGCACGAGGAAGCTGAAGCAGAATGCGGTGCCCACACTGAACCTGGGACACAGCACGCCCCTGCGCTACAGTTGCAATGGCCAGTCGGTGGGCATTCATGATGCCCAGCCGCAGCATTCGGTTTTTCGGCTTTGGAGCCTAAAGCATTGCCGCAAACGGAAGCTGCCAATGGAGCCGCCGGATCAGACAGCGATTAAGCGACGCTGCTGCCTGCCCAGCTGTGGCAAGCAGCCGGATCTCCATGGCGTCCAGCTGCATCGTCTGCCCAGCAACCGCATCCTGCTGCGCAAGTGGCTCCACAACCTGAAGCTGCCACCAATGATGGACACCAGCCAGGCGCGTCTCTGTAGCGAACACTTTGAGACGCCGCTGGAGGATTGTGTGCCCACACTGCGGCTGGGTCATGACGATACCCACATTTATCGCAATCGTGGCAGCATCTCGGCAGCATCGTCGTCATCCAGTGGCTGCATGGTGCCCAGCTGTCCCTGTGCCCGCCTCAATCTCTATCGCTGCCATGATCTGCCCGAGAATCGTCTGGTGCAGCAGGCCTGGCTGCAGTGGCTCCAACTGCCGATGCCCCAGCTGGCCAGCGATGGCAAGCTCTGCGTGATGCACTACATGCAGCTGTACGAGCAGGTGCCACTGCCCCAGGAGCTGCCAGAGCCAGTGCTCCGTCAGCTGCAGGAGACCTATGACCTAATCGCCAGTTCCTCCATGGCCATGAAGCTGCGCTGTGCCATTCCCGGCTGTTACTCCAAGTACACGGACAACATCCGGCTGACCAAGCTGCCAATGTGCCCGGATATGTCTGCCAAGTGGCTGCACAACACCAAGATCACGTACGATCCTGCCCGACATTATATTTACCGCATCTGCATGCTTCACTTCGAGGCACACTGCCTGGGTCCAGTGCGCCCGAAGCCGTGGGCGGTGCCAACGCTACAGCTGAATCACAGCGATCCTCATATCTACCATAATCCCAAGCTGGATGACAACAGGCAACCAGTGTCCGCGCCAGTTCCCGTTCCTGTGGCCATCTCCACGCCCGTTCCCGATCCCGTTCCCGTGTCTTTGTCCACGTCTGTGCCCGTGGAGCTGCCGTTGCGTATTAAAACGGAGCTGGCCTTCTGCGGCAGTCCCAGCGCCAGTGCCAGTCCGAGTCCACGTGGCAAGCTGCGCATCTGCTGCATTCCCAGCTGCGCCCAACAGGCCACATCGCAGACGCGTCTCTTTCGCTTTCCCAGCGCCGAGACGTCGCTGCTCAAGTGGCTGGTGAAtacgcagcagcagccgcgATTGGTGGACGCACAGCATCTGTTCGTCTGTCAGGACCACTTCGAGGAGGACGCCATTTGCAAGAAGCAACTGCAAAGTTGGGCCGTGCCCACGCTGAGGCTTGGACACGATGGCCATGTCATACCAAATGCGAGGCACAATGGCAACATCGCCGACAGCCAGGAGAACAAGCAGGCGCTGCAGTTCATCTGGGAGAACTACTGCTCCGTCTTGAGCTGCTTCCAGCCGCGCAGCGAGCAACTGCGTCTCTTTGCCTACCCCAAGGATCGACCCACCATACGCAAGTGGGCGGCCAACTGCAAGCATCGTTCCATGCAGGCCAGCAGCGATGGCTACCAGGTCTGTCAATCGCACTTTGCGCCGCATTGCTTTGACCCGGACACGGGTGAGTTGCGGGAGAATGCGGTGCCCACGTTGGAGCTCAGTCGTTGCATGAATGAGGTGCGCTGCGTGGTGCCCGGTTGTGTCAAGGATGAGGATGCGCCGCGTCAACGCTTTTACAGGATGCCCAAGCGCTTCTCTCAGCTCATCACTTGGTGTCACAATCTTCGACTGGATTCCACAGCCATGGCAAGCGGGGATCATCATGTGTGCGATCGTCACTTTGAGACGCACTGCTTCAACCAGCAGAAACTTCTGCGTCCTGGAGCGCGTCCCACGCTGCACTTGGGCCATGACGAGCAAAAAGACGTGTTGCCCAATCCGGCTGAGTGGGATGCCCCAGATGCGGTGGCTGCCACGAACGCCTGCTGCGTGCCCAACTGTGGACACTCCAAGGAAGAGGAGGATCCCGTCCAGGTGCAGCTCTTTGGCTTCCCAAAACTGCGGGTCCTGGCGGAGAAATGGCTACAGAATATTCGCCTGGACCTTGGCAAGGAGCAGCTGGCCAAGCTGAAGATCTGTTCGGCACACTTTGAGCCCAGTTGCCTGGAAAATGGACGACCCCAGTTGGGTGCCATGCCCACGCTGCAGCTGGGCCACGAGGAGCTCCACAATATACATCGCAGCCGGAGCACAGAGCCAACGGCCGGCAAGGCCAAGAAGTATTGCAACAGGAGTGGCTCCAGCCATGACTGCTGCTATCCGCAGTGCGTGGAGCTGCAGAAGACTTACCTGAGGAATAGCTACGATCTGCCGCAGGGCGAGGCACTGCGTCGCAGGTGGCTGGACTACATGGCCATAGAGGAAGTGGAGGATAAACCGCTCAAGCTCTGTCCGCTGCACTTGATCCTTCTCTACGATCACAGTGACGAGCATTTTGAGGAGCACACGAAGGAGCAGCTGCTGGACACCAACTACGAGGACGCACGCAGCAGTGTCCGCATCCGTGTCATCAGCTGTGCGGTGCGGGGTTGTCGGACCCTGAAGCCACGAGACGGTGGGCGACTACACGGATTGCCCCAGCGGCGGGATGTGCTCGAGATGTGGCTGCACAACATGCAGCTGGTGTTCTACGAGCACCAGCGATACCTATACAAGATATGCAGCAAACACTTTGAGGCCAGTTGCTTCATGGACACGACACGTCGTCTCAAGCCCTGGACAATGCCCACGCTGGAGCTGCCGGAGCGGGAGCCGGGCGAGGCGCCTGTCTATCAGAATCCCACTGAAGACGAATGGCGGCGCATGAATGAGCTGCTGGCAgcggagcagcagcagcaagaacaGCAGCTCATCCCGGAGGATCATGAAGGCGAAGATATTGATATGCTGGAGCCAATAGTGAAGATCGAGCATCAGCTAGACGATGAAGAGGAGGaagagcagctgcagcagcagccagaGGGGGAAGAAGATTATGACAACTCACAGCAGCCGCTGGAAGTGCTCCTCGAGGTGGGACACGTTGAGAAGTGCACCACCTATGAGCAAATGGATTCAGAGGCCAATCTTGGCTATGCCGCCGAGCAGCAGACGCAGATGAGCAGCAGCTTTAGTCCGTCAGTGTCCATGTCGATGTCCGCGTCTGCATCGCAATATGGGAGTGGCGCCATGGTCAGCAATGGATTCAAGTACAATGCCCGCCATTGCAGCGTCCGAGGATGCGATGTGACGGCCAATGATGTGAGTGGCAGCATCAAATTGCACAAGTTCCCCACCTCGCTGGATGCCATGCAGAAATGGATGCACAACACCCAGGTGGATGTGGACACGAACTTCGCATGGCGTTTTCGTATTTGCAGCTATCACTTCACCGGCGAATGCTTCAATGGATCGCGCATAAGGCGTGGTGCCATGCCAACGCTTAGTCTAGGACCATGTCGTCCTCCGACCATCTACGACAACGAGTTCAACACATCGCTGCAGCCGGAACTGGAGCAGGCCAATGAGGTGACTGGCGAGGAGCAGCTGATCCCTGATGCGGAACCAAGTGAGACGCGCACAAAGGGCAGTGAGATTAGCCTGCGTCTGCCACATCCGGCACCGCCACGTAAATCCAGCAAGTTCTGCCAGATCGAGGGTTGTCCCAATCATTTAACCAGCGAGAACCTGACACTCCACAAGTTTCCGCATTCGGTGGACATGTGCGCCAAGTGGCAGCACAATACCCAGGTCCCGTTTGATCCGGACTTCCGTTGGCGCTATCGCATCTGCAGCGCCCACTTCGAGCCCATCTgtctgattaacatgcgtctGGTGCACGGCAGCGTGCCCACACTTCATCTGGGGCCACGTGCTCCGCGCCAGCTCTTCGAGAGTGACTTTGAGGCGATAAACATGCGATTGGATAAACAGAAGAGCAGCTCGGATCAGCAGCTGGAGCAGCGGGAGCAACTGCAGCTTCAGGAAGaagaggatgaggatgagaatGCACTGAGTTATCTGGTGCCAGAGATGCAATTGCATGAGGATGCCGATGTCGATGGCGATGCGGATCATTCGGACACTCCCGTGACTTACAGCAACAAATGGAAGGATCTGCGCTTGCCCAGCATCAAGCAGGAGAAGACAATGACACTAACGAATCCAGTCAAATCTGGCTATGACAAGTGTTCCCTGGTGCATTGCCAGCGCCAGCGTTCACAGCATGGCGTCCATCTCTACAAGTTTCCACGCTCGAAGCAACTGCAGAAGCACTGGATGCACAACTTGAGGATCATATACGACGAGCGACGTCCCTGGAAGACCATGATATGCAGCGCACACTTTGAGCCCAACTGCATCCGACTCCGCAAGCTGCGTCCGTGGGCAGTGCCCACGCTGGAGCTGGGCGACAATGTGCCGCAACAGATCTACACAAATGAACAGAGCCGGATGCAGCTGGAGATGGGCAGCGATAACGAGGAATTGGAATTGAGCACGGAcgtgtccatgtccatgtccgtGTCCGTGGAGGATACTCAGTTAGAGGGAGCCTACGAAGACGACGATTATGATGATGAAGACGATGACTATTTGGCTACGGAGCCATTGGTAAAAAGGGAGCGTCGCTCACGCTTCGATCCCTTGCGGCCAGGTCAGTTGCCGCCTTGGAAAATGAAATTCTGCTCCTTGCCCTACTGTCGCAGCCCACGCGGTGATGGCATCAAGCTCTTCCGGCTGCCCAATAACATCAGTTCCATTCGCAAATGGGAAAAGGCGACCGGCATTCACTTCACTGAATCCCAACGCAATACGAAGCTCATCTGCAGTCGCCACTTTGATCCACAGCTGATCGGGGTACGTCGGCTCATGTACAATGCCGTGCCCACACTTCATCTGGGTCCAATAAGTGCAGTTAGTCAACCCGTGGCAGCTCCTGCAGCTCCACGCTGCTGTATGCCTGATTGTCAGGAGGAAGTCAAGTTGCACAAGTTTCCCAGTGATCCCCTGCTGTTGCATCAATGGTGTCACGCGCTAGATCTCTCGGATACTCAGCGATATAGTAGCAAACACATCTGCGCCAGACATCTGCCCGCCAAAGCTCCGAGTTGCATCGTCTGTGGCGTGGAACATATCAAGATGCCATTGCTAGACTTCCCGGAGAATCGCAATCAGCGCGCCAAATGGTGTTACAATCTCAAAATCGAGCCCATCCCCAAGTGGGATAGCTCAAAGCAGATATGCAGCAAGCACTTCGAGAGCTACTGCTTCGTTCAGCCGGGTCAACTGCTGCCCGAGGCAGCTCCCACGTTGCACTTACAGCACAACGATAGCAACATATTTCTGAACGATTATACCATAGATCACAGCAAGATGCTGCGCATTAAGGATGAGCCCTTGGACAGCGAGGACCTGATGCtgtaa
- Protein Sequence
- MSQHNNPPHHHQHHYYQQQQQQQQLQLHHQHPHHQQQQQLQHKQIQQHSWYSHVASYPPHHPHAAAFAAPCKTNNNSSSNNNNINAYGAGTGGTHAGYYGSAVAAAGGGVGYNLEGNTVAYAHNQLLQYQQQQQQHQQQQQQHQLSQRSYMAHSLMHSSYPYIKSEPLELPDDRQRQQHHQQQQHQQQQHFQNPMAPPPAPANRHSLDASGEMVIKSEPIDEHAYKSNYIDDNTPFADFSKYPEFGDDMLSPKVEHSVKDEGYGSQKVPNPLSYPRRKLQSERSSESLPICQRCKEVFFKKQIYLRHVAESSCSIQEYDFKCNICPMSFMSTEELQKHKHQHRADKFFCHKYCGKYFDTIAECESHEYMQHEYESFVCNMCSVTFATREQLYAHLPQHKFQQRYDCPICRLWYQTALELHEHRLAAPYFCGKYYAAAQSASHQQQQQQHPQQHQQQANYKLQDCHMATMEMPTPHHKANAAATALPATAALSSLLQQRQANADGAALFASTLKNEPNVKLERSYSNSTSESGYSLHDSSYNNAYGSDTSLHASGGAGAIGGPQAHSSTLDDSEDALCCVPLCGVRKSTSPTLQFFTFPKDEKYLHQWLHNLKMFHIPASSYATFRICSMHFPKRCINRYSLCYWAVPTFNLGHDDVANLYQNRELTNTFTTGEVARCSMPNCTSQRGESNLKFYNFPKDIKSLIKWCQNARLPVQAKEPRHFCSRHFEERCIGKFRLKPWAVPTLHLGAQYGKIHDNPKNLYVEEKRCCLNFCRRSRSSDFNMSLYRFPRDEVLLRRWCYNLRLDPSVYRGKNHKICSAHFIKEALGLRKLSPGAVPTLHLGHNDTFNIYENELWPPPTPATPTAHQQQQQQQQLHQQHQQQLQQQHSHNHGHKYQRHSAASTSSSASSASHYVDADMGGSYMGMGNSGGSGGSSGLNVSDSMDVCCVPSCESKRHNNENITFHTIPRRPEQMRKWCHNLKIPEDKMHKGMRICSLHFEPYCIGGCMRPFAVPTLHLGHDDEDIHRNPDVIKKLNIRETCCVAVCKRNRDRDHANLHRFPSNVALLTKWCANLQRPVPDGTKLFNDAICEVHFEDRCLRNKRLEKWAVPTLILGHDNIAYPLPTAEQVAEFYSRPSAPNNGEEQGECCVETCKRNPSVDDIKLYRPPEESQVLAKWAHNLELDVAQLPNMRICNLHFESHCIGKRMRPWAIPTLNLSTNIENLFENPERQMLYKRRTHLNSNSNPDRGAGIVKPTWVPRCCLPHCRKVRALHNVQLYRFPKLNRSTLAKWAHNLQVPLVGSAQRRLCSAHFEPHVLSKKCPVPLAVPTLDLNSPPGYKIYQNPAKLKANKLCLQRVCIVESCRRQRGQGVQLFRLPHNPTQLRKWMHNIKMRPRGAMRQQYRMCSIHFESHSFNGKRLSAGAIPTLELGHQDDDIYPNEAQSFVEEHCTVEGCDANKEQPDVRLFRFPTEDEDLLWKWCNNLKMNPVDCVGVRICNKHFEADCIGPKHLYKWAIPTLELGHDDDQIELIHNPKPEDRYVDPVFKCCVPTCGKTRKFDEVQMNSFPKDPTLFQRWRHNLRLDHLNFKERERYKICNVHFEDICIGKTRLNIGSVPTLELGHDETEDLFQVNPAELQSNLFGRQRRVHGYIKQEEPDDDQEEQEEHSELDEDIKPDISMSEATDLNTSQVKIKRSLVDFKCCVPSCGRSRLEHGARLFPFPSGKQQQSKWRHNLQLSSADVDRSTRVCSAHFNRRCIDGKQLRGWAMPTQLLGHQEQPIYENPKNIPGFFTPTCALAHCRKRRSIDNDLRTYRYPRSEDLLEKWRVNLRLAPDQCRGRICADHFEPMVRGKLKLKTGAVPTLKLGHDEGVVFDNEAIKAGLQQEAEVEADEGEAFMESLVKVKQEKVDPEDEQPDNDDEDIDEQPDKDQEDHGYFDPLELVETFADHHSDDNASEDNDDDDEDGEGDDADDDDEAVDDDELLLPDTSPVPVAPLPLPPRREKPVNNVTPICCLRHCRKERTASHLLSTFGFPKDPQQLLKWSANLQLAPDDCVGRVCIEHFESEMLGTRKLKQNAVPTLNLGHSTPLRYSCNGQSVGIHDAQPQHSVFRLWSLKHCRKRKLPMEPPDQTAIKRRCCLPSCGKQPDLHGVQLHRLPSNRILLRKWLHNLKLPPMMDTSQARLCSEHFETPLEDCVPTLRLGHDDTHIYRNRGSISAASSSSSGCMVPSCPCARLNLYRCHDLPENRLVQQAWLQWLQLPMPQLASDGKLCVMHYMQLYEQVPLPQELPEPVLRQLQETYDLIASSSMAMKLRCAIPGCYSKYTDNIRLTKLPMCPDMSAKWLHNTKITYDPARHYIYRICMLHFEAHCLGPVRPKPWAVPTLQLNHSDPHIYHNPKLDDNRQPVSAPVPVPVAISTPVPDPVPVSLSTSVPVELPLRIKTELAFCGSPSASASPSPRGKLRICCIPSCAQQATSQTRLFRFPSAETSLLKWLVNTQQQPRLVDAQHLFVCQDHFEEDAICKKQLQSWAVPTLRLGHDGHVIPNARHNGNIADSQENKQALQFIWENYCSVLSCFQPRSEQLRLFAYPKDRPTIRKWAANCKHRSMQASSDGYQVCQSHFAPHCFDPDTGELRENAVPTLELSRCMNEVRCVVPGCVKDEDAPRQRFYRMPKRFSQLITWCHNLRLDSTAMASGDHHVCDRHFETHCFNQQKLLRPGARPTLHLGHDEQKDVLPNPAEWDAPDAVAATNACCVPNCGHSKEEEDPVQVQLFGFPKLRVLAEKWLQNIRLDLGKEQLAKLKICSAHFEPSCLENGRPQLGAMPTLQLGHEELHNIHRSRSTEPTAGKAKKYCNRSGSSHDCCYPQCVELQKTYLRNSYDLPQGEALRRRWLDYMAIEEVEDKPLKLCPLHLILLYDHSDEHFEEHTKEQLLDTNYEDARSSVRIRVISCAVRGCRTLKPRDGGRLHGLPQRRDVLEMWLHNMQLVFYEHQRYLYKICSKHFEASCFMDTTRRLKPWTMPTLELPEREPGEAPVYQNPTEDEWRRMNELLAAEQQQQEQQLIPEDHEGEDIDMLEPIVKIEHQLDDEEEEEQLQQQPEGEEDYDNSQQPLEVLLEVGHVEKCTTYEQMDSEANLGYAAEQQTQMSSSFSPSVSMSMSASASQYGSGAMVSNGFKYNARHCSVRGCDVTANDVSGSIKLHKFPTSLDAMQKWMHNTQVDVDTNFAWRFRICSYHFTGECFNGSRIRRGAMPTLSLGPCRPPTIYDNEFNTSLQPELEQANEVTGEEQLIPDAEPSETRTKGSEISLRLPHPAPPRKSSKFCQIEGCPNHLTSENLTLHKFPHSVDMCAKWQHNTQVPFDPDFRWRYRICSAHFEPICLINMRLVHGSVPTLHLGPRAPRQLFESDFEAINMRLDKQKSSSDQQLEQREQLQLQEEEDEDENALSYLVPEMQLHEDADVDGDADHSDTPVTYSNKWKDLRLPSIKQEKTMTLTNPVKSGYDKCSLVHCQRQRSQHGVHLYKFPRSKQLQKHWMHNLRIIYDERRPWKTMICSAHFEPNCIRLRKLRPWAVPTLELGDNVPQQIYTNEQSRMQLEMGSDNEELELSTDVSMSMSVSVEDTQLEGAYEDDDYDDEDDDYLATEPLVKRERRSRFDPLRPGQLPPWKMKFCSLPYCRSPRGDGIKLFRLPNNISSIRKWEKATGIHFTESQRNTKLICSRHFDPQLIGVRRLMYNAVPTLHLGPISAVSQPVAAPAAPRCCMPDCQEEVKLHKFPSDPLLLHQWCHALDLSDTQRYSSKHICARHLPAKAPSCIVCGVEHIKMPLLDFPENRNQRAKWCYNLKIEPIPKWDSSKQICSKHFESYCFVQPGQLLPEAAPTLHLQHNDSNIFLNDYTIDHSKMLRIKDEPLDSEDLML
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00601834;
- 90% Identity
- iTF_00601834; iTF_00585998; iTF_00585233;
- 80% Identity
- -