Dmur007042.1
Basic Information
- Insect
- Drosophila murphyi
- Gene Symbol
- GA10450
- Assembly
- GCA_018904325.1
- Location
- JAEIFX010001243.1:25092766-25108980[-]
Transcription Factor Domain
- TF Family
- THAP
- Domain
- THAP domain
- PFAM
- PF05485
- TF Group
- Zinc-Coordinating Group
- Description
- The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 28 7.6e-15 1.4e-11 44.9 4.1 1 86 756 828 756 829 0.85 2 28 2.9e-15 5.3e-12 46.2 4.6 1 87 856 925 856 925 0.83 3 28 7.2e-16 1.3e-12 48.2 0.4 1 87 947 1019 947 1019 0.85 4 28 6.3e-16 1.1e-12 48.3 5.7 1 87 1114 1184 1114 1184 0.83 5 28 8.8e-15 1.6e-11 44.7 3.4 1 86 1208 1279 1208 1280 0.81 6 28 1.4e-12 2.5e-09 37.6 1.2 1 87 1315 1383 1315 1383 0.80 7 28 3.1e-11 5.5e-08 33.3 1.9 1 86 1431 1500 1431 1501 0.77 8 28 4e-17 7.2e-14 52.2 0.3 1 86 1528 1597 1528 1598 0.82 9 28 3.5e-12 6.3e-09 36.3 1.3 1 86 1619 1688 1619 1689 0.80 10 28 1.4e-15 2.5e-12 47.2 1.7 1 86 1716 1787 1716 1788 0.85 11 28 2.1e-13 3.8e-10 40.2 1.6 1 85 1864 1932 1864 1934 0.82 12 28 3.2e-12 5.7e-09 36.5 0.1 1 86 1957 2025 1957 2026 0.82 13 28 5.1e-14 9.3e-11 42.2 0.9 1 86 2175 2243 2175 2244 0.82 14 28 9.1e-12 1.7e-08 35.0 1.0 1 61 2297 2351 2297 2372 0.80 15 28 1.7e-05 0.03 14.9 0.1 1 59 2378 2430 2378 2453 0.79 16 28 3.2e-11 5.9e-08 33.2 0.2 1 86 2468 2537 2468 2538 0.83 17 28 2.8e-14 5.1e-11 43.1 1.3 1 87 2596 2666 2596 2666 0.81 18 28 3.7e-13 6.7e-10 39.5 0.8 1 86 2701 2772 2701 2773 0.82 19 28 1.6e-13 3e-10 40.6 1.3 1 87 2783 2855 2783 2855 0.81 20 28 8.7e-14 1.6e-10 41.5 0.1 1 87 2878 2949 2878 2949 0.77 21 28 6.5e-06 0.012 16.2 0.1 1 58 2982 3035 2982 3054 0.84 22 28 7.6e-15 1.4e-11 44.9 0.1 1 86 3073 3145 3073 3146 0.80 23 28 4.7e-14 8.5e-11 42.3 1.4 1 86 3280 3352 3280 3353 0.81 24 28 1.6e-14 2.8e-11 43.9 2.4 1 87 3416 3487 3416 3487 0.83 25 28 8.8e-15 1.6e-11 44.7 4.0 1 86 3600 3670 3600 3671 0.85 26 28 2.2e-13 3.9e-10 40.2 0.1 1 87 3763 3833 3763 3833 0.85 27 28 6.2e-10 1.1e-06 29.1 0.5 1 58 3850 3898 3850 3914 0.86 28 28 9.4e-09 1.7e-05 25.3 2.2 18 87 3915 3973 3904 3973 0.75
Sequence Information
- Coding Sequence
- ATGGTTCAACTGTTTAAATTCTTGCTTAAATCGACAAAATCGCCGGCGCGTGCTCATTTTCGGCCCAGCTTCCTGGATACACTGCGTCTAACGGTGCGTGGGGGACATGGCGGCAACGGTTTGCCCAAATACGGAGGAGTTGGTGGCCAAGGTGGCTGCGTCTACTTTGTGGCCAAGGAGGGGCACACGCTGCGTAAGGTGGCGCAGAGCCTGAAGGATAAACGCGTGTATGCCACAAGCGGCGAGGACAGCAGCAAGCTGAGCATATTCGGGCGTCGAGGCGTCGATCAGCGCATAGAAGTTCCTGTTGGAGTTCAGGTCTACGATGAAAAGCACAAACTGCTGGCCGATCTCAATGAGGACGAGGCCAGTTGCATTGTGGCAGGCGGCGGCACTGGCGGCTGCACGGGCAACAATTTCCTCGGTCGTCCCGGCGAGAGTCGCATTGTGAATCTGGATCTTAAACTGATTGCAGATGTGGGCCTAGTTGGGTTTCCCAATGCCGGCAAAAGCACCCTGCTGAAAGCTATTTCCAATGCCAAGCCCAAAATAGCAGCCTATCCCTTCACCACGATTCGTCCGCAGATTGGAACGATTGAGTACAGTGATTTGCGCTCCATCAGCCTGGCCGATCTGCCGGGTCTCATTGAGGGCGCTCACGCCAACTTCGGCATGGGACACAAGTTTCTGAAACACATTGAACGCACACGTTTGCTGCTCTTCATGGTGGACATCTTTGGGTTTCAGCTCAGTCCGCGGCATCCGCATCGCGATTGTTTGGCCAACATCTATGCGCTGAACAAGGAGCTGGAGTTATACGATCCGACGCTGCTGGAGAAGCCCTGCTTGCTGTTGCTTAACAAAATGGATAAGGAGGGTGCACAGGACATCCTGGCAAAAGTGAAGCCTAGCATAAGCGATCTGTCTGTGGGTCTGGCAGAGTGTCCAGAGGAAGTGCGTCCCAGTAGAGTGTTGAAATTCGAACATATTTTGCCCATATCCGCCAAAAATGCAACACGAATAAGCCAAGTAAAGGAACAATTGCGCGAAACGCTTGACGAGCTGGCCGAGGAGCAACTGGTAGTTGATGGTCAGGTGTTGAAAGAGCAACTGCACCAGCGGAATCCTATGGCACCGCCGCCAGCACCCGCCATTGCTAATCGTCATTCGCTCGATGCTAGTGGCGAAATGATAATTAAATCGGAACCCATTGACGAACATGCTTTCAAGTCCAACTATATCGATGATAATACTCCCTTTGCCGATTTTAGTAAATATCCCGAATTCGGCGACGATATGCTAAGTCCCAAACTAGAGCTAAACGTCAAGGATGAGGCCTATGGAAACCACAAAAACCCGCTGAACTACCCACGTCGTAAGCTCCAAACGGATCGCTCCGCGGAGATTATGCCCATTTGTCAGCGCTGCAAAGAGGTGTTCTTCAAGAAGCACATTTACCTGCAGCATGTGGCCGAGAGCAATTGCAGCATACACGAGTATGAATTCAAGTGCAACATCTGTCCCATGTCCTTTATGGGCGGCGAGGAGCTGCAGAAGCACAAGCATCTGCATCGAACCGACAAGTTCTTCTGCCACAAATACTGTGGCAAGCACTACGACTCGATTGCAGAGTGCGAATCGCACGAGTACATGGAGCACGAGTACGATAGCTTTGTGTGCAATATGTGCTCTGTTACGTTCCCCACACGGGAACAGCTGTATGCCCATTTGCCGCAACACAAGTTCCAACAGCGTTACGATTGCCCCATTTGCCGATTGTGGTACCAAACGGCATTAGAGCTGCACGAGCATCGACTGGCGGCGCCCTACTTTTGTGGGAANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTGGGCACCATTGAAATGACTCCACCGCAGCACAAGGCAAATGCGGCATTACCGGCAACGGCGGCGCTCAATTCGCTGTTGCAGCAACGCCAGGCGAACGCTGATGGCGCCGCTTTATATGCCTCGGCGTTGAAGAGCGAGACGAACGTGAAACTGGAGCGCAGCTATAGCAATTCCACCAGCGAGTCTGGTTACAGCATCCACGAGAGCAGCTATAACAATGCCTACGCCAGCGACAATTCTCTGCATGGCGGGGGCGGGGCAATTGGTGGTCCGCAGGCGCATTCCTCGACGCTGGACGATTCGGAGGATGCGCTGTGCTGTGTGCCACTTTGCGGAGTACGCAAGAGCACAAGCCCAACGCTGCAATTCTTTACGTTTCCCAAAGATGACAAGTACTTGCATCAGTGGCTGCACAACCTCAAGATGTTTCACATTCCGGCGTCGAGCTATGCTACCTTTCGCATCTGCAGCATGCACTTCCCTAAGCGTTGTATCAATCGTTACTCTCTGTGCTATTGGGCGGTGCCCACATTTAATCTGGGCCACGACGATGTGGCCAATCTCTATCAGAATCGTGAGCTGACCAACACATTCACCACAGGCGAAGTGGCCCGCTGCAGTATGCCAAACTGTACTAGTCAGCGTGGTGAGAGTAATCTGAAGTTCTACAACTTTCCCAAGGATATCAAGAGTTTGATTAAGTGGTGCCAAAACGCTCGCCTGCCCGTCCAGGCCAAGGAGCCGCGTCACTTCTGCAGTCGCCACTTCGAGGAGCGTTGCATCGGCAAGTTCCGGCTGAAGCCTTGGGCAGTGCCCACCTTACATCTTGGCGCCCAGTACGGCAAGATTCATGACAATCCCAAAAATCTGTATGTGGAGGAGAAGCGCTGCTGCCTCAACTTTTGTCGTCGCAGTCGCTCCTCCGACTTCAACATGTCGCTGTATCGCTTCCCCAGGGATGAGGTGCTACTGCGTCGTTGGTGCTACAATCTACGCCTTGATCCGGCTGTCTATCGTGGGAAGAATCACAAAATTTGTAGCGCTCACTTTATCAAAGAAGCATTGGGATTGCGCAAGCTATCTCCGGGCGCTGTGCCCACGCTGCATCTGGGTCATAATGACACCTTTAACATCTACGAGAACGAACTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNCTCGGCTGCGTCCACATCCTCGTCGGCCTCGTCAACATCGCATTATGTGGATCCGGAGCTAAGTGCATCCTACATGAGCATGGGCGCTGGAGGCTCAGCCTCTGGCCTTAATGTCAGCGACAGCATGGATGTCTGCTGTGTGCCCAGCTGCGAGAGCAAGCGCCACAACAATGAGAACATCACATTCCACACAATACCCAGGCGGCCAGAGCAGATGCGCAAGTGGTGTCACAATCTTAAGATACCCGAGGACAAGATGCACAAAGGCATGCGGATATGTAGCTTGCACTTTGAGCCCTACTGCATTGGCGGCTGCATGCGTCCGTTTGCGGTGCCGACATTGCATCTGGGCCACGACGACGAGGACATTCACCGCAATCCGGATGTGATCAAGAAGCTCAACATACGCGAAACTTGCTGCGTGGCTGTTTGCAAACGCAATCGTGACCGGGACCATGCCAATCTGCATCGCTTTCCCAGCAATGTGCCGCTGTTGACCAAATGGTGCGCAAATCTGCAGCGCCCTGTGCCGGATGGCAGTAAACTGTTCAACGATGCCATCTGTGAGGTGCACTTTGAGGATCGATGCCTGCGCAATAAACGGCTAGAGAAGTGGGCAGTGCCCACACTCATCCTTGGCCATGAGAATATACCCTATCCGCTTCCCACGCCGGAGCAAGTTACCGAGTTCTATGCGCGTCCCACTGCGCCTAACAATGGCGAGGAGCAGGGCGAGTGCTGTGTGGAGACGTGCAAGCGTAATCCCAGTGTTGATGACATCAAGCTATATCGCCCGCCCGAGGAGTCGCAGGTGCTGGTAAAGTGGGCGCACAATCTCCAACTGGAGATTGCCCAGCTGCCCAATATGAGAATATGCAATCTGCATTTCGAAACCCACTGCATTGGCAAGCGGATGCGTCCCTGGGCAATACCCACGCTCAATCTGGCAACTAACATAGAGAATCTCTACGAGAATCCCGAACACCAGATGCTCTACAAGCGGCGCACGCATCTCAAGCCGGGCAGAGCAGCGCGAAGCTCTGAAGCAAGCGCTGGTGGTGTGAAGCCCACCTGGGTGCCACGCTGCTGCTTGCCACACTGCCGCAAGGTGCGTGCCACACACAATGTCCAGCTGTATCGCTTCCCCAAACTCAATCGTTCCACGCTGGCCAAGTGGGCGCATAATCTGCAGGTGCCGCTCGTGGGCAGCGCTCAGCGTCGCCTCTGCTCCGCACACTTTGAGCCGCATGTACTTAGCAAGAAATGCCCGGTGCCCATGGCGGTGCCCACACTGGACCTCAATACACCATCCGGTTACAAGATCTATCAGAATCCGGCCAAGCTCAAGGCGAATAAGCTGTGCTTGCAGCGTGTCTGCATTGTGGAGAGCTGCCGGCGTCAGCGGTCGCAGGGGGTGCAGCTCTTCCGTCTGCCTCACAGCCCCACCCAGCTGCGTAAGTGGATGCACAACATCCGGATGCGGCCCCGAGGAGCCATGCGACAACAGTATCGCATCTGCTCGAAGCACTTCGAGACGCACTCGTTCAATGGGAAGAGACTCAGTGCGGGTGCAATTCCAACGCTTGAGTTGGGCCATGAGGACGAAGACATATTTCCGAATGAGGCGCAGTCTTTCGTGGAGGAGCACTGCACCGTCGAGGGCTGCGATGCCGTCAAGGAGCAACCGGATGTGCGTCTCTTCCGCTTCCCCAACGACGATGAGGATCTGCTCTGGAAGTGGTGCAACAATCTGAAAATGAGTCCGGTCGACTGCATCGGCGTTCGCATCTGCAACAGACACTTCGAGACTGATTGCATTGGACCAAAGCACCTGTTCAAGTGGGCTATTCCTACGCTCTCCCTCGGCCACGATGATGATGACATCGAGTTGATGCTAAATCCCAAGCCGGAGGAGCGCTATATTGATCCGGTATTCAAGTGCTGTGTGCCCTCGTGCGGCAAGACGCGTAAATTCGATGAAGTGCAGATGAACAGTTTTCCCAAAGATCCGGAGCTCTTTCAGCGCTGGCGCCACAATCTCCGCCTCGAGCATCTCAACTTCAAGGAGCGCGAACGCTATAAGATCTGCAATGCCCACTTCGAGGACATTTGCATTGGTAAGACGCGCTTGAACATTGGCTCCATACCGACACTGGAGCTTGGCCATGACGAGACTGATGACTTCTTCCAAGTCAACCCCGAGGAGCTACAGAGCAATCTCTTTGGACGCCAGAGACGCGTGCAGGATTCCATGAGGATCAACATTAAGCAGGAGGCGCACTCCGACCTCGATGAAGACACTAAACCGGACATTAACATGTCGGAGGCCACAGATTCAAATACAACACAGGCTAAAATCAAAAAATCTATGACCGATTTCAAGTGCTGTGTGACGAACTGTGGTCGCAGTCGTCTGGAGCATGGTGCCCGCCTCTTTCCGTTTCCGAACGGGAAACAGCAGCAGAGTAAGTGGCGCCACAATCTCCGGCTGCCTGCTGCCGACGTGGACAAGACGACGCGCATCTGCAGCGCCCACTTCAATCGCCGTTGCATTGATGGCAATCAGCTGAGGGGCTGGGCAATGCCCACACAGCAGCTGGGACATCAGGAGCTGCCGATCTATGAGAATCCAAAGAATATACCGGGCTTCTTTACGCCCACCTGTGCGCTGGCGCACTGCCGAAAACGTCGCAGCATTGACAACGATCTGCGTACCTATCGCTATCCACGCAGTGAGGAGCTGCTCGAGAAGTGGCGTGTCAATCTGCGCTTGTCGCCGGACCAATGCCGCGGACGCATTTGTGCGGATCACTTCGAGCCACTGGTGCGTGGCAAGCTTAAGCTGAAGACTGGAGCAGTGCCTACGCTCAAATTGGGACACGACAAGGGCGTAGTCTTCGATAATGAGGGCATTAAGGCGGGTCTGCAGCTGGAGGAGGAGGCGGAGGAAGAAGAGGGCAATGCCAGCTTGAAGTCGTTGGTCAAAGTAAAGACTGAGCAGGAGGATGAGCAGGAGCTAGAGAATGAAGATGAAGAGCAGCTGGAGCAGGAGCAGGATCAAGATATGGACGAAGATGGGGAAGAGCACCAAGACTCTGAGGAACATGGCTATTTTGATCCCTTGGAACTTGTGGAAACCTACGCTGAGCACCATAGCGATGATAACTCTGCCGGACATGATAATCTCGACGATGATGATGACGAAGATGGGGACATTCCCGGCAATGACGATGAGCTGCTTCTGCCTGATACGCGGCCACTTCGAATGACAATGGCTCCGCGGCGCGAGAAGGCTGTGAATAATGTGACGCCTATTTGCTGTCTGAAGCACTGTCGCAAGGAGCGCACCGCCATCCATCATCTGAGCACCTTTGGCTTTCCCAAAGATCCGCAGCTGCTGCTCAAGTGGAGCGCCAATTTGCAGCTACCATTGGAGTCGTGCATGGGTCGTGTATGCGTCGAGCACTTTGAGCCCTCGATGCTGGGCACGCGCAAGCTGAAGCAGAATGCGGTGCCCACCTTGAAACTGGGCCATGCCACACCGCTCACCTACAGCTGCAATGGCCGGATGCTGTCGGGCATTTACGATGAACAGCCACAGCACTCGGTTTTTCGGCTTTGGAGCCTGAAACACTGCCGCAAACGGAAACCGGATCTGGCGGAGATTAAGCCCGGTCGTCGCTGTTGCCTGCCAAGTTGCGAAAAGCAGTCGGAGTCGCACGGCGTCCAGCTGCAGCGTCTGCCGAAGGATCGTCTGATGCTGCGCAAATGGTTGCACAACCTCAAGCTGCCTCCAACGATGGACTGCACCCAAATGTTCCTCTGCAGCGATCACTTTGAGCTGAATGCGCCGTGTCCCACTTTGAAACTGGGACACTCGGATACCAATATTTATCGCCACAGTGTGGCTAGCACCAGTGGCAGCTGCCTGGTGCCCAAATGTACTTGTGCTCGTCTCAATCTCTATCGCGGCTATGATCTGCCTGCGCATCCGCAGGTGCAACAGGCTTGGCTACACTGGCTGCAGCTGCCCCATCCGCAGCCGTCGCCCAAGCACGCCCAGCTGTGTGTGATGCACTTTATGCAGCTCTACGAACTGGTGCCGCTGCCCAAATCGGTGCCAGATGTTGTGCGCAGGCAGCTGCGGGAGACTTACGAACTGATATCCAGTTCCAGCATGGCCATGAAGCTGCGTTGCGCTGTGCCCGGCTGCTACTCGAAGTATACGGACAATGTGCGTCTGACCAAGCTGCCCGTTTACCCCGACACCTGCGCCAAGTGGGTGCACAACACCAAGATTCAATATGATTCGGCCCGACATTATGTCTATCGCATTTGCATGTTGCACTTCGAGCCAGGTTGCCTGGGCCCAGTGCGTCCTAAAGTGTGGGCAATGCCAACGCTGCAGCTGCACCACAAGGATGCCAACATCTATTTAAATCCCAAACTGGATGGCAGCCAAACACAGTCGGCCGTGCCGCTGGACCTGCCACTGCGTATTAAAACTGAGCTGCCTATGTGCAACAGTCCCAGCTTTAGTGCGAGTGCCAGTCCCAGTCCGCGTGGCAAGCTGCGCACTTGCTGCATTCCCAGCTGCGGTCAGCAGGCCTCGGCCCTGACGCGACTCTTTCGCTTTCCTAGCGCAGAGACATCGATGCTGAAGTGGCTGGTGAATACCCAGCAGCAGCCACGCTTTGTCGATGCACAACGGCTGTTCGTCTGCCAGGATCACTTCGAGGCGGAGGCCATTTGCAAGAATCAGCTGCGCAGCTGGGCGGTACCAACACTGAATCTAGGACACGATGGACACATCATACCGAATGCCAAGCACAATGGCAACATTGCCGACAGCCAGGAGAACAAGCAGACGCTGCAGTTCATCTGGGCCAACTACTGTTCAGTGCTGCCCTGCTTCCAGAAAAGTAGCGAGCAGCTGCGTCTCTACCAATACCCCACGGATCGGCCAACCATCCGCAAGTGGGCCGCCAATTGTAAGCATCGCTCCATGCAGGCCAGCAGTGATGGATTCCAGGTGTGTCAGTCGCATTTTACGCCGGATTGCTTTGATTCTGATACCGGGGAGCTGAAAGAGGACGCTGTGCCCACACTGGCGCTGAGCCGGTCTGTCACTGAGGTGCGCTGTGTGGTCAATGGTTGCGTTAAGGACGAAGATGCATCGCGTCGCCGTCTGTTCAAGATGCCCAAGCGTAACCCACAGATATTGGATTGGTGCCACAATTTGCGGCTGGATCAGGCGGCCATGAACGGCTCGGAACAGCACGTTTGTGAACGTCACTTCGAGGCGAACTGCTTCAATGCGTCTAGAGTGCTGCGTCCAGGAGCACGACCCACACTTCATTTAGGTCATGAGGACCTAGACGATGTGATACCCAATCCAGCGAACTGGGAAGAGGATGTGATCGTGTGCTGTGTCCCCCACTGCGAAAGCTCCAAGGATGCGGATGAAGTCCAACTGTTTGGGCTGCCAAAGGTGCGCCAGTTGGCGGACAAGTGGCTGCAAAATGTGCACCTCGATCCGAGCAAAGAACAACTGGCCGGCCTGAAGATCTGCAGTGTGCACTTTGAGGCGAGCTGCATGGAGAATGGACGACCCACCTATGGTGCAATGCCCACACTCCATCTCGGTCACGATGAGCTCGACAATATACACCCAAACGTAGAGTCGGTGCCGACGCAGCAGAAGCGCTACTGCAATAGAGATGGCGCCAGTCACGATTGCTGCTATCCGCAGTGCGTGGAGCTGCAGAAGAGCTATCTGCGGGTCACCTACGAACTGCCCCAGGAGCAGGAGCTCCGTCAGCAGTGGCTCTCTTATATGGGCCTGGAAGCGCATCAGCTCGATAAACAGCAGCTGCCCAAGCTCTGTCCACTCCACCTAATCTTGCTCTACGATCACAGTGCGGATCACTTTTCGGCACACGCCGCTGAGGAGCTGTTGGACTCTAATTATGAAGCAGCGCGCAGCAGCGTTCGCATACGCGTTGTCAGCTGTGCTGTGCGCGGCTGCAGAACGCTCAAACCACGCGACGGTGGTCGGCTGCATGGTTTGCCCACGCGGCGAGATCTGCTGGAGATGTGGCTGCACAACATGCAGCTGGTGTTTTACGAGCAACAGCGTTATATGTACAAGATTTGCAGCAAGCACTTTGAGTCCACATGTTTCACGGAGACAACCAAGCGGCTGAAGCCGTGGAGTATGCCTACCCTCGAGTTGCCGGAGCGCCAACCGGGCGAGCTGCCTGCCTATCAGAATCCCACAGAGTTGGAGTGGCAACACATGAATGAACTGCAGGTCAGCGAGAAAGTTGTTGAGGCTCAGCCGGAGCCATTACTTAATCTGGAGCCGTTGCACAAGAAGGAGCCACCACCACCGCAGGTTGTGGAATATGAAGAGGATTGCGACAATAACTCACAGCAGCCACTGGAAATGCAGGCGCTGGAGGTGCTGCTCGAGGTGGGCCATGTCGAGAAGTGCACCACCTACGAGCAAATGGATACCGAGGCAAATCTCAACTATGCCGAGCAGTTCTCGCACAATCCCCTCAGTCCAGGTCCACCTCAATGCCGTATCCCCGTTGTCCAGAATGGACTCCACTACAGTGCACGCCACTGCAGCGTGCATGGCTGCAATGTCACCTCCAATAATCTGAGCAGCAGCATCAAGCTACACAAGTTCCCCGTCTCGCTGGATGCCATGCAAAAGTGGATGCACAACACCCAGGTGCTCGTGGACGTCAAATTCGCTTGGCGTTTTCGAATCTGCAGTCATCATTTCATCGAGGATTGCTTTCACGGCTCGCGCATCAGGCGTGGGGCGATGCCCACGTTGCGACTGGGCTCACGTCGACCGAAGCATATCTATGATAATGAGTTCAACGCCCAACTGCAACTGGAACAGTCTAAAGAAGAGGCCAGGGAGGCTCTCGCTGCCCCGCTGGAGTCTCAGCAACAGTTGCTCTCTGCGAATGTAGGTCTGCGTCTGCCGCGTCCAGCCCCGCCCTGCAAATCCAGCAAATACTGTCAGATCGAAGGCTGCTCCAATCATTTGACCAGCGAAAATGTGACTCTGCACAAGTTCCCCCATTCGTCGGATATGTGCGCCAAGTGGCAGCACAACACTCAGGTGCCCTTCGATCCCGAGTTCCGCTGGCGCTATCGCATCTGCAGCGCACACTTTGAGCCCATTTGTCTAGGCAATGTGCGACTGATGCACGGCAGTGTGCCCACCCTGAATCTGGGGCCGCTTGCGCCCAAGAAACTGTTTGAGAATGAATTCTTGCGTCTGGACAAGCCAATGAGCAGTTCGGAGCTGGGTATGACCGTCAAACAAGAACAAATGGAGCAATTTGATCAAATGGAGCTGGAAGATGGCAACCAGGAGCAGGATGATTTCAGTCTGCTGGAGCCCGAGCTGCAGTTGCACGAGGATAGCGAGGAAGAGCAAGAATATGACAATCATTTTAGCCAAAACGATTCCCATAGCTGGTCCGATCAGCAGCTGCGTCTGCCCAGCATTAATCAGGAGAAGTGCACCACCATCTACAATCCAGTCAAGTCCGGCTATGATAAGTGCTCACTGGTCCACTGCCAACGACAGCGTTCGCAGCACGGCGTGCACATCTACAAGTTTCCACGCTCGCGTCAGCTACAGCAACGATGGATGCATAATTTGCGCATCCAATACGATGAGCGACGGCCGTGGAAGACAATGATATGCAGTGTCCATTTCGAGCCACACTGCATCCGTCTACGCAAGCTGCGTCCCTGGGCGGTGCCCACGCTGGAACTTGGGGACAATGTGCCGCTGGAGATCTTTACGAATGAGCAGAGCCAGCAGCTGTTTGCTCAGTCCGAAGCAGGCAGCGAGTGTGATGACGTTGAAGTGGATGTTGAGGACACCATACTGGAGGACATGGATGATGACTATGATGACAATGACGCTGATATGAATGTGAATGCTGATGATCAAATGCGAACAGCTCCATATGTCAAAAGAGAGCGTCGCTCTCGATTTGATCCTCTGCCACCGGGTCAGCTGCCACCGTGGAAGATCAAATGCTGCTGTTTACCCTATTGCCGCAGTCCTCGCGGTGATGGCATCAAGCTCTTTCGACTTCCCAACAATATCAGCTCCATACGTAAATGGGAGCAGGCCACAGGCATGCGCTTCTATGAGTCCCAGCGAAACACAAAGCTCATCTGCAGTCGACACTTTGATCCGCAGCTTATAGGCGTGCGTCGCCTCATGTCCAATGCGGTACCCAGCCTCCATTTGGGCCCAGACAGCGCAGAGCCCGAGCTGCCTCCTGTGGGACCACGTTGCTGCATGCCCGATTGCTCTGAGGATGTCAATGTCCAGCTGCACAAGTTTCCCAAAGATCCCATGCTGCTGCATCAATGGTGTCAGGCGCTCAATCTACCGGATGTTCAAAGCTACTCCGGCAAATTCATTTGTGCGTCACATCTGCCCTCCAACGCGATGAGCTGTCTAATTTGTGGCGTGGACGATGTACAGCTGCCAATGCTGGACTTTCCCCAAAATCGCAATCAGCGCACCAAGTGGTGCCACAATCTGAAAATCGAGTCTCTGCCCAAGTGGGACAACTCAAAGCAAATTTGCTGCAAACACTTTGAGAGCTTTTGCTTTATCCAGCCGGGTCAACTTCTGGCGGAAGCATTGCCCACTCTACACTTGGAGCACGGGGATTGCAACATATTCCTAAACGATGACACCATGGATAACAGCAAGTTGTTGCGCATCAAGGACGAGCCCATGGAGAGCGAGGATCTGATGCTGTAA
- Protein Sequence
- MVQLFKFLLKSTKSPARAHFRPSFLDTLRLTVRGGHGGNGLPKYGGVGGQGGCVYFVAKEGHTLRKVAQSLKDKRVYATSGEDSSKLSIFGRRGVDQRIEVPVGVQVYDEKHKLLADLNEDEASCIVAGGGTGGCTGNNFLGRPGESRIVNLDLKLIADVGLVGFPNAGKSTLLKAISNAKPKIAAYPFTTIRPQIGTIEYSDLRSISLADLPGLIEGAHANFGMGHKFLKHIERTRLLLFMVDIFGFQLSPRHPHRDCLANIYALNKELELYDPTLLEKPCLLLLNKMDKEGAQDILAKVKPSISDLSVGLAECPEEVRPSRVLKFEHILPISAKNATRISQVKEQLRETLDELAEEQLVVDGQVLKEQLHQRNPMAPPPAPAIANRHSLDASGEMIIKSEPIDEHAFKSNYIDDNTPFADFSKYPEFGDDMLSPKLELNVKDEAYGNHKNPLNYPRRKLQTDRSAEIMPICQRCKEVFFKKHIYLQHVAESNCSIHEYEFKCNICPMSFMGGEELQKHKHLHRTDKFFCHKYCGKHYDSIAECESHEYMEHEYDSFVCNMCSVTFPTREQLYAHLPQHKFQQRYDCPICRLWYQTALELHEHRLAAPYFCGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGTIEMTPPQHKANAALPATAALNSLLQQRQANADGAALYASALKSETNVKLERSYSNSTSESGYSIHESSYNNAYASDNSLHGGGGAIGGPQAHSSTLDDSEDALCCVPLCGVRKSTSPTLQFFTFPKDDKYLHQWLHNLKMFHIPASSYATFRICSMHFPKRCINRYSLCYWAVPTFNLGHDDVANLYQNRELTNTFTTGEVARCSMPNCTSQRGESNLKFYNFPKDIKSLIKWCQNARLPVQAKEPRHFCSRHFEERCIGKFRLKPWAVPTLHLGAQYGKIHDNPKNLYVEEKRCCLNFCRRSRSSDFNMSLYRFPRDEVLLRRWCYNLRLDPAVYRGKNHKICSAHFIKEALGLRKLSPGAVPTLHLGHNDTFNIYENELXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSAASTSSSASSTSHYVDPELSASYMSMGAGGSASGLNVSDSMDVCCVPSCESKRHNNENITFHTIPRRPEQMRKWCHNLKIPEDKMHKGMRICSLHFEPYCIGGCMRPFAVPTLHLGHDDEDIHRNPDVIKKLNIRETCCVAVCKRNRDRDHANLHRFPSNVPLLTKWCANLQRPVPDGSKLFNDAICEVHFEDRCLRNKRLEKWAVPTLILGHENIPYPLPTPEQVTEFYARPTAPNNGEEQGECCVETCKRNPSVDDIKLYRPPEESQVLVKWAHNLQLEIAQLPNMRICNLHFETHCIGKRMRPWAIPTLNLATNIENLYENPEHQMLYKRRTHLKPGRAARSSEASAGGVKPTWVPRCCLPHCRKVRATHNVQLYRFPKLNRSTLAKWAHNLQVPLVGSAQRRLCSAHFEPHVLSKKCPVPMAVPTLDLNTPSGYKIYQNPAKLKANKLCLQRVCIVESCRRQRSQGVQLFRLPHSPTQLRKWMHNIRMRPRGAMRQQYRICSKHFETHSFNGKRLSAGAIPTLELGHEDEDIFPNEAQSFVEEHCTVEGCDAVKEQPDVRLFRFPNDDEDLLWKWCNNLKMSPVDCIGVRICNRHFETDCIGPKHLFKWAIPTLSLGHDDDDIELMLNPKPEERYIDPVFKCCVPSCGKTRKFDEVQMNSFPKDPELFQRWRHNLRLEHLNFKERERYKICNAHFEDICIGKTRLNIGSIPTLELGHDETDDFFQVNPEELQSNLFGRQRRVQDSMRINIKQEAHSDLDEDTKPDINMSEATDSNTTQAKIKKSMTDFKCCVTNCGRSRLEHGARLFPFPNGKQQQSKWRHNLRLPAADVDKTTRICSAHFNRRCIDGNQLRGWAMPTQQLGHQELPIYENPKNIPGFFTPTCALAHCRKRRSIDNDLRTYRYPRSEELLEKWRVNLRLSPDQCRGRICADHFEPLVRGKLKLKTGAVPTLKLGHDKGVVFDNEGIKAGLQLEEEAEEEEGNASLKSLVKVKTEQEDEQELENEDEEQLEQEQDQDMDEDGEEHQDSEEHGYFDPLELVETYAEHHSDDNSAGHDNLDDDDDEDGDIPGNDDELLLPDTRPLRMTMAPRREKAVNNVTPICCLKHCRKERTAIHHLSTFGFPKDPQLLLKWSANLQLPLESCMGRVCVEHFEPSMLGTRKLKQNAVPTLKLGHATPLTYSCNGRMLSGIYDEQPQHSVFRLWSLKHCRKRKPDLAEIKPGRRCCLPSCEKQSESHGVQLQRLPKDRLMLRKWLHNLKLPPTMDCTQMFLCSDHFELNAPCPTLKLGHSDTNIYRHSVASTSGSCLVPKCTCARLNLYRGYDLPAHPQVQQAWLHWLQLPHPQPSPKHAQLCVMHFMQLYELVPLPKSVPDVVRRQLRETYELISSSSMAMKLRCAVPGCYSKYTDNVRLTKLPVYPDTCAKWVHNTKIQYDSARHYVYRICMLHFEPGCLGPVRPKVWAMPTLQLHHKDANIYLNPKLDGSQTQSAVPLDLPLRIKTELPMCNSPSFSASASPSPRGKLRTCCIPSCGQQASALTRLFRFPSAETSMLKWLVNTQQQPRFVDAQRLFVCQDHFEAEAICKNQLRSWAVPTLNLGHDGHIIPNAKHNGNIADSQENKQTLQFIWANYCSVLPCFQKSSEQLRLYQYPTDRPTIRKWAANCKHRSMQASSDGFQVCQSHFTPDCFDSDTGELKEDAVPTLALSRSVTEVRCVVNGCVKDEDASRRRLFKMPKRNPQILDWCHNLRLDQAAMNGSEQHVCERHFEANCFNASRVLRPGARPTLHLGHEDLDDVIPNPANWEEDVIVCCVPHCESSKDADEVQLFGLPKVRQLADKWLQNVHLDPSKEQLAGLKICSVHFEASCMENGRPTYGAMPTLHLGHDELDNIHPNVESVPTQQKRYCNRDGASHDCCYPQCVELQKSYLRVTYELPQEQELRQQWLSYMGLEAHQLDKQQLPKLCPLHLILLYDHSADHFSAHAAEELLDSNYEAARSSVRIRVVSCAVRGCRTLKPRDGGRLHGLPTRRDLLEMWLHNMQLVFYEQQRYMYKICSKHFESTCFTETTKRLKPWSMPTLELPERQPGELPAYQNPTELEWQHMNELQVSEKVVEAQPEPLLNLEPLHKKEPPPPQVVEYEEDCDNNSQQPLEMQALEVLLEVGHVEKCTTYEQMDTEANLNYAEQFSHNPLSPGPPQCRIPVVQNGLHYSARHCSVHGCNVTSNNLSSSIKLHKFPVSLDAMQKWMHNTQVLVDVKFAWRFRICSHHFIEDCFHGSRIRRGAMPTLRLGSRRPKHIYDNEFNAQLQLEQSKEEAREALAAPLESQQQLLSANVGLRLPRPAPPCKSSKYCQIEGCSNHLTSENVTLHKFPHSSDMCAKWQHNTQVPFDPEFRWRYRICSAHFEPICLGNVRLMHGSVPTLNLGPLAPKKLFENEFLRLDKPMSSSELGMTVKQEQMEQFDQMELEDGNQEQDDFSLLEPELQLHEDSEEEQEYDNHFSQNDSHSWSDQQLRLPSINQEKCTTIYNPVKSGYDKCSLVHCQRQRSQHGVHIYKFPRSRQLQQRWMHNLRIQYDERRPWKTMICSVHFEPHCIRLRKLRPWAVPTLELGDNVPLEIFTNEQSQQLFAQSEAGSECDDVEVDVEDTILEDMDDDYDDNDADMNVNADDQMRTAPYVKRERRSRFDPLPPGQLPPWKIKCCCLPYCRSPRGDGIKLFRLPNNISSIRKWEQATGMRFYESQRNTKLICSRHFDPQLIGVRRLMSNAVPSLHLGPDSAEPELPPVGPRCCMPDCSEDVNVQLHKFPKDPMLLHQWCQALNLPDVQSYSGKFICASHLPSNAMSCLICGVDDVQLPMLDFPQNRNQRTKWCHNLKIESLPKWDNSKQICCKHFESFCFIQPGQLLAEALPTLHLEHGDCNIFLNDDTMDNSKLLRIKDEPMESEDLML
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00595971;
- 90% Identity
- iTF_00598913; iTF_00543779; iTF_00577092; iTF_00616228; iTF_00498183; iTF_00518472; iTF_00542472; iTF_00511308; iTF_00564684; iTF_00501837; iTF_00582991; iTF_00522205; iTF_00525161; iTF_00499616;
- 80% Identity
- -