Dbip009583.1
Basic Information
- Insect
- Drosophila bipectinata
- Gene Symbol
- -
- Assembly
- GCA_018153845.1
- Location
- JAECYF010000051.1:234497-247512[-]
Transcription Factor Domain
- TF Family
- THAP
- Domain
- THAP domain
- PFAM
- PF05485
- TF Group
- Zinc-Coordinating Group
- Description
- The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 29 2.4 6.5e+03 -2.2 3.3 38 62 334 361 325 377 0.59 2 29 2e-15 5.4e-12 46.1 4.1 1 86 570 642 570 643 0.85 3 29 7.6e-15 2e-11 44.3 5.0 1 87 670 739 670 739 0.83 4 29 6.9e-16 1.8e-12 47.6 0.2 1 87 761 833 761 833 0.85 5 29 3.8e-16 1e-12 48.5 5.8 1 87 932 1002 932 1002 0.81 6 29 4.3e-15 1.2e-11 45.1 3.5 1 86 1026 1097 1026 1098 0.81 7 29 7.1e-13 1.9e-09 38.0 1.2 1 87 1133 1201 1133 1201 0.81 8 29 4.9e-11 1.3e-07 32.1 1.6 1 86 1241 1310 1241 1311 0.75 9 29 1.7e-17 4.6e-14 52.8 0.3 1 86 1338 1407 1338 1408 0.82 10 29 6.3e-13 1.7e-09 38.2 1.5 1 85 1429 1497 1429 1499 0.79 11 29 1e-14 2.8e-11 43.9 1.0 1 86 1526 1597 1526 1598 0.85 12 29 3.4e-14 9.2e-11 42.2 2.0 1 86 1678 1747 1678 1748 0.82 13 29 4.4e-13 1.2e-09 38.7 0.1 1 86 1771 1839 1771 1840 0.82 14 29 1.2e-13 3.2e-10 40.5 1.2 1 87 1967 2036 1967 2036 0.81 15 29 3.9e-08 0.0001 22.8 0.0 1 86 2129 2194 2129 2195 0.77 16 29 4e-07 0.0011 19.6 0.0 1 58 2210 2257 2210 2273 0.81 17 29 1.7e-12 4.6e-09 36.8 0.2 1 87 2287 2359 2287 2359 0.80 18 29 2.7e-14 7.2e-11 42.5 0.5 1 87 2416 2486 2416 2486 0.81 19 29 2.8e-10 7.4e-07 29.7 0.0 1 86 2518 2589 2518 2590 0.79 20 29 1.8e-13 4.9e-10 39.9 0.0 1 87 2600 2672 2600 2672 0.79 21 29 1.1e-15 2.9e-12 47.0 0.2 1 85 2689 2759 2689 2761 0.82 22 29 2e-06 0.0055 17.3 0.1 1 58 2793 2840 2793 2866 0.85 23 29 1.2e-12 3.2e-09 37.3 0.1 1 87 2878 2950 2878 2950 0.82 24 29 2.9e-15 7.9e-12 45.6 0.2 1 86 3056 3128 3056 3129 0.80 25 29 6.4e-13 1.7e-09 38.1 3.3 1 86 3190 3260 3190 3261 0.82 26 29 1.1e-13 2.9e-10 40.6 2.9 1 86 3331 3401 3331 3402 0.85 27 29 6.7e-12 1.8e-08 34.9 0.1 1 87 3485 3555 3485 3555 0.84 28 29 3.1e-10 8.4e-07 29.5 1.8 1 58 3583 3631 3583 3639 0.85 29 29 1.8e-09 4.8e-06 27.1 1.5 18 86 3649 3706 3638 3707 0.73
Sequence Information
- Coding Sequence
- ATGTCACAACATAACCAACCCCACCAAGTTCCCCCGCAACCCCATCCGCACTATCCTTACCACCACGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNCTATATGGGCCACGAAGTCATGGCCGGCAGCAGCTATCCCTACATCAAGAGCGAACCCATGGAGGCATTCCAGCACCCGCCAAACCCCATGGCACCGCCACCACCCCTGCCTCCGGCCCCGGAAATGATCATAAAATCTGAACCCATGGACGAACAGGCCTACAAGTCCAACTATATAGACGACAACACCCCGTTTGCGGACTTTAGCAAGTTCAACGAATACAGCGAGGATATGCTGAGTCCCAAAGTGGAGCTTACCGTTAAGGACGAGTCCTACGGCAAGAACCATAATAGTTTTCCTCGTCGCAAGCCACCAAATGATCGTCTCGCCGGCAATGAGAGCCTGCCGATCTGCCAGCGCTGCAAGGAGGTGTTCTTCAAGAAGCAGACCTACTTACGCCACGTTGCCGAGAGCAGTTGCAGCATCCAGGAGTATGACTTCAAGTGCAACATCTGCCCCATGTCCTTCGTGAGCGCTGAGGAGCTGCAGCGCCACAAAAACCATCATCGGGCCGACCGATTCTTCTGCCACAAATACTGCGGCAAGCACTTTGAAACGATTGCCGAGTGCGAGGCGCATGAGTACATGCAGCACGAATACGACAGCTTTGTCTGCAACATGTGCTCTGCCACTTTTGCCACAAGGGATCAGCTTTACTCCCACCTGCCGCAGCACAAGTTTCAGCAGCGCTTCGACTGCCCCATTTGCCGCCTGTGGTACCAGACCGCTCTCCAGCTCCACGAGCACCGGCTAGCGGAACCCTACTACTGCGGGAAGTACTACGGGGCAGGGCTGAACACGGCGACACCTCAGCAGCAACATCACCACCAGAGCCAGACCAACTATAAGCTACAGGATTGCCATATGGCCACAATGGAGATGCCCAACACATCGCAGCACAAGCCAAACTCCTCCAACTCCACCTTGCCGGCCACGGCGGCTCTCAGTTCCTTGCTGCAACAGCGGCAAGCGAATGCTGATGGCGCTGCCATGTTTGCTGCCTCGGCGGCGGTCAAGGCGGAGATGAACGTGAAGCTGGAGCGGAGCTACAGTAACTCGACCAGTGAGTCATCGTACGGGGTGCAGGATGGCGGCTACAACAACTCCTTCTCCGGAGAAACTTCCATGCACAGTGGTGCCATCGCTGGGCCACAGGCCAACTCCTCGACGCTGGACGACTCGGAGGACGCGCTGTGCTGTGTGCCATTGTGTGGAGTGCGCAAGAGCACCAGCCCTACGCTGCAGTTCTTCACGTTCCCCAAAGACGAAAAATACCTCAACCAGTGGCTGCACAACCTCAAGATGTTCCATGTGCCGGCCTCCAGCTACGCCAGCTTCCGCATCTGCAGCATGCACTTCCCCAAGCGCTGCATCAACCGATACTCGCTGTGCTATTGGGCAGTTCCGACATTCAACCTGGGCCACGACGACGTGGCCAATCTCTACCAGAACCGAGAGCTGACCAACACCTTCACCGTCGGTGAAGTGGCCAGGTGCAGCATGCCCCACTGCACCAGCCAGCGGGGCGAGAGCAACCTCAAGTTCTACAACTTTCCCAAGGACATCAAGAGCCTGATCAAGTGGTGTCAGAACGCCCGTCTCCCAGTCCAGGCCAAGGAGCCGCGCCACTTCTGCAGCCGTCACTTTGAGGAGCGTTGTATTGGCAAGTTTCGTCTTAAGCCCTGGGCTGTGCCCACTCTCCATCTGGGCGCCCAGTACGGAAAGATCCACGACAATCCGAAGAACCTGTATGTGGAGGAGAAACGGTGCTGTCTCAACTTCTGCCGCAGGAGCAGGTCCTCCGACTTTAATATGTCGCTGTATAGATTTCCCAGAGACGAAGTCCTCCTCCGCCGTTGGTGCTATAACCTTCGCCTAGATCCCGGAGTATATCGCGGCAAGAATCACAAAATATGCAGTGCCCACTTCATCAAGGAGGCGTTGGGCTTGCGGAAGCTATCACCTGGGGCGGTGCCAACATTGCATTTGGGCCACAACGACACCTTCAACATCTACGAGAACGAGCTGTGGCCGCCGCCAACTCCCTCCACCAGCCACGGCAGTGGCCAAGTGCACATGCAGCATCAGCAACATATCCCGTCGCACCACTCGCTCCAGCACCAGCTGCATCTTGGACAAAGCAAGTCCTATCAACGGCACTCGGCCGCGTCCACTTCGTCCTCGGCGAGCTCCACCTCGCACTACGTGGATCCGGAGGTGAGTGCTTCGTACCTGGCGATGGGCGGATCCTCGGCGAACGCCAGCGACAGTATGGATGTCTGCTGTGTGCCCAGCTGTGAGAGCAAACGGCACAACGCCGAGAACATTACCTTCCACACGATTCCCCGAAGACCCGAGCAGATGCGCAAGTGGTGCCACAACCTGAAGATACCCGAGGACAAGATGCACAAGGGCATGCGGATCTGCAGCCGGCATTTCGAAGCCTACTGCATCGGCGGGTGCATGCGTCCGTTCGCAGTGCCCACACTGCATCTGGGTCACGACGACGAGGACATCCACCGAAATCCGGACGTTATAAAGAAGCTAAACATCCGCGAGACCTGCTGCGTGGCTGTCTGCAAACGAAACCGCGACCGGGACCATGCCAACCTGCACCGCTTCCCCAGCAACGTGGCGTTGCTGACCAAGTGGTGTGCCAATCTCCAGCGGCCCGTTCCGGACGGCAGCAAGCTCTTCAACGACGCCATTTGCGAGGTGCACTTCGAGGACCGTTGCCTGCGGAACAAGCGCCTGGAAAAGTGGGCGGTGCCTACTCTGACCCTGGGCCACGACGACATTGCCTATCCCCTGCCCACGCCGGAGCAGGTTGCCGAGTTCCACTCTCGGCCCTCAGCCCCCAACAACGGGGAGGAGCAGGGCGAGTGCTGCGTGGAGACCTGCAAGCGAAACCCCAGCGTGGATGACATTAAACTGTACCGCCCTCCGGAGGAGGCCTCTGTGCTGGCCAAGTGGGCGCACAACCTACAGACGGAGGCGGCACAGCTGGTGAGCCAGCGAATCTGCAATCTGCACTTCGAGGCCCACTGCATCGGCAAGCGGATGCGGCCATGGGCCATACCCACCCTCAACCTGGCCGGCAACATTGAGAATCTCTACGAGAATCCGGAGCCTTCGATGCTCTACAAGCGGCGGATGCACACCAAAGCGAAACTGTCGGTCTCTGCGAAACCCACCTGGGTGCCGCGTTGCTGCCTGCCTCATTGCCGCAAGGTGCGGGCCCTCCACAATGTCCAGCTCTACCGATTCCCGAAGCACAACCGCTCCACGCTGGCCAAGTGGGCGCATAACCTGCAGGTGCCCATGGTGGGCAGTGCCCAACGCCGGGTGTGCTCGGCCCACTTTGAGCCTCTTGTGCTGAGCAAGAAGTGCCCGGTGCCGCTGGCGGTGCCCACACTGGACCTGAACGCCCCGGCAGGGCATATGGTGTACCAGAATCCGGCCAAGCTGAGGGCCAGTAAGCTGTGCCTGCAGCGCGTGTGCATCGTGGAGAGCTGTCGCAAGACTCGGGCACAAGGAGTGCAACTCTTCCGGCTCCCGCACAACCCGTCCCAGCTACGGAAGTGGATGCACAATATCCGGACCCGTCCGCGGGGTTCCATGCGGTCTCAGTACCGGATCTGCTCCCGCCACTTTGAGACTCACTCGTTTAACGGGCGAAGGCTCAGTGCTGGGGCCATTCCCACACTGGAGCTGGGCCACGACGACGACGACATCTACCCCAACGAGGCGCAGGCTTTTGTGGACGAACACTGCGCCGTGGAGGGATGCGGGGCATCCAAGGAACAGCCGGAGGTGCGACTATTCCGCTTCCCCACTGACGACGATGACATGTTGTGGAAGTGGTGCAACAACCTGAAGATGAACCCCGCGGACTGCACGGGCGTGCGCATCTGCAACAAGCACTTCGAGGCGGACTGCATTGGACCCAAGCACCTATTTAAGTGGGCCATTCCCACCCAAGAGCTGGGCCACGACGATGCCCAGATAGAACTCATTCCAAACCCCAAGCCGGAGGATCGGTACGTGGACCCAGTGTTCAAGTGTGTGGTCCCCACCTGTGGCAAGACGCGGCGCTTTGACGAAGTCCAGATGAACAGTTTCCCCAAGGACCCGGAGCTCTTCCAGCGGTGGCGACACAACCTCCGCTTGGACCACTTGCACTTCCACGAGCGAGAGCGCTACAAGATCTGCAACGCCCACTTCGAAGACGTGTGTATTGGCAAGACCCGCTTGAACATCGGCTCGATACCCACACTAGAGCTGGGCCACGACGAGACCGAGGACCTGTTCCAAGTCAATCCCGCGGAGTTGCAGAGCAACTTGTTTGGTCGCCAACGACGGCTGCTCGACGGATCGGAATCCGGCGAGGTGGTGGTCAAGCAGGAGCTTCCGGATGAGGAGACCGAGCCCGAGGACATCAAGCCGGACATTCGAGAACTACTAGTGTCCAGACCCAGACAGGTGAAGTCCAAAAAAGGATCGCTGGGGAATCTGAAGTGCTGTGTCCGGAGCTGCGGAAGGAGCCGGCTCCAACATGGTGCTCGTCTGTTTGCCTTTCCGACGGGCAAGCAGCAGCACCTTAAGTGGCGCCACAATCTACGCCTGGAGCCAGAGGACGTGGACAGGTCTACGAGGGTGTGCAGCGCTCACTTCAATCGCCGGTGCATCGACGGCAAGCAGCTTCGTAGCTGGGCCATGCCTACCCTGCAGCTGGGCCATCGGGAGCAGCCCATCTACGAGAACCCCAAGAACATACCGGGCTTCTTCACGCCCACCTGTGCCCTGAGCCACTGCCGCCAGAGAAGGAGCATCGACAACGACCTCCGAACTTACCGGTACCCGCGAACGGAAGACCTGCTCGAGAAGTGGCGGGCAAATCTTCGCCTGACTCCGGATCAGTGCCGCGGTCGTATCTGTGCGGATCACTTTGAACCTATGGTGCGTGGCAAGCTGAAGCTGAAAACGGGAGCGGTGCCCACCTTGAAGCTCGGCCACGACGAGGGACTGATCTACGATAACGAGGCGATCAAGGCTGGCTTGGCGGAGGAGGAGGAGGTCACCTGCAAGCAGGAGATGGTCGAAGAGGAGGAAGAGGCCGAGGGAGAGGAGTCGCCCGAAGGAGTTCCCGCTGTCAACGAGGATGACGACGACAAGGACGACAGCTACTTCGATCCTTTGGAGTTGGTAGAGACGTTCGCAGAGCGCGCCAGCGACGAAGAAGCGGAAGACCACGAAATGGAGGAGAAGAATGAGCCGGAGGAGGGGGATGAGGAGGAGGCAGAGGAGCTCCTGCCAGACCTGCCACCCACACCGCCACCTGTACCCCAGCGTCGCGAAAAACCCGCCAACAATGTGACCCCCATTTGCTGTCTGAAGCACTGTCGCAAGGAGCGCACGGCCTTCCATCTGCTGAGCACATTCGGCTTCCCGAAGGACCGCAAGCTCTTGCTGAAGTGGTGCGACAATCTCCACCTGCTTCCGCATGACGTTGTCGGGCGGGTCTGCATCGAGCATTTCGAACCGGAGGTGCTCGGCACTCGCAAGCTGAAACAGAATGCAGTGCCCACATTGAACGTGGGCCACGACGACCCGTTGCGGTACACCTGCCATGGTGTGGAGCAGGATCAGGACTTGGAGCAGGGACAGCCGCAGCACTCGGTTTTTCGGCTTTGGAGCCTGAAACACTGTCGCAAGAGGAAGCTATCGGATCCGCCGGACATTCGCCCCAGCCACTGGAAGGAACTGAAGCTGCACATGCAGAAGCAGAGGCAGATGGAAATGGTAGAGATGGAGACCGACCTAATGATGAGCACTCCGCCTCAGACACCGGTGAAGATTAAACCCAAAAGATGCTGCGTCATCAGCTGCGGGAGCGAGGATGCTAGAAAATTGGTGGCGCTGCCGGATGAGCGCAGCCTTCTCCGCCGGTGGCAGCACAACCTGAAGCTGTCAGTGCTGACGGATCCAGGTCTTGGCTTGTGCCTGGACCATTTCGAGGAGTCTCTGGTGCAATATGGAAAGCCCATGGAGAGGGCAGTGCCCACCTTAAAGTTGGGTCACAAGAGCGGTAACCTCTACCGAAACAATGCTACTTGTCTGGTCCCCAGCTGTCCCAGTTCCGGCTCCGATAGCACTAGTTTTGTGGCTCTGCCCCACAATCCTGTGATGAAAAGGGCCTGGCTCTCCTACCTCCAACTGCCATTCACTAGCAACGGACTTCTATGTGGCAACCACTTCGTGGAGCTGTACGAGCAGGTGGACTTGCCTGAGGACTTGCCCGTCCAGGATTTGGAGGAACTGGAACGAACTGTCGATGAGCTGCAGTGCGCTGTGCCCGGTTGTGCGTCAAAGAACGCCCGTGATATTCCCGTCCAGCTGGTCCAGTTACCCCACAACGAGAAGGAACTGTCCAAGTGGCTGCATAACACAAAGATCACCTATGACTATTCCCGGCACGGCAGCTATCGGATTTGCCTGCTCCACTTCGACCCGATCTGCCTCGATGAAGACTTTCCCCAGAGTTGGGCAGTGCCTACTCTAAACCTGGGCCACGACGACCAAATCCACTTGAATCCCGTCCAGAATCAGGTTGCTGAGGCCCTAAACGGAACGTCCAATAGCCATAGCCTGAGACCTCTGAGGATTAGGACAGAACTAGCATCCAGCCCGAGTGTGAGTGCCAGTCCCAGTCCGAGAGGAAACATCCGGATTTGTTGCATCCCCACTTGCAACCAGTTTGGGAACAGCCAGGTGCGACTCTATCGCTTTCCCAGCGAGGAGCAGTTCCTCCTCCAGTGGCTGGTCAATACGCAGCAGCAGCCCCGTCTGGTGGATCCCATGGAGCTCTACGTGTGTCAGGCACACTTTGAAACCGACGCCACCTACAAGAAGCACCTTCGCAGCTGGGCCTTGCCGACCTTGAATCTTGGCCACTACGGGCATGTCTTTCAGAACGCCAGGCACAATGGAAACACTGCCGATGTCGAGGAGGCATTGAAGTTTATCCGGGAGCGCTACTGTTCGGTGCTGAGCTGTTTCCAACTCCGAGGAGAGGGAGTCCGCCTGTTCGAGTACCCCGAGGACATGGCCATGATCCGAAAGTGGGCAGTTGCCTGCAAACATCGTTCCATGCACGCCAGGAGCCATGGCCTCCAGGTGTGCCAGGCGCACTTTGCTGCCGACTGCTTTGATCCCGACACTGGAGACCTACTGGAGGGATCAATACCCACGCTGGAACTCAACCGCGAAGACATCGAGAGACACTGCTTGGTGCCAGGTTGTGAGCAGGACGATGCGGGCCCCCGGCTGCGATTCTATAAGCTGCCCAAGATCGGTGAACAGCTCGAGGCGTGGAGCACCAATATAAAGCTTCCGGTCTCAGAACTGAAGCGCGGACACCAGCGCATCTGTGAGCGCCACTTCGAGACGTACTGCTTCGGACCTAGCCGGGGTCTGCGGCTGGGAGCCTTACCCACTCTGTTCCTGGGTCACGAGGACCTTCTTCTTAATCCCGACAACTTGCGGGAGAACTGCTGCGTACCGGGATGCGGGCGTATCCGGCAGACTGATGACATTCCCTTCTACGGCTTCCCGAAGCATTGGTCCTTGGCCAGGAAGTGGCTGCACAACATCCGCTTGGAAAAGACAAGCAAGGATCAGCTAAACAAACTGAGGGTATGCCCGGCGCACTTTGAGTCGGATGTGCGGGAAAACGACGGACTCCTGCCAGAAGCCATGCCCACCAAGCAGCTGGGGCATTCCTCCGAGGGGATTTTCATCACGGACAGGGGCACGCAGGCTAGAAGTCTTCCGAATCTCAAAAGATCCTCTCCGGAGGTCATTTGCTGTTATCCGGACTGCACTGACTCGTCGAGATTCCAGCTATTGGATTTTCCCGAACAGGCAGAGCTCCGCGATGCATGGCTGGGTCACTTGAAACTCAGGGAGCTACATGATGAAGCCCCACAGCTCTGTCCCCTCCATTATGTGATTCTATATGAGCACAGTGCCAAGGAGTTTCCGGAGCACGTTCCAGACCAGTTGATGGAAGTAAACTACACTAACGCCCGCGCCAACCGGCGGGTCAAGATCGTCAGTTGTGCCATCAAGGGCTGCACAACGGTGAGGCCTAGAGATGGAGTACCGCTGCACGGAATGCCCACGTACAAGGATATCCTGCAGATGTGGGTGGACAACGGGCAGGTGGACTTCTCCGAACCGCAACGGTACATGCTCAAGGTGTGTCACAGGCACTTTGAGCCACGTTGCTTCGTCGACGAACGGCGGCTCAGCTCCTGGAGTGTTCCTACCCTGCATCTTCCCGGTGAGACTGTCCACCAGAATCCCAGCAAGGAGGAGTGGGAGGTCATCAAGCGAGAGAACAAGGAAGAGCCAGAAATCAAGGAGGAACCTCTAGAGACGGAGCCAGAGATGGAGATCGAAACGGAAAACTCTCTACTGGAGCCCATTGTCAAGATGGAACACCTGGAATCCGAGGAGGAGGACTCAGAAATGCAGGCGTTGGAGGTGCTGCTGGAGGTCGGACACGTGGAGCGGCTGGACAGCTATGAAAAGATCGACGAATCCCCCATTGCCTACAAGTCCAATCGAGGGCAGTACAACGCCAACAGCTGTGCCGTGGAAGGGTGTGACGTCACGGCCGAGGACGTGGGTGGAACTATCAAGCTGCACAAGTTTCCCGCCCCAGCGGAAGCCGCCCGCAAGTGGATGCACAACACCCAGGTGGACATGGAGGAGAAGTTCTGGTGGCGCTATCGCATCTGCAGCTACCACTTTCACCAGGACTGCTTCCAGGGGTCTAGAATCCGAAAGGGAGCCATGCCCACGCTACTCCTGGGACCTCGGAGACCGGATGAGGTCTACGACAATGAGTTCGCATCGCAGCCGGAGGTTAAGGATCCACCTCCGCCAGTCGAGATCCTCCCAGTGACCAGTGTGACTGAACGGATAGCGCCCGATGTTACCAATCTCTGCCTTCCTCCGCCGGCTGCGCCCCGAAAATCCAGCAAGTTCTGCCAAATCGAAGGCTGCTCGAATCACCTGACCACCGACAACATAACCCTCCACAAGTTTCCGCACTCGGAGGAGATGTGCATCCGATGGCAGCACAACTCTCAAGTTCCATTCGATCCGAACCATCGCTGGCGGTACAGGATCTGCACCGCCCACTTCGAACCCGTGTGCTTGTCCAACTTGCGCCTGCTCCACGGAAGTGTGCCCACCCTGAAGCTAGGACCCAAAGCTCCTGCGGAGCTCTTCGACAACGATTTCGAGGCCATCAACCAGCGACTGGATAAGAGATCGGCGGCAGAGGTGAAACAGGAACGGGTGGATATGGAAGACGAGCTGCACGAGGACCAAATGGATGTGCCTAGCTTGATGCCTGTGAAGCAGGAGAAGATATCCTTCAACCAGATCAAGTCTGGCTACGACAAGTGCTCCCTGGCCCACTGCCAGCGCCAAAGATCTCTGCACGGCGTCCACATCTACAAGTTCCCCAGGTCGCAGCTCCAGCAGGAGCGATGGATGCACAACCTCCGCATCCGCTACGATGAGCGCCGTCCCTGGCGATTCATGATCTGTAGCGTCCACTTCGAGCCCCACTGCATCAGCCTCAGGAAGCTGCGTCCCTGGGCAGTTCCTACGCTAGAGCTGGGCACCAATGTGCCGGAGATACTCTTCACCAACGAACAGTGCCTGGAACTGGAGGTGGAACAACCCAGCGATCGTAGCGAAGCGGAGAGCGAAGAGGAGGATGGTCTTGAAGAAGATGACGATGGTGAGGAGGACGAGGCGGAGGAAGAAGGACATGACTCCAATGTCCGCATCAAAAAGGAACGGCGCTCGAGACTGGATCCATATCCTGCTGGTCAGGTTCCGCCCTGGAAAGTGAAGCAGTGCTGCCTTCCCTACTGTCGTGCCTTTCGAGGAGATGGCATCAAGCTCTTCCGGCTCCCCAACAACCGAACCTCTATTCACAATTGGGAGTTGGCCACTGGCATGGTGTTCAAGGAGTCTCAGCGAAACACGCGACTCATTTGTAGTCGGCATTTCGATCCGGAGCTTATCGGAGTGCGTCGCCTCATGCGCAACGCCATTCCAACTCTGCATCTGAATCCGGAAGCCGTAAAGGGCAAGGAGAAAAAGGTTTGGCAGAGCAAACCCAAGGAAACTCCCACACCCATCCCAACCTGCTGCATGGCGGACTGTCATCACAACGGAAATGCCAAGCTGCATAAGTTCCCCAATGATTCCACACACCTGAGGCAGTGGTGCCAGGCCCTCAGACTCACGGATATACAACGTTATCGTGGCAAGTACATCTGCTCGGCCCACCTGCCGACCAACATGACCGTAAGCTGCGTCGTCTGCGGCGTAGATGACGTTCAGCTACCGATGCTGGACTTTCCAGAGAACCGCAACCAGCGGGCCAAATGGTGCTACAACCTAAAAATCGAGACCATACCCAAGTGGGATCGCTCCAAGCACATCTGTTGCCGGCACTTTGAGTCACACTGCTTTGTCCGGCCGGGTGAACTTCGTCCAGGAGCGACCCCAACAGTGGCATTGAACCACAACGATACAAACATATTCCTCAGCGACTACGCCACCGATCCGACGACCTCCTATGCGGGTAATCAGATCAAGGACGAGCCCATGGACGGCGACGAGACGCTCCTGGTCTAG
- Protein Sequence
- MSQHNQPHQVPPQPHPHYPYHHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXYMGHEVMAGSSYPYIKSEPMEAFQHPPNPMAPPPPLPPAPEMIIKSEPMDEQAYKSNYIDDNTPFADFSKFNEYSEDMLSPKVELTVKDESYGKNHNSFPRRKPPNDRLAGNESLPICQRCKEVFFKKQTYLRHVAESSCSIQEYDFKCNICPMSFVSAEELQRHKNHHRADRFFCHKYCGKHFETIAECEAHEYMQHEYDSFVCNMCSATFATRDQLYSHLPQHKFQQRFDCPICRLWYQTALQLHEHRLAEPYYCGKYYGAGLNTATPQQQHHHQSQTNYKLQDCHMATMEMPNTSQHKPNSSNSTLPATAALSSLLQQRQANADGAAMFAASAAVKAEMNVKLERSYSNSTSESSYGVQDGGYNNSFSGETSMHSGAIAGPQANSSTLDDSEDALCCVPLCGVRKSTSPTLQFFTFPKDEKYLNQWLHNLKMFHVPASSYASFRICSMHFPKRCINRYSLCYWAVPTFNLGHDDVANLYQNRELTNTFTVGEVARCSMPHCTSQRGESNLKFYNFPKDIKSLIKWCQNARLPVQAKEPRHFCSRHFEERCIGKFRLKPWAVPTLHLGAQYGKIHDNPKNLYVEEKRCCLNFCRRSRSSDFNMSLYRFPRDEVLLRRWCYNLRLDPGVYRGKNHKICSAHFIKEALGLRKLSPGAVPTLHLGHNDTFNIYENELWPPPTPSTSHGSGQVHMQHQQHIPSHHSLQHQLHLGQSKSYQRHSAASTSSSASSTSHYVDPEVSASYLAMGGSSANASDSMDVCCVPSCESKRHNAENITFHTIPRRPEQMRKWCHNLKIPEDKMHKGMRICSRHFEAYCIGGCMRPFAVPTLHLGHDDEDIHRNPDVIKKLNIRETCCVAVCKRNRDRDHANLHRFPSNVALLTKWCANLQRPVPDGSKLFNDAICEVHFEDRCLRNKRLEKWAVPTLTLGHDDIAYPLPTPEQVAEFHSRPSAPNNGEEQGECCVETCKRNPSVDDIKLYRPPEEASVLAKWAHNLQTEAAQLVSQRICNLHFEAHCIGKRMRPWAIPTLNLAGNIENLYENPEPSMLYKRRMHTKAKLSVSAKPTWVPRCCLPHCRKVRALHNVQLYRFPKHNRSTLAKWAHNLQVPMVGSAQRRVCSAHFEPLVLSKKCPVPLAVPTLDLNAPAGHMVYQNPAKLRASKLCLQRVCIVESCRKTRAQGVQLFRLPHNPSQLRKWMHNIRTRPRGSMRSQYRICSRHFETHSFNGRRLSAGAIPTLELGHDDDDIYPNEAQAFVDEHCAVEGCGASKEQPEVRLFRFPTDDDDMLWKWCNNLKMNPADCTGVRICNKHFEADCIGPKHLFKWAIPTQELGHDDAQIELIPNPKPEDRYVDPVFKCVVPTCGKTRRFDEVQMNSFPKDPELFQRWRHNLRLDHLHFHERERYKICNAHFEDVCIGKTRLNIGSIPTLELGHDETEDLFQVNPAELQSNLFGRQRRLLDGSESGEVVVKQELPDEETEPEDIKPDIRELLVSRPRQVKSKKGSLGNLKCCVRSCGRSRLQHGARLFAFPTGKQQHLKWRHNLRLEPEDVDRSTRVCSAHFNRRCIDGKQLRSWAMPTLQLGHREQPIYENPKNIPGFFTPTCALSHCRQRRSIDNDLRTYRYPRTEDLLEKWRANLRLTPDQCRGRICADHFEPMVRGKLKLKTGAVPTLKLGHDEGLIYDNEAIKAGLAEEEEVTCKQEMVEEEEEAEGEESPEGVPAVNEDDDDKDDSYFDPLELVETFAERASDEEAEDHEMEEKNEPEEGDEEEAEELLPDLPPTPPPVPQRREKPANNVTPICCLKHCRKERTAFHLLSTFGFPKDRKLLLKWCDNLHLLPHDVVGRVCIEHFEPEVLGTRKLKQNAVPTLNVGHDDPLRYTCHGVEQDQDLEQGQPQHSVFRLWSLKHCRKRKLSDPPDIRPSHWKELKLHMQKQRQMEMVEMETDLMMSTPPQTPVKIKPKRCCVISCGSEDARKLVALPDERSLLRRWQHNLKLSVLTDPGLGLCLDHFEESLVQYGKPMERAVPTLKLGHKSGNLYRNNATCLVPSCPSSGSDSTSFVALPHNPVMKRAWLSYLQLPFTSNGLLCGNHFVELYEQVDLPEDLPVQDLEELERTVDELQCAVPGCASKNARDIPVQLVQLPHNEKELSKWLHNTKITYDYSRHGSYRICLLHFDPICLDEDFPQSWAVPTLNLGHDDQIHLNPVQNQVAEALNGTSNSHSLRPLRIRTELASSPSVSASPSPRGNIRICCIPTCNQFGNSQVRLYRFPSEEQFLLQWLVNTQQQPRLVDPMELYVCQAHFETDATYKKHLRSWALPTLNLGHYGHVFQNARHNGNTADVEEALKFIRERYCSVLSCFQLRGEGVRLFEYPEDMAMIRKWAVACKHRSMHARSHGLQVCQAHFAADCFDPDTGDLLEGSIPTLELNREDIERHCLVPGCEQDDAGPRLRFYKLPKIGEQLEAWSTNIKLPVSELKRGHQRICERHFETYCFGPSRGLRLGALPTLFLGHEDLLLNPDNLRENCCVPGCGRIRQTDDIPFYGFPKHWSLARKWLHNIRLEKTSKDQLNKLRVCPAHFESDVRENDGLLPEAMPTKQLGHSSEGIFITDRGTQARSLPNLKRSSPEVICCYPDCTDSSRFQLLDFPEQAELRDAWLGHLKLRELHDEAPQLCPLHYVILYEHSAKEFPEHVPDQLMEVNYTNARANRRVKIVSCAIKGCTTVRPRDGVPLHGMPTYKDILQMWVDNGQVDFSEPQRYMLKVCHRHFEPRCFVDERRLSSWSVPTLHLPGETVHQNPSKEEWEVIKRENKEEPEIKEEPLETEPEMEIETENSLLEPIVKMEHLESEEEDSEMQALEVLLEVGHVERLDSYEKIDESPIAYKSNRGQYNANSCAVEGCDVTAEDVGGTIKLHKFPAPAEAARKWMHNTQVDMEEKFWWRYRICSYHFHQDCFQGSRIRKGAMPTLLLGPRRPDEVYDNEFASQPEVKDPPPPVEILPVTSVTERIAPDVTNLCLPPPAAPRKSSKFCQIEGCSNHLTTDNITLHKFPHSEEMCIRWQHNSQVPFDPNHRWRYRICTAHFEPVCLSNLRLLHGSVPTLKLGPKAPAELFDNDFEAINQRLDKRSAAEVKQERVDMEDELHEDQMDVPSLMPVKQEKISFNQIKSGYDKCSLAHCQRQRSLHGVHIYKFPRSQLQQERWMHNLRIRYDERRPWRFMICSVHFEPHCISLRKLRPWAVPTLELGTNVPEILFTNEQCLELEVEQPSDRSEAESEEEDGLEEDDDGEEDEAEEEGHDSNVRIKKERRSRLDPYPAGQVPPWKVKQCCLPYCRAFRGDGIKLFRLPNNRTSIHNWELATGMVFKESQRNTRLICSRHFDPELIGVRRLMRNAIPTLHLNPEAVKGKEKKVWQSKPKETPTPIPTCCMADCHHNGNAKLHKFPNDSTHLRQWCQALRLTDIQRYRGKYICSAHLPTNMTVSCVVCGVDDVQLPMLDFPENRNQRAKWCYNLKIETIPKWDRSKHICCRHFESHCFVRPGELRPGATPTVALNHNDTNIFLSDYATDPTTSYAGNQIKDEPMDGDETLLV
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00481571;
- 90% Identity
- iTF_00538995;
- 80% Identity
- -