Dosh016940.1
Basic Information
- Insect
- Drosophila oshimai
- Gene Symbol
- -
- Assembly
- GCA_018150695.1
- Location
- JAECWM010000296.1:108081-125316[-]
Transcription Factor Domain
- TF Family
- THAP
- Domain
- THAP domain
- PFAM
- PF05485
- TF Group
- Zinc-Coordinating Group
- Description
- The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 21 7.5e-14 1.1e-10 42.1 0.6 1 86 144 213 144 214 0.83 2 21 3e-10 4.3e-07 30.5 0.0 1 87 239 311 239 311 0.78 3 21 1.3e-11 1.9e-08 34.9 0.4 1 86 342 414 342 415 0.81 4 21 9.3e-12 1.3e-08 35.4 0.3 1 85 485 550 485 552 0.74 5 21 0.00046 0.68 10.7 0.0 1 58 567 614 567 629 0.80 6 21 1.8e-12 2.7e-09 37.6 0.0 1 87 643 713 643 713 0.82 7 21 8.6e-08 0.00012 22.7 0.1 2 58 750 796 749 815 0.79 8 21 4.9e-13 7.1e-10 39.5 0.7 1 86 1325 1394 1325 1395 0.83 9 21 5.5e-10 8.1e-07 29.7 0.0 1 87 1420 1492 1420 1492 0.77 10 21 2.3e-11 3.4e-08 34.1 0.4 1 86 1523 1595 1523 1596 0.82 11 21 6.6e-12 9.7e-09 35.8 0.4 1 86 1666 1732 1666 1733 0.75 12 21 0.00024 0.35 11.6 0.0 1 58 1748 1795 1748 1810 0.81 13 21 1.8e-12 2.7e-09 37.6 0.0 1 87 1824 1894 1824 1894 0.82 14 21 1.6e-12 2.3e-09 37.8 0.6 1 87 1930 1996 1930 1996 0.85 15 21 1.9e-12 2.8e-09 37.6 0.5 1 87 2031 2103 2031 2103 0.82 16 21 3.1e-13 4.5e-10 40.1 0.8 1 86 2141 2212 2141 2213 0.82 17 21 8.7e-14 1.3e-10 41.9 0.1 1 86 2226 2295 2226 2296 0.79 18 21 1.7e-05 0.025 15.3 0.0 1 58 2329 2379 2329 2410 0.83 19 21 2.3e-11 3.3e-08 34.1 0.0 1 87 2417 2488 2417 2488 0.80 20 21 0.8 1.2e+03 0.3 0.4 21 70 2495 2551 2492 2561 0.46 21 21 5.4e-15 7.9e-12 45.7 1.6 1 87 2598 2671 2598 2671 0.84
Sequence Information
- Coding Sequence
- ATGAGAATAATTTTAATAAAAAAGAGAAAAGCTGATTACCCAGAAGTCAGTGACGGATCAGAAGAGTTTACAATTTGCCTCGAGCAATTCCAGCTGAACAGTTCTACCCTACATAATGTTTCCCAGCTGGGCTACTCTAGCCCCACTCCAGATCAGGATCAGGCATCGATATCGATCATTCGAGATGCACAGCCCCAGCTGCAGAATTCGGCTTTCCGGAAGGGGAGTCACTTTGAACCCTATGCCCAAATTGAAGAGGGTACACATACGAAGTGGGGTGCTCACATAGAAGAGGTTGTGAAGGAGGAGCCCCGAATTCAAGAGCCTTCACATCTGGAATTAGAAAATAGTGTGGAAATAAAAGAGGAGGCTTCCGTGAGCACTAACACTGAAAGCACTGGCCACCCTCGTCTAAAGAAGATTAGGAAGTGCTGTATACCCAATTGTGGGCAGTATGAGAGCACTGAAGTACGGGTCTTTCGCTTTCCTGCAGAGGCGCAGATGCTGGTCCTGTGGCTCGTGAACACCAAGCTAAAACCCCGGCTAGTTAATCCCCTGGAGATGTGCGTATGTCAGTTCCACTTTGAACCCGAGGCCATCAATAAGGATCAGCTTACCAGCTGGGCAGTGCCCACCTTAAAACTCGGCCACGAAGGACCTGTGGTCCCCAGTCCCAAGCTGAAAAAATACACTCTCGACTGCCGGGAGACAAGGTGTGTGGTGACTGGCTGTGCGGAGGATAAGAGCGAACCCTCCGACGTATACTACAGGTTGCCAAAGAGTCCTGCCCTATTGGATGCATGGAGCTGCAACCTTTGGATATCTCCCTCTGAACTGCTGATGGGTGACCAGCGCATCTGCGGACGCCACTTCGAGGACTACTGCAAAAAAACGACTAAGAGCCTGTATGCTGGATCGCTTCCCACTCTCTTTCTGGGTCACGAAGAAAACATTCTCCCCAATCCCCGCAGCCTCATGGAAAGAGAGGAGAATAAAAGCTATGATAAGTACAGTGCCAGCAGTTGCGCCGTGCCAGGATGCTGGGTGACCTTCAAAGTTATAACTAATACTTTTGGGTTGCACAAGCTGCCCAGCTCCTACGAGGCCGCCGAGATGTGGCTACAAAACATTCACGTGGAGATGGACGAAGGGCTCTGGCCGAGGATTCGCGTCTGCAGCCACCACTTCGAGCAGGATTGCTTCCAAAACTTCAGGATTAGGGAGGGAGCAATGCCCACGCTCTGCTTGGGACCAAGAACTCCTGAATTTCTGCACGAAAATCAGCCACCATCCAACCAGGATCTTGCGACGTCACCGCAAGATCAGGCAAAGACGAACATAACTAGTCCAACGGACATTGCCCAGCGCAAGAGGAACTTGCAGGCGACACCAGAGAAGGAAGGTCCAGTGGAAAAGAAGAAAACTCCAGGTAGTAATGGCGCTTCCAAAAACTGCTGTGTCAAGAGCTGTAGGAACGACGATGAAAGCAAGTTGCAACCAATGCCCGAGGAGCCCTCGCTGCTTAAAAAGTGGCTGCACAACTTAAAGCAGTCAAAAGACGGAGATACCACCCTGCTTCGCGTGTGCCCGCAGCACTTTGAGCCGCACTTGGTGAGGAACGGAACGCCCTTGGAGCAGATGGTGCCCACCTTGAATCTAGGCCACAGCAGTGTAAATATTTACCGGAACAAAGAGGTCTGCCTAATGACCGACTGCCCAAGTGCTCGTTTGAGTCAGTGCGTTTACGTCGATCTTCCGAGAAATACTGCCGTAAAGGAGGCCTGGTTGTCACACCTGCAACTGCCCCTCACCTACAAGGGACGCCTGTGTGGGGCTCACTTTGTGGAACTGTACAAGCTGGATTTACCCAAGGTTCTGTCAATCCAGGATTTGGAAGAGCTGGAGTGTACTTTTATTGAGTTCCACTGCGCTGTGCCAGGCTGCCATTCGAAAAGTGCGCCTGAAACAACCTTAGTCCGGCTCCCGGTCGACGAAACGCTGGTCTCCAAGTGGTTGCGCAACACAGGAATTACCTATGATCCCTTGAGGCACAGTAGTTATCGACTATGCCTACTTCACTTTCCGCCGAAATGTTTGGAGGAAGGTATTCTACAGAGCTGGGCCATACCCACGTTGCATCTAGGCCAAAGAAGCACGATCTTTTTGAATCCCGAGCATCCAGTGGAGCAGAAGTGGCCCAAAAAAGTCCCGGAAATAAATGGCGTTGTTAGTTTGGAAAGGGTCCGTTGCATTCCCTTATGCGGCAAAGAAAAACGACTCTTCCGCTTTCCCAGCGAGGAACAGCAATTGCTCCTGTGGCTGGTTAACACTCAGCAGCAGCCTCGTGTCGTGGATCCCCGAAAGCTCTTCGTGTGCATGTCCCATTTCATGTATTTCATGTATATTCATGCCAATAAAGAAGAAGAGGAATTACTGGCCACTCCACCTAAGGCTAGCGCACCACCCAAGGTCGCCCTTTCGGGAGCTGCGAGAAAGAGACTCCGCTTCTACCTCAGTAAAGGACACCCCATAGAAGAGGCCAGACGGCTAGCTGCACCACCGATGCAGAAGAAGGAGACGCCTCAGACAGCGAGCAAGAGGATGAGATCGGTCAATACGACTCCCTCTTCTCAATCTCCGAAGAGGGCAATGACACAATCGGACACCACTTCCGATATCTCCAGCGGTATGGCGGCGCCCGCCCCACTGATCACAGTGGAAGTCACTATGGCCCCAGTGGCCACGACAACCACAGCAGCCTCAGCCAAGGAACCAGTGAACAGTGAGGCGGATGTTGCCCAGCCAACGGAGAATGGTGTTGGTCCCTCTTACCGGGACGCTGTCAGCAACACAAAACTGGCAGTCTTCCCGGAAGGTTATCCAAGCACCCTGTACTCGGTGGAGCAGCTAAAAGTGCTACAAAGCGCGATTCTGGACGCGGTGATTGCAGACAGAAAGGGGACCACAAAGCCCCACTTCTCAGGCTGCTCATTCCGCCCAGGATGGCTGGCGGTGTCTTGCAGAGACTCAAACACTGCGGACTGGCTCAAGACTACATTGCCCAAGATCAAGGTGCAACTTGGGGCACGACCCAAAGTAGTTATAGAGTCTGAGATGCCGCGCCCGCAGATATTCGTTGGTTACTTCCCGGAGACCAAGAAGGATATCACGTCGGAAGAGATCCTGGCACTTCTCGAGGGCCAGAACGAAGGCCTCAGAGTATCGGATTGGCGAGTCCTGCGCAGGGAGCTAAAAGGCAACATGGTGAGCCTAACGCTGTCTGTGGACGCGGTATCGGCAAACATGCTGAAAGTGAGCAGCAACAAGGTGGACTTTGCCTGGGGGACTGTTAGTCTTCGGCCAAAAGCCAAACCAACAGTAGCCGCCCAGCAAGCCGACGTAGCTAAGACGGGCGTCGCACCAACCGCCACTGCCAATGTGACAAGCGTGCAGCCTGGCAACAGCAGACAGGCAGCAACCAACAACTTGCCCTCCACAATGGCGCCACAACAACCGCGCATGAGGCCACCGCTGTTTGGACGACTCCCGAGACCCGCAGGACGACAGGGAAAAGCTGAATACCCAGAAGTCAGTGACGGATCAGAAGAGTTTACAATTTGCCTCGAGCAATTCCAGCTGAACAGTTCTACCCTACACAATGTTTCCCAGCTGGGCTACTCTAGCCCCACTCCAGATCAGGATCAGGCATCGATAACGATCATTCGAAATGGACAGCCCCAGCCACAGAATTCGGCTTTCCGGAAGGGGAGTCACTTTGAACCCTATGCCCAAAGTGAAGAGGGTACACATACGGAGTGGGGTGCTCACATAGAAGAGTTTGTGAAGGAGGAGCCTCGAATTCAAGAGCCTTCACATCTGGAATTAGAAAATAGTGTGGAAATAAAAGAGGAGGCTTCCGTGAGCACTAACACTGAAAGCACTGGCCACCCTCGTCTAAAGAAGATTAGGAACTGCTGTATATCCAATTGTGGGCAGTATGAGAGCACTCAAGTACCGGTCTACCGCTTTCCTACAGAGGCGCAGATGCTGGCCCTGTGGCTCGTGAACACCCAGCTAAAACCCCGGCTAGTTAATCCCCTGAAGATGTGCGTTTGTCAATTTCACTTTGAACCCGAGCCCATCAATAAGGATCAGCTTACCAGCTGGGCAGTGCCCACCTTAAAACTAGGGCACGAAGGACCTGTAGTCCCCAGTCCCAAGCTGAAAAAATACGCTCTTGACTGCCGGGAGACAAGGTGTGTGGTGACTGGCTGTGCGGAGGATGAGAGCGAACCCTCCGACGTATACTACAGGTTGCCAAAGAGTCCTGCCCTATTGGATGCATGGAGCTGCAACCTTTGGATTTCTCCCTCTGATCTACTGATGGGTGACCAGCGCATCTGCGGACGCCACTTCGAGGACTACTGCAAAAAAACGACAAAGAGCCTGTACACTGGATCGCTTCCCACTCTCTTTCTGGGTCACGAAGAAAACATTCTCTCCAATCCCCGCAGCCTTATGGAAAGAGAGGAGAATGAAAGCTATGATAAGTACAGTGCCAGCAGTTGCGCCGTGCCAGGATGCTGGGTGACCTTCAAAGATGTAACTAAGACTTTTGGGTTGCACAAGCTGCCCAGCTCCTACGAGGCCGCCGAGATGTGGCTACAAAACATTCACGTGGAGATGGACGAAGGGCTCTGGCCGAGGATTCGCGTCTGCAACCACCACTTCGAGCAGGATTGCTTCCAAGACTTCAGGATTAGGGAGGGAGCAATGCCCACGCTCTGCTTGGGACCAAGAATTCCAGAATTTCTGCACGAAAAGCAGCCACCATCCAACCAGGATCTTGCGACGTCACCGCAAGATCAGGCAAAGACGAAAATAACTAGTCCAACGGACATTGCCCAGCGCAAGAGGAACTTGCAGGCGACACCAGAGAAGGAAGGTCCAGTGGAAAAGAAGAAAACTCCAGGTAGTAATGGCGCTCCAAAAAAATGCTGTGTCAAGAGCTGCAGGAACGACGATGAAAGCAAGTTGCAACCCATGCCCGAAGAGCCCTCGCTGCTTAAAAAGTGGCTGCACAACTTAAAGCAGTCAAAAGACGGAGATACCACCCTGCTTCGCGTGTGCCCGCAGCACTTTGAGCCGCACTTGGTGAGAAACGGAAAGCCCTTGGAGCAGATGGTGCCCACCTTGAATCTAGGCCACAGCAGTGTAAATATTTACCGGAACAAAGAGGTCTGCCTAATGACCGACTGCCCAAGTGCTCGTTCGAGTCAGTGCGTTTACGTCGATCTTCCGAGAAATACTGCCGTAAAGGAGGCCTGGTTGTCCCACCTGCAACTGCCCCTCGCCTACAAGGGACGCCTGTGTGGGACGCACTTTGTGGAACTGTACAAGCTGGATTTGCCTAAGGTTCTGTCAATCCAGGATTTGGAAGAGCTGGAGTGTACTTTCATTGAGTTCCACTGCGCTGTGCCAGGCTGCCATTCGAAAAGTGCTCCTGAAACAACCTTAGTCCGGCTCCCGGTCGACGAAACGCTGGTCTCCAAGTGGTTGCGCAACACAGGAATTACCTATGATCCCTTGAGGCACAGCAGTTATAGACTATGCCTACTACACTTTCCGCCGAAATGTTTGGAGGAAGGTATTCTACAGAGCTGGGCCATACCCACGTTGCATCTAGGCCAAAGAAGCACGATCTTTTTGAATCCCGAGCATCCAGTGGAGCAGAAGTGGCCCAAAAAAGTCCCGGAAATAAATGGCGTTGTTAGTTTGGAAAGGGTCTGTTGCATTCCCTTATGCGGCAAAGAAAAACGACTCTTCCGCTTTCCCAGCGAGGAACAGCAATTGCTCCTGTGGCTGGTTAACACTCAGCAGCAGCCTCGTGTCGTGGATCCCCGAAAGCTGTTCGTGTGCATGTCCCACTTCGAGCCCGAAGCCATATACAGAAATCAACTCCGCAGTTGGATGGTACCCACCTTAAACCTCGGACACGATGGCTACGTTATGCGCAACGTTCAACACAACGGGAGCATCGCCGACAGCCAGGACATCGAGGCGGCGCTCAAGTTCATCCGCAAGCGCTACTGCTCTGTCCTTAGCTGTTTTCGCATCAAAGGAGACGGAGTGCGGCTCTTTGAGTATCCAAAAGATAAGGACATAATTCGGAAGTGGGCAGCCAAATGCAAGCATCGCTCTTTGCATGCAATAAGTCATGGGTTCAGGGTGTGCCAGGATCACTTTGCCCCTGATTGCTTCGACCCGGACACAGAAAATCTGATAGAAGGATCGGTCCCCACTTTACACCTGTCCAGCGGTGTAGCTGCGGAGACGGAACAGGGGACCTGTATGGTGCCGGACCAGAAGAACTGTCTGGTGCCGGACCAGGAGACCTGTTTGGTGCCGGACCAGGAGACCTGTCTGGTGCCGGGGTGCAGCAACAAGGAAACAGGCCAACGTATTCGCTACTTCAGTGTGCCAAAGCGAGCTTACCTAGTGGACCAATGGTGCCTGAACTTGCGGATTCACTACGCTGATCTCATACATGGCAGTCAGCGCATATGCCATCGCCACTTTGAACCTCACTGCTTCAAAAGCGCCACTGTCCTGCGTCCTAATTCATTTCCAACCCTTTACCTAGGTCCCGGCGTGGACGACGGCAATGACAATAGGTGCTGCGTGCCGGGCTGTGGAAAAGACCAGACTGTTGGTCCTTTATATAGTTTTCCTAGCGTGCGATCCTTGTTCGAGCAATGGATGCATAACATTCGGCTGGACTTAGACAAGGATCAACGGCTTATTCTGAAGGTGTGCAGTGTGCACTTTGAGGCCTTTTGTTTAACTGCTAAAAAAAGGCTGCGCTTGGGCTCCATACCAACCCTGAAGCTGGGCCACTCATACCCGGACATTTACAAGACAAAAATGGGCATCCTGGGCAAAGCACATAAGTTCCACAAGCGACCATTAATGCTGGAAGTCGACTGTTGCTATCCCGAGTGTTCGGAGCAACCGAAGAATTTGGCCTACGATCTCCCACGGGATAATCTTTTGCGCTTAGCCTGGCTGCACCACATGGGCATCGACGAGGCTGCGAAGGACTCACAGGACTCCCCACAACTCTGCCCGCTGCACTACGTCATGCTTTACGATCACAGTGCGAAAAGTGTTCCGGAACGTGTTCCAGACCGAATCCTCACCGATAACTACGAGGATGCCCGGAGAAAAAGGCGGGTGAAGCTCGTCAGCTGTGCCGTCAAAGGTTGCAACACGCTGAAGCCGAGGGATGGTATTCGCCTACACGGATTGCCAACGACTCGGATGATCTTGCAATTGTGGGTGGACAACGGCCAGCTGGAATTGGGCGACCATCTGCAATACACGCGAGTGTGCCGCAAGCACTTTGAGCCACAGTGCCTTGCCACTGAACAGCGCCTGGTCTCCTGGGGCATACCCACCTTGCATCTTCCGGGCAGAGTGGTGCACCAGATTCCTACCAAAGAGCAGTGGAAAGAAATGAAGGCCCAGGAAGAGCAACCAATAGAGCTGAAGCAAGAAGAACATATTGAAACACATGACTGCTCTACAGAGCCCTTCGTTAAGATGGAGCATTTTGATTCAGATGATGAGGACAAGGAGAAGGAGGACGAAGAGATGCAGGACCTTGAGGTGGTACTGGAGGTCAGTAGAGTGGAACACCTTAAGTCCTACGAGAGAACGAACGATTTTAGGGTGGTCAACGCCGTGCAAGCGGCTCCCTCCTTCAAACGCAACAAATACAATGCTGACCACTGCGCAGTGAAAGGATGCCAAGTAACAGTGAAGGACGTTGACGAGAACATAAAGCTTCACGCCTTTCCCACCACCACAGAGACGATACAGATGTGGTCCCACAACACACAAGTGGACATGGATGTAAGGTGCCGATGGCGCTATCGAATCTGCAGCAACCACTTTGAGCAGGAGTGCTTCTTTGGTAGCAGACTCAGAAAGGGATCCATTCCTACTCTGTTTCTGGGACCTCAAACACCAAGAAAAGTGTACAGCAGTATGTTTTCTTCAGCCGATGCTAATCCTTCTTTTGAACCAAATAGCGACTTAGCTCCCGAACCGTCATCCAACGACTTTCAGATAAAACTAGAAAAGGATGAGAATGAGATTATCCTAGGGCATCAGGATGTCTACTTACAATAG
- Protein Sequence
- MRIILIKKRKADYPEVSDGSEEFTICLEQFQLNSSTLHNVSQLGYSSPTPDQDQASISIIRDAQPQLQNSAFRKGSHFEPYAQIEEGTHTKWGAHIEEVVKEEPRIQEPSHLELENSVEIKEEASVSTNTESTGHPRLKKIRKCCIPNCGQYESTEVRVFRFPAEAQMLVLWLVNTKLKPRLVNPLEMCVCQFHFEPEAINKDQLTSWAVPTLKLGHEGPVVPSPKLKKYTLDCRETRCVVTGCAEDKSEPSDVYYRLPKSPALLDAWSCNLWISPSELLMGDQRICGRHFEDYCKKTTKSLYAGSLPTLFLGHEENILPNPRSLMEREENKSYDKYSASSCAVPGCWVTFKVITNTFGLHKLPSSYEAAEMWLQNIHVEMDEGLWPRIRVCSHHFEQDCFQNFRIREGAMPTLCLGPRTPEFLHENQPPSNQDLATSPQDQAKTNITSPTDIAQRKRNLQATPEKEGPVEKKKTPGSNGASKNCCVKSCRNDDESKLQPMPEEPSLLKKWLHNLKQSKDGDTTLLRVCPQHFEPHLVRNGTPLEQMVPTLNLGHSSVNIYRNKEVCLMTDCPSARLSQCVYVDLPRNTAVKEAWLSHLQLPLTYKGRLCGAHFVELYKLDLPKVLSIQDLEELECTFIEFHCAVPGCHSKSAPETTLVRLPVDETLVSKWLRNTGITYDPLRHSSYRLCLLHFPPKCLEEGILQSWAIPTLHLGQRSTIFLNPEHPVEQKWPKKVPEINGVVSLERVRCIPLCGKEKRLFRFPSEEQQLLLWLVNTQQQPRVVDPRKLFVCMSHFMYFMYIHANKEEEELLATPPKASAPPKVALSGAARKRLRFYLSKGHPIEEARRLAAPPMQKKETPQTASKRMRSVNTTPSSQSPKRAMTQSDTTSDISSGMAAPAPLITVEVTMAPVATTTTAASAKEPVNSEADVAQPTENGVGPSYRDAVSNTKLAVFPEGYPSTLYSVEQLKVLQSAILDAVIADRKGTTKPHFSGCSFRPGWLAVSCRDSNTADWLKTTLPKIKVQLGARPKVVIESEMPRPQIFVGYFPETKKDITSEEILALLEGQNEGLRVSDWRVLRRELKGNMVSLTLSVDAVSANMLKVSSNKVDFAWGTVSLRPKAKPTVAAQQADVAKTGVAPTATANVTSVQPGNSRQAATNNLPSTMAPQQPRMRPPLFGRLPRPAGRQGKAEYPEVSDGSEEFTICLEQFQLNSSTLHNVSQLGYSSPTPDQDQASITIIRNGQPQPQNSAFRKGSHFEPYAQSEEGTHTEWGAHIEEFVKEEPRIQEPSHLELENSVEIKEEASVSTNTESTGHPRLKKIRNCCISNCGQYESTQVPVYRFPTEAQMLALWLVNTQLKPRLVNPLKMCVCQFHFEPEPINKDQLTSWAVPTLKLGHEGPVVPSPKLKKYALDCRETRCVVTGCAEDESEPSDVYYRLPKSPALLDAWSCNLWISPSDLLMGDQRICGRHFEDYCKKTTKSLYTGSLPTLFLGHEENILSNPRSLMEREENESYDKYSASSCAVPGCWVTFKDVTKTFGLHKLPSSYEAAEMWLQNIHVEMDEGLWPRIRVCNHHFEQDCFQDFRIREGAMPTLCLGPRIPEFLHEKQPPSNQDLATSPQDQAKTKITSPTDIAQRKRNLQATPEKEGPVEKKKTPGSNGAPKKCCVKSCRNDDESKLQPMPEEPSLLKKWLHNLKQSKDGDTTLLRVCPQHFEPHLVRNGKPLEQMVPTLNLGHSSVNIYRNKEVCLMTDCPSARSSQCVYVDLPRNTAVKEAWLSHLQLPLAYKGRLCGTHFVELYKLDLPKVLSIQDLEELECTFIEFHCAVPGCHSKSAPETTLVRLPVDETLVSKWLRNTGITYDPLRHSSYRLCLLHFPPKCLEEGILQSWAIPTLHLGQRSTIFLNPEHPVEQKWPKKVPEINGVVSLERVCCIPLCGKEKRLFRFPSEEQQLLLWLVNTQQQPRVVDPRKLFVCMSHFEPEAIYRNQLRSWMVPTLNLGHDGYVMRNVQHNGSIADSQDIEAALKFIRKRYCSVLSCFRIKGDGVRLFEYPKDKDIIRKWAAKCKHRSLHAISHGFRVCQDHFAPDCFDPDTENLIEGSVPTLHLSSGVAAETEQGTCMVPDQKNCLVPDQETCLVPDQETCLVPGCSNKETGQRIRYFSVPKRAYLVDQWCLNLRIHYADLIHGSQRICHRHFEPHCFKSATVLRPNSFPTLYLGPGVDDGNDNRCCVPGCGKDQTVGPLYSFPSVRSLFEQWMHNIRLDLDKDQRLILKVCSVHFEAFCLTAKKRLRLGSIPTLKLGHSYPDIYKTKMGILGKAHKFHKRPLMLEVDCCYPECSEQPKNLAYDLPRDNLLRLAWLHHMGIDEAAKDSQDSPQLCPLHYVMLYDHSAKSVPERVPDRILTDNYEDARRKRRVKLVSCAVKGCNTLKPRDGIRLHGLPTTRMILQLWVDNGQLELGDHLQYTRVCRKHFEPQCLATEQRLVSWGIPTLHLPGRVVHQIPTKEQWKEMKAQEEQPIELKQEEHIETHDCSTEPFVKMEHFDSDDEDKEKEDEEMQDLEVVLEVSRVEHLKSYERTNDFRVVNAVQAAPSFKRNKYNADHCAVKGCQVTVKDVDENIKLHAFPTTTETIQMWSHNTQVDMDVRCRWRYRICSNHFEQECFFGSRLRKGSIPTLFLGPQTPRKVYSSMFSSADANPSFEPNSDLAPEPSSNDFQIKLEKDENEIILGHQDVYLQ
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -