Fyps018532.1
Basic Information
- Insect
- Fissipunctia ypsillon
- Gene Symbol
- ZNF541
- Assembly
- GCA_947568875.1
- Location
- OX387661.1:14620281-14658355[+]
Transcription Factor Domain
- TF Family
- MYB
- Domain
- Myb_DNA-binding domain
- PFAM
- PF00249
- TF Group
- Helix-turn-helix
- Description
- This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 1 6.7e-08 9e-05 23.3 0.1 2 46 855 899 854 899 0.96
Sequence Information
- Coding Sequence
- ATGCACTTAGTAGGTACACAATCGCCATTTCTTCCACTTGAACAAAAAGAAGTCGGCGAACAAAAACAAAAGAACATCAGATCATTAGGGGTTTGCAACAAGATCTTCGGCAATGCGTCGGCGTTGGCGAAACACAAACTAACGCACAGCGATGAACGCAAGTACGTGTGCATCACATGCGCTAAGGCGTTCAAGAGACAGGATCATCTCAACGGGCACATGCTAACACATCGCAACAAGAAACCTTACGAGTGTAAAGCAGACGGGTGCGGCAAGTCATACTGTGACGCGAGGTCGCTGCGACGACATACTGAGAACCACCATCAACCGCCGGCACCTGAATCAAGTTCATCAAGCATACCAAGCAGCAGTGTCATTACGATTGTGGAGCGCGAGGTGACCGCGACCGCGAACGCAAATGCGAACGCGAATTCGAACAGTCATCATGCGCAGCGAGTCGCGTCCCCCATCTCACCTGCGCGTACTCAACCACCACATTCTGATTCAAGCAATGGATCAGACAAGTCAATCGCGACTTCAAGCAGTACCAACTGTGACTCGAGCCCCCCTTGCACCCCGCCCGCGCCGCCTACACCCCCTGCGAGGCCCAAGCCTAAGAGCAGTAGCAAGCCCAAGTCTTGTACGACGCAGCAAAGCAGTCAGAAAGCTTCGAGTGGTGCGAGTGCTGGCGGGGCGTCGAGCGGAGCGGGCAGCAGTGCCGCGTCCGCTGCCACGCGAGCCAGCGATGTCAAGCCGGTAGAATGCAACCTCTGTCATCGGAAGTTTAAAAATATACCTGCTCTAAACGGTCACATGCGATTGCATGGTGGATACTTCAAAAAGGATTCTGACAGCAAAAAACTTGATAAAAAGGAGTCAACGGGACCACCTTTACAAACGGCTTCGGTATCCGTTCGAGCGCTTATAGAAGAGAAAATCATAAGCCGACGAGGTGCAACTGTTGCTTCCACACCATCTACAGGTACAAGTACAGACTCAGTGGCATCCCGCACAGGGTTCGTAGCGCCGGCTCCTCCACCACTGTCTAACATCCGAACTTCAGTAGCATCGCCAGCGTCCCCGGTTGCCACGTCGTCGCAATTCGTCTCCCCTCGGGCACCACCCGTCGTCACTGTCGCCTCAAACGCCGCAACTAGTATCTGTGCCGCACAGAGAGATTCCACCCTTATAGAATTATTGAAGAAGGGGAACACTAAGGCTGTAAAGCGATCAGCATCAGATCCCGGTCATTCACCGCCACAACAAGACTTCACTTTCCGGCCAGAATTATTCGGAGTCTCCTTCAATTCAGATGATGGCTACTTTTCACCAGCATTGAACGAAGATACTTTTCAATTTACCACAACTCACGACCAATTAGAGGAACTCGCCTCACTGGAAGACTATGCTACTGTAGCAGCATCTATTCGGGAGCGATCGCCTGTAACGTTTCCTTCCAATCGACGGCTTGCTGCTGTCCTTAATTCACCGCTACCAGAATCCCTTGCAGACTTCGGCGCTTGTCATGGAGGGTCACCGGTTCCATCGCCAGGAATGGGGTATGCTGCGAACTCACCCGGACTGTCCTACTCTACTAATGGTTCTCCAGGGTTATCGTTCACAGCTGCATCGCCAAGTAGCTATTCGAACCACGCCGAGCCATCACCAGGAATGGCATATCCAACGCCTCCAGCTTCCCATGGCGCTCACTCACCAGGTCACAACGCCCCTCGCGCGTCGTCGCCACTCTCAGCTGCTTTTTTCACTGCCACAATGTCTAGTCAAGAAGAGGTGGAGGAAGCTCTAGAAGAAGTGCTGCCAGAAGAGTGTCGATCATTAGATGCGTACGCGCTGGAACCTTCTCCTACGCCACGCCGGATCATGCTTAACTCTGAAGATCCACTCTTATCGAGCAGTCCTCGAGACTTCTCACATCAACGGCCTTTTCGTCGACATAGTCGAATGGCATCGTCTACTATTTCGCCCCTACAACAATGGCAGCACGACTCACTTCAAGTATGTGTGGAGGGTCGAGATACAGTGCCAGCAGTATTTCTGAGTCCAAGCAGCGTGCCTGCGTCCCCTCAGAGACGCAAACGTCGAGCGTCCCCGGCCGGTCCGTTTCGCTCCCGCATCCGGCGCACCTCTAGTCACTACACGCCTTTTCCGATACTGCCCCCGGACCGAGAAGGATCAGGGCTACTTATAGAAAATACAGCAGGTATAGCAGCTACGCAAACCGATGTCCTAGCTCAGCTGGAAGAGCCTCGACATCCACAGATCAACATCGGGCGAGACTTCCAAGCCGACCTGCCGGCGCTGTGTAATGATCGTATAGACCTACATCGCGTGCCGGAACAGCTTCTATGGGACCCTGGCATCAACGATGCTCTAGACGATAATGAAGTGAGAATGTTCATGGAGATGTCCATGTGCGCCGCGATGCCCGTGGGCGGTCACACGCGCGAGTTCGCGCTGCAGACGCTGGGCGAGTGCGGCGGCGACATCCGCGCGGCCACGCTGCGCCTCATGACGCGGCCCGCGGCGCCTGCGCAGCACGAGTCGCGCTGGACCACCGACGAGGTCGAAGCCTTCCTAGCTGGACTAGGCCATTACGATAAGGATTTTTACCGAATTTCGCAGCTGGTAAGAACAAAAGACTCGAAACAATGTATACAATTCTACTACTTCTGGAAGAAACTCACGAAAGATTACAAACCGCTGTACTTACGGAGCTGGAGCATGGATCAGCAGGTTTCAACACAAGGTTCTGTAGGCCAGTATAGCGCGCTCAGCACCTCGGCGTGCGCGTCCTCCGCGCCCACCTTCGATACCGAGGAGTTCCCTTGCAAGATATGCGGGAAAGTATTTAACAAAGTAAAAAGTCGTAGCGCACACATGAAGTCGCACCGGCCACTCGACGCCGAGCCAAAACGGCCAAAACTCGAAAAGCCTTATGAAAAGGTCGAAAGATCCGATGACAGATCCCACGCGACAGCTGAATACCAAAGCAAACCGCCCGCTCAGTAA
- Protein Sequence
- MHLVGTQSPFLPLEQKEVGEQKQKNIRSLGVCNKIFGNASALAKHKLTHSDERKYVCITCAKAFKRQDHLNGHMLTHRNKKPYECKADGCGKSYCDARSLRRHTENHHQPPAPESSSSSIPSSSVITIVEREVTATANANANANSNSHHAQRVASPISPARTQPPHSDSSNGSDKSIATSSSTNCDSSPPCTPPAPPTPPARPKPKSSSKPKSCTTQQSSQKASSGASAGGASSGAGSSAASAATRASDVKPVECNLCHRKFKNIPALNGHMRLHGGYFKKDSDSKKLDKKESTGPPLQTASVSVRALIEEKIISRRGATVASTPSTGTSTDSVASRTGFVAPAPPPLSNIRTSVASPASPVATSSQFVSPRAPPVVTVASNAATSICAAQRDSTLIELLKKGNTKAVKRSASDPGHSPPQQDFTFRPELFGVSFNSDDGYFSPALNEDTFQFTTTHDQLEELASLEDYATVAASIRERSPVTFPSNRRLAAVLNSPLPESLADFGACHGGSPVPSPGMGYAANSPGLSYSTNGSPGLSFTAASPSSYSNHAEPSPGMAYPTPPASHGAHSPGHNAPRASSPLSAAFFTATMSSQEEVEEALEEVLPEECRSLDAYALEPSPTPRRIMLNSEDPLLSSSPRDFSHQRPFRRHSRMASSTISPLQQWQHDSLQVCVEGRDTVPAVFLSPSSVPASPQRRKRRASPAGPFRSRIRRTSSHYTPFPILPPDREGSGLLIENTAGIAATQTDVLAQLEEPRHPQINIGRDFQADLPALCNDRIDLHRVPEQLLWDPGINDALDDNEVRMFMEMSMCAAMPVGGHTREFALQTLGECGGDIRAATLRLMTRPAAPAQHESRWTTDEVEAFLAGLGHYDKDFYRISQLVRTKDSKQCIQFYYFWKKLTKDYKPLYLRSWSMDQQVSTQGSVGQYSALSTSACASSAPTFDTEEFPCKICGKVFNKVKSRSAHMKSHRPLDAEPKRPKLEKPYEKVERSDDRSHATAEYQSKPPAQ
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_01342555;
- 90% Identity
- iTF_00908353; iTF_00907010; iTF_01094882; iTF_01095490; iTF_01534434; iTF_00122979; iTF_01093060; iTF_00907845; iTF_00121032; iTF_00071401; iTF_00124713; iTF_01094537; iTF_00924619; iTF_00036652; iTF_01093548; iTF_01246863; iTF_00122352; iTF_01094001; iTF_00071954; iTF_00906098; iTF_01440626; iTF_00925363; iTF_00685972; iTF_00124214; iTF_00745655; iTF_00121401; iTF_00746361; iTF_00120474; iTF_00726874; iTF_01525978; iTF_00121883; iTF_00123867; iTF_00237532; iTF_01439900; iTF_00907518; iTF_00906688; iTF_00148010; iTF_00147421; iTF_00667978; iTF_00667426; iTF_00449070; iTF_00449676; iTF_00363988; iTF_00364454; iTF_01441493; iTF_01441011; iTF_01527801; iTF_01527140; iTF_00758682; iTF_00758125; iTF_00111622; iTF_00112075; iTF_01119792; iTF_01119294; iTF_01118271; iTF_01030665; iTF_01535375; iTF_01533479; iTF_01063251; iTF_00929119; iTF_01084693; iTF_00810548; iTF_00809601; iTF_00274004; iTF_00274888;
- 80% Identity
- iTF_00726874;