Amei002037.1
Basic Information
- Insect
- Apis mellifera
- Gene Symbol
- Znf541
- Assembly
- GCA_003254395.2
- Location
- NC:1279705-1750482[+]
Transcription Factor Domain
- TF Family
- MYB
- Domain
- Myb_DNA-binding domain
- PFAM
- PF00249
- TF Group
- Helix-turn-helix
- Description
- This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 2 4 2.9e+03 -2.4 0.1 29 42 1160 1173 1158 1174 0.88 2 2 8.9e-12 6.5e-09 34.9 0.0 2 46 1326 1370 1325 1370 0.96
Sequence Information
- Coding Sequence
- ATGACAGCCATCAACTCgggtCCGGTCGGGGAAGTGGGCGGCGTCAGGCTGCCGTTGCAATCATTACATGGGTACGGATCTGTGTTCATGTACTCGGCGTCGACGAGTCCCGGGGGTGGAGGTGGCCAGGGTGGCGGAGGCGGAGGTGAAGTGACCCTCCGCCTCCGCGAACCTGAGGATATTCCCGAAGAAGTTCTTGCACGCCTCGAGGACACGGCCACCCTCTCCATCTCTCTTCAAAGCGACGCAGTGTTGAGGCTGGGGGCAGCTGGCACGGGAAACGGGAATTTAGAATTGTTCGATAGTGAGAGCGGGCCGGACCTCACACTATCACCTCAGACCTTCACGTCGACGGCGGAGGCCTTCATCAATGACAATAGTGATAGTCTTGGTGTTCCAGGAGACAACGATACGGGTAGCAGCGAGGAATCGAGACCTGGGGGTGGGGATCCTGGAAAGTTGGGGGTGAAGAAACCAAGACCGAAGGCTTCGTCGCCCAACCGACAGGGACCACAGCAGTGCCAGGTCTGCGGGAAGGTGTTCAACAACGCATCGGCGCTCACGAAGCACAAGCTGACGCACAGTGACGAGAGGAAATATGTGTGCACGATGTGCGGCAAGGCGTTCAAGCGGCAGGACCACTTgaacGGGCACATGCTAACGCATCGAAACAAGAAGCCGTACGAGTGCAAAGCGGAGGGATGCGGAAAATCGTATTGCGACGCGAGGAGTCTGAGGAGGCACACGGAGAATCATCATGCCGGTAGCAAGGTGAACGAAAGCGTTAGTCCTAGCAGTCCAACCACCGGTCCTCACACACCAAACACACCGGGAAGTAATCCCAGCACGCCAAGCACACCGGGAGCGGCGTCGACGCCCAACGGCCATGGACATCCGGCCTTAAAACAATTGCTCGCCACGGAACCGTCACAGGCGCAGCAGCAAAAGaGTGCCACGTCTCCTGGTGCTAGCAACGATGGTCTCACGAAACAGCAACTGGAGCTCATTCAACAGATTATGCAACAAAcacagcagcaacagcaacaacagcaacaaacATCCGCGCAACAGCCACAGCAACAACGCAGCAATTCATCAAGCACAGTCACAGCGCAAATTCAAAAAACGCAAAAACCGTCCGTCAAGTCCACAACTTCATCTGGCAgcgtttcgaataattctgGAAGTTCAAATGCAACAAATAACGCGAGATCCAGCCCCAAACCTAAAGTCTGGAGTCACGTACAGcaacagcaacagcaagCACAGCAGCAAGcacagcaacagcaacaacagcaaccGCAACAACAGCATCAGCAACAACAGCCACAGGCTAATTCGCCAGGAGGTGGAAATGTGGGGAAAAGCAGCCCGTCGCCGGGAAATTCGGCATCCCCTGGCACTGTGACAGCGTCCAACAAGTCGCAGACCGAACCGAAACCGGTCGAGTGCAATTTGTGCcatcgaaaattcaaaaatataccgGCGCTAAATGGCCACATGAGACTCCATGGCGGTTACTTCAAAAAGgATGCTGAGAACAAGAAAAGCGAGAAGAAAGAGGTTGTCGGTCCGCCTTTACAAACCGCGTCTGTCAGCGTTAGAGCGctcattgaagaaaaaattatacagaaaAGAACGACTGCAGCGGACACAAATTCAAATCAACAGGCACGCACAAACGCAACAGTTTTTACCACCCAAGCCGACAATGCGTCGCAAATTTCCGGTAAAACGAGCTTCGCGGTACCAGCACCGCCTCCATTGGCCTCCGCGGAAAAGGTCCGCAGACTTTCGGATGGCGAGCACTTCCGGCAGCCCAATGTCACCGTTTCCGGCCACACGACCGTGCAAAAGCAGCTGCAGGAAGCGCAGACCCAGGCTCAGGCGCAGGCGCTGGCCGACATGATCTTGAAGgGAACTACTAAAATGGCTGTGAAAAGAACAACTTCGGATCCCGGGCAAAGATTGCATCTGACTTATCAGACATCGGTTCAATCGACGACTAgtcagcagcaacagcagcagcaacaacagcagcagcagcagcagcaacagcagcagcagcagcaacagcagcaaaaTAATCAGGCTCAAGCTCAGCCCACGGTTACTGAAAGTTTCTCAATGACAGGTTACTCTGAAGACGGTGGTTATTTTAGCCCAAGCCTTCAAGATGAGGTGTTTCAACAAGTCACTGCAGTTCCTGACAGCGTTCTTCTTCATGCTACTCAATTGGATTCACTTCAATTTCAGACAGCGAGCCTGTTACAAGAACAGCCCGCGGAGCAATTGCAAGACATCACTCTGGAGGACCGATATCCATCGCAGCACACGGTGAACCCAGACCTGCAGGCGTTGGTGAACTCCCCGTTGCCAGACTCGTTAGCCGAGTTCAGCACGTACGGTGCCCAATACACGGAGAGCCCGCACACGTTACCCACCCAGTCCCCCCTGCCATCGCCGTTGACGAGACACGACAGCCCCAGCTTCACATATCCAACGCCACCAGCGAGCCAGGAGGGCCTTCTCACGGGCAGCTTCCCCCTGCTCAGCCCTCAAGAGGATCCGGAGGCCGTGAGACAAGTCTCGTCACCGTTGTCCGCTGCCTTCTACGCGAGCACCATGTCCTCGGCGGCTGCCGTCGAGGAAGCTCTGAGCGAGGTGTTGCCAGGCGAATCCGAAGCAGATACCGACCTGTACGGGTCCGGTTCACCTAATCCTCACTCGCCACTGCCAAGCCCGTTATCGGCGACCCCGGCACCGTCGCCGTTGTCCTCGTTGCCGCCCTCCTCGGTCTCGTCACCCGGCCCCGTGTCGTTCACGGGCACGAGTTTCCCCGTGTCGCCACACCACGCCCTCCAGAATCAGATGATGCCCAACTCGGAGGATCCCCTGCTCTCCTCGACCCCGAAAGACTTCACCACGGCCAGCCGGAGACGGTTCGAGTTCCAATCGTACAAATTTATCACCAGCCAGAACTTGGTGGACTTTGGGCTGGGTAACGGCAGCCTGGCCGGGATCGTGGTGGACAACAACGGCGAGTTCAAGCTCATCCAGACGGCGGGGTTGCAGAAGACTAATGTGCTGGTGCAAACTGGCTCCCTGAGCCCGGCCCTCGCGTTCAAGAAGGAGATGAGGCTGGCGACACCCAAGATGGAGCCCGTCACCGTCAACCTTGGCCTGAACGTGCAGACGAACGTGCATCAGTACAACGCCGGCCAGTTCAATCAGCAACAGGTGGTGAACCCGACCAAGTTTCTCATACAGACGAACGTGGCCAAGGGTGGCAGTCTGAGGCAGAAGGTGGCCGCCTCCGGTGGAGGGAACCTGCCTGTCCCGGTCGAGCAGATCAAGGAGGAGCTGTTGGACGAGGACGTGTTCCTTAATCCCAGCAACGTGCCGGCTGGCAGCCCTGTCAGGCAGTGCAGGAAAAGGCCACGCATGGACGGGGGGGGAGGCCTCTACGCGAGCGCCTCGTATCCGTCCAGATTGAGGAAAGCCTGCGACAGGCATTGGGACAGCAGCTACACGCCGCCACCGATCTTGGATCCATCCCGCCCCGGGCCGGGCCTCTACGCCAGGCTGCATCAGTACGAGAGGGATTCTGACTTCAGCTCCGACGATTCCCAAGGGAACGACGGCCCTCCACCTAGAATTAATATCGGCACGAGGTATCAGGCTACGATACCACCGGTCGGCAGCGACGGGGACAGAGGAAAGGGGGAGCCGGAGGCCGACCATCTGCTCTGGGATCCTGGTATAAATAACGTGCTGACGGACAATGAATTGGATATGTACCTACAATTTGCATGCTGCGCCGCGGTACCAGGTGGAGGAAGAAACAAAGAATACGCGCTTCATCTGCTGCACATGTGCAAAGGGAATATTCATGAGGCCATGCTAAAGTTAATGAGACCAACACCTCCGTTGCCAGCGGAACACCCGTTGCTCAGTTACGAGTGTCACGAGTCCGACCGATGGACGTCGCAAGAGATGGACGCATTCTACCAGGGCCTGTTGAAGTACAACAAGGATTTCTCTGCGATCTCGCGGGACGTGGGTGGCAAGTCCGCGAAACAATGCGTGCAATTCTATTATCTGTGGAAGAGGCTCTGCCCGGACGAATACAGCAGACTTCGTGTTCGCCACGGTAAACCAAAGATTAAGACAGAGAGCAAGGATTGCAAGGATACCAGGGAGCTTCGCGACGCCATTGCATCGGTCACGGAGATGGACTTTAGCGAAGACAAATCCATTCTTCATCGCACTCTGTTACAGACAAATGAGAGAGAAAACTCGGCTACTCTGACCACCAGCGACGGAGAGCTGAGATTATTTGCATACGACTGCCCCGATAGCTTTAGCGGAAGTGTAACAGTAACGATGGCCGGGAGCACCCAAACCGCCCAAACCGGCAATACCGGCCACGCCACAGTCAACACCAACTCACAAACTCACACTAGTTCTGAGCAACAACTACCCGTGCCCACCCCACTTCACTACCCCTGCAAGATTTGCGGGAAAGTTTTCAACAAAGTGAAAAGCAGAAGCGCGCACATGAAATCCCACAGGCCGGTACCCTCGCCCGATTCAACGGGGCAGGAATCTAAGAGGCCTGTCCAACAGCAATCCAAGACGCAACAATCCCAACAATCCATCCAGCAACAGCGtcaacaacagcagcagcaaccgcagcaacaacagcagcagcaacagcaagcGAGCAGCGTTTCATCACAGGCCCAAGATTTCAATCAGCAGCAACTGCCTCCCAGCAACGGCGACACCGGCGCTCAGAACCCAGCGCATTTATGGCACAATCCGACACGGTTCAGGCCACCATAG
- Protein Sequence
- MTAINSGPVGEVGGVRLPLQSLHGYGSVFMYSASTSPGGGGGQGGGGGGEVTLRLREPEDIPEEVLARLEDTATLSISLQSDAVLRLGAAGTGNGNLELFDSESGPDLTLSPQTFTSTAEAFINDNSDSLGVPGDNDTGSSEESRPGGGDPGKLGVKKPRPKASSPNRQGPQQCQVCGKVFNNASALTKHKLTHSDERKYVCTMCGKAFKRQDHLNGHMLTHRNKKPYECKAEGCGKSYCDARSLRRHTENHHAGSKVNESVSPSSPTTGPHTPNTPGSNPSTPSTPGAASTPNGHGHPALKQLLATEPSQAQQQKSATSPGASNDGLTKQQLELIQQIMQQTQQQQQQQQQTSAQQPQQQRSNSSSTVTAQIQKTQKPSVKSTTSSGSVSNNSGSSNATNNARSSPKPKVWSHVQQQQQQAQQQAQQQQQQQPQQQHQQQQPQANSPGGGNVGKSSPSPGNSASPGTVTASNKSQTEPKPVECNLCHRKFKNIPALNGHMRLHGGYFKKDAENKKSEKKEVVGPPLQTASVSVRALIEEKIIQKRTTAADTNSNQQARTNATVFTTQADNASQISGKTSFAVPAPPPLASAEKVRRLSDGEHFRQPNVTVSGHTTVQKQLQEAQTQAQAQALADMILKGTTKMAVKRTTSDPGQRLHLTYQTSVQSTTSQQQQQQQQQQQQQQQQQQQQQQQQNNQAQAQPTVTESFSMTGYSEDGGYFSPSLQDEVFQQVTAVPDSVLLHATQLDSLQFQTASLLQEQPAEQLQDITLEDRYPSQHTVNPDLQALVNSPLPDSLAEFSTYGAQYTESPHTLPTQSPLPSPLTRHDSPSFTYPTPPASQEGLLTGSFPLLSPQEDPEAVRQVSSPLSAAFYASTMSSAAAVEEALSEVLPGESEADTDLYGSGSPNPHSPLPSPLSATPAPSPLSSLPPSSVSSPGPVSFTGTSFPVSPHHALQNQMMPNSEDPLLSSTPKDFTTASRRRFEFQSYKFITSQNLVDFGLGNGSLAGIVVDNNGEFKLIQTAGLQKTNVLVQTGSLSPALAFKKEMRLATPKMEPVTVNLGLNVQTNVHQYNAGQFNQQQVVNPTKFLIQTNVAKGGSLRQKVAASGGGNLPVPVEQIKEELLDEDVFLNPSNVPAGSPVRQCRKRPRMDGGGGLYASASYPSRLRKACDRHWDSSYTPPPILDPSRPGPGLYARLHQYERDSDFSSDDSQGNDGPPPRINIGTRYQATIPPVGSDGDRGKGEPEADHLLWDPGINNVLTDNELDMYLQFACCAAVPGGGRNKEYALHLLHMCKGNIHEAMLKLMRPTPPLPAEHPLLSYECHESDRWTSQEMDAFYQGLLKYNKDFSAISRDVGGKSAKQCVQFYYLWKRLCPDEYSRLRVRHGKPKIKTESKDCKDTRELRDAIASVTEMDFSEDKSILHRTLLQTNERENSATLTTSDGELRLFAYDCPDSFSGSVTVTMAGSTQTAQTGNTGHATVNTNSQTHTSSEQQLPVPTPLHYPCKICGKVFNKVKSRSAHMKSHRPVPSPDSTGQESKRPVQQQSKTQQSQQSIQQQRQQQQQQPQQQQQQQQQASSVSSQAQDFNQQQLPPSNGDTGAQNPAHLWHNPTRFRPP
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_01122371;
- 90% Identity
- iTF_00142660; iTF_00230629; iTF_00230890; iTF_00214593; iTF_00214331; iTF_00684984; iTF_00684732; iTF_00760922; iTF_00761187; iTF_00232190; iTF_00231920; iTF_00232544; iTF_00232802;
- 80% Identity
- iTF_00142660;