Obic010749.1
Basic Information
- Insect
- Osmia bicornis
- Gene Symbol
- Trerf1
- Assembly
- GCA_907164935.1
- Location
- NC:3208107-3449775[+]
Transcription Factor Domain
- TF Family
- MYB
- Domain
- Myb_DNA-binding domain
- PFAM
- PF00249
- TF Group
- Helix-turn-helix
- Description
- This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 1 6.1e-09 2.8e-06 27.8 0.0 2 46 1327 1371 1326 1371 0.96
Sequence Information
- Coding Sequence
- ATGGCAGCCACGAGTGACAACGGGCTAGTGGACACAGATCCAGGTCGAGTTAGCAGCATGACAGCCATCAACTCGGGTCCGGTCGGGGAAGTGGGCGGCGTCAGGCTGCCGTTGCAATCATTACATGGGTACGGATCTGTGTTCATGTACTCGGCGTCGACGAGTCCCGGGGGTGGAGGTGGTCAGGGTGGCGGAGGCGGAGGCGAAGTAACCCTCCGCCTCCGCGAACCCGAGGACATTCCCGAAGAAGTTCTTGCACGCCTCGAGGACACGGCCACCCTCTCCATCTCTCTTCAAGGCGACGCTGTTCTCAGGCTGGGGGCGGCTGGCACAGGCAACGGGAATTTAGAGTTATTTGATAGTGAGAGTGGGCCGGACCTCACACTATCACCTCAGACCTTCACGTCGACGGCGGAGGCCTTCATCAATGACAATAGTGATAGTCTTGGTGTTCCAGGAGACAACGATACGGGTAGCAGTGAAGAATCGAGACCCGGTGGCGGGGATCCTGGAAAGTTGGGGGTGAGGAAACCAAGACCGAAGGCTTCGTCGCCCAACCGACAGGGACCACAGCAATGCCAGGTCTGCGGAAAGGTGTTCAACAACGCATCGGCGCTCACGAAGCACAAGCTGACGCACAGCGACGAGAGGAAATACGTGTGCACGATGTGCGGCAAGGCGTTCAAGCGGCAGGACCACTTGAATGGCCACATGTTGACGCATCGAAATAAAAAGCCGTACGAATGTAAAGCGGAGGGATGTGGTAAATCGTACTGTGACGCGAGAAGTTTGAGAAGGCATACAGAGAATCATCACGCAGGTAGCAAAGTAAATGAAAGTGTTAGTCCTAGCAGTCCTACGACCGGTCCTCACACTCCTAACACACCAGGAAGTAATCCTAGCACACCCAGTACACCAGGTGCAACATCGACGCCGAATGGCCATGGACATCCGGCTTTGAAACAATTGCTTGCCACAGAACCTACACAAGCACAACAGCAAAAGAGTGCCACGTCTCCTGGTGCCAGCAACGATGGCCTCACAAAACAACAACTGGAGCTCATCCAACAGATCATGCAACAAacgcaacaacaacagcaacagcagcaacaatCTCAACAGCAGCGTAGCAATTCGTCAACAACAGTCACAGCACAGATTCAAAAAACGCAAAAACCGTCCGTCAAATCCACAACCTCGTCTGGAAATGTTTCGAATAATTCCGGAAGTTCGAATGCAACTAGCAACGCGAGATCCAGCCCGAAACCAAAAGTCTGGAGTCACGTTCAGcaacaacagcaacaagcCCAGCAGCAGGtccagcagcagcaacaacaacagcctCAACAGCAACatcagcagcaacaacaacaggcTAACTCACCAGGAGGTGGAAATGTGGGGAAAAGTAGTCCTTCACCAGGAAATTCGGCCTCCCCTAGCGCTGTAACAGCAGCAAATAAATCACAGACCGAACCTAAACCGGTCGAGTGCAATTTGTGCCATcggaaattcaaaaatataccAGCTCTCAATGGTCACATGAGATTACATGGCGGTTACTTTAAAAAGGACGCTGAGAACAAAAAGAGTGAAAAGAAAGAGGTTGTTGGGCCACCATTACAAACTGCTTCTGTCAGCGTGAGAGCACTCATCGAGGAGAAGATTATACAGAAAAGAACGACTGcaGCAGACACGAGCACAAATCAACAGGCCCGCGCGAATATGACGGTTTTTACCACGCAAGCAGACAATGCGTCGCAGATTTCCGGTAAAACAAGTTTCGCGGTGCCAGCACCACCGCCGTTGGCTTCTGCGGACAAAGTTCGCAGACTTTCGGACGGCGAACACTTCCGGCAGCCCAACGTCACCGTTTCCGGCCACGCGACCGTGCAGAAACAACTACAAGAAGCGCAGACCCAGGCACAGGCGCAGGCGCTGGCCGACATCATTCTCAAGAGCACTAAAATGGCTGTGAAAAGAACAACTTCTGATCCTGGGCAGAGATTACACTTGACCTATCAGCCATCAGTTCAATCAACAAATAATAatcagcagcagcagcaacaacaacaacagcagcaacagcagcagcaacaaaaTAATCAGCCACAAGCTCAACCTACAGTTACAGACAGTTTCTCGATGACTGGTTATTCTGAAGACGGTGGTTACTTTAGTCCAAGTCTTCAAGACGAAGTGTTTCAGCAAGTCACCGCTGTTCCTGACAGTGTTCTGCTTCATGCTACTCAGTTGGATTCTCTTCAGTTTCAGACAGCGAGTCTGCTGCAAGAACAGTCGACAGAGCAGCTGCAGGACATCACTCTGGAGGATCGATACTCGTCCCAGCACACGGTCAACCCGGACCTCCAGGCGTTGGTCAACTCTCCATTACCCGATTCGTTGGCTGAATTCAGTACTTATGGTGCCCAGTACACGGAAAGTCCGCACACCTTGCCCGCACAATCCCCGTTGCCGTCGCCATTGACCCGACACGACAGTCCCAGTTTCACATATCCAACACCTCCGGCCAGCCAGGAGGGTCTCCTCACGGGTAGTTTTCCACTGTTGAGCCCGCAAGAGGATCCAGAAAGCGTACGACAAGTCTCGTCACCATTGTCGGCAGCTTTCTACACGACAACCATGTCTTCTGCCGCTGCTGTCGAAGAAGCTCTCAGCGAAGTGTTGCCAGCCGAATCCGAAGCGGACGCGGATCTCTATGGTTCCGGTTCACCCAATCCGCATTCTCCACTGCCAAGTCCTCTGTCCGCCACCCCAGCACCCTCGCCATTGTCATCCTTACCACCATCCTCGGTCTCGTCACCAGGTCCGGTGTCGTTCACCGGTACCAGCTTCCCTGTGTCGCCTCATCACGCTCTTCAGAGTCAGATGATGCCCAACTCGGAAGACCCTCTGCTATCCTCCACCCCGAAGGACTTCACCACCGCCAGTCGAAGACGATTCGAGTTCCAGTCGTACAAGTTCATCGCCAGCCAGAACTTGGTGGACTTTGGTCTGGGCAATGGTAGTTTGGCAGGGATCGTGGTGGACAACAACGGCGAGTTCAAGCTGATCCAAACAGCTGGCCTGCAGAAGACGAACGTGTTGGTTCAGACGGGTTCTCTGAGCCCGGCTCTCACTTTCAAGAAGGAAATGAGACTGGCGACACCCAAAGTGGAGCCAGTCACCGTGAACCTGGGCCTAAATGTGCACACCAACCATCAGTACAATGCTGGGCAATTTAATCAACAACAACTGGTGAACCCGGCTAAGTTTCTCATACAGACGAACCCTGCCAAGAACTGTAACTTCAGACAGAAGACGCCTAGCGGCAGTAGTTTACCGATTCCAGTGGAGCAGATTAAAGAAGAACTTCTCGATGAAGATGTGTTTCTTAATCCGAGCAACGTGCCTAACGGCAGCCCGGTCAGACAGTGTAGGAAAAGGCCGCGTATGGAGGGGAACTTGTACGCGAATCACTCGTATCCTTCGAGGTTACGAAAGGCCTGCGATCGACACTGGGATAGTAGTTACACACCGCCACCCATATTGGACCCATCTCGTCCTGGACCTGGATTGTACGCGAGGTTACATCAATACGAAAGGGATTCCGACTTTAGCTCGGATGATTCTCAAGGAAATGATGGACCTCCGCCTAGGATTAACATCGGAACGAGATACCAGGCAACGATACCACCTGTGGGTAGCGATGGGGATAGAGGAAAGGGAGAACCGGAAGCTGATCATCTGTTATGGGATCCTGGTATAAACAACGTGCTGACGGATAACGAACTGGAGATGTATCTGCAATTTGCATGCTGCGCTGCGGTGCCTGGTGGGGGAAGAAACAAAGAGTACGCTCTTCATCTGTTGCACATGTGCAGAGGAAACATTCATGAAGCGATGCTGAAGTTGATGAGACCAACGCCCTTATTGCCAGCTGAACATCCGTTGCTCAGTTACGAGTGTCACGAATCGGACCGATGGACGTCGCAGGAAATGGACGCTTTCTACCAGGGTCTGTTGAAATATAACAAAGACTTCTTAGCGATTTCACGAGACGTGGCTGGCAAGTCAGCGAAACAATGCGTGCAATTCTATTACCTGTGGAAGAGGCTCTGTCCGGACGAATACAAGAGGCTGCGTGTTCGCCATGGTAAACCAAAGATTAAGACAGAGAGCAAGGATTGCAAGGACACGAGGGAACTTCGCGACGCCATCGCGTCTGTCACGGAGATGGACTTCAGCGAGGACAAATCGATTCTCCACCGCACCCTGTTACAGACAAACGAGAGAGAAAACTCGGCCACTCTGACCACCAGTGATGGAGAGCTGAGATTATTTGCATACGATTGCCCTGATAGCTTTAGCGGAAGCGTAACAGTAACGATGGCCGGGAGCACCCAAACCGCCCAAACCGGCAATACCGGCCACGCCACAGTCAACACCAACTCACAAACTCACACTAGTTCTGAGCAACAACTACCCGTGCCCACCCCACTTCACTACCCCTGCAAGATTTGCGGGAAAGTTTTCAACAAAGTGAAAAGCAGAAGCGCGCACATGAAATCGCACAGGCCGGTGCCCTCGCCCGATTCAACGGGTCAGGAATCTAAAAGGCCGGTTCAGCAACAATCAAAAATGCAACAATCTCAGCAGTCGAATCAACgccaacaacagcagcaacaagctcagcagcagcagcaacaattGCAGCAACAGCctcaacagcaacaacaacaatccAGCAGTGTATCACCGCAGGCTCAAGACTATAATCAGCAGCAACTGCCTCCTAGCAATGGCGACACCGGTGCTCAAAAGCCATCCCATTTATGGCACAACCCGACACGGCTCAGGCCACCATAG
- Protein Sequence
- MAATSDNGLVDTDPGRVSSMTAINSGPVGEVGGVRLPLQSLHGYGSVFMYSASTSPGGGGGQGGGGGGEVTLRLREPEDIPEEVLARLEDTATLSISLQGDAVLRLGAAGTGNGNLELFDSESGPDLTLSPQTFTSTAEAFINDNSDSLGVPGDNDTGSSEESRPGGGDPGKLGVRKPRPKASSPNRQGPQQCQVCGKVFNNASALTKHKLTHSDERKYVCTMCGKAFKRQDHLNGHMLTHRNKKPYECKAEGCGKSYCDARSLRRHTENHHAGSKVNESVSPSSPTTGPHTPNTPGSNPSTPSTPGATSTPNGHGHPALKQLLATEPTQAQQQKSATSPGASNDGLTKQQLELIQQIMQQTQQQQQQQQQSQQQRSNSSTTVTAQIQKTQKPSVKSTTSSGNVSNNSGSSNATSNARSSPKPKVWSHVQQQQQQAQQQVQQQQQQQPQQQHQQQQQQANSPGGGNVGKSSPSPGNSASPSAVTAANKSQTEPKPVECNLCHRKFKNIPALNGHMRLHGGYFKKDAENKKSEKKEVVGPPLQTASVSVRALIEEKIIQKRTTAADTSTNQQARANMTVFTTQADNASQISGKTSFAVPAPPPLASADKVRRLSDGEHFRQPNVTVSGHATVQKQLQEAQTQAQAQALADIILKSTKMAVKRTTSDPGQRLHLTYQPSVQSTNNNQQQQQQQQQQQQQQQQNNQPQAQPTVTDSFSMTGYSEDGGYFSPSLQDEVFQQVTAVPDSVLLHATQLDSLQFQTASLLQEQSTEQLQDITLEDRYSSQHTVNPDLQALVNSPLPDSLAEFSTYGAQYTESPHTLPAQSPLPSPLTRHDSPSFTYPTPPASQEGLLTGSFPLLSPQEDPESVRQVSSPLSAAFYTTTMSSAAAVEEALSEVLPAESEADADLYGSGSPNPHSPLPSPLSATPAPSPLSSLPPSSVSSPGPVSFTGTSFPVSPHHALQSQMMPNSEDPLLSSTPKDFTTASRRRFEFQSYKFIASQNLVDFGLGNGSLAGIVVDNNGEFKLIQTAGLQKTNVLVQTGSLSPALTFKKEMRLATPKVEPVTVNLGLNVHTNHQYNAGQFNQQQLVNPAKFLIQTNPAKNCNFRQKTPSGSSLPIPVEQIKEELLDEDVFLNPSNVPNGSPVRQCRKRPRMEGNLYANHSYPSRLRKACDRHWDSSYTPPPILDPSRPGPGLYARLHQYERDSDFSSDDSQGNDGPPPRINIGTRYQATIPPVGSDGDRGKGEPEADHLLWDPGINNVLTDNELEMYLQFACCAAVPGGGRNKEYALHLLHMCRGNIHEAMLKLMRPTPLLPAEHPLLSYECHESDRWTSQEMDAFYQGLLKYNKDFLAISRDVAGKSAKQCVQFYYLWKRLCPDEYKRLRVRHGKPKIKTESKDCKDTRELRDAIASVTEMDFSEDKSILHRTLLQTNERENSATLTTSDGELRLFAYDCPDSFSGSVTVTMAGSTQTAQTGNTGHATVNTNSQTHTSSEQQLPVPTPLHYPCKICGKVFNKVKSRSAHMKSHRPVPSPDSTGQESKRPVQQQSKMQQSQQSNQRQQQQQQAQQQQQQLQQQPQQQQQQSSSVSPQAQDYNQQQLPPSNGDTGAQKPSHLWHNPTRLRPP
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_01122661; iTF_01123004; iTF_01123261; iTF_00141443; iTF_00141173; iTF_00142660; iTF_00142409; iTF_00230629; iTF_00230890; iTF_00214593; iTF_00214331; iTF_00385253; iTF_00385826; iTF_01071207; iTF_01070965; iTF_01087440; iTF_01087143; iTF_00684984; iTF_00684732; iTF_00760922; iTF_00761187; iTF_00232190; iTF_00231920; iTF_00232544; iTF_00232802; iTF_01228584; iTF_01228313; iTF_01228956; iTF_00754569; iTF_00755030; iTF_00264939; iTF_00265247; iTF_01077733; iTF_01077401; iTF_01099128; iTF_01099460; iTF_01270445; iTF_01270749; iTF_00182194; iTF_00181908; iTF_01477508; iTF_01477856; iTF_01355686; iTF_01355251; iTF_00360845; iTF_00361150; iTF_01476493; iTF_01476117; iTF_00181244; iTF_00181540; iTF_00417843; iTF_00417499; iTF_01120437; iTF_01120150; iTF_00898686; iTF_00899079; iTF_00016895; iTF_00016622;
- 90% Identity
- iTF_01122661; iTF_01123004; iTF_01123261;
- 80% Identity
- iTF_01122661;