Dlim011697.1
Basic Information
- Insect
- Drosophila limbata
- Gene Symbol
- Trerf1
- Assembly
- GCA_963924055.1
- Location
- OZ001352.1:8301994-8319640[-]
Transcription Factor Domain
- TF Family
- MYB
- Domain
- Myb_DNA-binding domain
- PFAM
- PF00249
- TF Group
- Helix-turn-helix
- Description
- This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 1 2.3e-09 4.6e-06 27.3 0.1 2 45 1744 1787 1743 1788 0.93
Sequence Information
- Coding Sequence
- ATGGCTGCCATAAATACGGTACCCAAAAGCATCCACCTGCCTTTGACAAGTCTCAGCACGCCACGTGTGTTGATGTGCAGCGCCTCCGTTGGCACTGAGGGTTCCGTGACACTGCAATTGAGAGATGCCAATGCCTTGGagacagcaacatcaacagcaacagcagcagcaactaagacaacagcgacagcaacaacaacagcaatagcgtcatcatcatcatcgagCAATCCAACAAATGGCGTCACAGTGCTTGAGGCATCATCAGCGTCAGCATTAGAGAATGCACTGACCATTGGCTATCCGGATCCGGATATGTTAGCAGATGTTTTGGGCACCATACAGACAGCCTCTCTCAAAAAcaagcacagcagcaacagcaacagcaacagcaaaagaagcagcaatagcaatagtagaaacaacagcagcaacagcgcaCCAACAATTTTGAATACAAGCTCTGGGATTCATTTGAATAATCCGAATAAGATTACGATAACGCGTCGAAGTTCGACGAACAAGATCAGTGTACAGACGGTGAGCACGTCGACGAACACGACCATTAGCAGTATTAGCGGCAACAAGACCAACTACATTAAGAACTGTCTAAtacgcagcagcagctgcagcaacactGCCACCAATGTCACGACGCTGGCACAGATtatgagcaacagcagcaccagcagcgcTGCCAGTTTACTCAGTCAGcataacaacaatagcaattgcagcaacagcagcaactttaGCAACGCCTCCTCCGTTGGCAGCAGCATTAGCATTGGCAGCAGCACCAACTGCAGCAAtagtagcagcaacagcagcaccaacagcaccagcagcaacagcaacagcaacgacagcagcaacagcagcagcggcggtcACAGTTTGAGTTTCAAGAGCAACGGTCGCAATGTTCCTGGCGTAgtcacaacaaccacaacaacaacagcagcaaccacaaaCACCACAACAACCGGCATTGGCATCGTCACCGTCGACACTGCGCCCCGAAAGAGTCGTCCCAAGCTATCGTCGCCAACACGCCACGGACCACAGCAGTGTCAGATTTGTAGCAAGATCTTTGGAAATGCCTCGGCGCTGGCCAAGCACAAACTGACGCACAGCGACGAACGGAAATATATTTGTGCGCTCTGCTCGAAGGCATTCAAGCGGCAAGATCACTTAAACGGACACATGATGACGCATCGGAACAAGAAGCCTTACGAGTGTAAGGCCGACGGCTGTGGCAAATCGTATTGTGACGCACGCTCTCTGCGCCGACACTCGGAGAATCACCATGGCGGCGTGGCCACGCCCACCAATAACTCACTCTCGCCCACGGCCagcacaagcagcagcagcagcagcaacggcagcaacattGGTGTTGGTGTCGGGGTTGCCACCAATTCGCTGAGTCTGTCGCCGGCAACGGCTAGCGGGGATGCCAGCTCACCGGATGGGGCCACCTGCATACGCACCTACATCTCCACGGGCAGCTCTGTGGTGGATGCGGCCACGGGAATCGCGCTGTCCGAGGAGCAAATCAAGGCCATGAATTTGCCCATTAAGACGGGAGTCACATTGCTGTCGCCAACGACATCGATATCCTCGAATGCCTCATCCACGTCGTGCTCCTCGTCATCCGCTGGTGGCTCCAACAATGGCAACATGGCCAGCTCACCGACCATAACGCTCAGCGATGGGGCCATGTTGGAGGGCGAGGGATTGACACGGGAGCAGCTCGATCTCATCAGCAAGATCATGCAGCAGACGAAGCAGACGAGCGCCCAGGTCACAGTCTCGTCTCCCAACAGCGTCAGCTCCTACAAGATCAACACGGAGACTGCTCCGGTGCAATCCCGACCACGTACTTGGAACATGCAACTGCTCAATAATGCACAGAATGTGACCGTCACCGTGGAGGATGGCACCGATCTGGTTGCCTCCGCCAGCAGCTCACCCGACGAGGTCAAGGAGGATGAGCTGAGCCAGCAAATTGTGGCTGCCATCAATCCCCACCTGCTCAACATCGTGAAGATCGACAAGCCGGTCGAGTGCAATCTCTGTCACCGCAAGTTCAAAAACATACCTGCCCTCAATGGGCACATGCGCCTCCATGGCGGCTACTTCAAGAAGGATCCGGAGGCCAAGCGCAGCGAGAAGAAGGACGCAAGTGGACCGCCCTTGCAGACGGCCAGCATCGGGGTCCGTGCCCTCATCGAGGAGAAGATCATCAGCAAACGCAAGGACATCAGCAAgGGAGCGTTTGTGGTGCCTGCACCACCGGGCAGCTTAGTGAACAGCGGTGGTGCTGGCGGCGGTGGCGTTGCCAGTGGAGGCGGCAGCCTGCGACGCTCCATCAGCGACCTGGAGAGTTTCCTCAATCCGAAGAGCGGCTTAAGTCAGGCATTggccaccagcagcagcagttgcaccACGAGCACCACCACAGCGGTGCTGCCAGCGGCCACCACCATCAAGAGCAGCAACGGGCTGAGCATCCAGCAAATCGGGCTGCCCCAGAGCATTGAGATCTTCAGCGGGGGCCAGAGGCAGCCAAAGACGCTGAGTCTGGGCAGTGGTGCCAACACGATCACCATAACCACCAACAATGTGCCCACGACAACCACCATGAGCGCGTTGACGGCACTGAAGGGCaacggcggcggcggtggcagcTCGAACCACGTCGGTGGCGTCTCGAACCACGCGGATCCCAAGGACTCCACACTGATAGAGCTGCTGAAACGCGGCACGCGCATTGCCGTCACCTCCAAGAAGACGCAGCTGCAGGCGCAGACGAGCACGAACCTGTTGCTGAGCAGCATGACGGGCACGGATCTGAGCACCATCGGTGGCAGTCTCCAGACGGGGCGTCAGATCATCACCAACAACAATCGCACTGTCATCATACCCTCGGATGTGCACGTCGTGTCCACCAAGAGCAAGTTGATctccagcagcaacagcagcaacaacagcaacaatatgaACAGCATCTCCCTGCCGGATGGGACACCGCTCTCCCTGACCATAGCGCCGAGTCAGGAGAGCCTTGGCGTCATTGGGGATTCAGGCGGCATCACGAGCGGCGGAGGTGGCGTCTATACGGTGACCTATACCAGCGATGCGGATGCCTCTGATCTCTTTGACGATGCCGAGGTCTACAATGTCTCGGACACGGAGATGCTGCTGCAGACGGTGGACACCATGGAGCTGCTGAACGATGACGAGGACGAGGCGCACAAAAGCGAACATTCCGAACATGACTTTGCCCTGGGTGGCGATCACAGCCAGCTGATCAAGTTGGAGCCAGATTCGGggcagagcaacaacagcaacaacaacaacattaaccaGACTACGCCGTTGCCCACCTTCCAGCAATTCCATTCCAAGGAGCTCATCATGCAGAACAGCTCACAGATCCAGGCGATTGCCAACATGCGGGCAGGCAGCAGTTTGGGAGTCCTCGCCTCCCCGCTGCACTCGCCGCTCGCCTATCCGACGCCGCCGTCCAGTCACGAGAACGTGGCCCAATCCTCGCCCTTCATCGAGGACGCCGCCGCTCAGTTTGTGGACGCGAATCACACGTTCTTCGGCGACAAGACGGACTTCTCCCACGTCTACTTCAAGACGGACGAGGGCAATGCGATGCAACAGCTGAGCGACAATGACAACGAGAAGATACTCAAGCTGAAGTCCGTGCTGGAGGAGAGCAGCTTCGATCCGTCGATCAAGGTGGAGGATCTGCTCAACAGCAACGAGGATGATGCCGAGTGCGATCTGCGCGAATTTGCCGAGACGAATCTGTCGTTTCTCGACGAGGATCAGGAGTTCCTCAACGATTCACGGAACGCCACCTCGCCGCTCTCCGAGTCCTTCTTCACCAGCGGCATCGGGTCCGCCGAGGACGTGAAGCAGGTGCTGCGCGAGGTGCTGCCCGACGAGAacatgcagctgcagctgagCAGCGAGCAGCAGGGCGAGAATATCATAGATCTCTACTATCTGCCCGGACTGGGACTGCAGTCCCAAATGATGCCCAACTCCGACGATCCATTGTTGTCGTCCTCGCCACGCGAATTTGGCCAACAGCGTCAagtgttgcatgcaacaacaacagcggcaacaccaacaacagttCAGGTCCTGGAGCCAACAGTGATTTacgagcagcaacaaccacagcagcagcagcaacagcagcagcagcaacagcagcaacaacaacagttgctgccTCAATTACAGCAACCCGAGcagcttcaacaacaacagcagcagcaacaacagcagtcgGAATTCATGTTGCCCAGCTACACACCGGTGTCGTGTCAGACACAGTATCTGGACACCAGTCAGCAGTCGATGACGTTGCAGCCACTGAACAGCCTGTTGCAGCCATTGCTTTATGCCACggccaacaccaacaacatcaacagcaatggcagcaacaacacatcCAGTGAGAAACACGAGTTTGCCACACTGCTCAGCACTAATGTCGGCGGTCATGCCACAGTGCTTGATGCAAGTCTGCTCTTCGCCTGCAGCGACAGTGGCAACAAGCCTGTGATTGCCGCCGCGCTgcccgcagcaacagcagcagttgcagtgCCTCCGCCAGTGGCTAGTGTGTCCAATCTGCAGCCATTGTCGAACCAAACCAATTCCATACTGAAGCGTCGCCTGCGCACGAATGGAACGCAGGATGTGCACAAGTTCTCCAAATTCCACACACTCTCCCCGCATCGCTCCAAGCTGCGTAAGCCATCGCGCACTCATTACACCCCGGCGCCCATTCTGAATCCGGACCGCAAGGGCACCGGACTCTACTGCAATGTGCGCAAGCAACTCGGCCAGGGCATCTTCGATGTCTTCGACGACGACTTCGGCGATCCCGTGGGTCTGGTCGACTTCTCCGACGAGTCCAAGGTGAACCTGGGCTCGACTTATCAGGCCCAGATACCAGCCTGCAAGTCGGCGGACGATTACTCCAAAGATACTCCGATGGGCGCCGATCTGATGTGGGATCCCAGCGTGCAGCTGGACGAGAAGATACTGATGCGCTACATAGATCTCAGCAAGTCATCTGCCGTCCCCATGGGCAGCCACTCGGAGGAGGTCGCTCTCCAAACGCTTCTCGACGCCAAGGGCAACTCGGCAGCCGCCGTCTTGACCCTGCTGCAAACACAGTCCAGCGCATTTCAGATGAAGTGGACTGCCTTCGAGCTGGAGCAGTTTCTGCGAGGTCTGGAGAAGCATGGCAAGAACTTTGGCAAAATCGCTAGCGAgCTGCTCACAAAGACATCGGGCGAGTGTGTACAAATGTATTACTTCTGGAAGAAGCTCTGTGTGGACTACAAGGTGTCGCATCTGAAAATGGAGCCAGTGATGCCCAGCACTCCGACGGTGGAGCAGAAACCCTATGTCTGTGAGATTGCCGACTGTTCGGCGAGCTTCAGTTCGAAGGCGGCATTGCATGGCCATGTTCGCATACACGCTTATGGTAGAaatgccagcagcaacaatagcaacaacaacagcaacaacaacaaccagcaTGCCACGGCAGTtagtgccaacaacaacagcaaccataGTTCCAGCAACAACGCATGCGCGACCCCAACTAGCAGCAATaatgcaataacaacaacaacaatcataaCGAACACCAATGGCAATAACACCACCGTGAACAACAATagtaacaacaaccacaaccacaacaacaattccaTGTTAGCTGCCAACTCAACCTTAAACTCAACGCTGTTgctgacagcagcagcagcagcagcaacagcaacaaccaaagacaacagcagcaacacaaacaacaacggcaacggcaataACTTGAATGTTGCCAAGGATAGCGAATTCCCCTGCAAGGTGTGCGGCAAGGTCTTCAATAAGGTTAAGAGCCGCAGCGCCCATATGAAAACTCACCGAGTGCAGGAGGCGGAGCAATCCTCCAAttccaaacaacaacaatcgtcCTCCGCAGGAAATCTGTCTGTAACTGTAGCTGTGTCCTCTGCAGTATCGGTCATGGCCTCGACATCCTCATAG
- Protein Sequence
- MAAINTVPKSIHLPLTSLSTPRVLMCSASVGTEGSVTLQLRDANALETATSTATAAATKTTATATTTAIASSSSSSNPTNGVTVLEASSASALENALTIGYPDPDMLADVLGTIQTASLKNKHSSNSNSNSKRSSNSNSRNNSSNSAPTILNTSSGIHLNNPNKITITRRSSTNKISVQTVSTSTNTTISSISGNKTNYIKNCLIRSSSCSNTATNVTTLAQIMSNSSTSSAASLLSQHNNNSNCSNSSNFSNASSVGSSISIGSSTNCSNSSSNSSTNSTSSNSNSNDSSNSSSGGHSLSFKSNGRNVPGVVTTTTTTTAATTNTTTTGIGIVTVDTAPRKSRPKLSSPTRHGPQQCQICSKIFGNASALAKHKLTHSDERKYICALCSKAFKRQDHLNGHMMTHRNKKPYECKADGCGKSYCDARSLRRHSENHHGGVATPTNNSLSPTASTSSSSSSNGSNIGVGVGVATNSLSLSPATASGDASSPDGATCIRTYISTGSSVVDAATGIALSEEQIKAMNLPIKTGVTLLSPTTSISSNASSTSCSSSSAGGSNNGNMASSPTITLSDGAMLEGEGLTREQLDLISKIMQQTKQTSAQVTVSSPNSVSSYKINTETAPVQSRPRTWNMQLLNNAQNVTVTVEDGTDLVASASSSPDEVKEDELSQQIVAAINPHLLNIVKIDKPVECNLCHRKFKNIPALNGHMRLHGGYFKKDPEAKRSEKKDASGPPLQTASIGVRALIEEKIISKRKDISKGAFVVPAPPGSLVNSGGAGGGGVASGGGSLRRSISDLESFLNPKSGLSQALATSSSSCTTSTTTAVLPAATTIKSSNGLSIQQIGLPQSIEIFSGGQRQPKTLSLGSGANTITITTNNVPTTTTMSALTALKGNGGGGGSSNHVGGVSNHADPKDSTLIELLKRGTRIAVTSKKTQLQAQTSTNLLLSSMTGTDLSTIGGSLQTGRQIITNNNRTVIIPSDVHVVSTKSKLISSSNSSNNSNNMNSISLPDGTPLSLTIAPSQESLGVIGDSGGITSGGGGVYTVTYTSDADASDLFDDAEVYNVSDTEMLLQTVDTMELLNDDEDEAHKSEHSEHDFALGGDHSQLIKLEPDSGQSNNSNNNNINQTTPLPTFQQFHSKELIMQNSSQIQAIANMRAGSSLGVLASPLHSPLAYPTPPSSHENVAQSSPFIEDAAAQFVDANHTFFGDKTDFSHVYFKTDEGNAMQQLSDNDNEKILKLKSVLEESSFDPSIKVEDLLNSNEDDAECDLREFAETNLSFLDEDQEFLNDSRNATSPLSESFFTSGIGSAEDVKQVLREVLPDENMQLQLSSEQQGENIIDLYYLPGLGLQSQMMPNSDDPLLSSSPREFGQQRQVLHATTTAATPTTVQVLEPTVIYEQQQPQQQQQQQQQQQQQQQQLLPQLQQPEQLQQQQQQQQQQSEFMLPSYTPVSCQTQYLDTSQQSMTLQPLNSLLQPLLYATANTNNINSNGSNNTSSEKHEFATLLSTNVGGHATVLDASLLFACSDSGNKPVIAAALPAATAAVAVPPPVASVSNLQPLSNQTNSILKRRLRTNGTQDVHKFSKFHTLSPHRSKLRKPSRTHYTPAPILNPDRKGTGLYCNVRKQLGQGIFDVFDDDFGDPVGLVDFSDESKVNLGSTYQAQIPACKSADDYSKDTPMGADLMWDPSVQLDEKILMRYIDLSKSSAVPMGSHSEEVALQTLLDAKGNSAAAVLTLLQTQSSAFQMKWTAFELEQFLRGLEKHGKNFGKIASELLTKTSGECVQMYYFWKKLCVDYKVSHLKMEPVMPSTPTVEQKPYVCEIADCSASFSSKAALHGHVRIHAYGRNASSNNSNNNSNNNNQHATAVSANNNSNHSSSNNACATPTSSNNAITTTTIITNTNGNNTTVNNNSNNNHNHNNNSMLAANSTLNSTLLLTAAAAAATATTKDNSSNTNNNGNGNNLNVAKDSEFPCKVCGKVFNKVKSRSAHMKTHRVQEAEQSSNSKQQQSSSAGNLSVTVAVSSAVSVMASTSS
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00534086; iTF_00569218; iTF_00568914; iTF_00585127; iTF_00585461; iTF_00471114; iTF_00470803; iTF_00519466; iTF_00519098; iTF_00537743; iTF_00537445; iTF_00529072; iTF_00528738; iTF_00601715; iTF_00602076; iTF_00557846; iTF_00558166; iTF_00608683; iTF_00608378; iTF_00583615; iTF_00583936; iTF_00586199; iTF_00585905; iTF_00523257; iTF_00522893; iTF_00610588; iTF_00610894; iTF_00575517; iTF_00575859;
- 90% Identity
- iTF_00534086;
- 80% Identity
- iTF_00534086;