Deng011386.1
Basic Information
- Insect
- Drosophila engyochracea
- Gene Symbol
- onecut
- Assembly
- GCA_035042385.1
- Location
- JAWNLK010000108.1:195241-201046[-]
Transcription Factor Domain
- TF Family
- CUT
- Domain
- Homeobox|CUT
- PFAM
- PF02376
- TF Group
- Helix-turn-helix
- Description
- The CUT domain is a DNA-binding motif which can bind independently or in cooperation with the homeodomain, often found downstream of the CUT domain. Multiple copies of the CUT domain can exist in one protein (eg Swiss:P10180).
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 1 1.5e-37 1.3e-33 114.7 0.3 3 78 724 799 722 800 0.97
Sequence Information
- Coding Sequence
- ATGGAATCGATGAACGAGATTATTGATGCACAAACTTTCGGCCACCAGCTGGTTGTGGAAACGAGCGATTTCATTGGGCAACCCGGCAATCCTGAAAGCGAATCCGAATGCGAGCCGAGGGAAGTGCACGAAACGGGCATTAAATACTTGAAAGCGTCTGCGATGCAAACACTTATTGGGAACACAAATCGCAGTCATCGCAGTAATCGATTAATCGAAGATGatgaggagcaggaggagctggaagaggaggaggaagagcaGGAGGAGCTGGAAGAGGAGGAGCAagagctggaggaggaggagcaagaGTCTCGATCTGTGGTGATGGTCATCGATGAACTGGGTCACAGCAATTATCAATTGATGCCACAACAGTCAGGGAGCAGAACAATACCAACAATGTCCTCGCTGCTACCCTCCATGCATCAGCGCAACAGTCCCGTCGATTTTGTCGGTTCCGAGCTCAGTTTGGATGGCTTGACCAATACTCAGATCATAGCCGTCAAACAAGAGCACAAATTGGTCATTGTTGAGCGTGGAAATAATCGTAATTTGAATGTCATCGATGAGTTGTCCGAGGATGGAGGGGGCATCGCCAGGGGTGCAGGGGTGGGCGTTGAGGAGGTCACTCTgaatcagcatcatcagcagctgctgcaggaaCAGTCTGAAcatcaaaatcatcatcatcatcagcaacagcagcagcatgcgcaacatcaacagcagtaTGGAACGGAattacatcatcatcatcagagcCAGGCTCATCATAATCCAAGCACTACCGCTACGCACTCGCGTCGCTTGCAGCTGCTCAGTCAACGCGAGGAATTGTCGGTTATTGTGCCCACGCTGCCCGATGACGaggataacgataacgatatcgatgacgatgatgaggaaGATGAGCGCGATGCCGGCGCACATTTACTTAGTCCCACAGAGCAGAGCACCTATCAAACGCTAACAgccgtcaacaacaacaataacaataataataacaataacaacagcagcacccaCAGCAGCACTACCAGCAGCAATACCAATAACAATCGCATATCAGCACCGATTTATAGTCCCACATCGTATGCGACCTTGACCCCAATACAACCACTACCGCCCATCTCAACAATGTCCGATAAGTTTGCGTACACGGGTCATATTTCCAACTCGAGCTCAAATGTAGCCggtaatagcagcagcaacagtgtgGTAGCCGCAACGGGTCGTACCAATAATTCAACGGATATTACTGGCAACGATTGTGGTACCTTTCCTAGTTTACCGCTACCCATTGAAACGGgacagcatcatcagcatcagcatcaacatcatcatcaacagcatcatcaacatcacCACGGTTTGAGTTTGGGCGGTCTGCCTTATGCCAGCTATGATAAGCTACCTTCACTCATCTCAACGCCGCCGCATAACTATGCCAGCAGTACCTCGCCGACCCATGGATTGTCTGGCCTGGTTGTGGGAGCTTGTGATTTGCATGGACACAATTCAACGGTCGTCGGCACGTCGTCGTCACCGGTTGCCGGTACGGCAATGTCACCGCACAAATCGGTTGGGCAGCTTGTGCCGGTAGTTCTGCAGAAACAGGTGCTTTGTCTTTCGCCAGGTAGTGGCCTACCCGATGGCGTTGTCGTCAGTGACTATGAGTCCTCTTACGGCCCGCCTCAACACGACCACGAGCTAATTAATGCGACAAGCGGTCGCAGTAGTTCGCAGTTGCGTTTGCAACATAGCCCTACACTGAGTCCACATTCAGCAGGCTCGGTAGTGTCAATGTCCCTGCATTCACCGGCTTCTGTCGTCACATTGCCGCACATGAATGGTTCGGTGACCACTTTAGCTGTGGATTTACCAGTTGTGGTCTCATTAACGCCGacacctcctcctccacctctgCCACAGTCCGGAGAGGGTGCCAGTAGCTCGTTGGGTGTCAGCTTGGATGAGATGTgtcagcgacaacaacatcagtcAAACGATTGCATGCCCACAAATAGTAATCACAATCTGTGCAATCTAAATCAGTCCCAGCATCATCAACAAAATGGATTTAATCAGGAGCAACCCAAACTGTCACCCAAGTCAGCAGCACCTCTAAACTCAACAGGTTCCGCTTCAAATCGTTCCTCTGACCTCGAGGAGATCAACACAAAGGAACTGGCTCAACGCATATCGGCGGAACTGAAGCGCTATAGCATTCCGCAGGCCATCTTTGCACAGCGGGTACTCTGTCGGTCACAGGGCACCCTCTCAGATCTGTTGCGCAACCCCAAGCCCTGGTCGAAGCTCAAGTCCGGCCGCGAAACGTTCCGGCGCATGTTCAAATGGCTTCAGGAACCTGAATTTCAACGCATGTCAGCGCTGCGCATGGCTGCCGCTCAGATTCCACAACGGCCAGCGAGTGGCATCAGTGTTGCTGGTAGTGCTGCTGGTAATGCTGCTATTACCAGTGTTGGTTATGCAATCACTGCTGCAACGGGTGCGGTGGCCTCTGCTGCGACACTGTTGGGCAACAGCGCAACAAATGCGGATACTGCCCACGTCTCGACTCTTGCTGAGAATGAAATGTTGAGCGGCCCTGGAGTTGTGGCCGGCTCCAATTGTCGTCGCAAGGAGGAGCCGCACATGGAACAAATGCCGCAACCAAAGAAGCCACGGTTAGTTTTTACCGATCTGCAGCGAAGGACATTACAGGCTATCTTTAAGgAGACAAAACGACCCTCAAAGGAGATGCAGGTGACAATTGCCCGCCAGTTGGGCCTCGAACCAACCACAGTGGGCAACTTCTTTATGAATGCTCGTCGACGATCGATGGACAAGTGGCGCGATGATGACACCAAGAATGCCCAGCATAATGCACACAATCGCCAGGAGCACCCGCAGGACGAACAGGATAATAGGGAAAGAGATATAGATAGGGATAGAGAAAGCAATAGTGGACAACATGGACAGCATTATGGAAGCAGTTTGCATACGACGGCAATGTCTCCATTGGGcaattttgatgatgatggcgataTGGATTTAGAACTGGAGCATCATGATTTCCTCGTAGATGGCGATGAGCACGATGATATGTTGTGA
- Protein Sequence
- MESMNEIIDAQTFGHQLVVETSDFIGQPGNPESESECEPREVHETGIKYLKASAMQTLIGNTNRSHRSNRLIEDDEEQEELEEEEEEQEELEEEEQELEEEEQESRSVVMVIDELGHSNYQLMPQQSGSRTIPTMSSLLPSMHQRNSPVDFVGSELSLDGLTNTQIIAVKQEHKLVIVERGNNRNLNVIDELSEDGGGIARGAGVGVEEVTLNQHHQQLLQEQSEHQNHHHHQQQQQHAQHQQQYGTELHHHHQSQAHHNPSTTATHSRRLQLLSQREELSVIVPTLPDDEDNDNDIDDDDEEDERDAGAHLLSPTEQSTYQTLTAVNNNNNNNNNNNNSSTHSSTTSSNTNNNRISAPIYSPTSYATLTPIQPLPPISTMSDKFAYTGHISNSSSNVAGNSSSNSVVAATGRTNNSTDITGNDCGTFPSLPLPIETGQHHQHQHQHHHQQHHQHHHGLSLGGLPYASYDKLPSLISTPPHNYASSTSPTHGLSGLVVGACDLHGHNSTVVGTSSSPVAGTAMSPHKSVGQLVPVVLQKQVLCLSPGSGLPDGVVVSDYESSYGPPQHDHELINATSGRSSSQLRLQHSPTLSPHSAGSVVSMSLHSPASVVTLPHMNGSVTTLAVDLPVVVSLTPTPPPPPLPQSGEGASSSLGVSLDEMCQRQQHQSNDCMPTNSNHNLCNLNQSQHHQQNGFNQEQPKLSPKSAAPLNSTGSASNRSSDLEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMFKWLQEPEFQRMSALRMAAAQIPQRPASGISVAGSAAGNAAITSVGYAITAATGAVASAATLLGNSATNADTAHVSTLAENEMLSGPGVVAGSNCRRKEEPHMEQMPQPKKPRLVFTDLQRRTLQAIFKETKRPSKEMQVTIARQLGLEPTTVGNFFMNARRRSMDKWRDDDTKNAQHNAHNRQEHPQDEQDNRERDIDRDRESNSGQHGQHYGSSLHTTAMSPLGNFDDDGDMDLELEHHDFLVDGDEHDDML
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00501709; iTF_00498658; iTF_00552212; iTF_00498775; iTF_00552096; iTF_00498054; iTF_00557001; iTF_00497940; iTF_00557112; iTF_00499377; iTF_00499492; iTF_00542229; iTF_00542344; iTF_00576853; iTF_00576969; iTF_00518232; iTF_00518348; iTF_00564442; iTF_00564558; iTF_00595739; iTF_00595856; iTF_00513248; iTF_00513362; iTF_00619629; iTF_00606856; iTF_00619743; iTF_00558447; iTF_00558563; iTF_00485649; iTF_00485766; iTF_00548593; iTF_00548709; iTF_00566577; iTF_00566695; iTF_00592881; iTF_00592997; iTF_00494898; iTF_00495013; iTF_00521138; iTF_00521253; iTF_00524914; iTF_00525030; iTF_00497172; iTF_00497288; iTF_00615988; iTF_00616103; iTF_00609782; iTF_00609668; iTF_00482135; iTF_00482020; iTF_00511063; iTF_00511180; iTF_00552824; iTF_00552941; iTF_00522073; iTF_00521956; iTF_00527834; iTF_00527950; iTF_00560026; iTF_00560161; iTF_00576250; iTF_00576134; iTF_00597147; iTF_00597264; iTF_00535101; iTF_00535217; iTF_00582749; iTF_00582866; iTF_00496382; iTF_00496499; iTF_00573950; iTF_00573835; iTF_00500095; iTF_00500212; iTF_00516895; iTF_00516782; iTF_00543647; iTF_00543531; iTF_00570303; iTF_00570187;
- 90% Identity
- iTF_00501709; iTF_00498054; iTF_00557001; iTF_00497940; iTF_00557112; iTF_00595856; iTF_00518348; iTF_00518232; iTF_00595739; iTF_00499377; iTF_00499492; iTF_00542229; iTF_00542344; iTF_00576853; iTF_00576969; iTF_00576250; iTF_00576134; iTF_00582749; iTF_00582866; iTF_00516895; iTF_00516782; iTF_00513248; iTF_00513362; iTF_00485649; iTF_00485766; iTF_00615988; iTF_00616103; iTF_00552824; iTF_00552941; iTF_00564442; iTF_00564558; iTF_00570303; iTF_00570187;
- 80% Identity
- iTF_00501709;