Dsub009334.1
Basic Information
- Insect
- Drosophila subsilvestris
- Gene Symbol
- Kdm5
- Assembly
- GCA_035043285.1
- Location
- JAWNNB010001726.1:1672301-1678241[-]
Transcription Factor Domain
- TF Family
- ARID
- Domain
- ARID domain
- PFAM
- PF01388
- TF Group
- Helix-turn-helix
- Description
- This domain is know as ARID for AT-Rich Interaction Domain [2], and also known as the BRIGHT domain [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 1 1.7e-26 4.1e-23 81.7 0.1 2 89 224 307 223 307 0.95
Sequence Information
- Coding Sequence
- ATGTCCGCCAAAACCGAGGCGGACAATACAACAGCCGCCAGCtccggcggcggtggcagcagtagcagcagtggCGGCGGGGGAACGGGCAACACATCCTCCTCGAATGGAACCGCCACACCGGCGCGTCGTCTACGCACACGCAACTCCACAGGCAATGGTCCGGCAAGCGGCAGTGAATCCGGCAAGAAATCCAATCCAAACGATGAACCCTCaacaccagctgctgctgcttcggccGCCAGTGGTCATGGCCCGTCGCCGGCTGCGGCCACAGAGCGTCCCATGCCGAGCGTACCCATGAACCATGCCAGCAGCAGTGTGTCGGCGAGCAGGAAGTACCACAACAGCTGCCCCCATCCCACGCCCATAGTGCACAAGAAGACGGTGCACACGCAGCCGCACAGCAGCAATAAGTTTGATCAGGGCAAGAACGAGGAGTTCCACTTTGATACGCCGCCCGAGTGCCCCGTGTTCCGGCCCACGGCAGAGGAATTCAAGAATCCGCTGGCCTACATCAGCAAGATACGTTCGATAGCAGAGAAGTGTGGCATTGCCAAGATCCAGCCGCCAGCGTCATGGTCGCCACCGTTTGCCGTGGATGTGGACAAGCTGCGTTTTGTGCCGCGTGTCCAGCGGCTCAACGAGCTGGAGGCCAAGACCAGGGTGAAGTTGAATTTTCTGGATCAAATAGCCAAGTTTTGGGAGCTGCAAGGCTCCTCCCTCAAGATCCCCATGGTGGAGCGCAAGGCCCTCGATCTGTACACCCTCCATCGCATCGTACACGAGGAGGGTGGCATGGAGCAGACCACTAAGGACCGCAAGTGGGCAAAGGTGGCCAATCGTATGCAGTATCCATCCAGCAAGAGTGTTGGCGCCACCCTCAAGAGCCACTACGAGCGCATTCTCCATCCCTTCGAGGTGTACACCTCGGGCAAGGTTCTGGGtccagctgcagccgctgccgcagccACGGCCAGTGGCACACCCACGCCCGTGAAGCTCGAAGATGTGGGCACCGAGTACAAGGCCCACGAGATAGCCACCCGTCAGCAGATCGCACCGCCCAATGAGACGAATACACGCCGCTCCAAGCGCTTTGGCAACTCCACTGCCAGCTGCGGCCTAGCAGGCAGCGCATCCTGTGGCAAGGATGCGGGTGCGGGTGGTGTGGTCATCAAAACGGAGACAAAGGAGGATTTCAAACGCGATCTGCTAAGCAGTTTCAATGCAGTCCAAGGCATCGGTGGCACGCCAGCTGGCGGCACACCTGCTGGCCGTGGCGGCCCCAAGAAGACCAACAACGAACCAACGCCAACTGCGCCGCCACTCATCGATCCACTGATGAAGTACATTTGCCACATCTGCAATCGCGGCGATGTCGAGGAATCGATGCTGCTATGCGACGGCTGCGACGACAGCTACCACACGTTCTGTCTGCTGCCGCCCTTGACCAGCATACCCAAAGGTGAATGGCTTTGTCCGCGCTGCGTTGTTGAGGAGGTGAGCAAGCCGCAGGAGGCCTTTGGCTTTGAGCAGGCCGAAAGAGAGTACACGCTGCAGCAGTTTGGCCAGATGGCGGATCAATTCAAGCAGGAGTACTTCCGCAAGCCAGTGCATTTGGTGCCCACCGGTTCGGTGGAGCGTGAATTCTGGCGCATTGTCTCCTCAATCGATGAGGATGTGACCGTTGAGTATGGAGCCGATCTGCATACAATGGATCATGGTTCGGGTTTCCCCACCAAGAGCTCGCTGTATCTGCTGCCCGGTGATCAGGAGTATGCCGAGTCCAGCTGGAATCTGAACAATCTGCCGCTGCTCGAGGACTCGATTTTGGGGCACATCAATGCCGATATTAGCGGTATGAATGCCCCCTGGATGTATGTGGGCATGTGCTTTGCCGCCTTCTGCTGGCACAACGAGGACCACTGGAGTTATTCCATCAACTATTTGCACTGGGGCGAACCCAAGACCTGGTACGGTGTGCCCGGCTCCTGTGCCGAGCAATTTGAGGAGACCATGAAGCGGGCAGCACCCGAACTGTTCTCCTCGCAGCCCGATCTGCTGCATCAGCTGGTGACGATTATGAATCCCAATATATTGATGAACAATCGCGTGCCTGTGTTTCGCACCGATCAGCATGCCGGCGAGTTTGTCATCACCTTTCCGCGCGCCTATCATGCGGGCTTCAATCAGGGCTATAACTTTGCCGAGGCCGTTAACTTTGCGCCCGCCGATTGGCTAAAAATGGGCCGTGAATGCGTCAATCATTACTCCATGCTGCGTCGCTTCTGTGTCTTCTCCCACGACGAGTTGGTGTGCAAAATGGCCTTGGAGCCGGCCAAACTGACCTTTGGGATAGCCACCGCCTGTTACATTGACATGGCCGAAATGGTGGACACCGAAAAGAAGCTGCGCAAGTCCCTACTGGAGTGGGGTGTAACGCGCGCGGAGCGTCGCGCTTTCGAGTTGGTCAACGATGACGAGCGGCACTGTCAGGAGTGCAATACGACGTGTTTCCTGTCGGCAGTGGCCTGCGAGTGCAACGACAAGCTGATTGTCTGTCTGCGGCACTACACCGtcctctgtggctgtgccccAGAGAAGCATACGCTCATCTATCGCTACACGCTCGACGAGATGCCGCTGATGCTGCAGAAACTGAAGGTCAAAGCGCACAGCTTTGAGCGTTGGCTGTCGCGTTGCCGGGACATTGTGGATGCGCATACACCCTCCTCGGTGACACTCACCGAGCTGCAGGAGCTGTGCAAGGAGGCGGAGACCAAAAAGTTTCCATCATCGTTGCTGATCGATCGCCTCAATGCGGCGGCCATCGAAGCGGAGAAATGCGTCACAGTCATCCAGCAATTGGGCATCAATAAGGTACGCACCCGTTCGGATCACAACCAGGAGGCGGCCCAATACAAGCTGACGATGGAGGAGCTCGAGCTGTTTGTCCAGGAAATCGATAATCTTTGCTGCATCATCGATGAGGGCACCTCGGTGCGGGAGCTGCTCGTTTTGGGCAAACAATTTGTGGACCGTGCGGAGGGGCAGCTGCAGCTAACGCTCGAGGCACTGGAGGAGAACGAACTGGAGACGCTGATCAATGAGGGGGGCTCCCTGAGGATTGAATTGCAGCAGTTGGATCAACTCCAGAAGCGTCTGAAGCAGTGCAAGTGGTACAAACGCTCGCAGGGATTGCGGGAGACCAGCTCCAAGCTGACATATCAGGATGTGAAGAATCTCCTGCATATGGCAGCGGCCGATTTGGATCCCACCGATCCCTATGTGGACAAGGAGATGCGCAAGCTGCAGCAGATTGGCGCCGACATTGAGTCATGGGAATCGCAGGCGGCCAAGTATTTCCGCCGCCTAACGCAACAGCACGAACTGGCGGAAATTGAGCAGTTCTTGAAGTCGGCGGGCGAGATAAGCGGCCAGGTACCATCGCATACGCTGCTCAAGGATGCCCTGCGCAAGGCGAGGGAGTGGCTGCGTGCcgtggagcagctgcagcagaacaATCATGTCACCTACTGTCACACGCTCGAGAATATGATTGAACGGGGCCTGAATATACCCATCCAGCTGGAGGAGCTCAGTCGCATGCAGGGCCATCTGTATAGCGCCAATCAGTGGAAGGATAATACGGCGTGCGCTTTCCTCAAGAAGGGCACCTTCTATACGCTGCTCGAGGTGCTGATGCCTCGGGCAGACGCCATCAACATTGACTCGGATCTCAAGCCGCGCTTCCAGGATGACTTTCTGAAGGAGAAGAATCCGGCGGAGATTGTCGACAGTTTCAAGCATgccgaggagcaggagctgctcgATATGCGCGACCTACGGCGCCAGAATATGACCAAGAACCCCCTGCGGGATATGTTCTGTCTGTGCAAGGCCGAGTTCCGTGGCCTCATGTACAATTGTCAGTTGTGCCGCGATTGGTTCCACGAGGACTGTGTGCCACCGCCACCCTCGGCATCGAGTCAGAATGGTTTGCtgaatggaggaggaggaggaatcGGAGGCGGCGGCATGGTGCCGCCCACGAATCGGCCCAAGTGGCTGTGCCCCAGCTGTGTGCGCTCGAAGCGGCCTCGCCTGGAGACGATCCTTCCACTGCttgtccagctgcagcagctgccaatACGTCTGCCCGAGGATGAGGCTTTGCGTTGCTTGGCGGAGCGTGCCATGAACTGGCAGGACAGGGCCCGCAAGGCCCTCAGCAGTCCCGATGTGAGTGCGGCCCAGGAGGCGATTCtcgcccagcagcagaagcgtcGCTCGGAGGGCGGTGGCatcggtggcggcggcggtgttGTGGGCAACATAAACAGTCCCCGCAAGCCCCGACGTCGTGGCAGCATGGCCAAGGAGGCAAGCGGCTCCACAGAGTCGGAtgccgatgatgatgacgacgatgacgagtGCCGGCTACGGATCGTCGAGGATGGCTTCAGCAATGACGAGGATGAGCTGCCCCGCCCCGCAACGAACACCGAATTGAATTCCGATCTCCTCAAGCTGCTGTCCGATAGCGAAATTGAGAATCTACTAGATCTAATGATGGAGGGTGATCTCCTCGAGGTCTCACTGGACGAGACGCTCGAACTGTGGCGCATACTAGAGACGATGCCCCCCACAATGCTTCAAGCGGAAGCCAATAGCCGTGTGGCCCAACACATGCAGCGCCAGCGACAGCAACCGACATCTCTGCCGCCCACATCCTCCAGCATTCACTCAGGCGCCGAGGACTCCAACGACAGCCTCCTGGTGCAGAACAGTcccaatagcaacagcaataatAGCAGCATACCGGCCACGATTACACCCAGCAATGGACGCAACAAGAAGCGGCGTTCGAACGATGCCAGCGGCGGCAATGCGGCTGTGCCACGCAAGAAACAGAATACGCCCAAGCAGACGCCCAGCAAGAAGAAtgccggcggcggcggtggagcGGCTGCCAATCGGAAGAGTGAGGCCAAAACGGcgaccgcagcagcagcagcagcggcggcggcaacgacACCTGGAGCGGATGCCGATGCGGAGAACAAGCAGGCGAATGGCGGCAATACCAACTCAAATATATCTCCGGCTACAGTGGCGGGGGCGGGAGTGGGTAGCGGTTCGACCACACCCACGCCTACGGCCACGCCTGGTTCGGCGCATAAGAAACGTAAGCGCACCTCCACCAACAATAatagcagcaatagcaacaacaacaacaacaatagcaacaccaatagtaacagcagcagcagcaacaacaacaacagtccGAGCACTCCCAGTGGAACGCAGGCAACGACAacaccggcagcagcagcagcagcagcagcagcaccaccaccaacaggtgcgggtggcggcggtggcggtcaGAAGAAGCACGCCCAGCGCAATCAACAGGCAGCCCAGGAGGATGACGAGGAGGAATGCAGGGCGGAGAACTGCCATAAGCCGACCGGGCGTGAGGTGGACTGGGTGCAGTGCGATGGCGGCTGCAACGAGTGGTTTCACATGTACTGTGTGGGTCTAAATCGCAGCCAGATCAAGGCCGATGACGATTACATTTGTATACGGTGCACGAAGACCGTCACCGTGGCGGTGagcagtagtagtagtagtagtaatCATGGCATGggagcgacaacgacaacgatggCGACGCCGGGCAAGCAGCGGGCAGTGCAATCGGCGCGATAG
- Protein Sequence
- MSAKTEADNTTAASSGGGGSSSSSGGGGTGNTSSSNGTATPARRLRTRNSTGNGPASGSESGKKSNPNDEPSTPAAAASAASGHGPSPAAATERPMPSVPMNHASSSVSASRKYHNSCPHPTPIVHKKTVHTQPHSSNKFDQGKNEEFHFDTPPECPVFRPTAEEFKNPLAYISKIRSIAEKCGIAKIQPPASWSPPFAVDVDKLRFVPRVQRLNELEAKTRVKLNFLDQIAKFWELQGSSLKIPMVERKALDLYTLHRIVHEEGGMEQTTKDRKWAKVANRMQYPSSKSVGATLKSHYERILHPFEVYTSGKVLGPAAAAAAATASGTPTPVKLEDVGTEYKAHEIATRQQIAPPNETNTRRSKRFGNSTASCGLAGSASCGKDAGAGGVVIKTETKEDFKRDLLSSFNAVQGIGGTPAGGTPAGRGGPKKTNNEPTPTAPPLIDPLMKYICHICNRGDVEESMLLCDGCDDSYHTFCLLPPLTSIPKGEWLCPRCVVEEVSKPQEAFGFEQAEREYTLQQFGQMADQFKQEYFRKPVHLVPTGSVEREFWRIVSSIDEDVTVEYGADLHTMDHGSGFPTKSSLYLLPGDQEYAESSWNLNNLPLLEDSILGHINADISGMNAPWMYVGMCFAAFCWHNEDHWSYSINYLHWGEPKTWYGVPGSCAEQFEETMKRAAPELFSSQPDLLHQLVTIMNPNILMNNRVPVFRTDQHAGEFVITFPRAYHAGFNQGYNFAEAVNFAPADWLKMGRECVNHYSMLRRFCVFSHDELVCKMALEPAKLTFGIATACYIDMAEMVDTEKKLRKSLLEWGVTRAERRAFELVNDDERHCQECNTTCFLSAVACECNDKLIVCLRHYTVLCGCAPEKHTLIYRYTLDEMPLMLQKLKVKAHSFERWLSRCRDIVDAHTPSSVTLTELQELCKEAETKKFPSSLLIDRLNAAAIEAEKCVTVIQQLGINKVRTRSDHNQEAAQYKLTMEELELFVQEIDNLCCIIDEGTSVRELLVLGKQFVDRAEGQLQLTLEALEENELETLINEGGSLRIELQQLDQLQKRLKQCKWYKRSQGLRETSSKLTYQDVKNLLHMAAADLDPTDPYVDKEMRKLQQIGADIESWESQAAKYFRRLTQQHELAEIEQFLKSAGEISGQVPSHTLLKDALRKAREWLRAVEQLQQNNHVTYCHTLENMIERGLNIPIQLEELSRMQGHLYSANQWKDNTACAFLKKGTFYTLLEVLMPRADAINIDSDLKPRFQDDFLKEKNPAEIVDSFKHAEEQELLDMRDLRRQNMTKNPLRDMFCLCKAEFRGLMYNCQLCRDWFHEDCVPPPPSASSQNGLLNGGGGGIGGGGMVPPTNRPKWLCPSCVRSKRPRLETILPLLVQLQQLPIRLPEDEALRCLAERAMNWQDRARKALSSPDVSAAQEAILAQQQKRRSEGGGIGGGGGVVGNINSPRKPRRRGSMAKEASGSTESDADDDDDDDECRLRIVEDGFSNDEDELPRPATNTELNSDLLKLLSDSEIENLLDLMMEGDLLEVSLDETLELWRILETMPPTMLQAEANSRVAQHMQRQRQQPTSLPPTSSSIHSGAEDSNDSLLVQNSPNSNSNNSSIPATITPSNGRNKKRRSNDASGGNAAVPRKKQNTPKQTPSKKNAGGGGGAAANRKSEAKTATAAAAAAAAATTPGADADAENKQANGGNTNSNISPATVAGAGVGSGSTTPTPTATPGSAHKKRKRTSTNNNSSNSNNNNNNSNTNSNSSSSNNNNSPSTPSGTQATTTPAAAAAAAAAPPPTGAGGGGGGQKKHAQRNQQAAQEDDEEECRAENCHKPTGREVDWVQCDGGCNEWFHMYCVGLNRSQIKADDDYICIRCTKTVTVAVSSSSSSSNHGMGATTTTMATPGKQRAVQSAR
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00503779; iTF_00804168; iTF_00477129; iTF_00475727; iTF_00481335; iTF_00550044; iTF_00914281; iTF_00916009; iTF_00915115; iTF_00611180; iTF_00487790; iTF_00565871; iTF_00538747; iTF_00579771; iTF_00916975; iTF_00918833; iTF_00487090; iTF_00545754; iTF_00614694; iTF_00563678; iTF_00474292; iTF_00505185; iTF_00580481; iTF_00581313; iTF_00529347; iTF_00511788; iTF_00536547; iTF_00578308; iTF_00484229; iTF_00507397; iTF_00547845; iTF_00600837; iTF_00514672; iTF_00615368; iTF_00565162; iTF_00604618; iTF_00602365; iTF_00605356; iTF_00587921; iTF_00515379; iTF_00486356; iTF_00504468; iTF_00591490; iTF_00607594; iTF_00592174; iTF_00539442; iTF_00596438; iTF_00577557; iTF_00483463; iTF_00555625; iTF_00500820; iTF_00590744; iTF_00517494; iTF_00480663; iTF_00473563; iTF_00471392; iTF_00494167; iTF_00618937; iTF_00535838; iTF_00574592; iTF_00919647;
- 90% Identity
- iTF_00535838;
- 80% Identity
- -