Cnup042776.2
Basic Information
- Insect
- Catocala nupta
- Gene Symbol
- osa
- Assembly
- GCA_963675205.1
- Location
- OY776100.1:10175141-10207493[-]
Transcription Factor Domain
- TF Family
- ARID
- Domain
- ARID domain
- PFAM
- PF01388
- TF Group
- Helix-turn-helix
- Description
- This domain is know as ARID for AT-Rich Interaction Domain [2], and also known as the BRIGHT domain [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 15 9.3e-13 5.4e-09 38.1 0.0 2 51 329 378 328 384 0.91 2 15 3.7e-11 2.2e-07 32.9 0.0 10 51 385 426 383 432 0.92 3 15 3.6e-11 2.1e-07 33.0 0.0 10 51 433 474 429 480 0.92 4 15 3.6e-11 2.1e-07 33.0 0.0 10 51 481 522 478 528 0.92 5 15 3.6e-11 2.1e-07 33.0 0.0 10 51 529 570 526 576 0.92 6 15 3.6e-11 2.1e-07 33.0 0.0 10 51 577 618 574 624 0.92 7 15 3.6e-11 2.1e-07 33.0 0.0 10 51 625 666 622 672 0.92 8 15 3.6e-11 2.1e-07 33.0 0.0 10 51 673 714 669 720 0.92 9 15 3.6e-11 2.1e-07 33.0 0.0 10 51 721 762 718 768 0.92 10 15 3.6e-11 2.1e-07 33.0 0.0 10 51 769 810 765 816 0.92 11 15 3.9e-11 2.3e-07 32.9 0.0 11 51 818 858 816 864 0.92 12 15 3.6e-11 2.1e-07 33.0 0.0 10 51 865 906 861 912 0.92 13 15 3.4e-11 2e-07 33.1 0.0 10 51 913 954 909 962 0.92 14 15 3.9e-11 2.3e-07 32.9 0.0 11 51 962 1002 960 1008 0.92 15 15 8.9e-24 5.2e-20 73.4 0.0 10 89 1009 1085 1006 1085 0.95
Sequence Information
- Coding Sequence
- ATGCGTCTGAAAGATGATCTGACCGGTCAGAACAGCAATGACTcgggcggcagcggcggcgcaGGGCGCGCTACCACGCCGCACCTGAGGCCGACGCCCAGCCCCACCGGCTCAAGCGGCTCTCGCTCCATGTCACCGGCTGTTGGCACGCAGAACGTAGCGATGCCGCCgcgaacgtcgtcgtcgctatCAGACGGCAGCGGGCCGACGGCGCGTcccggcgcgggcggcgcgcccGGGGCGCCGACGCCCGGAGCCGGCCCGCCGCCCGGCGCCATGGTGGCGCAGCCCTACGGCCACCACCCCGCGTACAAGGCGCCGCACTACCCGCCGCCGCCCCAGCCCTACGGCTATCCGCCGAGGAACCATCACCCGTACCCCTACGGGTATCGTCCACCACCACCGCCGCATCCACCGCAGCACTACCCGCCACTCAAGCAACAACCGCGGCATATGGGTCCGCCGGGGCCGGGCGGGCCGGGCGGGCCGGGCGGGCCGGGCGGGCCGGGCGCGCCGGGAGCGCcggagggcgccatgccgccgcCCACCGCGCCCGCCGAGCCGCACGACAACGGGCCCGCTGCGCCCGCCACCGCGCTCGTCACCACCGGCCCCGACGGCGCGCCGCTAGACGAAGGTAGCCAGCAGAGCACACTAAGTAATGCCTCAGCAGCGTCCGGCGAGGAGGCGTGCGGCGGCGGCTCGAGCTCGAGCAAGGGCGGCTCCCGCAAGGAGTACGGCGCGGGCAGCGCGGCGCCGTCACCGTCACCCGGTGGCGGCTCGCACTCCTCGCTGCACGACGACTACGACGCGTCGCCCTCCTCGTGGCCGCGCCCGCCCTCTAGTCCCGTATTTAACAGTCACATACCGCCGGAGTCCTACAGATCAAAGGCTGAAAATTCAAGGCGACGAATAGGCGAGTCTTCGTCAGGCCCGAAGTCGGACTCGCTGGGCAAGCTGTACGAGATGGACGACGCCCCGGAGCGGCGCGGCTGGGTGGAGCGCCTGCTCGCCTTCATGGAGGAGCGGCGCACGCCCATCGCCGCCTGCCCCACCATCTCCAAGCAGCCGCTCGACCTCTACCGCCTCTACCTGCTCGTGCGCGACCGTGGCGGCTTCGTCGAGGTACGTGCCACCTCCGCTACCATATCGTATACCTTCATGGAGGAGCGGCGCACGCCCATCGCCGCCTGCCCCACCATCTCCAAGCAGCCGCTCGACCTCTACCGCCTCTACCTGCTCGTGCGCGACCGTGGCGGCTTCGTCGAGGTACGTGCCACCTCCGCTACCATATCGTATACCTTCATGGAGGAGCGGCGCACGCCCATCGCCGCCTGCCCCACCATCTCCAAGCAGCCGCTCGACCTCTACCGCCTCTACCTGCTCGTGCGCGACCGTGGCGGCTTCGTCGAGGTACGTGCCACCTCCGCTACCATATCGTATACCTTCATGGAGGAGCGGCGCACGCCCATCGCCGCCTGCCCCACCATCTCCAAGCAGCCGCTCGACCTCTACCGCCTCTACCTGCTCGTGCGCGACCGTGGCGGCTTCGTCGAGGTACGTGCCACCTCCGCTACCATATCGTATACCTTCATGGAGGAGCGGCGCACGCCCATCGCCGCCTGCCCCACCATCTCCAAGCAGCCGCTCGACCTCTACCGCCTCTACCTGCTCGTGCGCGACCGTGGCGGCTTCGTCGAGGTACGTGCCACCTCCGCTACCATATCGTATACCTTCATGGAGGAGCGGCGCACGCCCATCGCCGCCTGCCCCACCATCTCCAAGCAGCCGCTCGACCTCTACCGCCTCTACCTGCTCGTGCGCGACCGTGGCGGCTTCGTCGAGGTACGTGCCACCTCCGCTACCATATCGTATACCTTCATGGAGGAGCGGCGCACGCCCATCGCCGCCTGCCCCACCATCTCCAAGCAGCCGCTCGACCTCTACCGCCTCTACCTGCTCGTGCGCGACCGTGGCGGCTTCGTCGAGGTACGTGCCACCTCCGCTACCATATCGTATACCTTCATGGAGGAGCGGCGCACGCCCATCGCCGCCTGCCCCACCATCTCCAAGCAGCCGCTCGACCTCTACCGCCTCTACCTGCTCGTGCGCGACCGTGGCGGCTTCGTCGAGGTACGTGCCACCTCCGCTACCATATCGTATACCTTCATGGAGGAGCGGCGCACGCCCATCGCCGCCTGCCCCACCATCTCCAAGCAGCCGCTCGACCTCTACCGCCTCTACCTGCTCGTGCGCGACCGTGGCGGCTTCGTCGAGGTACGTGCCACCTCCGCTACCATATCGTATACCTTCATGGAGGAGCGGCGCACGCCCATCGCCGCCTGCCCCACCATCTCCAAGCAGCCGCTCGACCTCTACCGCCTCTACCTGCTCGTGCGCGACCGTGGCGGCTTCGTCGAGGTACGTGCCACCTCCGCTACCATATCGTATACCTTCATGGAGGAGCGGCGCACGCCCATCGCCGCCTGCCCCACCATCTCCAAGCAGCCGCTCGACCTCTACCGCCTCTACCTGCTCGTGCGCGACCGTGGCGGCTTCGTCGAGGTACGTGCCACCTCCGCTACCATATCGTATACCTTCATGGAGGAGCGGCGCACGCCCATCGCCGCCTGCCCCACCATCTCCAAGCAGCCGCTCGACCTCTACCGCCTCTACCTGCTCGTGCGCGACCGTGGCGGCTTCGTCGAGGTACGTGCCACCTCCGCTACCATATCGTATACCTTCATGGAGGAGCGGCGCACGCCCATCGCCGCCTGCCCCACCATCTCCAAGCAGCCGCTCGACCTCTACCGCCTCTACCTGCTCGTGCGCGACCGTGGCGGCTTCGTCGAGGTACGTGCCACCTCCGCTACCATATCGTATACCTTCATGGAGGAGCGGCGCACGCCCATCGCCGCCTGCCCCACCATCTCCAAGCAGCCGCTCGACCTCTACCGCCTCTACCTGCTCGTGCGCGACCGTGGCGGCTTCGTCGAGGTACGTGCCACCTCCGCTACCATATCGTATACCTTCATGGAGGAGCGGCGCACGCCCATCGCCGCCTGCCCCACCATCTCCAAGCAGCCGCTCGACCTCTACCGCCTCTACCTGCTCGTGCGCGACCGTGGCGGCTTCGTCGAGGTGACGAAGAACAAGACGTGGAAGGACATCGCGGGCCTGCTGGGCATCGGCGCATCGTCGTCGGCGGCCTACACGCTCCGCAAGCATTATACCAAGAATCTGTTGGCGTACGAGTGCCACTTCGACCGCGGCGGTATCGACCCACAGCCGATCATCAACCAGGTTGAGGCATCCACTAAGAAAAAGAGTGGGAAATCTAATAGTACGTCCAGTGCAGGTTCATCAAATTCGCAAGAGTCATTCCCAGTCGGTGGCAGCACCGCAAGTGGTGCTCCTATGGACGGCTACGGCGGTCAGTATGCTGGGTATCCGCCGCAGCCCAACCATTCGCAAGGCGGCGGGCCGGGCGGGGACAACCTCGCCGCCTCCAACCCGTTCGACGAGCCGCCGGGGCCCAGGAGACCCCCAGGTTACCAACAAGGATATGGTTACGAATATGGTACCCCCTACCAGCCAAATAGGCCAGTTTATCCACCGTACGGCCCCGATGGAGAGAGGGGTTACCGTGGCGGCGCGGGCGGAGAGTACCAGTACGGCGGGTACGGCGGCGCGTACCGCGGCGGCGCGCCGGGCGCCGGCACGCCGCCGCCCGCGGCGCCCTACCCCGAGTACTACCGCGCGCCGCACCCGCCGCagcacgcgccgcacgcgccgcacgcgccgcacCCGCACCCGCCGCTGCAGCagcacgcgccgcacgcgccgccgcacgcgccgccgcaccaggcgccgcacgcgccgcacgcgccgcacgcgccgcacgcgcccCACGCGCCccacgcgccgcacgcgccgcacgcgccgcacCCGCCACACCCGCCGCATCCGCCACATCCGCCGCACCCGCCCATCTCGCCGCAGCAGCCGCCCATCGAGATGTCAGGCGGCGCCGGGGCGAGTGTGGGTAACTTATTGAACTCGCAGCTGGCGCGACAGCTGGTGGCGCCGCTGCCGCCCAGCCCCAGACCCTACTACGGCGGCGCCAAGCAGctgggcgcgggcgcgggcgcgggcggcggcgcgccgcGGCGGCACCCCGACTTCGCCAAGGGCGACGCGCAGtacggcggcgcgggcgcgggcggcgcgggcgcgggcgcggcgggcgcggcgccgggcgcgggcgcggcgccgcgCTTCGGCGGCGCGTGGGGCGGCGCCTTCCCGCGCGCGCCGcagccgcccgcgcccgccgcgccctgGCGCCCGCCGCCCGGCGCGCAGCAGTGGGCGCACCCGCCCTACCAGCCGCCGCCTGGCGGAGCGCCAGCGTGGGGCGGAGCACCTCGACCACCTACACAGGACGGCACGTATTCCACCTCTGCGGCATCCTCACCTAATTCGGCTCCGACGTTAAACGCTGCAGGGCAGTTAAAACGTGAGCTGACATTCCCCGCCGAATGCGTGGAGGCGGCCGTGCCCACGGGCGAGAAGCGGCGCCGCCTCACTAAGGCTGACGTGGCGCCGGTGGACGCCTGGCGGATCATGATGGCGCTCAAGTCTGGCCTGCTGGCCGAGACGTGTTGGGCGTTAGATATACTCAACATTCTCCTCTTCGATGACAACTGTATAGGCTATTTTGGGCTCCAGCACATGCCGGGCCTCCTCGACTTATTATTAGAGCACTTCCACAAGAGTCTCAGCGACGTATTCGACTCGCCCTCGAACGACGAAGGGCCCTGGTATGCGCCGCCACCGAGCCCCGAGCCCGTAGTTAAGCCTGTAAAAACGAAGAAAGAACCCCGGGAACCGCCCGACCCCGCAGACCGCGTGCGAGTCCTATCCGGTGAGAACTTCACGCTCGAGTCGCGTCGGAGGCACCCCGTAATGTTCAAGTCCGACGAGGAGCTGTTCGCGCCCGAGGATGAGGGCAAGGACAACGAGAACGTGGAGGACGTGCTGGAGCCCTGGCAGTTCGGCGGCGACTCCGGCGGCGCCGCGCACGTCGTGCCGTGCTTCCGCGGCGAGTTCCTGCACCTGCCCTTCGTGCGCGTCAtggccggcgcgcgcgcgccctcgccgcccgccgccgAGCCGCGCCTCAAGCCCGAGCCcgacgcgccgccgcccgacccgcccgccgcgcccgcgcccgcgcccgccgccggcgAGCGCGACAACCTGGCGCCCGAGCCCATGGAGCTGGAGCCCGAGCGCCGCCCCGCGCTGGCCGTGCGCGACCCCGCCGGCGTGCTCAAGCGCCGCCGCCTCGAGGACTACGAGGACGAGTGTTACACGCGCGACGAGCCCAGCCTCAACCTCGTGGACGAGTCGCGCGACGCGCTCGCGCGCCGCTGCGTGGCGCTGTCGAACATCCTGCGCGGGCTCACGTTCGTGCCCGGCAACGAGGCCGAGTTCTCGCGCTCCGGCGCGTTCCTGGCGCTCGCCGGCAAGCTGCTGCTGCTGCACCACGAGCACGCGCCGCGggccgcgcgcgcgcgcgcctaCGAGCGCGCGGCGCGCGACGACGCCGACGCGTCCGCCTGCTGCTCGAGCCTGCGCGGCGGCGCCGAGTGGTGGTGGGACGcgctggcgcagctgcgcgagGACGCGCTCGTGTGCTGCGCCAACATCGCGGGCGGCGTGGAGCTGGCGGGCCAGCCCGAGGCCGTGGCGCGCCCGCTGCTGGACGGGCTGCTGCACTGGAGCGTGTGCCCCGCCGCCGTGGCGGGcgacgcgccgcccgccgccggcgCCGCGTCGCCGCTGTCGCCGCGCCGCCTGGCGCTGGAGGCGCTGTGCAAGCTGTGCGTGACGGACGCCAACGTGGACCTGGTGCTGGCCACGCCGCCGCGCACGCGCATCGCGGCGCTGTGCGCGGGGCTGGCGCGCGACCTGTGCCGGCCGGAGCGGCCCGTGGTGCGCGAGTTCGCCGTGAACCTGCTGCACTACctggcgggcgcgggcggcggcgcggcgcgcgacGTGGCGCTGCACGCGCCCGCCGTGGCGCAGCTCGTGGCTTTCATCGAGCGCGCCGAGCAGACGGCGCTGGGCGTGGCCAACCAGCACGGCGTGGCGGCGCTGCGCGACAACCCCGACGCCATGGGCACGTCGCTGGACATgctgcgccgcgccgccgccacgcTGCTGCGCCTGGCCGAGCACGCGGAGAACCGCCCGCTCATCCGGCGCCACGAGCGCCGCCTGCTGTCGCTCGTCATGTCGCAGATCCTGGACCAGAAGGTGGCGCACGAGCTCGCCGACGTGCTGTTCCACTGCAGCCAGCCGGCGCGGGAGgacgcgcccgccgcgctcgcCGCCGAGTAG
- Protein Sequence
- MRLKDDLTGQNSNDSGGSGGAGRATTPHLRPTPSPTGSSGSRSMSPAVGTQNVAMPPRTSSSLSDGSGPTARPGAGGAPGAPTPGAGPPPGAMVAQPYGHHPAYKAPHYPPPPQPYGYPPRNHHPYPYGYRPPPPPHPPQHYPPLKQQPRHMGPPGPGGPGGPGGPGGPGAPGAPEGAMPPPTAPAEPHDNGPAAPATALVTTGPDGAPLDEGSQQSTLSNASAASGEEACGGGSSSSKGGSRKEYGAGSAAPSPSPGGGSHSSLHDDYDASPSSWPRPPSSPVFNSHIPPESYRSKAENSRRRIGESSSGPKSDSLGKLYEMDDAPERRGWVERLLAFMEERRTPIAACPTISKQPLDLYRLYLLVRDRGGFVEVRATSATISYTFMEERRTPIAACPTISKQPLDLYRLYLLVRDRGGFVEVRATSATISYTFMEERRTPIAACPTISKQPLDLYRLYLLVRDRGGFVEVRATSATISYTFMEERRTPIAACPTISKQPLDLYRLYLLVRDRGGFVEVRATSATISYTFMEERRTPIAACPTISKQPLDLYRLYLLVRDRGGFVEVRATSATISYTFMEERRTPIAACPTISKQPLDLYRLYLLVRDRGGFVEVRATSATISYTFMEERRTPIAACPTISKQPLDLYRLYLLVRDRGGFVEVRATSATISYTFMEERRTPIAACPTISKQPLDLYRLYLLVRDRGGFVEVRATSATISYTFMEERRTPIAACPTISKQPLDLYRLYLLVRDRGGFVEVRATSATISYTFMEERRTPIAACPTISKQPLDLYRLYLLVRDRGGFVEVRATSATISYTFMEERRTPIAACPTISKQPLDLYRLYLLVRDRGGFVEVRATSATISYTFMEERRTPIAACPTISKQPLDLYRLYLLVRDRGGFVEVRATSATISYTFMEERRTPIAACPTISKQPLDLYRLYLLVRDRGGFVEVRATSATISYTFMEERRTPIAACPTISKQPLDLYRLYLLVRDRGGFVEVRATSATISYTFMEERRTPIAACPTISKQPLDLYRLYLLVRDRGGFVEVTKNKTWKDIAGLLGIGASSSAAYTLRKHYTKNLLAYECHFDRGGIDPQPIINQVEASTKKKSGKSNSTSSAGSSNSQESFPVGGSTASGAPMDGYGGQYAGYPPQPNHSQGGGPGGDNLAASNPFDEPPGPRRPPGYQQGYGYEYGTPYQPNRPVYPPYGPDGERGYRGGAGGEYQYGGYGGAYRGGAPGAGTPPPAAPYPEYYRAPHPPQHAPHAPHAPHPHPPLQQHAPHAPPHAPPHQAPHAPHAPHAPHAPHAPHAPHAPHAPHPPHPPHPPHPPHPPISPQQPPIEMSGGAGASVGNLLNSQLARQLVAPLPPSPRPYYGGAKQLGAGAGAGGGAPRRHPDFAKGDAQYGGAGAGGAGAGAAGAAPGAGAAPRFGGAWGGAFPRAPQPPAPAAPWRPPPGAQQWAHPPYQPPPGGAPAWGGAPRPPTQDGTYSTSAASSPNSAPTLNAAGQLKRELTFPAECVEAAVPTGEKRRRLTKADVAPVDAWRIMMALKSGLLAETCWALDILNILLFDDNCIGYFGLQHMPGLLDLLLEHFHKSLSDVFDSPSNDEGPWYAPPPSPEPVVKPVKTKKEPREPPDPADRVRVLSGENFTLESRRRHPVMFKSDEELFAPEDEGKDNENVEDVLEPWQFGGDSGGAAHVVPCFRGEFLHLPFVRVMAGARAPSPPAAEPRLKPEPDAPPPDPPAAPAPAPAAGERDNLAPEPMELEPERRPALAVRDPAGVLKRRRLEDYEDECYTRDEPSLNLVDESRDALARRCVALSNILRGLTFVPGNEAEFSRSGAFLALAGKLLLLHHEHAPRAARARAYERAARDDADASACCSSLRGGAEWWWDALAQLREDALVCCANIAGGVELAGQPEAVARPLLDGLLHWSVCPAAVAGDAPPAAGAASPLSPRRLALEALCKLCVTDANVDLVLATPPRTRIAALCAGLARDLCRPERPVVREFAVNLLHYLAGAGGGAARDVALHAPAVAQLVAFIERAEQTALGVANQHGVAALRDNPDAMGTSLDMLRRAAATLLRLAEHAENRPLIRRHERRLLSLVMSQILDQKVAHELADVLFHCSQPAREDAPAALAAE
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -