Basic Information

Gene Symbol
osa
Assembly
GCA_963675205.1
Location
OY776100.1:10175141-10207493[-]

Transcription Factor Domain

TF Family
ARID
Domain
ARID domain
PFAM
PF01388
TF Group
Helix-turn-helix
Description
This domain is know as ARID for AT-Rich Interaction Domain [2], and also known as the BRIGHT domain [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 15 9.3e-13 5.4e-09 38.1 0.0 2 51 329 378 328 384 0.91
2 15 3.7e-11 2.2e-07 32.9 0.0 10 51 385 426 383 432 0.92
3 15 3.6e-11 2.1e-07 33.0 0.0 10 51 433 474 429 480 0.92
4 15 3.6e-11 2.1e-07 33.0 0.0 10 51 481 522 478 528 0.92
5 15 3.6e-11 2.1e-07 33.0 0.0 10 51 529 570 526 576 0.92
6 15 3.6e-11 2.1e-07 33.0 0.0 10 51 577 618 574 624 0.92
7 15 3.6e-11 2.1e-07 33.0 0.0 10 51 625 666 622 672 0.92
8 15 3.6e-11 2.1e-07 33.0 0.0 10 51 673 714 669 720 0.92
9 15 3.6e-11 2.1e-07 33.0 0.0 10 51 721 762 718 768 0.92
10 15 3.6e-11 2.1e-07 33.0 0.0 10 51 769 810 765 816 0.92
11 15 3.9e-11 2.3e-07 32.9 0.0 11 51 818 858 816 864 0.92
12 15 3.6e-11 2.1e-07 33.0 0.0 10 51 865 906 861 912 0.92
13 15 3.4e-11 2e-07 33.1 0.0 10 51 913 954 909 962 0.92
14 15 3.9e-11 2.3e-07 32.9 0.0 11 51 962 1002 960 1008 0.92
15 15 8.9e-24 5.2e-20 73.4 0.0 10 89 1009 1085 1006 1085 0.95

Sequence Information

Coding Sequence
ATGCGTCTGAAAGATGATCTGACCGGTCAGAACAGCAATGACTcgggcggcagcggcggcgcaGGGCGCGCTACCACGCCGCACCTGAGGCCGACGCCCAGCCCCACCGGCTCAAGCGGCTCTCGCTCCATGTCACCGGCTGTTGGCACGCAGAACGTAGCGATGCCGCCgcgaacgtcgtcgtcgctatCAGACGGCAGCGGGCCGACGGCGCGTcccggcgcgggcggcgcgcccGGGGCGCCGACGCCCGGAGCCGGCCCGCCGCCCGGCGCCATGGTGGCGCAGCCCTACGGCCACCACCCCGCGTACAAGGCGCCGCACTACCCGCCGCCGCCCCAGCCCTACGGCTATCCGCCGAGGAACCATCACCCGTACCCCTACGGGTATCGTCCACCACCACCGCCGCATCCACCGCAGCACTACCCGCCACTCAAGCAACAACCGCGGCATATGGGTCCGCCGGGGCCGGGCGGGCCGGGCGGGCCGGGCGGGCCGGGCGGGCCGGGCGCGCCGGGAGCGCcggagggcgccatgccgccgcCCACCGCGCCCGCCGAGCCGCACGACAACGGGCCCGCTGCGCCCGCCACCGCGCTCGTCACCACCGGCCCCGACGGCGCGCCGCTAGACGAAGGTAGCCAGCAGAGCACACTAAGTAATGCCTCAGCAGCGTCCGGCGAGGAGGCGTGCGGCGGCGGCTCGAGCTCGAGCAAGGGCGGCTCCCGCAAGGAGTACGGCGCGGGCAGCGCGGCGCCGTCACCGTCACCCGGTGGCGGCTCGCACTCCTCGCTGCACGACGACTACGACGCGTCGCCCTCCTCGTGGCCGCGCCCGCCCTCTAGTCCCGTATTTAACAGTCACATACCGCCGGAGTCCTACAGATCAAAGGCTGAAAATTCAAGGCGACGAATAGGCGAGTCTTCGTCAGGCCCGAAGTCGGACTCGCTGGGCAAGCTGTACGAGATGGACGACGCCCCGGAGCGGCGCGGCTGGGTGGAGCGCCTGCTCGCCTTCATGGAGGAGCGGCGCACGCCCATCGCCGCCTGCCCCACCATCTCCAAGCAGCCGCTCGACCTCTACCGCCTCTACCTGCTCGTGCGCGACCGTGGCGGCTTCGTCGAGGTACGTGCCACCTCCGCTACCATATCGTATACCTTCATGGAGGAGCGGCGCACGCCCATCGCCGCCTGCCCCACCATCTCCAAGCAGCCGCTCGACCTCTACCGCCTCTACCTGCTCGTGCGCGACCGTGGCGGCTTCGTCGAGGTACGTGCCACCTCCGCTACCATATCGTATACCTTCATGGAGGAGCGGCGCACGCCCATCGCCGCCTGCCCCACCATCTCCAAGCAGCCGCTCGACCTCTACCGCCTCTACCTGCTCGTGCGCGACCGTGGCGGCTTCGTCGAGGTACGTGCCACCTCCGCTACCATATCGTATACCTTCATGGAGGAGCGGCGCACGCCCATCGCCGCCTGCCCCACCATCTCCAAGCAGCCGCTCGACCTCTACCGCCTCTACCTGCTCGTGCGCGACCGTGGCGGCTTCGTCGAGGTACGTGCCACCTCCGCTACCATATCGTATACCTTCATGGAGGAGCGGCGCACGCCCATCGCCGCCTGCCCCACCATCTCCAAGCAGCCGCTCGACCTCTACCGCCTCTACCTGCTCGTGCGCGACCGTGGCGGCTTCGTCGAGGTACGTGCCACCTCCGCTACCATATCGTATACCTTCATGGAGGAGCGGCGCACGCCCATCGCCGCCTGCCCCACCATCTCCAAGCAGCCGCTCGACCTCTACCGCCTCTACCTGCTCGTGCGCGACCGTGGCGGCTTCGTCGAGGTACGTGCCACCTCCGCTACCATATCGTATACCTTCATGGAGGAGCGGCGCACGCCCATCGCCGCCTGCCCCACCATCTCCAAGCAGCCGCTCGACCTCTACCGCCTCTACCTGCTCGTGCGCGACCGTGGCGGCTTCGTCGAGGTACGTGCCACCTCCGCTACCATATCGTATACCTTCATGGAGGAGCGGCGCACGCCCATCGCCGCCTGCCCCACCATCTCCAAGCAGCCGCTCGACCTCTACCGCCTCTACCTGCTCGTGCGCGACCGTGGCGGCTTCGTCGAGGTACGTGCCACCTCCGCTACCATATCGTATACCTTCATGGAGGAGCGGCGCACGCCCATCGCCGCCTGCCCCACCATCTCCAAGCAGCCGCTCGACCTCTACCGCCTCTACCTGCTCGTGCGCGACCGTGGCGGCTTCGTCGAGGTACGTGCCACCTCCGCTACCATATCGTATACCTTCATGGAGGAGCGGCGCACGCCCATCGCCGCCTGCCCCACCATCTCCAAGCAGCCGCTCGACCTCTACCGCCTCTACCTGCTCGTGCGCGACCGTGGCGGCTTCGTCGAGGTACGTGCCACCTCCGCTACCATATCGTATACCTTCATGGAGGAGCGGCGCACGCCCATCGCCGCCTGCCCCACCATCTCCAAGCAGCCGCTCGACCTCTACCGCCTCTACCTGCTCGTGCGCGACCGTGGCGGCTTCGTCGAGGTACGTGCCACCTCCGCTACCATATCGTATACCTTCATGGAGGAGCGGCGCACGCCCATCGCCGCCTGCCCCACCATCTCCAAGCAGCCGCTCGACCTCTACCGCCTCTACCTGCTCGTGCGCGACCGTGGCGGCTTCGTCGAGGTACGTGCCACCTCCGCTACCATATCGTATACCTTCATGGAGGAGCGGCGCACGCCCATCGCCGCCTGCCCCACCATCTCCAAGCAGCCGCTCGACCTCTACCGCCTCTACCTGCTCGTGCGCGACCGTGGCGGCTTCGTCGAGGTACGTGCCACCTCCGCTACCATATCGTATACCTTCATGGAGGAGCGGCGCACGCCCATCGCCGCCTGCCCCACCATCTCCAAGCAGCCGCTCGACCTCTACCGCCTCTACCTGCTCGTGCGCGACCGTGGCGGCTTCGTCGAGGTACGTGCCACCTCCGCTACCATATCGTATACCTTCATGGAGGAGCGGCGCACGCCCATCGCCGCCTGCCCCACCATCTCCAAGCAGCCGCTCGACCTCTACCGCCTCTACCTGCTCGTGCGCGACCGTGGCGGCTTCGTCGAGGTGACGAAGAACAAGACGTGGAAGGACATCGCGGGCCTGCTGGGCATCGGCGCATCGTCGTCGGCGGCCTACACGCTCCGCAAGCATTATACCAAGAATCTGTTGGCGTACGAGTGCCACTTCGACCGCGGCGGTATCGACCCACAGCCGATCATCAACCAGGTTGAGGCATCCACTAAGAAAAAGAGTGGGAAATCTAATAGTACGTCCAGTGCAGGTTCATCAAATTCGCAAGAGTCATTCCCAGTCGGTGGCAGCACCGCAAGTGGTGCTCCTATGGACGGCTACGGCGGTCAGTATGCTGGGTATCCGCCGCAGCCCAACCATTCGCAAGGCGGCGGGCCGGGCGGGGACAACCTCGCCGCCTCCAACCCGTTCGACGAGCCGCCGGGGCCCAGGAGACCCCCAGGTTACCAACAAGGATATGGTTACGAATATGGTACCCCCTACCAGCCAAATAGGCCAGTTTATCCACCGTACGGCCCCGATGGAGAGAGGGGTTACCGTGGCGGCGCGGGCGGAGAGTACCAGTACGGCGGGTACGGCGGCGCGTACCGCGGCGGCGCGCCGGGCGCCGGCACGCCGCCGCCCGCGGCGCCCTACCCCGAGTACTACCGCGCGCCGCACCCGCCGCagcacgcgccgcacgcgccgcacgcgccgcacCCGCACCCGCCGCTGCAGCagcacgcgccgcacgcgccgccgcacgcgccgccgcaccaggcgccgcacgcgccgcacgcgccgcacgcgccgcacgcgcccCACGCGCCccacgcgccgcacgcgccgcacgcgccgcacCCGCCACACCCGCCGCATCCGCCACATCCGCCGCACCCGCCCATCTCGCCGCAGCAGCCGCCCATCGAGATGTCAGGCGGCGCCGGGGCGAGTGTGGGTAACTTATTGAACTCGCAGCTGGCGCGACAGCTGGTGGCGCCGCTGCCGCCCAGCCCCAGACCCTACTACGGCGGCGCCAAGCAGctgggcgcgggcgcgggcgcgggcggcggcgcgccgcGGCGGCACCCCGACTTCGCCAAGGGCGACGCGCAGtacggcggcgcgggcgcgggcggcgcgggcgcgggcgcggcgggcgcggcgccgggcgcgggcgcggcgccgcgCTTCGGCGGCGCGTGGGGCGGCGCCTTCCCGCGCGCGCCGcagccgcccgcgcccgccgcgccctgGCGCCCGCCGCCCGGCGCGCAGCAGTGGGCGCACCCGCCCTACCAGCCGCCGCCTGGCGGAGCGCCAGCGTGGGGCGGAGCACCTCGACCACCTACACAGGACGGCACGTATTCCACCTCTGCGGCATCCTCACCTAATTCGGCTCCGACGTTAAACGCTGCAGGGCAGTTAAAACGTGAGCTGACATTCCCCGCCGAATGCGTGGAGGCGGCCGTGCCCACGGGCGAGAAGCGGCGCCGCCTCACTAAGGCTGACGTGGCGCCGGTGGACGCCTGGCGGATCATGATGGCGCTCAAGTCTGGCCTGCTGGCCGAGACGTGTTGGGCGTTAGATATACTCAACATTCTCCTCTTCGATGACAACTGTATAGGCTATTTTGGGCTCCAGCACATGCCGGGCCTCCTCGACTTATTATTAGAGCACTTCCACAAGAGTCTCAGCGACGTATTCGACTCGCCCTCGAACGACGAAGGGCCCTGGTATGCGCCGCCACCGAGCCCCGAGCCCGTAGTTAAGCCTGTAAAAACGAAGAAAGAACCCCGGGAACCGCCCGACCCCGCAGACCGCGTGCGAGTCCTATCCGGTGAGAACTTCACGCTCGAGTCGCGTCGGAGGCACCCCGTAATGTTCAAGTCCGACGAGGAGCTGTTCGCGCCCGAGGATGAGGGCAAGGACAACGAGAACGTGGAGGACGTGCTGGAGCCCTGGCAGTTCGGCGGCGACTCCGGCGGCGCCGCGCACGTCGTGCCGTGCTTCCGCGGCGAGTTCCTGCACCTGCCCTTCGTGCGCGTCAtggccggcgcgcgcgcgccctcgccgcccgccgccgAGCCGCGCCTCAAGCCCGAGCCcgacgcgccgccgcccgacccgcccgccgcgcccgcgcccgcgcccgccgccggcgAGCGCGACAACCTGGCGCCCGAGCCCATGGAGCTGGAGCCCGAGCGCCGCCCCGCGCTGGCCGTGCGCGACCCCGCCGGCGTGCTCAAGCGCCGCCGCCTCGAGGACTACGAGGACGAGTGTTACACGCGCGACGAGCCCAGCCTCAACCTCGTGGACGAGTCGCGCGACGCGCTCGCGCGCCGCTGCGTGGCGCTGTCGAACATCCTGCGCGGGCTCACGTTCGTGCCCGGCAACGAGGCCGAGTTCTCGCGCTCCGGCGCGTTCCTGGCGCTCGCCGGCAAGCTGCTGCTGCTGCACCACGAGCACGCGCCGCGggccgcgcgcgcgcgcgcctaCGAGCGCGCGGCGCGCGACGACGCCGACGCGTCCGCCTGCTGCTCGAGCCTGCGCGGCGGCGCCGAGTGGTGGTGGGACGcgctggcgcagctgcgcgagGACGCGCTCGTGTGCTGCGCCAACATCGCGGGCGGCGTGGAGCTGGCGGGCCAGCCCGAGGCCGTGGCGCGCCCGCTGCTGGACGGGCTGCTGCACTGGAGCGTGTGCCCCGCCGCCGTGGCGGGcgacgcgccgcccgccgccggcgCCGCGTCGCCGCTGTCGCCGCGCCGCCTGGCGCTGGAGGCGCTGTGCAAGCTGTGCGTGACGGACGCCAACGTGGACCTGGTGCTGGCCACGCCGCCGCGCACGCGCATCGCGGCGCTGTGCGCGGGGCTGGCGCGCGACCTGTGCCGGCCGGAGCGGCCCGTGGTGCGCGAGTTCGCCGTGAACCTGCTGCACTACctggcgggcgcgggcggcggcgcggcgcgcgacGTGGCGCTGCACGCGCCCGCCGTGGCGCAGCTCGTGGCTTTCATCGAGCGCGCCGAGCAGACGGCGCTGGGCGTGGCCAACCAGCACGGCGTGGCGGCGCTGCGCGACAACCCCGACGCCATGGGCACGTCGCTGGACATgctgcgccgcgccgccgccacgcTGCTGCGCCTGGCCGAGCACGCGGAGAACCGCCCGCTCATCCGGCGCCACGAGCGCCGCCTGCTGTCGCTCGTCATGTCGCAGATCCTGGACCAGAAGGTGGCGCACGAGCTCGCCGACGTGCTGTTCCACTGCAGCCAGCCGGCGCGGGAGgacgcgcccgccgcgctcgcCGCCGAGTAG
Protein Sequence
MRLKDDLTGQNSNDSGGSGGAGRATTPHLRPTPSPTGSSGSRSMSPAVGTQNVAMPPRTSSSLSDGSGPTARPGAGGAPGAPTPGAGPPPGAMVAQPYGHHPAYKAPHYPPPPQPYGYPPRNHHPYPYGYRPPPPPHPPQHYPPLKQQPRHMGPPGPGGPGGPGGPGGPGAPGAPEGAMPPPTAPAEPHDNGPAAPATALVTTGPDGAPLDEGSQQSTLSNASAASGEEACGGGSSSSKGGSRKEYGAGSAAPSPSPGGGSHSSLHDDYDASPSSWPRPPSSPVFNSHIPPESYRSKAENSRRRIGESSSGPKSDSLGKLYEMDDAPERRGWVERLLAFMEERRTPIAACPTISKQPLDLYRLYLLVRDRGGFVEVRATSATISYTFMEERRTPIAACPTISKQPLDLYRLYLLVRDRGGFVEVRATSATISYTFMEERRTPIAACPTISKQPLDLYRLYLLVRDRGGFVEVRATSATISYTFMEERRTPIAACPTISKQPLDLYRLYLLVRDRGGFVEVRATSATISYTFMEERRTPIAACPTISKQPLDLYRLYLLVRDRGGFVEVRATSATISYTFMEERRTPIAACPTISKQPLDLYRLYLLVRDRGGFVEVRATSATISYTFMEERRTPIAACPTISKQPLDLYRLYLLVRDRGGFVEVRATSATISYTFMEERRTPIAACPTISKQPLDLYRLYLLVRDRGGFVEVRATSATISYTFMEERRTPIAACPTISKQPLDLYRLYLLVRDRGGFVEVRATSATISYTFMEERRTPIAACPTISKQPLDLYRLYLLVRDRGGFVEVRATSATISYTFMEERRTPIAACPTISKQPLDLYRLYLLVRDRGGFVEVRATSATISYTFMEERRTPIAACPTISKQPLDLYRLYLLVRDRGGFVEVRATSATISYTFMEERRTPIAACPTISKQPLDLYRLYLLVRDRGGFVEVRATSATISYTFMEERRTPIAACPTISKQPLDLYRLYLLVRDRGGFVEVRATSATISYTFMEERRTPIAACPTISKQPLDLYRLYLLVRDRGGFVEVTKNKTWKDIAGLLGIGASSSAAYTLRKHYTKNLLAYECHFDRGGIDPQPIINQVEASTKKKSGKSNSTSSAGSSNSQESFPVGGSTASGAPMDGYGGQYAGYPPQPNHSQGGGPGGDNLAASNPFDEPPGPRRPPGYQQGYGYEYGTPYQPNRPVYPPYGPDGERGYRGGAGGEYQYGGYGGAYRGGAPGAGTPPPAAPYPEYYRAPHPPQHAPHAPHAPHPHPPLQQHAPHAPPHAPPHQAPHAPHAPHAPHAPHAPHAPHAPHAPHPPHPPHPPHPPHPPISPQQPPIEMSGGAGASVGNLLNSQLARQLVAPLPPSPRPYYGGAKQLGAGAGAGGGAPRRHPDFAKGDAQYGGAGAGGAGAGAAGAAPGAGAAPRFGGAWGGAFPRAPQPPAPAAPWRPPPGAQQWAHPPYQPPPGGAPAWGGAPRPPTQDGTYSTSAASSPNSAPTLNAAGQLKRELTFPAECVEAAVPTGEKRRRLTKADVAPVDAWRIMMALKSGLLAETCWALDILNILLFDDNCIGYFGLQHMPGLLDLLLEHFHKSLSDVFDSPSNDEGPWYAPPPSPEPVVKPVKTKKEPREPPDPADRVRVLSGENFTLESRRRHPVMFKSDEELFAPEDEGKDNENVEDVLEPWQFGGDSGGAAHVVPCFRGEFLHLPFVRVMAGARAPSPPAAEPRLKPEPDAPPPDPPAAPAPAPAAGERDNLAPEPMELEPERRPALAVRDPAGVLKRRRLEDYEDECYTRDEPSLNLVDESRDALARRCVALSNILRGLTFVPGNEAEFSRSGAFLALAGKLLLLHHEHAPRAARARAYERAARDDADASACCSSLRGGAEWWWDALAQLREDALVCCANIAGGVELAGQPEAVARPLLDGLLHWSVCPAAVAGDAPPAAGAASPLSPRRLALEALCKLCVTDANVDLVLATPPRTRIAALCAGLARDLCRPERPVVREFAVNLLHYLAGAGGGAARDVALHAPAVAQLVAFIERAEQTALGVANQHGVAALRDNPDAMGTSLDMLRRAAATLLRLAEHAENRPLIRRHERRLLSLVMSQILDQKVAHELADVLFHCSQPAREDAPAALAAE

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-