Ctra044915.1
Basic Information
- Insect
- Cosmia trapezina
- Gene Symbol
- arid5b
- Assembly
- GCA_905163495.1
- Location
- LR991047.1:15970101-15989019[-]
Transcription Factor Domain
- TF Family
- ARID
- Domain
- ARID domain
- PFAM
- PF01388
- TF Group
- Helix-turn-helix
- Description
- This domain is know as ARID for AT-Rich Interaction Domain [2], and also known as the BRIGHT domain [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 1 3.2e-14 1.4e-10 42.9 0.0 3 89 232 305 230 305 0.90
Sequence Information
- Coding Sequence
- atgctcgactatgcTCGAGCGTGCGACGAAGTACTAGCCATCAACGATAAGGTAGTCGTCCGCGCAGACGACCTTCTCTCCTGGATGTGTGTGGGCACCGAGTGGCGCTGGGGGCTGCGGGCTGTGTGGCGCGGCGCCTGCGCCCCTCCGGCCGACCTGCCCCGCCACGCACCTCTCACTCACACCAAGTTAGATTTCAGTGATGTGGATACCGAGAAGAGTACTATTGCGGTAGACGCAGACGCTCCAGGGGTGGTAGTCTTCTCCTACCCCAGGTACTGCCGGTACCGGGCGCTGATAGCCAGGCTCGAAGGCATCCAGGCCTCATGGCTGCGGGACTCCTTGGTGGCAGCGCTAGGAGGGTACGCCGCCCCCACTAGTAACACCAGGATACTGTACTGCAAGGACACCTTCGAGTACCCCGAGCTGGAGGGCCACGAGTTCGTGTGCAACCACCTGGCGCCGCGGCTGAAGGGCCGGCCGCGCGGCCGCAGGCGCCGTGCGCACACGCCGGACCACGCGCACGCCGGCGAGCGCGACGACAGGTCCGACTCCGACGACCCGCCCGCCACTGGACCCGCCCATACCAGTGAGCCGCCGACCCCCCGCCGCCTCTCCCTCCGCAACGGCGCCCCCCCTCCCAGCGACGAAGACGAGGAGGAGTACAAGAGGATAGAACAGACTCCCGAAGACAGAGCCTTCCTGCAGACTCTGAAGATGTTCTACAAGAGCAGGAGTGAACCCTTCAAGAGTGGTCACACTTTGAAGCACTTATCTCTCCGCGCGTTGTACCTGTTGGTGGTGTCCCGCGGCGGCTACGAGGCCGTGTGTCGGCACAAGCTGTGGCGCGCGCTCGCTCGCGACCAGCCCGCCAGGACCAGGAGACACTACGAGAGATTCCTCCTCCCCTTCGAGAACCACGAGCGACGCAACGGCTTCTCCCTAAAGCTCAACGGCAAACTGGATGCTGAACCACGCACCATCGCCACCATCGACGTCACCGACTCCCCCCGACGAGACGACGACAACGTCCTCCGcaccccctccccctcccccaaACTCGACAACTACGACAAACTTGACACTTTCGACAAACTTGACAAACTTGGAATCGACAGCGAAACCGGAGAAATAACAACCAAAGATGAAATTATAGTCAAAAATGCCGAAGAGCTGAACAGAGAATTTCTAGACTCTCTACCGAAAGAGGAGAAAACGGTTAAGATATCGGTGAAACCGGTTGAGAAGCTGATCGAACCGGTTAAGGTTTTTGAGGGTTTGGATGTGACGAAGTCTGAGGGGCAGAAGGATGAGGGGAGGGATTTTAGTCAAAACTATTTGAGTGATATACAGAAATTTAGTATCGGAGCGGAGTCCCTACCGGCGCACGCGCCGCCTCTCAACGGACACGCTGCGCCTGCGCCCACCGACCTTAAGACGTGCCGGCCGGCCGGGCGCAGCTCGCTGCGCGCCGTGCGCGTGAAGCCGGCGCGTGCGCCCGCGCAGCAGCGCACGCCCGCCAGCACGCCCCCTCACCCGCTGCGGCCCGAGTCCGTGCCGGTGTCTCCTCCGGTCACGAACTTCGGCATCCACCACCCGCCGCCGCACGCGCACCACGACGACGACATCGTCGAGGTCCCCTACAAACCGAAAACCCCAGAAATTATCGACTTAGACGAATACCCAGAGAGCCCCCAAGCAGTAAAAAAGAAGAAACTAGACATTCTAAAAGAGCGAGGTCTCGAAGTGACCGCCCTGCCGGCTCCCTGGCAGGGCGTGGGCCCCCACATCATGGGACCCCCCATGCTCCTCAACCCGGCAGTCCAGCACCAGATCATGACTCAAGCGCAAATCTTCCAAATGTACAACATCATCCCACAGAACTATGCAAACGGTATACAGCCTCCAAGAGTCATCCAAGCATCTTCTATCTTCGGTAACACAGGACCTGAAAAGACTGTCTACGGCAACCCCAAAGATCCCTTCATGCCACCTCCTCATGTTCTTCACGGAGTACCAGTTAAACCTTTGAAAACCCCCCCTTCTACCATTCCTCAGGACATACTAGACCTTACCTGCAAGTCTTCAAGCCCCCCTCCACAGAAACCAGCAGTAGAAATAGTAAGAGTacctccctccccctctccaaCGGCGCAGAATTTAACAAAGAATTACACGTTAGTCGATGGAAAAGCAGTCGTCGGATCGAATTTAGAAATAACTTTAGTCAATAAGTCTTCGAGTCCTGGCAAACATCGACCGCCACAGAAGAGATCCAGCAATGGGAAGTTCATGTCGGCTAAAACTCCTACTCCTCCCAAGGAACCCTACCCTAAATACACTTCTCCTACCACAACCCAGAAAAAACCACCTATAAACATCCCCAACTACCAAATCAGAGACGATTCCTCTCCCACCCAAAGTCAGGTTCTGCAAAACGCAATGAAAAATCAGAATCTAGCTCAAATAATGGATTTGCAAAAGAACTCAGTTCCTATGACGTCTTTCATGGACCCTTATGTGGCCCTATACAGCAGTCTGGCTGGTCAGATGGACCAGAGACAACTGGCCATGTACCGAGACCTCATGACCAATCAATTTAGATACCCAGGACTTCTAAACCTTGGCGTAGCAACACCGACAACGAAAAATTAG
- Protein Sequence
- MLDYARACDEVLAINDKVVVRADDLLSWMCVGTEWRWGLRAVWRGACAPPADLPRHAPLTHTKLDFSDVDTEKSTIAVDADAPGVVVFSYPRYCRYRALIARLEGIQASWLRDSLVAALGGYAAPTSNTRILYCKDTFEYPELEGHEFVCNHLAPRLKGRPRGRRRRAHTPDHAHAGERDDRSDSDDPPATGPAHTSEPPTPRRLSLRNGAPPPSDEDEEEYKRIEQTPEDRAFLQTLKMFYKSRSEPFKSGHTLKHLSLRALYLLVVSRGGYEAVCRHKLWRALARDQPARTRRHYERFLLPFENHERRNGFSLKLNGKLDAEPRTIATIDVTDSPRRDDDNVLRTPSPSPKLDNYDKLDTFDKLDKLGIDSETGEITTKDEIIVKNAEELNREFLDSLPKEEKTVKISVKPVEKLIEPVKVFEGLDVTKSEGQKDEGRDFSQNYLSDIQKFSIGAESLPAHAPPLNGHAAPAPTDLKTCRPAGRSSLRAVRVKPARAPAQQRTPASTPPHPLRPESVPVSPPVTNFGIHHPPPHAHHDDDIVEVPYKPKTPEIIDLDEYPESPQAVKKKKLDILKERGLEVTALPAPWQGVGPHIMGPPMLLNPAVQHQIMTQAQIFQMYNIIPQNYANGIQPPRVIQASSIFGNTGPEKTVYGNPKDPFMPPPHVLHGVPVKPLKTPPSTIPQDILDLTCKSSSPPPQKPAVEIVRVPPSPSPTAQNLTKNYTLVDGKAVVGSNLEITLVNKSSSPGKHRPPQKRSSNGKFMSAKTPTPPKEPYPKYTSPTTTQKKPPINIPNYQIRDDSSPTQSQVLQNAMKNQNLAQIMDLQKNSVPMTSFMDPYVALYSSLAGQMDQRQLAMYRDLMTNQFRYPGLLNLGVATPTTKN*
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00448802; iTF_00373691; iTF_00043265; iTF_00039579; iTF_00040579; iTF_00237251; iTF_00425155; iTF_00745411; iTF_00363813; iTF_01440773; iTF_00905897; iTF_00907664; iTF_01526801; iTF_00120274; iTF_01525715; iTF_00124021; iTF_00726069; iTF_01246613; iTF_00036407; iTF_00924375; iTF_00122138; iTF_00906843; iTF_01439679; iTF_00757923; iTF_00622597; iTF_00147086; iTF_00711598; iTF_01533722; iTF_01063534; iTF_01064417; iTF_01030918; iTF_01062539; iTF_01061666; iTF_00446901; iTF_00445894; iTF_00928443; iTF_00444893; iTF_00685215; iTF_00042417; iTF_00809842; iTF_00273396; iTF_00808887; iTF_01084920; iTF_01084032; iTF_00111385; iTF_00274233; iTF_01119065; iTF_00300954; iTF_00301885; iTF_01116954; iTF_01118043; iTF_00041564; iTF_00121180; iTF_00123136; iTF_01339742; iTF_01338271; iTF_01424832;
- 90% Identity
- iTF_00905897;
- 80% Identity
- -