Hnig006485.1
Basic Information
- Insect
- Hartigia nigra
- Gene Symbol
- ARID5B
- Assembly
- GCA_031001745.1
- Location
- JARPRG010000322.1:367473-380917[-]
Transcription Factor Domain
- TF Family
- ARID
- Domain
- ARID domain
- PFAM
- PF01388
- TF Group
- Helix-turn-helix
- Description
- This domain is know as ARID for AT-Rich Interaction Domain [2], and also known as the BRIGHT domain [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 1 6.9e-24 1.2e-20 73.6 0.0 1 89 272 358 272 358 0.95
Sequence Information
- Coding Sequence
- ATGTTGGAGGAAAAGCAATATGGCCGACCATTCAAATATTATGAAGCGACGGTTATGCAGCTGGTGTCCGTAGACATAACTAAATTCCTTGGGGATGAAATATCCTTTTACAAGAATACTCGTAGAAATTGGgATGAGGTTCTCACAATCTCGGAAAAGGTGGTCGTTCGCATTCACGATCTGGTCACATGGCTGTCACCGTCCCTGGAGTGGTCCTGCGGCAGGGAGGTATCCTGTCCCCCTATACCAACGTCGTCTCCAGAGAACAGTCCCCTAAAACCGGAGATTCCACAGCCACAAGTACTGACCGATCCCGGTATCGATTTTTCGGACGTGgagaaacaacaaaaagaatacGAGAGCAATTGTAGCTCGtcgaaaaacaatgaaagcAACAATCAGCCAAGAGTCGTGGTGCTATCGTATCCCAGGTACTGCCGGTATCGGGCTCTTCTACGTAGACTAGAGGGTGCCGAGCCATCCTGGTTATGTTCCTCCGTGGCAGCTGCACTTGGTGGTTTCACGGCATTACCCGGTACCAGGATACTCTTCTGCAGAGACACATTCGATTATCCTGATCTGGAAACGCACGAGTTACTTTGCAATCATTTAGcGCCAAAATTGAAAGGCCGGCCTCGACGAAAGCGTAAAAAGCGCAGTGCTTCTCCCGGTGAATCGAGCAACGAAAGTGAAGCGTCAGTGGCATCGGTGGTCGCCTCTACGTCACGGGGCGCCTCGACGTCCTCGAGTCTCGTCCAAGCCCTCCGACCACCAGCATCCGGGGTACGTCGTAGCGAACGAAAGACGAGTGCCGAGGAGAAGAGGTTCATGGCCGATGTACAGACCTTCATGATCTCCCGAGGCACGCCGGTCGGCAAAATGCCTTTATTAGGATACAGACAAATTGATCTGTATCTATTCTACACGAAAGTGCAGAGTCTGGGTGGTTACGATTCCGTTAGTGCCGGACGCCTATGGAAATCTATATACGATGACATTGGCGGCAACACCGGATCCACTAGTGCTGCTACCATCACGCGAAGGCATTACGAACGGTTGTTGCTGCCTTATGAAAGACATCagaaaggagaagaaacaaaagttaGGCCAACGCACGGTAGGCGTACCAAATCCAGCAGTATGTCCGAGGACACTGTAGAAATAAAACAGGAACCTGGGGCATCGTATCCCTTGCAGACACCTCCGCCACAACCGGGGACACCTACTTCAACGGCGGTAACACCACCACCATTGCAGTCTATCCTTTCATCGtccTTCCTACCTGGAGATAAACCCAGGACAGAAAGTGGAAAGACGTCCTCGCTCCGTAGCGTCAGGGTGAAACCGGAACGCCTTAAATCACTGAACACGATAATAGCAAACAGTAGTACACCACCAGCAACGCAGAGTAATCAGTTGCCGAGTCCTCCTCCGTCGGTAAATACACCGACATTGACACCGTGTAGTACGCCGTCGACAACCCCGACGGAAGGTCAGAGCGTGTTAGAACGGCAACTTAACAGTCCGTTGATATCGCAACAAGTGGCCCCAGAAAGTCCTCCGCCGACTAGTCTGCCAGTAGTAACGACGGTACCAAACCCAACTAATGCGTCAACAGTCGTGACGTTAACCCCACCACCGGAAAAGGACATAAAGGAGATGAAACTGGACCCGAAGCAGACGTCCCTGCTGGCTCAGGGCAAAGAGAACATACCTCTCTTCGGTGAGAAACCAATAGTCCCACCCAGGTCACCGGAAGTTATCGACTTGGAGACCGAAAGCGATACCAGTAGAGACAAGATCATCATTCCCAGCTTTAAGAAGCGAAAACTCGAGATTCTGCGTGAAGGTGGACTCGAGGTCACTGCCGTGGAATTAGACGCAAGGCCGAGCGTCATTCAGGCGACTATAGCACCGGTTTCCATGGCACCGACGTTTAATGCTAAAGCCGAAGAAAAGACTGCCTCACCGTTTCCCACGCCGGTCACGACGAACTCCATACCTAAACTCATTTCCGTCACAGTCACACCGGACATCAGTCATATGCTACCGTCGCCACATGAGAAGAGTCCCAGTCCGAATCTTCAGAATCGACTGCCAACGCCTACGAAGCAACCAGCTATTAACAACAATAGTCATGCCCAtcacaataacaataacaccTCGTTGAACAACAACAACGTGGCGAATAGTAGAGTCATCAATTTAGGCAGTTCGAACAACGCGACGCTTCTACAGCTCTATGCCAATGCAAATGCCAATGCAGTGGCGACGTCTCCAAGTAATAGTAATCATCGGTTCGTTTCTCCGGGTTTACCTAACGGAAGGATAGTACCTCCAAAGGTAACACAGTCCAGGTCCATCTTTGCACACAACGAGAAGACGGTCTACGGGAATCCAAAGGATATTTTAATACCGACAAAGTATCCTCATCAGCATCCACAGCCATCCCCTCATCCACCAAGGAGCAATAATCAGAGCAGTGGGGTGTTAGACCTCACTCAAAGACCAGGGGAGAAGCAAACGTTTTCCAGACCTAGTCTCGAGATCGTAAGGGTGCCGATAGTGCCGAGACCTAATCCATTAAATTTAGAAATGAAAAGTAGCGTCAAAGACAAGCAACAGGAATCCCAAAAGAAGTCATTCACGCCCTATCCAAATATGCTCGACAGTAGAACAATGATATCCAATAATCTGGAGATCACCTTAGTTAATCCTGCTAAGCAAAAGTCTCATCCTGGTACACCACCCCAGATATCTCCGCAGCAGCAAGTGCCGCCAACCCCAGCAGCGCGAAACAACGTAATGCCATCTCAGAGGAGACAACAAGCTAACGGGAAGTATCCACCGAGGAGCGAACCAGTCTCGCCGTACACACCAAGAAAACCAAATTATCCAGTAATACCAAACGTGCCGAATCTAAATCAGCTGAACAGCGTAGCCAATGCATCCTACGGTCGTGCGACTCAACAACAAATAGCCatcgagagaagaaaagcaGAGGCAAGCAAGGAACATCGCAGGATAAGCGAAGGCGATAGATCTGTGACTAtccaacaacagcaacaacatcagcagcaacagcaagaAGCTGCAAATCACAGACAATCATCAGACTCCTCGCGTCGACACAGCGTACCAAATGTATCCACGGGCTTGATGCTGGCTCTGCCCCAAAATCCTGGCTTCCTCTCGCAGCTACCGAACCCAGCTGGAAAGTTTATGCCGATTTTGGATCCCGTTTATTATCCAGCTTTCTACAATGGTCTCTTTCCACCACCAATGGCGCCTGCATCGACTCCATTCCTCGTACCGGAACTCAGCACTTATTACAAGGAGTTATTAGCCTCCTCGCAACCAAGACTGGCGATGGCCGGACAGCATCAACCTGCTGTCCCTACGTCCAAGTAG
- Protein Sequence
- MLEEKQYGRPFKYYEATVMQLVSVDITKFLGDEISFYKNTRRNWDEVLTISEKVVVRIHDLVTWLSPSLEWSCGREVSCPPIPTSSPENSPLKPEIPQPQVLTDPGIDFSDVEKQQKEYESNCSSSKNNESNNQPRVVVLSYPRYCRYRALLRRLEGAEPSWLCSSVAAALGGFTALPGTRILFCRDTFDYPDLETHELLCNHLAPKLKGRPRRKRKKRSASPGESSNESEASVASVVASTSRGASTSSSLVQALRPPASGVRRSERKTSAEEKRFMADVQTFMISRGTPVGKMPLLGYRQIDLYLFYTKVQSLGGYDSVSAGRLWKSIYDDIGGNTGSTSAATITRRHYERLLLPYERHQKGEETKVRPTHGRRTKSSSMSEDTVEIKQEPGASYPLQTPPPQPGTPTSTAVTPPPLQSILSSSFLPGDKPRTESGKTSSLRSVRVKPERLKSLNTIIANSSTPPATQSNQLPSPPPSVNTPTLTPCSTPSTTPTEGQSVLERQLNSPLISQQVAPESPPPTSLPVVTTVPNPTNASTVVTLTPPPEKDIKEMKLDPKQTSLLAQGKENIPLFGEKPIVPPRSPEVIDLETESDTSRDKIIIPSFKKRKLEILREGGLEVTAVELDARPSVIQATIAPVSMAPTFNAKAEEKTASPFPTPVTTNSIPKLISVTVTPDISHMLPSPHEKSPSPNLQNRLPTPTKQPAINNNSHAHHNNNNTSLNNNNVANSRVINLGSSNNATLLQLYANANANAVATSPSNSNHRFVSPGLPNGRIVPPKVTQSRSIFAHNEKTVYGNPKDILIPTKYPHQHPQPSPHPPRSNNQSSGVLDLTQRPGEKQTFSRPSLEIVRVPIVPRPNPLNLEMKSSVKDKQQESQKKSFTPYPNMLDSRTMISNNLEITLVNPAKQKSHPGTPPQISPQQQVPPTPAARNNVMPSQRRQQANGKYPPRSEPVSPYTPRKPNYPVIPNVPNLNQLNSVANASYGRATQQQIAIERRKAEASKEHRRISEGDRSVTIQQQQQHQQQQQEAANHRQSSDSSRRHSVPNVSTGLMLALPQNPGFLSQLPNPAGKFMPILDPVYYPAFYNGLFPPPMAPASTPFLVPELSTYYKELLASSQPRLAMAGQHQPAVPTSK
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_01198447; iTF_01199186; iTF_00296551; iTF_00251847; iTF_01473697; iTF_00253346; iTF_00298086; iTF_00292071; iTF_00295022; iTF_00294247; iTF_00295786; iTF_00297324; iTF_00254862; iTF_00252596; iTF_00292721; iTF_00254098; iTF_00291314; iTF_00293497; iTF_01394908; iTF_01475229; iTF_01474464; iTF_01472959; iTF_01130592;
- 90% Identity
- iTF_01198447; iTF_01199186; iTF_00251847; iTF_01473697; iTF_00253346; iTF_00298086; iTF_00292071; iTF_00296551; iTF_00295786; iTF_00297324; iTF_00254862; iTF_00252596; iTF_00295022; iTF_00292721; iTF_00294247; iTF_00254098; iTF_00291314; iTF_00293497; iTF_01475229; iTF_01472959; iTF_01130592; iTF_01474464; iTF_01394908;
- 80% Identity
- -