Basic Information

Insect
Apis dorsata
Gene Symbol
CSRNP3
Assembly
GCA_000469605.1
Location
NW:1057663-1066431[+]

Transcription Factor Domain

TF Family
CSRNP_N
Domain
CSRNP_N domain
PFAM
PF16019
TF Group
Unclassified Structure
Description
This presumed domain is found at the N-terminus of cysteine/serine-rich nuclear proteins. These proteins act as transcriptional activators [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 3 0.23 2.1e+03 -2.5 1.3 45 103 8 63 4 84 0.52
2 3 0.087 8.1e+02 -1.2 1.0 78 103 532 557 436 600 0.57
3 3 2e-101 1.9e-97 324.5 8.6 1 218 844 1054 844 1054 0.92

Sequence Information

Coding Sequence
ATGATGCTCGATAATCGTCGACTGTCTGTGAGCAAGTATCAAACGCTCCAGATGGAATCACCTTCGGCCCAGGACCGTCTTATCGAGGCCCAAGAAGATTTCACCTCGGCGGAATCAACATCGGAAGCAACATCCGCGTCGACTGAGGAACCGTCGACGATCACGGAGACCGAATCAACGGAACGCGACNNNNNNNNNNNNNNNNNNNNNNNNNNNACCGCCGCCGAAGAGGTGGACAATGTCATCGTTCGTGAATCCATCGATAACGAGGAAACGTTCGTCAGTGTAGAATCGTGCGAGGCTTCGCTCGCTTCTCTCGAGACCTCGAACGACTCTTCGACCGACGAAGGTCGACTAGATTTCTCCGACATGGGTCTTAAATTGGGGAAGAACGAGAAACGATCCGAGGATGAAGATGCGGATACGGTGAACGTAGAAGTgaaatcgaacgaattaaTCGTTGAACGAGATTGTTTAGAGGCGGTATCGACATCGACGTCGACATCGACGACGATATTAACTATATGCACGACATCGCAGACAACGGTATCTACGAGCATGGCAACATCGTTAATTTCATCATCAatgtcatcgtcgtcgtcgccatCGCCATCGCCATCGCCGTCGCCCTCACCATCGCCGTCGCCATCTCCGTCTCCATCGCCATCGCCATCGCCGTCTTCGTCGATGATTCAATGCGTCGAAAAACAGACGGAGTATTGCAAAACCAGTGACGGCGAATTCGTCACCTCGTCGCCGATATGTAGAAAGAGGCCGGCTAGCGATTTTCTGCCCATCAATGCGGAGATCAAGCGAATCGGCGTCGAGATGCCGGAGAACGAATCTAATCAAGTAAGGAGGATCTCCCCCGTGTTGGTCTCTCTTCGAGAGCGCACCCTCGGTGAGATATCGTTGTCCTCGGACTCGTGTCTGTTCGATAACGACGTCGGAAGTCGATGTGTGCCGAGAAACAACAGGATTCTCGATGATTTGTTAACGAGCACTAACTGCAGATTGACGACGAGCGCGACGGGTCTTGACGATTCTTTTCAAACGGATCACGATACCGAGGCAATAGATTGCACGGACACTGCGGAACGCAGGATCTGCAGAGCAGCGACATCGTTGGAAGGTGAATGTACCAATGGCAGCGTCGAGGAGGCGCTGGCAGAGCAGAGCGAAAAGCTCGAGTACCCGGACCCGTTCATCGACGAGGATTCCTGTTGTTCCCTGTCTCGCGACAGCCCCGAGAGGAATCTTAAAAAGTGTCAGGAGCCGAAACAGTGCGAGGAGCCGATCGACGACGAGTGCTCGTGCAACGACACCATGCCTGCGAACGATGCTTGCACTATTGCGCACGACGAGACATCTAATGAActgataaaaaattcgttaCCCAATTTAATAGTTAAGGTGGAACAGATCGATTTGTCCCGTTACTTAGCGGCCGAGCGGAAAGATAAGAAGCGTATGGTTATCATACCTAAGAGGAAGCAAAAGAAAAGTAGCGAATGCAAGGATGATGCCGGTAACGTAGACGTAAGGACTGATCCCAAAGAATCGTTGCAGTGTTTCGTCCCATTGTGCACTTATACGAACGAAAGTGATGAAAAGACTGAGTCACAGGACCAGGAGAAACAAGTAGACTCCGTTGTAGTATCCCCGTTGCAAACGGggcaagaaaaattagaatcatcTTTCACCACCCCCATTAAGGAATGCAAGGTGCTGTTACAGAGGATAACGTTGCCGAAAGCGGTAAGAACGATGACAACGGTACAGACATTGGACAAGGAAGAAACGGCACCGAGAGCGATAATAGAAAAGTTGGCCGAGGACGAGGAGGTCGTGTTTTCCTTCCCTCTACCTTCCTCCACGAATCTGCAAACGTTGAACAATTTAGATTCGATCGAGACGGCGAGACCCAGCTCCCCGGAGGAGTTGTTGGAATCTACTACTACGGATGTTCCAGAAGCTATCGATACGGAGACGGAAACCGAGACCGGTTCCGATAGTTCTGAAATGTCGAGCATGACGAACGTACGCCTCGGTGGATGCGACGATGACGCTGCTTCCGACCAGATATCCTGTCAAGAGAGCGAGTCCATTTGTTGCGTCGATATTAATCCGGAGATCATATCGAGGTTGGAGCCGGAGAGACCGGAAGCTTTCACCGAAGATTCAGCGGAAAGTTTGGCCCTTGCCACCGGTGCACGAGACGAGGTTAGATCGGATGGAAGCGATTCTGGCCTGGGAAGCGAGATTCCTGGTGATCCAGGGCCCGCACCAGTACCGGAAAGCGATTCCGAGACTTCTTTCTTAGATAGGATACCAGATGATATACTTTCCGACAAAGAAAAAGCAGTGAATCAATTGGATAGCTTTGCGCCGAACGTGGATGTATCCGACACGCCACAGACGCCATTGACGAATTTTCGTAGTCCTTCGAAGAGTAATCTGAAACGAAGATTGATGGATTGTATGGAAGGGGTTCCTAGCCCAAAGAGAAGCAATACGGACGAATCCATGAAAAAGAAACGCAATATTCAGTTCGACGCAGTCACGGTTTATTATTTTCCCAGGGCACAAGGTTTTACTTGTGTACCTTCTCAGGGTGGCAGCACTCTTGGCATGAGCGCAACGCACACTCATGCTGAACGATTCTCGTTATCGGAGCATGCCGCTGAACAGAGGCGAATTCATCGTGCTAGACTAGCTCAATTACGCTCCGAGCGTGCTGCAAATTGCGTATCGGAAGCAGCCTCCAGTTCTGAGGATCCTAGCGATGATACAGACGAGGAACAAAGTGATAACGAGGATCTGGATATCGATAGTTATTACTTTTTGCAACCAGTGCCTACATGGCAGAGACGAGCTTTACTTAGAGCAGCAGGAGTACGTAGAATAGACGCGGTCGAGAAGGACGAGTGCCGCGATATTAGAGCTAGCAGAGAACATTGTGGTTGCGGGTGTAAAGGATACTGCGATCCTGAGAGTTGTCCTTGTAGTCGAGCTAATGTAAAGTGCCAGGTTGATAGAGCGGGTTTCCCTTGCGGATGTACTCGCGATGGTTGTGCAAATAGTTCAGGCAGGATCGAGTTTAATCCAGTACGAGTACGAACGCATTTCATTCACACGTTAATGCGGTTAGAGTTAGAAAAAAAgcaacgagaagaagaagaaggcgcGGATCATGACGCTTCCGACAATCAAAACGGTAGAAGTCCGTTAAGAGAAATCAATTTAGGATCCGTGATGGAGAATAGGACCACGGAATCATGTTTGAATGGTGGTGGATTTACGACGTTGCATTATGAGAATCACGATGCCAGAGACGGCGGGACAAATTGTCAGCCAGAAGTATCTGGTACTAGAGAGGATAGCCTGGATTTGTATGCGATTAGAGATGATTGTTATCCTAGCGAAGATACTGTTGATGGTACGCAGGGACCTCAAAGGAAACTTCATCCTGAATTTAGTCAAGCTTTTCAAACATTCTCAGGCCAAACAAGTGCCGGAGTAAACTTTCAACAACCTGCTTATCAGGATTATCAGCCTTACGCTAACCTCCCTTCCACATCTAGGGTGCAATTTCAGCCGCAATTCCAAACGGTGCCAGGAAATGCAGGGTTCTCACATTATGCGCCTTACGGACAAGATACCGGATCAATTCAGGGGAACTGCCAGGTCCATCCCGGACAACATTCTTCCAGCTATGAAACTAGTTTTGCGCAAGATGAAACAACCGGATcacaatatacaaatttaaattcggtACAACCAATGAATACTGTGGTTCAACAGATGGGTAAACTAGAACCATTCTCTGAACTTTTGTCTGGTAGATATTCGTATTACGGTGAAATGGAGCCTCAGCCGCACAGTACTTATCACGGTAATGGAACCAAGGTTGAGGTAGAAAAGAACCAAGCTAACGAACAGCAATCGGAAAGTACAGAGGAGTGCGATGAAAATTTTGgggaaatcataaaaaaatcaatggtTGAGACTGTATCCGCTTAG
Protein Sequence
MMLDNRRLSVSKYQTLQMESPSAQDRLIEAQEDFTSAESTSEATSASTEEPSTITETESTERDXXXXXXXXXTAAEEVDNVIVRESIDNEETFVSVESCEASLASLETSNDSSTDEGRLDFSDMGLKLGKNEKRSEDEDADTVNVEVKSNELIVERDCLEAVSTSTSTSTTILTICTTSQTTVSTSMATSLISSSMSSSSSPSPSPSPSPSPSPSPSPSPSPSPSPSSSMIQCVEKQTEYCKTSDGEFVTSSPICRKRPASDFLPINAEIKRIGVEMPENESNQVRRISPVLVSLRERTLGEISLSSDSCLFDNDVGSRCVPRNNRILDDLLTSTNCRLTTSATGLDDSFQTDHDTEAIDCTDTAERRICRAATSLEGECTNGSVEEALAEQSEKLEYPDPFIDEDSCCSLSRDSPERNLKKCQEPKQCEEPIDDECSCNDTMPANDACTIAHDETSNELIKNSLPNLIVKVEQIDLSRYLAAERKDKKRMVIIPKRKQKKSSECKDDAGNVDVRTDPKESLQCFVPLCTYTNESDEKTESQDQEKQVDSVVVSPLQTGQEKLESSFTTPIKECKVLLQRITLPKAVRTMTTVQTLDKEETAPRAIIEKLAEDEEVVFSFPLPSSTNLQTLNNLDSIETARPSSPEELLESTTTDVPEAIDTETETETGSDSSEMSSMTNVRLGGCDDDAASDQISCQESESICCVDINPEIISRLEPERPEAFTEDSAESLALATGARDEVRSDGSDSGLGSEIPGDPGPAPVPESDSETSFLDRIPDDILSDKEKAVNQLDSFAPNVDVSDTPQTPLTNFRSPSKSNLKRRLMDCMEGVPSPKRSNTDESMKKKRNIQFDAVTVYYFPRAQGFTCVPSQGGSTLGMSATHTHAERFSLSEHAAEQRRIHRARLAQLRSERAANCVSEAASSSEDPSDDTDEEQSDNEDLDIDSYYFLQPVPTWQRRALLRAAGVRRIDAVEKDECRDIRASREHCGCGCKGYCDPESCPCSRANVKCQVDRAGFPCGCTRDGCANSSGRIEFNPVRVRTHFIHTLMRLELEKKQREEEEGADHDASDNQNGRSPLREINLGSVMENRTTESCLNGGGFTTLHYENHDARDGGTNCQPEVSGTREDSLDLYAIRDDCYPSEDTVDGTQGPQRKLHPEFSQAFQTFSGQTSAGVNFQQPAYQDYQPYANLPSTSRVQFQPQFQTVPGNAGFSHYAPYGQDTGSIQGNCQVHPGQHSSSYETSFAQDETTGSQYTNLNSVQPMNTVVQQMGKLEPFSELLSGRYSYYGEMEPQPHSTYHGNGTKVEVEKNQANEQQSESTEECDENFGEIIKKSMVETVSA

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

90% Identity
iTF_00141819;
80% Identity
-