Basic Information

Insect
Dione juno
Gene Symbol
CSRNP3_1
Assembly
GCA_037127375.1
Location
JAXRJS010000007.1:1008259-1010454[-]

Transcription Factor Domain

TF Family
CSRNP_N
Domain
CSRNP_N domain
PFAM
PF16019
TF Group
Unclassified Structure
Description
This presumed domain is found at the N-terminus of cysteine/serine-rich nuclear proteins. These proteins act as transcriptional activators [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 5 0.22 6.3e+03 -2.5 0.9 94 94 68 68 9 120 0.57
2 5 0.18 5.3e+03 -2.3 1.0 81 108 159 182 138 205 0.47
3 5 1.7e-101 4.9e-97 324.7 5.4 3 218 214 421 212 421 0.93
4 5 0.2 5.7e+03 -2.4 0.1 68 101 424 456 420 469 0.31
5 5 0.34 9.7e+03 -3.1 0.8 53 53 648 648 579 715 0.55

Sequence Information

Coding Sequence
ATGGACAAGAACGCTGTAGAAATGACTGAAAAATCAGAAGAACAACAAAGTTTACCTTGTAATATAGAGGTGCCCATAATGGAGCAAGTAGAGGATAAGGAATGCTTACCTGTGATAAAAGATGAAAAAGCAAGTGACAGTGATGGTCATAGTGACGATCTTAATTCTAATGatgaacaaaataatgttaatagcTCTCCGTTTTACGAGAGCAGTGATAGTAAGATAGAATTTGCTGTTAAATTAGAGGACAATGACACCTTAAGTGATATAGATGATATAAAAACAGATACAGCATCAGCAGATAGTGAAGACTCTGCTCTGGGTAGTCTGCCACCTGACAGTACCATTAATGATAGAGAAGAAGATGCACAAGATCGTTCTGATGGCAGTGACTCAGGGCTTGGATCAGAAACTCCAGATGATGCAAAGATACCCAGTTTTAATTTGCAATTACCTTCTGAAAATGATGAGAGCAATACACAATCAGCATCTGAAGAGAATAACTCAAGTTCTTTAGATACTAATGATAATGTTAGTGAGGACAATTCTGCAGATGTAATAAACAAACCCCTCAAAAGCAGCCTCAAAAGAAAATGTGATAAAGATATGAATGAAGGACCCCAGCAAAAAAGATTTAAGcacaatatacaatttgatAATGTTACTGTTTTCTATTTTCCACGAGCTCAAGGCTTTACTTGTGTTCCTTCTCAAGGAGGCTCAACATTAGGTATGGAATGGACCCACAGTCACACACAGAAGTTCACTTTACCTGAGCATGCCCTAGAACAGCGACGATTACATAGACAAATTCTACAACAATTGAAAAGTGAGAGGCATACATTACAAGGAGTATCCTTGTCATCCAGTGAAGATAGCGATACAGAAGAAGAAACAAGTGACATATCTGAATCAGAACTAGATTTGGAcagttattactttttacaacCAGTACCAACAAAACAGAGAAGAGCTTTATTGAGAGCAGCAGGTGTGAGAAAAATAGAAGGATATGAGAAAGATGAATGTAGAGACATAAGAACTTCTCGTGAATTTTGTGGCTGTGCCTGTAAAGGTGTTTGTAATCCCATTAATTGTTCATGTAGCTTGGGTGGCATTAAATGCCAAGTTGATAGATTAAACTTTCCGTGTGGGTGCACTAGAGATGGGTGTGGAAACACATCTGGCAGAATAGAGTTTAACCCTATGAGAGTACGAACACATTTCATTAATACTCTTATGAGGATAGGtctagaaaagaaaaatgaagaGGCTCAGGAAGCTGTCAAAAGAAAATGGGCAGAAGCACATAGTGCTAACACCCAAAGCACTACTGCCTTCACTCAGGAAATTCACCATAACCATGAAGATACATCTTTATCTAATATGAACATCCCTCACAGCCTTGGCATTGAATCTTGtttaaacacaaattttaGCAACACACTTCACAATTTAAGCAATCCTAGCACTAATCAAAGTGACACATTTAGTTTTAGAAATGAATCTCTAGAccatagaaatttaaaatgtaacatgCTGAACTTTGAAGATAAATCTGATCACTGCAATCATGATCCTTACACATCTAACATATTACAAGGTAAAGGGCCTCCTTACTCTGCCACAAATGCTATTGAATTTAACACAATGACAAACAATATGCAAAGATACCAATGTGATCTCAACTATTCTTATGACCAGCATTCAGACCAACATTTTAAAGGCTTACAGAGTTATTCAGCAGCTAGTTTTGAAGAGTTTGCACATAACTCTCAGATGTCTATGCTCAATCATTATGGCCACATGTACATGTCGGATTACATGCAGAAGCCGAGCACCAGTGTCCCTGAACACAATTCAATGCATTATCAGTCAATGTCacataatcattataatatgtataaaaatacagaatGCATCACAGAAAACAAAACAGACCCAATGCATTATACAACATTGATGACAATGCCATATCAACAAAGCAATAAATTGCAAGCCGTAGACAATGATGAGAATTGGTTTAgtcataatacattattaaatttggacCATTCTGTTCATGTCACGCAAGAAACAAATGTAGTTCAGTCTGAACCGGCTAAAGTTGAAGCTGAAAGTGACAATTGTGAAACAACTGAAAACTTTGgtgaacttattaaaaaaactatggtAGAGTCTGTTATAGTGTAG
Protein Sequence
MDKNAVEMTEKSEEQQSLPCNIEVPIMEQVEDKECLPVIKDEKASDSDGHSDDLNSNDEQNNVNSSPFYESSDSKIEFAVKLEDNDTLSDIDDIKTDTASADSEDSALGSLPPDSTINDREEDAQDRSDGSDSGLGSETPDDAKIPSFNLQLPSENDESNTQSASEENNSSSLDTNDNVSEDNSADVINKPLKSSLKRKCDKDMNEGPQQKRFKHNIQFDNVTVFYFPRAQGFTCVPSQGGSTLGMEWTHSHTQKFTLPEHALEQRRLHRQILQQLKSERHTLQGVSLSSSEDSDTEEETSDISESELDLDSYYFLQPVPTKQRRALLRAAGVRKIEGYEKDECRDIRTSREFCGCACKGVCNPINCSCSLGGIKCQVDRLNFPCGCTRDGCGNTSGRIEFNPMRVRTHFINTLMRIGLEKKNEEAQEAVKRKWAEAHSANTQSTTAFTQEIHHNHEDTSLSNMNIPHSLGIESCLNTNFSNTLHNLSNPSTNQSDTFSFRNESLDHRNLKCNMLNFEDKSDHCNHDPYTSNILQGKGPPYSATNAIEFNTMTNNMQRYQCDLNYSYDQHSDQHFKGLQSYSAASFEEFAHNSQMSMLNHYGHMYMSDYMQKPSTSVPEHNSMHYQSMSHNHYNMYKNTECITENKTDPMHYTTLMTMPYQQSNKLQAVDNDENWFSHNTLLNLDHSVHVTQETNVVQSEPAKVEAESDNCETTENFGELIKKTMVESVIV

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00775262;
90% Identity
iTF_00458060;
80% Identity
-