Basic Information

Gene Symbol
CSRNP3_1
Assembly
GCA_018230605.1
Location
DWAO01014501.1:3398-5596[-]

Transcription Factor Domain

TF Family
CSRNP_N
Domain
CSRNP_N domain
PFAM
PF16019
TF Group
Unclassified Structure
Description
This presumed domain is found at the N-terminus of cysteine/serine-rich nuclear proteins. These proteins act as transcriptional activators [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 4 0.2 7.6e+03 -2.4 2.6 89 109 41 61 6 108 0.49
2 4 0.72 2.7e+04 -4.2 0.2 82 97 160 175 150 192 0.45
3 4 1e-100 3.9e-96 322.2 6.4 3 218 214 421 212 421 0.92
4 4 0.21 7.7e+03 -2.4 1.6 39 70 632 652 568 714 0.55

Sequence Information

Coding Sequence
ATGGACAAGATACCTAGAGAAATTACGGAAAAAGCTGAAGAGCGACCAAATTCACCTAGGAAAATAGAGGTGCTCACAATGGAGCAAGTAGATGATAAGGAACATCTACCTGTGGTAAAGGATGAAAAGGCAAACGACAGTGATGCTCACAGTGAAGATCTTAATTCTAATGatgaacaaaataatgttaatagctCTCCGTTTTATGAGAGCAGTGATAGTAAAATAGAATATGCTATTAAGTTAGACGACAATGACACCTTAAGTGACATAGATGACATAAAAACAGACACAGCATCAGCAGATAGTGAAGACTCTGCTCTTGGTAGTCTGCCGCCTGATACTAACCTTCTTGATAGAGAAGAAGATGCTCAAGATCGATCTGATGGCAGTGACTCAGGGCTTGGCTCAGAGACACCAGATGATACGAAGGTACCCAGTTTTAATTTGCAACTACCTTGTGAAAACGATGAAAGCAATACACAATCAGCATGTGAAGAAAACAATTCAAGTACTTCTGATAGTAATGACCATATTGGTAATGACAATACTGtggaagtaataaataaacccCTCAAAAGCAgccttaaaagaaaatgtgatGATGAAATAATTGAAGAACCCCAACAAAAGAgaattaaacacaatatacAATTTGATAATGTTACAGTTTTCTATTTCCCAAGATCTCAAGGTTTTTCTTGTGTCCCATCTCAAGGAGGTTCAACATTAGGTATGGAATGGTCACACAGTCACACCCAAAAATTCACTCTACCTGAGCATGCCCTAGAACAGCGACGGTTGCACAGACAAGTTCTGCAACAATTAAAAAGTGAGAGGCACACATTACATGGTGTATCTCTATCATCAAGTGAAGATAGTGATACAGAAGAAGAGACAAGTGATATATCTGAATCAGAGCTAGATTTGGAtagttattactttttacaaCCAGTACCAACAAGGCAGAGACGAGCTTTATTAAGAGCAGCAGGGGTCAGAAAAATTGAAGGCTATGAAAAAGATGAATGTAGAGACATAAGAACTTCACGTGAATTTTGTGGCTGTGCATGTAAAGGTGTATGTAACCCTGCTAATTGCTCGTGTAGTTTAGGTGGCATTAAATGCCAAGTTGATAGATTAAACTTTCCGTGTGGGTGTACcagagatggatgtggaaataCTACTGGCAGAATAGAGTTTAACCCCATGAGAGTTCGAACTCACTTTATTAATACCCTTATGAGGGTAGgtcttgaaaagaaaaatgaagagGCTCAAGAAGCTGTTAAAAGAAAATGGGCTGAAGCACATGGTGTCAATGCCCAATGTGGCATTAGTacttttgaaaaagaaaatcaccTTAACCGTGAGGACATATCTTTGTCTAATATGAACATGAACCCTCACAGGCTTGGTATGGAATCTTGtataaacacaaattttactaatatacatCACAATTTAAGCAATCCCGGTACTAATCAAAGTGGCACATTTAGTTTTAGAAATGAATCTCTAGACCAcagaaatctaaaaaataacatgcTAAACTTCGAAGATAAATCTGATCACCGCAATCACGACCCTTACTCATCAAACATATTACAAGGCAAAGGACCTCCTTATACCACCACTAATACTATGGAATTTAACACAATGACAAACAATGTGCAACGATACCAATGTGATCTCAACTATTCTTATGATCAACACTCAGACCATCATTTCAAGGGTCTGCAGAGTTTTTCTGCAGCTAGTTTTGAAGAGTTCGCCCATAACTCTCAGATGTCGATGCTCAATCATTACGGCCACATGTACATGTCAGATTATATGCAGAAGCCGAGCTCTTGTGTTCCCGAGCATAATTCAATGCAATATCAATCAATGTCtcataatcattataatatgtataaaaacacaGAATGCATCACagaaaacaaaacagaaaCACATTATACGACATTGATGACAATGCCATATCAACAAAGCAATAAATTGCAAGCTGTAGATAACGACGAGAATTGGTTTAGCCATAATACACTTTTAAATTTGGACCATTCAGTTCAAGTCACGCAAGAAGCAGCTGTAGTTCAGTCGGAACCAGCCCAAGtaacggcggaaagtgacaATTGTGAAACAACTGAAAACTTTGgtgaacttattaaaaaaactatggtAGAGTCTGTTACTGTgtag
Protein Sequence
MDKIPREITEKAEERPNSPRKIEVLTMEQVDDKEHLPVVKDEKANDSDAHSEDLNSNDEQNNVNSSPFYESSDSKIEYAIKLDDNDTLSDIDDIKTDTASADSEDSALGSLPPDTNLLDREEDAQDRSDGSDSGLGSETPDDTKVPSFNLQLPCENDESNTQSACEENNSSTSDSNDHIGNDNTVEVINKPLKSSLKRKCDDEIIEEPQQKRIKHNIQFDNVTVFYFPRSQGFSCVPSQGGSTLGMEWSHSHTQKFTLPEHALEQRRLHRQVLQQLKSERHTLHGVSLSSSEDSDTEEETSDISESELDLDSYYFLQPVPTRQRRALLRAAGVRKIEGYEKDECRDIRTSREFCGCACKGVCNPANCSCSLGGIKCQVDRLNFPCGCTRDGCGNTTGRIEFNPMRVRTHFINTLMRVGLEKKNEEAQEAVKRKWAEAHGVNAQCGISTFEKENHLNREDISLSNMNMNPHRLGMESCINTNFTNIHHNLSNPGTNQSGTFSFRNESLDHRNLKNNMLNFEDKSDHRNHDPYSSNILQGKGPPYTTTNTMEFNTMTNNVQRYQCDLNYSYDQHSDHHFKGLQSFSAASFEEFAHNSQMSMLNHYGHMYMSDYMQKPSSCVPEHNSMQYQSMSHNHYNMYKNTECITENKTETHYTTLMTMPYQQSNKLQAVDNDENWFSHNTLLNLDHSVQVTQEAAVVQSEPAQVTAESDNCETTENFGELIKKTMVESVTV

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2