Basic Information

Gene Symbol
CSRNP3
Assembly
GCA_034770305.1
Location
JAPMIB010000005.1:3055462-3064819[+]

Transcription Factor Domain

TF Family
CSRNP_N
Domain
CSRNP_N domain
PFAM
PF16019
TF Group
Unclassified Structure
Description
This presumed domain is found at the N-terminus of cysteine/serine-rich nuclear proteins. These proteins act as transcriptional activators [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 8 0.4 6.3e+03 -2.4 1.9 37 91 21 76 14 102 0.52
2 8 2 3.1e+04 -4.7 0.9 87 91 176 180 136 214 0.49
3 8 9.3e-32 1.4e-27 97.8 0.1 1 76 839 914 839 938 0.91
4 8 8.3e-67 1.3e-62 212.4 8.6 83 217 997 1126 985 1127 0.87
5 8 0.84 1.3e+04 -3.4 1.3 49 82 1140 1173 1128 1197 0.43
6 8 1.2 1.9e+04 -3.9 0.3 43 56 1333 1346 1314 1379 0.46
7 8 2 3.1e+04 -7.3 9.7 58 86 1415 1441 1394 1463 0.32
8 8 1.4 2.2e+04 -4.2 13.8 43 107 1450 1514 1443 1532 0.54

Sequence Information

Coding Sequence
ATGGAACCACCATTGGTCCAGGATCCAATAGTGGATGATTCCGCCTTGGCCCACCACCATCACAATCATAATCACCTCGCGGACTTTGAATTTTCACACCACGACGATGATGCaccagagcagcagcagcagcagccaccgcaGCTACCAAGCGACGAATCATCCTGCTCAGACAAAGAATCCGAcccgacagcagcagcagcaacagaaacgacagaagaagcagcagcaacagtaacGACCGAAGCCGTAGCCAGCAGCTTGGTCCAGGAAGTGCCGCTCGATAACAATACCGACTACTCGTCCTCGTTGCTCGACCAAGTGGATTTGGGCCcgaaggcagcagcagcagtagcgaccACGACGACGCTCCTCGTCGACGCGTCCACaactgtagcagcagcgacagaaCAAAcggaaaaagaagaggaagaaaaagcaCCACTGCCGAGTCCTTCCGCGTTGACGCTCGACACGAGCGAAGATAACGGTTTGCGGCTATCCGAGAGCAACAgcaacgacagcagcagcgaggatCGTCACACGTGCGATGACACGCTCCAGCAACAAACTACTATGGGTAGTCTGTCTACGAAGAGCTATTCCGAGGAAGACACGAGGCTAGATTATTCGGAGAGCGTCGAGAATCCCTTGGACGCCGCAGTACCGGACGTAGTCCAGCCAGCTCCGTCGTCGCCCAGCAGTCCACTGGAGATGGCCCGAAGACGGACGAACGAGGTCGGCTCGCCGACGCCGCAACAGACGACCACGActgcgacgacaacgacgacgacgaccacgcCGAAGAAGGACTTTCAACCGTTCGGCGCCGAGGTCTCCGAGGAGAAGGCCCAGCAGCTGAAGAACTACGTGAAGCGTCTGTCGCCGATACACGTGAGCCAGCGCGAGCGAACACTGGGTGAAATATCCCTCGGCAGtgcgggcagcagcagcaacagtagcgAGGCCGACAGTGCGGAAATAATCGCCCAGGACGATGCCagcagtagtggtagtggtgCGCGATCGATGCAGGACgagtgcggcggcggcggcggcgtcgtcgtcgctgtagccggtggtagtggtggtgttAGCAGCAGTAATAgtggtacgacgacgacgacgacaatcgacggtagcagcagcagcagcagcagcagcagcagcaagcccaGCGATCCGAGGATGAGAGATCACCATCAACTGCGCCAGGTGATGATCAGGCTGACGCGGATCGACCGATATCCGCCTTCGCCCCCGCCGCAAGCCAGCGGCGGCAGTGTCGGAGGCAGCGATCAGGGCCACGGCAGCAGGCCGCGCTCGCCCTTCTCCGAGACGCCCATGTCGCCCGTCTGCCCGTACGGCTCGAGCCTGATCGTCGAGCGCTGCGACTCGCCGGGATACATCGAGAATCGCGTCCTCGATCAGACCGGCCGGACGGTGAGCCACCTCGATAGTGGCAGCGGccaccagcaccagcagcagcagacacacCAACACAGCCCGCGCTCGGTCGTCGAGAAGCTCAGCGTCGGCGACGAGATCACCTTCTCCAGTCCGGCTTCGCCCGCCACCGCCGCGTTATTGGCGGCGGCGGGTACAAGCGGACTAGTGGCGCGCGTCTCGTCGCCACCCGCgaccagcagcggcagcaacaatTTCACCCTGGTGGGTGGCACCCCGCTCGAGGTGGCCTCCACCACGGCGGACTCGACGGCCTGCGTGCTGGCGTGTATGGCCGCGGCCGGTGCCGATCTGTCCTGCCAGCATCAGGGCGGCCTCACCCAGGCGATCATCCAGCGACTGGACCAACCCATGCTGCCAGTCGTCAGTAGCCatcaccatcaccatcaccatcatcaccatcaGACAACGACTTACACGAGCCACGTAGGCAGCGATGATTCGTCGTCGCTGGTCttgtcgtcttcgtcggccgTGGGCGTGGGCGCCGCCGGTGCGCTGCCCGACGAGGTGCGCTCCGACGGCAGCGACTCCGGCCTCGGCAACGagatcgcgcgcgagtgcgagcTCGCCGAGGCCTGCCCGACGTCGATGCCGTCCTCGGCGGTCACCACGCCGGACTTCGACATAGCCGAGACCTCGTTCCTCGATCGGATACCCGACGACATTCTCAACGTCAAGGATAAGCCGATGAGTCAGCTgtacggcggcagcagcagcagtagcagcgaggTGTTGAAGTCACCGACGTTGCCAATCacaagcggcagcagcagtaccagCGGCGCCATCAACATAAGTCAAAAAGTACCGACGAAGAGCAGCTTGAAGAGGCGGCGCTTGGAGGTGGTCGACGACTGCATCGTCGGCGGACTGGACGACGCCAGCAGTCCCAAGAGGCCCAACGTCCAGctgtcgccgtcgtcgtcgtcttcgtcggcgtccgtcgcgtcgtcgtcggtcgcGGCCGCAACGGCATCCACttcgactacgacgacgacgacgacgacgcagcagcagcagcagcaacagcagcaacagcagcccaAGAAGAAGCGCAACATACAGTTCGACGCGGTCACGGTGTATTACTTCCCCAGGGCGCAAGGTTTCACCTGCGTACCGTCTCAGGGAGGCAGCACACTGGGCATGCACGCGAGACACGAGCACATGGAGAGGTTCTCCCTGTCGGAGCACGCAGCGGAGCAGCGCCGCAatcatcgcgcgcgcctggCCCAGCTGCGCTCGGAACGTCGCTCGGCGGCCATCGTTGCGGCCGGCGTCGTGGGCCAGGATGCGAGCGTCAGCGGCATCATCAACGCCGACGGCGTCAACGACCTCGACGTCGGCGGCGCGCCTGCTCTACTGGACGTAGTGAGCTGTGGCGTCGACGACGTCGGCGTCAACGTTAACGTCAACGACGCGCCCGTCGGCAACAatagcaacagcaacagcagcagtatgctcggcggcggcggtggcggcaatGTCGGTGGCGACGCAGCCTCGGGCTCGGAGAGCCACAGCGACGACACGGACGACGAGCCGAGCGACGAGCACGAAGAGGAGGACCTCGACATGGACAGCTACTACTTCCTGCAGCCGGTGCCCACCTGGCAGAGGCGAGCTCTGCTCAGGGCCTCGGGGGTGCGCAAGATCGACTCCGTGGAGAAGGACGAGTGCCGCGAGATCAGGGCCAGCCGGGAGCACTGCGGCTGCGCCTGCAAGGGCTACTGCGACCCGGAGAGCTGCCCCTGCAGCCGCGCCAACGTCAAATGTCAAGTCGATCGGCAGGGCTTCCCGTGCGGCTGCTCGCGCGACGGCTGCGCCAATCGCTCCGGCCGCATCGAGTTCAATCCCATCCGCGTGCGCACCCACTTCATACACACGATAATGCGCCTGAAGCTGGACACGAACGAGGCGCAGCAGCGCGAGGAGAAGCTGCGCCAGCAGATGATGCACGAGCAGCATCACGCCAGactggcggcggtggcggtttCTCGatcgccgcagcagcagcagcagcttctcgcGTCTTCCTCGTCCTCCGGCTCCACGGGGACGGCCGCCAACTGCAGCGCGACGACCAGCGCCTCGTCGCTGATGGACTCGTCGAGCGACTGCCTCGGCGCGGGCTTCACCAGTCTGCACTACGACTCGTCGGACAGCGTCTCGTCGCGCCCCGACGGCCTGGACCTGTACGCGATGCGCGACGACTGCTATCCCACGGCCGCAGGCGGGgctggtggcggtggcggcgacgcGGACGGCCTCGGCGACACGGGACAGCGCAGCAAGCTGCAATCCGACTTTTCCCAGGGTTTTCAGCATTACGGCGTGACGGGACAGGGATCGAGTCTGGGCTTTCAATCCAACATCTATACCGATTACCATAGTTATCAGAATTTACCCTCGATGTCCAGATCGCCCTTTCAGATGCAGTTCCCGGCCAGCCCGGGATTTTCGCATTACAGCACCACCTATAGCCAGGACGCGACGTCCGTTACGAGCTCCGCCAACAACTGCCAACAGACGCACTCGCTCATGCAACACCAGCAACACCAGCACCAGCACAACAACGTCATCTACGACGCGCCTTTTGGCCAGGACGAGATGACGGGGTCGCAATACACAAATCTCAACACGCTCCAGTCGATGAATTCGGTCGTGCCACAGATGGGCAAGCTAGATTCGTTCTCCGAACTCCTATCTTCCAGATACTCTTACTACGGTAACTGCCTCGAAGCTCAACAACAGCCACAGCagccgcaacagcagcagcagcagccacagcaacAGCATCTGCAGTCTCAGGAGGAACAGCAAGAacaccatcatcatcagccGCAGTCGCAGCAGCCACCGCAGCCACAATCGCACGAGCAAATGCAGATACAACACGAGCAAACgccacagcaacagcaacactTGGAACAACTtcagcagcatcaacaacaacaacaacaacagctgcATCAAGTTTATCACCAAGTCAACGGGACTAAACTAGATCTAGCAGATAAGCTGCAGCATAATAATTCGCTCGagcagcaagagcagcagcagcagcagcagcacaacggGGACGATTGCGATGACAATTTCGGTGAGATCATTAAAAAGTCCATGGTCGAAACCGTCTCAGCCTAG
Protein Sequence
MEPPLVQDPIVDDSALAHHHHNHNHLADFEFSHHDDDAPEQQQQQPPQLPSDESSCSDKESDPTAAAATETTEEAAATVTTEAVASSLVQEVPLDNNTDYSSSLLDQVDLGPKAAAAVATTTTLLVDASTTVAAATEQTEKEEEEKAPLPSPSALTLDTSEDNGLRLSESNSNDSSSEDRHTCDDTLQQQTTMGSLSTKSYSEEDTRLDYSESVENPLDAAVPDVVQPAPSSPSSPLEMARRRTNEVGSPTPQQTTTTATTTTTTTTPKKDFQPFGAEVSEEKAQQLKNYVKRLSPIHVSQRERTLGEISLGSAGSSSNSSEADSAEIIAQDDASSSGSGARSMQDECGGGGGVVVAVAGGSGGVSSSNSGTTTTTTIDGSSSSSSSSSSKPSDPRMRDHHQLRQVMIRLTRIDRYPPSPPPQASGGSVGGSDQGHGSRPRSPFSETPMSPVCPYGSSLIVERCDSPGYIENRVLDQTGRTVSHLDSGSGHQHQQQQTHQHSPRSVVEKLSVGDEITFSSPASPATAALLAAAGTSGLVARVSSPPATSSGSNNFTLVGGTPLEVASTTADSTACVLACMAAAGADLSCQHQGGLTQAIIQRLDQPMLPVVSSHHHHHHHHHHQTTTYTSHVGSDDSSSLVLSSSSAVGVGAAGALPDEVRSDGSDSGLGNEIARECELAEACPTSMPSSAVTTPDFDIAETSFLDRIPDDILNVKDKPMSQLYGGSSSSSSEVLKSPTLPITSGSSSTSGAINISQKVPTKSSLKRRRLEVVDDCIVGGLDDASSPKRPNVQLSPSSSSSSASVASSSVAAATASTSTTTTTTTTQQQQQQQQQQQPKKKRNIQFDAVTVYYFPRAQGFTCVPSQGGSTLGMHARHEHMERFSLSEHAAEQRRNHRARLAQLRSERRSAAIVAAGVVGQDASVSGIINADGVNDLDVGGAPALLDVVSCGVDDVGVNVNVNDAPVGNNSNSNSSSMLGGGGGGNVGGDAASGSESHSDDTDDEPSDEHEEEDLDMDSYYFLQPVPTWQRRALLRASGVRKIDSVEKDECREIRASREHCGCACKGYCDPESCPCSRANVKCQVDRQGFPCGCSRDGCANRSGRIEFNPIRVRTHFIHTIMRLKLDTNEAQQREEKLRQQMMHEQHHARLAAVAVSRSPQQQQQLLASSSSSGSTGTAANCSATTSASSLMDSSSDCLGAGFTSLHYDSSDSVSSRPDGLDLYAMRDDCYPTAAGGAGGGGGDADGLGDTGQRSKLQSDFSQGFQHYGVTGQGSSLGFQSNIYTDYHSYQNLPSMSRSPFQMQFPASPGFSHYSTTYSQDATSVTSSANNCQQTHSLMQHQQHQHQHNNVIYDAPFGQDEMTGSQYTNLNTLQSMNSVVPQMGKLDSFSELLSSRYSYYGNCLEAQQQPQQPQQQQQQPQQQHLQSQEEQQEHHHHQPQSQQPPQPQSHEQMQIQHEQTPQQQQHLEQLQQHQQQQQQQLHQVYHQVNGTKLDLADKLQHNNSLEQQEQQQQQQHNGDDCDDNFGEIIKKSMVETVSA

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_01484629;
90% Identity
iTF_01483858;
80% Identity
-