Basic Information

Gene Symbol
Csrnp2
Assembly
GCA_012977825.2
Location
Scaffold:3992096-4010757[-]

Transcription Factor Domain

TF Family
CSRNP_N
Domain
CSRNP_N domain
PFAM
PF16019
TF Group
Unclassified Structure
Description
This presumed domain is found at the N-terminus of cysteine/serine-rich nuclear proteins. These proteins act as transcriptional activators [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 6 1 8.8e+03 -3.7 0.1 68 104 28 64 10 73 0.51
2 6 0.49 4.3e+03 -2.6 1.0 39 82 168 212 148 239 0.48
3 6 1.6e-24 1.4e-20 74.2 0.2 1 50 539 588 539 588 0.98
4 6 2.5e-40 2.2e-36 125.8 13.2 148 218 586 656 584 656 0.98
5 6 0.099 8.7e+02 -0.4 0.2 16 52 803 839 793 874 0.80
6 6 0.012 1.1e+02 2.6 4.0 55 107 897 949 879 962 0.48

Sequence Information

Coding Sequence
ATGTTTGGTCGTCAGCAGGGAAGCAAGACTCAGCCCGAGATGGAATCACCTTCCGCCGAGCAACAGCTGACGGCAGAGGCCCAAGAGGATTCCAGCTCAGTAGAAGCTCTCCTACTGCAGTGTACGACGACGAGCTGCAACGACGACGACGGCCTCTTACTCGAAAGGTCAACCGAACAGGATGTTGACAATGGATCCGCTACGGGCTTTGCCTTCTTCGATGACCTGGGACAGCTAGCCAGCAACGACGACGTCGCGTCAGTAACTCTCTACAGCGAGCGTCTTCTTCAAGACAACGACATGGGCAGCTACGCGTCCAGGAACGAATCCACCGAGGAATTTAGGTTAGAGGACAGTCTCGAGGAGGTCTGCAACGAGGTCGCGAATCAGCCGAGTACCCCGAGGCGAAGGCCGACCGCCGACTTCCAGCCGATCGGCGTCGAGGTGTCCGAGGAGAAGGCCCAGCAGCTGAAGAACGACGTGAAGCGACTTTCGCCCGTTCACGTGAGTCTGCGCGAACGGACCCTCGGCGAAATATCGCTGACCAGCTCCGACTCCCGGAGCAGTGCTAGTTGTGCTAACGGTGCCGGTGAGGAGGAGGCCGACGAGAGGGACGATCAGGACAAAAGTGCCGAGTCGGCCGATCGATCGTGCGCGTCGAGACTGAGTAGTAGTAGCAGCAGCGCTAGTGATAGTGTAGGCAAGGGCCACCGGGAGCTGGTGATCCGACTGTCGCGCGTCGACGACGAACGACCGATCACCAGCCCGACGAGTTTGCTCGATTTGAGGAACGACGAGGACTTTGCCGCCGAGATGAGCGGCGTCGACATGAGCGCCGAAGCCAAGGCGGAGGAGGAGACGGCCACGGCGACCTGCCCCTTCGGCGCGAACCTCGTGCTGGAGCGCTGTAACGCGCCGGGCTTCGCCGAGAACCGCGTGCTCTCGAGCTGCGGCCGCGTCACGCTGCTCGACAAGTTCGAGGCCAAGAGCATCGTCGAGAAGCTGCACATCGGCGACGAGATCGAGTTCTCGAGCCCGACCTCACAGCCGCAGCTCCAGCAGGTGGGCTTCGCCGAGATGGAGGAGGGTGAGATCGAGGCCAGCTCCGAGGACGTGCTCGCCTCGATGGCGGCGGCCGCGGCGGGTGTCACCGCCGCGACGGCCTGTGAGGACGGCCTCACCCAGGAAATCATGACCAGGCTGGAGCAGGATCGGCCCGAGCCCTTCACCGAGGACTCGGCCGAGAGCCTCGCCCTGGCCGCCGGCGTGCGCGACGAGGTCAGGTCCGACGGCAGCGACTCCGGACTGGGCAGCGAGATGCCCGGCGACCCCTGCCCCGCACCCGCCCCCGAGAGCGACTCCGAGACTTCCTTCCTGGACAGGATACCCGACGACATTCTCTCCGACAAGGACAAACCCGCGAGTCAGCTGGACAGCTACGCGGTGGTCGAGCTGCCGAAGACGCTGCCGCTGCTCCGCGCGCCGGCCAAGAGCAGCCTCAAGCGCCGGCTGACCGACTGCCTGGAGGACGCCGAGCAGCCCGACAGCAAGCGGGCCAACGTCGAGGAGCAGGTGGCCGCCGGCAGCAGCTCCCTCGCCTCCTCGCCGGCCGGCAAGAAGAAGCGCAACATCCAGTTCGACGCCGTGACGGTCTACTATTTCCCGCGGGCCCAAGGCTTCACCTGCGTGCCTTCCCAGGGTGGCAGCACGCTCGGCATGTCGGCCACGCACACGCACGCCGAGCGCTTCTCGCTCCGGGAGCACTGCGGCTGTGGCTGCAAGGGCTACTGCGACCCCGAGAGCTGCCCCTGCAGCCGGGCCAACGTCAAGTGTCAGGTCGACCGGCAGGGCTTCCCCTGCGGCTGCTCGCGCGACGGCTGCGCCAACAGCTCCGGCCGCATCGAGTTCAACCCCGTCCGCGTGCGCACGCACTTTATCCACACGCTCATGCGCCTCGAGCTCGAGAAGAAGCCGGCGCACCGCGACGTCGAGGAAGCCCACCAGGAGAGCCACCACCACCACCATCACCACCAGAGCCGGCTCATGGAGACGGCCGGTGACTGTGGCCTGGCCGCCAGCGTCTCCGGCGCCGGCTTCACCGGCCTTCACTACTCCGACGGCCAGGACGGTGGCGCGCACACCGACAGCCTCGACCTCTACACCATCCGCGACGACTGCTACGTCGCCGACGATTGCCTGGTGGTAGGCGCCGAGCCCCAACAGCAGCAGAGGAAGTTGCACTCCGAGTTCGGTCAGGGCTTCCAGCACTACGGGCCAGCCCAGGGGCCCGGCATCAGCTTTCAGCAGCAGAACCCCTATGCCGATTACCAGAGCTACCAGTCTATGCCCTCGACGTCCAGGTCGCCGTTTCAGCCGCAGTTCCAGCCGGTCGCAACGAATACCGACTTCTCACACTACGGCTCGTACTCGCAGGAGCCAACCCCCGGTACGAGCTCCAACGGCTGCCTTCAGACTCACTCGCTCATGCAGCAACAGCAGCAGCAGCAGCACAGTAACATCATATACGACGCGCCATTCGCGCAGGATGAAGTCACCGGCTCCCAGTACACCAACCTCAACTCGATCCAGCCGATGAGTTCAGTAGGCCAGCAAATCGGAAAGCTCGAGCCCTTCTCGGAACTGCTCTCTGCCAGGTATTCTTACTACGGCGACGTCGTCGATCAACAACCGCAGAATCACCACCAACATCAACAACAGCAACCGCAGGAGCAACAGCAACAGCAGCACCACGGCGTTTATCACGCGAACGGCGACAAGATGGAGATGGAGAAGGGTGACGAGATGGTCGTGAACGAGCAGCAGGAGCAGCTGACCGAAGAGGACTGCGACGAGAATTTCGGCGAGATCATCAAAAAGTCCATGGTCGAGACTGTGTCCGCCTAG
Protein Sequence
MFGRQQGSKTQPEMESPSAEQQLTAEAQEDSSSVEALLLQCTTTSCNDDDGLLLERSTEQDVDNGSATGFAFFDDLGQLASNDDVASVTLYSERLLQDNDMGSYASRNESTEEFRLEDSLEEVCNEVANQPSTPRRRPTADFQPIGVEVSEEKAQQLKNDVKRLSPVHVSLRERTLGEISLTSSDSRSSASCANGAGEEEADERDDQDKSAESADRSCASRLSSSSSSASDSVGKGHRELVIRLSRVDDERPITSPTSLLDLRNDEDFAAEMSGVDMSAEAKAEEETATATCPFGANLVLERCNAPGFAENRVLSSCGRVTLLDKFEAKSIVEKLHIGDEIEFSSPTSQPQLQQVGFAEMEEGEIEASSEDVLASMAAAAAGVTAATACEDGLTQEIMTRLEQDRPEPFTEDSAESLALAAGVRDEVRSDGSDSGLGSEMPGDPCPAPAPESDSETSFLDRIPDDILSDKDKPASQLDSYAVVELPKTLPLLRAPAKSSLKRRLTDCLEDAEQPDSKRANVEEQVAAGSSSLASSPAGKKKRNIQFDAVTVYYFPRAQGFTCVPSQGGSTLGMSATHTHAERFSLREHCGCGCKGYCDPESCPCSRANVKCQVDRQGFPCGCSRDGCANSSGRIEFNPVRVRTHFIHTLMRLELEKKPAHRDVEEAHQESHHHHHHHQSRLMETAGDCGLAASVSGAGFTGLHYSDGQDGGAHTDSLDLYTIRDDCYVADDCLVVGAEPQQQQRKLHSEFGQGFQHYGPAQGPGISFQQQNPYADYQSYQSMPSTSRSPFQPQFQPVATNTDFSHYGSYSQEPTPGTSSNGCLQTHSLMQQQQQQQHSNIIYDAPFAQDEVTGSQYTNLNSIQPMSSVGQQIGKLEPFSELLSARYSYYGDVVDQQPQNHHQHQQQQPQEQQQQQHHGVYHANGDKMEMEKGDEMVVNEQQEQLTEEDCDENFGEIIKKSMVETVSA

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_01037004;
90% Identity
-
80% Identity
-