Basic Information

Gene Symbol
CSRNP3
Assembly
GCA_933228635.1
Location
CAKOFY010000082.1:233765-242061[-]

Transcription Factor Domain

TF Family
CSRNP_N
Domain
CSRNP_N domain
PFAM
PF16019
TF Group
Unclassified Structure
Description
This presumed domain is found at the N-terminus of cysteine/serine-rich nuclear proteins. These proteins act as transcriptional activators [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 4 0.016 4.1e+02 1.2 6.0 42 114 76 146 28 164 0.78
2 4 0.022 5.6e+02 0.8 2.2 73 120 242 292 213 301 0.47
3 4 0.28 7.2e+03 -2.9 0.6 80 103 345 364 305 397 0.47
4 4 2.4e-95 6.3e-91 304.7 7.1 2 218 510 712 509 712 0.92

Sequence Information

Coding Sequence
ATGTTTACACCCATTAAAAACTTAATAACATACATACAAAAGAATGCAAGTGGTAATTCGATATTTGGAGGAAATAGTACAGCAGAAGAATCAGCAAAAATGGAAACAGAAAATATGAACATAGAAATAGAGACTAAAGTCGAAAGTGAAACTGCAATTGTGAAGGCTGAAAGTGAGATTTTAAAAGAAAAAGGTGAAAAAGAAGCAATGGCGGTTGATTTAGAAGAAAAAGAAGCAAGCATCGAAGAATTTCAAGATTCAGAAAGTAAAGAAGTTCAAGACTCTGAAAGCGAAGAAGCTAAAGAAACGGAAACAAAGTCCAACACCATCAACACTAACATCAACAGCACTGTTACAACAACGGTATCCGCAGAAAATTCATGTGATTTTGAGGATGATGACATTGAGGCACAAATCGAAAGGAGTATTATGTTGAAAAATCCTGTCAAAGTGGCGGCCTCAAAAACCACGGAAATACTAGAAGTTAAAAAGTCTTCAACCACTAATTCCAGTATTGTTATAAGTCATGATTATAATCGTAATCAGGCCCTAACGCCCATTGATGATGAATTGGAAGATTTTAGGCATGATCCCACTGAAATAATGGGCACCACATCATCGCCATGCTCTAATCAACCGGCCGATGAGTTTTGTTCGGTGGGAAGTGCAGAAGAAGTGTTGGAAAGAGATGCAACTACGGCGGATGAGTTTACTAACGATGAAGAAGAAGAAGTTGAAATGGAAGATAATGAGACAGAAGAGGATGAAACAAAAGCTCTTAATGATGTAGAAACTGATTACGATGAAGATAGGGATATCGTTGAAGTCGACGATGATGATCATGAGGCATTTTTAGATTTGGAGAGTTATAAAAACTCCAATGTAATCGTCTTGGATGAAGTCGATCTCAATGGCTCTACATTATCCATTGACACAAATAATACAGCAGACGATCCTCTGGCTATAGCTTGTGAAGAACAGGAAGCATTTGCTGCCGATAAGACGGCAGCATTAGAGGAACTAAATTTACTCAACAACTCACGAACATCTCAAGACGAAGACACAAGAACAACAGCCGACGATGATCTGCAGTGGTTGAAAACAGATCTAGACGATAATCAACCGTCTTGCAGCAATAAAGTGCTGCTTAAAGAAAACACCACAGAAGTCACCCCAGAAAAGGAGGAACGTAATGGTGAGGGTTCCGATTCCGGTTTAGGCAGTGAAACATCTGCTTTGCACATAACCACTACAAGTATTAACGATACTAGTCATTTGAATATGACCACAGCGACACCCACGCAAACGCCTACACAAACAAAAACGAAAGACGGTGAAAAATTACTGAGCACAAATGTGCTGCAGCAAACAAAATCTCCAGCTAAAGAATCTCCCAAACCCTATAGATCAAATTTAAAGCGACGCCTAGAAGTTGACGATGATGTTGTAGAAGGTTTGGGAGCTTTAGGCTCGACAGCTTCGGCCTTAAGTGAGCATAGTTCTTTAAGTGGTAGCATACAAAAGAAACCAAAGCGTTCCATAAATTTCGATAGTGTGCAAGTATTTTATTTTCCTCGGCAACAGGGCTTTAGTTGTGTACCCTCAGCAGGTGGTTGTACTTTAGGTATGGGAGCTAGACATGTGGCTTTCAAAACTCTAACATTAGCTGAACATGCCGCCGAGCTGAGGCGTGCCCATCGCTTGCAGTTACAGGAAATAAATCCCAGGGGTTCTTCTAGTGATGATAGCGAAGAGTCTGAGGAGGATTATTTAAGTGAAGGCAGTGGTTCGGATCTGGATGGTGAATCGAATGGTTTCCTACAGCCGGTTTCACCCAAACAGAGAAGAGCTTTGCTCAAAGCTGCAGGCATACGCAAAATCGATCCCAGTGAAAAGGCCGAATGCCGTAATATACGCAATAGTCGAGAGGTGTGTGGCTGTAGCTGTCGTGATTTTTGTGATCCCGAGACATGTGCTTGCTCTCAATCTGGCATTAAATGTCAAGTTGATCGTGATATGTTCCCTTGCGGTTGCTCTCGAGATGCCTGTGGCAATACAATTGGTCGTGTTGAATTTAATCCAACTCGAGTGCGTACCCATTACATACACACCCTAATGCGTTTGGAAATGGAAAATCGTCAGCAACAAAATCCCTACTCTTCTGCCGTTGCCTCACCCATGCAACCAACACCCACTTCCTTCTATCAAAATCACTTGCAACCGCAATCAAACTACAGTTCGGGTTATGCATCTCCAGCCTATAATACTGCCTCCGAACTGCAACAGCAACAGACGTCTGCCAATACCTACTATCATCCCCAGAATCCATCACCATCAAATGGTCTGTATGGCCAGCAATCATCATCTTTGGAAATGTCACATAATGCTAGTGCAAATAATGCTACCACTTCAACGCAGTACGCTATTGATAGCTTGGATTCGAGTCTCTTTGGTGGCAGTTCGTCGGCAACACCATCTTACGGTGAACTAATGCCAGTTTCGTCGTATCATTATGGAAATGTGCAAGCACAGCCATCACCCTACAACTCCTACCACAACGCCGTTCCATACGTTAACCCAAACAGTTCCACAACTTCCCTTAATACACCACCAAACACCTACAGTTCGTGTGCTGTGCCCTCAATACCACCATACGGAACAGCCACAACAACCGAGGCAACGGGAGTCTATCACAGTGTAAGTAGTTTAACGAGCCTAGAGACAACCACTGCACCCAGCTGCAGTGTAAACGGTACGACATCGCTGGAAAGTGATGCTAGCGCTAGTTTTATAAGTCTATCCACTCCACTGGCCAGTTCGACAAGACTTTCACAAATTAACGATTTACTACAGCACAATCGCAATGCAACCAATGCTCTAGTGGCCGTTACACAAAATATTGATGCCACAGCAAATAGCACATCTTTAACAGGAGTTAGAACTCAAGTGCAAATATCCTCCACATCGTCAAATAGCTCATCGATTAATACACCACCTATAGAAGATGCTCACAAGAGTTGCATGGCTTATGAAGGACTAGCGCCACCTTTAAAACCGGCGCCAATAGTGGCAGTAATGGAAACTGAGAGCAGTAGTGGACCGGCAAAGGAAACAACACTCATAAAATTAGCAGAATCGCTAGAAACAGGCACAAATAAATGTTTATCGAAAGGATTAGAGAAAATAGAAACTCCAACTGAATCTTTAGTAAATACAGATATTTTGCTAGCAACTTCAGAACTAAATAAAAATGATCCAGATGCCACTAGAAGCAATAGCCTAACGGTGAAACCTACGGCGGTAAGTCCAACTATAGAACCTTCAACAATTGAAGTAGCAGCTGGCGATTAA
Protein Sequence
MFTPIKNLITYIQKNASGNSIFGGNSTAEESAKMETENMNIEIETKVESETAIVKAESEILKEKGEKEAMAVDLEEKEASIEEFQDSESKEVQDSESEEAKETETKSNTINTNINSTVTTTVSAENSCDFEDDDIEAQIERSIMLKNPVKVAASKTTEILEVKKSSTTNSSIVISHDYNRNQALTPIDDELEDFRHDPTEIMGTTSSPCSNQPADEFCSVGSAEEVLERDATTADEFTNDEEEEVEMEDNETEEDETKALNDVETDYDEDRDIVEVDDDDHEAFLDLESYKNSNVIVLDEVDLNGSTLSIDTNNTADDPLAIACEEQEAFAADKTAALEELNLLNNSRTSQDEDTRTTADDDLQWLKTDLDDNQPSCSNKVLLKENTTEVTPEKEERNGEGSDSGLGSETSALHITTTSINDTSHLNMTTATPTQTPTQTKTKDGEKLLSTNVLQQTKSPAKESPKPYRSNLKRRLEVDDDVVEGLGALGSTASALSEHSSLSGSIQKKPKRSINFDSVQVFYFPRQQGFSCVPSAGGCTLGMGARHVAFKTLTLAEHAAELRRAHRLQLQEINPRGSSSDDSEESEEDYLSEGSGSDLDGESNGFLQPVSPKQRRALLKAAGIRKIDPSEKAECRNIRNSREVCGCSCRDFCDPETCACSQSGIKCQVDRDMFPCGCSRDACGNTIGRVEFNPTRVRTHYIHTLMRLEMENRQQQNPYSSAVASPMQPTPTSFYQNHLQPQSNYSSGYASPAYNTASELQQQQTSANTYYHPQNPSPSNGLYGQQSSSLEMSHNASANNATTSTQYAIDSLDSSLFGGSSSATPSYGELMPVSSYHYGNVQAQPSPYNSYHNAVPYVNPNSSTTSLNTPPNTYSSCAVPSIPPYGTATTTEATGVYHSVSSLTSLETTTAPSCSVNGTTSLESDASASFISLSTPLASSTRLSQINDLLQHNRNATNALVAVTQNIDATANSTSLTGVRTQVQISSTSSNSSSINTPPIEDAHKSCMAYEGLAPPLKPAPIVAVMETESSSGPAKETTLIKLAESLETGTNKCLSKGLEKIETPTESLVNTDILLATSELNKNDPDATRSNSLTVKPTAVSPTIEPSTIEVAAGD

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-