Basic Information

Insect
Eumaeus atala
Gene Symbol
CSRNP3
Assembly
GCA_017140195.1
Location
JAOYMN010049248.1:44827-47010[-]

Transcription Factor Domain

TF Family
CSRNP_N
Domain
CSRNP_N domain
PFAM
PF16019
TF Group
Unclassified Structure
Description
This presumed domain is found at the N-terminus of cysteine/serine-rich nuclear proteins. These proteins act as transcriptional activators [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 5 0.38 8.9e+03 -3.3 1.8 75 109 51 86 16 107 0.48
2 5 0.31 7.1e+03 -3.0 2.1 56 107 155 180 117 202 0.46
3 5 2.7e-101 6.3e-97 324.1 9.5 2 218 208 416 207 416 0.94
4 5 0.29 6.6e+03 -2.9 0.1 43 77 555 590 548 635 0.61
5 5 0.19 4.4e+03 -2.3 0.4 75 100 662 687 616 715 0.48

Sequence Information

Coding Sequence
ATGATGGAAAGCTTTCCATGTGACTGGGGAGATGCTAATGAGCTTATTTATACCAGTGATGAATCTTCTGTAACAATGGAAAATAACAGAGAAGAAACTTGTTCACCTGTGATAAAGAACGAGAAAAGTGACAGTGATGCTAATGGTGATAACTTTAGTTATAATCAAAGTCATGATCAAATTTATGTAAACGATAACGTTTCTCAGTTTTATGAGAGTAGTGATAGTAAAATAGATTATGCTGTGAAATTAGATGATAATGATACTTTAAGTGATATCGAGGACATTAAAACAGACACTGCTTCAGCTGACAGTGAGGATTCGGCACTTGGAAGTTTACCCCCTGACACTGTTACCCTACATGATAGAGAAGAAGATATGAACGATAGATCAGATGGGAGTGACTCTGGAACCGAAGCAACAAAATTTATAAAATTAACCTATTTTGAAAATACTTCTAAGGCAGAAGATAAAAAGCCACTTGATAAGGAAGAAAAACAAGATGAATTCAAAAGCAACGAAGTCGAAAATAAGCCAAGTGAAGAGATGAAAAGACCCCTAAGAAGCAGCCTAAAAAGAAAGTGTGATGACGATAACAACTGCGAACATCCGCAGAAAAGGAGAAAAGGAAACATTAAATTTGACAATGTTACGGTATACTATTTTCTAAGATCTCAGGGATTTTCATGTGTACCATCACAAGGCGGCTCTACATTAGGAATGGAAAGGGAGCATAGCCACTCTCAAAAATTTACACTCACCGAACATGCTCTGGAACAAAGAAGATTACACAGGCAAATATTACAACAACTCAAGAGTGAGCGACATTCCGGTCAAGGTGAGTCACTATCTTCAAGCGAAGAAAGCGACACCGAAGAAGAGACGAGTGATATTTCAGAATCAGAGCTAGATTTAGATAGCTACTATTTTTTACAACCTGTACCTACTAGACAAAGAAGAGCCCTTTTACGGACGGCAGGTGTTAGAAAAATTGAAGCTTACGAGAAAGACGAATGTAGAGACATAAGAACATCTCGTGAATTTTGTGGTTGTGCTTGTAAAGGTGTATGCAATCCAGAAAACTGTTCTTGTAGTTTGGCGGGCATAAAATGCCAAGTGGACAGACTGAACTTTCCATGTGGTTGTACGAGAGATGGATGTGGTAACACAACTGGTAGAATAGAATTTAATCCTTTAAGAGTAAGAACTCACTTCATCAATACTTTAATGAGAATAAGTTTAGAAAAAAAAAATGAAGAACACCAAGAAGCGAAAAGACAGTGGGCTGAAGCACACGATACAAATGCTATAGCTGCTTGTATGCCAAATATAAGTATGCCATATGACAAAACAGTATGCCATAATCAGGACGCTAATACTTTAAGAGACACCAATACTGGCTTGCCTGTTAAAAGTTGTAATAACAACTCAAATACCTATAATAAGATTCATTGTAATCTACACAATTCTAACATTTCTGGTAGTGTCCATAACGATGCCTCGTTTACTTTTCAAAGAAGGGACCAAAACGTTATAAATTATGACAGTAAATCAGATAACCCATTGCATGATCCGTACAATTCTAATATTCAGGGAAAAGGTCCACCTTACCCTTCTACTAGTACAGCCGAATATAGCACATTACCCAATAATATACAACGGTATCAATACGATCTGAACAATTATATGTATGAGCAACACTCAGAGAATCACCAGTATAAAGGAATACAATCATTTTCGGCTTCTAGCTTTGAAGAATTCGCACACAATTCGCAAATATCTATGTTTAATCACTACGGTAATATGTATACGCCATCTACCTACCATCAGAAACAAGACAGCAACATCCCAGAGCACCCTGCACAATACCCTATGCACCAAAATCATTATGGCGTATATAGAAATAATTTAGAATGTTATAGCAGTGATAATAAAATTGACTCTCAGTATACCACATTAATGGCAATGCCGTATCAAAATAATAAAATGCAAGCCTCAGAAAGTGACGAAAATTGGTTTTGTCAAAATAAGCTGTTAAATTTAGACAGCATAGACCAAGAGGCACAAGACACGCCTCATGCCCAAACTGAAATAGTCCGAGAGTCCACTGAAGAAAATGTAAATGAAACTGAAAACTTAGGAGAACTTATCAAAAAAACTATGGTAGAGTCTGTCTCTGTGTAG
Protein Sequence
MMESFPCDWGDANELIYTSDESSVTMENNREETCSPVIKNEKSDSDANGDNFSYNQSHDQIYVNDNVSQFYESSDSKIDYAVKLDDNDTLSDIEDIKTDTASADSEDSALGSLPPDTVTLHDREEDMNDRSDGSDSGTEATKFIKLTYFENTSKAEDKKPLDKEEKQDEFKSNEVENKPSEEMKRPLRSSLKRKCDDDNNCEHPQKRRKGNIKFDNVTVYYFLRSQGFSCVPSQGGSTLGMEREHSHSQKFTLTEHALEQRRLHRQILQQLKSERHSGQGESLSSSEESDTEEETSDISESELDLDSYYFLQPVPTRQRRALLRTAGVRKIEAYEKDECRDIRTSREFCGCACKGVCNPENCSCSLAGIKCQVDRLNFPCGCTRDGCGNTTGRIEFNPLRVRTHFINTLMRISLEKKNEEHQEAKRQWAEAHDTNAIAACMPNISMPYDKTVCHNQDANTLRDTNTGLPVKSCNNNSNTYNKIHCNLHNSNISGSVHNDASFTFQRRDQNVINYDSKSDNPLHDPYNSNIQGKGPPYPSTSTAEYSTLPNNIQRYQYDLNNYMYEQHSENHQYKGIQSFSASSFEEFAHNSQISMFNHYGNMYTPSTYHQKQDSNIPEHPAQYPMHQNHYGVYRNNLECYSSDNKIDSQYTTLMAMPYQNNKMQASESDENWFCQNKLLNLDSIDQEAQDTPHAQTEIVRESTEENVNETENLGELIKKTMVESVSV

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00261875;
90% Identity
-
80% Identity
-