Basic Information

Gene Symbol
-
Assembly
GCA_004329405.1
Location
SJPC01003121.1:937-4586[-]

Transcription Factor Domain

TF Family
CSRNP_N
Domain
CSRNP_N domain
PFAM
PF16019
TF Group
Unclassified Structure
Description
This presumed domain is found at the N-terminus of cysteine/serine-rich nuclear proteins. These proteins act as transcriptional activators [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 5 0.13 4.4e+03 -0.8 4.1 53 104 86 137 66 181 0.52
2 5 0.22 7.6e+03 -1.5 0.2 64 102 212 250 187 256 0.71
3 5 0.56 1.9e+04 -2.9 1.2 77 104 271 301 235 377 0.64
4 5 0.53 1.8e+04 -2.8 3.9 74 136 410 473 365 488 0.48
5 5 6.1e-13 2.1e-08 36.3 0.2 1 28 706 733 706 741 0.94

Sequence Information

Coding Sequence
ATGGAATCGCCTTCGGCCCGTCAGGACTGTCTCGTTGAGGCCCAAGAGGATTCCACCTCAGTGGAGCCTGAACCGGAAAGCGTATCATCTCCAAGCaagacgatgacgacgacaacgacgccAGCATCAGAAGCTGAAGATGACGCGCGAGATTCTCGTGCGGACTTGGTGTGCGAGAGACTGGATGAAGGATCGGAAAATCAAGGATCTCGGGAAGGACAATTAAGTGATTATCCTTGCCTCCGGAAATTAGAGGAGGAGAAAGACATGGACCGAAAGTTGGAAAAGGAAACCAGTGAAAAcggaaaaaatttagatattgattttaacgTGGAATCGACGACAGATTCTCTTCAGGAATCCGAGGAGTCTTCGAAGAAGTCCGATTCTCAAAATTGTAATGTCGAGGATGAGGAGGATAAGATCGAAAAGAAGAGGTCGATCGGGAGTGACTTTCTGCCAATCGGTGCTCAGGAAATCAGGAGAGTTGGCGTCGAGATCTCGGAGGAGGAGGCGACCCAAATGAGAAACGACGTCAGAAGGTTGTCGCCGGTATTAGTGAGCTTGCGAGAGAGAACCTTAGGCGAAGTTTTCTTGTCTTCTCAGTCCTGTCTCTTCGGGGACGAGGTTGACACTGGCAAGGCAGTGTCGCGAAGTAATGCCATCTCGCAAGAATTATTCGCGAAGGACGATTCTAGAATTCAGGATGCCCATCAGAGTGAAAAGCGGGATGTCAACGAGGAGGAAATGATCGTCAAGATCGACGCAGGTCTTCCAGATTGCGCAGACAGGAAACTGAATTATAAAGAAGAGGTGTTTAGCAGCGACGTAGCTTCTTCGAGTCCCTCTAGGCTCGAGAATTCAGATGAGACGAGTCAAAATGATGTGGACGTGACTGCGAATATAAAGAGCGCATCCTGCACGCTGATCGAGAAGAAAACGACATTGAGTCCTTCACCATCGAGTCCCGTTCGTCGACATTTAGCGATAGTGCTGAACCGCGTCAATATGGGAGACGCAAACATCGACAAACTGACAAAAGAAAATGACAACGAGAGCATATGGAACATCGACGAAAGTTCTCCATGTcctaaaagaagaagaaattccgTCGAAGAAGAGTGCGATTCGTCTTCGGAGATTTCGAAAATCAATGAGAGAAGCCCGAAGAAGCAACGGTTGCAAAGTAACTCCTCCCCGACGACAAGATCGAAGAAGTGCGAACTGATAGAGAATAATCAAAGTCCTGATCGCAATCTGATACTAAAAAAGTGCAAAGTGGTGCTGGAGCGAATCATGAAGAGCGAGATGGAATCATCTCAAGAGGAGGACTCCCAAATTGAGGAACTCGAAGCGTTGAAAGAATCCGAAAGAGAAGAACTCGGACCCGTGGCGGTGATCGGCAAGCTGGAGGAGGACGAGGAGGTGGTGCTGACGAGCTCATCGTCCTCGGGTGAGAATAGCCGGGCGTTTAACGAACTGGATTCATTGGAGACGACGCCGAGTTCGCCGGAAGAAGGTGAGACGACGCCGGAAGAGACATCGGTCGACGTCGGCGCCGAAGGTGTCGACACCGAGACGGAAACTGAGACCGGCTCCGACGTCAGCTCGGAGGTGTCGCCCATGACAAACATCCGGGAACTACGCGACATGGCGTCCGATCATCAGCTACCCTGCCCCGAAGAAGGACCGCTGTGCTGCGTTGAGGTGATGGCGCCGATCATGACGAGATTGGAGGCTGAAGGGCCGGAAGCTTATACGGAAGACTCCGCCGAGAGTCTGACATTAGCTACCGGTGCTAGAGACGAAGTTAGATCCGATGGGAGCGATTCCGGTCTAGGGAATGAGATTCCTGGTGATCCTGGACCGGCGCCCGCACCGGAAAGCGACTCGGAGACCTCTTTTCTTGACAGACTGCCGGACGATATCCTCTCTGACAAGGAGAAAGgCGTGAATCAACTAGATGGCTTCGCGCCGTCCTCGGGCACGCCTGAGACTCCGGGCCAGGCACCTCTGACGAGTTTCCGGGCACTTCCGGCTAAAAGCAATTTGAAGCGCAGGCTAACCGACTGCATGGAGGGCGATGAGTCGCGGAGCAACCCCGATGAGCCCGTGAAAAAAAAGCGCAACATCCAGTTCGATGCAGTGACAGTCTATTACTTCCCCAGGACGCAAGGCTTCACCTGCGTACCTTCCCAGGTAAGTCCTAAAActgctctttctttctctctctctctctcttttttaacttttaagtAG
Protein Sequence
MESPSARQDCLVEAQEDSTSVEPEPESVSSPSKTMTTTTTPASEAEDDARDSRADLVCERLDEGSENQGSREGQLSDYPCLRKLEEEKDMDRKLEKETSENGKNLDIDFNVESTTDSLQESEESSKKSDSQNCNVEDEEDKIEKKRSIGSDFLPIGAQEIRRVGVEISEEEATQMRNDVRRLSPVLVSLRERTLGEVFLSSQSCLFGDEVDTGKAVSRSNAISQELFAKDDSRIQDAHQSEKRDVNEEEMIVKIDAGLPDCADRKLNYKEEVFSSDVASSSPSRLENSDETSQNDVDVTANIKSASCTLIEKKTTLSPSPSSPVRRHLAIVLNRVNMGDANIDKLTKENDNESIWNIDESSPCPKRRRNSVEEECDSSSEISKINERSPKKQRLQSNSSPTTRSKKCELIENNQSPDRNLILKKCKVVLERIMKSEMESSQEEDSQIEELEALKESEREELGPVAVIGKLEEDEEVVLTSSSSSGENSRAFNELDSLETTPSSPEEGETTPEETSVDVGAEGVDTETETETGSDVSSEVSPMTNIRELRDMASDHQLPCPEEGPLCCVEVMAPIMTRLEAEGPEAYTEDSAESLTLATGARDEVRSDGSDSGLGNEIPGDPGPAPAPESDSETSFLDRLPDDILSDKEKGVNQLDGFAPSSGTPETPGQAPLTSFRALPAKSNLKRRLTDCMEGDESRSNPDEPVKKKRNIQFDAVTVYYFPRTQGFTCVPSQVSPKTALSFSLSLSFLTFK

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-