Basic Information

Insect
Tachina fera
Gene Symbol
CSRNP3
Assembly
GCA_905220375.1
Location
LR999965.1:42317275-42321392[+]

Transcription Factor Domain

TF Family
CSRNP_N
Domain
CSRNP_N domain
PFAM
PF16019
TF Group
Unclassified Structure
Description
This presumed domain is found at the N-terminus of cysteine/serine-rich nuclear proteins. These proteins act as transcriptional activators [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 5 0.21 9.7e+03 -2.4 2.5 64 103 67 93 32 138 0.47
2 5 0.018 8.3e+02 1.1 1.3 78 124 239 285 209 306 0.71
3 5 0.5 2.4e+04 -3.7 0.5 81 109 334 351 312 372 0.42
4 5 4.9e-95 2.3e-90 303.7 7.7 2 218 496 698 495 698 0.92
5 5 0.31 1.5e+04 -3.0 0.2 69 116 946 991 922 995 0.44

Sequence Information

Coding Sequence
ATGTTTTCacccattaaaaattttattacatacatacaaaagaaTTCAGGCAGTAATTCAATATTCAGTGGAAACAGCACAATAGAAGGACCAGCAAAAATGGAAACAGAAGATATAAAAACAGAAGCCATTAACGCAAACATGCTAACTGAAAGTGAGATTAAAGAAGAAATCGAGATAAGAAACGATAATGTTGCAAAAACGGAAGAGGAAGACAACATACCACAATGcgaaaaaagtaaaagtttcaATTCAGACATAAATGATTCTGTAGCAGAAATAACTCATTCTGAACAAAATTCAAATACTTTGTCAACAACATCTATAGTATCCATAGGAAATTCTTCCGAAGTGGAAGATGATGACATCGAAGAACAAATTGAAAGTAGTATTATGTTAAAAACTCCTGTTAAAGTTACAGCCACTAATGGTAGTGAAATTCTAgaattgaaaaaatcgtcaactgCTAATGGCAATATTGTTATTAGCCACGACTTTAACCGCAATAGCGCCATGACTCCAATAGATGATGAGTTGGAAGATTTTAGGGATGACCCCATGGCTATACGGGCAACATCATCTCCTTGTCCTAACGAACCTGCCGATGAGTTTTGTTCAGTGGAAAGTGGGGAAGAAACCTTAGAAAAAGATTTCAAAAGCCTGGATGACCTCTCAGTCGTAGAAGAAGTGGAAGAAGTAGATAAAAGCGAGGCCTTGCATGATTCAAAAGTAATCGTTGATTCAGATCTAGAtgtagaagaagaaaaagatgTTGATCATAATAATCTGACCGATGTTGATGATGAAGAAGAgccatttttagatttaaagagtttCAAAAATAGAAACCTAATCGTTTTAGATGACGTGGACATGAATGCTTCTATGGCTTCCATTGATACAAATAATACATCTGATGATCCCTTAGCAATTGCGTGCGAAGAACGAGAAGCGTTCGCTTCAGACAAGGCTTCTGCTTTAGAAGAGTTAAATTTACTGAACAATTCCCAAACTTCTCAGGATGATGATACCAGAACAACCACGGATGAAGACTTACAGTGGCTAAAAACCGATACAGAGAATAACCTCCCATCATGTAGCAGAACAGATAAGGGTGATGCAATAAAAGATACGCCCGAAAAGGAAGAGCGTAACGGCGAAGGTTCGGATTCCGGTTTGGGCAGTGAAACATCAGGCTTGCATACCACCACAAGTGTAACAGATACTAGCCATTTGAACATAACTGCAGTAACACCAACGCAAACTCATACTCAGATAGGAGCAAAGGACAGTGAAGATATATTAGACTCGAATGAATTACGAGTCACAAAATCGCCACTAAAAAATCCTCCCAAGCCATATCGATCAAATTTAAAACGTCGCTTGGAAGTTGGAGATGAGGCCGTAGAAAGTTTGGCTTCTCTAAGATCGACATTATCGACACCCAACGATCATAGTTCGTTAAATGGCAGCATACAGAAGAAACCAAAGCgttcaataaattttgatacAGTACAAGTTTATTACTTTCCCCGACAACAGGGCTTTGGTTGTGTGCCTTCTGCTGGAGGCTGTACTTTAGGTATGGGAACTAGGCATATAGCCTTCAAAACTTTGACTTTAACCGAACATGCTGCCGAGCTCCGAAGAGCTCACCGCATGCAGTTACAAGAAATTAATCCAAGAGGTAGTTCTAGCGAGGATAGCGAAGAGTCGGAGGAAGATTATTTAAGTGAGGGTAGTGGCTCCGATTTAGACGGAGAATCAAATGGTTTCCTGCAACCTGTGTCGCCCAAACAGCGAAGGGCTTTGCTTAAAGCAGCTGGTATACGTAAAATAGATCCCAGTGAAAAGGCTGAATGTCGAAATATACGGAATAGCCGAGAGGTTTGTGGTTGCACTTGTCGTAATTTTTGTGATCCTGAAACGTGTGCTTGTTCTCAATCAGGCATTAAATGCCAAGTTGATCGTGATATGTTTCCATGTGGCTGCTCCCGGGATGCCTGTGGAAATACAATTGGACGTGTCGAATTTAATCCGACACGTGTGCGTACCCATTACATACACACCTTAATGCGCCTAGAGATGGAGAACCGTCAACAACAAAATCCTTACTCCTCTGCCGTTGTCTCAACAATGCAACAAACAGCTTCACCTTACTACCAAACTCACTTGCAGCCACAATCGAATTATAGTTCAGGTTATGCATCTCCGTCTTACAATGCTGCCTCCGAAATACATCAACAATCTGCTGCCAGTTCATACTATCACGCGCAGAACCCTTCCACATCAAATACTCTATATGGACAACAAACCTCCTTAGAAATAACACATAGTGGCAGTGTGAATTCTACTGTAACATCGCATTACGGTATCGATACTTTAGATACAAGTCTGTTTGGTGCAGGAACTGCAGCATCTTCTTCATATGGAGAACTAATGCCGGTGTCCTCGTATCATTATGGAAATATGCAAGCACAAGCATCACCATATAATATGTATCACAATAATCCCTACATAAACCCAAACAGTAATACTACATCCCCAAGCACATCGACTGCATATAGCTCATGTGCTGTACCTTCTATACCACCCTTCGGCACAGCTACAACTACCGAAGCTACAGGAGTTTATCACAGCGTGAACAGTCTCACCAGCATGGAAACTACAACAGCGCCCTCCTGCAGTGTTAATGGTACTACATCATTAGAAATTGATACCAATGCAAATTTTATAAGTCTGTCAACACCGCTTGCGAGTTCCTCCAGATTATCGCAAATAAATGATTTACTACAACATAATCGCAATGCGACAACAGCCCTAGTGGCCGTTTCACAAAACATTGAAGCCACAACAGAAAACCAATCAGCCTCAGCGACATGCCATACACAAACCATGGGTACCAATTCACAGACATCTGGCACATCTCTACAAGATGTCCACAGAAGCTGTAGTGCGTTTGAAGAACCAGTACCATCTCTAAAACCGTTGCCTCAAGTGACCGTTAAAGAAGATTACTGTAACAGTGGACCAGTTAAACCCATAACACTAACATTACCAGCAGAAACAGGagctttaaataaaagtttagagGAATCATCGTCTCTAATAAATTCTGAATTTTCAGCACAAGTTTCTACTTTGCATGAAGATGACAATGACACCACTACAGTGACGGTAAGTGCAACTATAGAAACTTCAGCTATTGAAGTGGCAGCTGGAAATTAA
Protein Sequence
MFSPIKNFITYIQKNSGSNSIFSGNSTIEGPAKMETEDIKTEAINANMLTESEIKEEIEIRNDNVAKTEEEDNIPQCEKSKSFNSDINDSVAEITHSEQNSNTLSTTSIVSIGNSSEVEDDDIEEQIESSIMLKTPVKVTATNGSEILELKKSSTANGNIVISHDFNRNSAMTPIDDELEDFRDDPMAIRATSSPCPNEPADEFCSVESGEETLEKDFKSLDDLSVVEEVEEVDKSEALHDSKVIVDSDLDVEEEKDVDHNNLTDVDDEEEPFLDLKSFKNRNLIVLDDVDMNASMASIDTNNTSDDPLAIACEEREAFASDKASALEELNLLNNSQTSQDDDTRTTTDEDLQWLKTDTENNLPSCSRTDKGDAIKDTPEKEERNGEGSDSGLGSETSGLHTTTSVTDTSHLNITAVTPTQTHTQIGAKDSEDILDSNELRVTKSPLKNPPKPYRSNLKRRLEVGDEAVESLASLRSTLSTPNDHSSLNGSIQKKPKRSINFDTVQVYYFPRQQGFGCVPSAGGCTLGMGTRHIAFKTLTLTEHAAELRRAHRMQLQEINPRGSSSEDSEESEEDYLSEGSGSDLDGESNGFLQPVSPKQRRALLKAAGIRKIDPSEKAECRNIRNSREVCGCTCRNFCDPETCACSQSGIKCQVDRDMFPCGCSRDACGNTIGRVEFNPTRVRTHYIHTLMRLEMENRQQQNPYSSAVVSTMQQTASPYYQTHLQPQSNYSSGYASPSYNAASEIHQQSAASSYYHAQNPSTSNTLYGQQTSLEITHSGSVNSTVTSHYGIDTLDTSLFGAGTAASSSYGELMPVSSYHYGNMQAQASPYNMYHNNPYINPNSNTTSPSTSTAYSSCAVPSIPPFGTATTTEATGVYHSVNSLTSMETTTAPSCSVNGTTSLEIDTNANFISLSTPLASSSRLSQINDLLQHNRNATTALVAVSQNIEATTENQSASATCHTQTMGTNSQTSGTSLQDVHRSCSAFEEPVPSLKPLPQVTVKEDYCNSGPVKPITLTLPAETGALNKSLEESSSLINSEFSAQVSTLHEDDNDTTTVTVSATIETSAIEVAAGN*

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

90% Identity
iTF_01074468;
80% Identity
-