Basic Information

Gene Symbol
-
Assembly
GCA_958496245.1
Location
OY292445.1:608388-620665[+]

Transcription Factor Domain

TF Family
CSRNP_N
Domain
CSRNP_N domain
PFAM
PF16019
TF Group
Unclassified Structure
Description
This presumed domain is found at the N-terminus of cysteine/serine-rich nuclear proteins. These proteins act as transcriptional activators [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 10 1.9 2.3e+04 -4.6 3.3 78 86 558 566 500 631 0.44
2 10 0.023 2.6e+02 1.7 0.0 23 108 723 808 712 820 0.54
3 10 0.00039 4.6 7.5 0.0 27 95 929 997 907 1015 0.68
4 10 0.00039 4.6 7.5 0.0 27 95 1047 1115 1025 1133 0.68
5 10 0.00036 4.2 7.6 0.0 27 96 1165 1234 1143 1254 0.68
6 10 6.6e-05 0.78 10.0 0.0 11 95 1267 1351 1259 1369 0.72
7 10 0.00017 2 8.7 0.0 27 96 1401 1470 1379 1490 0.74
8 10 6.7e-05 0.79 10.0 0.0 11 95 1503 1587 1495 1605 0.72
9 10 0.0071 84 3.3 0.1 27 94 1637 1704 1615 1720 0.70
10 10 0.083 9.8e+02 -0.1 0.1 27 78 1755 1806 1746 1821 0.57

Sequence Information

Coding Sequence
ATGTACAAAGCAGTGATTCTCCTACTCATAGCTGCATCATCAGCTAATCAGCTGCAGAACGGCTGCCCTTCCGACTGGCAGGTACACCGTCTTCTGCCGCACGAGACAGACTGTGCCAAGTTCTACACGTGCAGCCATGGACAGAAGATCCTGATGCAGTGTGCCCCTGGGACGCTCTTCGACGCTAATTTACAGaCTTGCAACTGGCCATCGTTAGTGGCATGCTACGCCACGCAATCCACGCCAGGCGAGCCTCCAACACAACACACCACGACACACCCCGCCACGGCACACCCGCCCACGACACAACCGCCTGTGGCTTGTCCGAATAATGTTGTGGAACATCAGCTATTGCCGCATGAGACAAACTGTTCAAAGTTCTATCAGTGCAGTAATAGAGTGTTGGTGCTAAAGGAATGCCCGGATGGATTGCATTTTGATATAAACTTTCAGATTTGCAATTGGCCGTTTATGGCGACTTGTGGTACAGGAACTACCACGGTTCAACCTTCGTCAATACAACCGTCCACAATACAACCACCCATGACACAACCGCCCACGACACAgCCAACGACACAATCGCCCGCGACACACCCTCCCACGACACACCTGCCCACGACAGAAGTAGAGTTTCTGCCGAACGGTTGTCCTAAAGATCACCACATCCACCATCTattgcctcatgaagactgctCTAAGTACTACCAGTGCAACTTCGGCGGTTCCACGACGCACGAACCTATTGTGACAGAACCTACAACATGGGAACCCACAACTAATGAGCCAACCACTAAAGAATCTGCTTCCACTTATGAGCCAACAACAGAAGAGCCCGCTACAACGTATGAGCCAACAACTGAAGTTCCCGTAGCTACATATGGGCCTACAACAGAAGAGCCCGCTACAACGTATGAGCCAACAACTGAAGTGCCCGTAAATGAGCCAAGTGAATCAACAACAAACGAGCCTGCAACTTATGAGCCAACAACTACAGAGCCATCGACTACATGGCAACCAACAACTAGAGAGCCTTCGACCACTTGGGAACCAACATCTAGAGAGCCTTCGACCACTTGGGAACCAACAACAAAGGAGCCTCCAACCACTTGGCAACCAACAACAAAAAAGCCTGCTACCACTTGGGAACCAACAACAAAAGAGCCTTCTACCATTTGGGAACCAACAACAAAAGTGCCTTCATCAACTTGGGAACCAACGACTAAAGAGCCTTCAACCACTTGGGAACCAACAACTAAAGAACCTCTAACCACTTGGAAACCAACAACAAAAGAGCCTTCAACCACTTGGGAACCTACAACTAAAGAACCTCTAACCACTTGGGAACCAACAACAAAAAAGCCTCTAACCACTTGGGAACCAACAACCAAAGAGCCTTCAATTACTTGGAAACCACCGACAAAAGAGCCTCCAACCACTTGGGAACCAACAACAAAAGAGCCTCCAACCACTTGGGAACCAACAACAAAAGAGCCTTTAACTACGCCAGCTCAAGACACCACTACGCTAGCTCAAGACACCACTACGCTAGCTCAAGACACCACTACGCTAGCTCAAGACACCACTACGCTAGCTCAAGACACCACTACGCTAGATCAAGACACCACTACGCTAGCTCAAGACACCACTACGCTAGCTCAAGACACCACTACACTAGCTCAAGACACCACTACGCTAGCTCAAGACACCACTACACTAGCTCAAGACACCACTACGCTAGCTCAAGACACCACTACGCTAGCTCAAGATACCACTACGCTAGCTCAAGACACCACTACACTAGCTCAAGACACCACTACGCTAGCTCAAGACACCACTACGCTAGCTCAAGACGCCACTACACTAGCTCAAGATGCCACTACGCTAGCTCAAGACAATACTGAAGGGAATTTACCGACCGACTTAACAACCATAACTGATTTGTCAACAACTCCAAGTGGCTTAACAACCACAGAAATCTTGTCGACAGTGACATACTCGACGTCAGCGGTGGAGTCGACGACAACTGTTGCGCCACTTTGCCCTGTTGGTACTATATTGTTAGTGCCTAACCCAGAGAGATGTGATGCGTATTACATGTGTACGGTTGCTATGGCCGTACCAATGTACTGTGAGGAGGGTTATGAATTTGACCTTGACGAACAACAATGCGTGGTTATAGCCGAAGGAGGCTGCACTCTTGGATCAAGCACCGATATCAATCTTCCACCAACTACCATCTCGCCAACAACTTACACAACTGAAAAGCTCACTGAAGTAACCGAAGAAGAATACACAATAACCACTGAACACAAAGACAAATTCTCCACTGAGTACACAACTGAGGATGGAATCAAAACGACTGAAACGACACTAAATCCAATAGAAACAACTATAATCACTGAACAGCAGACAGCAGATTCGGAATCACTGATCACGGTGGATCCAATAATAGTTACGGAAGCATCCACTCCACCAGAAGCCACTTCAACGGAAATAGAAACACCTACCGTAACAGAAACACCAACACTTACTGAAACACCAGTTACCACAGATAAACCTATAACAACGACAGAGGCAGCAGAATTAACGACACAAAACTTAGCGACGACGACAGAAGTAGACTCAACTACGCCAGCTAAATTATGCCCTCTTGGTGTAGTTGGTAATGTTGCGAACTCGGAGCGATGTGACGCCTATTACATGTGTGCTGGAGGTATGGCAATACCCTTTTATTGTGCTACAGGGTTCGAATTTGACCCAGAACTCGAAACCTGTGTAGAAATCGCTGAAGGAGGTTGCACGCTTGGTAGCAGCACAAACGCTGAAATACAATCAACTATCGTAAACAATTTAGAAACAACAACTGCCAATGCTGCTACTACAAATAGAGTTGAATCAACTCAAAGTGCCGAATCGACTACTTCAGAGAAAATCGAATTGACGACTGAGGATGACGCAACGACGACAGAAATTGAATCGACTACGCAAACGCAGTTATGTCCTCTTGGTGTGATGGGTAATGTTGCGAACTCGGAGCGATGTGACGCCTATTACATGTGTGCTGGAGGTATGGCAATACCCTTTTATTGTGCTACAGGGTTCGAATTTGACCCAGAACTCGAAACCTGTGTAGAAATCGCTGAAGGAGGTTGCACGCTTGGTAGCAGCACAAACGCTGAAATACAATCAACTATCGTAAACAATTTAGAAACAACAACTGCCAATGCTGCTACTACAAATAGAGTTGAATCAACTCAAAGTGCCGAATCGACTACTTCAGAGAAAATCGAATTGACGACTGAGGATGACGCAACGACGACAGAAATTGAATCGACTACGCAAACGCAGTTATGTCCTCTTGGTGTGATGGGTAATGTTGCGAACTCGGAGCGATGTGACGCCTATTACATGTGTGCTGGAGGTATGGCAATACCCTTTTATTGTGCTACAGGGTTCGAATTTGACCCAGAACTCGAAACCTGTGTAGAAATCGCTGAAGGAGGTTGCACGCTTGGTAGCAGCACAAACGCTGAAATACAATCAACTATCGTAAACAATTTAGAAACAACAACTGCCAATGCTGCTACTACAAATAGAGTTGAATCAACTCAAAGTGCCGAATCGACTACTTCAGAGAAAATCGAATTGACGACTGAGGATGACGCAACGACGACAGAAATTGAATCGACTACGCAAACGCAGTTATGTCCTCTTGGTGTGACGGGTAATGTTGCGAACTCGGAGCGATGTGACGCCTATTACATGTGTGCTGGAGGTATGGCAATACCCTTTTATTGTGCTACAGGGTTCGAATTTGACCCAGAACTCGGAACCTGTGCCGAAATCGCTGGAGGAGGTTGCACGCTTGGTAGCAGCACAAATGCTGAAATACAATCAACTACCGTAAACAATTTAGAAACAACAACCGCCAATGCTGCCACTACAAATAGAGTTGAATCAACTCAAAGTGCCGAATCGACTACTTCAGAGAAAGTCCAATTGACGACTGAGGATGACGCAACGACGACAGAAATTGAATCGACTACGCAAACGCAGTTATGTCCTCTTGGTGTGATGGGTAATGTTGCGAACTCGGAGCGATGTGACGCCTATTACATGTGTGCTGGAGGTATGGCAATACCCTTTTATTGTGCTACAGGGTTCGAATTTGACCCAGAACTCGAAACCTGTGTAGAAATCGCTGAAGGAGGTTGCACGCTTGGTAGCAGCACAAACGCTGAAATACAATCAACTACAGTAAACAATTTAGAAACAACAACCGCCAATGCTGCCACTACAAATAGAGTTGAATCAACTCAAAGTGCCGAATGGACTACTTCAGAGAAAATCGAATTGACGACTGAGGATGACGCAACGACGACAGAAATTGAATCGACTACGCAAACGCAGTTATGTCCTCTTGGTGTGACGGGTAATGTTGCGAACTCGGAGCGATGTGACGCCTATTACATGTGTGCTGGAGGTATGGCAATACCCTTTTATTGTGCTACAGGGTTCGAATTTGACCCAGAACTCGGAACCTGTGCCGAAATCGCTGGAGGAGGTTGCACGCTTGGTAGCAGCACAAATGCTGAAATACAATCAACTACCGTAAACAATTTAGAAACAACAACCGCCAATGCTGCCACTACAAATAGAGTTGAATCAACTCAAAGTGCCGAATCGACTACTTCAGAGAAAGTCCAATTGACGACTGAGGATGACGCAACGACGACAGAAATTGAATCGACTACGCAAACGCAGTTATGTCCTCTTGGTGTGATGGGTAATATTGCGAACTCGGAGCGATGTGACGCCTATTACATGTGTGCTGGAGGTATGGCAATACCCTTTTATTGTGCTACAGGGTTCGAATTTGACCCAGAACTCGAAACCTGTGTAGAAATTGCTGAAGGCGGTTGCGCGCTTAGTAGCAGCACAAATGCTGAAATACAATCAAGTACCGTAAACAATTTAGAAACAACAACCGCTAATGCTGCCACTACAAATAGAGTTGAATCAACTCAAAGTGCCGAATCGACTACTTCAGAGAAAATCGAATTGACGACTGAGGATGACGCAACGACGACAGAAATTGAATCGACTACGCAAACGAAGTTATGTCCTCTTGGTGTGATGGGTAATGTTGCGAACTCGGAACGATGTGACGCCTATTACATGTGTGCTGGAGGTATGGCAATACCCTTTTATTGTGCATCAGGATTTGAATTCGACCCAGATCTAAAAACCTGTGTGGAAATCACGGAAGGAGGTTGCACTATTGGTAGTAATACTATCACCACCATAATACCAACTACTGCTGGACGTGAAACGACTACCGAAAATACTAAAACAACAGAAGCATCAAAAACGACAGAAAAACTAGAATTAACTATAGAAAACGAAGCAACAACGAAACAAGAAGTAGAAACAACTACCGCAGCTCAAATCTGTCCTGTAGGCGTAATGGGCAACGTGGCTAGCCCAGACAGTTGTTCAACATACTTTATGTGTGGAGGAGGGGCCCCTATTCCTCTATTCTGTGAAAGTGGATTCGAATTCGACCCGATGTCAAAGcaatgCGTAGCGATAGTTGAAGGTGGATGCAGTCTCGCCCAGAGCCAAGCCACTGCCGAACCTACACTCGGGTCAGTTCCCGAACACATTACCACTGTGTTGGGACTGTGTGAAGGAGACGGAAGTATGGCTGACCCAAATCAATGCGACTCTTTCTACGTGTGTGCGAATGGGAAAGCAATAAGGATGCATTGCGATGTCGGTTATGAATACAGTACTGAAGCTAAGAAATGCAAACCCTCCTCCGAAGGAGGCTGCACGGTAACAAGATACTGA
Protein Sequence
MYKAVILLLIAASSANQLQNGCPSDWQVHRLLPHETDCAKFYTCSHGQKILMQCAPGTLFDANLQTCNWPSLVACYATQSTPGEPPTQHTTTHPATAHPPTTQPPVACPNNVVEHQLLPHETNCSKFYQCSNRVLVLKECPDGLHFDINFQICNWPFMATCGTGTTTVQPSSIQPSTIQPPMTQPPTTQPTTQSPATHPPTTHLPTTEVEFLPNGCPKDHHIHHLLPHEDCSKYYQCNFGGSTTHEPIVTEPTTWEPTTNEPTTKESASTYEPTTEEPATTYEPTTEVPVATYGPTTEEPATTYEPTTEVPVNEPSESTTNEPATYEPTTTEPSTTWQPTTREPSTTWEPTSREPSTTWEPTTKEPPTTWQPTTKKPATTWEPTTKEPSTIWEPTTKVPSSTWEPTTKEPSTTWEPTTKEPLTTWKPTTKEPSTTWEPTTKEPLTTWEPTTKKPLTTWEPTTKEPSITWKPPTKEPPTTWEPTTKEPPTTWEPTTKEPLTTPAQDTTTLAQDTTTLAQDTTTLAQDTTTLAQDTTTLDQDTTTLAQDTTTLAQDTTTLAQDTTTLAQDTTTLAQDTTTLAQDTTTLAQDTTTLAQDTTTLAQDTTTLAQDTTTLAQDATTLAQDATTLAQDNTEGNLPTDLTTITDLSTTPSGLTTTEILSTVTYSTSAVESTTTVAPLCPVGTILLVPNPERCDAYYMCTVAMAVPMYCEEGYEFDLDEQQCVVIAEGGCTLGSSTDINLPPTTISPTTYTTEKLTEVTEEEYTITTEHKDKFSTEYTTEDGIKTTETTLNPIETTIITEQQTADSESLITVDPIIVTEASTPPEATSTEIETPTVTETPTLTETPVTTDKPITTTEAAELTTQNLATTTEVDSTTPAKLCPLGVVGNVANSERCDAYYMCAGGMAIPFYCATGFEFDPELETCVEIAEGGCTLGSSTNAEIQSTIVNNLETTTANAATTNRVESTQSAESTTSEKIELTTEDDATTTEIESTTQTQLCPLGVMGNVANSERCDAYYMCAGGMAIPFYCATGFEFDPELETCVEIAEGGCTLGSSTNAEIQSTIVNNLETTTANAATTNRVESTQSAESTTSEKIELTTEDDATTTEIESTTQTQLCPLGVMGNVANSERCDAYYMCAGGMAIPFYCATGFEFDPELETCVEIAEGGCTLGSSTNAEIQSTIVNNLETTTANAATTNRVESTQSAESTTSEKIELTTEDDATTTEIESTTQTQLCPLGVTGNVANSERCDAYYMCAGGMAIPFYCATGFEFDPELGTCAEIAGGGCTLGSSTNAEIQSTTVNNLETTTANAATTNRVESTQSAESTTSEKVQLTTEDDATTTEIESTTQTQLCPLGVMGNVANSERCDAYYMCAGGMAIPFYCATGFEFDPELETCVEIAEGGCTLGSSTNAEIQSTTVNNLETTTANAATTNRVESTQSAEWTTSEKIELTTEDDATTTEIESTTQTQLCPLGVTGNVANSERCDAYYMCAGGMAIPFYCATGFEFDPELGTCAEIAGGGCTLGSSTNAEIQSTTVNNLETTTANAATTNRVESTQSAESTTSEKVQLTTEDDATTTEIESTTQTQLCPLGVMGNIANSERCDAYYMCAGGMAIPFYCATGFEFDPELETCVEIAEGGCALSSSTNAEIQSSTVNNLETTTANAATTNRVESTQSAESTTSEKIELTTEDDATTTEIESTTQTKLCPLGVMGNVANSERCDAYYMCAGGMAIPFYCASGFEFDPDLKTCVEITEGGCTIGSNTITTIIPTTAGRETTTENTKTTEASKTTEKLELTIENEATTKQEVETTTAAQICPVGVMGNVASPDSCSTYFMCGGGAPIPLFCESGFEFDPMSKQCVAIVEGGCSLAQSQATAEPTLGSVPEHITTVLGLCEGDGSMADPNQCDSFYVCANGKAIRMHCDVGYEYSTEAKKCKPSSEGGCTVTRY

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-