Adep007360.1
Basic Information
- Insect
- Aradus depressus
- Gene Symbol
- -
- Assembly
- GCA_963662175.1
- Location
- OY759214.1:103589094-103592168[+]
Transcription Factor Domain
- TF Family
- NCU-G1
- Domain
- NCU-G1 domain
- PFAM
- PF15065
- TF Group
- Unclassified Structure
- Description
- NCU-G1 is a set of highly conserved nuclear proteins rich in proline with a molecular weight of approximately 44 kDa. Especially high levels are detected in human prostate, liver and kidney. NCU-G1 is a dual-function family capable of functioning as a transcription factor as well as a nuclear receptor co-activator by stimulating the transcriptional activity of peroxisome proliferator-activated receptor-alpha (PPAR-alpha) [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 20 0.0066 1.1e+02 3.0 0.0 142 168 43 69 18 100 0.86 2 20 0.043 7.1e+02 0.4 0.0 142 166 93 117 75 120 0.89 3 20 0.044 7.2e+02 0.3 0.0 142 167 143 168 140 177 0.89 4 20 0.029 4.8e+02 0.9 0.0 142 167 193 218 190 239 0.88 5 20 0.029 4.8e+02 0.9 0.0 142 167 243 268 240 289 0.88 6 20 0.0095 1.6e+02 2.5 0.0 142 168 293 319 290 340 0.87 7 20 0.059 9.8e+02 -0.1 0.0 144 167 345 368 323 389 0.88 8 20 0.0089 1.5e+02 2.6 0.0 142 168 393 419 379 439 0.89 9 20 0.041 6.7e+02 0.4 0.0 142 167 443 468 440 484 0.89 10 20 0.035 5.8e+02 0.6 0.0 142 167 493 518 479 539 0.89 11 20 0.039 6.4e+02 0.5 0.0 142 166 543 567 540 569 0.87 12 20 0.0095 1.6e+02 2.5 0.0 142 168 593 619 588 639 0.88 13 20 0.01 1.7e+02 2.4 0.0 142 168 643 669 640 687 0.89 14 20 0.029 4.8e+02 0.9 0.0 142 167 693 718 690 739 0.88 15 20 0.031 5.2e+02 0.8 0.0 142 167 743 768 724 772 0.90 16 20 0.029 4.8e+02 0.9 0.0 142 167 793 818 790 839 0.88 17 20 0.0095 1.6e+02 2.5 0.0 142 168 843 869 840 890 0.87 18 20 0.028 4.6e+02 1.0 0.0 142 167 893 918 890 950 0.80 19 20 0.27 4.5e+03 -2.3 0.0 147 165 948 966 922 969 0.84 20 20 0.039 6.4e+02 0.5 0.0 142 166 993 1017 990 1019 0.87
Sequence Information
- Coding Sequence
- ATGCCTAACAACTATCACTCCGCAGAAAGCTCTCGGGTGGACAAACAACTGAACAGGGAGGGGTGGGGAGCTCGAAGAAGGCTTAGTGTTCATTTGAAAATACCTGGTAAGAGGGACTATCTAGAGAAGTCTCAGGGTATTGTCCAGAGCATGCCTAACAACTATCACTCCGCAGAAAGCTCTCGGGTGGACACACGACTGAAAGGGGAGGGGTGGGGAGCCTCAAGAAGGCTTAGTGTTCATTTGAAAATACCTGGTAAGAGGGACTATCTAGAGAAGTCTCAGGGTATTGTCCAGAGTATACCTAACAACTATCACTCCGCAGAAAGCTCTCGGGTGGACAAACAACTGAACAGGGAGGGGTGGGGAGCCTCAAGAAGGCTTAGTGTTCATTTGAAAATACCTGGTAAGAGGGACTATCTAGAGAAGTCTCAGGGTATTGTCCAGAGCATACCTAACAACTATCACTCCGCAGAAAGCTCTCGGGTGGACAAACAACTGAACAGGGAGGGGTGGGGAGCCTCAAGAAGGCTTagtgtttatttgaaaataccTGGTAAGAGGGACTATCTAGAGAAGTCTCAGGGTATTGTCCAGAGCATGCCTAACAACTATCACTCCGCAGAAAGCTCTCGGGTGGACAAACAACTGAACAGGGAGGGGTGGGGAGCCTCAAGAAGGCTTAGTGTTCATTTGAAAATACCCGGTAAGAGGGACTATCTAGAGAAGTCTCAGGGTATTGTCCAGAGCATGCCTAACAACTATCACTCCGCAGAAAGCTCTCGGGTGGACAAACAACTGAACAGGGAGGGGTGGGGAGCTCGAAGAAGGCTTAGTGTTCATTTGAAAATACCTGGTAAGAGGGACTATCTAGAGAAGTCTCAGGGTATTGTCCAGAGCATGCCTAACAACTATCACTCCGCAGAAAGCTCTCGGGTGGACACACGACTGAAAGGGGAGGGGTGGGGAGCCTCAAGAAGGCTTAGTGTTCATTTGAAAATACCTGGTAAGAGGGACTATCTAGAGAAGCCTCAGGGTATTGTCCAGAGCATGCCTAACAACTATCACTCCGCAGAAAGCTCTCGGGTGGACAAACAACTGAACAGGGAGGGGTGGGGAGCTCGAAGAAGGCTTAGTGTTCATTTGAAAATACCTGGTAAGAAGGACTATCTAGAGAAGTCTCAGGGTATTGTCCAGAGCATGCCTAACAACTATCACTCCGCAGAAAGCTCTCGGGTGGACACACGACTGAAAGGGGAGGGGTGGGGAGCCTCAAGAAGGCTTAGTGTTCATTTGAAAATACCTGGTAAGAGGGACTATCTAGAGAAGTCTCAGGGTATTGTCCAGAGCATACCTAACAACTATCACTCCGCAGAAAGCTCTCGGGTGGACAAACAACTGAACAGGGAGGGGTGGGGAGCCTCAAGAAGGCTTAGTGTTCATTTGAAAATACCTGGTAAGAGGGACTATCTAGAGAAGTCTCAGGGTATTGTCCAGAGCATACCTAACAACTATCACTCCGCAGAAAGCTCTCGGGTGGACAAACAACTGAACAGGGAGGGGTGGGGAGCCTCAAGAAGGCTTAGTGTTCATTTGAAAATACCTGGTAAGAGGGACTATCTAGAGAAGTCTCAGGGTATTGTCCAGAGCATGCCTAACAACTATCACTCCGCAGAAAGCTCTCGGGTGGACAAACAACTGAACAGGGAGGGGTGGGGAGCTCGAAGAAGGCTTAGTGTTCATTTGAAAATACCTGGTAAGAGGGACTATCTAGAGAAGTCTCAGGGTATTGTCCAGAGCATGCCTAACAACTATCACTCCGCAGAAAGCTCTCGGGTGGACACACGACTGAAAGGGGAGGGGTGGGGAGCTCGAAGAAGGCTTAGTGTTCATTTGAAAATACCTGGTAAGAGGGACTATCTAGAGAAGTCTCAGGGTATTGTCCAGAGCATGCCTAACAACTATCACTCCGCAGAAAGCTCTCGGGTGGACACACGACTGAAAGGGGAGGGGTGGGGAGCCTCAAGAAGGCTTAGTGTTCATTTGAAAATACCTGGTAAGAGGGACTATCTAGAGAAGTCTCAGGGTATTGTCCAGAGCATGCCTAACAACTATCACTCCGCAGAAAGCTCTCGGGTGGACAAACAACTGAACAGGGAGGGGTGGGGAGCCTCAAGAAGGCTTAGTGTTCATTTGAAAATACCTGGTAAGAGGGACTATCTAGAGAAGTCTCAGGGTATTGTCCAGAGCATGCCTAACAACTATCACTCCGCAGAAAGCTCTCGGGTGGACAAACAACTGAACAGGGAGGGGTGGGGAGCTCGAAGAAGGCTTAGTGTTCATTTGAAAATACCTGGTAAGAGGGACTATCTAGAGAAGTCTCAGGGTATTGTCCAGAGCATGCCTAACAACTATCACTCCGCAGAAAGCTCTCGGGTGGACAAACAACTGAACAGGGAGGGGTGGGGAGCTCGAAGAAGGCTTAGTGTTCATTTGAAAATACCTGGTAAGAGGGACTATCTAGAGAAGTCTCAGGGTATTGTCCAGAGCATGCCTAACAACTATCACTCCGCAGAAAGCTCTCGGGTGGACACACGACTGAAAGGGGAGGGGTGGGGAGCCTCAAGAAGGCTTAGTGTTCATTTGAAAATACCTGGTAAGAGGGACTATCTAGAGAAGTCTCAGGGTATTGTCCAGAGTATACCTAACAACTATCACTCCGCAGAAAGCTCTCGGGTGGACAAACAGCTGAACAGGGAGGGGTGGGGAGCCTCAAGAAGGCTTAGTGTTCATTTGAAAATACCTGGTAAGAGGGACTATCTAGAGAAGTCTCAGGTTATTGTCCAGAGCATGCCTAACAACTATCACTCCGCAGAAAGCTCTCGGGTGGACAAACAACTGATGAGGGAGGGGTGGGGAGCATCACGAAGGCTTAGTGTTCATTTGAAAATACCTGGTAAGAGGGACTATCTAGAGAAGTCTCAGGGTATTGTCCAGAGCATGCCTAACAACTATCACTCCGCAGAAAGCTCTCGGGTGGACAAACAACTGAACAGGGAGGGGTGGGGAGCCTGA
- Protein Sequence
- MPNNYHSAESSRVDKQLNREGWGARRRLSVHLKIPGKRDYLEKSQGIVQSMPNNYHSAESSRVDTRLKGEGWGASRRLSVHLKIPGKRDYLEKSQGIVQSIPNNYHSAESSRVDKQLNREGWGASRRLSVHLKIPGKRDYLEKSQGIVQSIPNNYHSAESSRVDKQLNREGWGASRRLSVYLKIPGKRDYLEKSQGIVQSMPNNYHSAESSRVDKQLNREGWGASRRLSVHLKIPGKRDYLEKSQGIVQSMPNNYHSAESSRVDKQLNREGWGARRRLSVHLKIPGKRDYLEKSQGIVQSMPNNYHSAESSRVDTRLKGEGWGASRRLSVHLKIPGKRDYLEKPQGIVQSMPNNYHSAESSRVDKQLNREGWGARRRLSVHLKIPGKKDYLEKSQGIVQSMPNNYHSAESSRVDTRLKGEGWGASRRLSVHLKIPGKRDYLEKSQGIVQSIPNNYHSAESSRVDKQLNREGWGASRRLSVHLKIPGKRDYLEKSQGIVQSIPNNYHSAESSRVDKQLNREGWGASRRLSVHLKIPGKRDYLEKSQGIVQSMPNNYHSAESSRVDKQLNREGWGARRRLSVHLKIPGKRDYLEKSQGIVQSMPNNYHSAESSRVDTRLKGEGWGARRRLSVHLKIPGKRDYLEKSQGIVQSMPNNYHSAESSRVDTRLKGEGWGASRRLSVHLKIPGKRDYLEKSQGIVQSMPNNYHSAESSRVDKQLNREGWGASRRLSVHLKIPGKRDYLEKSQGIVQSMPNNYHSAESSRVDKQLNREGWGARRRLSVHLKIPGKRDYLEKSQGIVQSMPNNYHSAESSRVDKQLNREGWGARRRLSVHLKIPGKRDYLEKSQGIVQSMPNNYHSAESSRVDTRLKGEGWGASRRLSVHLKIPGKRDYLEKSQGIVQSIPNNYHSAESSRVDKQLNREGWGASRRLSVHLKIPGKRDYLEKSQVIVQSMPNNYHSAESSRVDKQLMREGWGASRRLSVHLKIPGKRDYLEKSQGIVQSMPNNYHSAESSRVDKQLNREGWGA
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -