Carc005353.1
Basic Information
- Insect
- Coenonympha arcania
- Gene Symbol
- NR2E3
- Assembly
- GCA_036785405.1
- Location
- CM072059.1:12434697-12467756[+]
Transcription Factor Domain
- TF Family
- SF-like
- Domain
- zf-C4|SF-like
- PFAM
- AnimalTFDB
- TF Group
- Zinc-Coordinating Group
- Description
- The ligand binding domain of nuclear receptor steroidogenic factor 1 (SF-1): SF-1, a member of the nuclear hormone receptor superfamily, is an essential regulator of endocrine development and function and is considered a master regulator of reproduction. Most nuclear receptors function as homodimer or heterodimers, however SF-1 binds to its target genes as a monomer, recognizing the variations of the DNA sequence motif, T/CCA AGGTCA. SF-1 functions cooperatively with other transcription factors to modulate gene expression. Phospholipids have been determined as potential ligands of SF-1. Like other members of the nuclear receptor (NR) superfamily of ligand-activated transcription factors, SF-1 has a central well conserved DNA binding domain (DBD), a variable N-terminal domain, a flexible hinge and a C-terminal ligand binding domain (LBD). [1, 8, 3, 11, 6, 5, 12, 10, 9, 2, 4, 7]
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 24 9.7e-17 4e-13 50.4 0.1 217 326 254 355 249 358 0.90 2 24 5.9e-05 0.24 11.6 0.0 290 326 368 404 359 410 0.91 3 24 6.2e-05 0.26 11.5 0.0 290 326 417 453 408 456 0.90 4 24 5.4e-05 0.22 11.7 0.0 290 326 466 502 458 514 0.90 5 24 4.3e-05 0.18 12.1 0.0 289 326 514 551 505 564 0.89 6 24 5.8e-05 0.24 11.6 0.0 290 326 564 600 559 613 0.90 7 24 5.8e-05 0.24 11.6 0.0 290 326 613 649 604 657 0.91 8 24 0.0001 0.43 10.8 0.0 290 326 662 698 653 701 0.90 9 24 5.2e-05 0.21 11.8 0.0 290 326 711 747 701 760 0.90 10 24 5.2e-05 0.22 11.8 0.0 290 326 760 796 752 810 0.90 11 24 5.6e-05 0.23 11.7 0.0 290 326 809 845 803 858 0.90 12 24 6.1e-05 0.25 11.6 0.0 290 326 858 894 851 900 0.91 13 24 0.00033 1.3 9.2 0.0 290 326 907 943 899 956 0.90 14 24 0.00035 1.5 9.0 0.0 290 326 956 992 950 999 0.90 15 24 5.2e-05 0.21 11.8 0.0 289 326 1004 1041 995 1045 0.89 16 24 6.4e-05 0.26 11.5 0.0 290 326 1054 1090 1047 1093 0.90 17 24 0.00075 3.1 8.0 0.0 290 326 1103 1139 1095 1152 0.89 18 24 0.0099 41 4.3 0.0 289 326 1151 1188 1143 1195 0.90 19 24 1.3e-05 0.055 13.7 0.0 290 326 1201 1237 1192 1249 0.90 20 24 0.01 41 4.3 0.0 289 326 1249 1286 1241 1292 0.90 21 24 1.5e-05 0.06 13.6 0.0 290 326 1299 1335 1289 1341 0.91 22 24 4.1e-05 0.17 12.1 0.0 289 326 1347 1384 1338 1397 0.89 23 24 0.01 43 4.2 0.0 289 326 1396 1433 1389 1437 0.89 24 24 1.3e-18 5.2e-15 56.6 0.0 290 404 1446 1560 1435 1562 0.94
Sequence Information
- Coding Sequence
- atgcataggcctcctccagGGGAAGACAACACGTCTCCAATACGATGGTCGCCTCCAAGTTCTGCCGCCCTGGTAGCTGCTGCGCTAGCGCCACCTGCGCTAAGACCCTGGCTCCCGGACCCGCCCATCAAGAAGACCAACCATTTGGGTCTGGTGTGCGTAGTGTGTGGAGACACAAGCTCAGGCAAGCACTACGGCATTCTGGCTTGCAACGGATGCAGCGGTTTCTTCAAAAGATCTGTGAGGAGAAAACTTATATACAGATGCCAGGCCGGTACTGGAAGGTGCGTGGTGGACAAAGCACACCGGAATCAGTGCCAGGCTTGTCGGCTGAAGAAATGTCTGGCTATGGGCATGAACAAGGATGCTGTGCAGAACGAGCGTCAGCCCCGGAACACCGCCACCATAAGACCGGAAGCTCTGCGAGACATGGACCAGGAGAGGGCGCTACGAGAAGCTGCGGTAGCTGTTGGAGTATTTGGGCCCCCAGTGTCCCTCGCGATGGCGCTGTCCCCGGCACGGTACCCGCTGCTGTCGCCGCACtacgcgccgctgccgccgcagccgccgccgccgcagcagccaGACTCCTCGCACGACCACCACGACGGCAGCCACGCGCACTCCGACAGCGATGATGACAGTATTGATGTGACCAATGAAGAAGACTCCACATCGTATTCCCCTCCAGCCGTGTCCAGCTATTCCCAAAGTTGTATACCATATGGACCACTTGGCGTAGAAAGTGCGGCGGAAACTGCAGCCCGCTTGCTGTTCATGGCGGTGAAGTGGGCCAAAAACTTACCTTCCTTCGCCAGCCTTGCCTTTCGGGATCAGGTGATTCTCTTGGAAGAAGCGTGGTCGGAACTGTTTCTGCTGAACGCGATACAGTGGTGCGCGCCGCTGGACGCCGCGTGCGCAGCGCTCTTCGGCACCGAGCAGACCGATCAAGagaATGGCGCGTCGTCGGTGCTGCGCAGGCTCCGCTCCGTGGTGTCGCGCTACCGCAGCGTGCTGGTGGATCCCGCGGAGTTCGCCTGCATGAAGGCCATCGCGCTGTTCAAGTGCGGTAACTATCCACTATCCATCAACATTGTAGAGAATGGCACGTCTTCGGAGTTGCGCAGGCTCCGCTCCGTGGTGTCGCGCTACCGCAGCGTGCTGGTGGATCCCGCGGAGTTCGCCTGCATGAAGGCCATCGCGCTGTTCAAGTGCGGTAACTATCCACTATCCATCAACATTGTAGAGAATGGCACGTCTTCGGAGTTGCGCAGGCTCCGCTCCGTGGTGTCGCGCTACCGCAGCGTGCTGGTGGATCCCGCGGAGTTCGCCTGCATGAAGGCCATCGCGCTGTTCAAGTGCGGTAACTATCCACTATCCATCAACATTGTAGAGAATGGCACGTCTTCGGAGTTGCGCAGGCTCCGCTCCGTGGTGTCGCGCTACCGCAGCGTGCTGGTGGATCCCGCGGAGTTCGCCTGCATGAAGGCCATCGCGCTGTTCAAGTGCGGTAACTATCCACTATCCATCAACACTGTAGAGAATGGCACGTCTTCGGAGTTGCGCAGGCTCCGCTCCGTGGTGTCGCGCTACCGCAGCGTGCTGGTGGATCCCGCGGAGTTCGCCTGCATGAAGGCCATCGCGCTGTTCAAGTGCGGTAACTATCCACTATCCATCAACATTGTAGAGAATGGCACGTCTTCGGAGTTGCGCAGGCTCCGCTCCGTGGTGTCGCGCTACCGCAGCGTGCTGGTGGATCCCGCGGAGTTCGCCTGCATGAAGGCCATCGCGCTGTTCAAGTGCGGTAACTATCCACTATCCATCAACATTGTAGAGAATGGCACGTCTTCGGAGTTGCGCAGGCTCCGCTCCGTGGTGTCGCGCTACCGCAGCGTGCTGGTGGATCCCGCGGAGTTCGCCTGCATGAAGGCCATCGCGCTGTTCAAGTGCGGTAACTATCCACTATCCATCAACATTGTAGAGAATGGCACGTCTTCGGAGTTGCGCAGGCTCCGCTCCATGGTGTCGCGCTACCGCAGCGTGCTGGTGGATCCCGCGGAGTTCGCCTGCATGAAGGCCATCGCGCTGTTCAAGTGCGGTAACTATCCACTATCCATCAACATTGTAGAGAATGGCACGTCTTCGGAGTTGCGCAGGCTCCGCTCCGTGGTGTCGCGCTACCGCAGCGTGCTGGTGGATCCCGCGGAGTTCGCCTGCATGAAGGCCATCGCGCTGTTCAAGTGCGGTAACTATCCACTATCCATCAACATTGTAGAGAATGGCACGTCTTCGGAGTTGCGCAGGCTCCGCTCCGTGGTGTCGCGCTACCGCAGCGTGCTGGTGGATCCCGCGGAGTTCGCCTGCATGAAGGCCATCGCGCTGTTCAAGTGCGGTAACTATCCACTATCCATCAACATTGTAGAGAATGGCACGTCTTCGGAGTTGCGCAGGCTCCGCTCCGTGGTGTCGCGCTACCGCAGCGTGCTGGTGGATCCCGCGGAGTTTGCCTGCATGAAGGCCATCGCGCTGTTCAAGTGCGGTAACTATCCACTATCCATCAATATTGTAGAGAATGGCACGTCTTCGGAGTTGCGCAGGCTCCGCTCCGTGGTGTCGCGCTACCGCAGCGTGCTGGTGGATCCCGCGGAGTTCGCCTGCATGAAGGCCATCGCGCTGTTCAAGTGCGGTAACTATCCACTATCCATCAACATTGTAGAGAATGGCACGTCTTCGGAGTTACGCAGGCTCCGCTCCGTGGTGTCGCGCTACCGCAGCGTGCTGGTGGATCCCGCGGAGTTCGGCTGCATGAAGGCCATCGCGCTGTTCAAGTGCGGTAACTATCCACTATCCATCAACATTGTAGAGAATGGCACGTCTTCGGAGTTACGCAGGCTCCGCTCCGTGGTGTCGCGCTACCGCAGCGTGCTGGTGGATCCCGCGGAGTTCGGCTGCATGAAGGCCATCGCGCTGTTCAAGTGCGGTAACTATTCACTATCCATCAACATTGTAGAGAATGGCACGTCTTCGGAGCTGCGCAGGCTCCGCTCCGTGGTGTCGCGCTACCGCAGCGTGCTGGTGGATCCCGCGGAGTTCGCCTGCATGAAGGCCATTGCGCTGTTCAAGTGCGGTAACTATCCACTATCCATCAACATTGTAGAGAATGGCACGTCTTCGGAGTTACGCAGGCTCCGCTCCGTGGTGTCGCGCTACCGCAGCGTGCTGGTGGATCCCGCGGAGTTCGCCTGCATGAAGGCCATCGCGCTGTTCAAGTGCGGTAACTATCCACTATCCATCAACATTGTAGAGAATGGCACGTCTTCGGAGTTACGCAGGCTCCGCTCCGTGGTGTCTGGCTACCGCAGCGTGCTGGTGGATCCCGCGGAGTTCGCCTGCATGAAGGCCATCGCGCTGTTCAAGTGCGGTAACTATCCACTATCCATCAACACTGTAGAGAATGGCACGTCTTCGGAGTTGCGCAGGCTCCGCTCCGTGGTGTCACGCTACCGCAGCGTGCTGGTGGATCCCGCGGGATTCACTTGCATGAAGGCCGTCACGCTGTTCAAGTGCAGTAACTATCCAATATCCATCAACACTGTAGAGAAAGGCGCGTCGTCGGTGCTGCGCAGGCTCCGCTCCGTGGTGTCGCGCTACCGCAGCGTGCTGGTGGATCCCGCGGAGTTCGCCTGCATGAAGGCCATCGCGCTGTTCAAGTGCGGTAACTATCCACTATCCATCAACACTGTAGAGAATGGCACGTCTTCGGAGTTGCGCAGGCTCCGCTCCGTGGTGTCGCGCTACCGCAGCGTGCTGGTGGATCCCGCGGGATTCACTTGCATGAAGGCCGTCACGCTGTTCAAGTGCAGTAACTATCCAATATCCATCAACACTGTAGAGAAAGGCGCGTCGTCGGTGCTGCGCAGGCTCCGCTCCGTGGTGTCGCGCTACCGCAGCGTGCTGGTGGATCCCGCGGAGTTCGCCTGCATGAAGGCCATCGCGCTGTTCAAGTGCGGTAACTATCCACTATCCATCAACACTGTAGAGAATGGCACGTCTTCGGAGTTGCGCAGGCTCCGCTCCGTGGTGTCGCGCTACCGCAGCGTGCTGGTGGATCCCGCGGAGTTCGCCTGCATGAAGGCCATCGCGCTGTTCAAGTGCGGTAACTATCCACTATCCATCAACACTGTAGAGAATGGCACGTCTTCGGAGTTGCGCAGGCTCCGCTCCGTGGTGTCGCGCTACCGCAGCGTGCTGGTGGATCCCGCGGGATTCACTTGCATGAAGGCCGTCACGCTGTTCAAGTGTAGTAACTATCCAATATCCATCAACACTGTAGAGAAAGGCGCGTCGTCGGTGCTGCGCAGGCTCCGCTCCGTGGTGTCCCGCTACCGCAGCGTGCTGGTGGATCCCGCGGAGTTCGCCTGCATGAAGGCCATCGCGCTGTTCAAGTGCGAAACCCGCGGCCTCAAGGAGCCGCTGCAGATCGAGAACCTGCAGGACCAGGCGCAGGTGATGCTGATGTCGCACGCGCGCGCGGCGCAcggcgcggcgcccgcgcgctTCGGCcgcctgctgctgctgctgccgctgctgcGCCTCGTGCCGCCGCACCAGCTCGAGCGAGAGTTCTTCTCCAAGACCATCGGACACACGCCCATGGAGAAGGTCTTGGCTGACATGTACAAGAATTGA
- Protein Sequence
- MHRPPPGEDNTSPIRWSPPSSAALVAAALAPPALRPWLPDPPIKKTNHLGLVCVVCGDTSSGKHYGILACNGCSGFFKRSVRRKLIYRCQAGTGRCVVDKAHRNQCQACRLKKCLAMGMNKDAVQNERQPRNTATIRPEALRDMDQERALREAAVAVGVFGPPVSLAMALSPARYPLLSPHYAPLPPQPPPPQQPDSSHDHHDGSHAHSDSDDDSIDVTNEEDSTSYSPPAVSSYSQSCIPYGPLGVESAAETAARLLFMAVKWAKNLPSFASLAFRDQVILLEEAWSELFLLNAIQWCAPLDAACAALFGTEQTDQENGASSVLRRLRSVVSRYRSVLVDPAEFACMKAIALFKCGNYPLSINIVENGTSSELRRLRSVVSRYRSVLVDPAEFACMKAIALFKCGNYPLSINIVENGTSSELRRLRSVVSRYRSVLVDPAEFACMKAIALFKCGNYPLSINIVENGTSSELRRLRSVVSRYRSVLVDPAEFACMKAIALFKCGNYPLSINTVENGTSSELRRLRSVVSRYRSVLVDPAEFACMKAIALFKCGNYPLSINIVENGTSSELRRLRSVVSRYRSVLVDPAEFACMKAIALFKCGNYPLSINIVENGTSSELRRLRSVVSRYRSVLVDPAEFACMKAIALFKCGNYPLSINIVENGTSSELRRLRSMVSRYRSVLVDPAEFACMKAIALFKCGNYPLSINIVENGTSSELRRLRSVVSRYRSVLVDPAEFACMKAIALFKCGNYPLSINIVENGTSSELRRLRSVVSRYRSVLVDPAEFACMKAIALFKCGNYPLSINIVENGTSSELRRLRSVVSRYRSVLVDPAEFACMKAIALFKCGNYPLSINIVENGTSSELRRLRSVVSRYRSVLVDPAEFACMKAIALFKCGNYPLSINIVENGTSSELRRLRSVVSRYRSVLVDPAEFGCMKAIALFKCGNYPLSINIVENGTSSELRRLRSVVSRYRSVLVDPAEFGCMKAIALFKCGNYSLSINIVENGTSSELRRLRSVVSRYRSVLVDPAEFACMKAIALFKCGNYPLSINIVENGTSSELRRLRSVVSRYRSVLVDPAEFACMKAIALFKCGNYPLSINIVENGTSSELRRLRSVVSGYRSVLVDPAEFACMKAIALFKCGNYPLSINTVENGTSSELRRLRSVVSRYRSVLVDPAGFTCMKAVTLFKCSNYPISINTVEKGASSVLRRLRSVVSRYRSVLVDPAEFACMKAIALFKCGNYPLSINTVENGTSSELRRLRSVVSRYRSVLVDPAGFTCMKAVTLFKCSNYPISINTVEKGASSVLRRLRSVVSRYRSVLVDPAEFACMKAIALFKCGNYPLSINTVENGTSSELRRLRSVVSRYRSVLVDPAEFACMKAIALFKCGNYPLSINTVENGTSSELRRLRSVVSRYRSVLVDPAGFTCMKAVTLFKCSNYPISINTVEKGASSVLRRLRSVVSRYRSVLVDPAEFACMKAIALFKCETRGLKEPLQIENLQDQAQVMLMSHARAAHGAAPARFGRLLLLLPLLRLVPPHQLEREFFSKTIGHTPMEKVLADMYKN
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00353121;
- 90% Identity
- iTF_00353121;
- 80% Identity
- iTF_00353121;