Acir015676.1
Basic Information
- Insect
- Athalia circularis
- Gene Symbol
- Hr39
- Assembly
- GCA_963978495.1
- Location
- OZ021700.1:24128992-24134326[+]
Transcription Factor Domain
- TF Family
- SF-like
- Domain
- zf-C4|SF-like
- PFAM
- AnimalTFDB
- TF Group
- Zinc-Coordinating Group
- Description
- The ligand binding domain of nuclear receptor steroidogenic factor 1 (SF-1): SF-1, a member of the nuclear hormone receptor superfamily, is an essential regulator of endocrine development and function and is considered a master regulator of reproduction. Most nuclear receptors function as homodimer or heterodimers, however SF-1 binds to its target genes as a monomer, recognizing the variations of the DNA sequence motif, T/CCA AGGTCA. SF-1 functions cooperatively with other transcription factors to modulate gene expression. Phospholipids have been determined as potential ligands of SF-1. Like other members of the nuclear receptor (NR) superfamily of ligand-activated transcription factors, SF-1 has a central well conserved DNA binding domain (DBD), a variable N-terminal domain, a flexible hinge and a C-terminal ligand binding domain (LBD). [1, 8, 3, 11, 6, 5, 12, 10, 9, 2, 4, 7]
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 4 5 2.5e+04 -13.3 25.7 63 164 209 311 183 396 0.52 2 4 5 2.5e+04 -5.7 6.5 85 132 342 398 314 442 0.40 3 4 6.2e-08 0.00031 20.7 0.0 1 21 509 529 509 533 0.92 4 4 1.9e-68 9.3e-65 220.1 7.5 191 408 632 839 551 839 0.89
Sequence Information
- Coding Sequence
- ATGTCGGAGCAGAACGGGCCCCCGGAGGGCCCAGGCTGGTGGCCCCCGGCTCAATCCTGGAAGTCGTCATCGGCCCCTGCAAGACCAGAGGCGGGAAGAAACGTATCTGTGACTACCATCAACGTTCCACCAGCCCACGAAATGCATGACGGTAAAAATGTGAAGTATTTGGGTGGTCGGAACGGTGTGACGGTGTCGGTCGTGTCCGGATGCGCTACCGGAAATGAACCGGAGAACGACGGAGCGGATAGCGACGGTGAAGTAAGCAAGATCGATTTCCGGGGAGTAAATCTCCgcacgaaaaagaaaagggacgCGCCAGAAGGACAGGAAAGCGACGCTAGCACCATCACCGAGGCGATGCTGCAACAGCCGGAACGCCCCATGTCCTGGGAGGGTGAACTTTCCGATCAAGAAATGTCCTCCAATACAATCACCAATCAAGACCATGAAGAAACATCAATGGAAGGTGTTCAAGTCTGCAGCGCAAGCCCAGGACCAACACATGCACCACAGGAGCCTAAATTCCCTATAAAACTCGAGCCAGAATTTCGCTCGAGTCCTGGGCCACCTTTAGGTACCCCTAACGAGACAATACTATCCAATTCTCAAcaacaccaacaacaacatcaacaccaccaacagcaacaacaacaacatcaacatcaacatcaacatcaTCAACAGCAACAGCTCAATATTGACGGACTTGAGCAAACTCAACAGAGTGATTTACCATTACTTGTTGGGAAACTGCTTGGAGGATATAACAATGCAACTCCTAGCCACAGTCCAGTCCTTAATCCAAGACATCATTTGACTAAACATAGTCACACAAGATCACAGGTCCCATCGCCAGATTCGGCGATACATTCAGCATACAGCGTATTTAGCTCCCCGACACAAAGTCCTCATGCCGCTCGTCACTCGGCGCTCGGAGCTGGAAGTCCTGTGCCTTCTTCATCTTTGTCACTTTCCCGCCACAGTTTCAACAATTCTACATCCTCACTATCTTTGTCTCTCTCGCATTCGCTTTCGCGAAATAACTCTGATGCATCCAGCAGCTGCTATAGTTACGGATCTCTCAGCCCACCTACACATTCGCCTGTGCAACAACCGAGACACTCCCACTCTCATACTTCACATCCACATCATCAAGTAGCACCACAAAGCAGTCCTCTACATTTGCCAGTATCGCATCATTATTCAGCTGCAAATTCGGAACTTTCCGAAGGGTTGAGTGAGGATCAGGAAGACTGTAAAATAGCACCAGCTACGGCAGGCATTTCAACTAGGCAACAGTTAATCAATAGTCCCTGCCCGATCTGCGGTGATAAAATTAGTGGGTTTCACTACGGTATATTCTCTTGCGAATCTTGTAAGGGATTCTTCAAAAGGACAGTGCAGAATCGAAAGAATTATGTATGCCTTAGAGGAGCGGGATGTTCGGTTACTGTGGCTAcccgaaaaaaatgtccagcTTGTCGATTTGACAAATGTTTAAATATGGGAATGAAGTTGGAAGCAATTAGGGAAGATCGGACCCGAGGTGGAAGAAGTACATATCAGTGTAGTTACACGCTCCCAGCAAACTTGGTGGGTTGTACTTCCGGTGGATTACCTGGGGATAAAATGGTCGGAGGAACATGCAGTCCTGCACCACCTGGGTATGAACATCACCACTCGGCAAGGCACCACTCAAATCACTCGCATAAACTGCACGTGGTGCCGCAGTTGTTGCAAGAAATTATGGATGTGGAACATTTGTGGCATTATAATGACAATGATCGAACAACAAGTGCGCAAAGTAGTAGCAGCAGTAGCAAATCAGAGCAGCCCAGGCCCAGTTCTGTAGGGCCACAAAACACCGAACATGATCCGCACTCACCCAGCAATGCTAATACAAATGCTAATCCATCACAGCATCCAGACTTTTTGTCAAACCTGTGCAATATTGCAGATCATAGGCTTTATAAAATTGTTAAGTGGTGCAAGAGTTTGCCTCTCTTTAAAAACATTTCAATCGATGATCAAATTTGTCTCCTTATCAATTCATGGTGTGagttgttgttattttcatgCTGCTTCAGGAGCGTGAATACACCAGGAGAGATTAGAGTCTCTCTTGGGAAATCAATCACACTCGAACAAGCCAGACAGCTGGGTCTCGCTAGCTGCATTGAAAGAATGCTTGCTTTTACCAACAACCTTAGACGATTGCGGGTGGACCAATACGAGTACGTCGCAATGAAGGTCATTGTCCTGCTGACATCTGACACGAGCGAGTTGAAAGAGCCAGAAAAGGTTCGAGCATCACAGGAGAAAGCATTGCAGGCGTTGCAACAGTACACCATTGCCATGTATCCGGAAATGCCAGCCAAGTTTGGTGAACTGCTCCTTCGAATCCCAGATCTGCAGCGAACTTGTCAAGCAGGGAAGGAGCTGCTCAGTGCAAAACGAGCAGAAGGAGAGGGTAGTTCCTTCAACCTATTGATGGAACTGCTACGAGGGGACCACTGA
- Protein Sequence
- MSEQNGPPEGPGWWPPAQSWKSSSAPARPEAGRNVSVTTINVPPAHEMHDGKNVKYLGGRNGVTVSVVSGCATGNEPENDGADSDGEVSKIDFRGVNLRTKKKRDAPEGQESDASTITEAMLQQPERPMSWEGELSDQEMSSNTITNQDHEETSMEGVQVCSASPGPTHAPQEPKFPIKLEPEFRSSPGPPLGTPNETILSNSQQHQQQHQHHQQQQQQHQHQHQHHQQQQLNIDGLEQTQQSDLPLLVGKLLGGYNNATPSHSPVLNPRHHLTKHSHTRSQVPSPDSAIHSAYSVFSSPTQSPHAARHSALGAGSPVPSSSLSLSRHSFNNSTSSLSLSLSHSLSRNNSDASSSCYSYGSLSPPTHSPVQQPRHSHSHTSHPHHQVAPQSSPLHLPVSHHYSAANSELSEGLSEDQEDCKIAPATAGISTRQQLINSPCPICGDKISGFHYGIFSCESCKGFFKRTVQNRKNYVCLRGAGCSVTVATRKKCPACRFDKCLNMGMKLEAIREDRTRGGRSTYQCSYTLPANLVGCTSGGLPGDKMVGGTCSPAPPGYEHHHSARHHSNHSHKLHVVPQLLQEIMDVEHLWHYNDNDRTTSAQSSSSSSKSEQPRPSSVGPQNTEHDPHSPSNANTNANPSQHPDFLSNLCNIADHRLYKIVKWCKSLPLFKNISIDDQICLLINSWCELLLFSCCFRSVNTPGEIRVSLGKSITLEQARQLGLASCIERMLAFTNNLRRLRVDQYEYVAMKVIVLLTSDTSELKEPEKVRASQEKALQALQQYTIAMYPEMPAKFGELLLRIPDLQRTCQAGKELLSAKRAEGEGSSFNLLMELLRGDH
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00174392; iTF_00174205; iTF_01413217; iTF_01413449; iTF_01413205; iTF_01048258; iTF_01048296; iTF_01048945; iTF_01049011; iTF_01048248; iTF_00718597; iTF_00718702; iTF_00718609; iTF_00719458; iTF_00719447; iTF_00719518; iTF_00175142; iTF_00175057; iTF_00175068; iTF_00175815; iTF_00175785; iTF_00175773; iTF_01303397; iTF_01303537; iTF_01303385; iTF_00060424; iTF_00060240; iTF_00060252; iTF_01412425; iTF_01412363; iTF_01411536; iTF_01411549; iTF_01411615; iTF_01412351; iTF_01414217; iTF_01414296; iTF_01414204; iTF_00159147; iTF_00159190; iTF_00159135; iTF_01200178; iTF_01200282; iTF_01200190; iTF_00938835; iTF_00938729; iTF_00939590; iTF_00939693; iTF_00939578;
- 90% Identity
- iTF_00938835;
- 80% Identity
- iTF_00174392; iTF_00174205;