Adis033571.1
Basic Information
- Insect
- Anastatus disparis
- Gene Symbol
- Hr39
- Assembly
- GCA_017163975.1
- Location
- JAFFSR010000221.1:8754137-8762816[+]
Transcription Factor Domain
- TF Family
- SF-like
- Domain
- zf-C4|SF-like
- PFAM
- AnimalTFDB
- TF Group
- Zinc-Coordinating Group
- Description
- The ligand binding domain of nuclear receptor steroidogenic factor 1 (SF-1): SF-1, a member of the nuclear hormone receptor superfamily, is an essential regulator of endocrine development and function and is considered a master regulator of reproduction. Most nuclear receptors function as homodimer or heterodimers, however SF-1 binds to its target genes as a monomer, recognizing the variations of the DNA sequence motif, T/CCA AGGTCA. SF-1 functions cooperatively with other transcription factors to modulate gene expression. Phospholipids have been determined as potential ligands of SF-1. Like other members of the nuclear receptor (NR) superfamily of ligand-activated transcription factors, SF-1 has a central well conserved DNA binding domain (DBD), a variable N-terminal domain, a flexible hinge and a C-terminal ligand binding domain (LBD). [1, 8, 3, 11, 6, 5, 12, 10, 9, 2, 4, 7]
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 3 3 1.1e+05 -6.5 10.9 51 147 182 276 115 304 0.55 2 3 0.7 2.7e+04 -3.2 3.6 72 132 292 362 276 378 0.50 3 3 7.3e-80 2.8e-75 256.9 1.3 1 408 470 861 470 861 0.79
Sequence Information
- Coding Sequence
- ATGGCAAGTGGCAAACATGGTAAAAGCATATGCAAATATCTTGGCGGTCAAAATGGTGTTACAGTATCAGTAGTATCAAGTTGTGGTAGTACGCAAAATGCAAGCAATTCAAATGTAACACAAGTAGTGGATTCAGGATCGAATGAATCAGCTGCAAATGTTTCAGTTGATGATGCTGATGGTGATAGTGATGGTGAAGTAAGTCGTATAGATTTTCGTGGCGTTAACTTAAGaactaaaaagaaaagagaTGGTAGTATAGGACCTCacgataataatgaaaattcatgTGATGCTATTTATCAACAACCTGAAAGACCAATGTCTTGGGAAGGTGAATTATCTGATCAGGAAATGTCTTCCAATACTATTACAAATCagGATCATGAAGAAACATCGATGGAAGGTGTTCAAATATGCAGTTCGAGTCCTGGTCCACAAGAACAAAAGTTTCCAATTAAACCTGAACCAGATTTTCGCTCAAGTCCTGGGCTTCAAAGCCTTGGCAGTTCACTTAATGAAACTATTGTTGCTGTTCCACATGTTCAACAATTGCATCATCAGCATCAACAAGcgcaacaacagcagcaacaaaATATCGAGCAAAATCAACAAAGTGATTTACCGTTACTCGTTGGCAAATTACTTGGCGGTTGTAATAGTTCAACGCCGAATCATAGTCCTGTACTCATTCCTCGGCATCATTTGACCAAACACAGCCACACTAGATCTCAAgTCCCATCACCAGATTCAGCGATCCATTCGGCGTACAGCGTATTTAGTTCGCCAACTCAATCACCACATGCTGGTCGTCATTCAGCTCTTGGTCCAGGTTCACCAGTACCATCGTCTTCACTTTCACTTTCGCGACATAGTTTTAACAATTCAACCTCATCATTATCACTATCTTTGTCACACTCATTATCGCGTAATAATTCAGATGCCTCAAGCAGTTGTTATAGTTATGGTTCTCTTAGTCCACCTACGCATTCGCCAGTTCAACAGCGTTATGCTCATCATCATCAACAGCAACATCAAGTTCAATCAAATCCATTACATTTACAACAAGGGCATCATTATAGTGGAACTACAGGTGTTGTTAATGATGTAAGTGAAACACATTTGGATGATCAAGATGATTGTAGAATACCTTCTGCACCATCGGGCATATCGACAAGGCAACAATTAATTAACAGTCCTTGTCCGATTTGTGGAGACAAAATTAGCGGATTTCATTATGGAATATTTTCATGTGAATCCTGTAAGgGCTTCTTTAAAAGGACAGTgcaaaatcgaaaaaattacGTGTGTCTTAGAGGTGCAGGTTGTCCGGTAACAGTGGCTACGCGCAAAAAATGTCCTGCTTGTCGCTTTGATAAATGCCTAAATATGGGTATGAAACTCGAAGCGATAAGAGAAGATAGAACTCGTGGTGGTCGTAGCACATATCAGTGCACTTATACTTTACCTGCAAATCTATTAGGTGGTTCATCTGCACAAGGCGGCTTACCTGGTCAAGATAAAATGAGTCAAGGTCCATGCAGTCCAGCGCCGCCAGGTTCCGAACATCATCATTCAATGAAACATCATCATTCTAATCATTCCCATAAAATGCAATTAGTACCACAGTTACTTCAAGATATTATGGATGTAGAACATCTTTGGCATTATAATGACAATGATAGAGTTGGAGCCAGTCAGGCAATGGGTTCGCTCAATCTTGCTCagTCCTTACCCAATTCCGGAATGGATAATTCTCCAAGTGGTAATAGTAGTAGCGGAGACTCATTGAGTGGTTTAGGTACATCAAATGGTCTCACTAGCAGTTCCAGCGCAACTAACAATGGCAATTTAAATTCTCGCTCAGATTTATCACGCCTGACCTCTTCCAATTCATCGACGCCCACTCCAATACATGAACAGCAACATTCTCCATCATCTAACAGTGCGGCAAGCTCAAGTAACTCGGCAAATGGCAATCCAACACAACATCCTGATTTCTtatcaaatttatgtaatatagcTGACCACCGActatataaaattgttaaatggTGCAAAAGCTTaccactttttaaaaatatatccatTGACGATCAAATATGTTTGCTCATCAATTCATGGTGCGAACTACTACTTTTTTCGTGCTGTTTTCGAAGCATGAGTACACCTGGAGAGATCAGAGTTTCATTAGGAAAAAGTATTACATTAGAACAGGCCAAACAGTTGGGTTTAGCTACTTGCATTGAAAGAATGTTAGCGTTTACTAATAATCTTAGGCGTCTCAGAGTCGATCAATATGAATATGTTGCCATGaagGTGATTGTGCTGTTAACATCTGACACGAGCGAATTGAAAGAACCAGAAAAGGTTCGCGCATCTCAAGAGAAGGCATTGCAAGCATTACAGCAATACACAATCGCAAGGTATCCGGAAATGCCTGCCAAATTTGGTGAATTGCTACTTAGAATTCCTGATCTTCAACGTACTTGCCAAGCAGGCAAAGAGCTATTAAGCGCTAAACGCGCTGAAGGTGAAGGTTGCTCTTTTAATTTACTCATGGAGCTACTTCGAGGGGACCATTAG
- Protein Sequence
- MASGKHGKSICKYLGGQNGVTVSVVSSCGSTQNASNSNVTQVVDSGSNESAANVSVDDADGDSDGEVSRIDFRGVNLRTKKKRDGSIGPHDNNENSCDAIYQQPERPMSWEGELSDQEMSSNTITNQDHEETSMEGVQICSSSPGPQEQKFPIKPEPDFRSSPGLQSLGSSLNETIVAVPHVQQLHHQHQQAQQQQQQNIEQNQQSDLPLLVGKLLGGCNSSTPNHSPVLIPRHHLTKHSHTRSQVPSPDSAIHSAYSVFSSPTQSPHAGRHSALGPGSPVPSSSLSLSRHSFNNSTSSLSLSLSHSLSRNNSDASSSCYSYGSLSPPTHSPVQQRYAHHHQQQHQVQSNPLHLQQGHHYSGTTGVVNDVSETHLDDQDDCRIPSAPSGISTRQQLINSPCPICGDKISGFHYGIFSCESCKGFFKRTVQNRKNYVCLRGAGCPVTVATRKKCPACRFDKCLNMGMKLEAIREDRTRGGRSTYQCTYTLPANLLGGSSAQGGLPGQDKMSQGPCSPAPPGSEHHHSMKHHHSNHSHKMQLVPQLLQDIMDVEHLWHYNDNDRVGASQAMGSLNLAQSLPNSGMDNSPSGNSSSGDSLSGLGTSNGLTSSSSATNNGNLNSRSDLSRLTSSNSSTPTPIHEQQHSPSSNSAASSSNSANGNPTQHPDFLSNLCNIADHRLYKIVKWCKSLPLFKNISIDDQICLLINSWCELLLFSCCFRSMSTPGEIRVSLGKSITLEQAKQLGLATCIERMLAFTNNLRRLRVDQYEYVAMKVIVLLTSDTSELKEPEKVRASQEKALQALQQYTIARYPEMPAKFGELLLRIPDLQRTCQAGKELLSAKRAEGEGCSFNLLMELLRGDH*
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00080147; iTF_01274502; iTF_01274568; iTF_01274509; iTF_00143937; iTF_00143912; iTF_00143901; iTF_00286370; iTF_00286419; iTF_00286361; iTF_01035495; iTF_01035445; iTF_01035454; iTF_01037031; iTF_01037041; iTF_01037088; iTF_01487048; iTF_01487054; iTF_01487099; iTF_01036233; iTF_01036241; iTF_01036284; iTF_00690215; iTF_00690250; iTF_00692939; iTF_00692947; iTF_00692991; iTF_00690208; iTF_00326441; iTF_00326412; iTF_00326421; iTF_00691119; iTF_00691124; iTF_00691156; iTF_00734610; iTF_00734668; iTF_01110995; iTF_01111051; iTF_01469751; iTF_01468899; iTF_01469723; iTF_01468924; iTF_01466307; iTF_01466248; iTF_01467121; iTF_01467187; iTF_01467127; iTF_01466239; iTF_01465222; iTF_01465377; iTF_01111802; iTF_01111853; iTF_01428316; iTF_01428406; iTF_01110205; iTF_01110157; iTF_01112702; iTF_01112634; iTF_00309568; iTF_00309545; iTF_01113504; iTF_01113464;
- 90% Identity
- iTF_00691119;
- 80% Identity
- iTF_00080147;