Avir026616.1
Basic Information
- Insect
- Agapostemon virescens
- Gene Symbol
- SGF1
- Assembly
- GCA_028453745.1
- Location
- CM052095.1:9824804-9826315[+]
Transcription Factor Domain
- TF Family
- Fork_head
- Domain
- Fork_head domain
- PFAM
- PF00250
- TF Group
- Helix-turn-helix
- Description
- The fork head domain is a conserved DNA-binding domain (also known as a winged helix) of about 100 amino-acid residues. Drosophila melanogaster fork head protein is a transcription factor that promotes terminal rather than segmental development, contains neither homeodomains nor zinc-fingers characteristic of other transcription factors [1]. Instead, it contains a distinct type of DNA-binding region, containing around 100 amino acids, which has since been identified in a number of transcription factors (including D. melanogaster FD1-5, mammalian HNF-3, human HTLF, Saccharomyces cerevisiae HCM1, etc.). This is referred to as the fork head domain but is also known as a 'winged helix' [1, 2, 3]. The fork head domain binds B-DNA as a monomer [2], but shows no similarity to previously identified DNA-binding motifs. Although the domain is found in several different transcription factors, a common function is their involvement in early developmental decisions of cell fates during embryogenesis [3].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 1 1e-40 1.7e-37 128.1 0.1 2 88 150 235 149 235 0.98
Sequence Information
- Coding Sequence
- ATGACCATGCTCCAGTCGCAGAAGCTTTACGGCGACGCTGGTAGCCTCGGCGGCGCGATGACCAGCGCCGCGATGTCCCATATGGGCAGCATCCCGCCGTCGTACACGTCGATCAACTCGATGGGTTGCGTGTCCATGGGAATGTCGATGGGCGTCGGTGTGGGCCCGAGCTGCAGCCCTCAGGGCGCCGGCGGTTTCAACATGAGCTCGGTCAGCTCCGCCATGGGGATGGCGACGATGGGCGGCGGTGGCATGAGCAGCTACGGCGGCGGCTCGATGGCGAACGGCAGCGCGTGCATGGGCGCCGTCGGTTACGGTCCGTTGACGCCGGCCGGGGGCACAGGGGTCACCAGAGACCCGTTGTCCCTGACCGAGCCGGACTCGCCGAACTCCGCTCTGCAGCGCGTCCGCGCCGACAAGTCCTACCGTCGGAGTTACAccgccgcgaaaccgccgtactcttacatcagcctgatcacgatggcgatacagaacgcgccgtcgaagatgctcaccctatccgagatctatcagttcatcatggacctgttcccctactacaggcagaaccagcaacgctggcagaactcgatcaggcactcgctcagcttcaacgactgtttcgtcaaagtggcgcgcacgcccgacaagccgggcaaaggttcgttctggacgttgcacccggagagcgggaacatgttcgagaacggttgctacctgcgccggcaGAAGAGGTTCAAGGACGAGAAGAAAGAGCTCACCAGGCAGTCGATCAAGCACCAGCAACACCAGCAACATCACACCAGCATCACGGCCACCGGCACCACGGCAGAACACAGCAGCCCGACGCATCGCCTGGCGCCGGTAGCCGGTCGGACGTCCACGTCCCTTCACCATGGCACGCAACAGCAAGAGGATAAAGATCAGCATTCTCTAGTATCCCCGCATCATCATCACCACGCAGCCAGCCTCCATCAGCACCACGCCGGCTTGAAAACCGACACGGCGGACATAGGCGGTCTCCTCGGGCCAGATTTAGGCACGGCGCACGACGAGCTCACCGCCATGGTCAGCCGTAGCCTTCATCCCCATCTGATCCCCGACACGTCCGCCCTCCACCATGGCATGGCCGGTAGCTTGAAGCAAGAACCCCCGTACACCGCTGCCAGCCATCCTTTCAGCATCACCAGGCTACTTCCGGGAGCCACGGCTGGCACGTCGCCCGGCGCTCAGGACACCAAGCCGCCCGAGATGAAGATGTACGAACAGCTGCACCAGAGTTACGCGAACTTCGGCTCCTCGCATCATCCCCACGCTCATTCCGCCCCGCCGAGCCATCATCATCACAACGGCATGCACACGAGCCCGAACACCGCCGGGCCCATGCACAACATGACCAACCACCATCATCAGGAGTACTACCAAAGCCCGCTTTATCATCACGCGACCAGCGTGGCCAGCAGTAGCGCTCCACCACCGCCCACAGTTGTTAGCGCTGCACCAGGATTGTGA
- Protein Sequence
- MTMLQSQKLYGDAGSLGGAMTSAAMSHMGSIPPSYTSINSMGCVSMGMSMGVGVGPSCSPQGAGGFNMSSVSSAMGMATMGGGGMSSYGGGSMANGSACMGAVGYGPLTPAGGTGVTRDPLSLTEPDSPNSALQRVRADKSYRRSYTAAKPPYSYISLITMAIQNAPSKMLTLSEIYQFIMDLFPYYRQNQQRWQNSIRHSLSFNDCFVKVARTPDKPGKGSFWTLHPESGNMFENGCYLRRQKRFKDEKKELTRQSIKHQQHQQHHTSITATGTTAEHSSPTHRLAPVAGRTSTSLHHGTQQQEDKDQHSLVSPHHHHHAASLHQHHAGLKTDTADIGGLLGPDLGTAHDELTAMVSRSLHPHLIPDTSALHHGMAGSLKQEPPYTAASHPFSITRLLPGATAGTSPGAQDTKPPEMKMYEQLHQSYANFGSSHHPHAHSAPPSHHHHNGMHTSPNTAGPMHNMTNHHHQEYYQSPLYHHATSVASSSAPPPPTVVSAAPGL
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00864450; iTF_00865133; iTF_00760803; iTF_01523177; iTF_00117815; iTF_00754396; iTF_00183108; iTF_01066052; iTF_01089806; iTF_01068785; iTF_01069460; iTF_01070166; iTF_01067420; iTF_01066756; iTF_01065363; iTF_00762364; iTF_01068094; iTF_00862377; iTF_00861697; iTF_00763049; iTF_00863769; iTF_00865821; iTF_00866519; iTF_00860938; iTF_00863086; iTF_00860194; iTF_00140399; iTF_00226407; iTF_00220395; iTF_00218924; iTF_00222349; iTF_00229166; iTF_00227089; iTF_00214838; iTF_00216886; iTF_00225723; iTF_00219614; iTF_00225054; iTF_00217567; iTF_00228479; iTF_00214218; iTF_00232430; iTF_00231806; iTF_00221743; iTF_00141650; iTF_00224372; iTF_00223687; iTF_00233045; iTF_00216201; iTF_00229843; iTF_00142294; iTF_00218231; iTF_00230520; iTF_00221071; iTF_00223003; iTF_00231131; iTF_00227772; iTF_00215520; iTF_00183763; iTF_00965862; iTF_00675833; iTF_00625312;
- 90% Identity
- iTF_00860194;
- 80% Identity
- -