Dsim005995.1
Basic Information
- Insect
- Drosophila simulans
- Gene Symbol
- foxp1
- Assembly
- GCA_016746395.1
- Location
- NC:16368528-16373360[+]
Transcription Factor Domain
- TF Family
- Fork_head
- Domain
- Fork_head domain
- PFAM
- PF00250
- TF Group
- Helix-turn-helix
- Description
- The fork head domain is a conserved DNA-binding domain (also known as a winged helix) of about 100 amino-acid residues. Drosophila melanogaster fork head protein is a transcription factor that promotes terminal rather than segmental development, contains neither homeodomains nor zinc-fingers characteristic of other transcription factors [1]. Instead, it contains a distinct type of DNA-binding region, containing around 100 amino acids, which has since been identified in a number of transcription factors (including D. melanogaster FD1-5, mammalian HNF-3, human HTLF, Saccharomyces cerevisiae HCM1, etc.). This is referred to as the fork head domain but is also known as a 'winged helix' [1, 2, 3]. The fork head domain binds B-DNA as a monomer [2], but shows no similarity to previously identified DNA-binding motifs. Although the domain is found in several different transcription factors, a common function is their involvement in early developmental decisions of cell fates during embryogenesis [3].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 2 1.5 1.2e+03 -0.7 0.0 7 34 165 192 162 199 0.88 2 2 6.9e-28 5.3e-25 87.0 0.1 2 82 339 413 338 419 0.94
Sequence Information
- Coding Sequence
- ATGGTAATGATTATAAATtacagttttaaaaatatgcatCGGATACATGACGACGAGTATTCCGATGATGCGAAGGAATCGGACTTCAAGGGTTCGATTCAGAAGGAGATATCAGTGAAGTCGAGACACCAAATTTCCATACCGGACATATGCTCAGATGCTGTTAAGAACAATTGCTTTCCGCCGACTGGCTTTCTCAATAGCTCGATCGCCTTTGCCTCACATGTTGTTAAGTGCAGCTCACCGGCCTCGAGCATCGAGGAATCCTCAACGGCGGCCCAGCAGCACGAATCCAATCCCCACATGCACATGCAGGGTCAGCATATGATGGCTCCCGTTCCCGATCTGGGCTTCTACAATGTGCCCGAGTTCATATCCGAGCAGGAGAAGCTCATGTTCTCCGACGCGGAGAGGTTCATGCGGTCGAAGGATAATGAGGTGTGCAACAATGATTTCAGCTATATGCACGATGAGTTCGCCATGCGCAAGTACTATCATCCGTTATTTGCTCATGGCATATGCCGCTGGCCTGGTTGCGAAATGGATTTGGAGGACATCACATCGTTTGTCAAGCATCTGAATACGGAACATGGTTTGGACGATCGATCAACTGCACAGGCTCGGGTTCAAATGCAAGTTGTCTCCCAGCTGGAATCGCACCTTCAAAAGGAAAGAGATCGCTTGCAGGCAATGATGCATCACTTGTATTTGTCCAAGCAACTTTTGTCACCCACCAAAATCGATAGGAAGgacGTGCCCGGAAGAGAGGGAAAGTTTTGCCGGAGTCCGCTTACCGTGAATAGCATAGGCCGACCCATCCGCCAAACAAACTCACCGAGTCCTCTGAATCTTCCAATGGTTAATTCCACCAACTTGTGCTCGATCAAAAAGAGAAACCACgacaaaaatacattttccataAATGGGGGATTACCCTATATGCTTGAAAGAGCCGGTCTTGATGTGCAACAGGaaATCCATCGAAACAGAGAGTTCTATAAGAATGCTGATGTACGACCGCCTTTTACTTATGCTTCCCTCATAAGAcagGCTATAATTGACTCGCCTGACAAGCAGTTAACCCTAAACGAAATCTACAACTGGTTCCAAAACACATTTTGCTACTTCCGACGCAACGCAGCTACGTGGAAGAATGCGATTCGTACGAACCTTTCCTTACACAAGTGCTTTGTACGTTATGAAGATGACTTTGGCTCGTTTTGGATGGTCGACGATAATGAGTTTGTCAAAAGGCGGCACTTGTCGAGAGGGAGACCCCGGAAATATGAACCGTCCTCCTCCCCAAATTCATGCCAATCCGGCAATGGTGTGCCCACTGATAAGAATCCCTGCGACAATTGTACGCAACATTGCACTAGTTTGCCACCGGGGGCTGATAATCCTTTAGATTCCAATAATCCAAATGATTTAGGCAGAATTGGTTGTCTTCCCTATTGCGGTAGTGATGGTTTAAGTAAGGCGTCCAAGGACTATAGCAACATGGATTCGGGTATGGTCGAAAGTAACAGCCATTTGGCAATCGATGAATATTCTACTAATATGTACGAGAGTAGTGCCAATGAGCACAATCGATAA
- Protein Sequence
- MVMIINYSFKNMHRIHDDEYSDDAKESDFKGSIQKEISVKSRHQISIPDICSDAVKNNCFPPTGFLNSSIAFASHVVKCSSPASSIEESSTAAQQHESNPHMHMQGQHMMAPVPDLGFYNVPEFISEQEKLMFSDAERFMRSKDNEVCNNDFSYMHDEFAMRKYYHPLFAHGICRWPGCEMDLEDITSFVKHLNTEHGLDDRSTAQARVQMQVVSQLESHLQKERDRLQAMMHHLYLSKQLLSPTKIDRKDVPGREGKFCRSPLTVNSIGRPIRQTNSPSPLNLPMVNSTNLCSIKKRNHDKNTFSINGGLPYMLERAGLDVQQEIHRNREFYKNADVRPPFTYASLIRQAIIDSPDKQLTLNEIYNWFQNTFCYFRRNAATWKNAIRTNLSLHKCFVRYEDDFGSFWMVDDNEFVKRRHLSRGRPRKYEPSSSPNSCQSGNGVPTDKNPCDNCTQHCTSLPPGADNPLDSNNPNDLGRIGCLPYCGSDGLSKASKDYSNMDSGMVESNSHLAIDEYSTNMYESSANEHNR
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00539475; iTF_00542954; iTF_00515415; iTF_00594364; iTF_00491330; iTF_00503810; iTF_00593667; iTF_00608995; iTF_00477160; iTF_00569512; iTF_00615400; iTF_00571721; iTF_00477834; iTF_00489194; iTF_00617450; iTF_00525707; iTF_00487820; iTF_00524236; iTF_00531500; iTF_00540908; iTF_00475757; iTF_00482799; iTF_00479989; iTF_00488533; iTF_00481366; iTF_00606136; iTF_00612632; iTF_00538777; iTF_00579801; iTF_00484954; iTF_00550074; iTF_00561562; iTF_00591517; iTF_00581347; iTF_00536581; iTF_00578341; iTF_00614725; iTF_00529383; iTF_00547879; iTF_00607627; iTF_00565196; iTF_00511821; iTF_00605389; iTF_00504501; iTF_00494201; iTF_00545785; iTF_00618969;
- 90% Identity
- iTF_00539475; iTF_00591517; iTF_00607627; iTF_00511821; iTF_00618969;
- 80% Identity
- -