Eocc001149.1
Basic Information
- Insect
- Eurois occulta
- Gene Symbol
- FOXL2
- Assembly
- GCA_950022335.1
- Location
- OX465477.1:18757352-18772880[-]
Transcription Factor Domain
- TF Family
- Fork_head
- Domain
- Fork_head domain
- PFAM
- PF00250
- TF Group
- Helix-turn-helix
- Description
- The fork head domain is a conserved DNA-binding domain (also known as a winged helix) of about 100 amino-acid residues. Drosophila melanogaster fork head protein is a transcription factor that promotes terminal rather than segmental development, contains neither homeodomains nor zinc-fingers characteristic of other transcription factors [1]. Instead, it contains a distinct type of DNA-binding region, containing around 100 amino acids, which has since been identified in a number of transcription factors (including D. melanogaster FD1-5, mammalian HNF-3, human HTLF, Saccharomyces cerevisiae HCM1, etc.). This is referred to as the fork head domain but is also known as a 'winged helix' [1, 2, 3]. The fork head domain binds B-DNA as a monomer [2], but shows no similarity to previously identified DNA-binding motifs. Although the domain is found in several different transcription factors, a common function is their involvement in early developmental decisions of cell fates during embryogenesis [3].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 1 6.1e-41 1.1e-37 129.2 0.1 2 88 87 172 86 172 0.97
Sequence Information
- Coding Sequence
- ATGGGTTCACCGTACGACAGTCGAATATGCCTCCAGGAAAGCAGCGCGGATGCAGCGAATTCACAAAGCGTCAAAGGGGAAGAAGAGCTATCGCGAGTGTACCAAACCCTCTCACTACCATCACTTCCAAGAGACGATAATAATTCTAACAACAGCAACACTGAAACAAAGAGTAAATCTAAGTCAACGACCCCAGCAAGCCCAGGCAACTCTGAAATGAAGCCGCAGTCCACCACGCCGACGTCGCAAGCGTTAACGAAACCTCCATACTCTTACGTGGCCCTCATCACGATGGCAATACAAAACAGCCAGACGAAACGCGCTACTCTAAGTGAAATTTACGCTTACATCACCAAGGAATTTCCATTTTTCGAGAAAAATAAGAAAGGGTGGCAAAATTCAATTCGGCATAATTTAAGTCTAAACGAATGTTTTATAAAAGTACCAAGAGAAGGCGGTGGTGAGCGTAAGGGAAATTATTGGACCCTTGATCCTCAATGCGGAGAGATGTTTGAAAACGGGAACTTCAGACGGCGGCGAAGGATGAAGAGGCCATTTCGAGCTACCAATTACTCCAAAACGCTGTTTGGAGACGGGTATCACGTGGCACACGTCGGGCAACACGTACCACCTCACATGCAGCCGTTGCCCCTGGGGGCCAGGAATTATTTCGGATCCGGTTCACCTTACCATCCTCCTTCATACCCTCGATATGACACTACATGGCTAACGCAGTCGCCGAGCGGGCTGGGGTACGCGGGCGCGTGCGGAAGTATGGGCCGCAGTCCACCCGGGTGTTCGCCACACAGTGCGCTGCCCCACCAGTCGTCGCCTGTCGCCGTCAACCCGTTTGCCACTCATCAAATTCAAGGACAGCTTCAGAGTCCATTGCAGCCGATGCAGTCGATGTCGATGAATACCTACAATGCTATTGGAGTCAGTGCTATAGACGGTTCTCCTAGCCCCGGTCCTGGTTATGCCCCTTCGGGCGGCGGCTTTTCGCCAAACCGACATCACGACATCGTCACGTCGTCAGACGCGGCTGTAACTCGATTTCCATTCTGGCCTGAAGGTGGATCCCCAAGTCCAAATCCAAGTTACGTGCCGTCTAGCAGTTTCTCGCCAACTCGCCGCCACGAGGCCGTCACTTCATCGGATGCAACGAGTGCTCGCTTCTCGTTCTGGCCAGATGGTGAATAA
- Protein Sequence
- MGSPYDSRICLQESSADAANSQSVKGEEELSRVYQTLSLPSLPRDDNNSNNSNTETKSKSKSTTPASPGNSEMKPQSTTPTSQALTKPPYSYVALITMAIQNSQTKRATLSEIYAYITKEFPFFEKNKKGWQNSIRHNLSLNECFIKVPREGGGERKGNYWTLDPQCGEMFENGNFRRRRRMKRPFRATNYSKTLFGDGYHVAHVGQHVPPHMQPLPLGARNYFGSGSPYHPPSYPRYDTTWLTQSPSGLGYAGACGSMGRSPPGCSPHSALPHQSSPVAVNPFATHQIQGQLQSPLQPMQSMSMNTYNAIGVSAIDGSPSPGPGYAPSGGGFSPNRHHDIVTSSDAAVTRFPFWPEGGSPSPNPSYVPSSSFSPTRRHEAVTSSDATSARFSFWPDGE
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00830972;
- 90% Identity
- iTF_00017985; iTF_01063555; iTF_01028068; iTF_01027105; iTF_00446922; iTF_00809864; iTF_01501893; iTF_00120295; iTF_00049705; iTF_00444914; iTF_00757945; iTF_00706751; iTF_01062559; iTF_00273417; iTF_00274253; iTF_00794126; iTF_00928464; iTF_00123156; iTF_01064440; iTF_00667162; iTF_00383434; iTF_00808909; iTF_01084942; iTF_00821583; iTF_00951599; iTF_00973526; iTF_00952510; iTF_01179311; iTF_00621828; iTF_01073413; iTF_00785008; iTF_00967856; iTF_01094705; iTF_01538545; iTF_00122162; iTF_01192436; iTF_00404501; iTF_01439701; iTF_01440794; iTF_00000124; iTF_00042437; iTF_00685237; iTF_01061688; iTF_00000989; iTF_00869430; iTF_00017150; iTF_01151870; iTF_00771680; iTF_00783232; iTF_00785823;
- 80% Identity
- -