Basic Information

Insect
Tipula unca
Gene Symbol
MYRF
Assembly
GCA_951394425.1
Location
OX596383.1:15436412-15440962[-]

Transcription Factor Domain

TF Family
NDT80_PhoG
Domain
NDT80_PhoG domain
PFAM
PF05224
TF Group
Unclassified Structure
Description
This family includes the DNA-binding region of NDT80 [2] as well as PhoG and its homologues. The family contains Swiss:Q05534 or VIB-1. VIB-1 is thought to be a regulator of conidiation in Neurospora crassa and shares a region of similarity to PHOG, a possible phosphate nonrepressible acid phosphatase in Aspergillus nidulans. It has been found that vib-1 is not the structural gene for nonrepressible acid phosphatase, but rather may regulate nonrepressible acid phosphatase activity [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 4 0.35 1.8e+04 -2.8 1.1 80 104 219 243 202 275 0.54
2 4 2.3e-35 1.1e-30 108.6 1.0 2 180 388 534 387 534 0.94
3 4 0.21 1e+04 -2.0 0.3 56 97 893 936 827 960 0.58
4 4 0.039 1.9e+03 0.3 2.0 36 101 951 1016 921 1039 0.73

Sequence Information

Coding Sequence
ATGACAAATATCGATAATGAAACAATAGATTTCGCTCAACTGGAGGCTCTCATACATGGTACACATGCAGTAGCAACGTCAACATCTGCATCGGGTAGCCATAACACAAATAACATTACTACAAATGCAAATCAAGTTGGAATAAAGTCATCGGCGACATCAGCCACATCATCTGGTGCTTCAcaaatttcaaatgataattCCGTTACACAATCGCATTTGCCGGAAAGTCCACCCGATTCGGGAtcagagccaccatatagtccAGCCGATTTACATAGCTTAAATGGCCAACAGCAATTACAAAATGGCAATAATATTCAAACATTATCTGAAATTGATATTAATAATGTGACTGAATTGCATGTGGCCCATCATCTATTGAATAAtccaaatacaatttatataacaGAAGTGGCAAATGGGGGAGGGGTACAACAAAATAGCGACCGTCAAAATGAGCAACAAATTGTGCTACAGTCACAAGCATCGTCTATACTTGATATAGATCATCATCGTAGTTATCGCAATGACCTATTAACGTTGGGacagcagcagcaacagcatCCAAGCGATAATGTTGTTGGAAATCAAGGAATTATACCCGATCTATATGGACACCATCAATTGCAACCACCACCGCCGCCGCCACCTCCTCCGCCTCAGCAACAAATTCATGACCAGGTCCATCATCATCAGCATCACACGcatatacaattaataaataataattcacggAAACGAAAATTATCCCAACAAAATAATGGACCAGATGCGTTAACCTGTCAGCAAAATGGTGGCCAACAATCTATAATAAAACCGGAACCGATACGAGATCTGTGTgacatacaatatatacaaaatcatggTGATATGCTATTTTGTCAAACcgaatcaaatcaaaatcaatctGCTCTCAGTCCCATATGTGGAAGTGGAAGTTCGCTCGAATCATCGTCACCGTGTTTGGCAAATGATGCCCAACAGCAAAACGGTAATCATGATGGTGGTGAACATTGCCCGCTACAGTGTATTAGATTCACACCATTCCAACAGCAAAATTGGCATATTTTGTGTGATCAAAGTCTACAAGAAATccCAATACCACACTATCGGGTCGATGCGGATAaaggttttaatttttccaattctgatgATGCGTTTGTATGTCAAAAAAAGAATCACTTTCAAATAACATGCCATGCCCAGCTTCAAGGTGCTGCCGTATTCATACGAACCGCCTCCGGTTTAGAAAAGGTTCGATCTTTCCATTTGCATTTCTATGGCGTAAAGCTAGAGGCACCAACACAAACGATAAGGGTCGAACAAAGCCAATCGGATCGATCAAAAAAACCATTCCATCCGGTGCCTATCGATTTACAGAATGGTCAGGTTAGCAAAGTAACGGTGGGCCGTTTACATTTCTCCGAGACAACAAGTAATAATATGCGAAAGAAGGGCCGTCCCAATCCTGAGCAACGATATTTTCAATTGGTTGTAGGCTTGCACGCACACACACACTCCGGCCATTTTCCAATAATCAGTCATGGAAGTGAACGGATTATAGTGCGCGCATCAAATCCTGGCCAATTCGAAAGTGATGTAGAATTAAGTTGGCAGAGGGGCGCAACCCAAGAATCAATATATCATGCCGGCCGTGTAGGGATAAATACGGATCGGCCGGATGAAAGTCTAGTGGTGCACGGTAATCTAAAAGTTTCCGGTCACATTGTGCATCCGAGCGATAGCCGAGCAAAGCAGGAAATTAACGAATTGGATACGTCACAGCAATTgataaatgtacaaaaaattcgAGTGGTACGCTATCGTTACGAACCGGAATTTGCGGCCCACAGCGGATTATCATCAGATTTAGATAATCATATTATTGATACGGGTGTTATTGCGCAAGAGGTGCAAAAAATTTTACCCGATGCTGTACAAGAAACCGGTAGCCTAATTCTACCGAACGGTcaaattatagataattttttagttgtaaATAAAGAGAGAATATTTATGGAGAATATTGGTGCCGTTAAGGAACTATGCAAGGTGACTGGTTCGCTTGAAACCAGAATCGAGCAATTAGAAAGAATTAATTCGCGATTGACGCGcatacaaaaatttgataaaaaaacagCCCGAAAATATTGCTACAGCCGAAAATGTAAACACTATCATCATCACAATAATCCAGACGATGATGACTATGATTTATACGATGGCACCGAGGAAGAAAGTAACGAGCTGTGTTCAAATAAAATGATTcaagttataattattatattagtaGTTATAATGGCCGTATGCTTGGCCGCTATATCAAccctatattttattgaacatcATAAACAGAATTACGACACAAATCGTAATGCATTTATGGTAAATTTTGGTGGTATACCAAAGACAAATCACTATTCACCCATGCAGCAAGATCAATCGGCGGTTCACACGAAAACCGTTTACAATAGTAGCCAATAcaatgcaaatttaataaatgcacAAAATCGTAATAATTACGTAGACAATTATAATGTTAACACGATTGATTCAACACATTCCACAAATGTTGCCGGATTCCAGGCCTCAACCGTAGCGCCCATATCAACGGCGATAATtgcgaaaaaaattaaactcaatCAAAATCTATCATATCAAACGGACACTTCAAATTATGAGAatgatacaaaaaatcaaattatcacGAAAAATGGTAAACGGACATCGGCAACATCGGCCCAATTTATACCATCGGACATTGAAACAAATTCGATTGATGAAAATAATTCGCAATACACTAgtgtaattgaaaataaaaataaatcagaaacgatgaaatatttaatacagtCGGGTAATAAAATTACATCGGCATCTTTATTACCATCATCGGCAGCGCTATCGACCTCGTCAACAATCGGGACCACAACGATTTTACCAACCAATAGTGATGACTCGAAGCAAAATTCAAATGATCCTCAAAATATAAacATAAATTCGGATATGTTTGACCCTATGCCAGTAAATGAACCAGAAGAGACAGAAAATGGGCCAAAAATAATACGAGTTCCAGTAATTGGTCAACCAAATCAATGTACAAACGAAAATCATGAACATCCCAATTGCCATTCGGTATGCTGCAAGGACAATAAAAATTTAGCCGAAATTCCGGATGATCTTGAATTTCAAGGTGGTGGTGGCGGTGGTGGTGCCGCGGCTAACATAAATACCAATTCAAGTAATATAACCACAgtaacaaatgaaaataataatccaaATGATGAGATTAGCCCATCATCAATGGAACTTCAACATCAATACTACTATAGTACACCAAATGCAAATGTAACATCAATTGTGGCGCCTGAATCGTCTCAGATAATACGACAGCAGGCTGAAAGTTTTAAATCACAGGATTTAGGATGCTACaatattcaattgaaaataCGTTCGGAAACATTTAATCAAACATATAAACGGGAAACGATATGCAGACACGAAGAGCAATCACTTAATGTTACATACGAGTTACCATTTTCAAAGTATATGCAAGAAATTGgacaaattgaattacattttatgtCACCGGAACTAACCGAATGGTATACGTGTCCACggaaatataaacattttattgatgAATCAAATGCCGGCTATGATATAAATCGAACTGTAACCAGTGAACCGAATAGTAATAGTAACAGTTGgaataatattgattttgataCACAAGTTTATCagaaaaattcgacaaaatatattttctacatCGAAATTCCAGTGTTtggtgattttgaaaaattaatacaatttcgGTCATCGTTAATGCCAGCGAGCAAGAATGTATGCCAGCTGAATCGGATCGGGCAGTTTGTTCAATACAACATAAAACTATTtagaaattgtaattaa
Protein Sequence
MTNIDNETIDFAQLEALIHGTHAVATSTSASGSHNTNNITTNANQVGIKSSATSATSSGASQISNDNSVTQSHLPESPPDSGSEPPYSPADLHSLNGQQQLQNGNNIQTLSEIDINNVTELHVAHHLLNNPNTIYITEVANGGGVQQNSDRQNEQQIVLQSQASSILDIDHHRSYRNDLLTLGQQQQQHPSDNVVGNQGIIPDLYGHHQLQPPPPPPPPPPQQQIHDQVHHHQHHTHIQLINNNSRKRKLSQQNNGPDALTCQQNGGQQSIIKPEPIRDLCDIQYIQNHGDMLFCQTESNQNQSALSPICGSGSSLESSSPCLANDAQQQNGNHDGGEHCPLQCIRFTPFQQQNWHILCDQSLQEIPIPHYRVDADKGFNFSNSDDAFVCQKKNHFQITCHAQLQGAAVFIRTASGLEKVRSFHLHFYGVKLEAPTQTIRVEQSQSDRSKKPFHPVPIDLQNGQVSKVTVGRLHFSETTSNNMRKKGRPNPEQRYFQLVVGLHAHTHSGHFPIISHGSERIIVRASNPGQFESDVELSWQRGATQESIYHAGRVGINTDRPDESLVVHGNLKVSGHIVHPSDSRAKQEINELDTSQQLINVQKIRVVRYRYEPEFAAHSGLSSDLDNHIIDTGVIAQEVQKILPDAVQETGSLILPNGQIIDNFLVVNKERIFMENIGAVKELCKVTGSLETRIEQLERINSRLTRIQKFDKKTARKYCYSRKCKHYHHHNNPDDDDYDLYDGTEEESNELCSNKMIQVIIIILVVIMAVCLAAISTLYFIEHHKQNYDTNRNAFMVNFGGIPKTNHYSPMQQDQSAVHTKTVYNSSQYNANLINAQNRNNYVDNYNVNTIDSTHSTNVAGFQASTVAPISTAIIAKKIKLNQNLSYQTDTSNYENDTKNQIITKNGKRTSATSAQFIPSDIETNSIDENNSQYTSVIENKNKSETMKYLIQSGNKITSASLLPSSAALSTSSTIGTTTILPTNSDDSKQNSNDPQNININSDMFDPMPVNEPEETENGPKIIRVPVIGQPNQCTNENHEHPNCHSVCCKDNKNLAEIPDDLEFQGGGGGGGAAANINTNSSNITTVTNENNNPNDEISPSSMELQHQYYYSTPNANVTSIVAPESSQIIRQQAESFKSQDLGCYNIQLKIRSETFNQTYKRETICRHEEQSLNVTYELPFSKYMQEIGQIELHFMSPELTEWYTCPRKYKHFIDESNAGYDINRTVTSEPNSNSNSWNNIDFDTQVYQKNSTKYIFYIEIPVFGDFEKLIQFRSSLMPASKNVCQLNRIGQFVQYNIKLFRNCN

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_01454852;
90% Identity
-
80% Identity
-