Basic Information

Insect
Amiota mariae
Gene Symbol
MYRF
Assembly
GCA_035041805.1
Location
JAWNKV010000425.1:587377-595633[+]

Transcription Factor Domain

TF Family
NDT80_PhoG
Domain
NDT80_PhoG domain
PFAM
PF05224
TF Group
Unclassified Structure
Description
This family includes the DNA-binding region of NDT80 [2] as well as PhoG and its homologues. The family contains Swiss:Q05534 or VIB-1. VIB-1 is thought to be a regulator of conidiation in Neurospora crassa and shares a region of similarity to PHOG, a possible phosphate nonrepressible acid phosphatase in Aspergillus nidulans. It has been found that vib-1 is not the structural gene for nonrepressible acid phosphatase, but rather may regulate nonrepressible acid phosphatase activity [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 3 1 3.6e+04 -5.4 5.7 108 137 217 273 175 296 0.59
2 3 1.2e-38 4.3e-34 119.3 1.1 2 180 530 676 529 676 0.96
3 3 0.4 1.4e+04 -3.0 0.3 49 85 1241 1275 1223 1305 0.48

Sequence Information

Coding Sequence
atggATATGGATTTTCTAGCCGATTTTACAGATCTAAcgCGTGCTGATTTCATTGGCGGCATTGATAACGATGCGCTGGATTTTGGAAATTTGGAACAGTTTATGCATGTAGAGACTGGCGTTGGCCAACTCGATGACGATGTTGTGCCTAATTCCGGTGTAACTGGTGTTGGACATATGCACAATGATATTAATGGTGGTGGTGCAAAGATAGAATCACCAACGACACCACCAATGCATGGCATGGATGGGAATCTTCCTTTAGCGACTGTTTCGGCACGTGTTAATGTGACAAGTACACCGATTGCCACGCCATCAGGCATGAATGGCGCACTTACGAGTAGTGGTGGAGGAGTCGGTGGAAGTGCTGGCATTTGCGGCGGTGGAGGTGGTGTAAGTCATTTACCTGAGAGTCCACCGGATTCGGGTTCAGAGCCTCCATATAGTCCCTTGCAAGATACACATGGCTTGGCTTTAACAGCACGTGATCTATACCATGGCATGTTACCATCACATGAACTGCATATGACATCACAGTACACcccgccgccaccaccaccatcatcacatcaacaacagcagcagcagcagcaaagtgGGCAACTGCCGCATCTTCATTATCAATCACAACATTTAAATCCACCACAAGATAACAATGGTATGCTAAATGGCAGTGTACGCATTAAACATGAGGCCGGTTTGATAATTAATCCAAATGGTTTAATGACatcacaacagcaacaggcTTTGGTAGAACATCAAGcactccaacaacaacaacaacatcaacatcaccaacaccaacagcagATACAGCAACAATCACATATTGTCCCTTCACAACATGCAGAACAGCAACAGTTCATATTCCAGAATGCAAACTCTGGTCTCATGCATTTCGACAATATGGCGTTAAATGGTAATAGTCCGATTGGTGGTCTGTATGCCTCTGCAagttatcaaaatatttctggTATGCTAAATGATAACGAACAAACGCCTACCTGCATGCTAACCTCGTCGTTGGGCGAATCGACTCGAGTTCAGGTAGTAGGCACTAGCCAAGCTTCCTTAGATCGCAGTTCTGTACCCACTACGCCAGTCCATTCGTCGTCACGCAAACGAAAAATGTCTACACAATTGGATTTCCCCGAATTTGGACATAAGCATGATTCGGGTCTTTCGATGAGTCCTCTGCGTGCCTCGCACCATTCGTTGGGTGCCACATCACCAATCAGAATAAATCTACCTGCGCCCAGTCTCAATGGCACGAAGCCAAACGAAATGTCTAAAACACCGGTGCACTCCTCTGCTTCAGTTTCGCCGGCGTTATCTACCGCCAATTCAAATGCGGACAATAGTCTTGATGGTCATGGCAGCTGTGCTGCATCAGCCAATGGTGGTCCATCAAGCGTTGCAGGCACAGAAAATGGTGATAGTGGCGCATTAACTCCTTGCATACGATTTAGTCCATTTCAATCGGAGAATTGGCACAAGTTGTGCGATCAGAGTCTGCAGGAATTGTCTGTGGTCTATTATCGTGTCGACGCCGATAAAGGCTTCAACTTCTCTGTCTCCGACGATGCTTTTGTGTGCCAGAAAAAGAATCATTTTCAAGTAACTTGCCATGCGCGCCTGCAAGGCGATGCCAAATTCGTTAAGACACCGTCCGGCCTTGAGAAAATCAAATCCTtccatttgcatttttatggCGTCAAATTAGAGGCGCCAAATCAGACGATACGCGTCGAACAGAGTCAATCGGATCGTTCAAAGAAACCGTTTCATCCCGTACCAATCGATCTCCAGAGCCACATTGTGAGCAAGGTCACAGTTGGGCGCTTACATTTTTCCGagacaaccaacaacaatatgcgAAAGAAAGGTCGCCCCAATCCGGAACAGCGCTATTTTCAACTTGTTGTTGGACTACACGTGCACACCACTTCCGGAAATTTCCCAGTTATAAGTCAATGTAGTGAACGCATCATTGTGCGCGCATCAAACCCCGGACAGTTTGAGTCAGATGTCGATCTTTGCTGGCAGCGTGGCATAACACAGGATTCAATATTTCATGCAGGTCGTGTCGGCATCAATACAGATCGACCTGACGAGAGCCTTGTTGTGCACGGCAATCTCAAGGTGTCCGGACACATTGTGCAGCCAAGCGATAGCCGTGCCAAGCAAGAAATCGGTGAACTGGATACATCAGTTCAACTTCGAAATCTGCAAAAGATTCGCATTGTACGGTACCGTTATGAGCCCGAGTTTGCTGTACACTCTGGGTTGAAACGTGCCAATGACAACGAGGAGATCGTTGACACCGGAGTGATAGCCCAAGAGGTACGTGAAGTCATACCGGATGCGGTAAAAGAGGCTGGCAGCATTGTGCTGCCCAATGGAAATGTTATTGAGAACTTTTTGCTGGTGAATAAAGATCGTATACTTATGGAAAATATCGGCGCTGTGAAAGAATTGTGCAAAGTGACTGGTTCGCTAGAGTTACGCATAGAGCATCTGGAACGGGAAAATACTCATGTTATGCGTGCCAAGGAATTAGAACAGCGTGGGATTATTGCGGAGCGTGCACgaaaaacagcaacaagacGTGAGGGTTATGAGATTTGTTCTAGCAGGACATTACAGATTATCATATTCCTATTAATTATTGTCATGGCTGCATGTCTTGCTGCCGTTTCCACACTCTACTTTGTGGAGCACAGCAAACAGCAACATAATTTGAAGCAACTCGAAAGTTACCAAATCTTTGGCAATGGACACATCTTCCGGCCTGCTGATGGGCCTTCTTATATAACCGACCAAGAACGACATTACATGCAGCACAATTTCCACACATTCCTTAACAAGAACAAAACTCATGGCCACTGGCCCAGTCTAATTTACGCCATGAGCACGACACGTCCACCGGCCAAGAATGCCACCAAATTACGTGAAGATTTCCTAACATACAACAGCAGTGGCGAGTCGTACACGACACACGGCGCAAACGAAGAATTGACTGTAGTCATGGAGAAGCCTTTTGTTAACCTACAGACGCTGTTACTGCCCAAGCAGCACTTCCGCATTACGACATCCGCGCCACAATTGCGCAACAAAACCATCAGCAACAAGAATAAATCCAAATGGCCATTGCCGCAAGAAGTGCTCAGAGCGGCGACCGCCAACCAAAATGCGCAGAAACTAATCTTGACTTCGAGAAACCTTAATACACAAAACTTTGTTGTAGTCGCTGGTACACCACCTACAGCATCGCCGCCATCAACAAATCTACCGCAACCTGTTGTTGTAGAAACGTATTCGAATAATGAAACGTCTTCGGAAAAGGTGCCAGAGGATTTcgaaaataattcaatagaTACAGATGCCCAGCACATTATCAAAAAGACGCTGAATGCTGCCAAGCTAATCAATGTTCAAAATGTTGTCTCGGCAGGAGTAGTAACAGGAAATGAACCTAAAGAAGAGGCAGAGGCATCACAAGAAGAGACTCTGATCAATGCTAATAGTGACACAGATATATCTCTGAAACTATCCGATTCAATTGCTGTTGAGACAATCGCCAGTACCTCAAGCAGTAACAGCATCAATGTGCCAGACTCTACGTCTGCAGCTGGACAAGCTATACGCAAGATAAATACTGGAGATGCTGTCATCTACAACGTGTATAAGACGGTTTCGCCACCTACGGCAAATCTGGCGTTGACCACCAATAAAGTAAACACTGAAAGCACACAGACAAATATATCCATTGAGTCACCTGCCagtacaacaaacaacaatagccGCCCAACAAACACACATCATGAAAGTCCTGATGCAACAGATTTACAAAATCtgagcaacaataataatgaatctGTGGACAACCCAATCACTGCACTATTTGGGTTTGAATACCCTGGATTGCGTGAGTCAAGTGTAGGCCGACGATCTGCTTCACACAGAAGCGTGGAATGGATGACACATAAAACTGTTAAAGCAGAGACTTTTGGTGATCCCTCCGAATGCGCAAACGTCGAGCAATCTAATGAACAATGCCAAACTGTCTGTTTTGATCCGGTAAAATCCAGCCAGAAGAAGGAGATCATAGATCCGATAAGCAATAGCGAGACCATGAAGCTTCAACGATCCATTGATGAAATGCATGAAACACTCGAAGATCGAGACCTAAGTGATTTGGGCTTTGACGACAGCGTGTCTAAActccaacaaaacaataaaactctAGGTATCAATATTGCACCAGGTAAAGCATCACACGCTACAGATGCCGCTGCAAAACAATTCTCAGACGAACAGGAGTCTTTGGAATCAGATGATGCTTTGGAATCGATGCTTAGTCAAGTTGCCTTATCTGCAGAGGCTTTGGAGGCGGCACCTAAAATTGTTGCCGACATAAATGCGCATATTATGTCAAGCATTGTCTCAAAGCATGATGCACAACAACATGGGTTGCCTTTGGATTGCTGGACAGTCACCACTCTTATACTCGCCGGTCAACTTAATCGCACCATAGGTGCCGAGCAATTCTATCCAAGTTTGGGCAAATCACTTAATATTACGTATTTAGTGCCTATGTCCCGATTTCTTAAAATTGACAATATTGAATTGCAATTAAGtTCAAACAAACCACTACGATGGTCTGTATGTAACAgcaatgataatgaaaaatcatCAAGTTCAGCTCATGCGGATCTAGGTGGCGATGatgatacaaatatacaagaaCCGCAAACACCAAATACAGTGACTATTAAACAAGATAACAGCGACAAACTCAGTCTGGGTCTCAAGATACCGAGCAATGGTTACTTTCTGAGAAATTTTATGCTACGCGCTAGCACCGATTTGGAGCAGCAAAAACTTTGCGATGATGACGCTCACTTAGCAAATACATTACTCCAGTACAACTTTAGAATCTTAAGAGATTGTGATTAG
Protein Sequence
MDMDFLADFTDLTRADFIGGIDNDALDFGNLEQFMHVETGVGQLDDDVVPNSGVTGVGHMHNDINGGGAKIESPTTPPMHGMDGNLPLATVSARVNVTSTPIATPSGMNGALTSSGGGVGGSAGICGGGGGVSHLPESPPDSGSEPPYSPLQDTHGLALTARDLYHGMLPSHELHMTSQYTPPPPPPSSHQQQQQQQQSGQLPHLHYQSQHLNPPQDNNGMLNGSVRIKHEAGLIINPNGLMTSQQQQALVEHQALQQQQQHQHHQHQQQIQQQSHIVPSQHAEQQQFIFQNANSGLMHFDNMALNGNSPIGGLYASASYQNISGMLNDNEQTPTCMLTSSLGESTRVQVVGTSQASLDRSSVPTTPVHSSSRKRKMSTQLDFPEFGHKHDSGLSMSPLRASHHSLGATSPIRINLPAPSLNGTKPNEMSKTPVHSSASVSPALSTANSNADNSLDGHGSCAASANGGPSSVAGTENGDSGALTPCIRFSPFQSENWHKLCDQSLQELSVVYYRVDADKGFNFSVSDDAFVCQKKNHFQVTCHARLQGDAKFVKTPSGLEKIKSFHLHFYGVKLEAPNQTIRVEQSQSDRSKKPFHPVPIDLQSHIVSKVTVGRLHFSETTNNNMRKKGRPNPEQRYFQLVVGLHVHTTSGNFPVISQCSERIIVRASNPGQFESDVDLCWQRGITQDSIFHAGRVGINTDRPDESLVVHGNLKVSGHIVQPSDSRAKQEIGELDTSVQLRNLQKIRIVRYRYEPEFAVHSGLKRANDNEEIVDTGVIAQEVREVIPDAVKEAGSIVLPNGNVIENFLLVNKDRILMENIGAVKELCKVTGSLELRIEHLERENTHVMRAKELEQRGIIAERARKTATRREGYEICSSRTLQIIIFLLIIVMAACLAAVSTLYFVEHSKQQHNLKQLESYQIFGNGHIFRPADGPSYITDQERHYMQHNFHTFLNKNKTHGHWPSLIYAMSTTRPPAKNATKLREDFLTYNSSGESYTTHGANEELTVVMEKPFVNLQTLLLPKQHFRITTSAPQLRNKTISNKNKSKWPLPQEVLRAATANQNAQKLILTSRNLNTQNFVVVAGTPPTASPPSTNLPQPVVVETYSNNETSSEKVPEDFENNSIDTDAQHIIKKTLNAAKLINVQNVVSAGVVTGNEPKEEAEASQEETLINANSDTDISLKLSDSIAVETIASTSSSNSINVPDSTSAAGQAIRKINTGDAVIYNVYKTVSPPTANLALTTNKVNTESTQTNISIESPASTTNNNSRPTNTHHESPDATDLQNLSNNNNESVDNPITALFGFEYPGLRESSVGRRSASHRSVEWMTHKTVKAETFGDPSECANVEQSNEQCQTVCFDPVKSSQKKEIIDPISNSETMKLQRSIDEMHETLEDRDLSDLGFDDSVSKLQQNNKTLGINIAPGKASHATDAAAKQFSDEQESLESDDALESMLSQVALSAEALEAAPKIVADINAHIMSSIVSKHDAQQHGLPLDCWTVTTLILAGQLNRTIGAEQFYPSLGKSLNITYLVPMSRFLKIDNIELQLSSNKPLRWSVCNSNDNEKSSSSAHADLGGDDDTNIQEPQTPNTVTIKQDNSDKLSLGLKIPSNGYFLRNFMLRASTDLEQQKLCDDDAHLANTLLQYNFRILRDCD

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00061133;
90% Identity
iTF_00061133;
80% Identity
-