Afus020435.1
Basic Information
- Insect
- Agapetus fuscipes
- Gene Symbol
- Myrf_1
- Assembly
- GCA_951799405.1
- Location
- OX637357.1:14969903-15006705[-]
Transcription Factor Domain
- TF Family
- NDT80_PhoG
- Domain
- NDT80_PhoG domain
- PFAM
- PF05224
- TF Group
- Unclassified Structure
- Description
- This family includes the DNA-binding region of NDT80 [2] as well as PhoG and its homologues. The family contains Swiss:Q05534 or VIB-1. VIB-1 is thought to be a regulator of conidiation in Neurospora crassa and shares a region of similarity to PHOG, a possible phosphate nonrepressible acid phosphatase in Aspergillus nidulans. It has been found that vib-1 is not the structural gene for nonrepressible acid phosphatase, but rather may regulate nonrepressible acid phosphatase activity [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 12 0.41 1e+04 -3.0 0.0 46 95 373 420 360 431 0.84 2 12 1.1e-31 2.9e-27 96.6 1.8 2 170 647 783 646 787 0.93 3 12 3e-06 0.076 13.7 0.0 135 169 787 818 783 822 0.80 4 12 3e-07 0.0077 17.0 0.0 135 170 823 855 819 855 0.80 5 12 7.1e-06 0.18 12.5 0.0 137 170 856 886 856 889 0.81 6 12 3.2e-07 0.0083 16.9 0.0 135 170 890 922 887 922 0.80 7 12 7.1e-06 0.18 12.5 0.0 137 170 923 953 923 956 0.81 8 12 1.8e-07 0.0046 17.7 0.0 135 170 957 989 953 992 0.79 9 12 3.2e-07 0.0083 16.9 0.0 135 170 993 1025 990 1025 0.80 10 12 7.1e-06 0.18 12.5 0.0 137 170 1026 1056 1026 1059 0.81 11 12 9.2e-13 2.4e-08 34.9 0.1 135 180 1060 1102 1056 1102 0.82 12 12 1 2.6e+04 -4.3 0.1 126 154 1999 2025 1998 2031 0.74
Sequence Information
- Coding Sequence
- ATGGAGTACAGCGCGTGGGGCATGGACGACCCTGGGGCGGGCCAGCACAGGAGGCAGACCAGGAGCCTGACAGGTAGTGGAGTACATGTGTGCACTTCCatagcaactaagctgcacatatatcaGCTAGGAGGCATGGAGTACAGCGCGTGGGGCATGGACGACCCTGGGGCGGGCCAGCACAGGAGGCAGACCAGGAGCCTGACAGGTAGTGGAGTACATGTGTGCACTTCCatagcaactaagctgcacatatatcaGCTAGGAGGCATGGAGTACAGCGCGTGGGGCATGGACGACCCTGGGGCGGGCCAGCACAGGAGGCAGACCAGGAGCCTGACAGGTAGTGGAGTACATGTGTGCACTTCCatagcaactaagctgcacatatatcaGCTAGGAGGCATGGAGTACAGCGCGTGGGGCATGGACGACCCTGGGGCGGGCCAGCACAGGAGGCAGACCAGGAGCCTGACAGGTAGTGGAGTACATGTGTGCACTTCCatagcaactaagctgcacatatatcaGCTAGGAGGCATGGAGTACAGCGCGTGGGGCATGGACGACCCTGGGGCGGGCCAGCACAGGAGGCAGACCAGGAGCCTGACAGGTAGTGGAGTACATGTGTGCACTTCCatagcaactaagctgcacatatatcaGCTAGGAGGCATGGAGTACAGCGCGTGGGGCATGGACGACCCTGGGGCGGGCCAGCACAGGAGGCAGACCAGGAGCCTGACAGGTAGTGGAGTACATGTGTGCACTTCCatagcaactaagctgcacatatatcaGCTAGGAGGCATGGAGTACAGCGCGTGGGGCATGGACGACCCTGGGGCGGGCCAGCACAGGAGGCAGACCAGGAGCCTGACAGGTAGTGGAGTACATGTGTGCACTTCCatagcaactaagctgcacatatatcaGCTAGGAGGCATGGAGTACAGCGCGTGGGGCATGGACGACCCTGGGGCGGGCCAGCACAGGAGGCAGACCAGGAGCCTGACAGGTCGTTCAGACTTCGTCGGCGGCATCGACAATGAGGCGCTGGACTTCAGCCAGCTGGAGGCCTTCATCAACAGCGAAGGAGGAGCAGCTGCGGGAGGATACTTCGCGGCCACCCTGGCTGCCACAGACCAGCAACCTGCCAAGAGATGTCACCTACCAGAGAGTCCGCCGGATTCTGGCTCGGAGCATCCCTACAGCCCCAGTGATGGCCATGAGCTGCCCATGCTGCCCCAGCACCAGCACCAAGTCCTCCTGTCCCCCCAGCATCACCAGCCTGTCCTCATGTCCCCTCAGCAGCAGGTCATTCTGTCTCCGCAGCAAGTCCTGCTGTCTCCGCAGCACGCCATGCTGTCtccccagcagcagcagcagcaacagctCCTGACACCCATCTACACGGAGCTGAAGACCAGCCACCACCTGGTGGGTGGGCTGCTGGCCGAGGGGTACAACATACACATGCCCACCATGGAGGGGCACGACGGACTCTTGGACATCAATGAAGCCAGCCGACTGCTCTTGAGAGCATCCGACCTTCACCAGGAGGTGGGCAGGCCGCTGTCCAGGCAGGAGATGCCTCCACCAGACTGCCAGCCCTACCCTGCACCGACCAGCGGCTCCAAGAAGAGGAAGCTGTCGCAGGACGCTGCCACACACGTCAAGTGTGAACCAGCCATTGTTGAGGACAGTGCCCCGCGCACGCATCAGCCCTCCCCCCATGTCCTGGTGACCCTGGGGGGAGCGCTGCCTTCAGGAGAGTCTCCCGAGCAGGCCGACCCTGCCTATCAGTGCATCCGCTTCCAGCCCTTCCAGCCCAGCGCCTGGCACACCCTCTGCGACCAGGCCCTCGAAAACCTGCCTGCGCCACATTACCGCGTGGACGCAGACAAGGGCTTCAACTTCTCCAACTCCGACGAGGCTTTCGTCTGCCAGAAGAAGAACCACTTCCAGGTGACCTGCCACGCGCAGCTGAGGGGCAACGCGCAGTTCGTCAAGACACAAGACGGCATCAAGAAAATCACCAGCTTCCATCTGCACTTTTACGGAGTCAAGGTTGAAGCTCCTAATCAGATGATCAAGGTGGAGCAAAGCCAGTCGGACAGATCAAAGAAGCCCTTCCATCCCGTGCTGGTGGATCTCCAGAACGGGCAAGTTACCAAGGTCACAGTGGGCCGCCTGCACTTCTCGGAGACCACCTGCAACAACATGAGGAAGAAGGGCAAGCCCAACCCCGACCAGAGGCACTTCCAGCTGGTGGTGGGGCTGCACGCCCACACTCCAGACAACCACTACCCAGTGGCTAGTCAGGCCAGCGAAAGGATCATCGTCAGGCTGAACCCTGACCAGAGGCACTTCCAGCTGGTGGTGGGGCTGCACGCCCACACTCCTGACAACCACTACCCGGTGGCTAGTCAGGCCAGCGAGAGGATCATCGTCATGCTGAACCCTGACCAGAGGCACTTCCAGCTGGTGGTGGGGCTGCACGCCCACACTCCTGACAACCACTACCCGGTGGCTAGTCAGGCCAGCGAGAGGATCATCGTCAGGAGGCACTTCCAGCTGGTGGTGGGGCTGCACGCCCACACTCCTGACAACCACTACCCGGTGGCTAGTCAGGCCAGCGAGAGGATCATCGTCAGGCTGAACCCTGACCAGAGGCACTTCCAGCTGGTGGTGGGGCTGCACGCCCACACTCCTGACAACCACTACCCGGTGGCTAGTCAGGCCAGCGAGAGGATCATCGTCAGGAGGCACTTCCAGCTGGTGGTGGGGCTGCACGCCCACACTCCTGACAACCACTACCCGGTGGCTAGTCAGGCCAGCGAGAGGATCATCGTCAGGCTGAACCCTGACCAGAGGCACTTCCAGCTGGTGGTGGGGCTGCACGCCCACACTCCTGACAACCACTACCCGGTGGCTAGTCAGGCCAGCGAGAGGATCATCGTCAGGCTGAACCCTGACCAGAGGCACTTCCAGCTGGTGGTGGGGCTGCACGCCCACACTCCTGACAACCACTACCCGGTTGCTAGTCAGGCCAGCGAGAGGATCATCGTCAGGAGGCACTTCCAGCTGGTGGTGGGGCTGCACGCCCACACTCCTGACAACCACTACCCGGTGGCTAGTCAGGCCAGCGAGAGGATCATCGTCAGGCTGAACCCTGACCAGAGGCACTTCCAGCTGGTGGTGGGGCTGCACGCCCACACTCCTGACAACCACTACCCGGTGGCTAGTCAGGCCAGCGAGAGGATCATCGTCAGGGCGTCAAATCCAGGACAGTTTGAGAGTGACGTGGATCTGTGCTGGCAGCGCGGTGTAACTCCTGACAGCATTTTCCACGCCGGGAGAATCGGCATCAACACAGACAGACCTGACGAGACCTGCGTCATCAATGGTAACCTGAAAATCACCGGACACATCATGAACCCCAGCGATGCCCGGGCCAAGCAGAACATCCAAGAGTGCGACACGGCGGAGCAGCTGAGGAATGTGCAGAAGATCCGGGTAGTCAGGTTAGTAGAAAACTGCTTCCtcatcgacaccttaaaggtaaaactgatcaTCAAAAAGTGTGACACGGCGGAGCAGCTGAGGAATGTGCAGAAGATCCGGGTAGTCAGGTACAGCTATGATCCTCAGTACGCCCTGCACTCTGGACTGAGTGGACCATCATCCATGGGCCCCCAGCTCCACGTGGACACTGGAGTCATCGCCCAGGAGGTGAAGAAGGTGCTGCCAGAAGCCGTGAAGGAGAGCGGGGACGTGGTGCTGCCCAACGGCGACACCATAGACAACTTCCTCGTAGTCAATAAGGACCGCATCTACATGGAGAACCTGGGTGCGGTGAAGGAGCTGTGCAAGGTCACGGACTACCTGGAGACCAGGATAGACCAGCTGGAGAAGGTCAACAGGAGGCTCTGCAAGATCAACTCTCTGCAGAGGAGCGGCAGCACCAGCACGGTGACTTCCCGCTTTTCCACCTCCACCAGCAACTCTTCAGGCAAATCCCACAACAGCGACAGCGTCAGCGTCGTCGGCCACTTCTGCGACGTCGCCAACATTGCCGTGCCGTCGTGCTGTCGCCGCGTCCGCCAGCCCAACGGCAAACACTCCAACTACCACGCCAACAAGATGAACGCGTGCGTCAAATACTACACGGCGCGACACCACACGCTTTCAAAGGAAATCTTCGATCACACGCACAGTCACCGAAATTACGCAGAAAAAACCCAAAAGAAAGCGTCGACGGATAAAAAGCTGCTAGCGCTAGTGCAATCGGAACCCAAGAAGGTCATCGAGAGTTTTGAATGCGGAATCGAGGCCGGGAAGGATTGCTGTGTATCGGTGATTGGCGGGAAGGATTGCTGCTTGTCTGTGATTGGTGAAAAGGACTGCGAGGGAGCAGAGCCCTGGCGGAGACACGCGCACTGCTGTCTGGGACGGGCGCAGAACCCGCCGATTTGCTCAAACAAGTTTATACAGACTGTCATCGGGATTTTGATTATTGTCATGGCCATATGTCTCATCGGCATGGCAATGCTCTACTTCAAAGAGCACCAAAACAGAATACAGGAGGAGCAGAGGCGGATCTACAGCCTGTCCGCGGACCGGCGTTACACGGACGAAAAGGAAAACGGCCGATGGCACGACTACCAACACAACGTCAAGAACCGCGGCAAGACACGACCGCCAGACTGGGCGAGCAATGCGCAGACCACCAAATACAGTAACAGGGTGCCGCAAGATGGTGGCACGGGACAAAATGGTGGAGCGGGACAAAATGGCGGACCGTTTGAAAACGAAGACAACCTGCATAAGAAGATTGCCTCGCGCGAAGAAGAGAACTTGGCGGATTCGGGAAAGAACACTCTCATGACGACGGAGATGTCAAACTTTATAGAGAGCTTGAAAAAGTTGAAGACGGAAATTCAGAATCTGATCCCAGGATCTGTCACAAGCGCCATATATCGCTCTACTCCTGCCTACACAATACAAGATGCCGAGATGACAACTCCTGACAATTTGGACGTGGTCCAGACCATGGAAGCCAGAAGAACCGTGTCCAAGAAGAGCCATATCATTCCAGTAAACACGCAAGACCCTGAGGCCAATGATTCCAACAAGGACATTATAGATCTGGATGACAAGGAGGAGCCTATTAACGGACTGGGGGGGAGCAGCGGCAACACACCTGAACTGGAGGGCGGGCGCTCCAAGGTCGTGGCTCCCATCGGGAAACCGTTTGGCTGTCCGGCCGGCGACGACAAACACCCCGGGCGGGAAAGCATCGCGACGGAAAACTTGCAGGAGTGCCAGgTGTTTTGCTGCGCCAGTGTACCTTCGATATCGGAAAACAATGAAGAATCTGAACTGTCCGAAGAGCTGGATTCTGGAAAGCCTCTCCAAATCGAAGAAAACTTTACATCATCGTTGCCAGAGTCTAAAACAGAAGTGAAAAAACCCGACGACGACAAAAAAATCCTTCACAAAAACCCTCCAGACGTCACCAagaaaaatctacaaaaataccATCTTTCGTCGCAGTTTGAACAGAGCGATTTAAACTTCGAGGAAATAATAAGCGATTCAAAAGAAATTAATAATGTCTTACCTAACGACGACAACCTAACGACTAAACCTTCAAAATTATCAACAATCAGTCAAAAGCAATTGGACGAACCGTCAAATTTAACAAAAGAAAACTTAAACGATACGGTAATCAAGAATTTGACAGAAATTACAAAGCTCGTCAACGCAAATCAAAAGAATATCAACGCTACGTCGATTGCTGAGAATTTGACGAGCGAAAATGAAAAGGATGCATCGAAAATCACGTCGGATTTGGAATTGGCGAGGAATATGCTGCCGAAATACAACTGGACGTATTTGAGTGAGAGTGGAGCCAAAAAGTTCACTCCCAACGTGTCCAAGAATGATCATCTGTCCAAAAAGAGTGCGGCTAAAGGTCCGGCTCTGACGCACGAGGAAAACAAGACGAGCGAGGGAGAGAAAAGATTCAAGAGGCAGGTCGCTTTGGTCGTTAACCAATCAAAGAAAGAGTCGTTGGCCGATTTGCAGGATGAAGAATTCGGAGAAGACAAGTGTGACAGCGTAGAGATTACGATGTACGGAGCGGACTTCTCCGCAGTGTTGGGCAGAGACAAGTTCTGTCGGGGCTCCAACCTGAACGTCACATACCACGTGCCACTCTCCAAGTACATGCCGGACACCTCGCTGACTCTGTTCTTCAACTCGAGTAAAGCCTTCATCTACTGCAGTGCCGCCAACTCGCAGTATTACAAGTCCGAGGATTGCCAGCTGGACGAGGACAAGACGGCAGAGTCCTCGCTGGCGGGCAGTGTTACCAGCGTGGAGGGCTTTGCCACGGCCCACAACAAGAACGGATCAGTGTTTGACATTAATGTCGCCAACGTCATCACTGCAGAGTTCCACTTCAGAACAAACATGGCTGCCAACTCTGAAACGGACATTTGCAAGATGCAAGACAACAAAATGGGGGAATTCCTACAGTACACCATCCACATATACAGACACTGCGATCTCTGA
- Protein Sequence
- MEYSAWGMDDPGAGQHRRQTRSLTGSGVHVCTSIATKLHIYQLGGMEYSAWGMDDPGAGQHRRQTRSLTGSGVHVCTSIATKLHIYQLGGMEYSAWGMDDPGAGQHRRQTRSLTGSGVHVCTSIATKLHIYQLGGMEYSAWGMDDPGAGQHRRQTRSLTGSGVHVCTSIATKLHIYQLGGMEYSAWGMDDPGAGQHRRQTRSLTGSGVHVCTSIATKLHIYQLGGMEYSAWGMDDPGAGQHRRQTRSLTGSGVHVCTSIATKLHIYQLGGMEYSAWGMDDPGAGQHRRQTRSLTGSGVHVCTSIATKLHIYQLGGMEYSAWGMDDPGAGQHRRQTRSLTGRSDFVGGIDNEALDFSQLEAFINSEGGAAAGGYFAATLAATDQQPAKRCHLPESPPDSGSEHPYSPSDGHELPMLPQHQHQVLLSPQHHQPVLMSPQQQVILSPQQVLLSPQHAMLSPQQQQQQQLLTPIYTELKTSHHLVGGLLAEGYNIHMPTMEGHDGLLDINEASRLLLRASDLHQEVGRPLSRQEMPPPDCQPYPAPTSGSKKRKLSQDAATHVKCEPAIVEDSAPRTHQPSPHVLVTLGGALPSGESPEQADPAYQCIRFQPFQPSAWHTLCDQALENLPAPHYRVDADKGFNFSNSDEAFVCQKKNHFQVTCHAQLRGNAQFVKTQDGIKKITSFHLHFYGVKVEAPNQMIKVEQSQSDRSKKPFHPVLVDLQNGQVTKVTVGRLHFSETTCNNMRKKGKPNPDQRHFQLVVGLHAHTPDNHYPVASQASERIIVRLNPDQRHFQLVVGLHAHTPDNHYPVASQASERIIVMLNPDQRHFQLVVGLHAHTPDNHYPVASQASERIIVRRHFQLVVGLHAHTPDNHYPVASQASERIIVRLNPDQRHFQLVVGLHAHTPDNHYPVASQASERIIVRRHFQLVVGLHAHTPDNHYPVASQASERIIVRLNPDQRHFQLVVGLHAHTPDNHYPVASQASERIIVRLNPDQRHFQLVVGLHAHTPDNHYPVASQASERIIVRRHFQLVVGLHAHTPDNHYPVASQASERIIVRLNPDQRHFQLVVGLHAHTPDNHYPVASQASERIIVRASNPGQFESDVDLCWQRGVTPDSIFHAGRIGINTDRPDETCVINGNLKITGHIMNPSDARAKQNIQECDTAEQLRNVQKIRVVRLVENCFLIDTLKVKLIIKKCDTAEQLRNVQKIRVVRYSYDPQYALHSGLSGPSSMGPQLHVDTGVIAQEVKKVLPEAVKESGDVVLPNGDTIDNFLVVNKDRIYMENLGAVKELCKVTDYLETRIDQLEKVNRRLCKINSLQRSGSTSTVTSRFSTSTSNSSGKSHNSDSVSVVGHFCDVANIAVPSCCRRVRQPNGKHSNYHANKMNACVKYYTARHHTLSKEIFDHTHSHRNYAEKTQKKASTDKKLLALVQSEPKKVIESFECGIEAGKDCCVSVIGGKDCCLSVIGEKDCEGAEPWRRHAHCCLGRAQNPPICSNKFIQTVIGILIIVMAICLIGMAMLYFKEHQNRIQEEQRRIYSLSADRRYTDEKENGRWHDYQHNVKNRGKTRPPDWASNAQTTKYSNRVPQDGGTGQNGGAGQNGGPFENEDNLHKKIASREEENLADSGKNTLMTTEMSNFIESLKKLKTEIQNLIPGSVTSAIYRSTPAYTIQDAEMTTPDNLDVVQTMEARRTVSKKSHIIPVNTQDPEANDSNKDIIDLDDKEEPINGLGGSSGNTPELEGGRSKVVAPIGKPFGCPAGDDKHPGRESIATENLQECQVFCCASVPSISENNEESELSEELDSGKPLQIEENFTSSLPESKTEVKKPDDDKKILHKNPPDVTKKNLQKYHLSSQFEQSDLNFEEIISDSKEINNVLPNDDNLTTKPSKLSTISQKQLDEPSNLTKENLNDTVIKNLTEITKLVNANQKNINATSIAENLTSENEKDASKITSDLELARNMLPKYNWTYLSESGAKKFTPNVSKNDHLSKKSAAKGPALTHEENKTSEGEKRFKRQVALVVNQSKKESLADLQDEEFGEDKCDSVEITMYGADFSAVLGRDKFCRGSNLNVTYHVPLSKYMPDTSLTLFFNSSKAFIYCSAANSQYYKSEDCQLDEDKTAESSLAGSVTSVEGFATAHNKNGSVFDINVANVITAEFHFRTNMAANSETDICKMQDNKMGEFLQYTIHIYRHCDL
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -