Esim021895.3
Basic Information
- Insect
- Euproctis similis
- Gene Symbol
- myt1_1
- Assembly
- GCA_905147225.1
- Location
- LR990116.1:16107060-16126592[+]
Transcription Factor Domain
- TF Family
- zf-C2HC
- Domain
- zf-C2HC domain
- PFAM
- PF01530
- TF Group
- Zinc-Coordinating Group
- Description
- This is a DNA binding zinc finger domain.
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 6 1.1e-16 9.3e-13 48.9 8.9 1 28 305 332 305 333 0.97 2 6 7.8e-17 6.8e-13 49.3 7.8 1 29 349 377 349 377 0.97 3 6 9e-18 7.9e-14 52.3 2.3 1 29 797 825 797 825 0.97 4 6 9.7e-19 8.5e-15 55.4 8.3 1 29 838 866 838 866 0.98 5 6 5.3e-17 4.6e-13 49.9 5.2 1 29 883 911 883 911 0.97 6 6 4.7e-18 4.1e-14 53.3 5.2 1 29 945 973 945 973 0.97
Sequence Information
- Coding Sequence
- ATGGAGGAAGAGGAGATTGTAGGCAAAAGGAGACGACGACTTTTAGACGATGAGAGACGAGATGAAGTCGACCGACGTCGAGCTATACCAATGCATAGACCTCAAGCCGCAAAACACATAGTCGACGAAGATGAAACTCTGCTTCGTGATACGCAGGCTGCTTTCAAAAATCTCTCTAGTAACTGGAACCGAGATCCCGGAGCTCAATGTGCTGATGAAAACGATGAACATAGTGGATTTGAAAATCTTTTCGAGGATAAACAGTCTGATAAATTATTACAATCATCTCCATCGCCATCGTCAGAGAGTGGTACTTCTCAAAAAAGTTACATCATGGGCAATCACATAGACCATAACGTAAATGGTAATTCTCAAATGGAGAACAAACGGATCCGATCAGAGAGAGAAATAAGTGCCGATTACGAAAGAAATAGCAGACATTCAAGTCACTATGAAACGccaaattttgatgaactgaTAGACTCGTCTTCCAATGATTTAGAAATAGACATATCTGATAGATGCGACGATAAATATGATGACAGAGATAATAAGCAAAAACGTAAAAACGAAATCAATTCTGTCAACAAACAATCTTTGTACAACGCTTACAAAGCGGCAACTGCTGGACTACCGTATTCTACACAATCTGCGTTCAAACCTCCTGCTGAAGTTAAACATAGACTTCACAATACTTCCCTTCCTAGTGAACCTTTTGGAGGGTATTCAAATGATCATGAGAAGAGTTCCTTTCATAAAGGAACGAAACAGTACACTGTATTACAACCGGCGGGAGTTGGGTCTCGGGCAGCCACGGCCCTGCAAGAGGCCAGGACTGTACCTTCAGCACAACCATCGCGCGACTCTCGTCCTACCAACCCGCTGTCTCCGCCGGCTTCTCGAGAAGGCAACAAGTGTCCTACACCCGGGTGCAATGGGCAGGGCCACGTCACCGGCCTATACACTCACCATAGGAGTCTGTCAGGATGTCCAAGAAAGGATAAAGTCACACCTGAGATATTGGCGTTGCACGAGACGATATTGAAGTGCCCCACACCGGGGTGCAACGGGCGCGGTCACGTCAGCTCCAATCGAAGCACTCACCGATCCCTGTCAGGTTGCCCCACTGCTGCTGCACGGAAAGCAGCTGCCAGATCACAACGCTCACGACCAGTCAACGTAACCCACTCGTCACTGCCGGCTCCCTCGACCGCGGCATCCGAGAGTGCTGCTAGTTCGACGACCAGCGTGTCAGTGCGCGAGGCATCGCCTCGCAGTCCAGGTGTGAAACGCGAGTCGAACGAGCTGCTGGTGCCGAAGCGAGAGGCGGCCGAGCCCGAGCGCGACTCCCCCACCATGGAGACGCGCCACGCCGGCTACGGCGCGCCGCCAGACCAACGGTCCCCATACGAGAGACCACCAGACGATCATGTGCGATCATACAGCCAAATGAACGAGACGCGCTACGGCTACGAGGCGCGGTGCTACGAAGGTGCACCTGCATTTGAACGTTACGACCCGCAATGTCCACAGCGGCCTTATGGCTGGGACGAGGAGCGCTACCACGACCCCCACCTACCGACACCAATGAAAACTGATCAATCCGAACAGGAGACCAGTCCTGGACCTATATATCCTAGACCAATGTACCACTACGAAGCTGGTAGCGGAGTAGGTGCCGTCAGCGCGATGGGTCCCGGCGTCCCACCCGGGTTCTCTGCGATCAACCTTTCGGTGAAGATAGCCGCGGCGCAGGCTCAACGACCGCGCACTCCGACCCCAAGAGACCCGCGCGACCCTCGCCCCGCCATAGATCTTTCCACTTCCAGTGGTAGTCCACAGGGTCCATATGCATCGCCGGTGTACACAAGCGCCGGTGGTGGTGGCGGGGGCGGAGCCCGAGGTAGTCCGCAGCCGGGAGCTTCGCCCCAGCTGACAGCCAGCCCCCAGGTGCCAAGTCCTCAGGGACAAACCCTCGACCTTAGCGTGTCCCGTTTACCACATAGTCGCAGTTTTCCTGGGGGAGTATCGTACAGTCGAGAATCAACTCCTGACAGCGGCGGCAGCCATCCTTACCTCGAAGCCTATCACCGGGACACAGCAAGCTATGGCGGCGTGAGTCCACATCCTGTAGCAGGGTATGGATTAGGACAACCAGATTATGCGGCAGCAGCTGCAGCAGCCGGTTATGGCGGTTACCAGTACCAGTGTGGCGCGTACCCCCCACCACCAGCATACCCACCCCACGCGCCACCTTACTCACCACCCTGTTATATGCCGCCACCTCACGCACCACATGACAAGCCCAAGGACAGcagCTATCATCGCGAACGCGACGACTTTTATGGGAAACTGTGTTACCGGATACGAGAATCGAGGGAGCTGATCCAGTGTCCGGTGGAGAACTGCGACGGCACCGGTCACATATCTGGAAACTTTGCTACACACCGCAGTCTGTCCGGATGTCCGCGCGCTGATCgatcgcagctccaaccacacTCGCAAGAGCTCAAGTGTCCTACACCAGGTTGTGACGGATCCGGTCACGTAACTGGAAACTACTCATCGCACCGTTCACTGTCCGGATGTCCAAGAGCCAACAAGCCCAAGAGCAAACCCAGAGACGGACAGGACTCTGAACCACTGAGATGCCCTATACCGGGCTGTGATGGATCTGGACATGCTACAGGGAAATTCTTGTCACATCGAAGTGCCTCGGGATGCCCGATAGCTAATCGAAATAAAATGCGAGTGCTAGAAAGTGGTGGTACCGTGGAACAGCATAAAGCAGCGGTAGCGGCAGCAGCTTCAGCAATTAAATTCGACGGTGTGAACTGTCCAACACCTGGATGCGATGGATCTGGACATCTGAATGGATCTTTTTTAACCCACCGGTCACTTTCCGGCTGTCCAATAGCTGGTGCGGCAACACCGACGCCACAACCGAAAAAACCAAAATACCCGGATGATATCACACCTTTATACCCTAAACCCGGTTATACAGGTATGGAAATGAACATGCAGACTGGCAACAGCGAAGATTTAATGACCTTGGAACAGGAGATCACTGAGCTGCAGCGCGAGAACGCTCGCGTCGAATCACAGATGATACGACTCAAATCCGACATTAACGCAATGGAGACACACCTTTCACATGGAGAAcggGAAACTCAGACCATGATCCAGCGTAACAACAATTTGAATGAATATTATGAGAGTCTACGTAATAATGTGATCACATTGTTGGAGCATGTAAGAATTCCGGGAGGTGGAGAGAAACCGGCCCACGACAACTTCGACTCATACCTGACCAAGTTACAGACGCTTTGCTCCCCAGATGGCTACTGCCCAGACGAGAATCGGCCAATATACGAAACAGTCAAAAACGCGCTTCAGGATTTCACAGTGCTACCTACACCCATTTAA
- Protein Sequence
- MEEEEIVGKRRRRLLDDERRDEVDRRRAIPMHRPQAAKHIVDEDETLLRDTQAAFKNLSSNWNRDPGAQCADENDEHSGFENLFEDKQSDKLLQSSPSPSSESGTSQKSYIMGNHIDHNVNGNSQMENKRIRSEREISADYERNSRHSSHYETPNFDELIDSSSNDLEIDISDRCDDKYDDRDNKQKRKNEINSVNKQSLYNAYKAATAGLPYSTQSAFKPPAEVKHRLHNTSLPSEPFGGYSNDHEKSSFHKGTKQYTVLQPAGVGSRAATALQEARTVPSAQPSRDSRPTNPLSPPASREGNKCPTPGCNGQGHVTGLYTHHRSLSGCPRKDKVTPEILALHETILKCPTPGCNGRGHVSSNRSTHRSLSGCPTAAARKAAARSQRSRPVNVTHSSLPAPSTAASESAASSTTSVSVREASPRSPGVKRESNELLVPKREAAEPERDSPTMETRHAGYGAPPDQRSPYERPPDDHVRSYSQMNETRYGYEARCYEGAPAFERYDPQCPQRPYGWDEERYHDPHLPTPMKTDQSEQETSPGPIYPRPMYHYEAGSGVGAVSAMGPGVPPGFSAINLSVKIAAAQAQRPRTPTPRDPRDPRPAIDLSTSSGSPQGPYASPVYTSAGGGGGGGARGSPQPGASPQLTASPQVPSPQGQTLDLSVSRLPHSRSFPGGVSYSRESTPDSGGSHPYLEAYHRDTASYGGVSPHPVAGYGLGQPDYAAAAAAAGYGGYQYQCGAYPPPPAYPPHAPPYSPPCYMPPPHAPHDKPKDSSYHRERDDFYGKLCYRIRESRELIQCPVENCDGTGHISGNFATHRSLSGCPRADRSQLQPHSQELKCPTPGCDGSGHVTGNYSSHRSLSGCPRANKPKSKPRDGQDSEPLRCPIPGCDGSGHATGKFLSHRSASGCPIANRNKMRVLESGGTVEQHKAAVAAAASAIKFDGVNCPTPGCDGSGHLNGSFLTHRSLSGCPIAGAATPTPQPKKPKYPDDITPLYPKPGYTGMEMNMQTGNSEDLMTLEQEITELQRENARVESQMIRLKSDINAMETHLSHGERETQTMIQRNNNLNEYYESLRNNVITLLEHVRIPGGGEKPAHDNFDSYLTKLQTLCSPDGYCPDENRPIYETVKNALQDFTVLPTPI*
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00831772;
- 90% Identity
- -
- 80% Identity
- -