Msca017745.1
Basic Information
- Insect
- Melanostoma scalare
- Gene Symbol
- ct
- Assembly
- GCA_949752695.1
- Location
- OX456984.1:230714392-230766117[+]
Transcription Factor Domain
- TF Family
- CUT
- Domain
- Homeobox|CUT
- PFAM
- PF02376
- TF Group
- Helix-turn-helix
- Description
- The CUT domain is a DNA-binding motif which can bind independently or in cooperation with the homeodomain, often found downstream of the CUT domain. Multiple copies of the CUT domain can exist in one protein (eg Swiss:P10180).
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 3 1.3e-26 4.5e-22 79.7 0.0 3 71 776 845 774 851 0.94 2 3 5.2e-32 1.8e-27 97.0 0.0 3 77 1213 1288 1211 1290 0.96 3 3 6.9e-31 2.4e-26 93.4 0.0 3 77 1433 1508 1431 1510 0.96
Sequence Information
- Coding Sequence
- ATGGAAATCAACCGGAATCGATCGAGAGCTACATTAGCTGAAAAGGAAGTAAATACATTGAAGGAACAATTGTCAACAACTTCGACTGTAACCACAACATCGACGTCTGCGACAACAACAACTGCGTCTTCGGTACATTCGTCGCCATCATCTCCATCGTCGGTGGTGGTGAGCACAACGCCTATACCAAATATCAAATTGTCACCAAcgataaacaacaacaatagtttcaataataataacaaattgaaCAGTGTTGCCATATCACAGCATTTGCATCAGAACGGAAATTTATTGGCAGCCACACCGTCACCATCGCCGCCATTATCAGAAAACGGTAGCACACTCAACGGATGCCTTAGTTCGGCAAAGCTACTATTCAATCATAATCAAATAATGGACACAACTGAATCGGGTACAGCAACTGATCTATCTGCTGttgtaacaacaacaacaacacccacaaacacaaacacaaacacatcAATAACAACGACAGATGCAAGTAAACTTATAACGGCAATGACCGCAGAAATGAATCTCCACCAAGACTCTAGCGAATCGGCAACGCTGGCATGTAATCAGATGTCAAATCGATCGCCACACAGTCCAATTCTACTGCAGTCGGCAGAGGGCAGCAGTGGCAATACAGAGTCAACacgaaacaacaataataacaataacaatagcaCTAGTAACAATAATAATACCAATACAAATAGTAATAGCAGTAGTAATTGTAATAATAATTTAGCTGATGAATTGGCTGCCAAAGAGAAAGAGATTAAAGCACTTATCGACGAGGTTAACCGACTGCAAGCCCTCCACGAAACCCACATTGTACAAATACAAAGACTTGAAGAGAATCTAGAGGCGAAACGTCAGCACATTGTGAGACTTGAGGCTCGTCTACAGCAACAGCAATTTAAAAGTCCTGACATCAATATCAACACCAATAATAGCACCAGCATCTCTACTGCTGTCGTACCACCAAAACAAAGTCCACATGAGGATCAGCAGGAGGAGGAGGCGGAAGTGGCGATCGTATCGGAGGAGCCATCCCCGGTGGTGGTGCCCGATGAgattattcaaataaaaaaggaaaaccaaGCACCAACAACTCTTGACATCACAGGCCAGTCGGTGGCAGTTGTGGCTGCGGCGGCTGCGGCTGCAGCGGCCGTTGCCAACGACCCCAAACAGTTCCAAGAACTTTTGCTTGAGAGAACTAAAGCACTGGCGGCTGCAGCTCATCATGCCTCTACTGAGGCACACAAGACCGAAGGAATTTCCAATGACACGACCCCGGTGGGTGACTTAGACACAAACAAAGCTTCTGTCAGCGAGCCTGCAGCAGCATGTGAGGTCCCTACTGCTGAGTGTAACGATGCGAGCCTGCATCAACAGCAGCAAGAAGTTCGACCGCCATCGGAGTCGTCGTCGTCGAATTTCCAAGGTGTCGATCAATTCGGTTCGTTGCTTGGCGACGAGCTGGTGAATTCGTATTGGCGTCGTGGCTTCGATCACCTGCCATTAGCTCCACCACTCCCACCATTGCCACCTCAACAGCCGTGTCCAGCTGCAACAACAGCTGCCACCGTTTCGATAGCACACCCACCACTACCATCAATGgcatcagcaacaacaaatcCAACTTCTAAAACGCCAACATCgattttccaaaatttcccaTCAATTCTTTCGTGTCGATCGCCTCCACCACTCCCACTATCGGTCACTGGGCCATCAACCACAACTCTTTTGAGTCAGCTAGTCAATTCAACCTTCTCATCGAATACAACTCCTTCTCTTCGAACAGATTCCCATTCCTCGTCGACTACGACTACAGGCGCCCACCTACAACAGACTTCGCATCATCACCACcagcatcatcatcagcaacagcaTCAGCAGCAGCATTCGCAGCATCCTGCCCTTCAGCACCAccatcagcagcagcaacaacagcagcatcaCCAGGACTTCGAAAAAGCCAATAGCCATCACGATTCCTCGTCAACGGCCAGTACGCCCAATTTACATGACCAGCGTTCCAACGACGGTCATATGAACGGGTCAAAGTCAGTCAAAGCTGAGGATAACAACAACTCTTTGGGCTCACTAGCTCATCTTCCACCTGCATTCCTGCCGGGATTGCCATTCCAATTCCAAGATCGTGGCCACTTCCGTTTTGCCGAAGACATCCAGCTGCCGCCTGGAGCTTCTATGGCCGGCCGACTTGGAGAGTCTCTCATTCCCAAGGGCGATCCTATGGAGGCCAAATTGCAGGAAATGCTACGCTATAACATGGACAAATATGCCAATCAAAGTTTAGACACACTCCACATATCACGACGCGTTCGCGAACTCCTCTCCGTGCACAACATCGGACAGAGGCTGTTCGCCAAATACATCCTTGGACTGTCACAAGGCACCGTCTCCGAGTTGCTGTCCAAACCAAAACCATGGGATAAACTCACTGAAAAAGGCCGTGACAGTTACCGCAAAATGCACGCCTGGGGATGTGATGATAACGCTGTGATGTTGCTTAAGTCTCTGATACCAAAAAAAGATTCGGGCATTCCACAGTATGGTCGCAATGAAGACTCCATGTCGGATGACCGCATTGCCCATATTTTGAATGAAGCTTCCAGCCTTATGAAGACACCGCAAGGACCAGGTGCACAGTTGCAGCAGCAATTGCAGCATGAGGCTCAGCGTCAGCATCACCAGCAATCTTCCTCCCATGAGGACGAGCAGAGTAATGACGACTCCAAGTCACCGCACGCATCATGTACTTCGCCATTCTACAAAGAGTCTCCTgtgcaacagcagcaacagcaacaacaacagcaacaacagcagcaacagcaacaacaacaacagcagcagcaacaacaacgcCAAGAGGCACAGCGTCGAAGTTTGAAAACAGATGAATTGTCCCAGGACAAGATGGCGAGAATATACCAAGAAGAGCTTACTAAGCTCATGACGCGCACTCCACGCGATGCGTTCCCAAgTTTTTTGTTTCCACACTTTTTCAGTGCGGCTGGTGGCCTTCCGCCCACCAATAACGCAATGCCACCAGATGACAACATTCGAATGACTTTGGAAGCCTACCACCGTGAGATGGCTAAGTTACAGCAAAACAGCAGCGGAAATATTCCACACTTTCCGAACTTCTCGAGCTTGATGGCGTTGCAGCAGCAAGTGCTTAATGGCGCTCAGGATCTATCCGTCCCGAAGGAAAATGAAATTAAGGTTAACGGCCAACGACACTCCATGGAAAGCTGCGATTCTAATCAAACCAAAGAAGCCAACGACGAGGCGGGACATCGTCACTCAAGTGCATTTAGCTTAGTTCGTCTAAAGCCCGATCCGGTGCTGGGACCAAACTCGAACTCAAATCCAAATACCAATTCTGCTGCTCCCAGTCCACTTAGCAATTCCATCCTTCCACCAGCTATGACGCCAACTGAGGACTTTTCCGCAGCTGCCAGCCCTCTACAGCGAATGGCATCGATCACAAACTCATTGATCACACAGCCTCCAGTAACGCCTCATCATACACAACCACAAAGACCCACTAAAGCGGTATTGCCGCCCATCACTCAGCAGCAGTTCGACATGTTTAACAATTTAAACACCGAAGACATTGTTCGCCGGGTGAAGGAAGCTTTGTCACAATACTCCATAAGCCAGAGGCTCTTTGGTGAGTCCGTGCTCGGCCTATCACAAGGATCTGTGTCGGACCTGCTGGCGCGACCCAAGCCATGGCATATGCTCACTCAAAAGGGCCGTGAACCTTTCATTCGTATGAAAATGTTTCTGGAGGACGAGAATGCTGTGCACAAGCTCGTAGCCAGCCAATACAAAATCGCCCCCGAAAAGTTAATGCGCACTGGCAGCTATAGCGGCAGTCCGCAAATACCACAAGGACTCGCGAACAAAATGAACAGCATTCCAATGCAAAAGATGATGAACGAACTGAAAATGCAGGAACCCCAAGCCCAGCACATCATGCAGCAAATGCAAGCTGCATCCGCAATGGCTGCCGCTGCTGCTGCGGGACACATGCAAAATGCCCAACACCAGGGTATGTTGCTGACTACTCCAGGACTTCCACCGCAGCATGCGATTAGTTTGCAGCAAGCTGCCGCcaatgctgctgctgctgctaacGAGAAAAAACCCATGATGATGCCAATTCACTCGCCACATCAAGCCAACGCTATGCGAAGCAGCATGCATCAGCATATGTCGCCCACGGTGTACGAAATGGCGGCTCTTACTCAAGACTTGGATACCCAAGTCATCACGACAAAGATTAAGGAAGCCCTTCTTGCGAACAACATTGGCCAAAAGATCTTCGGCGAAGCAGTTTTAGGTCTGTCGCAGGGTTCTGTTTCGGAACTGCTTTCGAAGCCAAAGCCTTGGCATATGCTCAGCATTAAAGGCCGAGAGCCTTTCATTCGCATGCAATTGTGGCTTTCTGATGCAAACAATGTCGAACGCTTGCAGGTACTTAAGAACGAGCGTCGTGAAGCCAGCAAACGTCGTCGATCAACCGGACCCAATCAACAAGATAACAGTTCCGACACAAGTAGCAATGATACGGCTGACTTCTACACCAGCAGTCCCGGACCAGGTTCCGTAGGGTCTGTGGGTGCACCCCCAAACAAAAAACAGCGTGTTCTCTTCTCCGAGGAACAAAAAGAAGCTCTTCGTCTGGCTTTTGCACTTGATCCATACCCAAACGTTGGAACTATTGAGTTTCTAGCCAACGAACTCGGCCTGGCAACGCGAACAATCACCAACTGGTTCCACAACCATCGAATGCGCCTTAAGCAGCAGGTGCCGCATGGCCAGCCAACCGAACCGGTACCAAGCCGGGAGAATACCAATGCAACGCCCTTCGACCCAGTCCAATTCCGTATACTCCTCAATCAAAGACTGATGGAATTGCAAAAGGAGCGCATGGGTCTAAGTGGCGCACCTATTCCTTATCCACCGTACTTTGCCGCCAATCCAAATCTAGCGGCGCTTATAGGCCGTGGCCTTCTTCCCGGTGGAGCTGAACCCGACTTGCAAGCACTCAACAATGCCTTCAAGGAACAAATGAGTGGCCTCGATCTGTCTTTGAAGAGAGAAAGAGGTGATGAGGATTACGAAGATGATGGCGAGGGCGATAGTCACTTATCAGACAACGAGTCAATCAACGGAAGCGACGAAAAGTCAGAAATGAGTGAGACCCCAAGATCCTCTCTATACGGATCGAGCTTGCAATCTCGAACATCGCGACGTAAGCCTGCTGCACCGCAATGGGTTAACCCCGACTGGCAAGGAGATGGCAAATCCGGCCTAATGCCACCACTAGGGCCAGGTGGACCCACTGATAAGGAGGTAATCATAAATGGAGTTTGTGTGATGCAGCAACAAGCTGCTGATTACAGCCGCAAGTCGGGAACCGCCGATGACGACGAGGAAGGCGCCTCCGACGCCAACGACGAAGAATCATCCACTCACCATCGTACCCAATCCTCTGATTGTGATGACACTCCCGGAGAAATCGATCAGCGGTTCATGGAACCCGAAGTTCGAATCAAGCAAGAAGAAGACTGTGACAGCAACGATAGCCATCAAGAGGATTCCGAATCAAGAAAACCCACACACCACCGGGACGACGAGGACGACACTCCCGGTGCATGGAATTATTAA
- Protein Sequence
- MEINRNRSRATLAEKEVNTLKEQLSTTSTVTTTSTSATTTTASSVHSSPSSPSSVVVSTTPIPNIKLSPTINNNNSFNNNNKLNSVAISQHLHQNGNLLAATPSPSPPLSENGSTLNGCLSSAKLLFNHNQIMDTTESGTATDLSAVVTTTTTPTNTNTNTSITTTDASKLITAMTAEMNLHQDSSESATLACNQMSNRSPHSPILLQSAEGSSGNTESTRNNNNNNNNSTSNNNNTNTNSNSSSNCNNNLADELAAKEKEIKALIDEVNRLQALHETHIVQIQRLEENLEAKRQHIVRLEARLQQQQFKSPDININTNNSTSISTAVVPPKQSPHEDQQEEEAEVAIVSEEPSPVVVPDEIIQIKKENQAPTTLDITGQSVAVVAAAAAAAAAVANDPKQFQELLLERTKALAAAAHHASTEAHKTEGISNDTTPVGDLDTNKASVSEPAAACEVPTAECNDASLHQQQQEVRPPSESSSSNFQGVDQFGSLLGDELVNSYWRRGFDHLPLAPPLPPLPPQQPCPAATTAATVSIAHPPLPSMASATTNPTSKTPTSIFQNFPSILSCRSPPPLPLSVTGPSTTTLLSQLVNSTFSSNTTPSLRTDSHSSSTTTTGAHLQQTSHHHHQHHHQQQHQQQHSQHPALQHHHQQQQQQQHHQDFEKANSHHDSSSTASTPNLHDQRSNDGHMNGSKSVKAEDNNNSLGSLAHLPPAFLPGLPFQFQDRGHFRFAEDIQLPPGASMAGRLGESLIPKGDPMEAKLQEMLRYNMDKYANQSLDTLHISRRVRELLSVHNIGQRLFAKYILGLSQGTVSELLSKPKPWDKLTEKGRDSYRKMHAWGCDDNAVMLLKSLIPKKDSGIPQYGRNEDSMSDDRIAHILNEASSLMKTPQGPGAQLQQQLQHEAQRQHHQQSSSHEDEQSNDDSKSPHASCTSPFYKESPVQQQQQQQQQQQQQQQQQQQQQQQQQRQEAQRRSLKTDELSQDKMARIYQEELTKLMTRTPRDAFPSFLFPHFFSAAGGLPPTNNAMPPDDNIRMTLEAYHREMAKLQQNSSGNIPHFPNFSSLMALQQQVLNGAQDLSVPKENEIKVNGQRHSMESCDSNQTKEANDEAGHRHSSAFSLVRLKPDPVLGPNSNSNPNTNSAAPSPLSNSILPPAMTPTEDFSAAASPLQRMASITNSLITQPPVTPHHTQPQRPTKAVLPPITQQQFDMFNNLNTEDIVRRVKEALSQYSISQRLFGESVLGLSQGSVSDLLARPKPWHMLTQKGREPFIRMKMFLEDENAVHKLVASQYKIAPEKLMRTGSYSGSPQIPQGLANKMNSIPMQKMMNELKMQEPQAQHIMQQMQAASAMAAAAAAGHMQNAQHQGMLLTTPGLPPQHAISLQQAAANAAAAANEKKPMMMPIHSPHQANAMRSSMHQHMSPTVYEMAALTQDLDTQVITTKIKEALLANNIGQKIFGEAVLGLSQGSVSELLSKPKPWHMLSIKGREPFIRMQLWLSDANNVERLQVLKNERREASKRRRSTGPNQQDNSSDTSSNDTADFYTSSPGPGSVGSVGAPPNKKQRVLFSEEQKEALRLAFALDPYPNVGTIEFLANELGLATRTITNWFHNHRMRLKQQVPHGQPTEPVPSRENTNATPFDPVQFRILLNQRLMELQKERMGLSGAPIPYPPYFAANPNLAALIGRGLLPGGAEPDLQALNNAFKEQMSGLDLSLKRERGDEDYEDDGEGDSHLSDNESINGSDEKSEMSETPRSSLYGSSLQSRTSRRKPAAPQWVNPDWQGDGKSGLMPPLGPGGPTDKEVIINGVCVMQQQAADYSRKSGTADDDEEGASDANDEESSTHHRTQSSDCDDTPGEIDQRFMEPEVRIKQEEDCDSNDSHQEDSESRKPTHHRDDEDDTPGAWNY
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00976344; iTF_01541217; iTF_01541334; iTF_00693572; iTF_00693660; iTF_00663949; iTF_00663842; iTF_01396384; iTF_01396276; iTF_00315045; iTF_00314935; iTF_00314056; iTF_00314160; iTF_00427017; iTF_00426907; iTF_00694476; iTF_00694382; iTF_00426140; iTF_00426232; iTF_01211689; iTF_01211800; iTF_00724554; iTF_00724665; iTF_00240451; iTF_00240553; iTF_01116235; iTF_01116341; iTF_00688500; iTF_00688392; iTF_00991548; iTF_00991646; iTF_01356746; iTF_01356652; iTF_01223275; iTF_01223176; iTF_01044525; iTF_01044612; iTF_00310798; iTF_00310916; iTF_00315726; iTF_00315833; iTF_00187951; iTF_00187837; iTF_00334863; iTF_00334757; iTF_00984016; iTF_00984118; iTF_00893649; iTF_00893748; iTF_00665523; iTF_00665422; iTF_00311701; iTF_00311591; iTF_01520758; iTF_01520847; iTF_01521666; iTF_01521769; iTF_01300728; iTF_01300601; iTF_01541948; iTF_01542097; iTF_00310141; iTF_00310009; iTF_01299807; iTF_01299898;
- 90% Identity
- iTF_00976344;
- 80% Identity
- iTF_00976344;