Sfus003252.1
Basic Information
- Insect
- Sylvicola fuscatus
- Gene Symbol
- ct
- Assembly
- GCA_026546915.1
- Location
- JANSTS010011584.1:48590-101601[+]
Transcription Factor Domain
- TF Family
- CUT
- Domain
- Homeobox|CUT
- PFAM
- PF02376
- TF Group
- Helix-turn-helix
- Description
- The CUT domain is a DNA-binding motif which can bind independently or in cooperation with the homeodomain, often found downstream of the CUT domain. Multiple copies of the CUT domain can exist in one protein (eg Swiss:P10180).
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 5 7.9e-27 3.7e-23 81.0 0.0 3 71 699 768 697 774 0.94 2 5 0.32 1.5e+03 -1.1 0.5 41 76 815 850 812 852 0.87 3 5 0.53 2.5e+03 -1.8 0.0 8 30 880 902 878 916 0.80 4 5 7.5e-32 3.5e-28 97.1 0.0 3 77 1134 1209 1132 1211 0.96 5 5 1.8e-30 8.4e-27 92.7 0.0 3 75 1353 1426 1351 1430 0.95
Sequence Information
- Coding Sequence
- ATGAAAAACGAATGGCGTGCAACTCTTGCCGAAAAAGAAGTAAACACATTGAAGGAGCAATTGTCGGCCACAAGTACGACAACACCACCCACCAGTACCATCCCACAGTCCAAAGTCACATCGTCATCCGGCACAACCGCAGCGGTAACACAAAACGGTGTGTCACAATCACTGGGATCGCCAACATTGCCCTCAAATCAGCCACAACAGCAACCCGCGCCAACAACAGCAACCACATCCGACGCCGAAGCATCGGACAGTAGTCTAACAGCAGCAGAGTCTACGACGGCTGCAGCGACGACCGCTGTTAGTGCGAAATTTTTAATCAACAAAATCAATAACAATAATTACAACAACAAAAGTGATAATAAGTCCATGCTGTCTGACGACGACGAAGAGAATCATCTGCAAGACATGGATGTTGATCTTAAGATGGAGCAAATCGATCAACTTAATAAGAATAGCATTAACAACAATAATAGCCTAAGCAGTAACAATAATCATAGTAATGCTAGCAATATTAGTAATAGCGGGAATGTCCCGGATGTGGTGGCGGCAACCGTAACGGAGACTGTCGATTTGGATGATACAACGGACATGGAGGTCACTACGGCGACGACAACAATTAATAATAACAACCACGTCAGCGCTAAGGTGTTAGCTGAGGAGATGGCCGCCAAAGATAAAGAGATAAACGCATTGGTGGAGGAGGTATCTCGATTGCGTACAGCATTGACGGCACTGCAAGAGGCCCACAACACACAAATCCAGCGGTTAGAGGATCGATTGGAGGCTAAGCGGCTACATATCACAAGACTTGAAGCACGTTTAGAGAAAAACACGGATTTGGAGGAGAAAAAGGAAAATAGTGGTCTCAGGTctgtcgatttgtcaacagcagcagcagctgcagccgccgctgcagtggcggctgattctaagcaattccaggagttgATTCTCGAGCGGACAAAAGCATTAGCCGCCCAAAGTGAATTAGCACTAAAATCATCATCTGATGGATCTTCAGAGCACCAACACGGGGATCAAACTCGTATCAATTCTTCTCAATCTCCACCAATAACAGCACAGCAACAACAGCTACCACCAATACCAGCACAATTAATTCATTCTCGCACCCAAACGCCCACTACCATTCAGCAACAGCAGCCTCCACTGTTGCCGCCGTCCCAACATCAACCAAAGCCAAACGCACTAACTAACAATGACAGTAATCAGAAATCGATGGCAAACTATAATAGCACATCGCCCCTACATAATGTGGACATGTTCAGCTCATTTCTGGGCGAAGAAATAGTCAACCATTGGCGACGTGGTCTCCAATTCGCTCCACTCATTGCGCCCAATCCATCAGCTATCGATTGCCAATTATTGGCGGCCAATACCGGCTTCAATTTAATACCATCCGCCAACTCGAACATCACCACAAACACCTTATCAACGACCACATCAACACCAATCATTAATCTAACCAATCAACATTCGCCTCCCCCAACAGATGGCGGTCTACTGCATGTCGCCAATAATTCTCAACAAACAGCCGCCTCCCTGCTCTTGCAACAACAGCAATCGCAACAGCACGCTCACCTGCTGTCGAACGCTGTCCAACCGCCGACATCGCTGACGTCGCTGCCAACTACACCAGCTTCGGCAGCTCCACAACACCAGCAACAACATTTACTCGACGACAAATCGTGCAGTCATCTGGACGCCTCCTCAACGGCCAGCACTCCCGTCTTACAAGATTGCAGTCGCAGCGAAGACAATAACCACCTTATGAACGGTGGCGGTCTCACACCGCTGTCGATGTCCATGCGATCAAAAAGTCCCGCCGATCACGACAATAGCAATGCACTGAACGCATTGACGCCACAACAACAGAGTACACCCACATTTCCGCTCGGCTTGCCATTCCAATTCCAAGAGCGGCCGCACTTCCGATTCGCTGAAGATCTTCAGATGCCGCCTGGCACAATGGCGGGTCGCCTCGGTGACTCACTGATACCTAAGGGTGATCCCATGGAGGCCAAACTGCAGGAGATGTTGCGCTACAACATGGACAAATATGCCAATCAGAATCTTGACACGTTGCACATATCGCGACGTGTGCGTGAACTGTTGTCAGTGCACAATATTGGACAGCGACTGTTTGCCAAATACGTGCTGGGACTGTCGCAGGGCACCGTATCAGAGTTGCTGTCGAAACCGAAGCCGTGGGACAAACTGACGGAGAAGGGACGTGACAGCTACAGGAAAATGCATGCATGGGGTTGTGATGATAACGCCGTACTGTTGCTGAAGTCGTTGATCCCGAAAAAAGATATGCCTACAATTCCAGGTACAGACTCAGGCCTACCCACCTATGGCCGCCCAGACGACAACATGAACGATGACCGGATAGCGCACATCCTCAccgaagcgtctaatctcatgaagacacccgtacagcatcagcaccaagcgcaaatgcaacaacaccaccaacaaatgcagcagcatcaacagcagcagcaacaacaacagcagcaacagcagcacaatcaaatgcatctggacgactcgcacagcaacgatgactccaagtcaccgcacaatcaatcaccgtcacCTTTCTCCAAAGATCTGCAGCGTCGCATGCAAAAGAAATACGAAAACGATGACATATCACAGGAGAAGGTCGCTCGCATCTACCAGGAGGAGCTGGCCAAACTGATGAGTCGGGCACCGCGAGACACTTTTCCTGGTCTCCTTTTTCCACATTTctttagtgccggcggtatgccaccaaatgcgcaacagcaacagcagcagcagtcacagcaacaacagcagACACCTCAAATGCCTGAAGAAAACATTCGCCTCGCTTTGGCTGCATATCATCGTGAACTTGCGAAACTGCAGCCGAGTGGAGCCGCCAATTTCCCCAGCCTACAAAATATCCTGGCGCTGCAGCAGCACGCTGCTCTCAACGGCTCCGTGCAGGACTTATCTGTAAAAAAAGACCAGCCGCACATCAACATGAACGGCCAACACCAGCAATCGCACACACCGTCCATTGACGGCGACACCAAAGACCCCGACGATGCGCGCCACTCGGGCAGCGCCTTTAGTTTGGTACGACCGAAACAAGAGCCCGGCACCACACCAACACCCAGTATCTCGTCGGCCGCTCCCAGCCCACTCAGTAACTCGATACTGCAGCCGGCAATGACACCCACCGATGACTTTTCATCAGCGGCCAGTCCCCTGCAACGCATGGCCTCAATCACAAACTCATTAATTACTCAATCACCTGTCAACACACACCACTCAACGCCACAGCGTCCACTGAAGGCCGTCCTGCCGCCAATCACACAGCAACAGTTTGACATGTTCAACAACTTGAACACCGAAGACATTGTGCGACGCGTCAAAGAAGCCTTGTCCCAATATTCCATAAGCCAGCGTCTGTTCGGTGAGTCTGTACTCGGCCTGTCTCAGGGTTCCGTCAGTGATTTGCTGGCTCGTCCCAAGCCATGGCACATGCTTACACAAAAGGGACGCGAACCCTTTATTCGAATGAAAATGTTTTTGGAAGATGAAAATGCTGTTCACAAGCTGGTCGCCAGCCAATATAAAATTGCGCCAGAGAAGTTGATGCGCACTGGCAACTACAGTGGAACACCCCAACTATCGTCGGGTATGGCTAAACAAATGCCGCCCGGTATGCCCAAGATGATGAGTGAAATGTCCAAGTTGCAACAGGAACAACATCTGATGCAGATCCAGTCGCAAATGTCACACTTGCAGGCACAGGCCCAACAGGCGGCTGCCGCTCAACAACAAGCACAACAACAGGGTATGTTGTTGACACCACCAGGAATGCCACCACACCATGCCATCACGTTGCAGACGCAACAAGAGAAAAAGGCGGCGGCGGCGGCTGCTGCTGCCATGCAAATGCCGGCAGGTCATTCACCGCAGCAAAACAATGCCATGCGAGGAATGCACCAACATATGTCGCCAACAGTGTACGAGATGGCGGCACTTACACAAGATCTAGATACCCAGGTCATAACGACTAAGATTAAGGAAGCGCTTCTTGCCAACAATATTGGTCAGAAGATTTTTGGAGAAGCCGTTCTTGGTCTTTCCCAAGGATCCGTGTCAGAATTGCTATCGAAACCAAAGCCATGGCACATGTTGAGCATTAAAGGACGCGAACCATTCATTCGCATGCAATTGTGGTTGTCCGATGCCAATAACGTAGATCGTCTACAAGCGTTGAAAAACGAGCGTCGCGAAGCAAACAAGCGACGTCGTTCGACTGGTCCTGGAGCTCAAGATAACAGCTCAGACACTAGTAGCAACGATACGTCTGAATTTTACACATCAAACTCACCTGGACCCGGTTCGGTTGGATCCGGTGCCGGTGGTCAACCACCAAACAAGAAGCAGCGTGTCCTTTTCTCTGAAGAACAAAAGGAGGCCTTGCGATTGGCATTTGCCCTGGATCCCTATCCCAATGTGGCCACAATAGAATTCCTTGCATGTGAGTTAGGTCTGACAACGCGCACAATAACTAACTGGTTCCACAACCATCGAATGCGCTTGAAGCAACAGCATCCACATGGACCACCCAGCGATCCAGTGCCGTCACGCGAAAGTCAGTCCGGTGCCCAATTTGATCCCGTGCAATTCCGAATTCTACTTAATCAGCGATTGGTGGAATTGCAAAAGGAGCGCATGGGCTTGAGCGGCGTACCACTGCCATACCCGCCATACTTTGCAGCCAATCCAAATTTGGCGGCGCTCATCAGCCGTGGTCTGCTGCCAGGCAGTGAGGCTGAGATGAACGCCATCAATCAAGCACTAAAAGAGCAAATGAGTGGCTTGGACCTGTCAATGAGTTCGTTAAAACGTGAACGTgatgacgactttgatgatgtcgttgacggtgataatgacgacggtggttccgacaacgaATCCATGGATGGCATGGATTTGAAGTTGGATTCCAGCAAGGAACAGCCTACTGCCGCACAAATTCAAGCCGCCTCACGAACATCTCGCCGAAAGCCGGCTGCCCCACAATGGGTCAATCCAGACTGGCAGGATGATAACAAAAAGGCGGGAGCGGCGGCAAAAGGTGGCCCAGAAGTAATTATCAACGGTGTGTGCGTGATGCAACAGGCGGCCGATTACAGTCGTTTGGGTGAGGAGACAGTACGCGTAGAGCCAACTGCCGTCCAGGAGGCGCGTCTcaatgcggctgcaacagctgcggctgacgacaatgaagatgacgaggctgacgaagacgatgatgatgacgccgagagcgGTGGTGCACTTCCTGTTATGCCAACCGTCTCATTGGGCCACCAATCCGATGATGAGGATGGTGAGGATGACGATGTGGTGGTGGCACACAGTCCATTGGCGCTTTCTGAGAAAAGTCGGTCTGTGCATGAGGTGGAGGACTGCACCGACAAAGACCAACCGACGGCTTTGGTAAAAGAAGAGAAAAACGACGTCGATCGGTGGGAATATTAA
- Protein Sequence
- MKNEWRATLAEKEVNTLKEQLSATSTTTPPTSTIPQSKVTSSSGTTAAVTQNGVSQSLGSPTLPSNQPQQQPAPTTATTSDAEASDSSLTAAESTTAAATTAVSAKFLINKINNNNYNNKSDNKSMLSDDDEENHLQDMDVDLKMEQIDQLNKNSINNNNSLSSNNNHSNASNISNSGNVPDVVAATVTETVDLDDTTDMEVTTATTTINNNNHVSAKVLAEEMAAKDKEINALVEEVSRLRTALTALQEAHNTQIQRLEDRLEAKRLHITRLEARLEKNTDLEEKKENSGLRSVDLSTAAAAAAAAAVAADSKQFQELILERTKALAAQSELALKSSSDGSSEHQHGDQTRINSSQSPPITAQQQQLPPIPAQLIHSRTQTPTTIQQQQPPLLPPSQHQPKPNALTNNDSNQKSMANYNSTSPLHNVDMFSSFLGEEIVNHWRRGLQFAPLIAPNPSAIDCQLLAANTGFNLIPSANSNITTNTLSTTTSTPIINLTNQHSPPPTDGGLLHVANNSQQTAASLLLQQQQSQQHAHLLSNAVQPPTSLTSLPTTPASAAPQHQQQHLLDDKSCSHLDASSTASTPVLQDCSRSEDNNHLMNGGGLTPLSMSMRSKSPADHDNSNALNALTPQQQSTPTFPLGLPFQFQERPHFRFAEDLQMPPGTMAGRLGDSLIPKGDPMEAKLQEMLRYNMDKYANQNLDTLHISRRVRELLSVHNIGQRLFAKYVLGLSQGTVSELLSKPKPWDKLTEKGRDSYRKMHAWGCDDNAVLLLKSLIPKKDMPTIPGTDSGLPTYGRPDDNMNDDRIAHILTEASNLMKTPVQHQHQAQMQQHHQQMQQHQQQQQQQQQQQQHNQMHLDDSHSNDDSKSPHNQSPSPFSKDLQRRMQKKYENDDISQEKVARIYQEELAKLMSRAPRDTFPGLLFPHFFSAGGMPPNAQQQQQQQSQQQQQTPQMPEENIRLALAAYHRELAKLQPSGAANFPSLQNILALQQHAALNGSVQDLSVKKDQPHINMNGQHQQSHTPSIDGDTKDPDDARHSGSAFSLVRPKQEPGTTPTPSISSAAPSPLSNSILQPAMTPTDDFSSAASPLQRMASITNSLITQSPVNTHHSTPQRPLKAVLPPITQQQFDMFNNLNTEDIVRRVKEALSQYSISQRLFGESVLGLSQGSVSDLLARPKPWHMLTQKGREPFIRMKMFLEDENAVHKLVASQYKIAPEKLMRTGNYSGTPQLSSGMAKQMPPGMPKMMSEMSKLQQEQHLMQIQSQMSHLQAQAQQAAAAQQQAQQQGMLLTPPGMPPHHAITLQTQQEKKAAAAAAAAMQMPAGHSPQQNNAMRGMHQHMSPTVYEMAALTQDLDTQVITTKIKEALLANNIGQKIFGEAVLGLSQGSVSELLSKPKPWHMLSIKGREPFIRMQLWLSDANNVDRLQALKNERREANKRRRSTGPGAQDNSSDTSSNDTSEFYTSNSPGPGSVGSGAGGQPPNKKQRVLFSEEQKEALRLAFALDPYPNVATIEFLACELGLTTRTITNWFHNHRMRLKQQHPHGPPSDPVPSRESQSGAQFDPVQFRILLNQRLVELQKERMGLSGVPLPYPPYFAANPNLAALISRGLLPGSEAEMNAINQALKEQMSGLDLSMSSLKRERDDDFDDVVDGDNDDGGSDNESMDGMDLKLDSSKEQPTAAQIQAASRTSRRKPAAPQWVNPDWQDDNKKAGAAAKGGPEVIINGVCVMQQAADYSRLGEETVRVEPTAVQEARLNAAATAAADDNEDDEADEDDDDDAESGGALPVMPTVSLGHQSDDEDGEDDDVVVAHSPLALSEKSRSVHEVEDCTDKDQPTALVKEEKNDVDRWEY
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_01381985;
- 90% Identity
- iTF_01382812;
- 80% Identity
- iTF_01382812;