Basic Information

Insect
Arma custos
Gene Symbol
kn
Assembly
GCA_037127475.1
Location
CM073760.1:73000322-73028489[+]

Transcription Factor Domain

TF Family
COE
Domain
COE domain
PFAM
AnimalTFDB
TF Group
Helix-turn-helix
Description
This is the helix-loop-helix domain of transcription factor COE. It is responsible for dimerisation [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 2 1.5e-47 5.8e-43 149.1 2.0 204 295 63 210 53 225 0.94
2 2 4.8e-64 1.9e-59 203.6 18.3 293 512 484 686 481 704 0.80

Sequence Information

Coding Sequence
ATGGCGCCCAGGAGGTTCTCAATGGACGGCTTTGCGTACGATCAAGAAAGAATAGTAGAGTACGCTAAGGATCGCCTGTCACAAGAACTTCTCATCCAGTATCCTCGATCAGTATTACGGAATACGAATACCCATACACGAATACTACTGGATACGCTGCTACTTCGTCGTTCCACAGGCAGACAACCGGTACGAACTGACCATGTCGTGATCTCGACGATGGTATCTGTTGAGGGTCCACTCCTTGCCATCTCGGACAACATGTTCGTCCACAACAACTCTAAACACGGGAGGCGGGCGAAAAGGATAGACCCTTCGGAAGGTGAATCAGCTTGCGCCGCATTAGCAGCGCAGCGCACGTTTGCCACTGATCAGCTGATTAACCTCAAGCCGCCTCCTTGCATATCCTATCGCGGGGGAATACGTGCTGATAAGGAGACCGTCTCGAGGGGCCCAGAAGCTACTGTCGGTTTATACCCACCTCTACCCGTTGCAACACCCTGCATCAAAGCTATCTCACCGAGTGAAGGCTGGACGAGCGGTGGTTCGACGGTCATCATAATCGGAGACAACTTCTTCGACGGACTTCAAGTAGTCTTCGGCACGATGCTTGTTTGGAGCGAGTCTTATCATATTAAGTGTGCGAAAATTGATTCTATGGACTCAGAAATCGCTAAATTATGGAAGTGCCCTGAGTGTTTATTAGCGACCAATACAATGCCGGTATCCGAACCTAAtctaatgttgaatatgctAAGATCACTCACAGAGGAGTTCAAAGAGATGAAAGCTAAAATGAAATCCTTAGAAGAGCTGAAAGAGATAAAAGAAACAATGCAAAAACAGTCTGAGCTATCTTttgaaaatagtgacagacttcataaaattgaaaacctgCTTGAAGAGCAGAACAAACAACTGGAAAGAGTAATGctagagaacaaatttttaaaagaaaaagtatcaCTACTAGAGGGTCGCCTTAACCACGCCGAGCAGtaccaattaaaaaacactattgAAATTAGGGGCATCCCTGAAAAGCAAGGAGAGACCACTGAAGGTCTAATACTAAGCGTCGGAGCAGCCCTGGGCGTTAAACTGTGCCCCGAAGATCTGGACCACGCCGTACGGCTCAGGGCAAGACAAGAAGGAACTCCTGGCCCCATCATCGCCAGATTCGTGCGACAAGCCCTAAGGGACGAATTGGTTcgtcagaggaaaataaaaagggactttTCCACTCGACACCTAGGCTGGGGAGAGAATGAGGCGCACCGCGTCTACGTCAGTGAAGCTATGACTCCTACCAACAAGCATCTTTACTGGCTAGCCAGGCAGAAACAGATATctgcaaaaattaagtatgtttggTTCAGTGGAGGCCGAGTCAGTTGTAGACAGTCGGACGGTTACCCAGCAGTGACAATTAGCAAGCCTAGTGATCTGGACCTTATAACCTCCCATGCGATACGGGTGCAAACACCACCAAGGCACATACCGGGGGTTGTAGAAGTCACCTTGTCCTACAAGAGCAAGCAGTTCTGTAAGGGTGCGCCTGGTCGTTTCGTCTATGTTTctTTGAATGAGCCCACGATAGACTATGGATTTCAGCGCTTGCAAAAGCTGATACCAAGGCATCCAGGCGACCCTGAAAAACTTCCcaaggaaataaTACTGAAAAGAGCAGCGGACTTGGCAGAAGCACTTTACACAATGCAGCCAAGGAACCAACAACTTGCAGCTCCGAGATCCCCTACCAACAACAACAGCACTACCACCTTCAACTCCTACACCGGCCAGCTGGCTGTCACAGTCCATGAGAACAACGGCCAATGGACGGAAGAAGAGTACACGAGGGGAGGTGCTGGAAGCGTTTCACCTAGAGGATACGGCTCCAGCACAAGCACACCTCACTCCAGCAACGGAGGATACAACAACTCGACGCCGACCTACCCCACGTCTTCCCAGGCTACATCCACCACACCCGTCATCTTTAACTCCTCGGCACCCAGAGTTGGAAGTCTTGTCTCCTCACCTTTCACTGGAATGAACCCATTCGCTCTTCCGACATGTAATTCACAGAGTTACAGCCCTCTGGTATCAACACCGAAATGA
Protein Sequence
MAPRRFSMDGFAYDQERIVEYAKDRLSQELLIQYPRSVLRNTNTHTRILLDTLLLRRSTGRQPVRTDHVVISTMVSVEGPLLAISDNMFVHNNSKHGRRAKRIDPSEGESACAALAAQRTFATDQLINLKPPPCISYRGGIRADKETVSRGPEATVGLYPPLPVATPCIKAISPSEGWTSGGSTVIIIGDNFFDGLQVVFGTMLVWSESYHIKCAKIDSMDSEIAKLWKCPECLLATNTMPVSEPNLMLNMLRSLTEEFKEMKAKMKSLEELKEIKETMQKQSELSFENSDRLHKIENLLEEQNKQLERVMLENKFLKEKVSLLEGRLNHAEQYQLKNTIEIRGIPEKQGETTEGLILSVGAALGVKLCPEDLDHAVRLRARQEGTPGPIIARFVRQALRDELVRQRKIKRDFSTRHLGWGENEAHRVYVSEAMTPTNKHLYWLARQKQISAKIKYVWFSGGRVSCRQSDGYPAVTISKPSDLDLITSHAIRVQTPPRHIPGVVEVTLSYKSKQFCKGAPGRFVYVSLNEPTIDYGFQRLQKLIPRHPGDPEKLPKEIILKRAADLAEALYTMQPRNQQLAAPRSPTNNNSTTTFNSYTGQLAVTVHENNGQWTEEEYTRGGAGSVSPRGYGSSTSTPHSSNGGYNNSTPTYPTSSQATSTTPVIFNSSAPRVGSLVSSPFTGMNPFALPTCNSQSYSPLVSTPK

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-