Basic Information

Gene Symbol
kn
Assembly
GCA_011634795.1
Location
PHTE01000448.1:73619-175408[-]

Transcription Factor Domain

TF Family
COE
Domain
COE domain
PFAM
AnimalTFDB
TF Group
Helix-turn-helix
Description
This is the helix-loop-helix domain of transcription factor COE. It is responsible for dimerisation [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 2 1.7e-237 4e-233 775.4 0.0 39 444 20 427 15 441 0.94
2 2 0.027 6.4e+02 -1.3 9.8 449 511 456 519 445 540 0.67

Sequence Information

Coding Sequence
ATGcggcaaattttaaaagaggaACCCGTACCACGTGCCTGGCCACAACCTGCCCTCGCCGATAATGGACCGGTTGGAGTGGGTCGAGCCCATTTCGAGAAACAGCCTCCGAGCAATCTAAGGAAGAGTAACTTTTTCCACTTTGTTATTGCTCTCTACGATCGAGGGGGACAGCCGATTGAAATCGAAAGGACGGCTTTCATTGGATTCGTCGAGAAGGACCAGGAAAGTGAGGGTCAAAAGACAAATAATGGAATTCAATATCGTCTTCAACTTCTTTATGCAAATggtgtCAGACAAGAACAAGATATCTTCGTGAGGCTGATAGATTCTGTAACAAAACAGgctATTATGTACGAGGGTCAGGATAAGAATCCTGAGATGTGTCGTGTTTTGCTGACTCACGAAGTCATGTGCAGTCGTTGCTGTGATAAAAAGAGCTGtggaaatagaaatgaaaCTCCAAGTGATCCAGTTATCATCGATCGATTCTTCCTCAAGTTCTTCCTCAAGTGCAACCAAAACTGTCTTAAGAACGCTGGAAATCCAAGGGACATGAGACGTTTTCAGgTTGTAATATCGACGCAAGTTGGTGTGGAAGGTCCTTTATTGGCTGTTTCGGACAATATGTTTGTGCATAATAACAGCAAACATGGACGTAGAGCAAAGAGGTTGGACCCAAGTGATCCTGGCGAATATAATAGTGAgtgcaTCTATCCACCAATTCCAGTTGCGACTCCCTGCATAAAAGCGATATCACCTAGCGAGGGGTGGACGCAGGGGGGTTCAACAGTGATAATAATCGGTGACAATTTCTTCGATGGGTTACAAGTCGTATTCGGCACAATGCTCGTATGGAGTGAGttaaTAACGGCCCATGCAATAAGAGTGCAAACACCACCACGACAAATACCCGGAGTTGTTGAAGTGACATTATCCTATAAAAGTAAACAATTCTGTAAAGGTTCTCCAGGAAGATTTGTTTATGTCtCATTGAACGAACCAACGATTGACTATGGATTCCAGaggttacaaaaattaataccaCGACATCCTGGCGATCCAGAAAAATTgccaaaagaaattattttaaaaagagctGCCGATCTTGCTGAGGCACTTTATAGTATGCCAAGGGGTAGTAATGCAAGTATCACAGGAGCGCCAAGAAGTCCTGTATCGAGTCATCCACCAGCGCCACCAACATCAAGTTCAGCGACAGCATTTAATTCATATACCGGTCAACTTGCTGTCACTGTCCAGGAAAATGGCAGCGCCGCCAAATGGACGGATGACGGTGGTTCAGTGACGTCAGGTGCCGGAGGCGGTGGAGGTGGCGGCGGTAACGGCGGCACTGTCGACGCCTATAGGCAAAGTAGTAGCGCGAGTCCACGGGGGGTTGTAACCAGTGGTGGTTATTGTGGAAGTTCCGCCAGTACGCCGCACAGTCACAGCACCAGCGGAAGCTACTCCGTCGCCAATCCCTACACCGGAAGTCCGACCCTCTACACTTCGCGATTGGGAGAAGTAGTTGGATCACCATTCAATATGAATCCATTTATGTTGCCTACCTGTAATCAAGGATATTCAACAACTGGTAGTCCACTTTTATCCTCGAATGGTAAATAG
Protein Sequence
MRQILKEEPVPRAWPQPALADNGPVGVGRAHFEKQPPSNLRKSNFFHFVIALYDRGGQPIEIERTAFIGFVEKDQESEGQKTNNGIQYRLQLLYANGVRQEQDIFVRLIDSVTKQAIMYEGQDKNPEMCRVLLTHEVMCSRCCDKKSCGNRNETPSDPVIIDRFFLKFFLKCNQNCLKNAGNPRDMRRFQVVISTQVGVEGPLLAVSDNMFVHNNSKHGRRAKRLDPSDPGEYNSECIYPPIPVATPCIKAISPSEGWTQGGSTVIIIGDNFFDGLQVVFGTMLVWSELITAHAIRVQTPPRQIPGVVEVTLSYKSKQFCKGSPGRFVYVSLNEPTIDYGFQRLQKLIPRHPGDPEKLPKEIILKRAADLAEALYSMPRGSNASITGAPRSPVSSHPPAPPTSSSATAFNSYTGQLAVTVQENGSAAKWTDDGGSVTSGAGGGGGGGGNGGTVDAYRQSSSASPRGVVTSGGYCGSSASTPHSHSTSGSYSVANPYTGSPTLYTSRLGEVVGSPFNMNPFMLPTCNQGYSTTGSPLLSSNGK