Basic Information

Gene Symbol
kn
Assembly
GCA_001276565.1
Location
KQ435694.1:1095608-1167762[-]

Transcription Factor Domain

TF Family
COE
Domain
COE domain
PFAM
AnimalTFDB
TF Group
Helix-turn-helix
Description
This is the helix-loop-helix domain of transcription factor COE. It is responsible for dimerisation [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 4 3.4e-45 4.8e-41 140.4 1.0 111 183 68 140 54 150 0.91
2 4 1.1e-53 1.6e-49 168.4 0.0 182 271 404 516 394 516 0.97
3 4 3.7e-40 5.2e-36 123.8 15.6 292 533 513 717 512 731 0.73
4 4 0.011 1.6e+02 -0.0 4.8 444 504 679 736 664 790 0.72

Sequence Information

Coding Sequence
ATGTCGTGGAATCCACCAGAATATTATGCCATAGTATTACGTATCGAAGTTCTTATTTTAGAAACAATGGTACCCAGTTTTACTCTACTTTCCCCTACGAGTCTGAAGGCCAGAAGACAAACAATGGCATTCAGTATAGCCTGCATCTGCTGTATTCCAACGGTGAGTAAAAAACCAAAGAAACACCACATTGATGACGACTCGTTATCGTCAGTTTGTGTCAGGCAGGTGCAGGACATCTACGTGAGGCTCATCGACTCTGCAACGAAACAGgCCATCATGTACGAGGGTCAAGACAAAAATCCCGAAATGTGCAGGGTCCTTCTCACTCACGAGGTGATGTGCAGTCGGTGTTGCGACAAGAAGAGCTGCGGAAACAGGAACGAGACGCCGAGCGACCCTGTAATTATCGATCGGGGCCAGATAACTCCTCTTCTCGAAGGTAGCCACCTGCGAGATAGCACAGTCGCGACTAGAAACCCAGGATATTTAGGCTTGCAAAATACGTTTCCTACGAGGCGTGGAAAGAGCAAACGGGAGGACAGACACGTGGAATCGCGGATCGGTCAGCAACGCGACGAGGAGACGCATCGCGAGAATAGTCCTAGAGAAGCTGgttatgttacgATAAGTCAAACTACCGTCGCTAGATGGCAGTGTTTActaaatttgaacgaaacgaCGTTACTTATGGGTACCCCTAgAAGTTACCCTTTCTTCTATTCATCCTGTTCTATTCTGAAACTTGGATACGATACCAATTTGGCACGCGGTGTTGGAAAACGTTCGTTCGGCAAGAGGGAACGAAGAATCTGCAGAAGACGGGAAAAAATAGGGGTTAGCTTGATAACAATGAACGTAATGAAGACATCCCTAAGGGGTACCTACCACCGAGATACATTGTTAATGCGACTGCTGAGTAGAGGTTTGTCGTTTGGATGGGTGATTAGTACTCTATCGAACACGCCTCCTATTCCACCGAACAGACAGAATTTCTTACTTACCGAGCAGAGCTTTGCGACATTTATAATTGGTATCGACATGAAAAATAGATTTACGTTACTAGAGGCGAGTAGAACAGGGCTTACGATTGGCTTATGCAATATTCAGACTTACTTTCCAAGCCAAGTTCGCGTTCTAATTAAGTTCTCGGCGTTGAAGAGCGTTAAGCACCTTGTCAACCGTTTAACTCCGGTCCCCTCTGCTTTCAGATTCTTCCTCAAGTTCTTCCTCAAGTGCAACCAGAACTGCCTTAAGAATGCCGGCAATCCACGCGATATGAGACGCTTCCAAGTGGTCATTTCCACGCAAGTAGGAGTAGAGGGACCACTACTGGCCGTCTCGGACAACATGTTCGTTCACAACAACAGCAAACACGGGCGTAGAACGAAACGATTGGACCCCAGTGATCCCGGAGAATACAACActaAGCTTAAGCCAGAACCGACTCGTTTGTTAAGTCTTTACACGCCTGTACCTCTTCAGACGCCGTGTATAAAAGCGATATCGCCGAACGAAGGTTGGACATCTGGCGGATCCACGCTAATTACGCCAAACGCAATTAGAGTACAAACTCCACCACGGCAGATACCAGGCGTCGTCGAGGTCACGCTGTCCTATAAAACTAAACAATTCTGTAAAGGAGCACCCGGTAGATTTGTTTATAAACTCATTCCTCGGCATCCAGGAGATCCCGAGAAATTACCAAAAGAGATCATACTGAAAAGGGCGGCGGATCTTGCAGAAGCCTTGTACAGCATGCCGAGAAGCGGAAACGCCGGAATCACGGGAGCGCCAAGGAGTCCGGGATCGGGTCATCCACCTGCGCCACCGACCTCCAGTTCGGCAACAGCGTTCAACTCGTACACAGGACAACTGGCGGTGACGGTGCAAGAAAACGGTAGCGCGACCAAGTGGACAGACGGCAAGTATTGCAGTGACGGTAGTTCGGTGACGACAGGAGCCGGTGCTGgaggcggcggtggcggcggcggcggcggcggcggtggtagCGTCGGAGGAAGCGTCGGTGGCAGCGCCGACGCCTACAGGCAGAGCAGCAGCGCGAGTCCACGTGGGGTCGTGACCGGTGGTGGTTACTGCGGAAGTTCGGCTAGCACACCTCACAGTCACAGCACCAACGGAAGTTACTCGGTAGCCAATCCGTATACCGGTAGCCCAACGCTTTATACATCGCGTAAcGCACGAACAATATGGCATTTTCTATACGTCCGGGGACGGGAGTGCGTTCGCCCCCGTAGTACGACCACCGACCACCTCAGTCCCACCGCACTGGTCCACACAGCACCACCTCGCCACAGCCGCCCAGTAACCGCTAACCGTCCACCATCACAACCACCATCATCACCACCACAACCACCCTATGCACTGCGATTAAAGAATTATGAGAAACATGCACGAGAGACGAGACTCGAGGATattgtagcacgaatgaaaaatttaagagCATCAAGATACAAAGATCAACATGATCGGGAAAATGTTCTTCGTAAAGGAGatgcataa
Protein Sequence
MSWNPPEYYAIVLRIEVLILETMVPSFTLLSPTSLKARRQTMAFSIACICCIPTVSKKPKKHHIDDDSLSSVCVRQVQDIYVRLIDSATKQAIMYEGQDKNPEMCRVLLTHEVMCSRCCDKKSCGNRNETPSDPVIIDRGQITPLLEGSHLRDSTVATRNPGYLGLQNTFPTRRGKSKREDRHVESRIGQQRDEETHRENSPREAGYVTISQTTVARWQCLLNLNETTLLMGTPRSYPFFYSSCSILKLGYDTNLARGVGKRSFGKRERRICRRREKIGVSLITMNVMKTSLRGTYHRDTLLMRLLSRGLSFGWVISTLSNTPPIPPNRQNFLLTEQSFATFIIGIDMKNRFTLLEASRTGLTIGLCNIQTYFPSQVRVLIKFSALKSVKHLVNRLTPVPSAFRFFLKFFLKCNQNCLKNAGNPRDMRRFQVVISTQVGVEGPLLAVSDNMFVHNNSKHGRRTKRLDPSDPGEYNTKLKPEPTRLLSLYTPVPLQTPCIKAISPNEGWTSGGSTLITPNAIRVQTPPRQIPGVVEVTLSYKTKQFCKGAPGRFVYKLIPRHPGDPEKLPKEIILKRAADLAEALYSMPRSGNAGITGAPRSPGSGHPPAPPTSSSATAFNSYTGQLAVTVQENGSATKWTDGKYCSDGSSVTTGAGAGGGGGGGGGGGGGSVGGSVGGSADAYRQSSSASPRGVVTGGGYCGSSASTPHSHSTNGSYSVANPYTGSPTLYTSRNARTIWHFLYVRGRECVRPRSTTTDHLSPTALVHTAPPRHSRPVTANRPPSQPPSSPPQPPYALRLKNYEKHARETRLEDIVARMKNLRASRYKDQHDRENVLRKGDA

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-