g4892.t2
Basic Information
- Insect
- Drosophila pseudoobscura
- Gene Symbol
- kn
- Assembly
- GCA_009870125.2
- Location
- CM020869.1:8615245-8638822[-]
Transcription Factor Domain
- TF Family
- COE
- Domain
- COE domain
- PFAM
- AnimalTFDB
- TF Group
- Helix-turn-helix
- Description
- This is the helix-loop-helix domain of transcription factor COE. It is responsible for dimerisation [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 2 6.8e-233 2.7e-229 762.5 0.0 41 385 95 468 89 472 0.99 2 2 1.5e-07 0.00061 18.4 16.0 422 543 471 591 467 593 0.73
Sequence Information
- Coding Sequence
- ATGTTCGCGCCGAGAAGAAGTGCGCGAGTTGCCCAAATCGAAAGCGAGATGACAGCGGGCTTTGGCCATTCAGCGGCGGCAAATCAACAatcgcacacgcacacacacacgcacatccAACACAATCACACGCAAAATCACAATCACACTCTCAATCACAATCAAAGTCAGAGTCTGAATCAGAATCAGACGCTGCAGCGAGCCGCAGACAATCTTAACCTCCATCTTAATCAGGATCATAATCACAATCAAACGCAAAACAATCCAAAAACAATGCGCTGTGAAACGAGCCTGGGCATTGGCCGCGCCCACTTTGAGAAGCAGCCGCCCAGCAATTTACGTAAATCGAACTTCTTTCACTTTGTGATCGCCCTGTACGATCGGGCCGGCCAGCCCATCGAGATCGAGCGCACGGCCTTCATTGGGTTCATCGAGAAGGACTCGGAATCGGATGCCACCAAAACCAACAACGGGATACAATATCGCCTGCAGTTGCTCTATGCGAATGGCGCCCGCCAGGAGCAGGACATATTCGTGAGGCTGATCGATTCCGTGACCAAGCAGGCCATCATCTATGAGGGTCAGGACAAGAATCCAGAGATGTGTCGGGTCCTGTTGACGCACGAAGTCATGTGCAGCCGCTGCTGTGATAAGAAGAGCTGTGGCAACCGCAACGAGACGCCATCGGACCCCGTCATTATTGATCGGTTCTTCCTGAAGTTCTTCTTGAAGTGCAACCAGAACTGCCTGAAAAACGCTGGCAACCCGCGGGACATGCGCAGATTCCAGGTGGTCATCTCCACACAGGTGGCCGTGGACGGACCCCTGCTGGCCATCTCCGACAACATGTTCGTGCACAACAACTCGAAGCACGGACGGAGGGCCAAGCGGCTGGACACCACGGAAGGTACAGGCAACACATCCCTGTCCATATCCGGTCACCCCCTAGCGCCCGACAGTACCTACGATGGTCTCTATCCGCCCCTGCCGGTGGCCACGCCCTGCATCAAGGCGATCTCCCCGAGCGAGGGCTGGACCACGGGCGGCGCCACAGTCATCATTGTGGGCGACAACTTCTTCGACGGCCTCCAGGTGGTCTTCGGGACGATGCTCGTGTGGAGCGAGCTGATTACCTCGCACGCCATCCGCGTCCAGACCCCGCCCAGGCATATACCCGGCGTCGTCGAGGTGACGCTCTCCTACAAGAGCAAGCAGTTCTGCAAGGGATCGCCCGGACGCTTCGTCTATGTCTCAGCTCTCAACGAACCCACAATCGACTACGGATTCCAGCGCCTGCAGAAGCTCATTCCGCGCCACCCCGGCGATCCGGAGAAGCTGCAGAAGGAGATCATATTGAAGCGGGCCGCCGACCTGGTCGAGGCCCTCTACTCAATGCCCAGATCTCCGGGCGGCTCCACCGGATTCAACTCCTACGCCGGACAGCTGGCCGTCAGCGTCCAAGACGGCACCGGCCAGTGGACCGAGGACGACTACCAGCGCGCCCAGTCGAGCAGCGTGAGTCCCCGCGGCGGCTACTGCAGCAGCGCCTCCACCCCCCACAGCTCGGGCGGCTCCTACGGCGCCTCGGCCACCACAGCGGCGGTAGCAGCCACGGCCAATGGCTATGCGCCCACACCCAACATGGGCACCCTCTCCTCCTCGCCGGGCAGCGTCTTCAACTCCACGTCAAGAGTGAGCAGCCTGAGTTTCAATCCCTTCGCCCTGCCCACCTGCAACACCCAGGGCTACAGCACCAGCCAGCTGGTGACTTCAACCAAATAA
- Protein Sequence
- MFAPRRSARVAQIESEMTAGFGHSAAANQQSHTHTHTHIQHNHTQNHNHTLNHNQSQSLNQNQTLQRAADNLNLHLNQDHNHNQTQNNPKTMRCETSLGIGRAHFEKQPPSNLRKSNFFHFVIALYDRAGQPIEIERTAFIGFIEKDSESDATKTNNGIQYRLQLLYANGARQEQDIFVRLIDSVTKQAIIYEGQDKNPEMCRVLLTHEVMCSRCCDKKSCGNRNETPSDPVIIDRFFLKFFLKCNQNCLKNAGNPRDMRRFQVVISTQVAVDGPLLAISDNMFVHNNSKHGRRAKRLDTTEGTGNTSLSISGHPLAPDSTYDGLYPPLPVATPCIKAISPSEGWTTGGATVIIVGDNFFDGLQVVFGTMLVWSELITSHAIRVQTPPRHIPGVVEVTLSYKSKQFCKGSPGRFVYVSALNEPTIDYGFQRLQKLIPRHPGDPEKLQKEIILKRAADLVEALYSMPRSPGGSTGFNSYAGQLAVSVQDGTGQWTEDDYQRAQSSSVSPRGGYCSSASTPHSSGGSYGASATTAAVAATANGYAPTPNMGTLSSSPGSVFNSTSRVSSLSFNPFALPTCNTQGYSTSQLVTSTK*
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00580420; iTF_01055013; iTF_00505190; iTF_00891654; iTF_00554183; iTF_00580489; iTF_00580422; iTF_01319539; iTF_00501590; iTF_00901275;
- 90% Identity
- iTF_00580420; iTF_00891654; iTF_00580489; iTF_00580422; iTF_00580492; iTF_00580421; iTF_00580493; iTF_00580423;
- 80% Identity
- iTF_00580420; iTF_00580492; iTF_00580421;