Ppro013801.1
Basic Information
- Insect
- Papilio protenor
- Gene Symbol
- kn
- Assembly
- GCA_029286645.1
- Location
- JAGSMZ010000026.1:2938359-2952294[-]
Transcription Factor Domain
- TF Family
- COE
- Domain
- COE domain
- PFAM
- AnimalTFDB
- TF Group
- Helix-turn-helix
- Description
- This is the helix-loop-helix domain of transcription factor COE. It is responsible for dimerisation [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 2 4.4e-143 3.8e-139 465.2 0.0 181 387 80 304 74 309 0.98 2 2 0.0031 27 3.4 3.1 442 540 327 423 313 466 0.65
Sequence Information
- Coding Sequence
- ATGCAATTTAATGAGTTGATAGTGTGCGTTAACCCGCGCAACGTTTGCGAATACTATTCTAAACCAGCATGTGAATTTGGACTTAATACTGTTAGAATTGAGTTGAAAGTTGTTTGTTTAAGTTGGTTGTCGTATCGGCGGGGCGGCGTGCGCGGGGAGTCGCGCCGGCCCTGGGGACCGCCTCCCGAGGGAGCGGACGACCGGGCCACCAGCCGGGGGAGCCGGGGGGAGGGGAACCAAAGattttttcttaaattcttCCTGAAATGCAATCAAAATTGCTTGAAAAACGCCGGCAACCCTCGAGATATGAGAAGATTTCAAGTGGTGATATCAACTCAAGTGATGGTGGACGGTCCGCTGCTCGCGATATCAGACAACATGTTCGTGCACAATAACAGCAAGCACGGCAGACGCGCTAAGCGCCTAGACCCGTCTGAAGGTACTGACGCCGCGCCAGACAATCATTCAGGTCTGTATCCTCCCCTGCCTGTCGCGACGCCGTGCATCAAGGCGATATCACCGAGTGAAGGCTGGACTTCAGGAGGCTCCACTGTCATTATAGTTGGAGACAACTTCTTCGATGGTCTTCAAGTTGTGTTCGGAACAATGTTAGTTTGGAGTGAGTTAATAACATCACACGCGATACGGGTGCAGACACCGCCTCGGCACATACCCGGCGTCGTCGAGGTGACCCTATCATACAAGAGCAAGCAGTTCTGCAAGGGAGCGCCTGGCAGATTTGTATATGTTTCAGCGCTCAACGAGCCCACCATCGACTACGGTTTCCAGCGACTGCAGAAATTGATACCGAGGCATCCTGGTGACCCTGAGAAATTACCAAAGGAGATAATCCTGAAGCGAGCGGCGGACTTAGCGGAGGCGCTGTACTCTATGCCGCGCAACAACCAGCTGCTGCCGCGAACACCGCCGCCCTCCGCGCCCTTCAACACCTACGCACAAGACGCCACACCCCACCAGTGGACTGAAGAGGAGTACGCGCGCAGCGGCGGCTCGGTGTCGCCGCGGTACTGCGCCGGCGCCGCCACGCCGCACTACGCGCAGCACTACGCGCCGCCCACCTCGCTCTTCAACTCCACCTCACTGTCATTAGGACCTTACCACCCCAGTGCAAACGGACAAATAGCTGACCATCAGAGCTATGACATTTATTACTCGAAAGATAGCGTCCATTACGATGAAAGGAATCAGAAATGCTCGGACGACCTCCAAGCAAACAGTCATACAAAATGCAGCCCCGGCCATATGAAAGGGCAACAGTGTAAAGAGGCCACGCTAAGAAGCGCGTTCGCGGCTGTAAAGCAGAGAATGGGTGGATTGGTGTCGTCTCCGTTCAGCGTCAATCCGTTCTCTTTGCCGACGTGCAGCGCGCAGCAGTATGCACAAACAGCACCTCTAGCCTCCAAGTAA
- Protein Sequence
- MQFNELIVCVNPRNVCEYYSKPACEFGLNTVRIELKVVCLSWLSYRRGGVRGESRRPWGPPPEGADDRATSRGSRGEGNQRFFLKFFLKCNQNCLKNAGNPRDMRRFQVVISTQVMVDGPLLAISDNMFVHNNSKHGRRAKRLDPSEGTDAAPDNHSGLYPPLPVATPCIKAISPSEGWTSGGSTVIIVGDNFFDGLQVVFGTMLVWSELITSHAIRVQTPPRHIPGVVEVTLSYKSKQFCKGAPGRFVYVSALNEPTIDYGFQRLQKLIPRHPGDPEKLPKEIILKRAADLAEALYSMPRNNQLLPRTPPPSAPFNTYAQDATPHQWTEEEYARSGGSVSPRYCAGAATPHYAQHYAPPTSLFNSTSLSLGPYHPSANGQIADHQSYDIYYSKDSVHYDERNQKCSDDLQANSHTKCSPGHMKGQQCKEATLRSAFAAVKQRMGGLVSSPFSVNPFSLPTCSAQQYAQTAPLASK
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_01145837;
- 90% Identity
- iTF_00180235; iTF_01495572; iTF_01494849; iTF_00249504;
- 80% Identity
- -