Amod021603.1
Basic Information
- Insect
- Allygus modestus
- Gene Symbol
- kn
- Assembly
- GCA_963675035.1
- Location
- OY776111.1:273895663-273905479[-]
Transcription Factor Domain
- TF Family
- COE
- Domain
- COE domain
- PFAM
- AnimalTFDB
- TF Group
- Helix-turn-helix
- Description
- This is the helix-loop-helix domain of transcription factor COE. It is responsible for dimerisation [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 4 1.1e-05 0.43 11.5 0.0 207 231 9 33 3 58 0.89 2 4 0.0022 88 3.9 0.0 202 230 85 113 77 116 0.86 3 4 4.4e-05 1.7 9.5 0.0 202 231 292 321 278 324 0.87 4 4 5.7e-21 2.3e-16 62.0 0.0 202 252 844 894 840 905 0.90
Sequence Information
- Coding Sequence
- ATGCTTCCTAACTCTGACAGGTCCTACCTCACAGTGGTCATCTCGACGCAGATCAGTGTGGAGGGACCCCTGCTTGCCGTCTCCGACAACATGTTACTTTCCCTCTGGCTGGTCCTACCTAACAGTAATGTTGAACCATCAGTGTGGAGGGACCCCAGCTCGCCGTCTCCGACAACATGTTACTTGCCCTCTGACTGGTCAGTGAGGGCATGTTACTTGCCCTCTGACTGGTCAGTGAGGGCATGTTACTTGCCCTCTGACTGGTCCTGCCCCACAGTGGTCATCTCGACGCAGATCAACATGGAGGGATCCATGCTCACCGTCTCCAACAGCATGTTACTTGCCCTCTGCCTGGTCCTACTTAACAGTTATGTTGTACCGTATGTTGACAGGTGGTCATCTCAACGCAGATCAGTGTGGAGGGACTCCTGCTTGCCGTCTCCAACAGCATGTTACTTGCCCTCTGACTGGTCCTACCTCACAGTAATGTTGTACCGTATGTTCACAGGTGGTCATCTCGACGCATATCAGTGTGCAGGGACCCCTGCTCGACGTTTCCGACAACATGTTCCTTGCCGTCTGACTGGTCTTACCTCACAGTGGTCATCTCAACGTAGATCAGTGTGGAGGGACCCCTGCTCGACGTTTCCGACAACATTTTCCTTGCCGTCTGACTGGTCTTACCTCACAGTAATGTTGTGCTGTATGTTGACAGGTGGTCATCTCAACGTAGATCAGTGTGCAGGGACCCCTGCTCGACGTTTCCGACAACATGTTCCTTGCCGTCTGACTGGTCTTACCTCACAGTGGTCATCTCAACGTAGATCAGTGTGCAGGGACCCCTGCTCGACGTTTCCGACAACATGTTCCTTGCCGTCTGACTGGTCTTACCTCACAGAGGTCATCTCGACGCAGATCAGTGTGGAGGGACCCCTGCTCGCCGTCTCCGACAACATGTTACTTTCCCTCTGGCTGGTCCTACCTCACAGTGGTCATCTCGACGCAGATCAGTGTGGAGGGATCCCGGCTCGCCGTCTCCGACAACATGTTACTTGCCCTCTGACTGGTCCTACATCACAGTGGTCATCTCGACGCATATCAGTGTGCAGGGACCCCTGCTCGACGTTTCCGACAACATGTTCCTTGCCGTCTGACTGGTCTTACCTCACAGTAATGTTGTGCTGTATGTTGACAGGTGGTCATCTCAACGTAGATCAGTGTAGAGGGACCCCTGCTCGACGTTTCCGACAACATTTTCCTTGCCGTCTGACTGGTCTTACCTCACAGTGGTCATCTCAACGTAGATCAGTGTGCAGGGACCCCTGCTCGACGTTTCCGACAACATGTTCCTTGCCGTCTGACTGGTCTTACCTCACAGTAATGTTGTGCTGTATGTTGACAGGTGGTCATCTCAACGTAGATCAGTGTGCAGGGACCCCTGCTCGACGTTTCCGACAACATGTTCCTTGCCGTCTGACTGGTCTTACCTCACAGTGGTCATCTCAACGTAGATCAGTGTGCAGGGACCCCTGCTCGACGTTTCCGACAACATGTTCCTTGCCGTCTGACTGGTCTTACCTCATAGTAATGTTGTGCTGTATGTTGACAGGTGGTCATCTCAACGTAGATCAGTGTGCAGGGACCCCTGCTCGACGTGGTCATCTCAACGTAGATCAGTGTGCAGGGACCCCTGCTCGACGTTTCCGACAACATGTTCCTTGCCGTCTGACTGGTCCTACCTCACAGTGGTCATATCAACGTAGATCAGTGTGCAGGGACCCCTGCTCGACGTTTCCGATAACATGTTCCTTGCTGTCTGACTGGTCTTACCTCACAGTAAAGTTGAACCGTATGTTGACAGGAGGTCATCTCGACGCAGATCAGTGTGGAGGGACCCCTGCTCGCCGTCTCCGACAACATGTTACTTTCCCTCTGGCTGGTCCTACCTCACAGTGGTCATCTCGACGCAGATCAGTGTGGAGGGATCCCGGCTCGCCGTCTCCGACAACATGTTACTTGCCCTCTGACTGGTCCTACATCACAGTGGTCATCTCATCTCGACACAGATCAGTGTGGAGGGACCACTGCTCGCCGTCTCCGACAACATATTACTTGCCCTCTGACTGGTCCTTCCTCACAGTAATGTTGTACCGTATGTTGATAGGTGGTCATCTCGACACAGATCAGTGTGGAGGGACCCCTGCTCGCCGTCTCCGACAACATGTTACTTGCCCTCTGACTGGTCCTTGCTCACAGTGGTCATCTCGACGCAGATCAGTGTGGAGGGACCCCTGCTCGCCGTCTCCGACAACATATTACTTGCCCTCTGACTGGTCCTTCCTCACAGTAATGTTGTACCGTATGTTGATAGGTGGTCATCTCGACGCAGATCAGTGTGGAGGGACCCCTGCTCGCCGTCTCCGACAACATGTTACTTGCCCTCTGACTGGTCCTTGTTTACAGTGGTCATCTCGACGCAAATCAGTGTGGAGGGACCCCTGCTCGCCGTCTCCGACAACATATTACTTGCCCTCTGACTGGTCCTTGCTCACAGTGGTCATCTCGACGCAGATCAGTGTGGAGGGACCCCTGCTCGCCGTCTCCGACAATATGTTCGTCCACAACAACTCCAAACACGGCCGGAGAGCCAAGAGGCTCGACCCCTCGGACGCGCTACATCCCTCCCCCACCCCTTGGCTGGCAATCACTGATAGCGGCTTGGCTCTGCCCTTATCACTGATTACCGCTAATCAGTCTAATCTTCATCAGAGCATCTTCTTCCTTGGCTTGCCCACCATCACAAGCGTGTTCACGCAGGCTCCGGCAGAGAGCAGCACATTACAGCGTACGTTGCCGCCGGACGCTCCGCACCGCGACGCTAGCGTCTTGACTCCGTGA
- Protein Sequence
- MLPNSDRSYLTVVISTQISVEGPLLAVSDNMLLSLWLVLPNSNVEPSVWRDPSSPSPTTCYLPSDWSVRACYLPSDWSVRACYLPSDWSCPTVVISTQINMEGSMLTVSNSMLLALCLVLLNSYVVPYVDRWSSQRRSVWRDSCLPSPTACYLPSDWSYLTVMLYRMFTGGHLDAYQCAGTPARRFRQHVPCRLTGLTSQWSSQRRSVWRDPCSTFPTTFSLPSDWSYLTVMLCCMLTGGHLNVDQCAGTPARRFRQHVPCRLTGLTSQWSSQRRSVCRDPCSTFPTTCSLPSDWSYLTEVISTQISVEGPLLAVSDNMLLSLWLVLPHSGHLDADQCGGIPARRLRQHVTCPLTGPTSQWSSRRISVCRDPCSTFPTTCSLPSDWSYLTVMLCCMLTGGHLNVDQCRGTPARRFRQHFPCRLTGLTSQWSSQRRSVCRDPCSTFPTTCSLPSDWSYLTVMLCCMLTGGHLNVDQCAGTPARRFRQHVPCRLTGLTSQWSSQRRSVCRDPCSTFPTTCSLPSDWSYLIVMLCCMLTGGHLNVDQCAGTPARRGHLNVDQCAGTPARRFRQHVPCRLTGPTSQWSYQRRSVCRDPCSTFPITCSLLSDWSYLTVKLNRMLTGGHLDADQCGGTPARRLRQHVTFPLAGPTSQWSSRRRSVWRDPGSPSPTTCYLPSDWSYITVVISSRHRSVWRDHCSPSPTTYYLPSDWSFLTVMLYRMLIGGHLDTDQCGGTPARRLRQHVTCPLTGPCSQWSSRRRSVWRDPCSPSPTTYYLPSDWSFLTVMLYRMLIGGHLDADQCGGTPARRLRQHVTCPLTGPCLQWSSRRKSVWRDPCSPSPTTYYLPSDWSLLTVVISTQISVEGPLLAVSDNMFVHNNSKHGRRAKRLDPSDALHPSPTPWLAITDSGLALPLSLITANQSNLHQSIFFLGLPTITSVFTQAPAESSTLQRTLPPDAPHRDASVLTP
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -