Clig004240.1
Basic Information
- Insect
- Craniophora ligustri
- Gene Symbol
- -
- Assembly
- GCA_905163465.1
- Location
- LR990959.1:12106921-12125141[-]
Transcription Factor Domain
- TF Family
- CUT
- Domain
- Homeobox|CUT
- PFAM
- PF02376
- TF Group
- Helix-turn-helix
- Description
- The CUT domain is a DNA-binding motif which can bind independently or in cooperation with the homeodomain, often found downstream of the CUT domain. Multiple copies of the CUT domain can exist in one protein (eg Swiss:P10180).
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 10 0.78 7.7e+03 -2.3 0.0 7 30 247 270 243 272 0.87 2 10 0.78 7.7e+03 -2.3 0.0 11 27 285 301 280 312 0.68 3 10 0.0056 55 4.6 0.0 6 27 349 370 346 375 0.92 4 10 0.032 3.2e+02 2.1 0.0 6 29 509 532 505 560 0.83 5 10 0.18 1.8e+03 -0.3 0.0 8 38 579 608 575 621 0.81 6 10 1.2 1.2e+04 -2.9 0.0 7 31 612 636 606 639 0.84 7 10 0.052 5.1e+02 1.5 0.0 5 27 675 697 672 709 0.86 8 10 0.95 9.4e+03 -2.6 0.0 9 28 814 833 812 859 0.67 9 10 0.024 2.4e+02 2.5 0.0 8 30 886 908 880 916 0.87 10 10 0.32 3.1e+03 -1.1 0.0 7 27 953 973 950 978 0.85
Sequence Information
- Coding Sequence
- ATGCTAAAAGGTAAAAACGCAAAGGAAATCAATGAACTACTCGAGAGTTTACGCCCCAGTGAAGATGAAGATATTTTGATGAAGAAAGTACGTAAAGCTATAAAACGGGGTGGATTATCACCTTCTGTCGTGGATATACTCACTAACATGCATCCTAGCAAATCACACTCGAATATCGGAAAAGGCTCAGAAAAAAGTCCAACTTGCTCTTCACTAGATAATATAGAAGAGGCTATACTTATGAAAAAGATTCGTAAAGCGATAGGAAAGGGCCACCTGTCGCCTGCTGTCATGGAGGATATACTAGCCGATGTACGTCCCTCGGACAGCATCTGTCAAGTTAAGAAGAAGGTCCAGAAAGCGGTGAAAGAGGGTCTACTACCGCCAGCTGTTCTGGGTGAAGTACGTACTGATATACTCCCGACTGATAATAAGTTTGAAGTTATAGAGAAGGTTCGTAAAGTAATGCTAAAAAGACGTGTGCCACCCCATGTCTTAGATGACATCCTCCCGTGTCAACAGCCTAGTGCTACCAAGTCAGAAGTCATAAAGAATGTACAAAGAGCGGTAGAAAAGGGCCTATTATCACCCGCTGTCATCGATGACATATTTGAAAACTTACATCCTGGTGATAGAAAATGTCAATTTTTAAAAACAGTACGCAAAGCGATAGCAAACAATCATGTCACACATTCTGCGATGAACGAAATACTTGAAGATGTACGTGATAGTGACAATAAGTCTGAAATCGTAGAGAACGTCCGTAAAGCGGTTGAAAGGGGCTCACTACCGCCCGCTGTTATGGGTGAAGTACTCAAAGAAATACGCCCAAATGACGATAAACATCAAGCCATTGAGAAAGTTCGTGAAGCGGTAGAAAAGAGGCACGTGCCACAAAAAGTCATTGAAGATATATTCAAAAAACCAAAACCTACCGGTAGTCCAAAGTCAGCAATTATGTGGAAGATTCGAAAAGCTGTACAGAACGGAAGAATACCGCTCGGTGCTTTAAATGCAATACTCACAGATTTAAACCCAAGTGATGATACGTATCAAATTGTGGATAAGGTCCGTAGAGCTTTAGAAAAGAATCACGTGTCGCCACACATTATCGACGATATTCTTTCTAAACAATTGCCGCGTGAACTTGAACTTTTGGACACTATCCGCAAAGCGGCGGAGCAGGGCCATCTGTCTACCTACGTTTTAGATGATCTACTTGATAGCATACAACCTGGCGATACTGAGTCACAAGTTATGAAGAAAGTACGCAAAGCTATAGCAAAGAGGCATGTTACGCCATCGGTCATTGAAGATGTACTCGCAAATGTACACGCAAGTGATGATAAATCTGAAATTTTGGACAAAGTCCGCAGTGGTATGTCTACAGACAAATTTAATTTAATATTCTTCACTCATATTGATTCTATATCTAATATCATCATTTCTTTTAAAGCTGTAGAAAAAGGACACCTACCGCCCGCAGTTTTGGGCGAAGTACTTAGCAATTTAAAGCCAACTGACGATAAGTATGAAGTCATAGATAAAGTACGTAATGCTATGAAAAAGCGGCAAGTGCCTCCAAATGTCATGAATGACATCCTAGCTTTCCAAAAGCCCGGTAGAAGTGCGTCAGAAATTTTAAGCAGTATACGTAAAGCGGTAAAGCGGGGGCATGTACCACCTAGTGTCCTGGATGACATACTCGGCAGTGTAAGTCCTGAAGATCGAAAGTCCCAAGTTTTGAAGAAAGTCCGTAAAGCCTTTACAAAAAGGAATATATCGCCGTCTGTAGTGGATAAATTACTGGCTAATGTACAAGGTAGCGACAATAAATCTGAAGTTATAAACAATGTTCGTAAAGCTATAGAGAAGGGTCAATTGCCTCCAGCTATTCTGGGCGAAATACTCGCTAACATACGTCCCAGTGATGATCAGCCAAAAGTTACAGAAAAAGTTCGTCAAGCGTTGCTAAAATGGCATGTGCCACCACATATCATGAACAATGTACTGGGTAGTCCTGTTGATAAGTCCGAAATCATTAGGAATATCAAAAAAGCCATGGGTAAAGCAAAACTGTCACCTTCCCTTATGGATGATATTATCGCTAGTGTACAGCCTGGTGATAATAAATCACAAGTTTTAAAGAAGATTCGCAAAGTGGTCGCCAAGAGGCATATGCCACCAGCATTGATGAATGATCTACTTACTGATATCAGTGACAGTGAAGATTCTTCCCAAGTCATGAAGAAAATCCGCTCAGCGGCAGATAAAGGTAAATTGCCACCTGCTCTCATGGATGATATACTGGCACATGTACTACCCGACGACGATAAGTCTGATGTCCTGAAAAAAGTCCGCACGGCGGTTGAAAAAAGCCAAGTTCCACCACACCTTGTTGACAGTATAGCCGAGAAAATACGTCCAGAGGAAGGAGATAAGATTATGAAAGAAGTCCAAACAGCGGTAGACAAGAGAAAAATGACACCATCGCTTTTTAAGGCGATAAAAGATTTAGGAAAAGGTGATCCAGATGATAGTATGGATAGAATGTCGGAACTCTTAAAAAACATCCGTAAAGCTGTAGCGGAGGAAAACCTCTCACCAGCTGTTTTGGAAGATATTCTGGCCAACATACAACCAAATGACAAAAAGTCTGATTTGGTTAAGAAGGTCCGTCAAGCCATAGCAAAGAGGCACGTATCCCCGTCCGTTTTTGAGGAAATGCTTGACAGTGCCAACCAAAGTGAAGATGGTCCAGCAATTGTGAACAAGATTCGGAAAGCTGTAGATAAAGGGCAACTACCACCAGATGTAATGCATGATATTCTTGCAAGTGTTCAGCCCAGTGATGACAAGGACGCAGTCATTAAAAAAGTACGGGAAGCTCTAGAAAAAAGAAAGGTATCACCATATGTAATGGATAATGTACTCGGAAATATATATCCTCATGATAATAAGGCTGAAATTATAATGAATGTTAGAAGAGCGGTGGAAAAGGGCCACTTATCACCCAAAGTTTTAGATGAAGTTCTTGCTAAAGTACATCCTAGTGACAATAAATCTCAAGTTATGAAGAAGATCCGCAAAGCCGTGCAAAATGGACAACTGCCACGCGATGTCATAAGCGAAGTACTCGACAATATCCGCCCCAGTGACACAAAGTCCGCCATTGTGGAGAAAATCCGAGAAGGTGAATTATATGCATGA
- Protein Sequence
- MLKGKNAKEINELLESLRPSEDEDILMKKVRKAIKRGGLSPSVVDILTNMHPSKSHSNIGKGSEKSPTCSSLDNIEEAILMKKIRKAIGKGHLSPAVMEDILADVRPSDSICQVKKKVQKAVKEGLLPPAVLGEVRTDILPTDNKFEVIEKVRKVMLKRRVPPHVLDDILPCQQPSATKSEVIKNVQRAVEKGLLSPAVIDDIFENLHPGDRKCQFLKTVRKAIANNHVTHSAMNEILEDVRDSDNKSEIVENVRKAVERGSLPPAVMGEVLKEIRPNDDKHQAIEKVREAVEKRHVPQKVIEDIFKKPKPTGSPKSAIMWKIRKAVQNGRIPLGALNAILTDLNPSDDTYQIVDKVRRALEKNHVSPHIIDDILSKQLPRELELLDTIRKAAEQGHLSTYVLDDLLDSIQPGDTESQVMKKVRKAIAKRHVTPSVIEDVLANVHASDDKSEILDKVRSGMSTDKFNLIFFTHIDSISNIIISFKAVEKGHLPPAVLGEVLSNLKPTDDKYEVIDKVRNAMKKRQVPPNVMNDILAFQKPGRSASEILSSIRKAVKRGHVPPSVLDDILGSVSPEDRKSQVLKKVRKAFTKRNISPSVVDKLLANVQGSDNKSEVINNVRKAIEKGQLPPAILGEILANIRPSDDQPKVTEKVRQALLKWHVPPHIMNNVLGSPVDKSEIIRNIKKAMGKAKLSPSLMDDIIASVQPGDNKSQVLKKIRKVVAKRHMPPALMNDLLTDISDSEDSSQVMKKIRSAADKGKLPPALMDDILAHVLPDDDKSDVLKKVRTAVEKSQVPPHLVDSIAEKIRPEEGDKIMKEVQTAVDKRKMTPSLFKAIKDLGKGDPDDSMDRMSELLKNIRKAVAEENLSPAVLEDILANIQPNDKKSDLVKKVRQAIAKRHVSPSVFEEMLDSANQSEDGPAIVNKIRKAVDKGQLPPDVMHDILASVQPSDDKDAVIKKVREALEKRKVSPYVMDNVLGNIYPHDNKAEIIMNVRRAVEKGHLSPKVLDEVLAKVHPSDNKSQVMKKIRKAVQNGQLPRDVISEVLDNIRPSDTKSAIVEKIREGELYA
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -