Rped017858.1
Basic Information
- Insect
- Riptortus pedestris
- Gene Symbol
- lilli
- Assembly
- GCA_019009955.1
- Location
- GWHBAZH00000001:77828365-77922376[+]
Transcription Factor Domain
- TF Family
- AF-4
- Domain
- AF-4 domain
- PFAM
- PF05110
- TF Group
- Unclassified Structure
- Description
- This family consists of AF4 (Proto-oncogene AF4) and FMR2 (Fragile X syndrome) nuclear proteins. These proteins have been linked to human diseases such as acute lymphoblastic leukaemia and mental disabilities [1]. The family also contains a Drosophila AF4 protein homologue Lilliputian which contains an AT-hook domain. Lilliputian represents a novel pair-rule gene that acts in cytoskeleton regulation, segmentation and morphogenesis in Drosophila [2].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 7 6.8e-10 6.4e-06 25.8 0.0 4 90 10 98 9 193 0.70 2 7 9.8e-07 0.0093 15.3 0.5 346 386 195 235 179 257 0.76 3 7 0.00054 5.1 6.3 12.4 415 513 294 403 260 404 0.51 4 7 0.15 1.5e+03 -1.8 3.1 203 264 442 505 413 529 0.58 5 7 1.7 1.6e+04 -5.2 11.4 150 255 454 556 440 567 0.47 6 7 2 1.9e+04 -8.6 23.8 80 240 539 687 517 710 0.35 7 7 2 1.9e+04 -9.0 27.0 76 236 860 1012 853 1058 0.41
Sequence Information
- Coding Sequence
- ATGGACCTGTCGACAAGCGATTGCGTTGAGCGGGACCGACTTCGCGAGCGGGATCGGCAGGCCCGAGCGCTGATGAACTCTGAGCCGAGCCCTACATCAGGAGGCGCCCCTTTATTCGGCGCTCCAGTCAGGGTTACACCCGCACCCGGGGATAGAATCAGGCAGAATATTCAGAGTAAGCTCGGGGAATATGATAAAGTTAAGCACCTGCTTGAAGAGCCAAAGCAGTTGCTGGGCTTTGACGGTATTCCTCCGGCATCTCCGGCCCCTGGGCCCACGCCACCACCTCCCAGGCCCCATGAGTTCAAGAAGCCACCTCACCACTCCATTGGCCAGGGACGGCAGAATCACCATCCCACCAATAGAGCTGGTGGGTTCGTGAAGCCTGCCGATGGGAAGCCAGCCTATCACGGACGTGGCTTCTACCCTGGCCAATCTGTGAAGCACACTGAACCGAGGGCAAATGGTGTCGTAAACTCATCAAAAGGGCCGCCTTTGCACATGCCTAGGCCTCCCAACAGGAACACACATCGACCTCATCTTGATCTGCATCACAAGGACATGGGGCGGTCTACTGCTTCAACAAGTGACGTTGACCACATTTTAAAAGAAATGACGGAAGTGACACCTCCCCTGACTGCTATTGCTTCAACCCCGAGGAAGGAAGTGGATTCAAAATTCCATTTCACTCCAGATATATTTAGTCAGTATTCAGAGGCAGTGGAACACAAAAGTTCAGTGATATCACCGAGGATGGAATCACCACCTCCGATGCCGCTCTCACCGCCGCCAGCTACACCCATTCCTCAGAAACAATACAAGCCACTTAATGAACTTGGAAGCAGAAATGGGCATGCTCAAAAATCATACAACGATGATTTTATCAGTGACTTGGCACTTTCTGAGGATAGTGAAGATGATGTAAAGATAAGTGAATGTACCAAACCTCCAGTTCCTGAGCAACAACCCAGGATTGAACCAGAACCTTCACCAATTCCAACTATAGTTTTGACCTCTCCTCCTGCAAGTGTAGTCCCTGAACCTGAGAGACCTATGCCTACATCTCCCGAAGGAAGTTCATCTAGTGAATATTCAGGGACTGATTCCTCAGAAAGTGATTCAAGTACATCCCCTCCAGCTGCCACTCAAGACTACCCAAACCCCATTCCAGAGCAACCTCAACTTAGTTGGGATTTAAAATCATTTCTCCAGCCACAAAAGGATATTTTGGAAAGAAAGCAACCAACTCCAGAACCCCAGCCAGCTGTAGAAAAACCTAAACTTGAGGATGATTCAGAAATTATTCTCGGTGAGGTTTTGACTAAACCACCCCAACCTTTACTATCAAGTTTGTCAGATAGTGATGCAGAGAAAAAAGGAAAACCTAAGAGTTCAGGTAGAACAAAGAAAAGGAGGAGGGTTGTAAGGAAATCTCCTGATGTAAGGACTATAGCATCAGACAGTGAGGAAGAAGCAGAGGTTGTGCTATCAACTCCACCTTCTCCTAGACCTAGCAGAATCCCAGCTAAAGTTCCATCTGGAAGTGAAGGGGAAGTAAAATCCACTGCTAAACGAGAAAAGTCCACAAGAAAAAATCAGCTCACTCCTTCTAGGAAACCTTCTGTTGAAAGAGGTCGAAAAAGGACTCCTGCCACCTCAAGTTCAACTCCTAAAAGTCCTGAATCTGAATCCCCTACAAAGAGAGGTCGTAAAAGGCCAATGAAGCCTCCAGTAACACCAAATGATACTGATGAAGATTCTAAAAGAAAAAGGATTCCAAGTAATTCTGAATCAGATGATGGGCCCTCCTGGAATCATGCTCCTGTTTCAAGAAAGAGTAATAGACGAAAGTCTGCCAAAGAAAGTTCAAGTGAGCGAACTAAGAGAGAATCCCCTTTCATCAGGATATCAAGTAGTGATGAAGAAGAAACCACTGTTGCAACATCTCCTTTGAAACCTGTCTCAAGAGTAGCCCCTGCCCCCTCTGGAAAAAGGCCTCCAAAGGGGTCAAGTGAAAGTGATGCTGAAAAAGAGGAAAGCCCTCCAAAACTTGATTCTGATGGTCAAACTATTCAAGACAAAAAGAAAAATGATACATTAAGAAAACTCTTTTCAAAGAGAGATAATGAAGGAGGTGGTGGAAAAGGCGGTGGCAAAGGAGGAGGTAAAGGAGGAGGTAAGGGTGGAGGCAAAGGAGGAGGTAAATCAGGTGGAAAAGGAGGAAAATGTGGAATTCAAATAATTGAATGTGACACTGAAAGAGCGTCCCCTCATGTTACAAAACCTGAAGTTATTTCACCTGTTTTGCCTCAAAATGCCAGGCTGCAGGAGCCATTGAATGAAGAAGAACCAAAGCCAAAAACTCCTGTATACCCACAAGTTTCCTACATCAATAACAAAATATCTATTATGTGTAGGATTGACTTGTCGAGAATTACCCATGTACCCAGTAAGCGGAGGTCTGAGGAAATTCGAACCAGAACAGAACTACCTGATACCAGGCAAGCAAAAGTTTCTAAAACAGAGCCTGAGGCCCCTCCACCACCTCCTCCTCCACCTCCCCCTCCTCCTCCTCAGCCAAAATCTGAGAAGCCTGAAAAAACAGAAAAACCTGAAAAGCCTGAAAAACGAGATCGTAAATCTTCTGATAAGCATCGTTCAAGGGATGATAAACTAAAGAAACGTAAGAAGTCACCAATTGTAGAAACTAAGGAGGAGCAACCCCCAGTGACTAATTCAATAGAAGCGTTTAAACGTTTGACTCCAGATGTACATGTGGAACCTGAAAGCAGTACTTCAAGTTCCACTGTTAACAATAAAAGGGATAGAAGATCAAGTACAAGTTCTACTTCCTCCCATGCAAGTAACTTGTATCATAACAATTCACATTCAAGTGCAAAAAAGAAAAAGTATGGGGAAAGGCTTGGCTCAAAGAGTGAAGGAGCATTGGTGGATGCCCCACCTACTAACCATGAGCGGGATCGAAAGGAACTTGACAGGGAGAAGGAAAAAGAGAAAGAAAAGGAGAAAAAGAAAAACTTGCAGTATTTTTCATACTTTGAGCGTGAAGAAGAAGAACCAAATGATAGTGATAAAAAGGACTTTTATTTGGCGGAAGCAAAACGCCTAAAAAGTAATGCAGAAAAAGAAAAAGATGAAGCTGCCCAGGGCATGCAGTACCTTGAAGCTGTAATGTATTTCCTACTCACTGGAAATGCAATGGAGCGTGAAAGTGACTCTGAGAAGTCAGTTCACAAAGTCTATACCGACACACTGAAACTCATCATGACAGCTGGCTATCAGCTTCGTAGGCCAACAAGTTTTAATTTAAAGGACATTGCATCTAAATTCAGGAGTCAACAGTATCAGCAGTCTAGCATTCACAGCAAACTGGCAATTCTGGGTTTAAGAAGCCAGTCACTTATTCATTTAAAATTATACAAGATGAAGAAGGGCGAATTGAATGAAGCTGTGAAGGTAGTGACTGACTACATGCAAAAGGCCCCTACAACAATAGATCAAAGAGAAATTGGAATTCCTGGCCAAGGAACTCCTTCACCACTTTCTCCTACTCCTTCTCCTGCAGGGAGTGTTGGCTCAGTGGGATCTCAGTCTTCAGGGTATAGCAGTGGAGGTGGACCTATCCCAATTCAGCCGCATATGCCTCCTGCACCTTTGGGACAGGTGGTTGCTACTCCTCTTCATGTACATTCTTATCTTCATAAAGAACACACACATTTTAATTCCTTAATGGCTTGTCATGACCTCTGGGAACAAGCTGACACACTTGTGCTTAAAGGAAAACACAAAGATTTCTTCATTGAGCTTGATCGAAGCTGTGGACCGTTGACCCTACACAGCACCTTGAAAGAATTGGTGAAGTATGTAAGGGTTGGAATTGCTCGATTGAAGGAAATGAGAAATGAATCAACGTGA
- Protein Sequence
- MDLSTSDCVERDRLRERDRQARALMNSEPSPTSGGAPLFGAPVRVTPAPGDRIRQNIQSKLGEYDKVKHLLEEPKQLLGFDGIPPASPAPGPTPPPPRPHEFKKPPHHSIGQGRQNHHPTNRAGGFVKPADGKPAYHGRGFYPGQSVKHTEPRANGVVNSSKGPPLHMPRPPNRNTHRPHLDLHHKDMGRSTASTSDVDHILKEMTEVTPPLTAIASTPRKEVDSKFHFTPDIFSQYSEAVEHKSSVISPRMESPPPMPLSPPPATPIPQKQYKPLNELGSRNGHAQKSYNDDFISDLALSEDSEDDVKISECTKPPVPEQQPRIEPEPSPIPTIVLTSPPASVVPEPERPMPTSPEGSSSSEYSGTDSSESDSSTSPPAATQDYPNPIPEQPQLSWDLKSFLQPQKDILERKQPTPEPQPAVEKPKLEDDSEIILGEVLTKPPQPLLSSLSDSDAEKKGKPKSSGRTKKRRRVVRKSPDVRTIASDSEEEAEVVLSTPPSPRPSRIPAKVPSGSEGEVKSTAKREKSTRKNQLTPSRKPSVERGRKRTPATSSSTPKSPESESPTKRGRKRPMKPPVTPNDTDEDSKRKRIPSNSESDDGPSWNHAPVSRKSNRRKSAKESSSERTKRESPFIRISSSDEEETTVATSPLKPVSRVAPAPSGKRPPKGSSESDAEKEESPPKLDSDGQTIQDKKKNDTLRKLFSKRDNEGGGGKGGGKGGGKGGGKGGGKGGGKSGGKGGKCGIQIIECDTERASPHVTKPEVISPVLPQNARLQEPLNEEEPKPKTPVYPQVSYINNKISIMCRIDLSRITHVPSKRRSEEIRTRTELPDTRQAKVSKTEPEAPPPPPPPPPPPPPQPKSEKPEKTEKPEKPEKRDRKSSDKHRSRDDKLKKRKKSPIVETKEEQPPVTNSIEAFKRLTPDVHVEPESSTSSSTVNNKRDRRSSTSSTSSHASNLYHNNSHSSAKKKKYGERLGSKSEGALVDAPPTNHERDRKELDREKEKEKEKEKKKNLQYFSYFEREEEEPNDSDKKDFYLAEAKRLKSNAEKEKDEAAQGMQYLEAVMYFLLTGNAMERESDSEKSVHKVYTDTLKLIMTAGYQLRRPTSFNLKDIASKFRSQQYQQSSIHSKLAILGLRSQSLIHLKLYKMKKGELNEAVKVVTDYMQKAPTTIDQREIGIPGQGTPSPLSPTPSPAGSVGSVGSQSSGYSSGGGPIPIQPHMPPAPLGQVVATPLHVHSYLHKEHTHFNSLMACHDLWEQADTLVLKGKHKDFFIELDRSCGPLTLHSTLKELVKYVRVGIARLKEMRNEST
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -