Basic Information

Gene Symbol
lilli
Assembly
GCA_019009955.1
Location
GWHBAZH00000001:77828365-77922376[+]

Transcription Factor Domain

TF Family
AF-4
Domain
AF-4 domain
PFAM
PF05110
TF Group
Unclassified Structure
Description
This family consists of AF4 (Proto-oncogene AF4) and FMR2 (Fragile X syndrome) nuclear proteins. These proteins have been linked to human diseases such as acute lymphoblastic leukaemia and mental disabilities [1]. The family also contains a Drosophila AF4 protein homologue Lilliputian which contains an AT-hook domain. Lilliputian represents a novel pair-rule gene that acts in cytoskeleton regulation, segmentation and morphogenesis in Drosophila [2].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 7 6.8e-10 6.4e-06 25.8 0.0 4 90 10 98 9 193 0.70
2 7 9.8e-07 0.0093 15.3 0.5 346 386 195 235 179 257 0.76
3 7 0.00054 5.1 6.3 12.4 415 513 294 403 260 404 0.51
4 7 0.15 1.5e+03 -1.8 3.1 203 264 442 505 413 529 0.58
5 7 1.7 1.6e+04 -5.2 11.4 150 255 454 556 440 567 0.47
6 7 2 1.9e+04 -8.6 23.8 80 240 539 687 517 710 0.35
7 7 2 1.9e+04 -9.0 27.0 76 236 860 1012 853 1058 0.41

Sequence Information

Coding Sequence
ATGGACCTGTCGACAAGCGATTGCGTTGAGCGGGACCGACTTCGCGAGCGGGATCGGCAGGCCCGAGCGCTGATGAACTCTGAGCCGAGCCCTACATCAGGAGGCGCCCCTTTATTCGGCGCTCCAGTCAGGGTTACACCCGCACCCGGGGATAGAATCAGGCAGAATATTCAGAGTAAGCTCGGGGAATATGATAAAGTTAAGCACCTGCTTGAAGAGCCAAAGCAGTTGCTGGGCTTTGACGGTATTCCTCCGGCATCTCCGGCCCCTGGGCCCACGCCACCACCTCCCAGGCCCCATGAGTTCAAGAAGCCACCTCACCACTCCATTGGCCAGGGACGGCAGAATCACCATCCCACCAATAGAGCTGGTGGGTTCGTGAAGCCTGCCGATGGGAAGCCAGCCTATCACGGACGTGGCTTCTACCCTGGCCAATCTGTGAAGCACACTGAACCGAGGGCAAATGGTGTCGTAAACTCATCAAAAGGGCCGCCTTTGCACATGCCTAGGCCTCCCAACAGGAACACACATCGACCTCATCTTGATCTGCATCACAAGGACATGGGGCGGTCTACTGCTTCAACAAGTGACGTTGACCACATTTTAAAAGAAATGACGGAAGTGACACCTCCCCTGACTGCTATTGCTTCAACCCCGAGGAAGGAAGTGGATTCAAAATTCCATTTCACTCCAGATATATTTAGTCAGTATTCAGAGGCAGTGGAACACAAAAGTTCAGTGATATCACCGAGGATGGAATCACCACCTCCGATGCCGCTCTCACCGCCGCCAGCTACACCCATTCCTCAGAAACAATACAAGCCACTTAATGAACTTGGAAGCAGAAATGGGCATGCTCAAAAATCATACAACGATGATTTTATCAGTGACTTGGCACTTTCTGAGGATAGTGAAGATGATGTAAAGATAAGTGAATGTACCAAACCTCCAGTTCCTGAGCAACAACCCAGGATTGAACCAGAACCTTCACCAATTCCAACTATAGTTTTGACCTCTCCTCCTGCAAGTGTAGTCCCTGAACCTGAGAGACCTATGCCTACATCTCCCGAAGGAAGTTCATCTAGTGAATATTCAGGGACTGATTCCTCAGAAAGTGATTCAAGTACATCCCCTCCAGCTGCCACTCAAGACTACCCAAACCCCATTCCAGAGCAACCTCAACTTAGTTGGGATTTAAAATCATTTCTCCAGCCACAAAAGGATATTTTGGAAAGAAAGCAACCAACTCCAGAACCCCAGCCAGCTGTAGAAAAACCTAAACTTGAGGATGATTCAGAAATTATTCTCGGTGAGGTTTTGACTAAACCACCCCAACCTTTACTATCAAGTTTGTCAGATAGTGATGCAGAGAAAAAAGGAAAACCTAAGAGTTCAGGTAGAACAAAGAAAAGGAGGAGGGTTGTAAGGAAATCTCCTGATGTAAGGACTATAGCATCAGACAGTGAGGAAGAAGCAGAGGTTGTGCTATCAACTCCACCTTCTCCTAGACCTAGCAGAATCCCAGCTAAAGTTCCATCTGGAAGTGAAGGGGAAGTAAAATCCACTGCTAAACGAGAAAAGTCCACAAGAAAAAATCAGCTCACTCCTTCTAGGAAACCTTCTGTTGAAAGAGGTCGAAAAAGGACTCCTGCCACCTCAAGTTCAACTCCTAAAAGTCCTGAATCTGAATCCCCTACAAAGAGAGGTCGTAAAAGGCCAATGAAGCCTCCAGTAACACCAAATGATACTGATGAAGATTCTAAAAGAAAAAGGATTCCAAGTAATTCTGAATCAGATGATGGGCCCTCCTGGAATCATGCTCCTGTTTCAAGAAAGAGTAATAGACGAAAGTCTGCCAAAGAAAGTTCAAGTGAGCGAACTAAGAGAGAATCCCCTTTCATCAGGATATCAAGTAGTGATGAAGAAGAAACCACTGTTGCAACATCTCCTTTGAAACCTGTCTCAAGAGTAGCCCCTGCCCCCTCTGGAAAAAGGCCTCCAAAGGGGTCAAGTGAAAGTGATGCTGAAAAAGAGGAAAGCCCTCCAAAACTTGATTCTGATGGTCAAACTATTCAAGACAAAAAGAAAAATGATACATTAAGAAAACTCTTTTCAAAGAGAGATAATGAAGGAGGTGGTGGAAAAGGCGGTGGCAAAGGAGGAGGTAAAGGAGGAGGTAAGGGTGGAGGCAAAGGAGGAGGTAAATCAGGTGGAAAAGGAGGAAAATGTGGAATTCAAATAATTGAATGTGACACTGAAAGAGCGTCCCCTCATGTTACAAAACCTGAAGTTATTTCACCTGTTTTGCCTCAAAATGCCAGGCTGCAGGAGCCATTGAATGAAGAAGAACCAAAGCCAAAAACTCCTGTATACCCACAAGTTTCCTACATCAATAACAAAATATCTATTATGTGTAGGATTGACTTGTCGAGAATTACCCATGTACCCAGTAAGCGGAGGTCTGAGGAAATTCGAACCAGAACAGAACTACCTGATACCAGGCAAGCAAAAGTTTCTAAAACAGAGCCTGAGGCCCCTCCACCACCTCCTCCTCCACCTCCCCCTCCTCCTCCTCAGCCAAAATCTGAGAAGCCTGAAAAAACAGAAAAACCTGAAAAGCCTGAAAAACGAGATCGTAAATCTTCTGATAAGCATCGTTCAAGGGATGATAAACTAAAGAAACGTAAGAAGTCACCAATTGTAGAAACTAAGGAGGAGCAACCCCCAGTGACTAATTCAATAGAAGCGTTTAAACGTTTGACTCCAGATGTACATGTGGAACCTGAAAGCAGTACTTCAAGTTCCACTGTTAACAATAAAAGGGATAGAAGATCAAGTACAAGTTCTACTTCCTCCCATGCAAGTAACTTGTATCATAACAATTCACATTCAAGTGCAAAAAAGAAAAAGTATGGGGAAAGGCTTGGCTCAAAGAGTGAAGGAGCATTGGTGGATGCCCCACCTACTAACCATGAGCGGGATCGAAAGGAACTTGACAGGGAGAAGGAAAAAGAGAAAGAAAAGGAGAAAAAGAAAAACTTGCAGTATTTTTCATACTTTGAGCGTGAAGAAGAAGAACCAAATGATAGTGATAAAAAGGACTTTTATTTGGCGGAAGCAAAACGCCTAAAAAGTAATGCAGAAAAAGAAAAAGATGAAGCTGCCCAGGGCATGCAGTACCTTGAAGCTGTAATGTATTTCCTACTCACTGGAAATGCAATGGAGCGTGAAAGTGACTCTGAGAAGTCAGTTCACAAAGTCTATACCGACACACTGAAACTCATCATGACAGCTGGCTATCAGCTTCGTAGGCCAACAAGTTTTAATTTAAAGGACATTGCATCTAAATTCAGGAGTCAACAGTATCAGCAGTCTAGCATTCACAGCAAACTGGCAATTCTGGGTTTAAGAAGCCAGTCACTTATTCATTTAAAATTATACAAGATGAAGAAGGGCGAATTGAATGAAGCTGTGAAGGTAGTGACTGACTACATGCAAAAGGCCCCTACAACAATAGATCAAAGAGAAATTGGAATTCCTGGCCAAGGAACTCCTTCACCACTTTCTCCTACTCCTTCTCCTGCAGGGAGTGTTGGCTCAGTGGGATCTCAGTCTTCAGGGTATAGCAGTGGAGGTGGACCTATCCCAATTCAGCCGCATATGCCTCCTGCACCTTTGGGACAGGTGGTTGCTACTCCTCTTCATGTACATTCTTATCTTCATAAAGAACACACACATTTTAATTCCTTAATGGCTTGTCATGACCTCTGGGAACAAGCTGACACACTTGTGCTTAAAGGAAAACACAAAGATTTCTTCATTGAGCTTGATCGAAGCTGTGGACCGTTGACCCTACACAGCACCTTGAAAGAATTGGTGAAGTATGTAAGGGTTGGAATTGCTCGATTGAAGGAAATGAGAAATGAATCAACGTGA
Protein Sequence
MDLSTSDCVERDRLRERDRQARALMNSEPSPTSGGAPLFGAPVRVTPAPGDRIRQNIQSKLGEYDKVKHLLEEPKQLLGFDGIPPASPAPGPTPPPPRPHEFKKPPHHSIGQGRQNHHPTNRAGGFVKPADGKPAYHGRGFYPGQSVKHTEPRANGVVNSSKGPPLHMPRPPNRNTHRPHLDLHHKDMGRSTASTSDVDHILKEMTEVTPPLTAIASTPRKEVDSKFHFTPDIFSQYSEAVEHKSSVISPRMESPPPMPLSPPPATPIPQKQYKPLNELGSRNGHAQKSYNDDFISDLALSEDSEDDVKISECTKPPVPEQQPRIEPEPSPIPTIVLTSPPASVVPEPERPMPTSPEGSSSSEYSGTDSSESDSSTSPPAATQDYPNPIPEQPQLSWDLKSFLQPQKDILERKQPTPEPQPAVEKPKLEDDSEIILGEVLTKPPQPLLSSLSDSDAEKKGKPKSSGRTKKRRRVVRKSPDVRTIASDSEEEAEVVLSTPPSPRPSRIPAKVPSGSEGEVKSTAKREKSTRKNQLTPSRKPSVERGRKRTPATSSSTPKSPESESPTKRGRKRPMKPPVTPNDTDEDSKRKRIPSNSESDDGPSWNHAPVSRKSNRRKSAKESSSERTKRESPFIRISSSDEEETTVATSPLKPVSRVAPAPSGKRPPKGSSESDAEKEESPPKLDSDGQTIQDKKKNDTLRKLFSKRDNEGGGGKGGGKGGGKGGGKGGGKGGGKSGGKGGKCGIQIIECDTERASPHVTKPEVISPVLPQNARLQEPLNEEEPKPKTPVYPQVSYINNKISIMCRIDLSRITHVPSKRRSEEIRTRTELPDTRQAKVSKTEPEAPPPPPPPPPPPPPQPKSEKPEKTEKPEKPEKRDRKSSDKHRSRDDKLKKRKKSPIVETKEEQPPVTNSIEAFKRLTPDVHVEPESSTSSSTVNNKRDRRSSTSSTSSHASNLYHNNSHSSAKKKKYGERLGSKSEGALVDAPPTNHERDRKELDREKEKEKEKEKKKNLQYFSYFEREEEEPNDSDKKDFYLAEAKRLKSNAEKEKDEAAQGMQYLEAVMYFLLTGNAMERESDSEKSVHKVYTDTLKLIMTAGYQLRRPTSFNLKDIASKFRSQQYQQSSIHSKLAILGLRSQSLIHLKLYKMKKGELNEAVKVVTDYMQKAPTTIDQREIGIPGQGTPSPLSPTPSPAGSVGSVGSQSSGYSSGGGPIPIQPHMPPAPLGQVVATPLHVHSYLHKEHTHFNSLMACHDLWEQADTLVLKGKHKDFFIELDRSCGPLTLHSTLKELVKYVRVGIARLKEMRNEST

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-