Basic Information

Gene Symbol
lilli
Assembly
GCA_951394055.1
Location
OX596018.1:35583026-35590486[+]

Transcription Factor Domain

TF Family
AF-4
Domain
AF-4 domain
PFAM
PF05110
TF Group
Unclassified Structure
Description
This family consists of AF4 (Proto-oncogene AF4) and FMR2 (Fragile X syndrome) nuclear proteins. These proteins have been linked to human diseases such as acute lymphoblastic leukaemia and mental disabilities [1]. The family also contains a Drosophila AF4 protein homologue Lilliputian which contains an AT-hook domain. Lilliputian represents a novel pair-rule gene that acts in cytoskeleton regulation, segmentation and morphogenesis in Drosophila [2].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 10 7e-09 0.00015 22.4 22.5 6 176 5 165 1 188 0.62
2 10 1.8e-07 0.0039 17.7 10.3 349 461 280 393 255 404 0.66
3 10 2 4.3e+04 -7.4 22.8 109 253 417 554 398 567 0.33
4 10 0.055 1.2e+03 -0.3 12.2 422 491 574 645 559 660 0.72
5 10 0.46 9.8e+03 -3.4 2.3 442 502 675 737 663 742 0.36
6 10 2 4.3e+04 -11.5 46.7 392 487 747 853 719 869 0.53
7 10 1.3 2.9e+04 -4.9 19.0 87 217 902 1027 894 1046 0.48
8 10 2 4.3e+04 -6.7 13.4 68 257 990 1176 987 1221 0.52
9 10 0.19 4e+03 -2.1 2.3 67 146 1158 1243 1143 1365 0.45
10 10 0.42 8.9e+03 -3.2 2.3 138 169 1424 1455 1398 1527 0.36

Sequence Information

Coding Sequence
ATGGAAGATTATGAACGTATGGAAAGAAGAGAAAGAGATAAGCTCACAAGACAGCAAGTACAAAATCCAGATCCAGAAATACCACTTTTTCCAGCACCAGTTCGTTTAAGTCCATCAGAAGGTGATCATAGAATACAATCGAAACTAGGAGATTTTTCATTAGTTAAGGGTTTAATTGAACAAAAAAGATGCATTGGTATTTCAACTAGTGCTCAACGCCAATTACAAATTGGTTCAAGCTCAtctgcatcatcatcatcgtcatcgtcatcgtcatcatctgCTATACAACAATCTTCATCTACATCAgcgtcattatcatcatcaaactGTTCGGTAATACCTCCAATACAGCACAATCGTAATCTTCAACTGCAGCAGGCGTTATCAtctcatcattatcatcaatcacaacagcaacaacagcaactgCAACAActgcaacaacagcagcaacagcagcaacagcaacaacagcaacaacaatcgCAACAACCGCAACATCCACCACCGTCACGAGTTCCTTACATGAAGCAAATAACTGATAATAAACCAGCATATAATGGTCGTGGAGGTTATCCAGGACAATCTATGAAACATGATATTCATAGTGTTGGAGGCGGTGGTGGTATAATAGCACCTCCGACAAAAGGTTCAATGTTACCACCACCACAATCACAACCAATGTTGTCAAATGGTGGAAGATTGAATGACAAATTGTCATCAATCCCACCACTTCCAAATGGTAGATTTCAACCACCGTTGCCTAAACCGCCTAAATTTAACGCAGagTCTGAAAGCTCAAATCCATTATCATCATTGGCTGTAAATGgtgatattgaaaatattttgaaagaaatGGTTGTACCCAGTTACACACCATTGACTGAAATTGCAGCAACTCCACGAAAAGAAGTGGAATGTAAATTTACATTTGGACAACCAAATCCTCATAAGTATGCCGATTTTCCCCCACCATTTAAAGCACCATTAAGTAATAAATtaaaagaaaaaatagtgCCTACAAAACCAACTGATCTCGAGAAAGATTTAGATTTATCAGATAGCGATgaagaaagaaagaaacaaaCGTCTGTCGATCAgACATCACCAGCAAATGATGCTTCCAGTGAAagttcagaatcagtctcaagTGAATCTGGTAGCGAAGAATCAAATGGAACCAAAGGAGAAACCTCTAATCATACAACTAAATGGACATTAGGTAGATTTTTTAAACAACCTCAAGCACAAAATGAAATAACACAACCATCACCAAATGCAAATTCACAAAGTATACCTGaacaaacaattaaaaatgaacCAATACAGAATATTGAAGATGATAATTTATCATCACCGTctacatcatcattatcatatcgtaatacaaataatatatcatCACAGCATttacagcagcaacagcaacagaaaACGACATCAGCAAAAATAATTTCACCAGCCGATGTGAAAAAGGAGAATATGCAATCGGATAGTCTGTTTCCAATTTCTAATAAAACATCATCAGTTACTGGTAATAATAATTCACAACAACAGAAACCgggacaaattaaaagtgaggCATTTGataaaagTCCATCAACAAGTCCTGAAAACATTCAACAATCATCGTGTACTGTTGAATCAGATCAATCTGCAACAATATTTGCTGCAGCCGAAGAATATTCGTGCATAAAAGGTTTATCAGAATTAAGCAGTAGTTCTGGGGAGGAAGAAGatgttgaagaaaataaaaaacctaAAAAGAAACGAACGCCTAGTAAAAAGCcacagaaaagaaaaagaaaattgccTAAAGATAAGAAAACAGTAAACACTGCAATAGCATCAAGTGATGAAGAAGAATTTTCAAGTGCATTATCATCAGCAGGAAGCAGATCAAAATCAATTGATACTGAAAAGAAACCGAGAAGTGGACGAGGTCGTCCAAGAAAAAATCCAATAGCTGTAACAGCAGCATTAGCTACAGTTACACCAGCTCCAACCAAAGCGACTGCTCCATCAACTAAAAAAGAACCAGTCAAAAAACAGGCTGTACGAAGAAGAGTTTCACGACAAAATTCGAACGTTAAAAGTCGTGAATTGTTAGATACTACGAATTCAACGTCAGACGATGATGACGACGGAAGTCCACCGTCTGTTGCAGCACCAGTACGTCGAAGTCCTCTTCCTCCGCCTAAACCACCTGAACCGACAGCTTCTAGTCGAATATTACTTCAAATCCAAAAACGTTCATCACCATCATGTGCCGTAAAGTCAAACGAAGATGACgaagacgatgatgatgatgacgacgacgacgacgacgatgatgatgacgatgaaaaTTCTTCTACCTCTTCACCACCATCACAAGAAGATCGACTATCAGTTTCACCTCGTATAACGCCAGCAAGTcgacaacagcagcaacaacaacatatTTCTCCTGGTCGTCGTTCCCAACCAAGAAGCaaactatcatcatcatcatcctcatccTCATCCTCTTCATCTAGTGAAAGCAACTCAGACAGTGATAGTGAGGATTCAATAGAAGATAGAACGCATTCTAGAGACGATAAATTGACTTCAAGGGAAGATAAGaaaattcgtGATAAGACGAAAAGTGATAAAAACAAAAGTGATACCTTACGTAAATTATTTTATGGATTGAATAAGGGAGAAGGTGGAGCAAAGAAACAAGGTCAAGTACTTATTGTGGACCAATCTGAAGAATCACAAAATCAATCGTCTTATcatcatgaaaaaattttatcaccaatAGGCTTTAAGTGCAGTAGTTCGTCGTCAGTTGAAACTACTCAAACAACATCTATAAATAGCAATAATAACACCAACAGCAATACAAATAACGCCAATATACTTACAACCAAATCCCCACGACGTACTCCAATTCGCAATagtaacaacagcaacaacaataataatcataataataatagcaattgTAATAATATGATTAGAACACCAGACAGATCTTCGAATATTACACCAACATCAATCACATCGAATAAACATCAAATGTTGAATAATAACAGCAGTAATAGCAGCAGTAATTTTAATTTAGGAAATCTAAATGAAAATCGTTGTAATACACCAATAATGCCTCCAATTATTTGTAGAATTGAATTATCTCGCCTAAAGCATATTCCTCCAGAATGGTATCATAATTACACAGTTCGAAATGAACGAAGTCCTCGCAATAGTAGCAACATAACCTGTAGTAGTAATTTAAGTCAAAATAATGCAAATAATAACAATTGCAACATAATGGGTCCTAGTGATGATGAAAGactgatgatgataaaaaattcCAGAAATATGAATATGTCTTCAAATTCTGCTAATTCAGCAGCTGTTTCAAAAATGAATCGACAGCAATTAATGGATAATTGTTGCTCAAGGGATAATTATCTGAATGTTGATAATGCAGGACTAAACTCTAGACAAACTTCTGTTGCAGTAACAATGCCTACAACATCAATTAACAATAATTTAAGTAACAATAATACAAGACTGTTTGACGATAACCCAATGGTTAGCATACCAATATCACAACACCAGCATAGCAATCGAAGTATGTTGATGAATCCAAGTTCTGGTATGGTGGGTACAAATAAGCCAATCGCAATAATTGCAGCCACGAGAAATTTAGATTCGTCCGCAATGCATGACGGTGGAAAGATGTTGATGAGTAATAGTGTCGCTAATAATGCGTACGGTAGTTTCAACACAAACAGTGGTGGTATGATGATGGTAGAAAATCGAAATAGTCGAGTTCCTACACCTCATCAagaacaaatacaaaatttacgttTAACACCGAATCGATCAAATAGTGGTGGCGGTCATTTGCAATCACCAAAATTAGATGATCGTACATCGATGGCTGCATCACCAGTTAAAAATGTGATAAAACATGAAACATCAGCTATAAAATCTGAATTTATGAATGATGGTGATTATTATTTGGCAACAGGAACAAAATCATCTGCTTCTGGATTACcagatgaaaaatataatatttacggTAAAATGAGTAACAATGCTGCCTTAAAATCTGAACAAATGATGatcaaaaatgaatacaaaacaCTTGGTATACCGCCACTTGGCGGTAATGATTTAGATTCAAATAAAATTGGTGGCAATGTTGGTTGTGGTGGCGATGAAATGTCACCACCATCGGGATTAGGGATTATAACGAGTGGCTCATCGAAATCAATGAATAGTGACGACTATATTCCAAAGCAACGTCGTAAACGTAGTTGTAGTGCAAACAGTAGTCCATTCAAAGAGAAGAAACGAAAAAAAGGTGATCAGCCACAGcTTCAAAAAGATGTTATGGAACAATTACCGCCAACAAATCATGATCGGTTAACATTAGAAAAACCGATTGTACAGTCACAACAATCAAAAGTATACAAGTCATACTTTGAAAGAAGTACAGACGATATTTTGAGAGAAGATTATAGgGACCAAACAAGATTTTTGGAAGAAGCTAAAAGATTAAAACATGCTGCCGATAGGGAAACACATAATTTTGATCAAGTTATGTTATATTTAGAAgctgtattatattttttgttatctgGTGCTGCTATGGAAGAGGAGAAAGTAACAGAGAATGCAGCTTTCACAATGTACAAGGACACATTAATGTTAATCAAATTTATATCATCAAAATTTCGTAATCAACAAGCAGATAATCATAATAAGGTCGCTATATTAAgttTACGTTGTCAATCATTAATTtcatcaaaattatataaaatgaaacgtCATGAATGTAAAGAAGTTCAAAAAATTGttcatgatttttttaaaactgGTGGTAAAGGTTCTACAGAATTGATAAATGGTAATACACCATCATCAATATCACCAACGAATTCTGTATGCTCTcaggGGTCCGGTTCTAATACTCCACCATGTCAAATGGTTCCGATTCAGATATATGCAGCtcttcaaaaacaaaatcaatactTTAATTATTTAACGAGTTGTCATGATTTATGGGAACAAGCTGATAGTTTAGTAAGACGAGGCAATCATacagattTCTTTATTGCATTAGATCATGAGAATGGACCTCTAACGTTACATAGCTCCATATAcgatgtatttaaatatgttcAGGCTggtttacaaaaattaaaagaaatgtaa
Protein Sequence
MEDYERMERRERDKLTRQQVQNPDPEIPLFPAPVRLSPSEGDHRIQSKLGDFSLVKGLIEQKRCIGISTSAQRQLQIGSSSSASSSSSSSSSSSAIQQSSSTSASLSSSNCSVIPPIQHNRNLQLQQALSSHHYHQSQQQQQQLQQLQQQQQQQQQQQQQQQSQQPQHPPPSRVPYMKQITDNKPAYNGRGGYPGQSMKHDIHSVGGGGGIIAPPTKGSMLPPPQSQPMLSNGGRLNDKLSSIPPLPNGRFQPPLPKPPKFNAESESSNPLSSLAVNGDIENILKEMVVPSYTPLTEIAATPRKEVECKFTFGQPNPHKYADFPPPFKAPLSNKLKEKIVPTKPTDLEKDLDLSDSDEERKKQTSVDQTSPANDASSESSESVSSESGSEESNGTKGETSNHTTKWTLGRFFKQPQAQNEITQPSPNANSQSIPEQTIKNEPIQNIEDDNLSSPSTSSLSYRNTNNISSQHLQQQQQQKTTSAKIISPADVKKENMQSDSLFPISNKTSSVTGNNNSQQQKPGQIKSEAFDKSPSTSPENIQQSSCTVESDQSATIFAAAEEYSCIKGLSELSSSSGEEEDVEENKKPKKKRTPSKKPQKRKRKLPKDKKTVNTAIASSDEEEFSSALSSAGSRSKSIDTEKKPRSGRGRPRKNPIAVTAALATVTPAPTKATAPSTKKEPVKKQAVRRRVSRQNSNVKSRELLDTTNSTSDDDDDGSPPSVAAPVRRSPLPPPKPPEPTASSRILLQIQKRSSPSCAVKSNEDDEDDDDDDDDDDDDDDDDENSSTSSPPSQEDRLSVSPRITPASRQQQQQQHISPGRRSQPRSKLSSSSSSSSSSSSSESNSDSDSEDSIEDRTHSRDDKLTSREDKKIRDKTKSDKNKSDTLRKLFYGLNKGEGGAKKQGQVLIVDQSEESQNQSSYHHEKILSPIGFKCSSSSSVETTQTTSINSNNNTNSNTNNANILTTKSPRRTPIRNSNNSNNNNNHNNNSNCNNMIRTPDRSSNITPTSITSNKHQMLNNNSSNSSSNFNLGNLNENRCNTPIMPPIICRIELSRLKHIPPEWYHNYTVRNERSPRNSSNITCSSNLSQNNANNNNCNIMGPSDDERLMMIKNSRNMNMSSNSANSAAVSKMNRQQLMDNCCSRDNYLNVDNAGLNSRQTSVAVTMPTTSINNNLSNNNTRLFDDNPMVSIPISQHQHSNRSMLMNPSSGMVGTNKPIAIIAATRNLDSSAMHDGGKMLMSNSVANNAYGSFNTNSGGMMMVENRNSRVPTPHQEQIQNLRLTPNRSNSGGGHLQSPKLDDRTSMAASPVKNVIKHETSAIKSEFMNDGDYYLATGTKSSASGLPDEKYNIYGKMSNNAALKSEQMMIKNEYKTLGIPPLGGNDLDSNKIGGNVGCGGDEMSPPSGLGIITSGSSKSMNSDDYIPKQRRKRSCSANSSPFKEKKRKKGDQPQLQKDVMEQLPPTNHDRLTLEKPIVQSQQSKVYKSYFERSTDDILREDYRDQTRFLEEAKRLKHAADRETHNFDQVMLYLEAVLYFLLSGAAMEEEKVTENAAFTMYKDTLMLIKFISSKFRNQQADNHNKVAILSLRCQSLISSKLYKMKRHECKEVQKIVHDFFKTGGKGSTELINGNTPSSISPTNSVCSQGSGSNTPPCQMVPIQIYAALQKQNQYFNYLTSCHDLWEQADSLVRRGNHTDFFIALDHENGPLTLHSSIYDVFKYVQAGLQKLKEM

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-