Basic Information

Gene Symbol
lilli
Assembly
GCA_949768715.1
Location
OX458321.1:18229376-18248202[-]

Transcription Factor Domain

TF Family
AF-4
Domain
AF-4 domain
PFAM
PF05110
TF Group
Unclassified Structure
Description
This family consists of AF4 (Proto-oncogene AF4) and FMR2 (Fragile X syndrome) nuclear proteins. These proteins have been linked to human diseases such as acute lymphoblastic leukaemia and mental disabilities [1]. The family also contains a Drosophila AF4 protein homologue Lilliputian which contains an AT-hook domain. Lilliputian represents a novel pair-rule gene that acts in cytoskeleton regulation, segmentation and morphogenesis in Drosophila [2].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 6 8.5e-10 3.4e-05 24.4 6.3 4 173 13 193 11 228 0.55
2 6 1.3e-09 5.1e-05 23.8 26.9 319 513 193 416 169 417 0.61
3 6 1 4e+04 -14.2 23.6 76 270 449 641 435 653 0.51
4 6 1 4e+04 -9.7 17.2 397 478 593 683 578 694 0.60
5 6 1 4e+04 -7.8 27.3 372 495 691 820 687 834 0.66
6 6 0.024 9.6e+02 -0.1 16.2 88 222 1015 1153 984 1187 0.40

Sequence Information

Coding Sequence
ATGCCTCATTACGCGAATTATACAACTCTCACCGTGGAGAGAGACCGCCTGCGAGAGCGGGAGAGGCAGGCAAGGGCTCAGATGTCCTCGCAAGTGGCCGAATGGGACAACGAACCGGAGCAGCCCCTCTTCGGAGTTCCAGTGAGGGTGAACCCGTCCTCCACTGACCGGGTGACGCAGCAGATCCAGAGTAAGCTGGGGGACTACAGGCGACTGAAGCCCTATTTGGACACATCGAAGACGCTGATCGGGGTCGACGGCGTGCCGCCTCCGAACCCCGGCGTGGCGCCTTCGTCCTCGCGGCTCCAGTCACCTAGCGGTTCCGGCCAAGCGGCCAGACCGGAGTTTAAAAAGCCCCACCAGAACAACGGAAGGCCGCCGCACCATCCTCACCAGAGAGGAAATTTTGTAAAGCCGGGCGGCGCCAAACCGCCCTATGAGAATAACCGTTACGCGGGGCAGACGGTCAAGCACGGTGGCGTTAACCATCGATCAAATGGTATTGTGGCGAGCAAGGGGCCGCCTCTTCCTGCATCCAGTTCATCGTCCCAGCAGCTACCGTCTTCTTCCAGTTCGAACCGCATACACAATGCGGCGCGGAATTTGCCTCGTCTTCCCATTGAACAACAGAACCCAGTGAGCGGGTTGGGTTCCCGAGATGCCTTATCGGCAAGCAGTAATCCAGAAGAGTTTGTCGACAACATTCTGAAGgaGATGATAGACGTGCGGACGCCTTTGACGGCCATCGCGGCAACTCCCCGGAAAGAGCCCGATAACAAGTTCACCTTCAATCCCGTTCTGTCCAAGCTGACGGAGGCGACTCCGACTGCACCGAAAAaaCGCGAAAGGCAGCCTCTCCATAGGCCCTCGACCGATTTGCGAGATGATCTCAATATATCCGAGTCGGACGACAGTGACGACGCCGACAAACGGGAAATCCCACAGTCATCAAAGTTACCAATAGAAAAATTGTTATCTCCAATCGGAGCGACGCCTGCTTCGAACAGCGGCAATCCGAAGATGGAAAGGCCGCCTGAGCCGCCCGCTGCTTTATCACCAGTGGCCAGTTCTTCCAGTGATTCAGGTTCGGATTCCGGCACGGACACCGATTCGAGCAGCGACGATTCAGGAGACGATCGCCCGACGCCGGTCGTTACCCACGTGTCGCCGTTGCCGGCTGAACCGCCCCACGTATCGCCAAAGCCCGAGGAAGAAGCGAAACCGCGCTGGAACCTCGCCTCCTACCTCGATCCTAACGGCGCCAAAAGTGAAACAATCCAATCGCCTCGCGTACAATCACCCCCCGTATTATCCTCGTTCCAAAACTTAACGTCCCACATCCTGCCTCTGAACTCGAAGATGTCGTCCGATAGCCGGAAGAAGTCCACGGAGGAATCGGACGCCAGCGATTCGACTAAAGACTTGGATAGTGTCGTGGCCGAGGCGTTCGCTTCGAAACCGTGCGCGCTCCTGTCTAGTTTTTCCGATTCGGACGGCAGTCCCAAACGGAAGAGTCCCAAGAGGCGGAAGCGCGCTTCGCAACCAGTTACCAATGTGTCGGAAAGTGATAGTGACGAGGAACGAACTATTAAAGCTCCGAAACCAAAAAATCGTGTCAGTCCAAGACAGAAATGTGATTTTTACAGTGATTCGGACGCGGAACCTACGGTTGTTGGTGCCAGCGAAGAGTTTAACAAACCGCTAGTGCCGGTCGTGAAACCCTCCGAGGCTAACAAGCCGAAATCGAACCGAGGCAGGCCGAGGAAAAATAAACGGCCGCACAGCGGGTCGGACAATGAAGTGCAGGTTAAAAAGCGCGGTCGACCGCCTCTCAACAAAAACCTCAATTCGCAGAGTGGATCTGATAACGAGGCGCGAGTTAAAAGGAGAGGACGTCCGCCCTTGAGACCCAAACCTCAACCTTCCCCCAGTAGTTCGGACGACGAACACAAACCGACGTTTGAAAAACCGCCCCCTCGAAGACGCACCGTCTCCAAGCACGTTTCCAGCAATTCAGATTCTGACACGTCCAACAGGAGCAAATCGCGCAAGAGGGACATCAACGTCCAAGAGTCCGAGGCCATCAAGTTCGCAACGTCCAAAAAATCTCACAAACACAAATCGGACGACAAGGAGAAAGCGAAGAAGAGCAAAGACGATTCCGATAATGACGAATGGGGCAAGcagaacaaaaacaaattgagGAACCACTTGTTGGAGAGGAACGGCGACGCTTCGTCCGCGAAAGATGGCAAGCAGAAGAGGCCATCAGACGAACCCAAAAAGAAGGAAACTGCGATCACgtttaaaagaaaagaaaagaaaagcgTCGAACGAGTCCCAACCAGCGATTCTGATAGCGATTCGGACCACAAACCCGCTGTTAAAAGTTCGCCTTATCGCATGGTTAGGCACTCTTCGAGTTCGGACAGTGACGCTAATGTTAGAACCAAGTCTTCGGCCGATCGTTCGTCTCACGTGCGCTTCCCTTCGAGCAATAAGGTCGAAAGTCCGGTTAAAGTCGACCCTGATGGGAAGACGATTCAAGACAAGAAGAAGAGTGACACGTTGAGGAAGCTGTTTACGCCTAAACGCGACTCTGAAGGTGGAAAGGGCGGCGGTAAAGGCGGAGGAAAGGGCGGGAAAGGCGGTAAAGGGAAAGGTGGCGTTAACGTGATTATAGTCGATGGAGATTATGAGAGGAGCAGTTCTTCGGTTGAGGACGAGGCGATGCCCATGATATCAAATCCGGCCATGCTTTCGCCGATTCCGAATCAAGAAATCAAGTCGCCATACACAGCGCCAATCAAACAAGCAGTTAAGACTGGTTATGGCCCGGGTAGGGTTTTGGTTAGGATCGATTTGGATAGGATCAAGCACTGGCCGAAACCTGTGAAGAGGAGGTTGGAGGAGGTCAGGCAGAGGACAGAATTAGCCGACCCCAGGCCGGAAAAGTCAGAAATTATCAAAACAAAGACTGACATATTTGATATTAATTCGCCTATTAGTGTTAGTGACAAACATTCGCCTTATATGAACGTTGAGGACAGTGAAATGATGAAAAGTGATAGGACTAATGTTATGCTTGAATGTGACGCTGATTTCAAGAAGAGGCACTCCGATCCCGCAAAAGACCAATCAAAACTAAAAGGGCACAAAAGAAAGCGACTCAACAGTTGCAGTTCCATATCATCTCTGTCGACTGTAAGCAGCATGTCGCACAGTAGTCGGCGAAACGAGCGCAGGAAGGAAAAGAACAGTCACAAATcaaaaagaagaaaagatGACGACTGCTCTTCACAAAGATCTCACGTTGATAACGATAATTTAACCGATGTTCCTCCTACGAATCACGAGAGGGACCGTCCAAGGACGCCGCCGGTGAGTTCCAGAAATTCGAGTTGGTGTCCACCTACCAGAGAATATCACTCATATTTCGAGAGAAGCGACGAACCCgcagaagaagaagaaagggatcaaaataaatatttgagtGAAGCTAAAAGATTGAAACATTTGGCTGATAAAGAGACTgacgcaataaagcaatgtaTGCTGTACCTGGAAGCTGTGCTATACTTCTTACTTACAGGAAATGCCATGGAATACGAAAGTGTCACTGAAAAGGCTGCCTTTACTATGTACAAAGATACTTTGAGTCTTATCAAATATATTTCATCTAAATTTAGAAACCAGCAAAACTCATCCCCAGTGCACACAAAACTTGCTGTTTTaagTTACCGGTGCCAAGCACTTTTGTATTATAAGCTGTTCAAAATGAGGAAGCATGAGGTTAAAGATTACCAAAAACTCATAAATGAATATTGCACAAAGtctgCAAACATGGCACCAATACAATCGGACCAAATAAACCAAGCGTTGGGAGGTCAGGGCACGCCGTCACCCTTATCTCCGACGCCATCCCCTGCAGGCTCGGTCGGTTCAGTGGGAAGCCAATCCTCGGGTTACAGCAGTGGAGAATTGGCTGGTAGAAACAACAATAATGTTGCGCAATCGACCCCCGCACCAACTCCCACGCCCTGTATGTTAGTTCCAGTTCCTGTTTACAATGCCATGTCAAAACAAAACCAGCACTTCTCATATCTACTTTCTTATCAAGATCTGTGGGACCAGGCAGACAATCTTGTGATAAAGGGCAGGCATAAAGAATTTTTCATTGAACTCGACAGGCATTGTAAGCCCTTAACTCTCCACAGTTCACTGATAGACCTGGTGAGGTATGTCAGAGCCGGCATATGCAGACTGAAGGAGAGTTAA
Protein Sequence
MPHYANYTTLTVERDRLRERERQARAQMSSQVAEWDNEPEQPLFGVPVRVNPSSTDRVTQQIQSKLGDYRRLKPYLDTSKTLIGVDGVPPPNPGVAPSSSRLQSPSGSGQAARPEFKKPHQNNGRPPHHPHQRGNFVKPGGAKPPYENNRYAGQTVKHGGVNHRSNGIVASKGPPLPASSSSSQQLPSSSSSNRIHNAARNLPRLPIEQQNPVSGLGSRDALSASSNPEEFVDNILKEMIDVRTPLTAIAATPRKEPDNKFTFNPVLSKLTEATPTAPKKRERQPLHRPSTDLRDDLNISESDDSDDADKREIPQSSKLPIEKLLSPIGATPASNSGNPKMERPPEPPAALSPVASSSSDSGSDSGTDTDSSSDDSGDDRPTPVVTHVSPLPAEPPHVSPKPEEEAKPRWNLASYLDPNGAKSETIQSPRVQSPPVLSSFQNLTSHILPLNSKMSSDSRKKSTEESDASDSTKDLDSVVAEAFASKPCALLSSFSDSDGSPKRKSPKRRKRASQPVTNVSESDSDEERTIKAPKPKNRVSPRQKCDFYSDSDAEPTVVGASEEFNKPLVPVVKPSEANKPKSNRGRPRKNKRPHSGSDNEVQVKKRGRPPLNKNLNSQSGSDNEARVKRRGRPPLRPKPQPSPSSSDDEHKPTFEKPPPRRRTVSKHVSSNSDSDTSNRSKSRKRDINVQESEAIKFATSKKSHKHKSDDKEKAKKSKDDSDNDEWGKQNKNKLRNHLLERNGDASSAKDGKQKRPSDEPKKKETAITFKRKEKKSVERVPTSDSDSDSDHKPAVKSSPYRMVRHSSSSDSDANVRTKSSADRSSHVRFPSSNKVESPVKVDPDGKTIQDKKKSDTLRKLFTPKRDSEGGKGGGKGGGKGGKGGKGKGGVNVIIVDGDYERSSSSVEDEAMPMISNPAMLSPIPNQEIKSPYTAPIKQAVKTGYGPGRVLVRIDLDRIKHWPKPVKRRLEEVRQRTELADPRPEKSEIIKTKTDIFDINSPISVSDKHSPYMNVEDSEMMKSDRTNVMLECDADFKKRHSDPAKDQSKLKGHKRKRLNSCSSISSLSTVSSMSHSSRRNERRKEKNSHKSKRRKDDDCSSQRSHVDNDNLTDVPPTNHERDRPRTPPVSSRNSSWCPPTREYHSYFERSDEPAEEEERDQNKYLSEAKRLKHLADKETDAIKQCMLYLEAVLYFLLTGNAMEYESVTEKAAFTMYKDTLSLIKYISSKFRNQQNSSPVHTKLAVLSYRCQALLYYKLFKMRKHEVKDYQKLINEYCTKSANMAPIQSDQINQALGGQGTPSPLSPTPSPAGSVGSVGSQSSGYSSGELAGRNNNNVAQSTPAPTPTPCMLVPVPVYNAMSKQNQHFSYLLSYQDLWDQADNLVIKGRHKEFFIELDRHCKPLTLHSSLIDLVRYVRAGICRLKES

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-