Basic Information

Gene Symbol
lilli
Assembly
GCA_030762935.1
Location
CM060881.1:24301284-24405279[+]

Transcription Factor Domain

TF Family
AF-4
Domain
AF-4 domain
PFAM
PF05110
TF Group
Unclassified Structure
Description
This family consists of AF4 (Proto-oncogene AF4) and FMR2 (Fragile X syndrome) nuclear proteins. These proteins have been linked to human diseases such as acute lymphoblastic leukaemia and mental disabilities [1]. The family also contains a Drosophila AF4 protein homologue Lilliputian which contains an AT-hook domain. Lilliputian represents a novel pair-rule gene that acts in cytoskeleton regulation, segmentation and morphogenesis in Drosophila [2].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 5 0.00027 33 6.3 0.0 43 120 105 185 102 206 0.55
2 5 9.4e-17 1.2e-11 47.4 14.3 341 502 275 464 258 492 0.75
3 5 1 1.2e+05 -7.0 12.5 84 244 575 723 546 753 0.50
4 5 1 1.2e+05 -21.1 30.0 105 218 769 887 742 967 0.36
5 5 0.00094 1.2e+02 4.5 12.6 151 236 1117 1195 1076 1233 0.49

Sequence Information

Coding Sequence
ATGGTTAATTGGGGTTTAAATTCTATATCCCTCTGTTACGCTTTGCGTCTTACGTCCCGCAGTCCCACGGCTACGTTTAGCGGCTGGGGGTCCTCTATCCTCGGGTATGTTTTGCGGCTGGGGGCCCTCTATCCTCGTCTATGTTTTGCGGCAGGGGGCCCTCTATCCTCGGCTACGATTTGCGCCACGAGTTCCTCCCCCACTGAAGCAAGTCCTCGTACTTTCACTCGCTCGAGCAATCGGCTAAACTTGACAACGTCGCACACGTCTGCAATACCAAGTGAAAACGGAAGGACGGTTGTGCCATCTGCTGAAGACACAGTCACTCAGCAGATTCAGAGGAAACTGGGGGACTATGCCCTCGTGCAGCAGTTCCTCGATGATGAGCCCAAGAGACTGATTGGTATCGATGGCTTGCATCCTCCAGCAAGTCCAGCTCCTGGGGGCGTAACTACATTCACATCCACTTCTCTGGCTGCTGCAGCACGCCTACAGCCGCCACCAGAGTTCAAGAAGCCACAACATGCTGGCACCCGAACCAATAGTAGCAATaacaaccatcaccatcaccaccatcaccatcatggACCGACTAGAGGCGGCTTTGTCAAACCTGCTGATGGCAAACCACCATATGGTGGAAGAGGGGGTTATCCTGGACAGCCTGTCAAGCATGGTGCAGGCAGCAATGATCATCGCTCCAATGGCATCGTTCCTCCTAAAGGTCCTCCTCAAGGAGGGGGGAGTCGAGTTCATCAGGCTGGCAGAAACCTTCCCAGAATACATgctCCAGTGCTGGGACCTCGTGAATCATCATCTCAACTTGGCAGTGCAGGTCATAATGAAGTGGAAAATATTCTCAAGGAGATGACTGAAGTGATGGCACCAATATCAGGGATTGCCACACCAAGGAAAGAACTGGAATCTAAGTTCACCTTCAGTCCCATGGTTCCCAAGGTCTCAGAAGTGCAGGCTCCGCGACCCGCCGCCGATTACAACAGGAATATCAACAGGCATGGAAATCAGGTGCTAGCGACGACACAAGTATCGGGCAACTTGGAAAAGGATCTCAGTCTGTCAGACAGTAGCGACGATGACAAGGATGTGCTCAAGTCCAATGGAAGACCctcctctgctgctgctgctgctgcagctaAAATGGACCAGTCCAGACGAGTTCCAGCTTTACTGACTGCTAGTCCTACTCCAAGCATCCTTAGTGTGGGGAGCAGACCACCAGAGCCAGCACCGCTAATGTCCCCGGTGGGACCTGCTTCATCTTCTAGTGAAGCAGACTCTGACTCTGGCTCTGGGTCAGACAGCGAGTCGAGTAGCGAGGACTCTGGGGATGAAGCCCCGCAGGAGTCGACTGAACCAGCAGCAATGCCTACAGCAACGGCAGCATCGCCACCCCAGATGGAGCAGTTGGAAGAGGCAAAGCCACAAGATCGATGGAACTTGGGAAGGTTTCTCAAGCCCATTGGAGAGCCCCCAGAAGAGAATGAACAAACTAGAACTGGTCCAGAGTTGGCCAAGCGAGAATCATCGCCAGCTGTTGCAGCAGTACCTGCCGTAACAGCAGCGAAGTCAGTTCCTGCAAGTGGCACTCCGTCCAAGGAGAAGCAGGAAGAGGGTGAGATTTTGTCAGATGACAGCGATTCCAACAAGGAGCTGGACAATGTCGTGGAGGACTTGCCACATCCAGTACCACTTCTGTCAGGCATCTCGGATGAGTCTGATTCTCCACCAGTCGTGGAAACTAGGACTGCCCCCATTACCAAGAGGAAACGTCCCTCCATCAATACCAGGCCACCCGTGCTCTCGGAGAGTGACAGTGATGGTGAAGTGAAAGTGAGGGAAAAACCAAGGACTCTTAAGAGGACTCAGCCCCGTCCCAAGAAGCAAACACGCTCTCCCAGTTCCTCGGATGATGATGTTGAGGACCCTGCTAGAGTAGTAAAGTTGCGAAATGGTGGCACAACCAGATCTCTAGAAGCCGACAACCATCGCAAGAAGCCAAGGAGTCGTaagccactaccaccaccgcctccaccaccaccaccaccaccacctgtaTCATCTAGTGAGTCAGAACCAGAGGAGGATCTGAAGAAAAAAGTAACCAAAGTCGTATTATCATCTTCCAGTTGTTCGGAAGGTGAGATCAAGGAGCCAAAGCCAAAAGAAAAGACAGTTGTTAATGTTCCATGCGCGTCTGGAGAAGAGTCAGAAGAAGAAGCGAAGCCAAAAGTTCATAGTGCAAGCAGCAGAGACAGTTCTAAAGTGAGGACAAAGGTTGAACTAAGTGACTCAGATGAGGAGCCGAGTGTAAAACCGTCAGTTGTGATAGAAGTGAGTGACTCTGACGATCCAAACTGGGGCAAGACCAAGACCAAAGTACAAAGTGCTAAGAAAGGTAGAGGTAGGAGCAAGCCGACGTCAAGTGACAGGTCGTCCAAGCGAGAGGAAAAGAAATCCAGCGGTCCAAAGAAAGAGAGGACACCCAAACGTCCGAACAACAGGAAGAGGCCTAGAGACGACAGTTCTCTGGGCCCAAAGAAATCCTCAGGACCCGGAAGGCCAATGCGAAAGAGTGGTGTCTTCAAGAGCATGGAGCATTGCCCCCCAGACACTCCCTCGAACAGTGAAGGCGAGATAGACATTATAAGTACTTCCCCAGTGAAGCCCATTGTGAGACCTAAACTGGTGAACAACTCCTGTCGAGACTCCAGTGCATCCGACTCTGATAGGTCTCGCACCAGGGTCAAACCAGACGTCCCAAAACGTAAACATTCATCAGACAGCGATGGATCTCCCGTGAGTCGTCCACAAGCCTCTAAGGTAGAGGAGAGTCCCCCCAAGCTGGATAGCGAGGACAAAACCATCCAGGACAAGAAGAAAAGTGACACATTACGGAAACTGTTCACATTGAAGGGTGGGGCTAAAGGAGGGGGTAAAGGGGGAGCCAAAGGGGGTGGCAAGGGTAAGGGGGGCTCTAAGACACCGGGGGTGCTGGTGATTGAGAGCGGCGAGTGCGAGGACGAGCACTTGCCTACCCCAATCAGTCCTCAGCTGTTGTCACCACTGCCAAGCAGCGAGGTTGGTAGCACTGCAGCAACCAAAAAAAAGCCCAGCAAGAAGGACTCGGCAGTGGAACTTCCTCCGTTAGTTTATAGTGACAGTGGTACTCCAAGTTTACTGTGTCAAGTGGATCTCTCGAGACTGTCGCATGTGCCCGCAGGTAAGGGGGAGGACATAAGGGTGCGCACGGAACTGCCTGACACCAGGCAACTCCCTAGCGACACCGATGACGTAGCTATACCAAAGTGTACAGGTTGTGACAACGCAGATCAGAAGGCTTCCAAGAAGCAcaagaagaagcagcagcagcaggctTCTGTTACTGAGGGTGAGGAGAAGTCAGGAGGCAAACATCATCACAAGCGAGAACGTAGACACAGCAACAGTTCAGTTTCATCGCTATCAACAGAATGTAGCCGTGTCTCATCACAGGGTGCTAGACCACACAAACGCCACAGGCACAGTCACGAACACAGTTCCAGCAAACGGCGGAAAGGTTCACAACCTCCTCCACCACCTGCAGAAGAGGATCTCCTAGAACCAAGAATATCTAGTCAAAGTACCATCACGACAGTGCTAAATGATGTGCCTCCGACAAATCATGAACGAGAGGAGGTTCATGAAGATGTGATGGACCTTGTTCCAGCTGTTGGTGTGCTAGACCCGTGTGTTGACTGGGGACCTCAGCCCCAGAAGGTTTACTACTCTTACTTTGAGCGAGAGGAAGAGGACACATCTGACCATGAAGAGAGGGACGAGAACCAGCAGAACAACTATCTGAAGGAAGCAAAGAGGCTGAAACATGGAGCAGATAAGGAGATGGACCACTTGGCACAGGGAATGCAGTACTTAGAGGCGGTGTTGTTCTTCATTCTCACTGGCAATGCTATGGAGCGAGAGAGAGTCACGGAGAAGGCAGCGTTCACCATGTACAAGGATACACTCAGTCTCATCAAGTTCATTTCCTCTAAATTTCGAACACTCCAGAATCATTCTCCAGTCAGCATAGAcaaaattgcaattttaagTTTGAGATGTCAGTCGTTGCTGTATCTGAAGCTGTTTAAGATGAGGCGACATGAAGTCAAGGAGTATCAGAAAGTATTGCAGGATTATCATCAAAAGGCCACCCAGCCCACACCATTGCAGCCGGAGCAGTGTGGCGGCCAAGGGACACCTTCTCCACTGTCACCAACCCCTTCACCTGCTGGCTCTGTTGGTTCGGTCGGCAGTCAGTCTTCTGGATACAGCAGCGGAGAACTGGCAGCAGGAAGAACCATGGGAAACGGCGTGGGAGTGGGCAATGCCATGGGACAGCAGCCCCCGGCAGCCAACATGGGACCTTGCGTAGCAGTCCCGTTAGCAGTGCACTCTGCCATGCAGAAACAGAACCAACACTTCAGTTATTTGCTGTCATGTCACGAATTGTGGGAGCAAGCTGATGATTTGGTGTACAAAGGGAAACACACAGATTTCTTCATCGAGCTGGACCAATACTATGGACCCCTAACTCTGCACAGCTCTCTTCAGGATCTTGTGAGATACGTGAGGGTCGGCATCACACGTCTCAAAGAGCTTCTCAAGTGA
Protein Sequence
MVNWGLNSISLCYALRLTSRSPTATFSGWGSSILGYVLRLGALYPRLCFAAGGPLSSATICATSSSPTEASPRTFTRSSNRLNLTTSHTSAIPSENGRTVVPSAEDTVTQQIQRKLGDYALVQQFLDDEPKRLIGIDGLHPPASPAPGGVTTFTSTSLAAAARLQPPPEFKKPQHAGTRTNSSNNNHHHHHHHHHGPTRGGFVKPADGKPPYGGRGGYPGQPVKHGAGSNDHRSNGIVPPKGPPQGGGSRVHQAGRNLPRIHAPVLGPRESSSQLGSAGHNEVENILKEMTEVMAPISGIATPRKELESKFTFSPMVPKVSEVQAPRPAADYNRNINRHGNQVLATTQVSGNLEKDLSLSDSSDDDKDVLKSNGRPSSAAAAAAAKMDQSRRVPALLTASPTPSILSVGSRPPEPAPLMSPVGPASSSSEADSDSGSGSDSESSSEDSGDEAPQESTEPAAMPTATAASPPQMEQLEEAKPQDRWNLGRFLKPIGEPPEENEQTRTGPELAKRESSPAVAAVPAVTAAKSVPASGTPSKEKQEEGEILSDDSDSNKELDNVVEDLPHPVPLLSGISDESDSPPVVETRTAPITKRKRPSINTRPPVLSESDSDGEVKVREKPRTLKRTQPRPKKQTRSPSSSDDDVEDPARVVKLRNGGTTRSLEADNHRKKPRSRKPLPPPPPPPPPPPPVSSSESEPEEDLKKKVTKVVLSSSSCSEGEIKEPKPKEKTVVNVPCASGEESEEEAKPKVHSASSRDSSKVRTKVELSDSDEEPSVKPSVVIEVSDSDDPNWGKTKTKVQSAKKGRGRSKPTSSDRSSKREEKKSSGPKKERTPKRPNNRKRPRDDSSLGPKKSSGPGRPMRKSGVFKSMEHCPPDTPSNSEGEIDIISTSPVKPIVRPKLVNNSCRDSSASDSDRSRTRVKPDVPKRKHSSDSDGSPVSRPQASKVEESPPKLDSEDKTIQDKKKSDTLRKLFTLKGGAKGGGKGGAKGGGKGKGGSKTPGVLVIESGECEDEHLPTPISPQLLSPLPSSEVGSTAATKKKPSKKDSAVELPPLVYSDSGTPSLLCQVDLSRLSHVPAGKGEDIRVRTELPDTRQLPSDTDDVAIPKCTGCDNADQKASKKHKKKQQQQASVTEGEEKSGGKHHHKRERRHSNSSVSSLSTECSRVSSQGARPHKRHRHSHEHSSSKRRKGSQPPPPPAEEDLLEPRISSQSTITTVLNDVPPTNHEREEVHEDVMDLVPAVGVLDPCVDWGPQPQKVYYSYFEREEEDTSDHEERDENQQNNYLKEAKRLKHGADKEMDHLAQGMQYLEAVLFFILTGNAMERERVTEKAAFTMYKDTLSLIKFISSKFRTLQNHSPVSIDKIAILSLRCQSLLYLKLFKMRRHEVKEYQKVLQDYHQKATQPTPLQPEQCGGQGTPSPLSPTPSPAGSVGSVGSQSSGYSSGELAAGRTMGNGVGVGNAMGQQPPAANMGPCVAVPLAVHSAMQKQNQHFSYLLSCHELWEQADDLVYKGKHTDFFIELDQYYGPLTLHSSLQDLVRYVRVGITRLKELLK

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_01410792;
90% Identity
-
80% Identity
-