Basic Information

Gene Symbol
lilli_1
Assembly
GCA_963930825.1
Location
OZ005760.1:2232407-2249145[+]

Transcription Factor Domain

TF Family
AF-4
Domain
AF-4 domain
PFAM
PF05110
TF Group
Unclassified Structure
Description
This family consists of AF4 (Proto-oncogene AF4) and FMR2 (Fragile X syndrome) nuclear proteins. These proteins have been linked to human diseases such as acute lymphoblastic leukaemia and mental disabilities [1]. The family also contains a Drosophila AF4 protein homologue Lilliputian which contains an AT-hook domain. Lilliputian represents a novel pair-rule gene that acts in cytoskeleton regulation, segmentation and morphogenesis in Drosophila [2].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 8 9.8e-09 0.0002 20.9 0.1 1 81 6 90 6 153 0.66
2 8 1 2.1e+04 -13.0 21.5 444 472 188 215 167 219 0.50
3 8 1.5e-11 3.1e-07 30.2 18.0 340 501 240 409 230 413 0.76
4 8 1 2.1e+04 -9.7 22.9 145 259 499 611 415 625 0.57
5 8 0.68 1.4e+04 -4.9 10.5 431 476 632 682 596 699 0.49
6 8 1 2.1e+04 -12.4 36.2 84 251 669 839 657 852 0.50
7 8 0.00094 19 4.5 17.8 74 269 932 1124 893 1130 0.52
8 8 0.11 2.2e+03 -2.3 1.8 177 215 1299 1341 1226 1369 0.48

Sequence Information

Coding Sequence
ATGTGTGATTTATTTTACAGTGTGGAAAGGGACCGCCTTCGTGAGCGGGAACGGCTGGCGAGGGCCCAAATGTCTTCACAAGCGGTCGAACAAGAAAGTCCAGGTGCCTCGGGTGATTTTCTATTTAGTGGTCCGATAAAAGTGAACCCGTCCTCTGCTGATCCCGTCACACAACAAATACAAAGCAAACTCGGCGACTTTCAACGTGTTCGCCATTATTTGGACCAGAAAGATTCCGCTTTAATTGGAGTGGATGGTGGTGCACCGCCTCCAAGTCCAGGTGTACCATCCTCCAGGCACCATTCGATCTTAAGTCTACCACCCAATTCCGGACCATGCACCTCTGCAAGACTCCAGCCGTCACCTGAATCTCGTACTGAGTTTAAGAAGCCTCACCACCATCTCCATCACACGTCACATCAGCGTAGCGGTTACGTGAAACCGGCGGACGGGAAGCCGCCGTATGGAGGCCGGGGCGGCTATCCGGGCCAGCCAGTCAAGCATGGCAGCATTAATAATCATCGACCAAATGGCATATTGCCGGCAAAGGGTCCACCTTTATCCTCGCCCACATCGTCAATGTCCAGCACCACAACGGCATCTTCCTCTTCTACTAGTTCCTCATCGTCTTCATCAGCATCGTCAAATCGTTTACATGGTGGACAAAGAAATCAACGCATTCCATTTGAGAATCAAGGTCCTACGGCCGGCTCCAGGGAATCCTTGCCTTCCGCAAATCCAGAcgtaaataatatatttaagGAAATGAGAGAGGTTCCATCGCCTTTGACTGCAATGGCTGCGACGCCGAGGATAGAACTGGACAATAAATATACAACTTGTACATTTAATCCAGTTCTTGCCAAGTTGACTGAAGCATCACCAGCACCATCCACACCAAAAAAACGCGATCGTCTGCCAGCCCCACGGCCCTCCACtGATCTAAAAGATGATCTCGATTTGTCCGAGGAGAGTGACGATGAACAAAAGCGAGAGGCCTTACCTTCTTCCAAGTTATCCGTTGAAAAGATGTTGTCGCCATTAGGTGGCCTAACACCAGCTAACCAAAAAATGGAACGTCCTCCTGATCCTGCCCATTCGCCCATTGGTAGTTCATCAAGTGAGTCTGGCTCTGATTCCGGTTCGGAAAGTGATTCAACAAGTGACGATTCCGGTGACGAGAATGTCACATCAATGAGTCGAACAGCAGCACCGCCGCCCGTCGTCTCCACGTCAGTATTACCAGCCGAAGCGCAACCATCCTCCCCGAAAATGGAGGAGGAGAGTAAGCAACTCCGTTGGAATTTGGGCTCGTTTGTGGTGTCACCCAAGCCGCAAACGTCGCCGCTCTTGTCGCCGATTAAATCAGCAACGGGCGCCACCAGTCCGGATAGCAGAAAACGCGACAAAGAAGAGTCGGATACGAGCGATTCAACGCGGGATTTAGGGAGAGTCGTTGCGGAGGCGTTCGCCTCCAATGCCGTGCCACTGTTAAGTGATTTCTCCGATTCCAATTCAGAAAAGGAGTCCCAGAAACGGTCGAAAAGACGCAAAAGGACAAACAATCCACCGCCGCGTAGTGATGACGATGATAGTGAAGACAGTGAGGATGATGATGAACGGACAAAAATCACAAAACCAGTGCCGCGTGTTAGTCCGCGTACGAAATCAGTTGACTCACTTAGTGAATCTGATGATTCGGAATTTAGCGTTGTTAATAATAGTGCCAAAAAAtccaaaacaattaaatcAGCTAATAGTGATAAGCCGAAATCGAATCGTGGCAGACCGCGAAagtacaataataataaaacgtCGAATATAGCGGACAAGAAGCAACGCGGTCGGCCACCAATCAAAAGTCAACAAAGTGATACGGACACTGAAGTGCGGAAGAAGCGTGGAAGGCCGCCGAAAACGGCCCGACCATCTTCACCTCTGAGCAGTTCGGACGAGGATGGACCGCCGCCAGCGAAGAAGTTTGAGAAACCGTTAAGACGTCGGACCAAGTCGAAGAGGGATTCGAACAGTAGTTCGGATTCGGATGTGAGTCCGCCCAGACGACATGAGTCCGTTGAGATTGAAACGCCAAAGAAACCGCACAAACACAAATCACCTCGTTCTGATGAGAAAAGTAGAAAGAATAAGAAAGAATCAGACGATGAATGGGGTGAGAAGAATAAGAATAAATTAAGACACCATTTTGAGCCTGAAAAACGAACGAAACCAGAATCGCCTCGTAAGAAGGATCCACAAAGACGGAAAGGACGAATACCACAACCAAAACAACAGAAGAGTATTTCGACATTACCGACAACTACGAGTGATTCGAGTGATAGTGATGTTAAACATAAATCGCACACATCTCCCATACGAACTGTAAATAATCGATCATCTAGTGGATCTGACTCTGATAGACGATTAAGTCGATCATCGGAAAGTGAGTGTAGTGATAATGGCAGAAGGAAAAAGTCACCAACGAAAATTGATTCGTCAATTAAAGTGGAAGAAACGAAACCGGTACAAGATAAAAAGAAGAGTGATACATTGAGGAAATTGTTCACATTGAAAAGGGACTCGGAAGGTGGAAAAGGAGGAGGCAAAGGTGGTGGCAAAGGAGGGAAAGGAGGAAAAGGAAAAGGTGGAGTGAGTGTGATAATGGTAGATGAAAACTATGAACGTAGCAGTTCATCTGTGGAAGATGAGACAATGCCAACAATATCATCAAATCCGGCACTATTATCACCATTGTCCACCGTCGAGAAACGTAGCCCTTTGATACCGGCCCGTGTTAATTCACCCTACAATATGTACAATGAATCAATCAAAACAGAAAAGACTGATTTATCAATTAATACTGATCATAATCATGTAACAAATGAGAAAAGCATAATGGTGAGAATTGATTTGAATCGTTTACAAGACATTCCAGCATTACGAAGATATTTGGATGCGAAACGTACATCATGGCGTAATAATAATGCACACGATAATATGATGAAATTCAATGAGAAGAATCGTTTATTATACGACaatgaaatggagaataataggatGACGATAGAAAAAATAAAGGAAAGTGACAACGAAACGAAAAAGTTGAATAAAATTGACCAACCCAACACCGATCGTAAACATAAGAAACGAAAACGTCGTAATAGTAGCAGTTCCGTATCATCACATTCGACTATAAGCAGCATGTCACATAGTAGTAGTAAAAAGAATAAAGagaattttgtaaataataataaaactaaGGAAAACCGTAAAGCGAAACGACGAAAAGAGGACGGTGCGACCACTTCAAGTGACTTAATCATGGCTAACAATAGATCTCATACTGATAATCTCACATTAACCAATGTACCACCGACCAACCACGAACGGGAAGGACTTGGTGGCGGATGTAGTGGTGTTAGTAATAATAATGGTGGATCTGGTAGCCGTTTATCGCCCGAACGGACATTTCCAATACGGACATCTGGTCATAGGGAATACCATTCGTATTTTGAAGCGCCCGAAGAACCTTCCGAATACGAAGAAAGAGATCAAAATCAGTACCTGAACGATGCGAAACGATTGAAACATTTGGCCGACAAAGAAGTGGACCCAATAAAACAATGCATGCTGTACCTGGAAGCTGTGCTATTCTTTTTGTTGACTGGAAATGCCATGGAACATGAGAGTGTCACCGAGAAGGCCGCTTTCACAATGTACAAAGATACacttaatttaattaaATTTATTTCATCAAAGTTTCGCAATCAACAAACTGCATCTTCAGTGCATAATAAACTAGCTGTTCTAAGTTACCGGTGCCAAGCCCTCTTATactataaattgtttaaaatgaGGAAACAAGAAAGTAGAGAGATACAGAAGACTATTGGCGAGTTTTGCAGCAATAAAaatgcTACGATGCCTCAGGAGCAACAGAATCACCAACAAGGCCAGGGAACGCCTTCGCCTTTGTCACCGACACCTTCGCCTGCGGGCTCTGTTGGTTCGGTCGGAAGCCAATCGTCCGGCTATCTCAGTGGTGAATTACGAGGCAATAATAGTAACAATAACGCTCCTGTACCATCAACACATGCACAAACGCCCGGTGTTTGGGTACCATTACCAGTTTTCAATGCCATTTCCAAACAGAATCTTCAATTTAGTTACTTGCTGAGTTATCAGGAGCTTTGGGATACGGCAGATTCGCTTGTGATGAAAGGAAAACACACCGACTTCTTTATCGAATTGGACCGACAATGCAGACCATTAACAATGCATAGTTCATTAATAGACTTGGTCAGGCATGTTCGTGAAGGGATCAATCGACTTAAAAGTCAAAGTTGA
Protein Sequence
MCDLFYSVERDRLRERERLARAQMSSQAVEQESPGASGDFLFSGPIKVNPSSADPVTQQIQSKLGDFQRVRHYLDQKDSALIGVDGGAPPPSPGVPSSRHHSILSLPPNSGPCTSARLQPSPESRTEFKKPHHHLHHTSHQRSGYVKPADGKPPYGGRGGYPGQPVKHGSINNHRPNGILPAKGPPLSSPTSSMSSTTTASSSSTSSSSSSSASSNRLHGGQRNQRIPFENQGPTAGSRESLPSANPDVNNIFKEMREVPSPLTAMAATPRIELDNKYTTCTFNPVLAKLTEASPAPSTPKKRDRLPAPRPSTDLKDDLDLSEESDDEQKREALPSSKLSVEKMLSPLGGLTPANQKMERPPDPAHSPIGSSSSESGSDSGSESDSTSDDSGDENVTSMSRTAAPPPVVSTSVLPAEAQPSSPKMEEESKQLRWNLGSFVVSPKPQTSPLLSPIKSATGATSPDSRKRDKEESDTSDSTRDLGRVVAEAFASNAVPLLSDFSDSNSEKESQKRSKRRKRTNNPPPRSDDDDSEDSEDDDERTKITKPVPRVSPRTKSVDSLSESDDSEFSVVNNSAKKSKTIKSANSDKPKSNRGRPRKYNNNKTSNIADKKQRGRPPIKSQQSDTDTEVRKKRGRPPKTARPSSPLSSSDEDGPPPAKKFEKPLRRRTKSKRDSNSSSDSDVSPPRRHESVEIETPKKPHKHKSPRSDEKSRKNKKESDDEWGEKNKNKLRHHFEPEKRTKPESPRKKDPQRRKGRIPQPKQQKSISTLPTTTSDSSDSDVKHKSHTSPIRTVNNRSSSGSDSDRRLSRSSESECSDNGRRKKSPTKIDSSIKVEETKPVQDKKKSDTLRKLFTLKRDSEGGKGGGKGGGKGGKGGKGKGGVSVIMVDENYERSSSSVEDETMPTISSNPALLSPLSTVEKRSPLIPARVNSPYNMYNESIKTEKTDLSINTDHNHVTNEKSIMVRIDLNRLQDIPALRRYLDAKRTSWRNNNAHDNMMKFNEKNRLLYDNEMENNRMTIEKIKESDNETKKLNKIDQPNTDRKHKKRKRRNSSSSVSSHSTISSMSHSSSKKNKENFVNNNKTKENRKAKRRKEDGATTSSDLIMANNRSHTDNLTLTNVPPTNHEREGLGGGCSGVSNNNGGSGSRLSPERTFPIRTSGHREYHSYFEAPEEPSEYEERDQNQYLNDAKRLKHLADKEVDPIKQCMLYLEAVLFFLLTGNAMEHESVTEKAAFTMYKDTLNLIKFISSKFRNQQTASSVHNKLAVLSYRCQALLYYKLFKMRKQESREIQKTIGEFCSNKNATMPQEQQNHQQGQGTPSPLSPTPSPAGSVGSVGSQSSGYLSGELRGNNSNNNAPVPSTHAQTPGVWVPLPVFNAISKQNLQFSYLLSYQELWDTADSLVMKGKHTDFFIELDRQCRPLTMHSSLIDLVRHVREGINRLKSQS

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_01121607;
90% Identity
-
80% Identity
-