Basic Information

Gene Symbol
lilli
Assembly
GCA_000211455.3
Location
scaffold:228667-281109[+]

Transcription Factor Domain

TF Family
AF-4
Domain
AF-4 domain
PFAM
PF05110
TF Group
Unclassified Structure
Description
This family consists of AF4 (Proto-oncogene AF4) and FMR2 (Fragile X syndrome) nuclear proteins. These proteins have been linked to human diseases such as acute lymphoblastic leukaemia and mental disabilities [1]. The family also contains a Drosophila AF4 protein homologue Lilliputian which contains an AT-hook domain. Lilliputian represents a novel pair-rule gene that acts in cytoskeleton regulation, segmentation and morphogenesis in Drosophila [2].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 10 1 1e+04 -11.0 9.3 470 486 58 74 17 92 0.35
2 10 4.2e-11 4.4e-07 28.8 2.5 3 204 115 312 112 334 0.51
3 10 2.5e-05 0.26 9.7 0.0 345 383 376 415 354 425 0.78
4 10 1 1e+04 -6.7 12.3 457 490 476 507 447 514 0.60
5 10 0.31 3.3e+03 -3.8 5.0 110 211 668 762 609 785 0.37
6 10 1 1e+04 -7.7 20.4 107 250 762 901 718 915 0.49
7 10 1 1e+04 -5.7 15.0 442 491 951 999 935 1005 0.51
8 10 1 1e+04 -7.5 11.7 447 489 1118 1157 1100 1170 0.40
9 10 0.016 1.7e+02 0.4 17.3 117 207 1341 1427 1320 1514 0.50
10 10 0.032 3.4e+02 -0.6 1.2 143 192 1637 1670 1585 1732 0.55

Sequence Information

Coding Sequence
ATGCCGGCGAACAAGGACCTGCTAAACAAGGCAGAGCTTAGAGTGTGTGGGCCGGagcaactccgagcaatgagaagcagtaacagcaacagcagcaacaacaacagcaaaagcaaTCCTTACGCGCCAACCGGAAGCGGAACAGGAGCACCGCCGGCAGCATTATTATTTTCTTCGTCCTCCTACTCCACCGCACCGTCATTGACATCATCCTCATCTAGCGGACGACACTCTTATTCGTTGCATACAACGGCACTGCCCGCGCCCGCGTCGTCGTTTGGGACGACCGGCGGAGTCGGCGAGTTGGACATGAAAAAATCGCAAAAACCAAGGCACGATGAGCATGATAGGATGGAGCGTCGCGAACGGGATAAACAGGCCCGTGCTCAGCTGCAGGCTGAACGGGAACCGGAACCTACTGGCCCATTATTTACAGCACCGTTCAAATTGCCAAGTACATCGGACACGGATATGCACATCGCCCAGCGGCTTGGCAATTATGAGGCGGTCAAGAAGTGCTTTATGGAGGCGTCGACGTCGTATCATGTGATCGGTATCGCGACCAGTCCGGCCCCGACAACGCCTCGGGCCGGCAGCAGCGGCAACAGTGGCGTTCTGTCGAATTCATCGTCGTCCACCGGCAGTCACCATCGGTCGCTTCCACCAAAAACCAATAGCAATAGTAATACCATTAGCAATAGTAACGTCAACTTCGTCAAGCCAGCCGACAATCGCCCCCTCTATAATGGTGGTCGAATGTCCTCTTCTTCTTCGGCGGCGCCAACAGCCGGCGGATCGACGGGTCGTACTGGAGCCCCAAACCATTATTCGTCCTCATCGTCGTTGTCATCATCGGGTACGGTAGTCGGTGGTTCCTTCAACAAGCATGAGATGCATGGAGTTCCGTCGAAGGGACCGTCATCTACGTCAATACCAACAGCGCTGATGAATGGCCGTTCATCATCATCTGGTTCTATGATGGGACCCGGTGTTAGTAGTGGAACTGGAAGTGGTCTTAGTGGAGGGATCCCCGGACTTTCCTCTACCGCTGGTGGTGATAAGTTACCCTCGCAATTGCCCAATGGACGACTGCCCCAGGTTGCAAACGCAAAAGTGCCTAGAGTACAGGGCAACAATGTGGACAAAATCCTGAACGAGATGAAGTCCAGCCTGATGACGCCACTGACGGAGATCGGTGCCACGCCGCGTAAGGAGCTGGAATCTAAGTTTAGCTTCAGTAACCCGAACCCGAAGTCTTTCGTGTATGCTATGACACCACTACTAGCACCCATGACACCCCTGTCAGCCGGTGGTGGTAGTAACGGTATCGGCAGTGGAAACGGATCCGGTAGTACTTCAAGACCAATCGGTTCCTTAGCGCTTTCTAGTATGGAAATGCATCCTCCCCTTGTGTCTGGCATTGAGGATGACAATCAGAACGGATCAGAGCGCAACTCTAGTGAATCCTCTTCAAATGAGAGTGTAGATGAATCGTCTAGCGAAGACTCAAACGTGGGTAATAAGCTAGCATCGAATGGTGGTGGTGCTGCTACTACCCTTGGAGATGGTAATTTAACCAGTGCCAGTGGTGGTGGCAATGGAGGAAGTGCCGGCGCTGCTGGATGTGTAAACGGCGGTCAAGTCGGGGATGGTGGAACCCTGGGTAGCCCGATCAGGCGTAAGGATTGGTCGCTCCTTAACTTTATGCAACCCACCATCCAACAAGTGCCAAGCGAAAGTACCCATCACCACCATACTCATCATCATGAAGAGAATTCGGTATCGTCGCCAATTCGAACGCTGAAGATGGGCAGTGGTGATACTTTGGGAGTAGCACTGTCTCCTCCTTCCCTAGTAGGAAACGTTGCACCCATCAAAAATGAACCGCTTGCCCCGCTAGACGATGACCATCTGTCGAATGCTAGCAGCAACAGTGAACCTCCTGTTAGTGCGATTGCTAGCGGGCCAGGAGCAACTGCGTCGTCCTCTTCCTATGTTAAACAGGAACCGTATCCTAGCGAAAACAGTGCCTCATCGCCTCCGGCTTCGATGGGTGTAGGGGTTAAGAGTGAGCGGAAGGACGATGCTCTCGATCGGTTATCATTATCGAGTCCGATTAAGAGTCCCGTCGAACATCATCATGGTGGCTACAACAATCATCAACAACAGCAGCAGCAACATCTTTTCGGCAGTGAAAATCTCGAGCCGGACGTGGATGTGATAAGTGCCCTACAGGAAGCCAAAGAGTTTAGCCTCATCAAGCCGATATCGAGTATGTCCGGTAGTGACTCCGACGATGCGCTCGATGCGCCGTCCTCGGCATGCGGCGTGGACGTGGATCAATCATCGGCAACACACACTCGACTGTTACCAGCCGCCCAGCAGCATGATGCAGTCGGTAGTAACGGCGTAGAAACAAGTGCTGCGGCAGCaaaaaagaaaaaGCGAAAGCGAAAGCTTGCTGGAGCAAATGATCGAGAACAGCGCGATGCATCGACGAGCAGTAGCGAGGATGAACGATACAACGCTATCTCACATCGAAGCCGCTCACAATCGTTCGAGAAGGATAAATCTTTGCTGAAAGGGCGAGGTCGGCAACGGAATGCAAGCAATACTCATGGTGGGGGTACGACAACGAGTGCGGCTTCCAGCGCCAGGTACTCCGATGCGGACTCTGTGGCCAGTGGTAGTGGCCGGCGATCGAGCAAAACACCTTCACACGGATCAACTCCCACGAAAAAGTTAGGCCTGGGTGTATTGAGTGCCAGCGCGGCATACGTGGATCCTGGAAGCATTATGAGTCCACCACTCAGCATACCTTCAGTAGATGGCATTCCGCCCACAGCGAAAACCTCTACCTCCCGTAAGTCACGTACACTGATCTCACGAACTAGTACCTCGTCGTCGGAGGAAGCTTCTTCTGGTGGATCGTCTGGATCTTCGGTGGAATCAGACTTTGGTCACGAGAGTCCGTCAGAGCAGATCGAAGCAGAACCTCCCGTGATACCAGCAGTGGTAGTGTGCAAAGCGAAAAAGGTGAAATCGAAGAAAAAATATGATAAGGATGCTAGCGTCATGTCATCGGCTGCTGTCGTTCTATCCGCGAAAACCTCTTCCGTTGAGCAGTTGACGAAGAACGGATCAACGGAAGGAGGAAGAAGTAGTCGTGGTAGCAGCACCAATCGTAACCTATATCATCTTTCTTCAGACAGCAATGACGATCGACCGTCATTGCTCTCTTCACCCGTTCCTACCAATGGTGGCCGTTCAGAAGTCGCTGGAGATAGTTCGATGATAGCTGGAAAACTGTCGAAGAAAGTGCGCAAACGATCGACTGCATCGGTGGCTTCGGTAGACGATGACGACAATACTCGCAACCGGCAACAAACAAGCTGTGTTGCcgatgacgatgacgatgatgatgatcgatCCGATAGTAACAGTGAAAGTGATAGCGATGCACCCGCCAAAAAGGAGAAGAAGCAGAGTaaaaacaaaaaaGCGGCTGTGTTTGCCCGGGTATTCAACAATAATGCCTCGGCATCTAGTggtggcaaaggtaaaggtggcaaaggtaaaggcggtaaaggtaaaggGCAGGTTTACATCGATCACGTAGACGATCTGCATGTTCCTGCGAAGAATCCGGCTCAATTGGCACCAGCAAATGCCAGCAATATGTCTCGTCAATCTCCGATCACTAGCCATGTGGATCAAAAGAGGAGTTTGTCCACGCAGCATACTCTAAGCGTGCCTTCATCTTCGACTGCAGGCCTTTCCGCTTGCTCTCCACGTGCCGAAGGTGCTCGACCAAGTAGCAGAGGAGGAGGAGGTTTAACGCCAGGGCAGAGAGCAGAGCCTAACCGAACGTCGCCGTTGTCATTGTTCTCCCCTATAAAAGGTGCCTTACAAAACATCACGCTTATGTGTCGAATCGATTTGAGTCGCCTTCTAAAAATACCACCACCGCCGTCGTATCCGGGATCCGGCAGCACCAGATCGCTATCGGGTAAGGCAAATGAAAGCTACAGCGCTCAACGAAGTGCCTCCGCTCGACAAAAGAGCAAAAGCCCATACGATCAACAACAAGGGAAGCGAAGGAGAAATTCAGTTGGTcagcaacaacagccgccgcaaccgccgccgccgcagcaacagcaacagcaacaacaaaaacatcaccaccagcaacaacaacaccaataccatcagcaaGGGCAGGATCATGATCGAAACGGTAACGGTTCAGTACACAGTTCCTCATCTACTCCGAAGAGGCTCGAAGATCGGTCAGAATCCATTTACGATCGCAACCGTATTACATTAGACGGTGAATCGGTTGTTGAAAATGGAACCACTATAGTGGCCGGTCCTTTGAGACATCGCAGTAACTCCATCAACAGTGATTATAGTGGGGCTGCACAGAAAGCTCGGGAGTATCACCGTGGGAGCGATGCTGGTGCTTTTGGGAGCAATTCAACATCATCTTCTCCTTTGATACATCATGCCAATCACCGACATTCAGGTGCCACTATGATTGGAGGCTATCAGCCGCATCACAGTCTTGCCGGGGATCCGAAAGGATCAGCAATAAGAACAGGAAAATCTCCGGTTCTTCCAGGGTACGACGAGAAACTAGCAAAGACGGAAAAACTCAGCTACAGTGGGTTGAAAGAGGATAAGCACTCACTTATATACGCTGGCAGTAAATACTCTAGTTCAGGCTATGGTCTGATCAAACAGGAAGGAGGACAATCGATCAAGCAAGAATTCGCAGGCAATAATGAGTTTACAACCGACAGCACACTGCTTGGCGAGGGTAAATTGGGTTCAGCGACGGGTGCTAAAATGGGTGGAACCCTTACAACGAATGGTACCGCTGAAGGCGCAGCAGCCAGTGGGCGAATGAGAAAACGATCCGTGAGTTCGAGCAGCAACTCTAACAATACGTACAAAGAAAAACGTCGGAAAAAGGATAAAGCAAACACTTCACAGACTGATCAACTTGAACAACTTCCACCCACCAATCATGACCGATTAGTGGTCGACGACTGTCAAACAACAGTCGGTGTCGCTAGCGGGACATTGGAGCGAGCCCATGGTGCTGGAACTGGAGCGCTGCATCACACCAATCATCATCATTCTCAACAGGTGCCTGCTGGCGATCCCTTTGTCTCGGCAGGTACCGGTTCACCGGAAGCTCCAGTGCATATCAAGAAGGTGTACGTTTCCTACTTCGAACGGAACGATGAAGAGTTATCCGAGGTGCGTGACCAAAACAGATATCTGTCGGAGGCAAAACGGCTCAAACATGCGGCCGATCGCGAAGGAGATCATTTGGCGCAAGCGATGCTCTATTTGGAAGCCGTGCTGTTCTTTCTGCTCACTGGTGACACAATGGAACGAGATCCTATTACCGAAAAAGCGGCCTTTACGATGTATAAAGATACGCTTTGTTTGATTAAATTCATTTCGTCGAAATTCCGTAGCCAACTGCAACATCCGACAGTGCAGGGTAACATTCACACGAAGGTGGCCATTCTAAGTTTACGATGTCAATCGTTGATTTATCTGAAATTGTACAAAATGCGGCGACTAGAGATGAAGGAAACGGGTAAAACAATCGGAGAGTTTAACCATAAAACCAGCACTGTGCCGGCAGAACTAGCGAATGGAAACACACCATCACCTCTTTCTCCGACATCAGTTGGATCGCAGAGCTCTGGGTACAGTTCGGGGCAGAATAATCAGGTCGGATCAATCCCACCGATTAATTCATCACCAGCTCAATGCATCCTTATGCCCATCAATGTACATACCGCATACCAAAAACAAACGACACTCTTTACGAACCTTTCCACCTGTTTTGATCTTTGGGAACAAGCGGATAGTTTGGTAATACGAGGGAACCATAACGAATTTTTCATCGACCTGGACCACGAGAATGGACCGATGACTCTGCACAGTTCACTGTACAATGTCGTTAAGTATGTGCAAGCTGGTATTCAGAAATTGCGGCGTATGTAA
Protein Sequence
MPANKDLLNKAELRVCGPEQLRAMRSSNSNSSNNNSKSNPYAPTGSGTGAPPAALLFSSSSYSTAPSLTSSSSSGRHSYSLHTTALPAPASSFGTTGGVGELDMKKSQKPRHDEHDRMERRERDKQARAQLQAEREPEPTGPLFTAPFKLPSTSDTDMHIAQRLGNYEAVKKCFMEASTSYHVIGIATSPAPTTPRAGSSGNSGVLSNSSSSTGSHHRSLPPKTNSNSNTISNSNVNFVKPADNRPLYNGGRMSSSSSAAPTAGGSTGRTGAPNHYSSSSSLSSSGTVVGGSFNKHEMHGVPSKGPSSTSIPTALMNGRSSSSGSMMGPGVSSGTGSGLSGGIPGLSSTAGGDKLPSQLPNGRLPQVANAKVPRVQGNNVDKILNEMKSSLMTPLTEIGATPRKELESKFSFSNPNPKSFVYAMTPLLAPMTPLSAGGGSNGIGSGNGSGSTSRPIGSLALSSMEMHPPLVSGIEDDNQNGSERNSSESSSNESVDESSSEDSNVGNKLASNGGGAATTLGDGNLTSASGGGNGGSAGAAGCVNGGQVGDGGTLGSPIRRKDWSLLNFMQPTIQQVPSESTHHHHTHHHEENSVSSPIRTLKMGSGDTLGVALSPPSLVGNVAPIKNEPLAPLDDDHLSNASSNSEPPVSAIASGPGATASSSSYVKQEPYPSENSASSPPASMGVGVKSERKDDALDRLSLSSPIKSPVEHHHGGYNNHQQQQQQHLFGSENLEPDVDVISALQEAKEFSLIKPISSMSGSDSDDALDAPSSACGVDVDQSSATHTRLLPAAQQHDAVGSNGVETSAAAAKKKKRKRKLAGANDREQRDASTSSSEDERYNAISHRSRSQSFEKDKSLLKGRGRQRNASNTHGGGTTTSAASSARYSDADSVASGSGRRSSKTPSHGSTPTKKLGLGVLSASAAYVDPGSIMSPPLSIPSVDGIPPTAKTSTSRKSRTLISRTSTSSSEEASSGGSSGSSVESDFGHESPSEQIEAEPPVIPAVVVCKAKKVKSKKKYDKDASVMSSAAVVLSAKTSSVEQLTKNGSTEGGRSSRGSSTNRNLYHLSSDSNDDRPSLLSSPVPTNGGRSEVAGDSSMIAGKLSKKVRKRSTASVASVDDDDNTRNRQQTSCVADDDDDDDDRSDSNSESDSDAPAKKEKKQSKNKKAAVFARVFNNNASASSGGKGKGGKGKGGKGKGQVYIDHVDDLHVPAKNPAQLAPANASNMSRQSPITSHVDQKRSLSTQHTLSVPSSSTAGLSACSPRAEGARPSSRGGGGLTPGQRAEPNRTSPLSLFSPIKGALQNITLMCRIDLSRLLKIPPPPSYPGSGSTRSLSGKANESYSAQRSASARQKSKSPYDQQQGKRRRNSVGQQQQPPQPPPPQQQQQQQQKHHHQQQQHQYHQQGQDHDRNGNGSVHSSSSTPKRLEDRSESIYDRNRITLDGESVVENGTTIVAGPLRHRSNSINSDYSGAAQKAREYHRGSDAGAFGSNSTSSSPLIHHANHRHSGATMIGGYQPHHSLAGDPKGSAIRTGKSPVLPGYDEKLAKTEKLSYSGLKEDKHSLIYAGSKYSSSGYGLIKQEGGQSIKQEFAGNNEFTTDSTLLGEGKLGSATGAKMGGTLTTNGTAEGAAASGRMRKRSVSSSSNSNNTYKEKRRKKDKANTSQTDQLEQLPPTNHDRLVVDDCQTTVGVASGTLERAHGAGTGALHHTNHHHSQQVPAGDPFVSAGTGSPEAPVHIKKVYVSYFERNDEELSEVRDQNRYLSEAKRLKHAADREGDHLAQAMLYLEAVLFFLLTGDTMERDPITEKAAFTMYKDTLCLIKFISSKFRSQLQHPTVQGNIHTKVAILSLRCQSLIYLKLYKMRRLEMKETGKTIGEFNHKTSTVPAELANGNTPSPLSPTSVGSQSSGYSSGQNNQVGSIPPINSSPAQCILMPINVHTAYQKQTTLFTNLSTCFDLWEQADSLVIRGNHNEFFIDLDHENGPMTLHSSLYNVVKYVQAGIQKLRRM

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00092151;
90% Identity
-
80% Identity
-