Adar003962.1
Basic Information
- Insect
- Anopheles darlingi
- Gene Symbol
- lilli
- Assembly
- GCA_000211455.3
- Location
- scaffold:228667-281109[+]
Transcription Factor Domain
- TF Family
- AF-4
- Domain
- AF-4 domain
- PFAM
- PF05110
- TF Group
- Unclassified Structure
- Description
- This family consists of AF4 (Proto-oncogene AF4) and FMR2 (Fragile X syndrome) nuclear proteins. These proteins have been linked to human diseases such as acute lymphoblastic leukaemia and mental disabilities [1]. The family also contains a Drosophila AF4 protein homologue Lilliputian which contains an AT-hook domain. Lilliputian represents a novel pair-rule gene that acts in cytoskeleton regulation, segmentation and morphogenesis in Drosophila [2].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 10 1 1e+04 -11.0 9.3 470 486 58 74 17 92 0.35 2 10 4.2e-11 4.4e-07 28.8 2.5 3 204 115 312 112 334 0.51 3 10 2.5e-05 0.26 9.7 0.0 345 383 376 415 354 425 0.78 4 10 1 1e+04 -6.7 12.3 457 490 476 507 447 514 0.60 5 10 0.31 3.3e+03 -3.8 5.0 110 211 668 762 609 785 0.37 6 10 1 1e+04 -7.7 20.4 107 250 762 901 718 915 0.49 7 10 1 1e+04 -5.7 15.0 442 491 951 999 935 1005 0.51 8 10 1 1e+04 -7.5 11.7 447 489 1118 1157 1100 1170 0.40 9 10 0.016 1.7e+02 0.4 17.3 117 207 1341 1427 1320 1514 0.50 10 10 0.032 3.4e+02 -0.6 1.2 143 192 1637 1670 1585 1732 0.55
Sequence Information
- Coding Sequence
- ATGCCGGCGAACAAGGACCTGCTAAACAAGGCAGAGCTTAGAGTGTGTGGGCCGGagcaactccgagcaatgagaagcagtaacagcaacagcagcaacaacaacagcaaaagcaaTCCTTACGCGCCAACCGGAAGCGGAACAGGAGCACCGCCGGCAGCATTATTATTTTCTTCGTCCTCCTACTCCACCGCACCGTCATTGACATCATCCTCATCTAGCGGACGACACTCTTATTCGTTGCATACAACGGCACTGCCCGCGCCCGCGTCGTCGTTTGGGACGACCGGCGGAGTCGGCGAGTTGGACATGAAAAAATCGCAAAAACCAAGGCACGATGAGCATGATAGGATGGAGCGTCGCGAACGGGATAAACAGGCCCGTGCTCAGCTGCAGGCTGAACGGGAACCGGAACCTACTGGCCCATTATTTACAGCACCGTTCAAATTGCCAAGTACATCGGACACGGATATGCACATCGCCCAGCGGCTTGGCAATTATGAGGCGGTCAAGAAGTGCTTTATGGAGGCGTCGACGTCGTATCATGTGATCGGTATCGCGACCAGTCCGGCCCCGACAACGCCTCGGGCCGGCAGCAGCGGCAACAGTGGCGTTCTGTCGAATTCATCGTCGTCCACCGGCAGTCACCATCGGTCGCTTCCACCAAAAACCAATAGCAATAGTAATACCATTAGCAATAGTAACGTCAACTTCGTCAAGCCAGCCGACAATCGCCCCCTCTATAATGGTGGTCGAATGTCCTCTTCTTCTTCGGCGGCGCCAACAGCCGGCGGATCGACGGGTCGTACTGGAGCCCCAAACCATTATTCGTCCTCATCGTCGTTGTCATCATCGGGTACGGTAGTCGGTGGTTCCTTCAACAAGCATGAGATGCATGGAGTTCCGTCGAAGGGACCGTCATCTACGTCAATACCAACAGCGCTGATGAATGGCCGTTCATCATCATCTGGTTCTATGATGGGACCCGGTGTTAGTAGTGGAACTGGAAGTGGTCTTAGTGGAGGGATCCCCGGACTTTCCTCTACCGCTGGTGGTGATAAGTTACCCTCGCAATTGCCCAATGGACGACTGCCCCAGGTTGCAAACGCAAAAGTGCCTAGAGTACAGGGCAACAATGTGGACAAAATCCTGAACGAGATGAAGTCCAGCCTGATGACGCCACTGACGGAGATCGGTGCCACGCCGCGTAAGGAGCTGGAATCTAAGTTTAGCTTCAGTAACCCGAACCCGAAGTCTTTCGTGTATGCTATGACACCACTACTAGCACCCATGACACCCCTGTCAGCCGGTGGTGGTAGTAACGGTATCGGCAGTGGAAACGGATCCGGTAGTACTTCAAGACCAATCGGTTCCTTAGCGCTTTCTAGTATGGAAATGCATCCTCCCCTTGTGTCTGGCATTGAGGATGACAATCAGAACGGATCAGAGCGCAACTCTAGTGAATCCTCTTCAAATGAGAGTGTAGATGAATCGTCTAGCGAAGACTCAAACGTGGGTAATAAGCTAGCATCGAATGGTGGTGGTGCTGCTACTACCCTTGGAGATGGTAATTTAACCAGTGCCAGTGGTGGTGGCAATGGAGGAAGTGCCGGCGCTGCTGGATGTGTAAACGGCGGTCAAGTCGGGGATGGTGGAACCCTGGGTAGCCCGATCAGGCGTAAGGATTGGTCGCTCCTTAACTTTATGCAACCCACCATCCAACAAGTGCCAAGCGAAAGTACCCATCACCACCATACTCATCATCATGAAGAGAATTCGGTATCGTCGCCAATTCGAACGCTGAAGATGGGCAGTGGTGATACTTTGGGAGTAGCACTGTCTCCTCCTTCCCTAGTAGGAAACGTTGCACCCATCAAAAATGAACCGCTTGCCCCGCTAGACGATGACCATCTGTCGAATGCTAGCAGCAACAGTGAACCTCCTGTTAGTGCGATTGCTAGCGGGCCAGGAGCAACTGCGTCGTCCTCTTCCTATGTTAAACAGGAACCGTATCCTAGCGAAAACAGTGCCTCATCGCCTCCGGCTTCGATGGGTGTAGGGGTTAAGAGTGAGCGGAAGGACGATGCTCTCGATCGGTTATCATTATCGAGTCCGATTAAGAGTCCCGTCGAACATCATCATGGTGGCTACAACAATCATCAACAACAGCAGCAGCAACATCTTTTCGGCAGTGAAAATCTCGAGCCGGACGTGGATGTGATAAGTGCCCTACAGGAAGCCAAAGAGTTTAGCCTCATCAAGCCGATATCGAGTATGTCCGGTAGTGACTCCGACGATGCGCTCGATGCGCCGTCCTCGGCATGCGGCGTGGACGTGGATCAATCATCGGCAACACACACTCGACTGTTACCAGCCGCCCAGCAGCATGATGCAGTCGGTAGTAACGGCGTAGAAACAAGTGCTGCGGCAGCaaaaaagaaaaaGCGAAAGCGAAAGCTTGCTGGAGCAAATGATCGAGAACAGCGCGATGCATCGACGAGCAGTAGCGAGGATGAACGATACAACGCTATCTCACATCGAAGCCGCTCACAATCGTTCGAGAAGGATAAATCTTTGCTGAAAGGGCGAGGTCGGCAACGGAATGCAAGCAATACTCATGGTGGGGGTACGACAACGAGTGCGGCTTCCAGCGCCAGGTACTCCGATGCGGACTCTGTGGCCAGTGGTAGTGGCCGGCGATCGAGCAAAACACCTTCACACGGATCAACTCCCACGAAAAAGTTAGGCCTGGGTGTATTGAGTGCCAGCGCGGCATACGTGGATCCTGGAAGCATTATGAGTCCACCACTCAGCATACCTTCAGTAGATGGCATTCCGCCCACAGCGAAAACCTCTACCTCCCGTAAGTCACGTACACTGATCTCACGAACTAGTACCTCGTCGTCGGAGGAAGCTTCTTCTGGTGGATCGTCTGGATCTTCGGTGGAATCAGACTTTGGTCACGAGAGTCCGTCAGAGCAGATCGAAGCAGAACCTCCCGTGATACCAGCAGTGGTAGTGTGCAAAGCGAAAAAGGTGAAATCGAAGAAAAAATATGATAAGGATGCTAGCGTCATGTCATCGGCTGCTGTCGTTCTATCCGCGAAAACCTCTTCCGTTGAGCAGTTGACGAAGAACGGATCAACGGAAGGAGGAAGAAGTAGTCGTGGTAGCAGCACCAATCGTAACCTATATCATCTTTCTTCAGACAGCAATGACGATCGACCGTCATTGCTCTCTTCACCCGTTCCTACCAATGGTGGCCGTTCAGAAGTCGCTGGAGATAGTTCGATGATAGCTGGAAAACTGTCGAAGAAAGTGCGCAAACGATCGACTGCATCGGTGGCTTCGGTAGACGATGACGACAATACTCGCAACCGGCAACAAACAAGCTGTGTTGCcgatgacgatgacgatgatgatgatcgatCCGATAGTAACAGTGAAAGTGATAGCGATGCACCCGCCAAAAAGGAGAAGAAGCAGAGTaaaaacaaaaaaGCGGCTGTGTTTGCCCGGGTATTCAACAATAATGCCTCGGCATCTAGTggtggcaaaggtaaaggtggcaaaggtaaaggcggtaaaggtaaaggGCAGGTTTACATCGATCACGTAGACGATCTGCATGTTCCTGCGAAGAATCCGGCTCAATTGGCACCAGCAAATGCCAGCAATATGTCTCGTCAATCTCCGATCACTAGCCATGTGGATCAAAAGAGGAGTTTGTCCACGCAGCATACTCTAAGCGTGCCTTCATCTTCGACTGCAGGCCTTTCCGCTTGCTCTCCACGTGCCGAAGGTGCTCGACCAAGTAGCAGAGGAGGAGGAGGTTTAACGCCAGGGCAGAGAGCAGAGCCTAACCGAACGTCGCCGTTGTCATTGTTCTCCCCTATAAAAGGTGCCTTACAAAACATCACGCTTATGTGTCGAATCGATTTGAGTCGCCTTCTAAAAATACCACCACCGCCGTCGTATCCGGGATCCGGCAGCACCAGATCGCTATCGGGTAAGGCAAATGAAAGCTACAGCGCTCAACGAAGTGCCTCCGCTCGACAAAAGAGCAAAAGCCCATACGATCAACAACAAGGGAAGCGAAGGAGAAATTCAGTTGGTcagcaacaacagccgccgcaaccgccgccgccgcagcaacagcaacagcaacaacaaaaacatcaccaccagcaacaacaacaccaataccatcagcaaGGGCAGGATCATGATCGAAACGGTAACGGTTCAGTACACAGTTCCTCATCTACTCCGAAGAGGCTCGAAGATCGGTCAGAATCCATTTACGATCGCAACCGTATTACATTAGACGGTGAATCGGTTGTTGAAAATGGAACCACTATAGTGGCCGGTCCTTTGAGACATCGCAGTAACTCCATCAACAGTGATTATAGTGGGGCTGCACAGAAAGCTCGGGAGTATCACCGTGGGAGCGATGCTGGTGCTTTTGGGAGCAATTCAACATCATCTTCTCCTTTGATACATCATGCCAATCACCGACATTCAGGTGCCACTATGATTGGAGGCTATCAGCCGCATCACAGTCTTGCCGGGGATCCGAAAGGATCAGCAATAAGAACAGGAAAATCTCCGGTTCTTCCAGGGTACGACGAGAAACTAGCAAAGACGGAAAAACTCAGCTACAGTGGGTTGAAAGAGGATAAGCACTCACTTATATACGCTGGCAGTAAATACTCTAGTTCAGGCTATGGTCTGATCAAACAGGAAGGAGGACAATCGATCAAGCAAGAATTCGCAGGCAATAATGAGTTTACAACCGACAGCACACTGCTTGGCGAGGGTAAATTGGGTTCAGCGACGGGTGCTAAAATGGGTGGAACCCTTACAACGAATGGTACCGCTGAAGGCGCAGCAGCCAGTGGGCGAATGAGAAAACGATCCGTGAGTTCGAGCAGCAACTCTAACAATACGTACAAAGAAAAACGTCGGAAAAAGGATAAAGCAAACACTTCACAGACTGATCAACTTGAACAACTTCCACCCACCAATCATGACCGATTAGTGGTCGACGACTGTCAAACAACAGTCGGTGTCGCTAGCGGGACATTGGAGCGAGCCCATGGTGCTGGAACTGGAGCGCTGCATCACACCAATCATCATCATTCTCAACAGGTGCCTGCTGGCGATCCCTTTGTCTCGGCAGGTACCGGTTCACCGGAAGCTCCAGTGCATATCAAGAAGGTGTACGTTTCCTACTTCGAACGGAACGATGAAGAGTTATCCGAGGTGCGTGACCAAAACAGATATCTGTCGGAGGCAAAACGGCTCAAACATGCGGCCGATCGCGAAGGAGATCATTTGGCGCAAGCGATGCTCTATTTGGAAGCCGTGCTGTTCTTTCTGCTCACTGGTGACACAATGGAACGAGATCCTATTACCGAAAAAGCGGCCTTTACGATGTATAAAGATACGCTTTGTTTGATTAAATTCATTTCGTCGAAATTCCGTAGCCAACTGCAACATCCGACAGTGCAGGGTAACATTCACACGAAGGTGGCCATTCTAAGTTTACGATGTCAATCGTTGATTTATCTGAAATTGTACAAAATGCGGCGACTAGAGATGAAGGAAACGGGTAAAACAATCGGAGAGTTTAACCATAAAACCAGCACTGTGCCGGCAGAACTAGCGAATGGAAACACACCATCACCTCTTTCTCCGACATCAGTTGGATCGCAGAGCTCTGGGTACAGTTCGGGGCAGAATAATCAGGTCGGATCAATCCCACCGATTAATTCATCACCAGCTCAATGCATCCTTATGCCCATCAATGTACATACCGCATACCAAAAACAAACGACACTCTTTACGAACCTTTCCACCTGTTTTGATCTTTGGGAACAAGCGGATAGTTTGGTAATACGAGGGAACCATAACGAATTTTTCATCGACCTGGACCACGAGAATGGACCGATGACTCTGCACAGTTCACTGTACAATGTCGTTAAGTATGTGCAAGCTGGTATTCAGAAATTGCGGCGTATGTAA
- Protein Sequence
- MPANKDLLNKAELRVCGPEQLRAMRSSNSNSSNNNSKSNPYAPTGSGTGAPPAALLFSSSSYSTAPSLTSSSSSGRHSYSLHTTALPAPASSFGTTGGVGELDMKKSQKPRHDEHDRMERRERDKQARAQLQAEREPEPTGPLFTAPFKLPSTSDTDMHIAQRLGNYEAVKKCFMEASTSYHVIGIATSPAPTTPRAGSSGNSGVLSNSSSSTGSHHRSLPPKTNSNSNTISNSNVNFVKPADNRPLYNGGRMSSSSSAAPTAGGSTGRTGAPNHYSSSSSLSSSGTVVGGSFNKHEMHGVPSKGPSSTSIPTALMNGRSSSSGSMMGPGVSSGTGSGLSGGIPGLSSTAGGDKLPSQLPNGRLPQVANAKVPRVQGNNVDKILNEMKSSLMTPLTEIGATPRKELESKFSFSNPNPKSFVYAMTPLLAPMTPLSAGGGSNGIGSGNGSGSTSRPIGSLALSSMEMHPPLVSGIEDDNQNGSERNSSESSSNESVDESSSEDSNVGNKLASNGGGAATTLGDGNLTSASGGGNGGSAGAAGCVNGGQVGDGGTLGSPIRRKDWSLLNFMQPTIQQVPSESTHHHHTHHHEENSVSSPIRTLKMGSGDTLGVALSPPSLVGNVAPIKNEPLAPLDDDHLSNASSNSEPPVSAIASGPGATASSSSYVKQEPYPSENSASSPPASMGVGVKSERKDDALDRLSLSSPIKSPVEHHHGGYNNHQQQQQQHLFGSENLEPDVDVISALQEAKEFSLIKPISSMSGSDSDDALDAPSSACGVDVDQSSATHTRLLPAAQQHDAVGSNGVETSAAAAKKKKRKRKLAGANDREQRDASTSSSEDERYNAISHRSRSQSFEKDKSLLKGRGRQRNASNTHGGGTTTSAASSARYSDADSVASGSGRRSSKTPSHGSTPTKKLGLGVLSASAAYVDPGSIMSPPLSIPSVDGIPPTAKTSTSRKSRTLISRTSTSSSEEASSGGSSGSSVESDFGHESPSEQIEAEPPVIPAVVVCKAKKVKSKKKYDKDASVMSSAAVVLSAKTSSVEQLTKNGSTEGGRSSRGSSTNRNLYHLSSDSNDDRPSLLSSPVPTNGGRSEVAGDSSMIAGKLSKKVRKRSTASVASVDDDDNTRNRQQTSCVADDDDDDDDRSDSNSESDSDAPAKKEKKQSKNKKAAVFARVFNNNASASSGGKGKGGKGKGGKGKGQVYIDHVDDLHVPAKNPAQLAPANASNMSRQSPITSHVDQKRSLSTQHTLSVPSSSTAGLSACSPRAEGARPSSRGGGGLTPGQRAEPNRTSPLSLFSPIKGALQNITLMCRIDLSRLLKIPPPPSYPGSGSTRSLSGKANESYSAQRSASARQKSKSPYDQQQGKRRRNSVGQQQQPPQPPPPQQQQQQQQKHHHQQQQHQYHQQGQDHDRNGNGSVHSSSSTPKRLEDRSESIYDRNRITLDGESVVENGTTIVAGPLRHRSNSINSDYSGAAQKAREYHRGSDAGAFGSNSTSSSPLIHHANHRHSGATMIGGYQPHHSLAGDPKGSAIRTGKSPVLPGYDEKLAKTEKLSYSGLKEDKHSLIYAGSKYSSSGYGLIKQEGGQSIKQEFAGNNEFTTDSTLLGEGKLGSATGAKMGGTLTTNGTAEGAAASGRMRKRSVSSSSNSNNTYKEKRRKKDKANTSQTDQLEQLPPTNHDRLVVDDCQTTVGVASGTLERAHGAGTGALHHTNHHHSQQVPAGDPFVSAGTGSPEAPVHIKKVYVSYFERNDEELSEVRDQNRYLSEAKRLKHAADREGDHLAQAMLYLEAVLFFLLTGDTMERDPITEKAAFTMYKDTLCLIKFISSKFRSQLQHPTVQGNIHTKVAILSLRCQSLIYLKLYKMRRLEMKETGKTIGEFNHKTSTVPAELANGNTPSPLSPTSVGSQSSGYSSGQNNQVGSIPPINSSPAQCILMPINVHTAYQKQTTLFTNLSTCFDLWEQADSLVIRGNHNEFFIDLDHENGPMTLHSSLYNVVKYVQAGIQKLRRM
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00092151;
- 90% Identity
- -
- 80% Identity
- -