Vcin010861.1
Basic Information
- Insect
- Villa cingulata
- Gene Symbol
- lilli
- Assembly
- GCA_951394055.1
- Location
- OX596018.1:35583026-35590486[+]
Transcription Factor Domain
- TF Family
- AF-4
- Domain
- AF-4 domain
- PFAM
- PF05110
- TF Group
- Unclassified Structure
- Description
- This family consists of AF4 (Proto-oncogene AF4) and FMR2 (Fragile X syndrome) nuclear proteins. These proteins have been linked to human diseases such as acute lymphoblastic leukaemia and mental disabilities [1]. The family also contains a Drosophila AF4 protein homologue Lilliputian which contains an AT-hook domain. Lilliputian represents a novel pair-rule gene that acts in cytoskeleton regulation, segmentation and morphogenesis in Drosophila [2].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 10 7e-09 0.00015 22.4 22.5 6 176 5 165 1 188 0.62 2 10 1.8e-07 0.0039 17.7 10.3 349 461 280 393 255 404 0.66 3 10 2 4.3e+04 -7.4 22.8 109 253 417 554 398 567 0.33 4 10 0.055 1.2e+03 -0.3 12.2 422 491 574 645 559 660 0.72 5 10 0.46 9.8e+03 -3.4 2.3 442 502 675 737 663 742 0.36 6 10 2 4.3e+04 -11.5 46.7 392 487 747 853 719 869 0.53 7 10 1.3 2.9e+04 -4.9 19.0 87 217 902 1027 894 1046 0.48 8 10 2 4.3e+04 -6.7 13.4 68 257 990 1176 987 1221 0.52 9 10 0.19 4e+03 -2.1 2.3 67 146 1158 1243 1143 1365 0.45 10 10 0.42 8.9e+03 -3.2 2.3 138 169 1424 1455 1398 1527 0.36
Sequence Information
- Coding Sequence
- ATGGAAGATTATGAACGTATGGAAAGAAGAGAAAGAGATAAGCTCACAAGACAGCAAGTACAAAATCCAGATCCAGAAATACCACTTTTTCCAGCACCAGTTCGTTTAAGTCCATCAGAAGGTGATCATAGAATACAATCGAAACTAGGAGATTTTTCATTAGTTAAGGGTTTAATTGAACAAAAAAGATGCATTGGTATTTCAACTAGTGCTCAACGCCAATTACAAATTGGTTCAAGCTCAtctgcatcatcatcatcgtcatcgtcatcgtcatcatctgCTATACAACAATCTTCATCTACATCAgcgtcattatcatcatcaaactGTTCGGTAATACCTCCAATACAGCACAATCGTAATCTTCAACTGCAGCAGGCGTTATCAtctcatcattatcatcaatcacaacagcaacaacagcaactgCAACAActgcaacaacagcagcaacagcagcaacagcaacaacagcaacaacaatcgCAACAACCGCAACATCCACCACCGTCACGAGTTCCTTACATGAAGCAAATAACTGATAATAAACCAGCATATAATGGTCGTGGAGGTTATCCAGGACAATCTATGAAACATGATATTCATAGTGTTGGAGGCGGTGGTGGTATAATAGCACCTCCGACAAAAGGTTCAATGTTACCACCACCACAATCACAACCAATGTTGTCAAATGGTGGAAGATTGAATGACAAATTGTCATCAATCCCACCACTTCCAAATGGTAGATTTCAACCACCGTTGCCTAAACCGCCTAAATTTAACGCAGagTCTGAAAGCTCAAATCCATTATCATCATTGGCTGTAAATGgtgatattgaaaatattttgaaagaaatGGTTGTACCCAGTTACACACCATTGACTGAAATTGCAGCAACTCCACGAAAAGAAGTGGAATGTAAATTTACATTTGGACAACCAAATCCTCATAAGTATGCCGATTTTCCCCCACCATTTAAAGCACCATTAAGTAATAAATtaaaagaaaaaatagtgCCTACAAAACCAACTGATCTCGAGAAAGATTTAGATTTATCAGATAGCGATgaagaaagaaagaaacaaaCGTCTGTCGATCAgACATCACCAGCAAATGATGCTTCCAGTGAAagttcagaatcagtctcaagTGAATCTGGTAGCGAAGAATCAAATGGAACCAAAGGAGAAACCTCTAATCATACAACTAAATGGACATTAGGTAGATTTTTTAAACAACCTCAAGCACAAAATGAAATAACACAACCATCACCAAATGCAAATTCACAAAGTATACCTGaacaaacaattaaaaatgaacCAATACAGAATATTGAAGATGATAATTTATCATCACCGTctacatcatcattatcatatcgtaatacaaataatatatcatCACAGCATttacagcagcaacagcaacagaaaACGACATCAGCAAAAATAATTTCACCAGCCGATGTGAAAAAGGAGAATATGCAATCGGATAGTCTGTTTCCAATTTCTAATAAAACATCATCAGTTACTGGTAATAATAATTCACAACAACAGAAACCgggacaaattaaaagtgaggCATTTGataaaagTCCATCAACAAGTCCTGAAAACATTCAACAATCATCGTGTACTGTTGAATCAGATCAATCTGCAACAATATTTGCTGCAGCCGAAGAATATTCGTGCATAAAAGGTTTATCAGAATTAAGCAGTAGTTCTGGGGAGGAAGAAGatgttgaagaaaataaaaaacctaAAAAGAAACGAACGCCTAGTAAAAAGCcacagaaaagaaaaagaaaattgccTAAAGATAAGAAAACAGTAAACACTGCAATAGCATCAAGTGATGAAGAAGAATTTTCAAGTGCATTATCATCAGCAGGAAGCAGATCAAAATCAATTGATACTGAAAAGAAACCGAGAAGTGGACGAGGTCGTCCAAGAAAAAATCCAATAGCTGTAACAGCAGCATTAGCTACAGTTACACCAGCTCCAACCAAAGCGACTGCTCCATCAACTAAAAAAGAACCAGTCAAAAAACAGGCTGTACGAAGAAGAGTTTCACGACAAAATTCGAACGTTAAAAGTCGTGAATTGTTAGATACTACGAATTCAACGTCAGACGATGATGACGACGGAAGTCCACCGTCTGTTGCAGCACCAGTACGTCGAAGTCCTCTTCCTCCGCCTAAACCACCTGAACCGACAGCTTCTAGTCGAATATTACTTCAAATCCAAAAACGTTCATCACCATCATGTGCCGTAAAGTCAAACGAAGATGACgaagacgatgatgatgatgacgacgacgacgacgacgatgatgatgacgatgaaaaTTCTTCTACCTCTTCACCACCATCACAAGAAGATCGACTATCAGTTTCACCTCGTATAACGCCAGCAAGTcgacaacagcagcaacaacaacatatTTCTCCTGGTCGTCGTTCCCAACCAAGAAGCaaactatcatcatcatcatcctcatccTCATCCTCTTCATCTAGTGAAAGCAACTCAGACAGTGATAGTGAGGATTCAATAGAAGATAGAACGCATTCTAGAGACGATAAATTGACTTCAAGGGAAGATAAGaaaattcgtGATAAGACGAAAAGTGATAAAAACAAAAGTGATACCTTACGTAAATTATTTTATGGATTGAATAAGGGAGAAGGTGGAGCAAAGAAACAAGGTCAAGTACTTATTGTGGACCAATCTGAAGAATCACAAAATCAATCGTCTTATcatcatgaaaaaattttatcaccaatAGGCTTTAAGTGCAGTAGTTCGTCGTCAGTTGAAACTACTCAAACAACATCTATAAATAGCAATAATAACACCAACAGCAATACAAATAACGCCAATATACTTACAACCAAATCCCCACGACGTACTCCAATTCGCAATagtaacaacagcaacaacaataataatcataataataatagcaattgTAATAATATGATTAGAACACCAGACAGATCTTCGAATATTACACCAACATCAATCACATCGAATAAACATCAAATGTTGAATAATAACAGCAGTAATAGCAGCAGTAATTTTAATTTAGGAAATCTAAATGAAAATCGTTGTAATACACCAATAATGCCTCCAATTATTTGTAGAATTGAATTATCTCGCCTAAAGCATATTCCTCCAGAATGGTATCATAATTACACAGTTCGAAATGAACGAAGTCCTCGCAATAGTAGCAACATAACCTGTAGTAGTAATTTAAGTCAAAATAATGCAAATAATAACAATTGCAACATAATGGGTCCTAGTGATGATGAAAGactgatgatgataaaaaattcCAGAAATATGAATATGTCTTCAAATTCTGCTAATTCAGCAGCTGTTTCAAAAATGAATCGACAGCAATTAATGGATAATTGTTGCTCAAGGGATAATTATCTGAATGTTGATAATGCAGGACTAAACTCTAGACAAACTTCTGTTGCAGTAACAATGCCTACAACATCAATTAACAATAATTTAAGTAACAATAATACAAGACTGTTTGACGATAACCCAATGGTTAGCATACCAATATCACAACACCAGCATAGCAATCGAAGTATGTTGATGAATCCAAGTTCTGGTATGGTGGGTACAAATAAGCCAATCGCAATAATTGCAGCCACGAGAAATTTAGATTCGTCCGCAATGCATGACGGTGGAAAGATGTTGATGAGTAATAGTGTCGCTAATAATGCGTACGGTAGTTTCAACACAAACAGTGGTGGTATGATGATGGTAGAAAATCGAAATAGTCGAGTTCCTACACCTCATCAagaacaaatacaaaatttacgttTAACACCGAATCGATCAAATAGTGGTGGCGGTCATTTGCAATCACCAAAATTAGATGATCGTACATCGATGGCTGCATCACCAGTTAAAAATGTGATAAAACATGAAACATCAGCTATAAAATCTGAATTTATGAATGATGGTGATTATTATTTGGCAACAGGAACAAAATCATCTGCTTCTGGATTACcagatgaaaaatataatatttacggTAAAATGAGTAACAATGCTGCCTTAAAATCTGAACAAATGATGatcaaaaatgaatacaaaacaCTTGGTATACCGCCACTTGGCGGTAATGATTTAGATTCAAATAAAATTGGTGGCAATGTTGGTTGTGGTGGCGATGAAATGTCACCACCATCGGGATTAGGGATTATAACGAGTGGCTCATCGAAATCAATGAATAGTGACGACTATATTCCAAAGCAACGTCGTAAACGTAGTTGTAGTGCAAACAGTAGTCCATTCAAAGAGAAGAAACGAAAAAAAGGTGATCAGCCACAGcTTCAAAAAGATGTTATGGAACAATTACCGCCAACAAATCATGATCGGTTAACATTAGAAAAACCGATTGTACAGTCACAACAATCAAAAGTATACAAGTCATACTTTGAAAGAAGTACAGACGATATTTTGAGAGAAGATTATAGgGACCAAACAAGATTTTTGGAAGAAGCTAAAAGATTAAAACATGCTGCCGATAGGGAAACACATAATTTTGATCAAGTTATGTTATATTTAGAAgctgtattatattttttgttatctgGTGCTGCTATGGAAGAGGAGAAAGTAACAGAGAATGCAGCTTTCACAATGTACAAGGACACATTAATGTTAATCAAATTTATATCATCAAAATTTCGTAATCAACAAGCAGATAATCATAATAAGGTCGCTATATTAAgttTACGTTGTCAATCATTAATTtcatcaaaattatataaaatgaaacgtCATGAATGTAAAGAAGTTCAAAAAATTGttcatgatttttttaaaactgGTGGTAAAGGTTCTACAGAATTGATAAATGGTAATACACCATCATCAATATCACCAACGAATTCTGTATGCTCTcaggGGTCCGGTTCTAATACTCCACCATGTCAAATGGTTCCGATTCAGATATATGCAGCtcttcaaaaacaaaatcaatactTTAATTATTTAACGAGTTGTCATGATTTATGGGAACAAGCTGATAGTTTAGTAAGACGAGGCAATCATacagattTCTTTATTGCATTAGATCATGAGAATGGACCTCTAACGTTACATAGCTCCATATAcgatgtatttaaatatgttcAGGCTggtttacaaaaattaaaagaaatgtaa
- Protein Sequence
- MEDYERMERRERDKLTRQQVQNPDPEIPLFPAPVRLSPSEGDHRIQSKLGDFSLVKGLIEQKRCIGISTSAQRQLQIGSSSSASSSSSSSSSSSAIQQSSSTSASLSSSNCSVIPPIQHNRNLQLQQALSSHHYHQSQQQQQQLQQLQQQQQQQQQQQQQQQSQQPQHPPPSRVPYMKQITDNKPAYNGRGGYPGQSMKHDIHSVGGGGGIIAPPTKGSMLPPPQSQPMLSNGGRLNDKLSSIPPLPNGRFQPPLPKPPKFNAESESSNPLSSLAVNGDIENILKEMVVPSYTPLTEIAATPRKEVECKFTFGQPNPHKYADFPPPFKAPLSNKLKEKIVPTKPTDLEKDLDLSDSDEERKKQTSVDQTSPANDASSESSESVSSESGSEESNGTKGETSNHTTKWTLGRFFKQPQAQNEITQPSPNANSQSIPEQTIKNEPIQNIEDDNLSSPSTSSLSYRNTNNISSQHLQQQQQQKTTSAKIISPADVKKENMQSDSLFPISNKTSSVTGNNNSQQQKPGQIKSEAFDKSPSTSPENIQQSSCTVESDQSATIFAAAEEYSCIKGLSELSSSSGEEEDVEENKKPKKKRTPSKKPQKRKRKLPKDKKTVNTAIASSDEEEFSSALSSAGSRSKSIDTEKKPRSGRGRPRKNPIAVTAALATVTPAPTKATAPSTKKEPVKKQAVRRRVSRQNSNVKSRELLDTTNSTSDDDDDGSPPSVAAPVRRSPLPPPKPPEPTASSRILLQIQKRSSPSCAVKSNEDDEDDDDDDDDDDDDDDDDENSSTSSPPSQEDRLSVSPRITPASRQQQQQQHISPGRRSQPRSKLSSSSSSSSSSSSSESNSDSDSEDSIEDRTHSRDDKLTSREDKKIRDKTKSDKNKSDTLRKLFYGLNKGEGGAKKQGQVLIVDQSEESQNQSSYHHEKILSPIGFKCSSSSSVETTQTTSINSNNNTNSNTNNANILTTKSPRRTPIRNSNNSNNNNNHNNNSNCNNMIRTPDRSSNITPTSITSNKHQMLNNNSSNSSSNFNLGNLNENRCNTPIMPPIICRIELSRLKHIPPEWYHNYTVRNERSPRNSSNITCSSNLSQNNANNNNCNIMGPSDDERLMMIKNSRNMNMSSNSANSAAVSKMNRQQLMDNCCSRDNYLNVDNAGLNSRQTSVAVTMPTTSINNNLSNNNTRLFDDNPMVSIPISQHQHSNRSMLMNPSSGMVGTNKPIAIIAATRNLDSSAMHDGGKMLMSNSVANNAYGSFNTNSGGMMMVENRNSRVPTPHQEQIQNLRLTPNRSNSGGGHLQSPKLDDRTSMAASPVKNVIKHETSAIKSEFMNDGDYYLATGTKSSASGLPDEKYNIYGKMSNNAALKSEQMMIKNEYKTLGIPPLGGNDLDSNKIGGNVGCGGDEMSPPSGLGIITSGSSKSMNSDDYIPKQRRKRSCSANSSPFKEKKRKKGDQPQLQKDVMEQLPPTNHDRLTLEKPIVQSQQSKVYKSYFERSTDDILREDYRDQTRFLEEAKRLKHAADRETHNFDQVMLYLEAVLYFLLSGAAMEEEKVTENAAFTMYKDTLMLIKFISSKFRNQQADNHNKVAILSLRCQSLISSKLYKMKRHECKEVQKIVHDFFKTGGKGSTELINGNTPSSISPTNSVCSQGSGSNTPPCQMVPIQIYAALQKQNQYFNYLTSCHDLWEQADSLVRRGNHTDFFIALDHENGPLTLHSSIYDVFKYVQAGLQKLKEM
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -