Pvib018976.1
Basic Information
- Insect
- Pseudococcus viburni
- Gene Symbol
- lilli
- Assembly
- GCA_033439095.1
- Location
- JAVBJF010000401.1:297477-307394[+]
Transcription Factor Domain
- TF Family
- AF-4
- Domain
- AF-4 domain
- PFAM
- PF05110
- TF Group
- Unclassified Structure
- Description
- This family consists of AF4 (Proto-oncogene AF4) and FMR2 (Fragile X syndrome) nuclear proteins. These proteins have been linked to human diseases such as acute lymphoblastic leukaemia and mental disabilities [1]. The family also contains a Drosophila AF4 protein homologue Lilliputian which contains an AT-hook domain. Lilliputian represents a novel pair-rule gene that acts in cytoskeleton regulation, segmentation and morphogenesis in Drosophila [2].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 9 1.5e-08 0.00037 20.3 17.2 4 235 21 251 20 281 0.45 2 9 9.5e-07 0.024 14.4 0.1 346 382 282 319 269 346 0.81 3 9 0.013 3.2e+02 0.8 36.0 430 513 491 604 405 605 0.70 4 9 1 2.5e+04 -10.0 11.3 136 481 695 719 613 741 0.45 5 9 0.26 6.6e+03 -3.6 12.7 120 240 901 1015 849 1049 0.29 6 9 0.089 2.2e+03 -2.0 1.7 435 474 1073 1110 1062 1151 0.63 7 9 0.061 1.5e+03 -1.5 0.6 189 216 1312 1339 1242 1384 0.63 8 9 1 2.5e+04 -8.2 13.3 100 243 1439 1581 1397 1615 0.54 9 9 1 2.5e+04 -14.4 18.4 111 184 1705 1779 1619 1810 0.55
Sequence Information
- Coding Sequence
- ATGAAAGGCGGCAATTATTCGATGAAAACGGCTGCGGATATAAGTGCGAAAGCGTGTTTTGAACGCGAACGTTTAAGGGAACGGGAAAGAGAAGCTCGAGCTCAAATGACTTACGAAGCAGATCAACAACGGATGTCTCAAGATGTAACGAATTCGGCGCCGTTGTTCGGGGAAATTGTTCGCGTGAATCCGAAATCCAACGATAAAGAAAGGCAGCAAATCGAACGTAAATTGGGCCGATTCGAAGACGTTAAACACTTGTTAGCGGATCAAGACGTAACAAATCTTTTCGGCGTCGATGGGCAGCCACCGCCCAGTCCTGCGCCTAATGTCACTTCTGGCTCGTCGTCGATCGCCAGTcacgaatttaaaaaacccAGCGCTTGCCTTCAATCCCATTCGACGAAATCGTCCCACCATCATCACCACCATCATCACCATTCTAGCAGCACTCAAAGGAACGCGTTTGTCAAACCTAACGATGGGAAATTGGCGTACGCGAGTCGCGGCAGTTTTTACGGCGGCCCGTCGTCCAAACATTGCAACGTAACTCCAACGTTGTCGGTCAAGGCAGGTGCGAGTTCGGCGACCAACTCGTTATTCGGCGTCGGCGGTAGTGTCGGCGCTAACAGTAGTAGCAGCGGCAGCAACAGCTCTTCCACGATGAGCGCGGCGGCGAATTTTCATTACGGTTCGACGTCGAATAAAGCTTCGTCTTCGACTCACGAGTACGCGGCTCGCATTCAGCAGTCGGCGGCcaaaaatttgcccaaaataAATTGCGAGCTGCGGGATTCTACGCGAGGTACGGATTGCAGCGACGATAGCGCGTCCTCCGTTTCCCTAACCAACGTGGAAAATATCCTGGATGAAATGACATCGGCCCTTAGAACGCCACTTACCGCTATAGCAGCTACGCCACGCAAAGAAGTAGAATCTAAATTCACGTTCAATCCGATATCTGGAAAGGTTGAGTCTTTACTGCCTTTGTCAAGAACGTATGCCGATAAAGGCAACAAAACAAACGCTATGAACAATGTGTTGGGATCAGGCGGTGGTGGCGGCGGCGGCCGTGCTCCTTTAATTTCTCCGGCTTTTCGAAGCGACGAATCGACTACTTTATTGCAGCCGTTCAACTCTACTCAGTTGTCGGCAGCTAGCGGTGGTAATGCAGTCCTCAGAAACGGAGCCAGCATCGGCGTCGGTTACGCTTACTCGGCGGGGTCTGTCGCCGAGAGGATTTCTTCATCTGTCGCGAACAAGGAGAAAAGCGATTCGACAACGTTGCCATGTTTGAAATCAGACCTCAGTTTGTCTGAAGATACGGACGATGGCGATGGAGTAGGGAATGATAATGACGAAGACGACGATTGCGATGACGACGAAGATGACGAaggcgacgacgacgatgacgaaaACAGTGATGATGGCAACGGTGATGGcgatgacgacgacgaagacgacgacgacgaagaagAAGAAACGCGAAGACGCCGTCAGCGTGCGAGAAGTAATATCGCCGACGAAAAGGATAGCAACAAGTACAAGATCACGTCGGGCACGGTTTTCGCATCGAAAAAAGAACCCGCTTCGTCGTCAGCTCCTCCCAACATGGCGGAAGATTTGCGAAAATGCGAAGCTGCGGCCACCAACGAGGTCAGAAATTCATCGTCTAGCGACGACAGCGACTCGGATTCTGGTTCGGAGAGCGAAGATTCGAGCAGCGATTCGCCGTCGCCGCCGTCGTTACCGCTGCCGGTGGTTACCGAACAGGAAGAAAGCAAACATCGctggaatttgaagaatttcctGCCGCCGTCCGCCCATAATTCGCAGCCGTGTCGTCGTCAGCAGAATTGCGATTCGGATAGTTCGACAGGTGACGGAGGCGGCCCGCTGAGAAGAGCAGCCAAGTCCAGAGTGACGTCGTGCAGTACGTCCGATTCGGATAACGACGGCGCCAAGTCGTCGCGCAAACTCGCCGTCGACAGTGGCAAGACGATGGCTGCCGCGGCTCCCGCTCTCGCGGCAGCCGCGTGTAGCGTCAACGTCGAATCCGTCGCAGGCGCCGACAATAATGGCGCGCGTCCGAAGTATCGATTCGGATCGTCGTCTTATTCGCAGTCGTCGGAAGAGGAGGACGATGAAGAAGAGGAAGATAACGATCGTCGCGATGACGACGATTCGGACCGAGCTTCGTCTTACGTCACTTCGAATTCCGCTCGAAACGGCGGCAGTATACGCGGCATTTCGGTCGGCGGTGGTGGCAACGCCGCGGTGGTGACCGCGTCTAATAACAACGGTTCCGGGTGTGCTACGTCCGTCGGTTGCGGCAGCATGGTAGTCACTTCGTCAATCGCCAATTCATCGAATTGCGGCAACAATTTCTCAACCAACGTCGCCGGCGCTTGCAACAACAGCGGCAGTGGCGGCGCTAGCGTGACAGTGGTGGGCGTGGTGAGACCGCACCAAAGGAAACCACCGTCGCCGCAGATTTTCCATCACAACGGCAGAAATCGCAACGTTACCGGTGGCTTACGCGATTTATCTCCCGCTACCAAAGACGAACGGTATACGGCGTTCGTTCAACAGCGGGCTTTGCATTTAGGCGCCGCTAGCCAGGCGCCGGAGCCGCAATGCGATTTATCCGCCAGCCAGGTGGCTGACAGCGGCATCGTATTAAAAAATACCAGCAGAGCGTCGTTGATTAACGATCACTATCAACAGACTCGTAACAACAGCAGTACCAGCAGCACCAGCAccagcaacaacaacaacaacaacaaccgCAACAACGCCAATAATAAATGTACCTCAACCGGCGGTAGCAGTCGCATCGTTACCTCCGCCGCCGTTAATAATAATATcaaagacgacgacgacgaacgcGATCGCGACAACCGTGACTACGATGTCAGATTGAGCGCGCTTGGATCAAGCGGCGGCAAGTCCGGTTTGGCGCCGAGTAAAACGAACGCTTCCTCTAAAATACGCCGCAAGTCGAAATCACCAGCGTTGGCATCCTCCAACGAATCTCCGGTCAAGAAAAAAAGAGGACGCAAGCGAACTATCAAAGTGCCCGAGTTATCGGAGAGCGACGAAGACACACCCAAGGTAAGCGAAAGTACATATCTTCCGCGTATTGCGGCAGTATTTGGCTATGTTGGCATTTCGTATTATTCACGTACGGTTCGTATGCGACCAGACGAAcgaACAAAAACGGCTCCGGAAAAGAAAAGGCCAGGTCGACCACCTCTGAAGCGTAACGATGCTTCCGCCGTCGAAGACATGGACTggaattcgaattcgaattcaaaTTCGAGTTCCAGGCGCAAAATGGCGGCAGATAAGTTTGCCAGTTTGACGGCTAACGTTTGGTGCGATCGTGAAAGACGTAGAAGCAGTCTACGAATGAGCAGTTTCACAACCGTGGATTCGGACAGCGATACGGAAATTTTAACGGCTACCGTTAAGCCTACAGCTCGCGTTATGCCGGCCGTCAAGAGAAAAGACGGAGGaagcggtggcggtggcggtggcagTGGCGGCGTGCGCGCGAAACGCGATTCCAGTTGCGAATCGATGAGAAACGACAACGATTTCAACGCTATTTCAGACGGTAGTATCAAGGGTGGTAGTGGTAGTGGTAGCGCCGGCGGCGGCGGCAATGGCGACGGCAACGGTGGTGGTAGTGGTGGTGTTGCCGCCATGATTAAAATGCAAGAAAGTCCGCCAAAATTGGATGTCGAATGTATCGCCGTTCAAGATAAGAAGAAAAGCGACACATTGAGGAAGCTGTTCTCGCGCCGCGAAGAGGGTGGCGGCAAAACAGGCGGCAAAGGAAAAGGTGGCAAAGGAAAGTGCGGCGTCATTGTGATGGAATCCGAAGTCGAACGTAAACTTTTACAACGTTCGTCGGTCATTAGTCCGACCACAGCGACGTCGGTGTCGGTGCCAGTGTCTGCGTCCGCGTCGTCCAATATCGCCTCCAGCTCGCCGGCCAAAAGTCAGGATCGCGTTCCGGATATGACGCACGTCGCCATCTCTTCGCCAAATTTACGATCAAACAACGGCGACAACGACGGTGGCAGATATTCCGAAATGTCGCGTATACCTTCCGCCATGAATAGCGGCGGCAGCGGTGGCGAACAGATTCCTAAACTGACTTACAACGAAGCCGGCAAACCATCGTTAATGTGTAAAATTGATTTGTCCAAGATTCCGTATATTGTAGCCAAGAAACGATCGGaagaaattcgaattaaaaGCGAATTATCGGATACTAGAcaatcgtcgtcgtcgtcgtcgtcgtcattgACCAACTCCATGCTGGACCACGTCGGTGGCGGCAGCGTAAGCGTCGAAGCCGTTTCGGCGACGAGCGAAGCTGCGTCCTCCGTTGATATGGGTAATGTGCCTTTGATCGAACCTCTATCGGATCGCCGTCGCAACAACTGCAATAGCAGCGGTGTCTCGGCGGGAGCGGTGACGATGAGCGTCGTCAGCCAGGGGAGTAACGGCGCGAACGCAAATACGGTACCGCCGATTGCCGTCGAACACGACGACTTGTATCGGGCACGCGATAACGTCGGCGGCAAGCAATACCAAAGGATAGCTGCGCCGCCAACGTCGCCGTTGTCGCGCAAGAAGCATTCGTCTACTAAGAAAACGAAGAATAGCAAACGCAAACATTTGGCCAACAAAACGCAAGCGATGATGACGACGCCGATGACACCGCCAACGTCGACGACAACGGCGGGCGTCGTTCCAGCCGTCGAAGAGGAAAATGCCGCCGTCGTCGCTTCGGTATTCGGAGCCGTTTCCATCGTCCATACGCCTTCCGTGGATACTACGGACATGGATTCTCCGTTATCTAGCGATTCGGAAACTAGAAAGTTATCTGCCGCCACCGCCGCTAGCCTCGCACATTCTACCGGTAAAGGCAAGAATAGATCGGCGAGCGCGGCGCCGATTGGCGTCCCACTCGCTTCTGCCAACGCTTCCGATTCCATTCGCAGCAATTTGATGACGACATGTTATCCTATCGGGAGTTTGAACGATGTCCGTAGTCAAAATGCGAACGCCATCGTCGATCAGAGTTCCGCCGACGACGAATCGTGGAATAGTTCGGAAAGCAGTTGCAGTAACTGTTCGGCGTCATCTGCTTCCTGTCGCGGCGGCGGTGGTGGTGGCGGCGTCAACGACAAATACTCGAAAGTTGGCAAACGTTCGTcgtcgcagtcgcagtcgcagtATCGTCGTAGTTGCGGCGGTaagaagagaaagaaaaaattaagaaacgagCATTCGCAACGTCATTCAGCTTTGAGTGATTGCGGCGGTGACGGCAGTGTCGGCAGCGCGTCCAATTTGATTCACATACCGCCCAACGCCGCCGGCGGTCACGAAAGAAGTAGCGTTTCGGATCCGCCGCCGGCGACAACGTTCAACGGTTATCACCAAGTCGCGATAAGCGATCTGGTCCGGCCTACAGTGCCTTCGCATCACGGTATTTACTATTCGTATCTGGAACATAAAGCCTCCGAAGAAATTAATTCCGAGTCGGACAACGTCAGTCCGAATTTATATTTGATGGAAGCGAAACGTCTGAAACACGCCGCCGATACGGAAAGGGATTCCGTTGCGCAAGGAATGCAATATTTGGAAGCGATTCTATCGTTCGTGTTAACCGGGCACGTTTTGGAACGTAAAAATCAAATGGATACAGCGTTCAACATGTACAGCGAAACTTTAAAGCTTATCGTCTACATCTCGAAGAAATTTCGCACTCTGTGCACCAACGTTACGCCATCCAGCATACCGAATAAAATCGCCATATTGAGtttgcGATGCGAATCCATTTTGAACTTGAAGCTGTATACGATGCGAGAAAATGAAGTTAGAGAAGTTCATAGCGCGGTCGCCGAATATTTCAGCAAGCCCGTTTCGATCGATGGATGCCACGGAATTCCTAGCCCGCATTCGCCAACTCCTTCGCCGGCCAGTTCTATCGGAAGTAATTCGTCCGGTTACAGTAGCGGCGATCGCAGACCAGTCGCTGCATCCCAGTGCGTTACCGTTCCTCTAAACCTTCACGTCGCCTTGCAGAAGCAGCATAATTTCTATTCAAACCTGATCGCTTCTTTCAGCTACTGGGAAGAAGCTGACAAACTTATATTTTCCGGTGGAAACAAAGATTTCTTCATCGAATTGGACAGGTTTTGCGGTCCACTCACTTTGCATAGTTCTTTGAAAGATTTGGTGCATTACGTGCGCGTCGGTATACAGAGATTGAAAGCCATTATAAGTCACAATCACGgcgtgtaa
- Protein Sequence
- MKGGNYSMKTAADISAKACFERERLREREREARAQMTYEADQQRMSQDVTNSAPLFGEIVRVNPKSNDKERQQIERKLGRFEDVKHLLADQDVTNLFGVDGQPPPSPAPNVTSGSSSIASHEFKKPSACLQSHSTKSSHHHHHHHHHSSSTQRNAFVKPNDGKLAYASRGSFYGGPSSKHCNVTPTLSVKAGASSATNSLFGVGGSVGANSSSSGSNSSSTMSAAANFHYGSTSNKASSSTHEYAARIQQSAAKNLPKINCELRDSTRGTDCSDDSASSVSLTNVENILDEMTSALRTPLTAIAATPRKEVESKFTFNPISGKVESLLPLSRTYADKGNKTNAMNNVLGSGGGGGGGRAPLISPAFRSDESTTLLQPFNSTQLSAASGGNAVLRNGASIGVGYAYSAGSVAERISSSVANKEKSDSTTLPCLKSDLSLSEDTDDGDGVGNDNDEDDDCDDDEDDEGDDDDDENSDDGNGDGDDDDEDDDDEEEETRRRRQRARSNIADEKDSNKYKITSGTVFASKKEPASSSAPPNMAEDLRKCEAAATNEVRNSSSSDDSDSDSGSESEDSSSDSPSPPSLPLPVVTEQEESKHRWNLKNFLPPSAHNSQPCRRQQNCDSDSSTGDGGGPLRRAAKSRVTSCSTSDSDNDGAKSSRKLAVDSGKTMAAAAPALAAAACSVNVESVAGADNNGARPKYRFGSSSYSQSSEEEDDEEEEDNDRRDDDDSDRASSYVTSNSARNGGSIRGISVGGGGNAAVVTASNNNGSGCATSVGCGSMVVTSSIANSSNCGNNFSTNVAGACNNSGSGGASVTVVGVVRPHQRKPPSPQIFHHNGRNRNVTGGLRDLSPATKDERYTAFVQQRALHLGAASQAPEPQCDLSASQVADSGIVLKNTSRASLINDHYQQTRNNSSTSSTSTSNNNNNNNRNNANNKCTSTGGSSRIVTSAAVNNNIKDDDDERDRDNRDYDVRLSALGSSGGKSGLAPSKTNASSKIRRKSKSPALASSNESPVKKKRGRKRTIKVPELSESDEDTPKVSESTYLPRIAAVFGYVGISYYSRTVRMRPDERTKTAPEKKRPGRPPLKRNDASAVEDMDWNSNSNSNSSSRRKMAADKFASLTANVWCDRERRRSSLRMSSFTTVDSDSDTEILTATVKPTARVMPAVKRKDGGSGGGGGGSGGVRAKRDSSCESMRNDNDFNAISDGSIKGGSGSGSAGGGGNGDGNGGGSGGVAAMIKMQESPPKLDVECIAVQDKKKSDTLRKLFSRREEGGGKTGGKGKGGKGKCGVIVMESEVERKLLQRSSVISPTTATSVSVPVSASASSNIASSSPAKSQDRVPDMTHVAISSPNLRSNNGDNDGGRYSEMSRIPSAMNSGGSGGEQIPKLTYNEAGKPSLMCKIDLSKIPYIVAKKRSEEIRIKSELSDTRQSSSSSSSSLTNSMLDHVGGGSVSVEAVSATSEAASSVDMGNVPLIEPLSDRRRNNCNSSGVSAGAVTMSVVSQGSNGANANTVPPIAVEHDDLYRARDNVGGKQYQRIAAPPTSPLSRKKHSSTKKTKNSKRKHLANKTQAMMTTPMTPPTSTTTAGVVPAVEEENAAVVASVFGAVSIVHTPSVDTTDMDSPLSSDSETRKLSAATAASLAHSTGKGKNRSASAAPIGVPLASANASDSIRSNLMTTCYPIGSLNDVRSQNANAIVDQSSADDESWNSSESSCSNCSASSASCRGGGGGGGVNDKYSKVGKRSSSQSQSQYRRSCGGKKRKKKLRNEHSQRHSALSDCGGDGSVGSASNLIHIPPNAAGGHERSSVSDPPPATTFNGYHQVAISDLVRPTVPSHHGIYYSYLEHKASEEINSESDNVSPNLYLMEAKRLKHAADTERDSVAQGMQYLEAILSFVLTGHVLERKNQMDTAFNMYSETLKLIVYISKKFRTLCTNVTPSSIPNKIAILSLRCESILNLKLYTMRENEVREVHSAVAEYFSKPVSIDGCHGIPSPHSPTPSPASSIGSNSSGYSSGDRRPVAASQCVTVPLNLHVALQKQHNFYSNLIASFSYWEEADKLIFSGGNKDFFIELDRFCGPLTLHSSLKDLVHYVRVGIQRLKAIISHNHGV
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -