Pmar006858.1
Basic Information
- Insect
- Paracoccus marginatus
- Gene Symbol
- lilli
- Assembly
- GCA_900065295.1
- Location
- FIZT01041768.1:3254-12809[+]
Transcription Factor Domain
- TF Family
- AF-4
- Domain
- AF-4 domain
- PFAM
- PF05110
- TF Group
- Unclassified Structure
- Description
- This family consists of AF4 (Proto-oncogene AF4) and FMR2 (Fragile X syndrome) nuclear proteins. These proteins have been linked to human diseases such as acute lymphoblastic leukaemia and mental disabilities [1]. The family also contains a Drosophila AF4 protein homologue Lilliputian which contains an AT-hook domain. Lilliputian represents a novel pair-rule gene that acts in cytoskeleton regulation, segmentation and morphogenesis in Drosophila [2].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 10 3.5e-12 7.4e-08 32.3 3.5 4 133 21 153 19 189 0.62 2 10 1.2e-06 0.025 14.0 0.1 338 381 194 238 177 264 0.86 3 10 1 2.1e+04 -7.4 12.2 163 246 357 438 283 450 0.39 4 10 2.6e-07 0.0055 16.2 21.2 436 513 439 515 430 516 0.77 5 10 1 2.1e+04 -6.5 10.1 130 220 568 658 519 706 0.44 6 10 1 2.1e+04 -7.9 15.3 117 244 808 935 775 945 0.38 7 10 0.0027 56 3.0 0.6 370 486 939 1052 934 1064 0.86 8 10 1 2.1e+04 -6.3 15.1 114 265 1296 1442 1260 1457 0.38 9 10 1 2.1e+04 -6.3 8.5 444 480 1459 1495 1452 1515 0.68 10 10 0.46 9.6e+03 -4.4 0.5 447 478 1761 1792 1733 1804 0.63
Sequence Information
- Coding Sequence
- ATGAAAGGCGGTGGTTATTCGATGAAGGCAGTTGCGGATTTGAGTTCGAAATCTAGTGTCGATCGAGAACGTTTGAGAGAACGCGAAAGAGAAGCTCGAGCCCAGATGACTTTCGAAGCAGATCAGCGAGCGTCTCAAGATGTTATAAATTCGGCGCCGCTGTTCGGCGAAATAGTTCGCGTAAATCCAAAATCCAACGACAAAGAAAGACAGCAAATCGAACGTAAACTGGGCCGATTTGAAGACGTGAAACATCTATTGGCCGATCAAGATGTGACGAATCTTTTCGGCGTAGACGGCCAACCTCCGCCCAGTCCAGCTCCAAATATTTCTTCAggttcgtcgtcttcgtcgtcgactGCTGGCcacgaattcaaaaaacccAACGCCTGTCTTCAGTCTCATCCGACGAAATCTTCacatcatcatcaccatcaccatcaccatcatcaTTCCGGCGCTCAAAGgAACGCGTTCGCTAAGCCCAGTGACGGAAAATTAGGGTACGCTAATCGCGGCAGTTCGTACACGACTCCGTCGACGAAACACCTACGAGACTCTTTGCGAGGTATGAGTTTTACGGACGATAATACGTCGGTTTCTCTAACTAACGTTGAAAATATCTTAGACGAAATGACGTCGGGAGTCCGTACGCCGTTGACAGCAATCGCAGCTACGCCACGAAAAGAAGTCGAGTCAAAGTTTACTTTCAATCCGATCTCCGGAaagGTCGAATCGTTACTGCCCATGCCGAGAATGGCTATCGATAAAGGCAACAAATGCAATGCCACGAATAACGTACTGGGATCTACTGGCagtggcggtggcggtggccGTAATTCGTTCATTTCTCCCGTTTTTCGTAACGACGACACGATGCCCTTCTCTTCCACTTCGTTGTCTGTGGCTAGCGTATCAAAAAATGTTGGTGGAGGATACGCTTATTCGACATCTGCGAACGACATGATTTCGTCGCCAGCAAGCAAAGAGAAAATCGATTCGACCGCTCTGCCATGTTTAAAATCGGATCTGAGTTTGTCCGAAGATACAGACGAAGGAGATGGAGTAGAGAACGGAAATGATTGCGACGATGAAGACGATGACGaggaggaagaagaagaagaagacgaggACGACGGCAACGAAGAGGAGGAGGACGacgaaaacatgaaaaatggcTCGCGAGAACAACGTCAATTGCTGCCGCAGCGTTCCAGAGGCAATGCGAccgacgaaaaagaaaataataagTTACATGTGATGCAGTGCAAGGCGATGGGCGGCGCGTCTTTCGCTTCCAAAAAAGAactatcatcatcatcatcttcTGCGGACCCTGCTAAAACAGAAGAGTTACGAAAATGCGAAGCTGCCGTTAGCGAGGGTAGAAATACGTCTTCCAGTGACGACAGTGACTCCGACTCCGGTTCCGGTTCAGAAAGCGAAGATTCGAGCAGTGACTCGCCTTCGCCGCCGTCTCTGCGTTTGCCGGTGGTGGCCGAGCAGGAAGAGAGCAAGCATAggtggaatttgaaaaatttcttaccgCCGTCCGCACACGAAAACTCGACGCCCGCGTCGACGCATAATTCGCAGGTAATGTATTGTTGTGCGCGATGCGAAAATGTCGCTTTTACACGCGAGACTATCACTTGTACGTTTACGTCTAATTCTGCTTCTGTGTTGGATCGCGTGCAGGCGCGTCGTcgtcaaaattgtgaaaaaccaaaagccaACGACTCGGACAGTTCGGTGGGCAGCGCGAGAAGAAATACAAGATTGGCCTCTTGCAGTACATCAGACTCGGACAACGGCAACCGCCACCGCAACGGCAGCGTAAGATCGTCCGCTAGTTCGCGTAAGCGTGCCATCGACAGTAATACGAAGGCTGGCTCGAATAAGGGCCTCGGCTCCGGTAACGTAAACGCCGGCCCAGTGAGTGTCGGCGGCGGTAATGTATCCGCTACCGACGACGCAGTCGGATCCAACGACGTCGATAACCGTCGAAAGTGCCGGCTGCGATCGTCGTCCTACTCGCAATCTTCTGACGATGATTCGGATCGAGCTTCGCCTTACGCCGGCGCGAATACCATGCGAAACAACAGCGGAGGCAGCAATTTGAACGGCATCACTGTGGGTAGCGGAGGATGCAGCAGCGGTATTAGCGGTAATGGTAACAATAGTAGTTGTTCGACGTCTGTCGGCTGCGCAAGTTTAGTGGCCAATTCGTTGAGTTGCGCGAATAACTTTGCCAGCAACGGCGGAGGCGCCAGTGGTAACGGTTGCGCAGCGACAAGTGGAAATGTCACCGTAGTGGGTGTTGTCAGACCGCATCAAAGAAAACCTCCTTCGCcgcaaatttttcaccttggTGCCAGAGGTCGTTCCTCAGCCGGCATGCGCGATTCGTCTCCGACGGCGGTAACCAACGACGAACGCTATTCCGCGTTCGTgcagcagcaacagcagcagcaacaacgtTCGCTACATTTGACTAGCGCAAGCGGAAATGCGCCAGAATCTCAATGCGATTTATCTGCCGGCCAACCCGGCAGTACGCACGCCAGTGTGGTCGCTAGTGGCAGCCGTGTCGCCGGCGGCAACCTCAAAGTTGCCATGCTGCAGTCGAACGACAAGTCAACTttgcaaacaaacaaaaccaGCGTTTCTTCTAAAGTTCGTCGTAAGTCGAAATCGCCGGCGCTAGCTTCCTCCAACGAATCTccggcgaagaaaaaaagaggacGCAAACGTACCATCAAAGTGCCAGAATTATCGGACAGCGACGAAGATACGCCCAAGACGAAAGCGGCGCCGGAAAAGAAGAGACCAGGTCGACCACCTCTCAAACGCAACGATACCGATGACTTGGATTGGAATTACAATTCTAGGCGTAAAATGACGGATAAATTTCCAAGCTCGGCGAACGTTTGGTGTGATCGCGAACGACGACGCAGTAGTTTACGAATGAGCAGTTTTACTACGGTGGATTCGGATAGCGAAATAGAAATACCGTCGGCGACGGTAAAACCAACGGCTCGCGTCATTCCTGctgtgaaaagaaaagatggcggcggcggcggcggcggtggcagCGGCAGCAGTAGCGTACGCAACAATCGTGATTCGAGTTGCGAATCGATGAGAAACGATAGCGGCGATTTCAACGCGATATCGGACAGCATCAAAAATATGGGTGGCAGCTGCGGCATTGTTAAAGCGTTAGAAAGTCCGCCCAAACTGGACGTCGAATGTATAGCTGTTCAAGATAAAAAGAAAAGCGACACTTTGCGAAAGTTGTTTTCGCGTCGTGAAGAAGGCGGTGGCAAAACAGGCGGCAAAGGAAAAGGTGGTAAAGGAAAGTGCGGCGTCATCGTTATGGAATCCGAAGTCGAACGAAAATTGTTACAGCGTTCGTCCGTCGTCAGTCCAGCATCGCCTTCTTCCAATTTGCCGCAGATGCAAAATGGCGTTCTCGATATGACGCCGCACGCGGCTATTTCTTCGCCTAGCTTGCAATCCAACGACGACTTATCCGGTTACAACGACAATATCGGCGGTGCTGGTGctggtggcggcggcggcggcggcggcggcggcggcaacGGCAACGGCAACAACAGCTGTATAGATGGAATGAGTCAAGTTGACGCCGCGGTCGATATGTTTCCCAAGTTGACTTACAACGAAGCTGGTAAACCCTCTTTGATGTGTAAAATCGATCTCAGTAGAATACCTTATATTGTGGCAAAGAAACGCTCCGAAGAAATTCGAATCAAATCCGAATTAGCGGACACCAGGCAAACGACAAACAACGTGCAAGACGGTAGCGGCGGCGCTGTCGCCGTCATTACGCTTGACGCGTCTTCCACGAATACCACTTCCAATGTGCCTTTGATCGAACCGGTATCGGATCGTCGCCGTCACGATAATGTGAAAGACGTCGCTGCGTCGATGATGACCGCTGAAAACAGCGACCTTCATCGCGATCAAAATAGACAACTGCATGAAAAGTCGCAGCCACCGACGTTGACACGTCCGCGTAAAAAACACGCGTCCAAGAAGACGAGAAGCGGCAAACGTAAGCATTACGCGGACGAAGCGCGCGACGTAACGATGGCGGCGGCGTCACCGGCTGCGGCGGCCGAGGACGATGATGGTACGTCGATATTTGCTAACGCGTCTACCGTGCATACGCCTTCGGTAGACACCACGGACATGGATTCGGCGTTGTCGAGCGATTCGGATACGCGAAAGGTAGCACGTTGCGCTaatcaaaatgcaaaaaagtcGCTAGCTGTACACGACTCGTCGCACGCAACCGATTCGTCCATCCCCAGCAACATGATCTCGTCATGTTACGCTGTGACTTTGAACGATGTTCAGCATCATGAGAATGCAGGCGCGGTAGCCGGTCAGAGTTCGGCGGACGACGAATCGTGGGACAGTTCGGAAAGCAGTTGCAGTAATTGTTCGGCATCGTCCACGCCTTGTCACGACTCGGTAGacaaatactcgaaaaatggCAAGCGTTCCTCTCGGTACCGCAGttgcggcggcggtggcgttaaatcgaaaaagaaaaagaagaaaaaaaagaagcttcGAAATGAGCAGTCGCCAAGATGTTCCACATCGTTGGGTGATTGCGCAGGCAGTATTGGTGGCGGTGGCGAcggcggtggcggtggtgTTGGCGACCATCTGCCCAACACGAGCCACGAAAGAAGCATTATGACGGAGACGCCAACCACGACAACGTTCAACGGTTATCATCCAGTTGTCATCGGCGATCTAGTTCGACCCACAGTGCCTACGCATCACGGCATTTACTACTCGTACCTCGAACACAAAGCTTCCGAAGACTTGAATTCCGAATCGGACAACGTTAGTCCGAATTTGTACTTGATGGAAGCGAAACGTCTGAAACATGCCGCCGATACGGAAAGAGATTCGGTCGCGCAAGGAATGCAATACTTGGAGGCTATCTTATCGTTCGTGCTGACCGGCCACGTTTTAGAACGTAAAAATCAAATGGATACCGCATTTAATATGTACAGCGAAACGTTGAAACTTATCGTgtatatttcaaaaaaatttcgcacttTATGCACGAATGCGGCGCAATCCAGTATACCAAATAAAATCGCCATTTTGAGTTTGCGGTGCGAATCTATTCTCAACTTGAAGCTTTATACGATGCGAGAAAACGAAGTGAAAGAAGTACACAACGCAGTTGCCGAATATTTTAACAAGCATGTTGCGGCGGAAGCGTGCTTAATTTCTAACGCGGCTGGAGTACCCAGTCCGCATTCGCCGACGCCTTCGCCCGCCAGTTCGGTCGGAAGTAATTCATCCGGTTATAGTACCGGCGATCGTAGGTCTGTCGCCATGTCTCAGTGTATCGCTGTGCCCTTCAATTTGCACGTGGCGTTGCAAAAGCAGCATAATTTCTATTCCAACTTGATCGCGTCCTTCAGCTATTGGGAAGAAGCGGATAAACTTATATTTTCTGGCGGAAATAaggatTTCTTCGTCGAATTGGACAGATTTTGCGGCCCGCTAACGTTGCATAGTTCGTTGAAAGATTTGGTGCATTACGTGCGCGTTGGTATTCAACGATTGAAAGCAATCATTAGTCATAATCACGGCGTATGA
- Protein Sequence
- MKGGGYSMKAVADLSSKSSVDRERLREREREARAQMTFEADQRASQDVINSAPLFGEIVRVNPKSNDKERQQIERKLGRFEDVKHLLADQDVTNLFGVDGQPPPSPAPNISSGSSSSSSTAGHEFKKPNACLQSHPTKSSHHHHHHHHHHHSGAQRNAFAKPSDGKLGYANRGSSYTTPSTKHLRDSLRGMSFTDDNTSVSLTNVENILDEMTSGVRTPLTAIAATPRKEVESKFTFNPISGKVESLLPMPRMAIDKGNKCNATNNVLGSTGSGGGGGRNSFISPVFRNDDTMPFSSTSLSVASVSKNVGGGYAYSTSANDMISSPASKEKIDSTALPCLKSDLSLSEDTDEGDGVENGNDCDDEDDDEEEEEEEDEDDGNEEEEDDENMKNGSREQRQLLPQRSRGNATDEKENNKLHVMQCKAMGGASFASKKELSSSSSSADPAKTEELRKCEAAVSEGRNTSSSDDSDSDSGSGSESEDSSSDSPSPPSLRLPVVAEQEESKHRWNLKNFLPPSAHENSTPASTHNSQVMYCCARCENVAFTRETITCTFTSNSASVLDRVQARRRQNCEKPKANDSDSSVGSARRNTRLASCSTSDSDNGNRHRNGSVRSSASSRKRAIDSNTKAGSNKGLGSGNVNAGPVSVGGGNVSATDDAVGSNDVDNRRKCRLRSSSYSQSSDDDSDRASPYAGANTMRNNSGGSNLNGITVGSGGCSSGISGNGNNSSCSTSVGCASLVANSLSCANNFASNGGGASGNGCAATSGNVTVVGVVRPHQRKPPSPQIFHLGARGRSSAGMRDSSPTAVTNDERYSAFVQQQQQQQQRSLHLTSASGNAPESQCDLSAGQPGSTHASVVASGSRVAGGNLKVAMLQSNDKSTLQTNKTSVSSKVRRKSKSPALASSNESPAKKKRGRKRTIKVPELSDSDEDTPKTKAAPEKKRPGRPPLKRNDTDDLDWNYNSRRKMTDKFPSSANVWCDRERRRSSLRMSSFTTVDSDSEIEIPSATVKPTARVIPAVKRKDGGGGGGGGSGSSSVRNNRDSSCESMRNDSGDFNAISDSIKNMGGSCGIVKALESPPKLDVECIAVQDKKKSDTLRKLFSRREEGGGKTGGKGKGGKGKCGVIVMESEVERKLLQRSSVVSPASPSSNLPQMQNGVLDMTPHAAISSPSLQSNDDLSGYNDNIGGAGAGGGGGGGGGGGNGNGNNSCIDGMSQVDAAVDMFPKLTYNEAGKPSLMCKIDLSRIPYIVAKKRSEEIRIKSELADTRQTTNNVQDGSGGAVAVITLDASSTNTTSNVPLIEPVSDRRRHDNVKDVAASMMTAENSDLHRDQNRQLHEKSQPPTLTRPRKKHASKKTRSGKRKHYADEARDVTMAAASPAAAAEDDDGTSIFANASTVHTPSVDTTDMDSALSSDSDTRKVARCANQNAKKSLAVHDSSHATDSSIPSNMISSCYAVTLNDVQHHENAGAVAGQSSADDESWDSSESSCSNCSASSTPCHDSVDKYSKNGKRSSRYRSCGGGGVKSKKKKKKKKKLRNEQSPRCSTSLGDCAGSIGGGGDGGGGGVGDHLPNTSHERSIMTETPTTTTFNGYHPVVIGDLVRPTVPTHHGIYYSYLEHKASEDLNSESDNVSPNLYLMEAKRLKHAADTERDSVAQGMQYLEAILSFVLTGHVLERKNQMDTAFNMYSETLKLIVYISKKFRTLCTNAAQSSIPNKIAILSLRCESILNLKLYTMRENEVKEVHNAVAEYFNKHVAAEACLISNAAGVPSPHSPTPSPASSVGSNSSGYSTGDRRSVAMSQCIAVPFNLHVALQKQHNFYSNLIASFSYWEEADKLIFSGGNKDFFVELDRFCGPLTLHSSLKDLVHYVRVGIQRLKAIISHNHGV
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -