Osur009087.1
Basic Information
- Insect
- Oryzaephilus surinamensis
- Gene Symbol
- lilli
- Assembly
- GCA_004796505.1
- Location
- SSSI01007716.1:8696-28690[-]
Transcription Factor Domain
- TF Family
- AF-4
- Domain
- AF-4 domain
- PFAM
- PF05110
- TF Group
- Unclassified Structure
- Description
- This family consists of AF4 (Proto-oncogene AF4) and FMR2 (Fragile X syndrome) nuclear proteins. These proteins have been linked to human diseases such as acute lymphoblastic leukaemia and mental disabilities [1]. The family also contains a Drosophila AF4 protein homologue Lilliputian which contains an AT-hook domain. Lilliputian represents a novel pair-rule gene that acts in cytoskeleton regulation, segmentation and morphogenesis in Drosophila [2].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 8 0.00035 6.4 5.9 24.2 4 215 7 213 5 231 0.57 2 8 6e-10 1.1e-05 24.9 20.5 341 499 237 405 226 413 0.73 3 8 1 1.9e+04 -8.7 14.3 165 231 523 591 438 624 0.47 4 8 1 1.9e+04 -11.1 19.7 78 237 496 659 491 673 0.49 5 8 1 1.9e+04 -6.0 10.7 434 476 648 696 631 709 0.60 6 8 1 1.9e+04 -20.5 41.5 93 259 692 841 649 877 0.31 7 8 0.00083 15 4.7 15.6 149 263 1024 1135 915 1150 0.58 8 8 0.069 1.3e+03 -1.7 2.0 162 215 1299 1348 1228 1377 0.50
Sequence Information
- Coding Sequence
- atgaagAGACCTCGtGTGGAAAGGGACCGCCTTCGTGAGCGGGAACGGCTGGCGCGGGCCCAAATGTCTTCACAAGCGGTCGAACAAGAAAGTCCAGGTGCCTCGGGTGATTTCCTATTTAGTGGTCCGATCAAAgTTAACCCGTCCTCCGCTGATCCCGTCACACAACAAATTCAAAGTAAACTTGGCGACTTTCAACGTGTTCGCCATTATTTGGACCAAAAGGATAGCGCTCTTGTGGGGGTGGATGGAGTGCCACCGCCAAGTCCAGGTGTGCCATCTTCCAGACATCATTCTATATTGAGTTTACCGCCAAATTCTGGACCATGTTCAGCAGCAAGaTTGCAACCATCACCGGAATCTCGTACAGAATTCAAAAAACCCCACCACCATCCACATCACCCATCATCATCGTCGTCGTCTTCGTCGTCGTCGCACCAACGTAGCGGCTACGTAAAACCGGCGGATGGTAAACCACCGTATGGCGGCCGTGGCGGTTATCCAGGCCAACCGGTAAAACATGGCAGCATCAATAATCATCGACCAAATGGTATATTACCAGCAAAGGGACCACCTCTGTCACCGGCTGCCTCCTCCTCATCTAGTTCCTCCACATCATCTTCCAGTAGTAGTCGGGGCGGCGTGCATGGACAAAGAAATCAACGCATTCCATACGAGaaTCAAGGTTCTACGGCCGGCCCTAGGGAATCCTTGCCTTCCTCAAATCCAGAcgtaaacaatatatttaagGAAATGAGAGAAGTTCCATCGCCTTTAGCTGCAATGGCTGCAACTCCACGGATAGAACTGgacaataaatatacaacatgTACATTTAATCCAGTTCTTGCCAAGTTGACTGAAGCATCACCAGCACCATCTACGCCAAAAAAACGCGATCGTCTGCCAGCCCCTAGGCCCTCCAcTAATTTAAAAGATGATCTCGATTTATCTGAAGAGAGTGACGATGAACAAAAACGTGAAGCACTTCATTCATCTTCCAAATTATCCAATGTTGAAAAaatgTTATCGCCATTAGGCGATTCAACACCAGTCAATCAAAAAATGGAACGACCACCCGATCCTGCCCATTCACCTATTGGCACATCATCAAGTGAATCTGGTTCCGATTCCGGTTCGGAAAGTGATTCAACTACTGACGATTCAGCTGAAGAGAATGTGACGTCGATGAGTCGTGTagcaccgccaccaccaccagtGGCGCCACCAGTTGTTTCCACATCTGTGGTGCCGGCAGAAGCGCAACCCTCTTCACCAAAACTGGAAGAAGAAAGCAAACAGTTGCGATGGAATTTGGGATCGTTTGTTGTATCACCAAAACCGCAAACTTCGCCACTTTTATCGCCAATCAAATCCACAACGGCTATTACTAGTCCAGATAGCCGAAAACGGGACAAAGAAGAGTCCGATACGAGTGATTCAACACGTGATTTGGGTCGAGTGGTGGCTGAAGCGTTTGCATCAAATTCGGTACCATTGTTAAGTGATTTCGGCGATACGGATTCGGAAAAAGAGGCTCAAAAGCGAACTAAACGTCGCAAAAggacaaattataataatgcacAAGTGATGCCGCCACCGCCGCGCAGTGATGACGATGACAGTGAAGAGGATAGTGATGAACGGACGAAAATTACGAAACCAGTGCCGCGTGTCAGTCCACGAACGAAATCAATTGACTCAATTAGTGATACTGATGATGATTCGGAATTTAGTAGTAGTGTTGTAAATAGTGccaaaaaatcaaaacaaactgTAAAATCGACCAGCGGCGGAGATAAACCAAAATCGAATCGTGGTAGACCaagaaaatacaataataagtTAGTGCCAGTGGTAGACAATAACAAGAAACGTGGTCGGCCGCCTATAAAGAGCCACCAACACAGTGACTCCGATACAGAGGTGCGGAAGAGACGTGGTCGGCCACCGAAATCGACACGTCCCTCATCACCACCAACTTCTtcagatgatgatgatgatattgATGAGAAATTTGACAAGCCGCCGCCGTCACGACGCCGTACCAAATCAAAAATGGATTCGAACACTAGTTCGGATTCGGATGTGAGTCCACCACGGGGTCGACGTGATTCGGATCGTGATTCTGTAAAATTGGAAACACCACGAAAACCACACAAACACAAATCACCACGTTCTGAtgataaaataagaaaaaataaaaaagaatccGATGATGAATGGGGTgaaaagaacaaaaataaattacgacATCATTTTGAATCGGAGAAACGAGCCAAATTGGATGGATCACCGCGTAAAAAGGATCCACAGAGACGGAAAGGTCGTCCATCACATACAAAACAGAAAAGTATTTCAACattaccaacaacaacaagtgATTCCAGTGACAGTGATGTTAAACAGCACAAAACGCAATCATCACCAATACGAACAGTGAACACAAATCGTCGATCATTGTCCATATCGGATTCAGATCGGCATTTAAGCCGTTCATCGGACAGTGAGTGTAGTGACAATAGTACAAGAAGACGGAAAAAATCACCAATGAAAATTGATTCATCAATTAAAGTAGAAGAAACGAAACCGATTCAAGATAAAAAGAAAAGCGACACGTTAAGGAAACTGTTCACAGTGAAACGTGATTCAGAGGGTGGTAAAGGTGGTGGCAAGGGTGGTGGAAAAGGTGGCAAAGGTGGAAAAGGAAAAGGTGGTGTTAGTGTAATAATGGTAGATGAGAATTATGAACGGAGTAGTTCATCTGTTGAAGATGAAACAATGCCAACAATATCATCGAATCCAGCACTATTATCGCCTATATCAAATGTACCGCCACAACCACCACCATCATATAATAATCATCATCTCTATAATGAATCAATCAAAACACAAAAGACTGACTTATCATCAATTAATAACGATCACAATCATATTAATAGTGAGAAAGGTGTTATAGTGAGAATCGATTTGAATCGTTTAGACATTCCAGCAATTCCACAATTAAAGAGATATTTAACAATGAAACGTGGATGGaataatatacaagaaaaaaatcgtttattataCGATAATGATATCGAATCGCATCATCAAAATAGTCGTGTTGTCGTAAAAGATGGTGGCGATAATAATgatgtgaaaaaattaaataaaatcgatcAACAACATACAACTGAtcgaaaacataaaaaacggAAACGGCGGAATAGTAGTAGTTCAGTGTCATCACATTCAACAATAAGCAGCATGTCGCATACTAGtagtaataagaaaaataaagaaaattttctaaacaacAATAAAGATAAGGAGAATCGTAAATCGAAACGACGAAAAGAGGacaacaacacaacaaatgacctaattttgaataataatagatCTCATGCTGATAATCTCAGCCTAACAAATGTACCAGCGACGAACCATGAAAGGGAAGGTTCAAGTAGTGGACGTTTAACGCCACCTGTTGTTGGAAATTATACGACGATGCGACCGTCCACGCACAGGGAGTATCATTCGTATTTTGAAGCACCTGAAGAACCTTCCGAATATGAGGAAAGgGATCAAAATCAGTACCTGAACGATGCGAAACGTCTTAAACATATGGCTGACAAAGAAACAGATCCAATAAAACAATGTATGTTATATTTGGAGGCGGTTTTATTCTTTCTATTAACAGGAAATGCCATGGAACATGAAAGTGTCACGGAAAAAGCTGCTTTTACAATGTACAAAGATACACTTAACTTAATAAAATTTATTTCTTCAAAGTTTCGTAATCAACAAACTGCATCTTCAGTACACAATAAATTAGCTGTATTAAGttaCCGGTGCCAAGccttattatattacaaactatttaaaatgagGAAACAAGAATCGaaagaaatacaaaaaacGATCAGCGAGTTCTGCAATAacaaaaaTGCTACAATGCCTCAGGACCAACAAAATCACCAACAGGGTCAAGGTACACCTTCGCCCTTATCACCAACACCGTCACCGGCCGGTTCAGTCGGTTCAGTGGGCAGCCAATCGTCTGGTTATCTTAGTGGCGAATTACGAGGCAACAACAGTAACAATAATGCCCCAGTTGTGCCATCAACGCATGCACAAACGCCAGGCGTTTGGGTACCATTGCCCGTTTTTAATGCCATCAATAAACAAAACCTACAGTTTACGTATTTATTGAGTTATCAAGAGTTGTGGGACACAGCCGACTCGTTGGTGATTAAAGGAAAACATACAGAGTTTTTTATCGAATTGGACCGCCAATGCAGACCATTAACAATGCACAGTTCATTAATAGACTTGGTCAGGCATGTACGTGAAGGGATAAATCGACTAAAGaatcaaagttga
- Protein Sequence
- MKRPRVERDRLRERERLARAQMSSQAVEQESPGASGDFLFSGPIKVNPSSADPVTQQIQSKLGDFQRVRHYLDQKDSALVGVDGVPPPSPGVPSSRHHSILSLPPNSGPCSAARLQPSPESRTEFKKPHHHPHHPSSSSSSSSSSHQRSGYVKPADGKPPYGGRGGYPGQPVKHGSINNHRPNGILPAKGPPLSPAASSSSSSSTSSSSSSRGGVHGQRNQRIPYENQGSTAGPRESLPSSNPDVNNIFKEMREVPSPLAAMAATPRIELDNKYTTCTFNPVLAKLTEASPAPSTPKKRDRLPAPRPSTNLKDDLDLSEESDDEQKREALHSSSKLSNVEKMLSPLGDSTPVNQKMERPPDPAHSPIGTSSSESGSDSGSESDSTTDDSAEENVTSMSRVAPPPPPVAPPVVSTSVVPAEAQPSSPKLEEESKQLRWNLGSFVVSPKPQTSPLLSPIKSTTAITSPDSRKRDKEESDTSDSTRDLGRVVAEAFASNSVPLLSDFGDTDSEKEAQKRTKRRKRTNYNNAQVMPPPPRSDDDDSEEDSDERTKITKPVPRVSPRTKSIDSISDTDDDSEFSSSVVNSAKKSKQTVKSTSGGDKPKSNRGRPRKYNNKLVPVVDNNKKRGRPPIKSHQHSDSDTEVRKRRGRPPKSTRPSSPPTSSDDDDDIDEKFDKPPPSRRRTKSKMDSNTSSDSDVSPPRGRRDSDRDSVKLETPRKPHKHKSPRSDDKIRKNKKESDDEWGEKNKNKLRHHFESEKRAKLDGSPRKKDPQRRKGRPSHTKQKSISTLPTTTSDSSDSDVKQHKTQSSPIRTVNTNRRSLSISDSDRHLSRSSDSECSDNSTRRRKKSPMKIDSSIKVEETKPIQDKKKSDTLRKLFTVKRDSEGGKGGGKGGGKGGKGGKGKGGVSVIMVDENYERSSSSVEDETMPTISSNPALLSPISNVPPQPPPSYNNHHLYNESIKTQKTDLSSINNDHNHINSEKGVIVRIDLNRLDIPAIPQLKRYLTMKRGWNNIQEKNRLLYDNDIESHHQNSRVVVKDGGDNNDVKKLNKIDQQHTTDRKHKKRKRRNSSSSVSSHSTISSMSHTSSNKKNKENFLNNNKDKENRKSKRRKEDNNTTNDLILNNNRSHADNLSLTNVPATNHEREGSSSGRLTPPVVGNYTTMRPSTHREYHSYFEAPEEPSEYEERDQNQYLNDAKRLKHMADKETDPIKQCMLYLEAVLFFLLTGNAMEHESVTEKAAFTMYKDTLNLIKFISSKFRNQQTASSVHNKLAVLSYRCQALLYYKLFKMRKQESKEIQKTISEFCNNKNATMPQDQQNHQQGQGTPSPLSPTPSPAGSVGSVGSQSSGYLSGELRGNNSNNNAPVVPSTHAQTPGVWVPLPVFNAINKQNLQFTYLLSYQELWDTADSLVIKGKHTEFFIELDRQCRPLTMHSSLIDLVRHVREGINRLKNQS
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_01344908;
- 90% Identity
- -
- 80% Identity
- -