Hpla002322.1
Basic Information
- Insect
- Hololepta plana
- Gene Symbol
- lilli_1
- Assembly
- GCA_963695495.1
- Location
- OY968831.1:10124971-10148325[+]
Transcription Factor Domain
- TF Family
- AF-4
- Domain
- AF-4 domain
- PFAM
- PF05110
- TF Group
- Unclassified Structure
- Description
- This family consists of AF4 (Proto-oncogene AF4) and FMR2 (Fragile X syndrome) nuclear proteins. These proteins have been linked to human diseases such as acute lymphoblastic leukaemia and mental disabilities [1]. The family also contains a Drosophila AF4 protein homologue Lilliputian which contains an AT-hook domain. Lilliputian represents a novel pair-rule gene that acts in cytoskeleton regulation, segmentation and morphogenesis in Drosophila [2].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 7 7.8e-07 0.011 14.7 25.4 29 216 9 193 3 223 0.64 2 7 9e-14 1.3e-09 37.6 19.6 341 500 221 393 208 414 0.68 3 7 0.43 6.1e+03 -4.3 4.1 119 130 486 498 430 596 0.44 4 7 1 1.4e+04 -8.5 21.9 129 250 607 730 588 755 0.42 5 7 0.04 5.7e+02 -0.9 4.2 93 184 754 845 741 853 0.57 6 7 1 1.4e+04 -5.8 24.4 147 257 1013 1121 939 1147 0.64 7 7 1 1.4e+04 -5.8 25.0 91 258 1017 1122 985 1199 0.48
Sequence Information
- Coding Sequence
- ATGTCCTCTCAGGTGGACGAGCAAGAGGAGCCTTCGCTCTTTGGAGATGGACCTGTTAAGGTTAATCCCGACCGGGTGACCAAACAGATACAGAACACGCTGGGCGACTTCCAAAGAGTGAACGGCATGTTCTTGGACGAACCGAAACGTCACATGTGGCAGGACGGACATAGACCGTTGTTTCCGTTACAATTGTCGACCAATCGGCCGCAACCGCCGCCCGAATTCAAGAAGCCGCACCAGAGCAACAGCGGCCGACCGTCTTCTTCCTCGACTTCCGCCACCGCGTCgtccacgtcgtcgtcgtcgttttcctcgtcgtcgcaTCATCGCAGCAATAACTTGATTAAACCGAACGATACCAAGTGCGCGTCACAACAACAATCACAATCTTATGATAACAATCGATCATATCAGAATCCGTCGCCGAATAAGCACGGTAGTACCAACATGAACAATCATCGAATCAACGGTATGCTatcgagcaacaacaataagggGCCAccgcagcaacagcaacaatctTCGACATCACCATTACCTTCGAATCCGTCGAGTCGATTACAAGCCAGCGCCCGTAATCTGGCTCGCATTCCTTTATACGTGcagGGTCAACCACCAACTGGCGCAAATTCTCGCGAAAATCTgtcatcttcgtcgtcgtcgtcgtctggCGGCAAAAGTGTCGAGCATATTCTAAAGGAAATGATTGCCGTTCGCACACCGTTGACTGGAATTGCTGCGACGCCTCGCACAGAACCGGAAaacaaattcgtttttaatccGCAATTATCAAAGTTAACCGAAGCACCAATTCCGAAACAGACAACACCAAAAAAACGGGAGAGACAGCCCTTTTCTCGTAACTCTAGTGATCTACAAGAAGATCTCAATTTGTCGGAAGACAGCGATGACGAGCAGAAGAGAGAACCGCCACCTTGCACAAAGTTACCAATAGACAAaatacTATCCCCTTTGGGCTCGGCAACGCCAATGATGGTCGCAACGAAAGCGGATCCGCCATCCGAACAAACGACCCGTTTATCGCCGACCGCTAGTTCGACGAGCGATTCCGGTTCGGATTCAGGTAGCGACAGCGACTCGACCAGCGAGGATTCCGGCGACGAGAACGTCGGCCACGTGAAAACGCGAAATCCGCTCGTGCCAGTCGAACCGCAGACCATATCGCCAAAGGCCGAGGAAGAGGCCAAACCCAGATGGAACTTGGCCTCGTACTTGGACCAGAACGGCGTCAAGTCCGAACCGTCGCCGCAGGTTCCGGCCGTGTCGCCCTTACAAAGCATCACGTCCAACATGTTGCCATTGAGCAAAAGGCGGACGACGCCGAACGAcatggacgacgacgacgacgattcggACTCGAACGATTCAACGAAGGACATACACAGTGTCTTAGCCGAGGTCAAGGCCCCAACCATCCTATTATCTAGTTTATCCGATTCGGATAGTAGTACAAAGAAGAAGTATcttaagaaacgaaaacgagtGCCAGTTAACGCTACCGCGTCCGTGAATTTGAACGGTAACGTGTCGGATGACAGTGATAGTGATGATTACGGACGGACATCTAAACTACACAAACCGGTGAACCGTGCTAGTCCGAAGCAGAAGAAATTGTTGGTGGACGATTCGAATAGCGATTCGGACATGGACAAGATCACGTTACACGGCGAGGACGGAGGCAAAACGTGGACCTCCGTCGCGCCGTCGAATGTGCCACCGGACAAACCGAAACTTGGTAGACGCGGGCGTCCGAGGAAACAGCCGGGCGAGAAACGGCCGGGCCGACCGCCGAATTCGAAGAAGAGCGCCACCCGAGGCAAAGTCCTTCCGAAATCGCACACCTCTGACGACGAATCGGCCACGACGCACGTTAAACATTCGAAGAAATCGTTGAAGAAGGACGTTCACAGCAGCAGCTCGGACTCTGACACTTCGGTGCACAGGAAACGcaattacaacaataacaactgcGTATCCGACCAGGATTCCACCAGGGTCTCGTCGAATAACAAGTATAGCAATTCGATGccttacaacaacaacaacaataatgataataagcaTCTCGATTCCGATCACGACGATTGGAGCGAGCGCAAGAAGAAGTGTTACAATCATTTGTTCGATAACCGTGAGAATGTTAATAAAGCGGAGTATAACAAGAAGAGACAGAAGCCGAACAGGTTGGCGAAGAGGAAGGGTAAACAAGATATCAGTCAGGAACTGTtaccgacgacgacggactCGGACAGCGATTCCGACGTCAACCATAAGAAAAGGCCGTCGGCCGCGAAGAAGCCGCCGCCACCGGTTAATCCGTACATCTCGGCATCGAATTCGGATAGCGACGACAAGCTGATCGGTTCGTCCGACAGCGACGACGGTCGGCATTGTCATAGTCGTTTGTCGATTGGCGGCAACGTTAGCAAAGAGAACAGGGGCAAAGTGGACAGCGACAGTAAGCCGCAGGACAAGAAGAAATCCGATACGCTCAAGAAGTTGTTCAAAAAGGATTCGGAGGGTGGAAAGGGCGGTGGCAAGGGCGGCGCGAAAGGTGGCAAAGGAAAAGGCggcgtcatcatcgtcgacggcGATTACGAACGCAGTTCGTCGTCCGTCGAAGAGGAAACGATGCCAACGACCATATCAAATGCTACAATGTTaccaatatcaaataatttacatgaaaCCAGGGGATCCACGCATTCGTTATCATACGAACCAATCAAATCTCTAAAGACTGAACCGACGGACGGCGGTGATAGCATTAGATCaaatattaatagtaataataatagtaatagcgTACCAAGTATTATGGTTCGAATCGATCTGAATCGATTGAGCTATGTGCCGAAACCGACGAAATCGTTGAGTAAAAGGCGATCCGAGGAGATGAGGCGATTGGGCGAGAAGGCCGACACGAGGCAAGAAATCGACGGGTCGATAGGTGGCGGCGGCAGTAGTGATGTGATGATTAAACCGTGTGATAATTtttacgaaacgaaacggaCGTCCTCGTCATCCGTGACGATCGAGAGTGATAGTGATGCGAACAATAAGAAAAGTAGTAAGTCAAATAAGAACAATTACGAACAATCGAAATCGAGCGGCGCTAAAGGGTCGAGCAAGAGGAAGCGACGCAACAGCAATAGTTCCTTATCGTCGCTGTCGACCGTCAGCAGTAACGTTTCGCTGAACGGCGGTAGCAATAGACGGCACGAGCATCACAGCGGTAGCGGCGGCAACAAGAAGGACAAGAGTCACAAGAGCAAGAGACGGAAAGAGGAGAAATCGTTGAGGGACGGCGACCAGAGTTTTGTTACCGTTCCGCCGACGAATCACGATCGGGAAGGTAGACCGCGTACGCCGCCATCGACTGCAACAATTGTAAATAGAACAAGtactagtaataataatgcacaaGTTGCCGCGGCCACTACGGTCGCAACCAGAGAGTATCATTCGTACTTCGAGAGGACCGACGATCAGCTGGACGAAGAAGGTGATCAAAATCATTATTTGAACGAGGCTAAAACGTTAAAACATTTGGCGGACAAAGAAACGGACACGATAAAACAGTGTATGTTGTATTTGGAGGCTGTCTTATACTTTTTACTGACCGGTAACGCCATGGAGCAAGAGTCGATGACGGAGAAGGCGGCATTCAAAATGTACAGAGACACGCTCGGGCTAATCAAATACATACCGTCTAAATTTCGAAGTCAGCAGAGCACGTCCTCTGTCCAAAACAAGCTGGCCGTTTTAAGTTTTAGATGTCAAGCGCTATTGTACTACAAACTATTGAAGATGCGTAGACATGACATGAGAGAGTACCAGAAAATCATCCACGATTACTTTCTAAAGTCTGCGAACACCGCAGCAATCCAGTGGGACCAAATGAACCAAGGCGTAGGCGGTCAAGGTACACCTTCTCTGTCGCCGACACCGTCACCAGCTGGATCGGTGGGCTCCGTTGGTAGTCAGTCGTCGGGTTACAGTAGCGGCGAATTGGCGAggggcaacaacaataacgcgcCCAGCGTTTCCGGTTCCGTTCCTACGCCTACACCGTGTATGCTGGTTCCCGTCGCGGTTTACAATGCCGTTCAGAAGCAAAATCAGTTGTTCCCGTATTTTCTCAGTTACCAGGACCTTTGGGAGCAAGCTGATAATCTAGTGATTAAAGGTAGTCATAAAGATTTCTTCATTGAATTGGATAGACAGTGCAAGCCATTAACTGTACATAGTTCGTTAATAGACTTGGTGAAATACGTTAGAGCCGGTATACAGAGATTGAAGAGagaaacttaa
- Protein Sequence
- MSSQVDEQEEPSLFGDGPVKVNPDRVTKQIQNTLGDFQRVNGMFLDEPKRHMWQDGHRPLFPLQLSTNRPQPPPEFKKPHQSNSGRPSSSSTSATASSTSSSSFSSSSHHRSNNLIKPNDTKCASQQQSQSYDNNRSYQNPSPNKHGSTNMNNHRINGMLSSNNNKGPPQQQQQSSTSPLPSNPSSRLQASARNLARIPLYVQGQPPTGANSRENLSSSSSSSSGGKSVEHILKEMIAVRTPLTGIAATPRTEPENKFVFNPQLSKLTEAPIPKQTTPKKRERQPFSRNSSDLQEDLNLSEDSDDEQKREPPPCTKLPIDKILSPLGSATPMMVATKADPPSEQTTRLSPTASSTSDSGSDSGSDSDSTSEDSGDENVGHVKTRNPLVPVEPQTISPKAEEEAKPRWNLASYLDQNGVKSEPSPQVPAVSPLQSITSNMLPLSKRRTTPNDMDDDDDDSDSNDSTKDIHSVLAEVKAPTILLSSLSDSDSSTKKKYLKKRKRVPVNATASVNLNGNVSDDSDSDDYGRTSKLHKPVNRASPKQKKLLVDDSNSDSDMDKITLHGEDGGKTWTSVAPSNVPPDKPKLGRRGRPRKQPGEKRPGRPPNSKKSATRGKVLPKSHTSDDESATTHVKHSKKSLKKDVHSSSSDSDTSVHRKRNYNNNNCVSDQDSTRVSSNNKYSNSMPYNNNNNNDNKHLDSDHDDWSERKKKCYNHLFDNRENVNKAEYNKKRQKPNRLAKRKGKQDISQELLPTTTDSDSDSDVNHKKRPSAAKKPPPPVNPYISASNSDSDDKLIGSSDSDDGRHCHSRLSIGGNVSKENRGKVDSDSKPQDKKKSDTLKKLFKKDSEGGKGGGKGGAKGGKGKGGVIIVDGDYERSSSSVEEETMPTTISNATMLPISNNLHETRGSTHSLSYEPIKSLKTEPTDGGDSIRSNINSNNNSNSVPSIMVRIDLNRLSYVPKPTKSLSKRRSEEMRRLGEKADTRQEIDGSIGGGGSSDVMIKPCDNFYETKRTSSSSVTIESDSDANNKKSSKSNKNNYEQSKSSGAKGSSKRKRRNSNSSLSSLSTVSSNVSLNGGSNRRHEHHSGSGGNKKDKSHKSKRRKEEKSLRDGDQSFVTVPPTNHDREGRPRTPPSTATIVNRTSTSNNNAQVAAATTVATREYHSYFERTDDQLDEEGDQNHYLNEAKTLKHLADKETDTIKQCMLYLEAVLYFLLTGNAMEQESMTEKAAFKMYRDTLGLIKYIPSKFRSQQSTSSVQNKLAVLSFRCQALLYYKLLKMRRHDMREYQKIIHDYFLKSANTAAIQWDQMNQGVGGQGTPSLSPTPSPAGSVGSVGSQSSGYSSGELARGNNNNAPSVSGSVPTPTPCMLVPVAVYNAVQKQNQLFPYFLSYQDLWEQADNLVIKGSHKDFFIELDRQCKPLTVHSSLIDLVKYVRAGIQRLKRET
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -