Ehay047561.1
Basic Information
- Insect
- Eretmocerus hayati
- Gene Symbol
- lilli
- Assembly
- GCA_029851415.1
- Location
- CM056744.1:25854309-25870282[+]
Transcription Factor Domain
- TF Family
- AF-4
- Domain
- AF-4 domain
- PFAM
- PF05110
- TF Group
- Unclassified Structure
- Description
- This family consists of AF4 (Proto-oncogene AF4) and FMR2 (Fragile X syndrome) nuclear proteins. These proteins have been linked to human diseases such as acute lymphoblastic leukaemia and mental disabilities [1]. The family also contains a Drosophila AF4 protein homologue Lilliputian which contains an AT-hook domain. Lilliputian represents a novel pair-rule gene that acts in cytoskeleton regulation, segmentation and morphogenesis in Drosophila [2].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 4 8.7e-05 5.4 7.9 0.1 44 78 8 43 5 63 0.75 2 4 1 6.1e+04 -8.2 32.1 459 493 65 99 37 111 0.69 3 4 1 6.1e+04 -8.5 22.1 442 484 138 178 131 185 0.70 4 4 4.6e-10 2.8e-05 25.3 2.4 324 382 196 255 185 273 0.87
Sequence Information
- Coding Sequence
- ATGGTAAACCCAGACGCGCACGACCGGACGACGCAGCAGATCCAGTCGAAGCTCGGTAACTACTCGCTGGTGAAGCACCTCCTCGACGAGCCGAAGCGACTTATCGGCATCGAGGGGGTGCCCGCAAGCCCTGCCCCCGGTGTCAATCCCAATCCCATCACATCCGCCATCACAGCGCCAAATAGTGGTGGAGGAGGTAATGGAAgctcttcgtcgtcgtcgtcaacgtcatcttcatcgtcgtcgtcgtcgtccagGTCCAGCCAGTCGAGTAGCGAGTTCAAAAAGCCTGGCGGCAATCCACGATCGAGTTCGAGCAGTAGCCAACAGAGGGGTGGTTTTGTCAAGCCCGCTGACGGTAAACCACCATACGGAGGCCGGGGTGGTTACCCTGGCCAGCCGGTCAAACACGCGAGCAGCAATAATGAGCACAGGAGTCATGGTCTTGCCCCAGCTAAGGGGCCACCTTCgtcttcatcgtcgtcgtcacaGCAATCGGGTGTATCGAATTCgacgtcatcgtcgtcgtcgtcgtccggactgCTGGGAGGCAACAGCGCGAGCTTGCACGGCAGATTGCACACGGCCAGCGTTCGGCTACCCAAGCTGCCCATCGATGCCAGCTCGAGCTCAAGGCATCTCAGCGCCGACAGTACCGCAGAGGTGGAGACAATTCTTAAGgaAATGACAAAGCCGCCAACGCCTCTGACTGCAATTGCTCAAACGCCAAGGAAGGAATTAGAGTCGAAATTTACTTTTAATCCGGATCTAGCTAAGCTGACTGAAATAACGCCTCCGGAAACAAAGCCACGTGAGTACCTCCATATGAACTCATTCACCTATATAGTACCCCAAAAAATTCACACCCCATCAACTGGAACGAGTCGGCCGGATTTTAATCCCTCCTTGACATTCTCACCCACAGCAATTGCTATATCATGA
- Protein Sequence
- MVNPDAHDRTTQQIQSKLGNYSLVKHLLDEPKRLIGIEGVPASPAPGVNPNPITSAITAPNSGGGGNGSSSSSSSTSSSSSSSSSRSSQSSSEFKKPGGNPRSSSSSSQQRGGFVKPADGKPPYGGRGGYPGQPVKHASSNNEHRSHGLAPAKGPPSSSSSSSQQSGVSNSTSSSSSSSGLLGGNSASLHGRLHTASVRLPKLPIDASSSSRHLSADSTAEVETILKEMTKPPTPLTAIAQTPRKELESKFTFNPDLAKLTEITPPETKPREYLHMNSFTYIVPQKIHTPSTGTSRPDFNPSLTFSPTAIAIS
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -