Zhep004559.1
Basic Information
- Insect
- Zelleria hepariella
- Gene Symbol
- -
- Assembly
- GCA_949319315.1
- Location
- OX439380.1:5828994-5853102[-]
Transcription Factor Domain
- TF Family
- AF-4
- Domain
- AF-4 domain
- PFAM
- PF05110
- TF Group
- Unclassified Structure
- Description
- This family consists of AF4 (Proto-oncogene AF4) and FMR2 (Fragile X syndrome) nuclear proteins. These proteins have been linked to human diseases such as acute lymphoblastic leukaemia and mental disabilities [1]. The family also contains a Drosophila AF4 protein homologue Lilliputian which contains an AT-hook domain. Lilliputian represents a novel pair-rule gene that acts in cytoskeleton regulation, segmentation and morphogenesis in Drosophila [2].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 4 1.5e-09 1.8e-05 24.6 15.8 460 512 82 135 48 137 0.69 2 4 2 2.4e+04 -10.9 15.4 453 501 257 306 248 317 0.69 3 4 2 2.4e+04 -8.7 12.8 171 247 429 503 317 518 0.59 4 4 2 2.4e+04 -11.8 9.9 415 485 642 709 603 721 0.58
Sequence Information
- Coding Sequence
- ATGTTGGGATTTCTGACGTCGGATTTCGGATACCAGAATCGATTGACGCTGCTGCAACAGCTGATCGAGTGTGGTGATGTGCTTCTGGGACAGCCTTACACTTATGGTTTAGACAACCGGATAGCGAACCCCAGGCGCTCTGTATCGCCTGATGTGGTAGTCCCCAAAGAGGTGCCACTTTCGGAGAGCGACGATGAggcagccgccgccgccacccggCTACCCTCCATCCTCTCGCCGCTCGGCTCCGGCGGCTCCTCAAGCTCGGACAGCGAGTCCTCCCGCTCGGACTCCGAGTCCGAGTCGTCCTCGACCGAGTCCTcgactgcgcccgcgccgccgccggccgcgcccgcgcccgcgccgcagccCGCGCAGCGCCTCTCCTGGTCGCTGCTGAACTTCGCCCAGCCGCCCGCGCAGCCGCCCGAGCAGAGCTATGAGGCGGGGAAGGAGCTCGCGCATGCGCTGGCCGACGTCAAAGGGAAACCGGATATGGTCATGGACCAAAGCATGGAAGTCCAGACCCTAAACGACCTGATCCTCATACTCCACCGTGCGATAGACTGGTCTCGCCACAACGCCCAACAACGCTGGGTCCAACTGGCCCTCCTCAAGACCAGCCTACGAAGAGCCGTGACCGCGTTCGAGACCAGTATCCGACATATTGCACCTGGGGGTCACCGGGAGGGGATTCTGGATGAATTCGGCAAGTTCAAGGACTTGCTGTGGCAAGCTGTTGGCGCGGTGCATGGAGAGATTTGTTCTGTTGAGAGTCCAATCTCCGAGCTCTCTGACTCTGAAGCGTCATCACCTTCATCCGGCTCGGCGCGGCGACGTCGCTCGATAGCCAAGCAGCAGTCTTCGGCTTCCAGCGATGACGAACGCCCGAGGACGAATCAGAACACGCCATCGCCGCGCGTGCAAGCGCTAGCGCCCACCGCGCCCAAGCGCGGCCGCCCGCCCAAGCCGCGCGTTGAGGAGCGCCCCGCGAAGAAGAAGCGCGGCCGCCCGCCCAAGGTGCGGCCGGCGTCCCCGCCCGCGCCCCCGCCCTCGCCGCCGCCCTCGCCCAGCGAGAGCGAGGCGGAGCCCGaaccaccgccgccgccgccgccgccgcagccgcagccCCTACTCGACACTAAAAACCAGATCTTTAAGAGGGTATTCACGCCGCGCAAGGGCGACGAGGGGGGCGGCAAGGGCGGCAAGGGGGGCAAGGGGAAGGGCGGCAAGGGCAAGGGTCAGGTGACGATAATCGCGCCCGAGTCGTTGGACGACGACGAGCGCCGGCGCAGGGACCGCTCCGACGAGCGACGCCGGAACGACGAGGCGATCGCCGCGCGACTGTCGCCGCAGCCCGAGAGCGATTCTGCTAGGCGACGCGACCGCGCCTCGCATAATAGGATACAGGAGCGTAGATCGGCGGAGCGCACCGACCGCCCCACTAGCGAGAAGAGCCAAGAGCGAGTGGAGAGACCCACCGACCGGCCTTCCACTGAACGGACCTCTAATGACCGGATACCGGCCACGGACCGGATACCGGGCGAACGGCTGACTGCTGACCGGATACCGGGCGACCGATTGACTACTGACCGGTTACCGGTCGACCGGTTGACTACAGACCGGATACCAAGTGACCGGCTGACTGCAGACCGGATACCGAATGACCGGAGTGACCGGTTGATTCCTGACAGACTGTCAAGCGACCGGCTTACCGCTGACAGGTTGTCAAGTGACCGGTTGACGGCTGACCGCCTCCCGAGCGAGCGGATTTCCAGCGACCGGCTGACGAGTGATCGATTGACCTCTGACCGCCTGTCGAGTGATCGGTTGACTTCAGATCGCCTGTCGAGTGACCGGCTGGCCGCAGATAGACTGTCGAGCGACCGGCTGACCTCTGACCGACTGTCGAGCGACCGGCTGTCTTCGGATCGCCTGTCTAGCGACCGACTCTCTGGTGATAAGCTGTCCATTGAGAGGCTGTCAGTGGAGCGTCGGTCGGCGGACCGCGTAGACCGGTACATGAAGGCGGAGGCGGACGTGGGATCCGGACTGGTCAAGTCTGAACGGGGCTCGGAGCGGCTCTCCGAGGCCTCGTCTAATGtggAGCGTCGGTCGGCGGACCGCGTAGACCGGTACGTGAAGGCGGAGGCGGACGTGGGATCCGGACTGGTCAAGTCTGAACGGGGCTCGGAGCGGCTCTCTGAGGCCTCGTCTAATGGTGAGAACATTGTTTAG
- Protein Sequence
- MLGFLTSDFGYQNRLTLLQQLIECGDVLLGQPYTYGLDNRIANPRRSVSPDVVVPKEVPLSESDDEAAAAATRLPSILSPLGSGGSSSSDSESSRSDSESESSSTESSTAPAPPPAAPAPAPQPAQRLSWSLLNFAQPPAQPPEQSYEAGKELAHALADVKGKPDMVMDQSMEVQTLNDLILILHRAIDWSRHNAQQRWVQLALLKTSLRRAVTAFETSIRHIAPGGHREGILDEFGKFKDLLWQAVGAVHGEICSVESPISELSDSEASSPSSGSARRRRSIAKQQSSASSDDERPRTNQNTPSPRVQALAPTAPKRGRPPKPRVEERPAKKKRGRPPKVRPASPPAPPPSPPPSPSESEAEPEPPPPPPPPQPQPLLDTKNQIFKRVFTPRKGDEGGGKGGKGGKGKGGKGKGQVTIIAPESLDDDERRRRDRSDERRRNDEAIAARLSPQPESDSARRRDRASHNRIQERRSAERTDRPTSEKSQERVERPTDRPSTERTSNDRIPATDRIPGERLTADRIPGDRLTTDRLPVDRLTTDRIPSDRLTADRIPNDRSDRLIPDRLSSDRLTADRLSSDRLTADRLPSERISSDRLTSDRLTSDRLSSDRLTSDRLSSDRLAADRLSSDRLTSDRLSSDRLSSDRLSSDRLSGDKLSIERLSVERRSADRVDRYMKAEADVGSGLVKSERGSERLSEASSNVERRSADRVDRYVKAEADVGSGLVKSERGSERLSEASSNGENIV
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -