Lser000592.1
Basic Information
- Insect
- Lucilia sericata
- Gene Symbol
- lov
- Assembly
- GCA_015586225.1
- Location
- NW:17520-107927[+]
Transcription Factor Domain
- TF Family
- HTH
- Domain
- HTH_psq domain
- PFAM
- PF05225
- TF Group
- Helix-turn-helix
- Description
- This DNA-binding motif is found in four copies in the pipsqueak protein of Drosophila melanogaster [1]. In pipsqueak this domain binds to GAGA sequence [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 1 2.9e-18 4.2e-15 54.9 0.0 1 44 827 870 827 871 0.96
Sequence Information
- Coding Sequence
- ATGTTAAAGGAATCAGAGGAAATCATTAAAAGGCGCAAACTGTCGTTAACGGCAGATACTGCTGCTGCATATTTGGAATTAAATCATAACACAACCActaccaacaacaacaccacCACCATTATACCACCTGGTACACCACCTACACCTACAGCTAACAGCATGGCTTTGGAAACAGAATCCGCAGAAGCTTTAGCTTTAACCACCACCACTTCGGCCACAGCACCCATTTCTCTTAGCACACCAAATATGACCAATAATAGCAACTCGACGACTACAACAACAGCTTTTAACTCGCTCACTGCTGCCGCCGCCAACAGTATGGTCAATACATTATTGGCCAATAATGTTACTACTAGACCAGCTGCCTCTATACCTGACCACTATAGCCTAAGATGGAATAATCATCAGAATCATTTAATGCGTGCTTTTGATGCTTTACTGCAGAGCAAAACTTTGGTGGATGTTACTCTCGTTTGTGCCGAGACCAGCATACGGGCCCATAAAATGGTTTTATCGGCCTGTTCACCCTTTTTCCAGCGTGTTTTTGCCGATACTCCCTGTAAACATCCTGTAATAGTGCTCAAAGATTTTCGTGGCTGGGTAGTGCAAGCTATCATAGATTTTATGTATCGTGGTGAGATTAGTGTGCCTCAGGAGAAGCTTCATACCTTAATAGATGCTGGGGAGTCTTTGCAAGTCCGTGGTTTGGTAGAAAGCTCTATACCCGAACATACTCCCACACCAGCGGCCTCTCCAGATGATTTTGGCATGCTGGAAACCTCTTTACTATCTTCAGAATTGGAAAGATGTTCCAGTTTTGATGAAGAATCCCCCTCTATGGTAACCTCAGGTTCCAAAATCATATTACCCTCTCGATTATTTGGTTCCTCCTACAGACGTGAAAAGGAGCAACGCAAACGAGAAAGAGAACGTGATTTGGAACATGAATTAGAAAGTGATCAAGAAATGTTAAATGATTCTTTAAGTCACAATGATTTATGTACTTCGCCCATGCCTCGCCGCAAACAGGCCAGACCCAGAAGAAGATCAGGAGAATTGACCCATGATATGATCAGCAAACCCACTACACCCATACCACAAGATCACAACAACGAACAGCCACAAGATTTATGCAACAAAGACTCAAGTATTACGGAAAATAAGTCCTCAACACCACAACCACAAACGGAGGCGCTCAGTGAAGATCCCTTAAAAACTGTGATTAAATCCGAATTAACTGAACCCCTAGAAGATGATGAGGAACAGGTGGGCGGTCGTGACAGTGGCAATGATAGTCATGAACAGCCCGAAGATCTAATTGTCAATGATAAGTCTTTGGATAATAATTCCAATTTTGACAATAACAATTCTCTGCCAGAGCCTTTAAACTCCACCGAGGAAATGGAGGAGGAAGATAGAGTGCACGAGGAAGAGCCTGAAGCAGAGGAGGAGGAGGAAGAGGATATTGAACAGTTAATACATCATACCAATGAGCTAAATAGAAGACGCAAATCCGCCTTGTCCATGAATTCCGATGGGCCCGAGGATTTGTGCACCACCAAAAAAGACAACGATACCAGTGGTCATCATTCGGCCTCCGATAATGAAGActgcagcaataacaacaatagcacAAAATTGAATAATAACAATCGTATTGTATTATCCTTAAAGGATATAAGACAATTGAATAAACCCTCCTCCTCTTCGACTCCCAATTCCTCCCAGAATAGCATGAATCCTTTTTCTTCAGCCAGCCTTAGAGATTTAAGATTAGATCGCGAACGTGCTATGGAAACACAATGTAAAATGGAAGTTTTGGAAGCACAAATGCAGGCAGCGGCCGAAGCTGCCGCCGCTGCTGCTGCCAGAGGTGAAAATCCCTTTCAACACATGGAACACCAAATGGATTTGTCATTAGCCGCTGCTGCAGCTGCTGCTGCCGTCTCACAACAGCACTCCAGAGAAAGAGATCAAGCCTTGCAACGCGAACACCGAGAAATGCAACAACACAGTGCTTATGCCAATAGTATACTAGGACAAATGGGTATGCCAGGCATGCCTCCTTTTGGACCCAACCCGGGGGCCCCCTCTGGCCCTGGAGGACCATCGGCCCATGAACGTTTAGAAGAATCAATGAATCGTTTAAGCAAAGAATTTACACCCACTTCACCCATGAGCCTGCCACCACATTACAATCCTCAAGATGGGCCACCACATCCACCCTCACCTTTGCCCTTTCCCGGTATGTCTTCATTGGCGCTAACTCCGCCGCACATGTTTGGGTTGGACTCACCCTTGGGCTTATTTCCACCTGGCATTGATCCCGGAAAACTTTATAATCCCTTAATGGAAATGTCCGATCCTAGAGGTATGCCAGGTGATGCGCCGCCCTTTCTCAAAAAGAAAAGTAAGTACTTGCCTAGACCAAAAGGTCAACATTCTGCACCACGAGGCGGTCCACCACGTTCTTGGACAAATGCCGAATTAACCGAGGCCTTACAGCATGTTTGGAATAAGAAAATGACCACTTCTCAGGCTTCACGCATCTTTGGCATACCCTACAATTCATTGCTAATGTATGTGCGTGGCAAGTATGGCAAATCTTTGAAATTGGAACAATTACGCAAAGATTGCATAAGTGGTCCTCCTTTGGAAATGCTACAAATGGGCATAAGCTCTGGTTCGGGCAAAAACTCGGGGGAGTCTAAAGAGCAGAGAGAAGCCCGCAAAGAGGCTCAACGTAATGCTGCTCAGCATCCTCCCGGTGATATGGAAATGGGAGGTCCTCTAACGGGCCCTGGACCACGACCCTCAAGCTCAGAACCAGACTTATTGGGTGGACCAAATGCCCTCTTTAATCCCTTTAATCCTCAAGGTTTTTACCCCGATTTTCCCGGAGGTTTTCCCGGTTTACCTTTAAGCATGCTAAACTTATTGCCTCCCTCTGAAAGACAACATGCTGCCGCTGCAGCAGCCGCCATGCATCACTCCATGGGCCTGGATGAGGACTGTAAATCCGATCGCTCTAAACAATCCTCCAATATGGATGATGAATACTCTCTGAGTCGTGAAGAACGTCGCCACTCGGATTTAATGAATAACTCCTCTACACCTTCATCTGCCTCGCAACAAAATGGCTCGATACAAGATTGA
- Protein Sequence
- MLKESEEIIKRRKLSLTADTAAAYLELNHNTTTTNNNTTTIIPPGTPPTPTANSMALETESAEALALTTTTSATAPISLSTPNMTNNSNSTTTTTAFNSLTAAAANSMVNTLLANNVTTRPAASIPDHYSLRWNNHQNHLMRAFDALLQSKTLVDVTLVCAETSIRAHKMVLSACSPFFQRVFADTPCKHPVIVLKDFRGWVVQAIIDFMYRGEISVPQEKLHTLIDAGESLQVRGLVESSIPEHTPTPAASPDDFGMLETSLLSSELERCSSFDEESPSMVTSGSKIILPSRLFGSSYRREKEQRKRERERDLEHELESDQEMLNDSLSHNDLCTSPMPRRKQARPRRRSGELTHDMISKPTTPIPQDHNNEQPQDLCNKDSSITENKSSTPQPQTEALSEDPLKTVIKSELTEPLEDDEEQVGGRDSGNDSHEQPEDLIVNDKSLDNNSNFDNNNSLPEPLNSTEEMEEEDRVHEEEPEAEEEEEEDIEQLIHHTNELNRRRKSALSMNSDGPEDLCTTKKDNDTSGHHSASDNEDCSNNNNSTKLNNNNRIVLSLKDIRQLNKPSSSSTPNSSQNSMNPFSSASLRDLRLDRERAMETQCKMEVLEAQMQAAAEAAAAAAARGENPFQHMEHQMDLSLAAAAAAAAVSQQHSRERDQALQREHREMQQHSAYANSILGQMGMPGMPPFGPNPGAPSGPGGPSAHERLEESMNRLSKEFTPTSPMSLPPHYNPQDGPPHPPSPLPFPGMSSLALTPPHMFGLDSPLGLFPPGIDPGKLYNPLMEMSDPRGMPGDAPPFLKKKSKYLPRPKGQHSAPRGGPPRSWTNAELTEALQHVWNKKMTTSQASRIFGIPYNSLLMYVRGKYGKSLKLEQLRKDCISGPPLEMLQMGISSGSGKNSGESKEQREARKEAQRNAAQHPPGDMEMGGPLTGPGPRPSSSEPDLLGGPNALFNPFNPQGFYPDFPGGFPGLPLSMLNLLPPSERQHAAAAAAAMHHSMGLDEDCKSDRSKQSSNMDDEYSLSREERRHSDLMNNSSTPSSASQQNGSIQD
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00922211; iTF_00045442; iTF_00258788; iTF_00045285; iTF_00259695; iTF_00259001; iTF_00259870; iTF_01174146; iTF_01174369; iTF_01236578; iTF_01237574; iTF_01237780; iTF_01236828; iTF_01313097; iTF_01312935; iTF_00892651; iTF_00892874; iTF_01376557; iTF_01376742; iTF_01315598; iTF_01315429; iTF_01374190; iTF_01373986; iTF_00655409; iTF_00655245; iTF_00716561; iTF_00716842; iTF_00997842; iTF_00997645; iTF_01201523; iTF_01201734; iTF_01399379; iTF_01399591; iTF_00199988; iTF_00199791; iTF_00435444; iTF_00435695; iTF_00899734; iTF_00899461; iTF_00921366; iTF_00921506; iTF_01427574; iTF_01427397; iTF_00350103; iTF_00350275; iTF_01074392; iTF_01074604; iTF_01261113; iTF_01260917; iTF_01235643; iTF_01235859; iTF_01398620; iTF_01398407; iTF_00900733; iTF_00900504; iTF_00742015; iTF_00741773; iTF_01137919; iTF_01138156; iTF_00975520; iTF_00975720; iTF_01165632; iTF_01165457; iTF_01259136; iTF_01259305; iTF_01313728; iTF_01313913; iTF_01397344; iTF_01397580;
- 90% Identity
- iTF_00921366;
- 80% Identity
- iTF_00922211;