Cpag018832.1
Basic Information
- Insect
- Cheilosia pagana
- Gene Symbol
- br
- Assembly
- GCA_936435595.1
- Location
- CAKZFE010000089.1:1189986-1191439[+]
Transcription Factor Domain
- TF Family
- HTH
- Domain
- HTH_psq domain
- PFAM
- PF05225
- TF Group
- Helix-turn-helix
- Description
- This DNA-binding motif is found in four copies in the pipsqueak protein of Drosophila melanogaster [1]. In pipsqueak this domain binds to GAGA sequence [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 3 1.6e-14 1.5e-11 44.1 0.0 3 43 287 327 285 329 0.91 2 3 0.014 13 5.8 0.0 27 37 358 368 349 373 0.87 3 3 1.3e-17 1.2e-14 54.0 0.0 3 42 396 435 394 437 0.93
Sequence Information
- Coding Sequence
- atgaaccACCTCAAATGGATGGGCCATACGGCAACCATCCTTGATATCCAAAAGACTCTCAAAAATGATCCTGTCACCTGTGATATTACCCTTTCATGTCGGGGTAAATCGGTGAAAGCACATCGCTTTATACTATCTTCATGTAGTGACCTACTGCGTGATATTCTAAGTGATGTTCCCGTGGGACAGGAGGCCACAATAATAGTGCCAGATGTGAAAGGCAATCTGTTGGACAGTGTATTGGGCTTTGTGTACATGGGGGAGACAAGTCTATCATCAACAAACTTGTCTGAATTTCTCGAAGCCATTAGCATATTAGGCATCAAAAGCGCTATTAGCTTCGAATGTGCCAACAGTAGTTCGGCAAATATCAAAAATGTTGCCGAAATGACAGAAATGGAAGAGGAACTAATGGAAGCCGAACCGGAGCAGGAAATTGTCGAAAAGGACGAAGAAGAGGATCAAACTGTTGCTGAAAATAGCAGCGTCCGCGAACTGGAATTTCTTGAAGTCTATAACGACCAAGAGAAAATTAGCTACACCATCGAAAACATCATACCGACCAATCCCAACGAATATATTCTTACCGAGAGTTCTGGCACCTTCACACTAACGCCAAACGCAAAAATCGAGGGCAGTGAAGCCGGAGCTCTTGTCGAGAAACAAGATGAAGATAGCCCATTGCTGGAACAATATGTGGGCGACCCAATAGGTGATAGCAGTCAGTGTTCAAGTACAGATatgaaagaagaaaagtgcTGCAAAAAACTTAAAGACGAGTTTAGTGAATTTCAAGCGCTTAGTCCGGAGGACAGCAAAGGATTTCAGAACTTTAATGAAACGCAATCGGCCCGCGTCGAAGCACTAGAAAATGCAGTTTTAGCTGTTGTCGATGAGGGAATGAGTCTACAAAAGGCCGCCATCAAGTATAACATTTCTAAGACGGTCCTTTGGCGTCGCGTAAAAAAGCATCCGCTCTACATGAAGACGACACGAGAGAATCCCGTTATAACAGCTGCCTGTGAACGTCTCAAAAGTGGCGACTCCCTTAAAAGTATTAGCCAAGCATTGGATATTCCCATGTCAACTTTGCATCGGCACAAGGTGAGGCTGGCACAGGAAGGGCGACTACCAGAATATGTTACTTTTAAGAAACGTGGTCCTATGTCAAAGGAAATGCTAAAGGAGAAGCTCACAAAAGCTGTGCAAGCATGCCTAGGCAATGGCATGTCGCAGAATCACGCAGCAAATGTGTTCGATGTGCCAAAGAGCACATTGTGGCGacatttacaaaagaaaatgagCCGCACGGCAGCTGACAGCGATATTAACATTAAAGAAGAAGTAGTTTTACCGTAG
- Protein Sequence
- MNHLKWMGHTATILDIQKTLKNDPVTCDITLSCRGKSVKAHRFILSSCSDLLRDILSDVPVGQEATIIVPDVKGNLLDSVLGFVYMGETSLSSTNLSEFLEAISILGIKSAISFECANSSSANIKNVAEMTEMEEELMEAEPEQEIVEKDEEEDQTVAENSSVRELEFLEVYNDQEKISYTIENIIPTNPNEYILTESSGTFTLTPNAKIEGSEAGALVEKQDEDSPLLEQYVGDPIGDSSQCSSTDMKEEKCCKKLKDEFSEFQALSPEDSKGFQNFNETQSARVEALENAVLAVVDEGMSLQKAAIKYNISKTVLWRRVKKHPLYMKTTRENPVITAACERLKSGDSLKSISQALDIPMSTLHRHKVRLAQEGRLPEYVTFKKRGPMSKEMLKEKLTKAVQACLGNGMSQNHAANVFDVPKSTLWRHLQKKMSRTAADSDINIKEEVVLP
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00311982; iTF_00312802; iTF_00313641; iTF_00313368; iTF_00312507; iTF_01521797; iTF_01522467; iTF_01521997; iTF_01520878; iTF_01521233; iTF_01522778; iTF_00672321; iTF_00311201; iTF_00671689; iTF_00671058; iTF_00672146; iTF_00670872; iTF_00671520; iTF_00316154; iTF_00315858; iTF_01395937; iTF_00672774; iTF_00310418; iTF_00672949; iTF_00310168; iTF_01395772; iTF_01542128; iTF_01542344; iTF_00240924; iTF_00240583; iTF_01301026; iTF_01299926; iTF_01300186; iTF_01300763; iTF_00688521; iTF_00688813; iTF_00991672; iTF_00991936; iTF_00187979; iTF_01212135; iTF_01211831; iTF_00188348; iTF_00427043; iTF_01357185; iTF_00427428; iTF_01396682; iTF_00665008; iTF_00426260; iTF_00974900; iTF_00426521; iTF_01396409; iTF_00664763; iTF_00974557; iTF_01356765; iTF_00664236; iTF_00663977; iTF_00893774; iTF_00894113; iTF_01116561; iTF_01116363; iTF_00665554; iTF_00665862; iTF_00335222; iTF_00334897; iTF_00976669; iTF_00976374; iTF_00724687; iTF_00724934; iTF_00984399; iTF_00984143; iTF_01318152; iTF_00694507; iTF_00693683; iTF_00694835; iTF_01318394; iTF_00693980; iTF_01254095; iTF_01253443; iTF_00314519; iTF_00314184; iTF_01044634; iTF_01044851; iTF_01541562; iTF_01541359; iTF_00315069; iTF_00310945; iTF_00315340; iTF_00389534; iTF_00389772;
- 90% Identity
- iTF_00316154;
- 80% Identity
- iTF_00311982;