Hsal040930.1
Basic Information
- Insect
- Hedya salicella
- Gene Symbol
- -
- Assembly
- GCA_905404275.1
- Location
- FR990118.1:318761-331058[+]
Transcription Factor Domain
- TF Family
- HTH
- Domain
- HTH_psq domain
- PFAM
- PF05225
- TF Group
- Helix-turn-helix
- Description
- This DNA-binding motif is found in four copies in the pipsqueak protein of Drosophila melanogaster [1]. In pipsqueak this domain binds to GAGA sequence [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 13 0.32 5.3e+02 1.8 0.0 21 37 105 122 102 126 0.78 2 13 0.34 5.5e+02 1.7 0.0 21 37 170 187 168 191 0.78 3 13 0.34 5.5e+02 1.7 0.0 21 37 235 252 233 256 0.78 4 13 0.34 5.5e+02 1.7 0.0 21 37 300 317 298 321 0.78 5 13 0.34 5.5e+02 1.7 0.0 21 37 382 399 380 403 0.78 6 13 0.32 5.3e+02 1.8 0.0 21 37 441 458 438 462 0.78 7 13 0.34 5.5e+02 1.7 0.0 21 37 506 523 504 527 0.78 8 13 0.34 5.5e+02 1.7 0.0 21 37 571 588 569 592 0.78 9 13 0.34 5.5e+02 1.7 0.0 21 37 636 653 634 657 0.78 10 13 0.34 5.5e+02 1.7 0.0 21 37 718 735 716 739 0.78 11 13 0.26 4.3e+02 2.1 0.0 21 39 777 796 774 799 0.80 12 13 0.34 5.5e+02 1.7 0.0 21 37 929 946 927 950 0.78 13 13 0.34 5.5e+02 1.7 0.0 21 37 994 1011 992 1015 0.78
Sequence Information
- Coding Sequence
- ATGGATGCATTTAAAAACTTCCGCATCGTGCATAGCCGCGGGCACTTCGATGATGACAATACAATCTCCGTTTTTCATGAGGAAGCGGGGCAGTACGCTCTGGTGCGGGGCGAGCAGGCGGGGGCGGGGCGAGCGGcggcgcgcctgcgcgcgctgctggcggagctggcggcgctgcgcgcgcgcgtgcgCGAGCCCGACCGCGCCGCCTTCGACGCGCGCAcgcagcgcgcacgcgacctcacgctgcaggccatcatggactacctagGTGGGGGCGGGGCGAGCAgcggcgcagcgcgcacgcgaccTCACGCTGCAGGCCATCATGGACTACCTAGGTCAGGCTCACTACACGCACTGGTGCGGGGCGAGCAGGTGGGGGCGGGGCGAGCAgcggcgcagcgcgcacgcgacctcacgctgcaggccatcatggactacctaggtcaggctcactacacgcactggtgcggggcgagcagcggcgcagcgcgcacgcgaccTCACGCTGCAGGCCATCATGGACTACCTAGGTCAGGCTCACTACACGCACTGGTGCGGGGCGAGCAGGTGGGGGCGGGGCGAGCAgcggcgcagcgcgcacgcgacctcacgctgcaggccatcatggactacctaggtcaggctcactacacgcactggtgcggggcgagcagcggcgcagcgcgcacgcgaccTCACGCTGCAGGCCATCATGGACTACCTAGGTCAGGCTCACTACACGCACTGGTGCGGGGCGAGCAGGTGGGGGCGGGGCGAGCAgcggcgcagcgcgcacgcgaccTCACGCTGCAGGCCATCATGGACTACCTAGGTCAGGCTCACTACACGCACTGGTGCGGGGCGAGCAGcggcgcagcgcgcacgcgaccTCACGCTGCAGGCCATCATGGACTACCTAGGTCAGGCTCACTACACGCACTGGTGCGGGGCGAGCAGGTGGGGGCGGGGCGAGCAgcggcgcagcgcgcacgcgacctcacgctgcaggccatcatggactacctagcggcgcagcgcgcacgcgacctcacgctgcaggccatcatggactacctaggtcaggctcactacacgcactggtgcggggcgagcagcggcgcagcgcgcacgcgaccTCACGCTGCAGGCCATCATGGACTACCTAGGTCAGGCTCACTACACGCACTGGTGCGGGGCGAGCAGGTGGGGGCGGGGCGAGCAgcggcgcagcgcgcacgcgacctcacgctgcaggccatcatggactacctagGTGGGGGCGGGGCGAGCAgcggcgcagcgcgcacgcgaccTCACGCTGCAGGCCATCATGGACTACCTAGGTCAGGCTCACTACACGCACTGGTGCGGGGCGAGCAGGTGGGGGCGGGGCGAGCAgcggcgcagcgcgcacgcgacctcacgctgcaggccatcatggactacctaggtcaggctcactacacgcactggtgcggggcgagcagcggcgcagcgcgcacgcgaccTCACGCTGCAGGCCATCATGGACTACCTAGGTCAGGCTCACTACACGCACTGGTGCGGGGCGAGCAGGTGGGGGCGGGGCGAGCAgcggcgcagcgcgcacgcgacctcacgctgcaggccatcatggactacctaggtcaggctcactacacgcactggtgcggggcgagcagcggcgcagcgcgcacgcgaccTCACGCTGCAGGCCATCATGGACTACCTAGGTCAGGCTCACTACACGCACTGGTGCGGGGCGAGCAGGTGGGGGCGGGGCGAGCAgcggcgcagcgcgcacgcgacctcacgctgcaggccatcatggactacctaggtcaggctcactacacgcactggtgcggggcgagcagcggcgcagcgcgcacgcgaccTCACGCTGCAGGCCATCATGGACTACCTAGGTCAGGCTCACTACACGCACTGGTGCGGGGCGAGCAGGTGGGGGCGGGGCGAGCAgcggcgcagcgcgcacgcgacctcacgctgcaggccatcatggactacctagcgacgcagcgcgcacgcgacctcacgctgcaggccatcatggactacctaggtcaggctcactacacgcactggtgcggggcgagcagcggcgcagcgcgcacgcgaccTCACGCTGCAGGCCATCATGGACTACCTAGGTCAGGCTCACTACACGCACTGGTGCGGGGCGAGCAGGTGGGGGCGGGGCGAGCAgcggcgcagcgcgcacgcgacctcacgctgcaggccatcatggactacctagGTGGGGGCGGGGCGAGCAgcggcgcagcgcgcacgcgaccTCACGCTGCAGGCCATCATGGACTACCTAGGTCAGGCTCACTACACGCACTGGTGCGGGGCGGGCAGGTGGGGGCGGGGCGAGCAgcggcgcagcgcgcacgcgaccTCACGCTGCAGGCCATCATGGACTACCTAGGTCAGGCTCACTACACGCACTGGTGCGGGGCGAGCAGTGCGGGGCGAGCAgcggcgcagcgcgcacgcgaccTCACGCTGCAGGCCATCATGGACTACCTAGGTCAGGCTCACTACACGCACTGGTGCGGGGCGAGCAGGTGGGGGCGGGGGCGAGCAgcggcgcagcgcgcacgcgacctcacgctgcaggccatcatggcctacctagcggcgcagcgcgcacgcgacctcacgctgcaggccatcatggactacctaggtcaggctcactacacgcactggtgcggggcgagcagcggcgcagcgcgcacgcgaccTCACGCTGCAGGCCATCATGGACTACCTAGGTCAGGCTCACTACACGCACTGGTGCGGGGCGAGCAGGTGGGGGCGGGGCGAGCAgcggcgcagcgcgcacgcgacctcacgctgcaggccatcatggactacctaggtcaggctcactacacgcactggtgcggggcgagcagcggcgcagcgcgcacgcgaccTCACGCTGCGGGCCATCATGGACTACCTAGGTCAGGCTCACTACACGCACTGGTGCGGGGCGAGCAGGTGGGGGCGGGGCGAGCAgcggcgcagcgcgcacgcgaccTCACGCTGCAGGCCGTCATGGACTACCTAGAGTCGTCACCGGTGAACATGCAGCGCGGGGCGGAGCCTGCGTGCGCGGAGGTGGGCAGCGCGGTGTCCACGGAGTCGGTGGGCGTGGTCGCGCATGCGCGTGCGCGCCGCGGCGGCGACGACTGCGACGTGCCCGTGCTGCAGCTGCAGATCGACGACCGGGTGCGTGCAGTCTGA
- Protein Sequence
- MDAFKNFRIVHSRGHFDDDNTISVFHEEAGQYALVRGEQAGAGRAAARLRALLAELAALRARVREPDRAAFDARTQRARDLTLQAIMDYLGGGGASSGAARTRPHAAGHHGLPRSGSLHALVRGEQVGAGRAAAQRARDLTLQAIMDYLGQAHYTHWCGASSGAARTRPHAAGHHGLPRSGSLHALVRGEQVGAGRAAAQRARDLTLQAIMDYLGQAHYTHWCGASSGAARTRPHAAGHHGLPRSGSLHALVRGEQVGAGRAAAQRARDLTLQAIMDYLGQAHYTHWCGASSGAARTRPHAAGHHGLPRSGSLHALVRGEQVGAGRAAAQRARDLTLQAIMDYLAAQRARDLTLQAIMDYLGQAHYTHWCGASSGAARTRPHAAGHHGLPRSGSLHALVRGEQVGAGRAAAQRARDLTLQAIMDYLGGGGASSGAARTRPHAAGHHGLPRSGSLHALVRGEQVGAGRAAAQRARDLTLQAIMDYLGQAHYTHWCGASSGAARTRPHAAGHHGLPRSGSLHALVRGEQVGAGRAAAQRARDLTLQAIMDYLGQAHYTHWCGASSGAARTRPHAAGHHGLPRSGSLHALVRGEQVGAGRAAAQRARDLTLQAIMDYLGQAHYTHWCGASSGAARTRPHAAGHHGLPRSGSLHALVRGEQVGAGRAAAQRARDLTLQAIMDYLATQRARDLTLQAIMDYLGQAHYTHWCGASSGAARTRPHAAGHHGLPRSGSLHALVRGEQVGAGRAAAQRARDLTLQAIMDYLGGGGASSGAARTRPHAAGHHGLPRSGSLHALVRGGQVGAGRAAAQRARDLTLQAIMDYLGQAHYTHWCGASSAGRAAAQRARDLTLQAIMDYLGQAHYTHWCGASRWGRGRAAAQRARDLTLQAIMAYLAAQRARDLTLQAIMDYLGQAHYTHWCGASSGAARTRPHAAGHHGLPRSGSLHALVRGEQVGAGRAAAQRARDLTLQAIMDYLGQAHYTHWCGASSGAARTRPHAAGHHGLPRSGSLHALVRGEQVGAGRAAAQRARDLTLQAVMDYLESSPVNMQRGAEPACAEVGSAVSTESVGVVAHARARRGGDDCDVPVLQLQIDDRVRAV*
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -