Basic Information

Gene Symbol
-
Assembly
GCA_951394065.1
Location
OX596037.1:13661695-13669973[-]

Transcription Factor Domain

TF Family
HTH
Domain
HTH_psq domain
PFAM
PF05225
TF Group
Helix-turn-helix
Description
This DNA-binding motif is found in four copies in the pipsqueak protein of Drosophila melanogaster [1]. In pipsqueak this domain binds to GAGA sequence [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 6 4e-08 2.5e-05 24.6 0.0 2 40 17 56 16 60 0.87
2 6 2.5e-07 0.00016 22.0 0.0 2 40 150 189 149 193 0.87
3 6 4e-08 2.5e-05 24.6 0.0 2 40 283 322 282 326 0.87
4 6 2.5e-07 0.00016 22.0 0.0 2 40 416 455 415 459 0.87
5 6 2.5e-07 0.00016 22.0 0.0 2 40 549 588 548 592 0.87
6 6 2.5e-07 0.00016 22.0 0.0 2 40 682 721 681 725 0.87

Sequence Information

Coding Sequence
ATGCCGCGAAAATACGAGAAAAAGAATACTAGATGTTCTGATATAGATGAAGATGCGAAGACAGAAGCTATAAAAGAGGTCGTAAACAAAAAGTTGTCTATCAGAAAATCcgctgaaaaatataatatcaaaccaTCAACTCTTATGAGTCGCTTGGTGAAGTCTCGAGAAGATGGAAAAGAAGACCACATTCCTGCACGAGCATTCTGCAACAAGTTTGCCACCAAACAAGTATTTTCAAAGGATGAAGAGGCTCTTTTAGCCAAGTATATCAGTGACTGCTCAAAAATGCACTATGGACTAACACTTGTACAGGAGGATAAACGCGCGGGAGGGCTTCCGTCCCGCTGCTTCAGAGCCTCCAACCCGCCGCTTCAGAGCCTCCAACCCGCCGTAATCATGCCGCGAAAATACGAGAAAAAGAATACTAGATGTTCTGATATAGATGAAGATGCGAAGACAGAAGCTATAAAAGAGGTCGTAAACAAAACGTTGTCTATCAGAAAATCcgctgaaaaatataatatcaaaccaTCAACTCTTATGAGTCGCTTGGTGAAGTCTCGAGAAGATGGAAAAGAAGACCACATTCCTGCACGAGCATTCTGCAACAAGTTTGCCACCAAACAAGTATTTTCAAAGGATGAAGAGGCTCTTTTAGCCAAGTATATCAGTGACTGCTCAAAAATGCACTATGGACTAACACTTGTACAGGAGGATAAACGCGCGGGAGGGCTTCCGTCCCGCTGCTTCAGAGCCTCCAACCCGCCGCTTCAGAGCCTCCAACCCGCCGTAATCATGCCGCGAAAATACGAGAAAAAGAATACTAGATGTTCTGATATAGATGAAGATGCGAAGACAGAAGCTATAAAAGAGGTCGTAAACAAAAAGTTGTCTATCAGAAAATCcgctgaaaaatataatatcaaaccaTCAACTCTTATGAGTCGCTTGGTGAAGTCTCGAGAAGATGGAAAAGAAGACCACATTCCTGCACGAGCATTCTGCAACAAGTTTGCCACCAAACAAGTATTTTCAAAGGATGAAGAGGCTCTTTTAGCCAAGTATATCAGTGACTGCTCAAAAATGCACTATGGACTAACACTTGTACAGGAGGATAAACGCGCGGGAGGGCTTCCGTCCCGCTGCTTCAGAGCCTCCAACCCGCCGCTTCAGAGCCTCCAACCCGCCGTAATCATGCCGCGAAAATACGAGAAAAAGAATACTAGATGTTCTGATATAGATGAAGATGCGAAGACAGAAGCTATAAAAGAGGTCGTAAACAAAACGTTGTCTATCAGAAAATCcgctgaaaaatataatatcaaaccaTCAACTCTTATGAGTCGCTTGGTGAAGTCTCGAGAAGATGGAAAAGAAGACCACATTCCTGCACGAGCATTCTGCAACAAGTTTGCCACCAAACAAGTATTTTCAAAGGATGAAGAGGCTCTTTTAGCCAAGTATATCAGTGACTGCTCAAAAATGCACTATGGACTAACACTTGTACAGGAGGATAAACGCGCGGGAGGGCTTCCGTCCCGCTGCTTCAGAGCCTCCAACCCGCCGCTTCAGAGCCTCCAACCCGCCGTAATCATGCCGCGAAAATACGAGAAAAAGAATACTAGATGTTCTGATATAGATGAAGATGCGAAGACAGAAGCTATAAAAGAGGTCGTAAACAAAACGTTGTCTATCAGAAAATCcgctgaaaaatataatatcaaaccaTCAACTCTTATGAGTCGCTTGGTGAAGTCTCGAGAAGATGGAAAAGAAGACCACATTCCTGCACGAGCATTCTGCAACAAGTTTGCCACCAAACAAGTATTTTCAAAGGATGAAGAGGCTCTTTTAGCCAAGTATATCAGTGACTGCTCAAAAATGCACTATGGACTAACACTTGTACAGGAGGATAAACGCGCGGGAGGGCTTCCGTCCCGCTGCTTCAGAGCCTCCAACCCGCCGCTTCAGAGCCTCCAACCCGCCGTAATCATGCCGCGAAAATACGAGAAAAAGAATACTAGATGTTCTGATATAGATGAAGATGCGAAGACAGAAGCTATAAAAGAGGTCGTAAACAAAACGTTGTCTATCAGAAAATCcgctgaaaaatataatatcaaaccaTCAACTCTTATGAGTCGCTTGGTGAAGTCTCGAGAAGATGGAAAAGAAGACCACATTCCTGCACGAGCATTCTGCAACAAGTTTGCCACCAAACAAGTATTTTCAAAGGATGAAGAGGCTCTTTTAGCCAAGTATATCAGTGACTGCTCAAAAATGCACTATGGACTAACACTTGTACAGGAGGATAAACGCGCGGGAGGGCTTCCGTCCCGCTGCTTCAGAGCCTCCAACCCGCCGCTTCAGAGCCTCCAACCCGCCGTGTCAGTTGCTCCGCGACACCAGATAGGGTAG
Protein Sequence
MPRKYEKKNTRCSDIDEDAKTEAIKEVVNKKLSIRKSAEKYNIKPSTLMSRLVKSREDGKEDHIPARAFCNKFATKQVFSKDEEALLAKYISDCSKMHYGLTLVQEDKRAGGLPSRCFRASNPPLQSLQPAVIMPRKYEKKNTRCSDIDEDAKTEAIKEVVNKTLSIRKSAEKYNIKPSTLMSRLVKSREDGKEDHIPARAFCNKFATKQVFSKDEEALLAKYISDCSKMHYGLTLVQEDKRAGGLPSRCFRASNPPLQSLQPAVIMPRKYEKKNTRCSDIDEDAKTEAIKEVVNKKLSIRKSAEKYNIKPSTLMSRLVKSREDGKEDHIPARAFCNKFATKQVFSKDEEALLAKYISDCSKMHYGLTLVQEDKRAGGLPSRCFRASNPPLQSLQPAVIMPRKYEKKNTRCSDIDEDAKTEAIKEVVNKTLSIRKSAEKYNIKPSTLMSRLVKSREDGKEDHIPARAFCNKFATKQVFSKDEEALLAKYISDCSKMHYGLTLVQEDKRAGGLPSRCFRASNPPLQSLQPAVIMPRKYEKKNTRCSDIDEDAKTEAIKEVVNKTLSIRKSAEKYNIKPSTLMSRLVKSREDGKEDHIPARAFCNKFATKQVFSKDEEALLAKYISDCSKMHYGLTLVQEDKRAGGLPSRCFRASNPPLQSLQPAVIMPRKYEKKNTRCSDIDEDAKTEAIKEVVNKTLSIRKSAEKYNIKPSTLMSRLVKSREDGKEDHIPARAFCNKFATKQVFSKDEEALLAKYISDCSKMHYGLTLVQEDKRAGGLPSRCFRASNPPLQSLQPAVSVAPRHQIG

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-