Basic Information

Gene Symbol
-
Assembly
GCA_035578135.1
Location
JAQJVK010000026.1:4333692-4337267[+]

Transcription Factor Domain

TF Family
HTH
Domain
HTH_psq domain
PFAM
PF05225
TF Group
Helix-turn-helix
Description
This DNA-binding motif is found in four copies in the pipsqueak protein of Drosophila melanogaster [1]. In pipsqueak this domain binds to GAGA sequence [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 19 0.38 4.1e+02 3.1 0.0 1 17 64 80 64 86 0.91
2 19 0.38 4.1e+02 3.1 0.0 1 17 98 114 98 120 0.91
3 19 0.38 4.1e+02 3.1 0.0 1 17 132 148 132 154 0.91
4 19 0.38 4.1e+02 3.1 0.0 1 17 166 182 166 188 0.91
5 19 0.38 4.1e+02 3.1 0.0 1 17 200 216 200 222 0.91
6 19 0.38 4.1e+02 3.1 0.0 1 17 234 250 234 256 0.91
7 19 0.83 9e+02 2.0 0.0 1 16 268 283 268 290 0.85
8 19 0.67 7.2e+02 2.3 0.0 1 17 302 318 302 324 0.91
9 19 0.38 4.1e+02 3.1 0.0 1 17 336 352 336 358 0.91
10 19 0.38 4.1e+02 3.1 0.0 1 17 370 386 370 392 0.91
11 19 0.38 4.1e+02 3.1 0.0 1 17 404 420 404 426 0.91
12 19 0.38 4.1e+02 3.1 0.0 1 17 438 454 438 460 0.91
13 19 0.38 4.1e+02 3.1 0.0 1 17 472 488 472 494 0.91
14 19 0.38 4.1e+02 3.1 0.0 1 17 506 522 506 528 0.91
15 19 0.38 4.1e+02 3.1 0.0 1 17 540 556 540 562 0.91
16 19 0.38 4.1e+02 3.1 0.0 1 17 613 629 613 635 0.91
17 19 0.38 4.1e+02 3.1 0.0 1 17 647 663 647 669 0.91
18 19 0.38 4.1e+02 3.1 0.0 1 17 681 697 681 703 0.91
19 19 0.38 4.1e+02 3.1 0.0 1 17 715 731 715 737 0.91

Sequence Information

Coding Sequence
ATGCTTTTGATTAAACGGGGAAGGCATCATCAAGGTGGCAACCAGCTTCTCAAGCAACCCAAGAACTTACCTGATTATCCTTTATCTTCGTCATTAAAGGACAGATTATCCTTCATCTCTGTCCAGGTGGGATCCTTGCTGCCGCTGACCCAGGCCCTGCTGAGTCTGCAGGAGCAGAAACCCCCCGTCAGCCCGGAGGAGCTGCACCTAGCCCTGACCAATGTCCAGGTGGGATCCATGCTGCCGCTGACCCAGGCCCTGCTGAGTCTGCAGGAGCAGAAACCCCCCGTCAGCCCGGAGGAGCTGCACCTAGCGCTGACCAATGTCCAGGTGGGATCCATGCTGCCGCTGACCCAGGCCCTGCTGAGTCTGCAGGAGCAGAAACCCCCCGTCAGCCCGGAGGAGCTGCACCTAGCCCTGACCAATGTCCAGGTGGGATCCATGCTGCCGCTGACCCAGGCCCTGCTGAGTCTGCAGGAGCAGAAACCCCCCGTCAGCCCGGAGGAGCTGCACCTAGCGCTGACCAATGTCCAGGTGGGATCCATGCTGCCGCTGACCCAGGCCCTGCTGAGTCTGCAGGAGCAGAAACCCCCCGTCAGCCCGGAGGAGCTGCACCTAGCGCTGACCAATGTCCAGGTGGGATCCATGCTGCCGCTGACCCAGGCCCTGCTGAGTCTGCAGGAGCAGAAACCCCCCGTCAGCCCGGAGGAGCTGCACCTAGCCCTGACCAATGTCCAGGTGGGATCCATGCTGCCGCTGACCCAGGCCCTGCTGAGTCTGCAGGAGCAGAAACCCCCCGTCAGCCCGGAGGAGCTGCACCTAGCGCTGACCAATGTCCAGGTGGGATCCTTGCTGCCGCTGACCCAGGCCCTGCTGAGTCTGCAGGAGCAGAAACCCCCCGTCAGCCCGGAGGAGCTGCACCTAGCGCTGACCAATATCCAGGTGGGATCCATGCTGCCGCTGACCCAGGCCCTGCTGAGTCTGCAGGAGCAGAAACCCCCCGTCAGCCCGGAGGAGCTGCACCTAGCGCTGACCAATGTCCAGGTGGGATCCATGCTGCCGCTGACCCAGGCCCTGCTGAGTCTGCAGGAGCAGAAACCCCCCGTCAGCCCGGAGGAGCTGCACCTAGCGCTGACCAATGTCCAGGTGGGATCCATGCTGCCGCTGACCCAGGCCCTGCTGAGTCTGCAGGAGCAGAAACCCCCCGTCAGCCCGGAGGAGCTGCACCTAGCGCTGACCAATGTCCAGGTGGGATCCATGCTGCCGCTGACCCAGGCCCTGCTGAGTCTGCAGGAGCAGAAACCCCCCGTCAGCCCGGAGGAGCTGCACCTAGCGCTGACCAATGTCCAGGTGGGATCCATGCTGCCGCTGACCCAGGCCCTGCTGAGTCTGCAGGAGCAGAAACCCCCCGTCAGCCCGGAGGAGCTGCACCTAGCGCTGACCAATGTCCAGGTGGGATCCATGCTGCCGCTGACCCAGGCCCTGCTGAGTCTGCAGGAGCAGAAACCCCCCGTCAGCCCGGAGGAGCTGCACCTAGCCCTGACCAATGTCCAGGTGGGATCCATGCTGCCGCTGACCCAGGCCCTGCTGAGTCTGCAGGAGCAGAAACCCCCCGTCAGCCCGGAGGAGCTGCACCTAGCCCTGACCAATGTCCAGGTGGGATCCATGCTGCCGCTGACCCAGGCCCTGCTGAGTCTGCAGGAGCAGAAACCCCCCGTCAGCCCGGAGGAGCTGCACCTAGCGCTGACCAATGTCCAGGTACTACGTCAGCAGGTGGGATCCATGCTGCCGCTGACCCAGGCCCTGCTGAGTCTGCAGGAGCAGAAACCCCCCGTCAGCCCGGAGGAGCTGCACCTAGCGCTGACCAATGTCCAGGTGGGATCCATGCTGCCGCTGACCCAGGCCCTGCTGAGTCTGCAGGAGCAGAAACCCCCCGTCAGCCCGGAGGAGCTGCACCTAGCGCTGACCAATGTCCAGGTGGGATCCATGCTGCCGCTGACCCAGGCCCTGCTGAGTCTGCAGGAGCAGAAACCCCCCGTCAGCCCGGAGGAGCTGCACCTAGCGCTGACCAATGTCCAGGTGGGATCCATGCTGCCGCTGACCCAGGCCCTGCTGAGTCTGCAGGAGCAGAAACCCCCCGTCAGCCCGGAGGAGCTGCACCTAGCCCTGACCAATGTCCAGGTGGGATCCATGCTGCCGCTGACCCAGGCCCTGCTGAGTCTGCAGGAGCAGAAACCCCCCGTCAGCCCGGAGGAGCTGCACCTAGCGCTGACCAATGTCCAGGTTCTACGTCAGCACGTGGCGTGCGCTGTGCCGTGGTTGCTACACTGA
Protein Sequence
MLLIKRGRHHQGGNQLLKQPKNLPDYPLSSSLKDRLSFISVQVGSLLPLTQALLSLQEQKPPVSPEELHLALTNVQVGSMLPLTQALLSLQEQKPPVSPEELHLALTNVQVGSMLPLTQALLSLQEQKPPVSPEELHLALTNVQVGSMLPLTQALLSLQEQKPPVSPEELHLALTNVQVGSMLPLTQALLSLQEQKPPVSPEELHLALTNVQVGSMLPLTQALLSLQEQKPPVSPEELHLALTNVQVGSMLPLTQALLSLQEQKPPVSPEELHLALTNVQVGSLLPLTQALLSLQEQKPPVSPEELHLALTNIQVGSMLPLTQALLSLQEQKPPVSPEELHLALTNVQVGSMLPLTQALLSLQEQKPPVSPEELHLALTNVQVGSMLPLTQALLSLQEQKPPVSPEELHLALTNVQVGSMLPLTQALLSLQEQKPPVSPEELHLALTNVQVGSMLPLTQALLSLQEQKPPVSPEELHLALTNVQVGSMLPLTQALLSLQEQKPPVSPEELHLALTNVQVGSMLPLTQALLSLQEQKPPVSPEELHLALTNVQVGSMLPLTQALLSLQEQKPPVSPEELHLALTNVQVLRQQVGSMLPLTQALLSLQEQKPPVSPEELHLALTNVQVGSMLPLTQALLSLQEQKPPVSPEELHLALTNVQVGSMLPLTQALLSLQEQKPPVSPEELHLALTNVQVGSMLPLTQALLSLQEQKPPVSPEELHLALTNVQVGSMLPLTQALLSLQEQKPPVSPEELHLALTNVQVLRQHVACAVPWLLH

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00727253;
90% Identity
iTF_00727253;
80% Identity
-