Basic Information

Gene Symbol
-
Assembly
GCA_029891345.1
Location
CM057002.1:40269963-40271918[-]

Transcription Factor Domain

TF Family
HTH
Domain
HTH_psq domain
PFAM
PF05225
TF Group
Helix-turn-helix
Description
This DNA-binding motif is found in four copies in the pipsqueak protein of Drosophila melanogaster [1]. In pipsqueak this domain binds to GAGA sequence [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 13 0.14 5.8e+02 3.8 0.0 18 40 51 74 50 78 0.90
2 13 0.002 8.7 9.7 0.0 18 40 97 120 96 124 0.90
3 13 0.094 4e+02 4.3 0.0 18 40 143 166 142 170 0.91
4 13 0.45 1.9e+03 2.2 0.0 22 40 193 212 190 216 0.80
5 13 2 8.8e+03 0.0 0.1 18 39 235 257 234 261 0.78
6 13 0.0049 21 8.4 0.0 18 40 265 288 264 292 0.91
7 13 0.042 1.8e+02 5.4 0.0 18 40 311 334 310 338 0.91
8 13 0.0049 21 8.4 0.0 18 40 371 394 370 398 0.91
9 13 0.094 4e+02 4.3 0.0 18 40 417 440 416 444 0.91
10 13 0.0013 5.6 10.3 0.0 18 40 463 486 462 490 0.91
11 13 0.025 1.1e+02 6.2 0.0 18 40 509 532 508 536 0.91
12 13 0.019 81 6.6 0.0 19 40 556 578 556 582 0.89
13 13 0.81 3.5e+03 1.3 0.1 28 40 611 624 602 628 0.80

Sequence Information

Coding Sequence
ATGCAAACTTGCAGTGTGGCAAAACTCCCTCCACCAGGACGAGCCGGGGTTGCTGAGGACAAAGTCGAGACGATCGTGCAACTTTCCTTTGGAGTCCCAGAAaatccacacgccgtgcatcaaAGGAGCCgcagataccacacacgacaaTCCACATGCCGTGCATCAAAGGAGCCGCAGATACCACACGCGACAATCCACACGCTGTGCATCAAGGGAGCAgcagataccacacacgacaaTCCACACGCTGTGCATCAAGGGAGCAgcagataccacacacgacaatccacacgccgtgcatcaagggagcagcagataccacacacgacaatccacatgccgtgcatcaagggagcagcagataccacacacgacaaTTCACACGCCGTACATCAAGGGAGCCgcagataccacacacgacaatccacacgccgtgcatcaaAGGAGCCGCAGATACCACACGCgacaatccacacgccgtgcatcaagggagccgcagataccacacacgacaatccacacgccgtgcatcaagggagccgcagataccacacacgacaaTTCACACGCCCTGCATCAAGAGAGCCgcagataccacacacgacaatccacacgccgtgcatcaagAGAGCCgcagataccacacacgacaatccacacgccgtgcatcaagggagctgcagataccacacacgacaaTCCACACGCCGTACATCAAGGGAGCCGCAGATACCAACACCGACAATCCACAACGCCGTGCATCAAGGGAGCAGCAGATACCACACacaatccacacgccgtgcatcaagggaGCCGCAGATACCACATACGACAATTcacacgccgtgcatcaagggagccgcagataccacacacgacaaTTCACACGCCGTGCATCAAAGGAGCCgcagataccacacacgacaatccacacgccgtgcatcaagggaGCCGCAGATACCACACGCgacaatccacacgccgtgcatcaagggagccgcagataccacacacgacaaTTCACACGCCCTGCATCAAggagctgcagataccacacacgacaatccacacgccgtgcatcaagggaGCAGCAGATACCACACacaatccacacgccgtgcatcaagggaGCCGCAGATACCACATACGACAATTCACACGCCCTGCATCAAGGGAGCCgcagataccacacacgacaaTTCACACGCCGTGCATCAAATGAGCCgcagataccacacacgacaatccacacgccgtgcatcaaAGGAGCCGCAGATACCACACGCgacaatccacacgccgtgcatcaagggagctgcagataccacacacgacaaTCCACACGCCCTGCATCAAGGGAGCCgcagataccacacacgacaatccacacgccgtgcatcaagggagctgcagataccacacacgacaaTTCACACGCCCTGCATCAAGGGAGCCgcagataccacacacgacaaTTCACACGCCCTGCATCAAGGGACCCgcagataccacacacgacaatccacacgccgtgcatcaagggaGCCGCAGATACCACAGACGACAATTcacacgccgtgcatcaagggaGCAGCAGATACCACATACGACAATTcacacgccgtgcatcaagggaGCCGCAGATACCACATACGACAATTcacacgccgtgcatcaagggagcagcagataccacacacgacaaTCCACACGCCCTGCATCAAGGGAGCCgcagataccacacacgacaaTCCACACGCCCTGCATCAAGGGAGCAgcagataccacacacgacaaTTCACACGCCGTACATCAAGGGAGCCACAGATACCACATACGACAATTcacacgccgtgcatcaagggagcagcagataccacacacgacaaTCCACACGCCCTGCATCAAGGGAGCAGCAGATACCACATACGACAATTCTCACGCAGTGCATAA
Protein Sequence
MQTCSVAKLPPPGRAGVAEDKVETIVQLSFGVPENPHAVHQRSRRYHTRQSTCRASKEPQIPHATIHTLCIKGAADTTHDNPHAVHQGSSRYHTRQSTRRASREQQIPHTTIHMPCIKGAADTTHDNSHAVHQGSRRYHTRQSTRRASKEPQIPHATIHTPCIKGAADTTHDNPHAVHQGSRRYHTRQFTRPASREPQIPHTTIHTPCIKRAADTTHDNPHAVHQGSCRYHTRQSTRRTSREPQIPTPTIHNAVHQGSSRYHTQSTRRASREPQIPHTTIHTPCIKGAADTTHDNSHAVHQRSRRYHTRQSTRRASREPQIPHATIHTPCIKGAADTTHDNSHALHQGAADTTHDNPHAVHQGSSRYHTQSTRRASREPQIPHTTIHTPCIKGAADTTHDNSHAVHQMSRRYHTRQSTRRASKEPQIPHATIHTPCIKGAADTTHDNPHALHQGSRRYHTRQSTRRASRELQIPHTTIHTPCIKGAADTTHDNSHALHQGTRRYHTRQSTRRASREPQIPQTTIHTPCIKGAADTTYDNSHAVHQGSRRYHIRQFTRRASREQQIPHTTIHTPCIKGAADTTHDNPHALHQGSSRYHTRQFTRRTSREPQIPHTTIHTPCIKGAADTTHDNPHALHQGSSRYHIRQFSRSA

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-