Daus156855.1
Basic Information
- Insect
- Dryococelus australis
- Gene Symbol
- -
- Assembly
- GCA_029891345.1
- Location
- CM057002.1:40269963-40271918[-]
Transcription Factor Domain
- TF Family
- HTH
- Domain
- HTH_psq domain
- PFAM
- PF05225
- TF Group
- Helix-turn-helix
- Description
- This DNA-binding motif is found in four copies in the pipsqueak protein of Drosophila melanogaster [1]. In pipsqueak this domain binds to GAGA sequence [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 13 0.14 5.8e+02 3.8 0.0 18 40 51 74 50 78 0.90 2 13 0.002 8.7 9.7 0.0 18 40 97 120 96 124 0.90 3 13 0.094 4e+02 4.3 0.0 18 40 143 166 142 170 0.91 4 13 0.45 1.9e+03 2.2 0.0 22 40 193 212 190 216 0.80 5 13 2 8.8e+03 0.0 0.1 18 39 235 257 234 261 0.78 6 13 0.0049 21 8.4 0.0 18 40 265 288 264 292 0.91 7 13 0.042 1.8e+02 5.4 0.0 18 40 311 334 310 338 0.91 8 13 0.0049 21 8.4 0.0 18 40 371 394 370 398 0.91 9 13 0.094 4e+02 4.3 0.0 18 40 417 440 416 444 0.91 10 13 0.0013 5.6 10.3 0.0 18 40 463 486 462 490 0.91 11 13 0.025 1.1e+02 6.2 0.0 18 40 509 532 508 536 0.91 12 13 0.019 81 6.6 0.0 19 40 556 578 556 582 0.89 13 13 0.81 3.5e+03 1.3 0.1 28 40 611 624 602 628 0.80
Sequence Information
- Coding Sequence
- ATGCAAACTTGCAGTGTGGCAAAACTCCCTCCACCAGGACGAGCCGGGGTTGCTGAGGACAAAGTCGAGACGATCGTGCAACTTTCCTTTGGAGTCCCAGAAaatccacacgccgtgcatcaaAGGAGCCgcagataccacacacgacaaTCCACATGCCGTGCATCAAAGGAGCCGCAGATACCACACGCGACAATCCACACGCTGTGCATCAAGGGAGCAgcagataccacacacgacaaTCCACACGCTGTGCATCAAGGGAGCAgcagataccacacacgacaatccacacgccgtgcatcaagggagcagcagataccacacacgacaatccacatgccgtgcatcaagggagcagcagataccacacacgacaaTTCACACGCCGTACATCAAGGGAGCCgcagataccacacacgacaatccacacgccgtgcatcaaAGGAGCCGCAGATACCACACGCgacaatccacacgccgtgcatcaagggagccgcagataccacacacgacaatccacacgccgtgcatcaagggagccgcagataccacacacgacaaTTCACACGCCCTGCATCAAGAGAGCCgcagataccacacacgacaatccacacgccgtgcatcaagAGAGCCgcagataccacacacgacaatccacacgccgtgcatcaagggagctgcagataccacacacgacaaTCCACACGCCGTACATCAAGGGAGCCGCAGATACCAACACCGACAATCCACAACGCCGTGCATCAAGGGAGCAGCAGATACCACACacaatccacacgccgtgcatcaagggaGCCGCAGATACCACATACGACAATTcacacgccgtgcatcaagggagccgcagataccacacacgacaaTTCACACGCCGTGCATCAAAGGAGCCgcagataccacacacgacaatccacacgccgtgcatcaagggaGCCGCAGATACCACACGCgacaatccacacgccgtgcatcaagggagccgcagataccacacacgacaaTTCACACGCCCTGCATCAAggagctgcagataccacacacgacaatccacacgccgtgcatcaagggaGCAGCAGATACCACACacaatccacacgccgtgcatcaagggaGCCGCAGATACCACATACGACAATTCACACGCCCTGCATCAAGGGAGCCgcagataccacacacgacaaTTCACACGCCGTGCATCAAATGAGCCgcagataccacacacgacaatccacacgccgtgcatcaaAGGAGCCGCAGATACCACACGCgacaatccacacgccgtgcatcaagggagctgcagataccacacacgacaaTCCACACGCCCTGCATCAAGGGAGCCgcagataccacacacgacaatccacacgccgtgcatcaagggagctgcagataccacacacgacaaTTCACACGCCCTGCATCAAGGGAGCCgcagataccacacacgacaaTTCACACGCCCTGCATCAAGGGACCCgcagataccacacacgacaatccacacgccgtgcatcaagggaGCCGCAGATACCACAGACGACAATTcacacgccgtgcatcaagggaGCAGCAGATACCACATACGACAATTcacacgccgtgcatcaagggaGCCGCAGATACCACATACGACAATTcacacgccgtgcatcaagggagcagcagataccacacacgacaaTCCACACGCCCTGCATCAAGGGAGCCgcagataccacacacgacaaTCCACACGCCCTGCATCAAGGGAGCAgcagataccacacacgacaaTTCACACGCCGTACATCAAGGGAGCCACAGATACCACATACGACAATTcacacgccgtgcatcaagggagcagcagataccacacacgacaaTCCACACGCCCTGCATCAAGGGAGCAGCAGATACCACATACGACAATTCTCACGCAGTGCATAA
- Protein Sequence
- MQTCSVAKLPPPGRAGVAEDKVETIVQLSFGVPENPHAVHQRSRRYHTRQSTCRASKEPQIPHATIHTLCIKGAADTTHDNPHAVHQGSSRYHTRQSTRRASREQQIPHTTIHMPCIKGAADTTHDNSHAVHQGSRRYHTRQSTRRASKEPQIPHATIHTPCIKGAADTTHDNPHAVHQGSRRYHTRQFTRPASREPQIPHTTIHTPCIKRAADTTHDNPHAVHQGSCRYHTRQSTRRTSREPQIPTPTIHNAVHQGSSRYHTQSTRRASREPQIPHTTIHTPCIKGAADTTHDNSHAVHQRSRRYHTRQSTRRASREPQIPHATIHTPCIKGAADTTHDNSHALHQGAADTTHDNPHAVHQGSSRYHTQSTRRASREPQIPHTTIHTPCIKGAADTTHDNSHAVHQMSRRYHTRQSTRRASKEPQIPHATIHTPCIKGAADTTHDNPHALHQGSRRYHTRQSTRRASRELQIPHTTIHTPCIKGAADTTHDNSHALHQGTRRYHTRQSTRRASREPQIPQTTIHTPCIKGAADTTYDNSHAVHQGSRRYHIRQFTRRASREQQIPHTTIHTPCIKGAADTTHDNPHALHQGSSRYHTRQFTRRTSREPQIPHTTIHTPCIKGAADTTHDNPHALHQGSSRYHIRQFSRSA
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -