Carc013889.1
Basic Information
- Insect
- Coenonympha arcania
- Gene Symbol
- -
- Assembly
- GCA_036785405.1
- Location
- CM072068.1:711663-723511[-]
Transcription Factor Domain
- TF Family
- HTH
- Domain
- HTH_psq domain
- PFAM
- PF05225
- TF Group
- Helix-turn-helix
- Description
- This DNA-binding motif is found in four copies in the pipsqueak protein of Drosophila melanogaster [1]. In pipsqueak this domain binds to GAGA sequence [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 31 1.6e-10 1.2e-07 32.1 0.0 4 39 177 213 174 217 0.89 2 31 2.5e-10 1.8e-07 31.6 0.3 2 39 229 266 228 269 0.92 3 31 7.5e-08 5.6e-05 23.6 0.0 2 30 284 312 283 317 0.93 4 31 7.5e-08 5.6e-05 23.6 0.0 2 30 326 354 325 359 0.93 5 31 7.5e-08 5.6e-05 23.6 0.0 2 30 368 396 367 401 0.93 6 31 7.5e-08 5.6e-05 23.6 0.0 2 30 410 438 409 443 0.93 7 31 7.5e-08 5.6e-05 23.6 0.0 2 30 452 480 451 485 0.93 8 31 7.5e-08 5.6e-05 23.6 0.0 2 30 494 522 493 527 0.93 9 31 7.5e-08 5.6e-05 23.6 0.0 2 30 536 564 535 569 0.93 10 31 7.5e-08 5.6e-05 23.6 0.0 2 30 578 606 577 611 0.93 11 31 7.5e-08 5.6e-05 23.6 0.0 2 30 620 648 619 653 0.93 12 31 7.5e-08 5.6e-05 23.6 0.0 2 30 662 690 661 695 0.93 13 31 7.5e-08 5.6e-05 23.6 0.0 2 30 704 732 703 737 0.93 14 31 7.5e-08 5.6e-05 23.6 0.0 2 30 746 774 745 779 0.93 15 31 7.5e-08 5.6e-05 23.6 0.0 2 30 788 816 787 821 0.93 16 31 7.5e-08 5.6e-05 23.6 0.0 2 30 830 858 829 863 0.93 17 31 7.5e-08 5.6e-05 23.6 0.0 2 30 872 900 871 905 0.93 18 31 7.5e-08 5.6e-05 23.6 0.0 2 30 914 942 913 947 0.93 19 31 7.5e-08 5.6e-05 23.6 0.0 2 30 956 984 955 989 0.93 20 31 7.5e-08 5.6e-05 23.6 0.0 2 30 998 1026 997 1031 0.93 21 31 7.5e-08 5.6e-05 23.6 0.0 2 30 1040 1068 1039 1073 0.93 22 31 7.5e-08 5.6e-05 23.6 0.0 2 30 1082 1110 1081 1115 0.93 23 31 7.5e-08 5.6e-05 23.6 0.0 2 30 1124 1152 1123 1157 0.93 24 31 7.5e-08 5.6e-05 23.6 0.0 2 30 1166 1194 1165 1199 0.93 25 31 7.5e-08 5.6e-05 23.6 0.0 2 30 1208 1236 1207 1241 0.93 26 31 7.5e-08 5.6e-05 23.6 0.0 2 30 1250 1278 1249 1283 0.93 27 31 7.5e-08 5.6e-05 23.6 0.0 2 30 1292 1320 1291 1325 0.93 28 31 7.5e-08 5.6e-05 23.6 0.0 2 30 1334 1362 1333 1367 0.93 29 31 7.5e-08 5.6e-05 23.6 0.0 2 30 1376 1404 1375 1409 0.93 30 31 7.5e-08 5.6e-05 23.6 0.0 2 30 1418 1446 1417 1451 0.93 31 31 0.00076 0.57 10.8 0.7 2 24 1460 1482 1459 1483 0.92
Sequence Information
- Coding Sequence
- ATGCCCGGATCAGAAAGCCTGCAAGTGAAATGCGAATCGCCTTCTGAGGATGAAGAGAGTAATTCAGAATTCGTACATTCTTTTGCTCATCGGCTTCCAACAGAATACGTGCCTCCGAGTATAAAATTGGAACCGAACTCGCAAGTAGACGAAAATGCATCATCGAACCTTTTGCATCATTTGCAGTGCGTGGAAGGCGCCCTGGAGGAGCTGGTACCGGCGCCGGCGGGGCCCGCAGCCACGCCCGTGTGCGAGCAGCCCCCGGCCACCTATGAAGCCTCCTTCGCGACCCTCGAGCCTGCCTCCCGCGCTGACACTCTGGGTTCATCACTGATCGACAGGCATATAAAGATGATGACTGCGAACGAAGTGGCCGACTCTTCAGACTTCGTACCGCTCATGGCTATCAAGGATGAGCCATCGTCTGAAGGAGAGCAGCAACACCTGAGTGGAGACGACACTAGCGACTCCAACGGGGCGCCACAGGAGTCGGCGAGCCGCGGCAGTCCCAAGAGCTGGACCCAGCAGGACATGGAGAAGGCGCTGGAAGCCTTGAGGAAGCATCACATGAGCTTGACTAAGGCTTCGGCGACATACGGTATACCGTCAACGACGCTGTGGCAGCGAGCGCACCGGCTGGGCATCGACACTCCTAAGAAGGAAGGCGCCTCTAAGTCCTGGAGCGAGGCAGACCTGCGGGGAGCGCTGCACGCGTTGAGAGCCGGCGCCATTTCCGCCAACAAGGCCAGCAAGGCGTATGGTATCCCGAGCAGTACTCTGTACAAGATAGCTCGTCGCGAAGGCATCCGGCTGGCGGCGCCCTTCAACGCCGCTCCCACGGCGTGGCGGCGAGCGGacctggagcgcgcgctgcaggcgatacgctgcggcgcggcctccgTGCAGCGAGCCGCCACGCAGTTCGGGATACCCACGGGTGAGTTGCTTGACAACGCCGCTCCCACGGCGTGGCGGCGAGCGGActtggagcgcgcgctgcaggcgatacgctgcggcgcggcctccgTGCAGCGAGCCGCCACGCAGTTCGGGATACCCACGGGTGAGTTGCTTGACAACGCCGCTCCCACGGCGTGGCGGCGAGCGGacctggagcgcgcgctgcaggcgatacgctgcggcgcggcctccgTGCAGCGAGCCGCCACGCAGTTCGGGATACCCACGGGTGAGTTGCTTGACAACGCCGCTCCCACGGCGTGGCGGCGAGCGGacctggagcgcgcgctgcaggcgatacgctgcggcgcggcctccgTGCAGCGAGCCGCCACGCAGTTCGGGATACCCACGGGTGAGTTGCTTGACAACGCCGCTCCCACGGCGTGGCGGCGAGCGGActtggagcgcgcgctgcaggcgatacgctgcggcgcggcctccgTGCAGCGAGCCGCCACGCAGTTCGGGATACCCACGGGTGAGTTGCTTGACAACGCCGCTCCCACGGCGTGGCGGCGAGCGGacctggagcgcgcgctgcaggcgatacgctgcggcgcggcctccgTGCAGCGAGCCGCCACGCAGTTCGGGATACCCACGGGTGAGTTGCTTGACAACGCCGCTCCCACGGCGTGGCGGCGAGCGGacctggagcgcgcgctgcaggcgatacgctgcggcgcggcctccgTGCAGCGAGCCGCCACGCAGTTCGGGATACCCACGGGTGAGTTGCTTGACAACGCCGCTCCCACGGCGTGGCGGCGAGCGGacctggagcgcgcgctgcaggcgatacgctgcggcgcggcctccgTGCAGCGAGCCGCCACGCAGTTCGGGATACCCACGGGTGAGTTGCTTGACAACGCCGCTCCCACGGCGTGGCGGCGAGCGGacctggagcgcgcgctgcaggcgatacgctgcggcgcggcctccgTGCAGCGAGCCGCCACGCAGTTCGGGATACCCACGGGTGAGTTGCTTGACAACGCCGCTCCCACGGCGTGGCGGCGAGCGGacctggagcgcgcgctgcaggcgatacgctgcggcgcggcctccgTGCAGCGAGCCGCCACGCAGTTCGGGATACCCACGGGTGAGTTGCTTGACAACGCCGCTCCCACGGCGTGGCGGCGAGCGGacctggagcgcgcgctgcaggcgatacgctgcggcgcggcctccgTGCAGCGAGCCGCCACGCAGTTCGGGATACCCACGGGTGAGTTGCTTGACAACGCCGCTCCCACGGCGTGGCGGCGAGCGGacctggagcgcgcgctgcaggcgatacgctgcggcgcggcctccgTGCAGCGAGCCGCCACGCAGTTCGGGATACCCACGGGTGAGTTGCTTGACAACGCCGCTCCCACGGCGTGGCGGCGAGCGGacctggagcgcgcgctgcaggcgatacgctgcggcgcggcctccgTGCAGCGAGCCGCCACGCAGTTCGGGATACCCACGGGTGAGTTGCTTGACAACGCCGCTCCCACGGCGTGGCGGCGAGCGGacctggagcgcgcgctgcaggcgatacgctgcggcgcggcctccgTGCAGCGAGCCGCCACGCAGTTCGGGATACCCACGGGTGAGTTGCTTGACAACGCCGCTCCCACGGCGTGGCGGCGAGCGGacctggagcgcgcgctgcaggcgatacgctgcggcgcggcctccgTGCAGCGAGCCGCCACGCAGTTCGGGATACCCACGGGTGAGTTGCTTGACAACGCCGCTCCCACGGCGTGGCGGCGAGCGGacctggagcgcgcgctgcaggcgatacgctgcggcgcggcctccgTGCAGCGAGCCGCCACGCAGTTCGGGATACCCACGGGTGAGTTGCTTGACAACGCCGCTCCCACGGCGTGGCGGCGAGCGGacctggagcgcgcgctgcaggcgatacgctgcggcgcggcctccgTGCAGCGAGCCGCCACGCAGTTCGGGATACCCACGGGTGAGTTGCTTGACAACGCCGCTCCCACGGCGTGGCGGCGAGCGGacctggagcgcgcgctgcaggcgatacgctgcggcgcggcctccgTGCAGCGAGCCGCCACGCAGTTCGGGATACCCACGGGTGAGTTGCTTGACAACGCCGCTCCCACGGCGTGGCGGCGAGCGGacctggagcgcgcgctgcaggcgatacgctgcggcgcggcctccgTGCAGCGAGCCGCCACGCAGTTCGGGATACCCACGGGTGAGTTGCTTGACAACGCCGCTCCCACGGCGTGGCGGCGAGCGGacctggagcgcgcgctgcaggcgatacgctgcggcgcggcctccgTGCAGCGAGCCGCCACGCAGTTCGGGATACCCACGGGTGAGTTGCTTGACAACGCCGCTCCCACGGCGTGGCGGCGAGCGGacctggagcgcgcgctgcaggcgatacgctgcggcgcggcctccgTGCAGCGAGCCGCCACGCAGTTCGGGATACCCACGGGTGAGTTGCTTGACAACGCCGCTCCCACGGCGTGGCGGCGAGCGGacctggagcgcgcgctgcaggcgatacgctgcggcgcggcctccgTGCAGCGAGCCGCCACGCAGTTCGGGATACCCACGGGTGAGTTGCTTGACAACGCCGCTCCCACGGCGTGGCGGCGAGCGGacctggagcgcgcgctgcaggcgatacgctgcggcgcggcctccgTGCAGCGAGCCGCCACGCAGTTCGGGATACCCACGGGTGAGTTGCTTGACAACGCCGCTCCCACGGCGTGGCGGCGAGCGGacctggagcgcgcgctgcaggcgatacgctgcggcgcggcctccgTGCAGCGAGCCGCCACGCAGTTCGGGATACCCACGGGTGAGTTGCTTGACAACGCCGCTCCCACGGCGTGGCGGCGAGCGGacctggagcgcgcgctgcaggcgatacgctgcggcgcggcctccgTGCAGCGAGCCGCCACGCAGTTCGGGATACCCACGGGTGAGTTGCTTGACAACGCCGCTCCCACGGCGTGGCGGCGAGCGGacctggagcgcgcgctgcaggcgatacgctgcggcgcggcctccgTGCAGCGAGCCGCCACGCAGTTCGGGATACCCACGGGTGAGTTGCTTGACAACGCCGCTCCCACGGCGTGGCGGCGAGCGGacctggagcgcgcgctgcaggcgatacgctgcggcgcggcctccgTGCAGCGAGCCGCCACGCAGTTCGGGATACCCACGGGTGAGTTGCTTGACAACGCCGCTCCCACGGCGTGGCGGCGAGCGGacctggagcgcgcgctgcaggcgatacgctgcggcgcggcctccgTGCAGCGAGCCGCCACGCAGTTCGGGATACCCACGGGTGAGTTGCTTGACAACGCCGCTCCCACGGCGTGGCGGCGAGCGGacctggagcgcgcgctgcaggcgatacgctgcggcgcggcctccgTGCAGCGAGCCGCCACGCAGTTACTTGCAGCGCTTGAAGCGGCCGTACAGCGACGAGTAGGGCAGGTTGTAGTGGATCGCCGCCTGGTTGATGGACATCTGACCCACCCTGCACATTAG
- Protein Sequence
- MPGSESLQVKCESPSEDEESNSEFVHSFAHRLPTEYVPPSIKLEPNSQVDENASSNLLHHLQCVEGALEELVPAPAGPAATPVCEQPPATYEASFATLEPASRADTLGSSLIDRHIKMMTANEVADSSDFVPLMAIKDEPSSEGEQQHLSGDDTSDSNGAPQESASRGSPKSWTQQDMEKALEALRKHHMSLTKASATYGIPSTTLWQRAHRLGIDTPKKEGASKSWSEADLRGALHALRAGAISANKASKAYGIPSSTLYKIARREGIRLAAPFNAAPTAWRRADLERALQAIRCGAASVQRAATQFGIPTGELLDNAAPTAWRRADLERALQAIRCGAASVQRAATQFGIPTGELLDNAAPTAWRRADLERALQAIRCGAASVQRAATQFGIPTGELLDNAAPTAWRRADLERALQAIRCGAASVQRAATQFGIPTGELLDNAAPTAWRRADLERALQAIRCGAASVQRAATQFGIPTGELLDNAAPTAWRRADLERALQAIRCGAASVQRAATQFGIPTGELLDNAAPTAWRRADLERALQAIRCGAASVQRAATQFGIPTGELLDNAAPTAWRRADLERALQAIRCGAASVQRAATQFGIPTGELLDNAAPTAWRRADLERALQAIRCGAASVQRAATQFGIPTGELLDNAAPTAWRRADLERALQAIRCGAASVQRAATQFGIPTGELLDNAAPTAWRRADLERALQAIRCGAASVQRAATQFGIPTGELLDNAAPTAWRRADLERALQAIRCGAASVQRAATQFGIPTGELLDNAAPTAWRRADLERALQAIRCGAASVQRAATQFGIPTGELLDNAAPTAWRRADLERALQAIRCGAASVQRAATQFGIPTGELLDNAAPTAWRRADLERALQAIRCGAASVQRAATQFGIPTGELLDNAAPTAWRRADLERALQAIRCGAASVQRAATQFGIPTGELLDNAAPTAWRRADLERALQAIRCGAASVQRAATQFGIPTGELLDNAAPTAWRRADLERALQAIRCGAASVQRAATQFGIPTGELLDNAAPTAWRRADLERALQAIRCGAASVQRAATQFGIPTGELLDNAAPTAWRRADLERALQAIRCGAASVQRAATQFGIPTGELLDNAAPTAWRRADLERALQAIRCGAASVQRAATQFGIPTGELLDNAAPTAWRRADLERALQAIRCGAASVQRAATQFGIPTGELLDNAAPTAWRRADLERALQAIRCGAASVQRAATQFGIPTGELLDNAAPTAWRRADLERALQAIRCGAASVQRAATQFGIPTGELLDNAAPTAWRRADLERALQAIRCGAASVQRAATQFGIPTGELLDNAAPTAWRRADLERALQAIRCGAASVQRAATQFGIPTGELLDNAAPTAWRRADLERALQAIRCGAASVQRAATQFGIPTGELLDNAAPTAWRRADLERALQAIRCGAASVQRAATQFGIPTGELLDNAAPTAWRRADLERALQAIRCGAASVQRAATQLLAALEAAVQRRVGQVVVDRRLVDGHLTHPAH
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -