Basic Information

Insect
Nomada fucata
Gene Symbol
lola
Assembly
GCA_948146005.1
Location
OX411278.1:30260463-30262679[-]

Transcription Factor Domain

TF Family
HTH
Domain
HTH_psq domain
PFAM
PF05225
TF Group
Helix-turn-helix
Description
This DNA-binding motif is found in four copies in the pipsqueak protein of Drosophila melanogaster [1]. In pipsqueak this domain binds to GAGA sequence [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 5 6.6e-07 0.00092 18.5 0.0 5 38 202 235 198 241 0.82
2 5 6.1e-11 8.6e-08 31.4 0.0 5 39 252 285 250 292 0.84
3 5 0.015 22 4.5 0.0 7 34 306 332 302 338 0.83
4 5 4.1e-13 5.8e-10 38.4 0.0 4 39 363 398 360 404 0.92
5 5 0.96 1.4e+03 -1.2 0.0 31 44 405 418 403 419 0.83

Sequence Information

Coding Sequence
ATGTCCGAGATTGATAACCAACATGAGATTAACCAGAGCTATTGGTTTAAATGGAACGACTACCAAAATCATTTGTCCGATGTAGTTAGACAACTTCTGGAAGAGGATTGTATGGTGGACGTTACCCTGGCTGCTGCCGGGGAACGTATTCATGCTCATCGAATAGTTCTTTGTGCTTGTAGCACTTTATTTCGAGAAATATTGAGTCAAGTAAATGAAGATCATCCGACAATAATATTAAGCGACATATCTGCTCAAGATATTAAATCTATTATAGAATTTACTTACCATGGGGAAGTACGAGTTCCAGTCAACAATATTAGCAGTTTATTAGATGCAGCACGTTCCTTAAAAATTTGTGGCCTGCTCGAGATCGATGGACTAGATGAGAGTGACTCTGTTGAGAACACTAAAGATTTTAACGATAACAATGAAGAGCCGATGTCTGTAATCAGTGAGACTGACGAAAGTGTTATTAACGAACTGCAACATGACTACGAGGATTTAGATGAACAAAAACAGGAGATTATCGAGGACTCCACTGTAAATAGAAAGAAGAAACGTAGGCGCGATACAATGAAACGAGATTACAGCGACAATATGTTAGCATCTGCAATCCATGATTTAAAGTCTGGTCAAACATTAATAGAAGCTTCTACTAAACATAATATACCACGTTCAACTTTATACATGCGTGCTAAAGCATTAGGTATACACTTGAACGCATCTAGAAACGAATATCCTGTAGAATGTATGAAAGCAGCTATAAACGCAGTAATAGATGGATCAAGTTTGCAGCATGCGTCAGAAATGTTTAGTATACCTAAGACTGTACTATGGAGGAGGATTCAAAAGGAAGGGTATCAAATGTTGCGTTCAGAAATGAAACGATCTTACGGTTCTGATAAACGGGAAGCAGCGGTTAAAGCTTTACAAAGGGGAGAGAATTTGACTAAAGTAGCACTTGAGTTTAAAATTCCAAAGACTACTTTATTTAGAGACAAAGCTCGACTGGTAGACGAAGGTAAATTACCTTTATCATTTTGGAAGAAACGCAAAGCGGAAAACGAACAATTAAAGAAGTCACGCTTGGAAGAAGCTGTTGCTGCTTGTAAAGGTGGAAAAATGTCGCAAGCAGCAGCATCGATGACTTATGATATTCCGAAGACGACGATATGGAGACGTCTTCAGCAGGATGGTAAAAAATCAGAGCGCTCATTAAGCTCGAGGAAGCAAAGAATAACAGACTCCAAACATAATGCAGAAACCAAAGTACAAGAAGGATCTAATTTCACGTATTGCGAGGTTTCATCCGAAATTCCTATAACTTACATAGACGAAAATAGTATACCTGAAGATTCTGTGATAATATTAACGGCTGAAGATATGGACGGACTTAATTTAGAAGAGGGACGGCAAATTATTGTTAATTCAGAATCTGGACAAGAATACGTTCCTTGTGCTATTAGTATCGAAGAAAACTCCAATTATTCGCAGACAGAAAGTTAG
Protein Sequence
MSEIDNQHEINQSYWFKWNDYQNHLSDVVRQLLEEDCMVDVTLAAAGERIHAHRIVLCACSTLFREILSQVNEDHPTIILSDISAQDIKSIIEFTYHGEVRVPVNNISSLLDAARSLKICGLLEIDGLDESDSVENTKDFNDNNEEPMSVISETDESVINELQHDYEDLDEQKQEIIEDSTVNRKKKRRRDTMKRDYSDNMLASAIHDLKSGQTLIEASTKHNIPRSTLYMRAKALGIHLNASRNEYPVECMKAAINAVIDGSSLQHASEMFSIPKTVLWRRIQKEGYQMLRSEMKRSYGSDKREAAVKALQRGENLTKVALEFKIPKTTLFRDKARLVDEGKLPLSFWKKRKAENEQLKKSRLEEAVAACKGGKMSQAAASMTYDIPKTTIWRRLQQDGKKSERSLSSRKQRITDSKHNAETKVQEGSNFTYCEVSSEIPITYIDENSIPEDSVIILTAEDMDGLNLEEGRQIIVNSESGQEYVPCAISIEENSNYSQTES

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00118108;
90% Identity
iTF_00963105;
80% Identity
iTF_01067045;