Nint019809.1
Basic Information
- Insect
- Neoascia interrupta
- Gene Symbol
- bab1_1
- Assembly
- GCA_947623515.1
- Location
- OX392463.1:217070802-217073900[-]
Transcription Factor Domain
- TF Family
- HTH
- Domain
- HTH_psq domain
- PFAM
- PF05225
- TF Group
- Helix-turn-helix
- Description
- This DNA-binding motif is found in four copies in the pipsqueak protein of Drosophila melanogaster [1]. In pipsqueak this domain binds to GAGA sequence [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 1 1.9e-17 3.7e-14 53.1 0.0 1 43 80 123 80 125 0.95
Sequence Information
- Coding Sequence
- ATGTTTATACTTAAAAAGTTTTCGTGCAAGCAGTTAAATAACCACAAATTTCTGTGTTTGTTGTATTTGCAGGCTAAACTACTGGAGAGCTCACATTCGTGGATGGGCGCCTCAACATCGTCTATCGCAGACAGCTACCAGTACCAATTGCAGTCCATGTGGCAAAAGTGCTGGAATACCAACCAAAATCTTATGCATCACCTGCGATTCCGCGAACGAGGTCCGTTGAAGTCTTGGCGTCCAGAAACAATGGCAGAAGCCATATTTAGCGTTTTAAAAGAAGGCTTATCCCTGTCTCAAGCGGCCCGCAAATACGACATTCCATATCCAACATTCGTACTCTACGCAAACCGAGTTCACAACATGCTCGGCCCATCGATAGATGGTGGCACTGACCTAAGGCCAAAAGGTCGAGGGCGTCCACAACGGATCCTGCTAGGCATTTGGCCCGATGAACATATAAAAGGTGTCATCAAGACAGTGGTATTTCGTGATGCTAAGGAGATCAAGGACGAGGGCATTCACTTGCACTACGGACGACATTCGCCAGTATTTCCTTTTCAAGACGCCGCGCTCAATTATCCGGGAGGGCCGGGCTGCCCCAACGGCTTACCCACAGGCCCAGTTGGTCCCACTGATTCCATGTCACAAGAAGCAACAGCAGCAGCTGTGGCCGGCGTCGCGCATGCATTGCGCCAGCAAATGCAGATGGCAGCAGCTCAGCAGCATCCACACCAGCATCCAACGCACCAGCACCACCATCCGGAATCAGCTTCCAATCTATTCAATCTGCCAGCGCACATGTCTCCACACGGTAGTGCTGGTGGGATTCCATTACCAAAGCAAAACTCTCCTATATCAAGTGGAACGGGGCAAAATCCAATTCAAAGACATTCATCGCCGTCTGCATCTGCGATACCTGGTCTTCCTAATCAATCCGTACTGAACTTAGCCAGTATATCAGGTCTTCATATGGTTTCCGGGATGGGATCCAATATGACCTCAACCGGAGGTTCCTCGACTTCATCAACTTCCACTGTACACCGAGGAGAAGGTCAGTCACAGACAAGCCATCAGTCCAACGTCTCACCTGCGCACATGTCATATAGTAGCAGTAGTTCTGTAGGGCATGCTCCACAACATGCGGCTGCGCCACCGTCCAGTAACACCAGCGCACGACTTCATCCGACGTCACCCTCCGAATTAGCTCACGATCTCCGGATGAACAGTCCTGAGGAGAGCCTGCTGAGCTCTCCCCTTCAAATGTCGCTTGAGCCGGCTGTCAATTTGGCTGTTGGGGTGAGCGGTATGGCCTACAAGCCGTCTCGAGGCTATTCATCACCTCGACCAGAACATCTCTTTCAGGAAGACATAGCGGAACTTGTAGGCTCCGCTAGTGACAGATCATCAGCATGTCCCCCGAACATGGGTACCTACAAGGACACTCCCACTAATATCAAAATGGAACCAATTACAGAATGTCGAAGCGAGTAA
- Protein Sequence
- MFILKKFSCKQLNNHKFLCLLYLQAKLLESSHSWMGASTSSIADSYQYQLQSMWQKCWNTNQNLMHHLRFRERGPLKSWRPETMAEAIFSVLKEGLSLSQAARKYDIPYPTFVLYANRVHNMLGPSIDGGTDLRPKGRGRPQRILLGIWPDEHIKGVIKTVVFRDAKEIKDEGIHLHYGRHSPVFPFQDAALNYPGGPGCPNGLPTGPVGPTDSMSQEATAAAVAGVAHALRQQMQMAAAQQHPHQHPTHQHHHPESASNLFNLPAHMSPHGSAGGIPLPKQNSPISSGTGQNPIQRHSSPSASAIPGLPNQSVLNLASISGLHMVSGMGSNMTSTGGSSTSSTSTVHRGEGQSQTSHQSNVSPAHMSYSSSSSVGHAPQHAAAPPSSNTSARLHPTSPSELAHDLRMNSPEESLLSSPLQMSLEPAVNLAVGVSGMAYKPSRGYSSPRPEHLFQEDIAELVGSASDRSSACPPNMGTYKDTPTNIKMEPITECRSE
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_01395777; iTF_00315074; iTF_00310950; iTF_00240588; iTF_00314189; iTF_00311732; iTF_01300768; iTF_01522470; iTF_01521802; iTF_01520882; iTF_01223307; iTF_01299931; iTF_00688526; iTF_00671524; iTF_01116368; iTF_00991678; iTF_00426264; iTF_00974561; iTF_01356770; iTF_00389539; iTF_00670877; iTF_00427047; iTF_01253447; iTF_00663981; iTF_01318157; iTF_00187983; iTF_00694511; iTF_00187985; iTF_00893779; iTF_00984147; iTF_00724692; iTF_01541364; iTF_00334901; iTF_00665557; iTF_01542133; iTF_00672151; iTF_00693687; iTF_00976378; iTF_01211835; iTF_01396414; iTF_00664767; iTF_00310173; iTF_00241376; iTF_00312512; iTF_00313373;
- 90% Identity
- -
- 80% Identity
- -