Hpse009774.1
Basic Information
- Gene Symbol
- -
- Assembly
- GCA_947369225.1
- Location
- OX376335.1:4701753-4707811[+]
Transcription Factor Domain
- TF Family
- MYB
- Domain
- Myb_DNA-binding domain
- PFAM
- PF00249
- TF Group
- Helix-turn-helix
- Description
- This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 6 0.33 2.9e+02 2.1 0.0 23 44 351 371 343 373 0.88 2 6 3.8 3.3e+03 -1.3 1.1 4 15 422 433 420 472 0.63 3 6 0.0041 3.6 8.2 0.1 22 46 582 616 569 616 0.73 4 6 0.0003 0.27 11.8 0.1 3 44 667 717 666 719 0.88 5 6 5e-06 0.0044 17.5 0.1 2 45 835 889 834 890 0.85 6 6 0.063 55 4.4 0.1 3 15 1048 1060 1046 1078 0.85
Sequence Information
- Coding Sequence
- ATGGAGCGCCATCAGATTGTGGTCAAGTCTGAGATGGAGACAGATGGAGAGTTACTAGTGTTTTATGTGGATGAAAATGGCACCACTGAAGAAGGCGTAGTAACTTCGATAGAAAACATTGTGGATGGCAATTTAGATCAGCTTCAGCAAGGAAGCTATATAATTGAAGATGTCGGCGAACAACCTGCCCAAAATGAACCAATGCAGATAGAAGAAGAAAACACCAGCAACGACAAATGGCTGGATGAGGAAACGAAGAGACTACTGATATTCTACATTGACAATAAGGTTACTTTCCTCAGTGGTGCTACAATGAAAAAGCACCTTTGGACCGTTGCTTGTAAGACTATGCTGACAGGAAAAACTCCAACATGTTGTGAAATTAAGCTACGTAATCTAAAGAAAAGTTATTCTGGAATACGTTTAGATCAAGCAAAGGGAAAAAACTTATTCCTCTGGCCTTATTACGAATTGTGTCACCAGGCCTTCCATGACGACGCGTATGTTAATATGCTGTTGAATGACGCTGGGAAAAAAGAGCAACTTGTTAAGTTAAATGAGCCTGCAACGAACCCTGAGAATGATGGCATATTAATAAAAAAAGTTAACAGCAACAGATCAGTGGATGAAGCAGTTGAGAAGATGCTAAATTTGTATCTCAAACATAAAAAGGCAAATAAAGATACTTGGCAGAGACACTTGTGGGACATCATAGCTATAGAGTTAGGGGAAGATGGGGAGTATTGGCACAAACGCTTCCTGAATTTCAAACAGCATTACATTAGAATGTTAGAAAAACGTACAAAGTCCGGTTCCAAAAGTGTCAATTGGCCTTACATGCACATATTTGATGAAATATTTGAAACCGACGTAGATTTTCAAAGAAAATATGGATCGCCAAAAAAAGAGCCGGTTACCATTATAAGTGATATAAGTCATACAGTATCACCAAAACTTGAGTGGGATGATACAGAAAAAACTGTCTTGGCCAAATATTGCTTTGACTGTTTTGATGAATTCCAAGATGCGACTATACCTAATAGCTTTCTGTGGAATGAAGTTGGAAGACTGCTGGATAAAACTGCCGATGTTTGTAAGAAAAAGTACGTAGAACTGAAGCAAAATCATTTGGAACAATACATCTCTGGCGGCTATGATATGCGGAATCGTATACCTATTGCGATATTGTTCGACAATATCATATCTCGAGAGATCGAAAGGCAGTTATCTAACATTCTTGTAAAACCAGGCAAAATGGATTTGTGGGAAACGGAAGAAATTGACATACTAGTGCAGTTTTTTTACGATAATTTGGAGATGTTCAAAGATCCGATATGCCATTATGTTTGTTGGGCAGCGGCAAGCAGGAAATTGAATCAAAATGTATTGAGTTGCAAATACCAGTGGCTAGATCTGACCACTTTGTATAAATCTATATTGGATGACAAAAAAGAGGATCCCGATATGGAGATCGATTGGAGATACATAGAGTTGTTCGATAGAATCTTTGATTATGGCATGGATACGGAATTACTGGATGGCTACGAGAAGTATGAGAAGCCATCTGAGAAGAAAGACTCTGGGAAGATTTGTGCCAAGAAACTGAACATCAAGCTAGAGGAAAACCTCGAAGTAAACGATGATGAAGAGGAATATGACGAAAGGGGTTTCATCAAACGATCCCGAGGCATTGGGGACTCGAAACAGATCAAAATACTAGAGTTTTTCCAAAAGAACAAGGATAAATTTGCTACATCCAAACGAAAGAAATTAGCCCTATGGGGAGTGCTAGCGGAGCAAATAGGAATAACCGCCGAGCAGTGCGCACACCGCTTCAGAAACTTAAAGCAAGTCTACACAGCCTACATCCAGAGAGAAATCAACAAACCCGAAATGCCAATCCTCTGGCCTTATTACACGCTATGCAAGAAAGTATTCGGTTACCGAGCGATCAAAAACAAACTCAGGAACAGCAAAGTTGACTCCGATGACATTGAAGAATGGACTCCTAAGGATATAAAGCTCGTGATAAAACATTTCTCTAAAAACTTCAACGTGATCAATGGTAATGCTGAGGACAAAAGCGTTTGGTCTCCGCTAGCACAAGAGATTAGTAGAAGTGATACTTCGTGCAGAGATAAACTGCTCGAACTGAGGAAGTCTTACAAGAAAGTTAAGATGATGATCAGTCGAAACCCAGATTGTAAAATAAATTGGAAATATTTCAAGCTGTTCGAAGAGATTTATACGGGAGTTAAAGTTGAGGAGATGGAAGTGGATAGTGACGATGATAGAGCAGAAGTGTCTCAGGAGGTGCTGGGTTCTGACACGGACAAGAAAAAGCTTCAAGAAGATGAGGATTACCAATGCATCATAGTGATGTCCGAAGGTGGTGAAGAATTGACGGAAAGTGTTCATCAATCATCAGAAATCATCTTAAAAGTAACGCCGGTTGAAGCCCCAGTCACAGCAACCACCAACACTAATGAAAAACGAAAACCAGTACCTTGGACATCAAAAGAAAGGGAGTTACTACTCAACACGTATTTAAACTACATCAAAGCTAATAAGGGAACATCCATAAACACAGAAGATATGTGGAAAGAGATTGCAAAGAAATTCCCAAAGAAATCACCAATTGGTTGCAGACGGAAGTATATCAAAATGAGAAATGATCACAAAAAGAATGCTAACGATGAAAGGTACAAGACCAGTCTGTACTGTGAACTTTTCAAACAGATATTTGCTGTGAAACCCAAGTTTTCTGCAGCACCAATAGACGATTGTTTTCTGCGAGAATCACAAATATTCAAAGACGTCACTTTACCCATATGGAAAGTGGAGCAAGCTCTCAAGTATTATCTTGAACATTTAGAGGAATTCATAAGTCCTAGATACGAGAAAAAATACGTTTGGAAAGAACTATCGAATGCCACGTCTTTACCACTCTATAAAACATTTAAAAAGATCAATTATCTAAAACAGGTTTACAGAAGCGATACGAGTCAGGTGTTAGACAAAAATGCTGAAGCTCTGGGTTTAGATTTTAGTGGTTTACTAAAACAGATATTCGAGAAAGAGTTTGCTATTAAAATAGCTACATACGAACCCAAGCCCCAAGTTGAAGATGAGTCGGAAACATCTTGGTCGGATGAAGAAACGGAGCAACTTCTGAATTGGTATTTGGAAAACTTGAGTAAGTTTAAAAATCCGAAATTTGTGCCGAGTTACCTATGGATGGACGCCGCGGAGGCTCTAAACAAATCCGCTTTGGCTTGTTCCAAGAAAATGTCCGAGATTCGAACCCGGTATAGCGGTATGGTGAAGGAGACTCCAGAAGCGTTGGTTGAGTGGAAGTTCCACGAGTTGTGTCAGAAAATCTATGGAACTGGTAAGAAAAGCACACCAGTGAATAGTGATTAG
- Protein Sequence
- MERHQIVVKSEMETDGELLVFYVDENGTTEEGVVTSIENIVDGNLDQLQQGSYIIEDVGEQPAQNEPMQIEEENTSNDKWLDEETKRLLIFYIDNKVTFLSGATMKKHLWTVACKTMLTGKTPTCCEIKLRNLKKSYSGIRLDQAKGKNLFLWPYYELCHQAFHDDAYVNMLLNDAGKKEQLVKLNEPATNPENDGILIKKVNSNRSVDEAVEKMLNLYLKHKKANKDTWQRHLWDIIAIELGEDGEYWHKRFLNFKQHYIRMLEKRTKSGSKSVNWPYMHIFDEIFETDVDFQRKYGSPKKEPVTIISDISHTVSPKLEWDDTEKTVLAKYCFDCFDEFQDATIPNSFLWNEVGRLLDKTADVCKKKYVELKQNHLEQYISGGYDMRNRIPIAILFDNIISREIERQLSNILVKPGKMDLWETEEIDILVQFFYDNLEMFKDPICHYVCWAAASRKLNQNVLSCKYQWLDLTTLYKSILDDKKEDPDMEIDWRYIELFDRIFDYGMDTELLDGYEKYEKPSEKKDSGKICAKKLNIKLEENLEVNDDEEEYDERGFIKRSRGIGDSKQIKILEFFQKNKDKFATSKRKKLALWGVLAEQIGITAEQCAHRFRNLKQVYTAYIQREINKPEMPILWPYYTLCKKVFGYRAIKNKLRNSKVDSDDIEEWTPKDIKLVIKHFSKNFNVINGNAEDKSVWSPLAQEISRSDTSCRDKLLELRKSYKKVKMMISRNPDCKINWKYFKLFEEIYTGVKVEEMEVDSDDDRAEVSQEVLGSDTDKKKLQEDEDYQCIIVMSEGGEELTESVHQSSEIILKVTPVEAPVTATTNTNEKRKPVPWTSKERELLLNTYLNYIKANKGTSINTEDMWKEIAKKFPKKSPIGCRRKYIKMRNDHKKNANDERYKTSLYCELFKQIFAVKPKFSAAPIDDCFLRESQIFKDVTLPIWKVEQALKYYLEHLEEFISPRYEKKYVWKELSNATSLPLYKTFKKINYLKQVYRSDTSQVLDKNAEALGLDFSGLLKQIFEKEFAIKIATYEPKPQVEDESETSWSDEETEQLLNWYLENLSKFKNPKFVPSYLWMDAAEALNKSALACSKKMSEIRTRYSGMVKETPEALVEWKFHELCQKIYGTGKKSTPVNSD
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -