Basic Information

Gene Symbol
-
Assembly
GCA_945859575.2
Location
OX243853.1:13289896-13296499[-]

Transcription Factor Domain

TF Family
MYB
Domain
Myb_DNA-binding domain
PFAM
PF00249
TF Group
Helix-turn-helix
Description
This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 7 6.1 1.1e+04 -2.3 0.0 3 13 73 83 72 86 0.87
2 7 0.024 42 5.4 0.0 23 44 371 391 358 393 0.89
3 7 0.48 8.7e+02 1.2 0.3 22 43 467 490 442 493 0.68
4 7 0.023 41 5.4 0.1 22 45 608 635 590 636 0.74
5 7 0.00067 1.2 10.3 0.1 3 45 687 738 685 739 0.83
6 7 0.00095 1.7 9.8 0.0 22 42 882 904 864 908 0.77
7 7 0.24 4.2e+02 2.2 1.4 3 15 1063 1075 1061 1080 0.88

Sequence Information

Coding Sequence
ATGGATCAGAGTATAGTAGTTAAGACGGAGATGCAATCAAATGGGGAGATATTGCTTTTCTATGTTGATGAAAATGGGGGCAATGAGGAAGGAGTACTAACAACAGTCCAAAACATTGAAAACCAAGCAATAGAACTTCAACAAGACAATTACGTCTTACCAGATGTTGGAGATCTTGACAACATTAACATAGCTCAATCCAGCACTCAAAATGACAACTGGAGCGAAGATGACATCAAAAGACTTCTTGTATTCTATATGGACAACAAACAAATATTTATATCAGGAGCAACTAAAAAGGAACATTTATGGACTGTTGCCTGCAAGACCATGCTCATTGGAAAAAATCCCAATGCATGTCAAGCCCAACTTGACAGTCTTAGAGCAAAATACTTTGAACTCTGTGGCTATATACATAAAGGTCTCTATGTAAAATGGCCACACTTTGAACTCTCCCATCAAGTTTTTGAGGATGAGAATAAAAGCACTGAGGAGAGTGATCTTCTGACTGAACCTCAACTATCTAATCTACCAGTACCAAAACCATATACAGATAATGTAATGGTTGTGAAAAAAATAAATACCGGTGAAGAATATAGTGTTACAACTGAACCACCTGTTAACAATGTACATGTTACAAAACTGAGTAACGAAAGTGGTAAAAAAAACAATAACAGGGCTCCAGGTGATGAAAAGGTGGAAATGATGCTAAAGTTGTATCTTAAATATAAGAAGAATTTCCAACAGGAATATCGAAGGAAAGGTCTATGGGAAACTATCGCTATGGAGTTGGGAGAGGATGATGGGGAATATTGGCAGAAAAGGTTCTTGAATTACAAACAACATTATTTACGTCTGGTTGAAAAGAGACAGGCTAATGGCTCTGAAGGCATCAACTGGCCTTACATAGACTTATTCGATAAAATATTTGAAGATGATGTAGATTTTAATCGTAAATATATAAATCAAGAGTATAGGATGATTGAAAATCAAGCAATTTCAGAGGAACCTCCACAAGATTGGGATAATACTGAAATGACAGTACTCGTCAAATATTGCTATGATTGTTTTGATGAATTTGAAGATACAACGATACCGAACAGTTTTCTTTGGAACGAGATTGGTCGTCTTTTAGACAAGACTGCAGACGCTTGTAAGGAGAAATTTGAAGAAATGAAAAATGCTCATTTAGACAAATATATAGAAGGTGATTATGAGCTACGGAATCGCAAACCTATAGCTATATTATATGACAACATTATATCAATGGAGATTAAATCGGAAATGATTAAATTAATCAATAAACCGGAACAATTAGAGATATGGAAGACAGAAGAATTGGATGAGTTGGTGCAATTTTTTTATGATAATATTGAAATGTATAAAGACCTTATTTGTCATTACGTTTGTTGGGCAGCCCTTTTGAAAAAGTTGAAGAGGAATTTACAAAGTTGTAAAGGGCAATGGGAGGATTTAGTGACTCTTTACAAGACAATTTTGGATGATAAGAAGGAGAATCCTGATATGCAAATTGACTGGAGGTATATTGAACTGTTTGATAGGCTATTTGATTATGGTATGGATATCAATTTGCTTTCGGGGTATGAAAAGCTGCAGCCACCCACTCCGGAAACTGGGAAAGTAGGTGTAAAAAAAATAAAAATTAGAACCGAAGATTTGGGCGAGGAATTTTCAGAAGACGACGAATCGTACGACGAGAGAGGTTTCACCAAACGCACTAAACGCCGCGCTGGGGACTCCAAAGCCTTCAAAATACTAGAATATTACCAAAAGAACAAAGATAAATTTGCCACTACAAACAAAAACAAACATTCCTTATGGGAAGTCTTAGCCAAACAAATAGGCATCCCAGCGACTCAGTGTGCACATAGATTTAGAAATTTCAAACAAGTTTACACTGCATATGTCCAAAGAGAAATCAATAAACCCGAGATGCCAATTCTGTGGCCTTACTACGCTCTATGCAAGAAAGTATTTGGCTATAGAGCGATCAAAACTAAATTAAAGAATGGAAAACTGGATTCCGATGATAGTGAAGATTGGTCTGCAAAAGAAATTAAACAGCTTATAAACTATTTCTCGAAGAATTACGATCAAATCAACAATAATGTTGATGATAGTAGTCGCTGGTCAGATTTGGCTGGCGATATTGGCAAAGGTGAAACATCTTGTAAGGAGAAATTCTTGGAATTGAGGAAGTCTTATAGGAAATTGAAGACTATGAGGACAAGAAACCCCGAAGTGAAGATATCGTGGAAATACTTTGCGATGTTTGAGCAGATTTATAACTCTAAGGAAGAAAGTGGCCATGAAGCTATGGATGTGGATGACAATTTAGGAAGTTATGTGGAACTTCATGGTGATGTTTCTGATGAGAAGAATGAACAAGAAGACGAGGACTACCAATGCATAATAGTAATACCAGAAGGCGAAGACATGGAAAATGCTCAAATTATACTACAAGAACATCCACAATCATTAGAAGACATGGTGACCACACAAGAGATAGAAACAACTGAACCACAGAAAACTGTAGTCAAATGGAACAAACGAAGCAAGAAGCGACTTCTTATCCTTTACATAAACTACATAAGAACTAACAAAGGCAAAGAAATAAAACCAAAAGAGATGTGGGCAGAAATAGCAACAAAACTGCAAGACAAAACCCCATTGTCTTGCCGAAAAATGTTCGCGAAATTGAAGGCTAACCATAGACAAGTTGACAATGAAGATCCGAATGTTAAAAAGACTCCTTACTATGGACTAATGCAAAAAGTATTGCGTTTAAAACCTAAATTCGTAAAAACTAAGCAAAGTAAATCTTCAAAAGATGGTAAAATATACAAAGATGTTTCTTTACCTGATGAGAAAGTAGTCCAAGTGTTAGAGTATTACTTACAAAACATTGAAGACTTTCTTAGTCCCAGATATGAGAAGAAATATCTATGGACAGAGCTTGCCAACTATGTCTGTGAACCAGTTACTAAAGTGTTTAATAAACTGAATTACTTAAAACAGTGTTATAATAAGGACACTGATGAAATTGCCGGCCAAAAATCACCATTTGCTTCATTACTAAAAGAAATCTTGGCTAAAGAAATAGCTGTGAAATTGGTCATTGAAAATGAGCCCAAACCTGTTGTTGAAGACGAAGGCAATGAAGAAGTATGGACCGATGACGAAACGGAACAACTTTTGGAATGGTATTTGAGTAATTTGGAGAAGTTTAAAAATCCAAAATTCGTTAGAAGTTACTTGTGGATGGAAGTTACAGGCATTTTAAACAAAAGTGCTATTGCTTGTTCAAAGAAAATGTCTGAAATTAGGACTCAGTATAGAAATATGGTCAGGGAAACTCCTGGTGAATTGACTGAATGGAGATTTTTGGAGTTATGTCAGAAAATCTATGGGACAGGGAAGAAAGGGACTCCGTATACATCGAGCCAGTCTGATGCCTGA
Protein Sequence
MDQSIVVKTEMQSNGEILLFYVDENGGNEEGVLTTVQNIENQAIELQQDNYVLPDVGDLDNINIAQSSTQNDNWSEDDIKRLLVFYMDNKQIFISGATKKEHLWTVACKTMLIGKNPNACQAQLDSLRAKYFELCGYIHKGLYVKWPHFELSHQVFEDENKSTEESDLLTEPQLSNLPVPKPYTDNVMVVKKINTGEEYSVTTEPPVNNVHVTKLSNESGKKNNNRAPGDEKVEMMLKLYLKYKKNFQQEYRRKGLWETIAMELGEDDGEYWQKRFLNYKQHYLRLVEKRQANGSEGINWPYIDLFDKIFEDDVDFNRKYINQEYRMIENQAISEEPPQDWDNTEMTVLVKYCYDCFDEFEDTTIPNSFLWNEIGRLLDKTADACKEKFEEMKNAHLDKYIEGDYELRNRKPIAILYDNIISMEIKSEMIKLINKPEQLEIWKTEELDELVQFFYDNIEMYKDLICHYVCWAALLKKLKRNLQSCKGQWEDLVTLYKTILDDKKENPDMQIDWRYIELFDRLFDYGMDINLLSGYEKLQPPTPETGKVGVKKIKIRTEDLGEEFSEDDESYDERGFTKRTKRRAGDSKAFKILEYYQKNKDKFATTNKNKHSLWEVLAKQIGIPATQCAHRFRNFKQVYTAYVQREINKPEMPILWPYYALCKKVFGYRAIKTKLKNGKLDSDDSEDWSAKEIKQLINYFSKNYDQINNNVDDSSRWSDLAGDIGKGETSCKEKFLELRKSYRKLKTMRTRNPEVKISWKYFAMFEQIYNSKEESGHEAMDVDDNLGSYVELHGDVSDEKNEQEDEDYQCIIVIPEGEDMENAQIILQEHPQSLEDMVTTQEIETTEPQKTVVKWNKRSKKRLLILYINYIRTNKGKEIKPKEMWAEIATKLQDKTPLSCRKMFAKLKANHRQVDNEDPNVKKTPYYGLMQKVLRLKPKFVKTKQSKSSKDGKIYKDVSLPDEKVVQVLEYYLQNIEDFLSPRYEKKYLWTELANYVCEPVTKVFNKLNYLKQCYNKDTDEIAGQKSPFASLLKEILAKEIAVKLVIENEPKPVVEDEGNEEVWTDDETEQLLEWYLSNLEKFKNPKFVRSYLWMEVTGILNKSAIACSKKMSEIRTQYRNMVRETPGELTEWRFLELCQKIYGTGKKGTPYTSSQSDA

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-