Basic Information

Gene Symbol
-
Assembly
GCA_907165245.1
Location
OU015611.1:5432270-5437432[-]

Transcription Factor Domain

TF Family
MYB
Domain
Myb_DNA-binding domain
PFAM
PF00249
TF Group
Helix-turn-helix
Description
This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 8 1.4 2e+03 -0.6 0.0 23 45 110 133 79 134 0.54
2 8 8.2 1.2e+04 -3.0 0.0 17 31 234 250 230 251 0.73
3 8 0.0038 5.4 7.6 0.0 21 44 352 376 344 378 0.86
4 8 0.00076 1.1 9.9 2.8 3 44 425 475 424 477 0.78
5 8 0.021 30 5.2 0.1 22 45 598 623 580 624 0.81
6 8 0.022 32 5.2 0.4 9 45 675 723 671 724 0.76
7 8 2.7e-05 0.039 14.5 0.2 3 40 893 941 892 946 0.86
8 8 0.18 2.6e+02 2.3 0.2 4 15 1098 1109 1096 1142 0.90

Sequence Information

Coding Sequence
ATGGAACAACAAATAGTGGTTAAAACCGAGATGGGCACGAATGGAGAAATACTGTTATTCTATGTTGATGAAAATGAAGAAAGCGTCTTGACAGCAGTTGAAAACATGGATGGTCAAGGCATACAACTACAGCAGGATGCTCAAGGTGAATACATATTAGGGGATGCTATAGATATTGAAGATGCTGTTCAAGATGATCAAGAGGAATCTGAAAGCATCCAAGGAAATCCATACACTTGGCTGGATGAGGAAACAAAACGCCTCCTCATGTTTTATAATGACAACAAACAGACATTTTTATCAGGCGTCACACAAAAGGCCCATTTGTGGTCTGTTGCTTGTAAGACAATGTTACTTGGCAAAACTGCTGATTCTTGTGAAGCAGAGTTTCAAAATCTTAAGATGAAATACATTCAAATCCGTGGTCATATGCAAAGGGGCATCTACATCAAATGGCCGTTCTTCGAACAGTGTCATGAGGCATTCCAAGACGATGAATCTATCAAATTAATGTACACAGAAGATACTGAAACTGATCAATTAATAAAAATACCAGCTACACCTAAAACAGTGCCTGTGGTTCCTCAAATAGAGAATGATGGTATTATGGTGGTCAAAAAAGTTAATAGAAATTCTTCTTGTGATGAGAAAGTTGAAATGATGCTCAATTTATATCTCAAATACAAGAAAGATTATCAAAAGCAATATCGGAGGAGAGGTCTGTGGGAAACAATTGCATTAGAATTGGGCGAAGATGACCCAGACTACTGGCACAAACGTTTCTTAAATTTCAAGCAACATTACATTCATATACTTGACAAACGCAAAGAGAGTGGTGATGATACAGTCGGCACTTGGCCATATACAACATTGTTTGATAAGATATTTGAAGATGATGAGGAATTTCAAAGAAAATATGCCAATTCTCAGGAGTCTCAGGTGGTCACAGATAATGTCATTGTCAATGAAAATTATTGGAATGATACTGAATTGACAGTTTTAGTCAAATATTGTTTTGATTGTTTAGATGAGTTCCAAGATAAAACTATACCTAATAATTTTCTTTGGAATGAAATTGGACGGCTTTTAGATAAAACACCTGAGAAGTGCAAGGAAAAGTATGAGGAGCTGAAGAATGATCACCTAGGTAGTTACATAGATGGTGGTTATGTCCTACGTACCCGTAAACCTCTAGCAATATTATTTGATAACATAATATCCAAAGAAGTTGAACAAGAATTGCTTTTTAGTCAAAAGGCTGATCATTATGAAGAGTGGACTAATGAGGAACTAGATGAACTTGTGCATTACTTTTATGAGAACATTGACATGTATAAAGATCCAATATGTTATTTTGTTTGTTGGGCTAAAATTGGAAAAAAATTGCAGCGAAATGTCATAAGTTGTAGAAAAAAGTGGGAAGATCTTAAGTTGTTATACAAATCCATATTAGAGGATAAGAAGGAGAATCCTGATATGGAGATAGATTGGAGATACATAGAGTTGTTTGACAGGATATTTGACTTTGGCATGGATACTACCTTGTTGGCTGGCTATGACACTATAAAAGAAGAAAGGTCCAATAACCTCAACAAAATTGGGGTAAGGAAATTAAACATTAAAATGGACAGCACGCCCAAGGCCATTGAGCCATTGCTATCAGACAATGAGGAAGAATCTGAAGACAATGGACCCACGAAACGTGCAAAAGAATATATTGGATCATCAAAAACTGTCAGGATCCTTGAATTTTACCATAAAAACAAGGAAAAGTTCAATTCTCAACCTAACAAATATGGGTTATGGAACATAATAGCGAAACAGCTGGGGGTAACTGCCAGTCAATGCATGAAGAAATTTAAGAGTTTAAAACAACATTACACATCCTATGTACAAAAGGAATTGAACAATGAACCGGTAACATGGCCACATTATGCACTTTGTAAAAAAGTCTATGGCTATCGGGCTATAAGAACTAAACTGAAGGGCAAAAAGGTAGATTGGGAAGCGACAGACTGGTCTGATGTACAAATTAAGAAGTTAATTCACTATTTTGGTAACAATTTCGAAGATATTGACCAAAACACTGATGATGTTAGTAAATGGTCAGTGTTAGCGGAACAAATAAAAAAGTCTAATTATGCGTGTAAGGAAAAGTTTTTGGAATTGAGGAAGTCTTATAGAAAGTTGCGGACGAGGTCACGGAACCCCGACGTGAGGATATCGTGGAAGTTCTTTCAATTATTGGATCAGATTTACAGCGCAAAGGAAGACAATGATCAATTGATGATGGAAGAGCATATTGATGATGTACACAATGACTCTGATTTCCGGGTTGATACACAAGAAGATGATGAGTACCAATGCATCATCGTATTACAAGAGGGTCAAGATCTAAGTGACATAAGTGAAAATCAAGTTATCATTCAAAGTAAAACCAATATTCAAGCAGAAGAAATAAATGATCAAGATAATCAAGACATCATTGATGATGATCAAGTTATTGATCAAAATAATCAAGTCATTATAAAAGTAGATCAATTGGCTGATCAAAATAATATCATAAATAATGAAGATCAAGTTATTGATCAAAACAGTCACATGATTAATGAAAATCAAATTGTAGGTCACGGTGATCAAACATATATAACCCCACAGCCAGAGAAGACATTGACAAAATGGACTAAGCAATCCAAGAAAAGACTACTTGGTTTATACATCAATTACTTAAGAACCCATAAAGGCAAAGAGATAAATGCACGAGTAATGTGGAGAGAAATATCATCAAAACTTCAAGGCAAAACACCTCTATCATGCAGGAAAATGTTCGCCAAACTAAATGCTAATCATAAGAAAATAGCCCCTGAGGATATCAACATGAAAAAGACCCCATACTACACATTACTTGAAAAAATACAAGCTATCACACCAAAATTTACCAAAACCAATCAAAATAAAGTGTTGGAAGAAGGAAAAACATACAAAAATGTTCAAATGGATACAAGTAAGATTGAGCAAGCATTACAATACTATCTGTATCATATAGAAGACTTTGCCAGTCCGAGATATGAGAAGAAATACCTATGGACAGAGCTTGCTAACTTTATTTCTGAGCCTGTTACAAAAGTATTTAATAAGATAAATTATTTGAAACAATGTTTCAACAATGACGATCCAGATGATATCGAAATAGGTCCATACAGAGAGGTCTTACAAGATATCACAGCTAAAGAATTAGCTATAAAACTTGTGACTGACACCGTTGTACAAAACGAAGACTTCAATGAAATATGGACAGATGAGGAAACTGAAATGCTTCTGGAATGGTATTTAAGTAATTTAGATAAGTTTAAAAATCCAAAATTCGTCAGGAGCTACCTTTGGGTGGAGGTTTCGAGTATTCTGAAGAAGAGCCCCTTAGATTGTTCAAAGAAAATGGGAGAAGTTAGGACTCTGTATAGGAATATGGTGAGGGAAAATTCAGAGGTATTGAGTACTTGGAGGTTCCATGATCTGTGCCAGAAAATCTATGGTACAGGCAAGAAAGGTAATCCTGCGAGTAATTAG
Protein Sequence
MEQQIVVKTEMGTNGEILLFYVDENEESVLTAVENMDGQGIQLQQDAQGEYILGDAIDIEDAVQDDQEESESIQGNPYTWLDEETKRLLMFYNDNKQTFLSGVTQKAHLWSVACKTMLLGKTADSCEAEFQNLKMKYIQIRGHMQRGIYIKWPFFEQCHEAFQDDESIKLMYTEDTETDQLIKIPATPKTVPVVPQIENDGIMVVKKVNRNSSCDEKVEMMLNLYLKYKKDYQKQYRRRGLWETIALELGEDDPDYWHKRFLNFKQHYIHILDKRKESGDDTVGTWPYTTLFDKIFEDDEEFQRKYANSQESQVVTDNVIVNENYWNDTELTVLVKYCFDCLDEFQDKTIPNNFLWNEIGRLLDKTPEKCKEKYEELKNDHLGSYIDGGYVLRTRKPLAILFDNIISKEVEQELLFSQKADHYEEWTNEELDELVHYFYENIDMYKDPICYFVCWAKIGKKLQRNVISCRKKWEDLKLLYKSILEDKKENPDMEIDWRYIELFDRIFDFGMDTTLLAGYDTIKEERSNNLNKIGVRKLNIKMDSTPKAIEPLLSDNEEESEDNGPTKRAKEYIGSSKTVRILEFYHKNKEKFNSQPNKYGLWNIIAKQLGVTASQCMKKFKSLKQHYTSYVQKELNNEPVTWPHYALCKKVYGYRAIRTKLKGKKVDWEATDWSDVQIKKLIHYFGNNFEDIDQNTDDVSKWSVLAEQIKKSNYACKEKFLELRKSYRKLRTRSRNPDVRISWKFFQLLDQIYSAKEDNDQLMMEEHIDDVHNDSDFRVDTQEDDEYQCIIVLQEGQDLSDISENQVIIQSKTNIQAEEINDQDNQDIIDDDQVIDQNNQVIIKVDQLADQNNIINNEDQVIDQNSHMINENQIVGHGDQTYITPQPEKTLTKWTKQSKKRLLGLYINYLRTHKGKEINARVMWREISSKLQGKTPLSCRKMFAKLNANHKKIAPEDINMKKTPYYTLLEKIQAITPKFTKTNQNKVLEEGKTYKNVQMDTSKIEQALQYYLYHIEDFASPRYEKKYLWTELANFISEPVTKVFNKINYLKQCFNNDDPDDIEIGPYREVLQDITAKELAIKLVTDTVVQNEDFNEIWTDEETEMLLEWYLSNLDKFKNPKFVRSYLWVEVSSILKKSPLDCSKKMGEVRTLYRNMVRENSEVLSTWRFHDLCQKIYGTGKKGNPASN

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-