Basic Information

Gene Symbol
-
Assembly
GCA_949315895.1
Location
OX438874.1:8096267-8105916[-]

Transcription Factor Domain

TF Family
MYB
Domain
Myb_DNA-binding domain
PFAM
PF00249
TF Group
Helix-turn-helix
Description
This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 8 0.15 3.4e+02 2.7 0.1 3 13 69 79 67 83 0.89
2 8 0.3 7.1e+02 1.7 0.1 21 44 340 364 311 366 0.68
3 8 3.8 8.9e+03 -1.8 0.1 3 15 411 423 409 425 0.86
4 8 0.26 6.1e+02 1.9 0.1 27 44 445 461 441 463 0.81
5 8 0.00025 0.59 11.6 0.2 12 46 561 605 559 605 0.77
6 8 0.017 41 5.7 0.1 3 44 656 705 654 707 0.82
7 8 0.00057 1.3 10.4 0.0 23 39 864 880 853 881 0.88
8 8 0.62 1.5e+03 0.7 0.2 4 13 1041 1050 1039 1066 0.91

Sequence Information

Coding Sequence
ATGGAACAAATCATTGTTAAAACTGAGGTGCAGGCAAATGGTGAAATATTGCTGTTCTATGTTGatgaaaatgaACAACAAAATGGTTCAATTGAAAGCCAATTACTGCAAATGCACGAGTCTAATGATATCAATTATCAAGTTCAGTTGGATGAATCTGCTGAAAGTACTGATTGGGCACAGCCAGAACCGGTCAGAAAATTTCGATGGACAGAGGAGGAAGTTCAAAGGCTACTTGTGTTTTATGTTGATAACAAAGATACATTTGTCACTGGAGCAGCTAAGAAAAAAGACTTATGGGCTGTTGCCTGCAAAACTATGCTAGTTGGTAAGAACCCTGATACCTGTGAAGTCAAGTTGCGTAATTTGAAACACAAGTATTCAGCTCTGCTCTTGGAGCAACAGAGAGGTGTAGATATTACCTGGCCATTGTTTAGTCTATGCCATCAAGCGTTTCATGATGATACTTATGTACAATATCTCCTTCGGGAACATTTGGCTCAGGAGAAAAAAGAAGTTAAAGTGCCCATTGAATCCAAACCTATAGAAGATACAAACAGTAGCATTATAGTTGTCAAAAATGTAccaaatacaaacaaaacatcAGGAGATGCAAATGTTGAGGCGATGTTAAATTTGTACttgagatataaaaaaaactttcagaaAGAATATTGGCATAAAGGGTTATGGGAGACAATTGCCATAGAATTAGGAGAGGAAAACGCAGATTATTGGCATAAACGGTTCTTGAATTTTAAACAACATTACCTAAGATTGCTGGCTAAGCGAGAATCCCAAGGCAATGACAGTATTAATTGGCCTTATATGCACCTGTTTGATCAGATATTTAAGGATGATCCACAATTTCAGAGAAAGTTTCAAAATGTTGAAAATGGTATTACGAAGACAGAAGTTTTACCAGTTGATTGTAATGAATGGAATGAGACGGAAAAAATGATTCTAGCGAAGTACTACTTTGACTGTTATGACGAATTTCAAGATAAAACAATACCAAATAATTTTCTTTGGACTGAAGTTGGTCGATTAGTAGATAAGAAGCCAGAGGCATGTAGGTTGAAGTTCGAGGAATTGAAACGGGCTCACTTGGACTTGTATCTTGAAGGTGGTTATAATCTAGAAACACGCAAACCTCTAGCAATATTATTCGATAATATAATAGCAAAAGATTCTGAACTagaaattaatacaaaaaaagcaTATGGCGAGCAATGGAGTACCGAAGACTTAGATACGTTGGTACAGTTTTTGTATGACAACATGATTGTTTTGAAAGATCCAGTTTGTTTCTATGTATTTTGGCCATGTTTGGcacaaaaatttaacaaaagtgTGGCTTCTTGTAAGGAGCAATGGGAGGAATTAAAGACGCTCTACAAATCAATTTTAGATGACAAGAAAGAGAACTCGGACATGCAAATAGATTGGAGGTACATAGAATTGTTTGACAGAATATTTGATTATGGCATGGACACTAATTTGCTTAATGAATTCAAACAGCTAAAAAAGAATTCAAGCAGGAGTGATAAAATTGGAGTAAAAAAGGTGCGTATCAAGTCAGAGAATGAATTACATGAAATCTCAGACGACGAGGAGTTTGACGAAAGAGGTTTTACCAAGCGCACCAAACGCGGGGTTGGTGATTCGAAAGCATTCAAAATACTCGAGTACTATCAGAAGAATAAAGACAGGTTCTCAACAACTCGACGTAAGAAGCAAGTCCTGTGGGATACCCTGGCTCAACAGATTGGAATAACGGGAGAACAATGTGCTCataggtTCAGAAATCTGAAGCAAGTCTACATGTCGTACGTCCAACGAGAAATCACAAAACCAGAAATGCCAATTCTTTGGCCTTATTACGCATTATGCAAAAAAGTCTTTGGGTATAGAGCCATAAaatctaaattgaaaaataGCAAATATGATTCAGAAGAAGCGGAAGAATGGGCTCCTAaagaaatcaaacaaataatacaatatcttGCAAACAACTTTCATGAAATTTCCGGTACCGATGATATAAGCAAATGGACACGTTTAGCTCGGGATATGAACAAATCTGAAACGTCAATTAACAGTAAGTTCTTGGAGCTTCAAAAATCATATAAGAGGTTGAAGACGATGAAAGAAAACCATCCCAAGTGTAAAGTTTCATGGAAATACTACAATTTGTTTGATAGTGTGTATGCTAATTTGGGAATCAATGAGATAGTCGAAATGGAGGTTGAAGAGTTAGATGATGAAGATGAAATGGTTACTGAAGAAGTTATGGAATCACAGGATGATGATGATTATCAATGCATCATCGTGCTACCAGAAGGTCAAGAATTATCAGACATTAGTAACGCTCAAATCATCATACAAACATCAACTGGTGAAGTCGTgcaacagacagacagcgaaaccGCACAGCAAACTTACCATGAAGTCCCAGTTGAAAATATCGAGCAAATACAGAAACCAAGAGTAACAATGTGGAACAAAAGGTCCAAAAAGAAACTACTGCTTCTATACTTGAACTATATCAGGTCGAGAAAAGGTACAGAGATCAATCCCAAAGAAATGTGGACGGAAATCGCGGCACAAATGCCAGAAAAAACGCCATACTCATGCAAGAAGATGGTAGCTTTATTAAAAGCTAAACACTTAGAACAGCCTAGCAATGAAGATGCTAGTAAAAAGAAGTCACTGTACCACACgcttattgaaaaaataatcccTATAAAGGTGAAATTCGCGAAGAAATGTCAGAGTAATTCAAAAGATCGTAAAACGTACAAAGATGTCCCTCTTCCGACGCCCAAAGTGGAACTGGCTCTTCAATATTATTTGCAGAACATTCAAGATTTTACCAGCCCTAAATTTGAGAAAAAGTATTTATGGACAGAGCTAGCTAACTTTGTGTCTGAACCAGCtaataaactatttaataaGATCAACTATTTAAAACAATCCTTTGATGTTGAAACTGAAGAAGTGGCTGGGGAAAAGACGCTGTTTAGTGAACtattaaaagaaattattaaCAAAGAAAATACGCTTAAAACAACCGCCAAGACTGATCCTATATGCTTAGAAGAAATCAAGGAAGTGATATGGTCCGACGAGGAAACTGAACAACTCCTTGTATGGTACTTGGCTAATTTGGAGAAGTTTAAAAATCCTAAATTCGTTCGGAAATATCTTTGGATTGAAGCCTCTGAGATTTTGAAGAAAAGTCCTTTGGCTTGTTCGAAGAAAATGACAGAAATAAGAACGCAGTATAAGACAATGATAAAGGAGAATCCCGAGGAGTTGAACAGCTGGAGGTTTTATGAGCTGTGCCAGAAGATTTATGGCACGGGTAAAAAGAACGAACCCAGTATGCAGCATGTGGAAGTAACTGCTAGTATAATGGAATAA
Protein Sequence
MEQIIVKTEVQANGEILLFYVDENEQQNGSIESQLLQMHESNDINYQVQLDESAESTDWAQPEPVRKFRWTEEEVQRLLVFYVDNKDTFVTGAAKKKDLWAVACKTMLVGKNPDTCEVKLRNLKHKYSALLLEQQRGVDITWPLFSLCHQAFHDDTYVQYLLREHLAQEKKEVKVPIESKPIEDTNSSIIVVKNVPNTNKTSGDANVEAMLNLYLRYKKNFQKEYWHKGLWETIAIELGEENADYWHKRFLNFKQHYLRLLAKRESQGNDSINWPYMHLFDQIFKDDPQFQRKFQNVENGITKTEVLPVDCNEWNETEKMILAKYYFDCYDEFQDKTIPNNFLWTEVGRLVDKKPEACRLKFEELKRAHLDLYLEGGYNLETRKPLAILFDNIIAKDSELEINTKKAYGEQWSTEDLDTLVQFLYDNMIVLKDPVCFYVFWPCLAQKFNKSVASCKEQWEELKTLYKSILDDKKENSDMQIDWRYIELFDRIFDYGMDTNLLNEFKQLKKNSSRSDKIGVKKVRIKSENELHEISDDEEFDERGFTKRTKRGVGDSKAFKILEYYQKNKDRFSTTRRKKQVLWDTLAQQIGITGEQCAHRFRNLKQVYMSYVQREITKPEMPILWPYYALCKKVFGYRAIKSKLKNSKYDSEEAEEWAPKEIKQIIQYLANNFHEISGTDDISKWTRLARDMNKSETSINSKFLELQKSYKRLKTMKENHPKCKVSWKYYNLFDSVYANLGINEIVEMEVEELDDEDEMVTEEVMESQDDDDYQCIIVLPEGQELSDISNAQIIIQTSTGEVVQQTDSETAQQTYHEVPVENIEQIQKPRVTMWNKRSKKKLLLLYLNYIRSRKGTEINPKEMWTEIAAQMPEKTPYSCKKMVALLKAKHLEQPSNEDASKKKSLYHTLIEKIIPIKVKFAKKCQSNSKDRKTYKDVPLPTPKVELALQYYLQNIQDFTSPKFEKKYLWTELANFVSEPANKLFNKINYLKQSFDVETEEVAGEKTLFSELLKEIINKENTLKTTAKTDPICLEEIKEVIWSDEETEQLLVWYLANLEKFKNPKFVRKYLWIEASEILKKSPLACSKKMTEIRTQYKTMIKENPEELNSWRFYELCQKIYGTGKKNEPSMQHVEVTASIME

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-