Basic Information

Gene Symbol
-
Assembly
GCA_951449985.1
Location
OX602131.1:2819464-2825142[+]

Transcription Factor Domain

TF Family
MYB
Domain
Myb_DNA-binding domain
PFAM
PF00249
TF Group
Helix-turn-helix
Description
This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 6 3 3.1e+03 -1.3 0.0 23 42 359 377 354 380 0.82
2 6 0.0015 1.6 9.3 0.4 10 45 440 480 430 481 0.79
3 6 0.23 2.4e+02 2.3 0.1 20 45 596 622 578 623 0.80
4 6 0.064 67 4.1 0.0 22 45 703 725 696 726 0.89
5 6 8.2e-05 0.086 13.3 0.4 13 40 846 880 832 886 0.81
6 6 0.03 32 5.1 0.8 2 15 1033 1046 1032 1055 0.90

Sequence Information

Coding Sequence
ATGGATCAAGTTGTAGTCAAAACTGAGATGGGGACAAATGGGGAGATATTACTCTTCTATGTGGACGAAAATGGTGTCACAGAGCAAGCAGTCTTATCAGGTGTGGATGGTATTGAAAATCATGGCCTACAACTGCAGCAGAATTCTGAAGGGGAATACATTCTAGCAGATGCCTGTGAGGTAGAGGATGCTAGACATGAGGTACCGTTATTGGATGCCATTCAGCAAGAAGGTATTGAAGAACCCATAGAAAACACCAGTGATGATGTGGAATCATGGCTTGAAGATGATGTGAAACGCCTATTGGTATTCTACAATGACAACAAGCAAACATTCAGAGCAGTATTCCAAAAAACTCACTTATGGACCGTTGCATGCAAGACAATATTGATTGGTAAAAATCCTGATGCATGCGAGTCCCAATTACAAAGTTTAAAGTTGAAATACATTCAAATTCAAGGTCATTTGCAGAGAGGTATTTACATCAAGTGGCCACTCTTTGAACTCTGCCATCAAGCATTCCAGGATGATGAATCTGTCATGTTATTAGATGATGAGGACACTCAGACTGACCAGCTATTGAAGTTCCCAGCTACTccaaaattagagccgccatATCAGGAAAATATTGTAGTTGAAAAGAGTAACAAAAATTCAAGTGATGAAAAAGTTGAAACAATGCTGACTTTGTATCTGAGGCATAAAAAGGATTACCAGAACCAGTATTGGACCAGGGGGCTGTGGGAAACGATTGCTCTGGAGCTGGGTGAAGATGATGTTGAATATTGGCACAAACGCTTCCTCAATTACAAACAGCATTACATAAGGGTATTAGAGAAACGCATAGAAACTGGTTCTGACAGTAGTAACTGGCCATATACAGAATTGATGGATCAAATATTTGCAGATGATGAGGAATTTCAAAGGAAGTATGTGAATGATGTGAAAAATATAGTTACGGAAAACATTGCAGATGATAATTACTGGAATGATACTGAAAATATAGTTTTGATCAAGTACTGCTTTGACTGTTTTGATGAGTTTCAAGATCCAACCATTCCGGACAGTTTCCTTTGGAGCGAAATTGGTAGACTTTTAGACAAGCTGCCTGATAGATGTAAGGAGAAATATGATGAGCTAAGAACTATACATTTGGACAAATATGTAGAAGGAGCTTACAGTTTACGCGCCCGTAAGCCTTTGCACTTATTATTCGACCATTTATTATCAAAAGAAGTAGAAATCCAGATGATAAAGGATAAAAATAAGCTAGATCAGGTGGAAATTTGGAAAACTGAGGAATTGGATGAGTTAgtaaaatttttttatgaaaatattgagatGTTCAAAGATAGGGTATGCTACTTTGTCTGTTGGTCAGCGGTGAGCAGGAAGCTTGGTAGAAAGGTTTTCAGCTGCTATAAGCACTGGGTTGAGTTGAAAGGGCTTTATAAATCAATACTCGATGACAAGAAGGAGAATTCGGATATGCAGATAGATTGGAGGTATATAGAGCTTTTTGACAGAATATTTGACTATGGCATGGATACCAACCTACTTATTGGATATGAGAACCTGAAAGAGAACCAAGTTTTAGATATTGGAAGAGTTGGTGtaagaAAATTGAACATAAGGTCGGAGAACAATTCAACAGAAAATGGTTATGCAAGCAAAGATGAATCCCACAGTGAGGTATCTACAAAGCGTCACAATATAAACGATTCAAAAACTGTCAAGATATTGGAATTCTACATCAAGCATAAGGATAAATTATCAAAACCTCAGGGCAACAAGTCAGTGTGGAACACCCTGGCTAAGCAACAAGGAATCTCGGTCGATCAATGCATAAGCAAGTTCAGAAGCATCAAACAGCAGTATGTATCCTATGTACAGAAAGAAGTAGAGGGTTCTGAGCCGATAACTTGGCCGTACTACACTTTATGCAAGAAAGTGTTCGGCTATCGAGccataaaatctaaaatcaagAATAAAAAGCTAGAATCTGATACTGGATTCGATGACTGGCTCGATTCTGACATCACGAAAATTATAAACCATTTTGACAGAAACTTTGACGCAATCAATAATGACACAGAGGACATCAGTAAATGGGCCGACTTGGCCTCTGAGCTTGGTAAAAGCGAGTTTGCATGTAAAGAGAAGTTCTTGGAGCTGAGGAAGTCTTACAGAAAGTTACGGACGAGATCAAGGAACCCTGATGTGAAGATATCGTGGAAGTTTTTCCATGCGTTGGACCAGATCTATATGTCGAGAGAGGATAGCAACTATGAGCTGATGGATGTTGGTGAGCAAGAGAGGAATGATGGGTACTTGGAGGAGAGGCTGATGGATACGCAAgaagAAGACGACTTCCAATGCATCATCGTGATACCAGAAGGCCAAGAACTGACCGACATTAACAACGCTCAAATAATTGTGCAAAAGAACTCAGATGTTGAAACACCGATTGTTAACAAACAACCCAAGGTTATCAAATGGACTAAACGCACAAAGAAACTACTACTTATCCACTATATAAATTACCTAAAAACACACAAAGGCAATGAGATCAATGCCAAAGAAATGTGGAGAGAAATAGCATCGAAATTGTCGAACAAAACTCCCCTATCGTgcagaaaaatatttgctaaacTTAAAGCGAGTCTTGTACAAGCACCAAAAGACGAAGACAGTTTGAACAAGTTGCCATATTACAGGTTATTGCAAAAGATTCTGGCAATGAAACCAAAGTTTGCCAAGACTGCACAGGATAAAGTACAGGGGAAGAATTTTAAAGATGTCGacttgccagtagccaaagtAGAAAATGccttgaattattatttacagcaTATTGAAGATTTCGCCAGTCCGAGGTATGAGAAGAAATATCTTTGGACAGAACTAGCTAATTATGTTTCAGAGCCAGTATCAAAAGTCTACAACAAAATCAATTTCTTAAAGCAATCTTTCAATAATAATGCGCTTTTGAATGAAGTGCTTCCGTATGGTGAGGTTTTAAAAGAAATAGCGGCTAAGGAAATCGCTATAAGCTTAGTAACGGAAACAGTAGAGAAAAATAATGAGGACTTTAATGAAAACTGGACAGACGAGGAGACAGAACAATTGTTAGAGTGGTATCTGAGTAATTTAGACAAGTTTAAAAATCCAAAATTTGTCAGACGTTATCTTTGGATGGAAGTTTCGAGTATTTTGCACAAAAGCCCATTGGGTTGTTCGAAGAAAATGTCCGAAATCAGAACTCAATATCGCAACATGGTCAGGGAAAATCCTGGACAGTTAAACGCTTGGAGGTTCTACGACTTGTGCCAGAAGATCTATGGTACAGGCAAGAAGGGCAGTGCTACGAGTGAAAGTAATTGA
Protein Sequence
MDQVVVKTEMGTNGEILLFYVDENGVTEQAVLSGVDGIENHGLQLQQNSEGEYILADACEVEDARHEVPLLDAIQQEGIEEPIENTSDDVESWLEDDVKRLLVFYNDNKQTFRAVFQKTHLWTVACKTILIGKNPDACESQLQSLKLKYIQIQGHLQRGIYIKWPLFELCHQAFQDDESVMLLDDEDTQTDQLLKFPATPKLEPPYQENIVVEKSNKNSSDEKVETMLTLYLRHKKDYQNQYWTRGLWETIALELGEDDVEYWHKRFLNYKQHYIRVLEKRIETGSDSSNWPYTELMDQIFADDEEFQRKYVNDVKNIVTENIADDNYWNDTENIVLIKYCFDCFDEFQDPTIPDSFLWSEIGRLLDKLPDRCKEKYDELRTIHLDKYVEGAYSLRARKPLHLLFDHLLSKEVEIQMIKDKNKLDQVEIWKTEELDELVKFFYENIEMFKDRVCYFVCWSAVSRKLGRKVFSCYKHWVELKGLYKSILDDKKENSDMQIDWRYIELFDRIFDYGMDTNLLIGYENLKENQVLDIGRVGVRKLNIRSENNSTENGYASKDESHSEVSTKRHNINDSKTVKILEFYIKHKDKLSKPQGNKSVWNTLAKQQGISVDQCISKFRSIKQQYVSYVQKEVEGSEPITWPYYTLCKKVFGYRAIKSKIKNKKLESDTGFDDWLDSDITKIINHFDRNFDAINNDTEDISKWADLASELGKSEFACKEKFLELRKSYRKLRTRSRNPDVKISWKFFHALDQIYMSREDSNYELMDVGEQERNDGYLEERLMDTQEEDDFQCIIVIPEGQELTDINNAQIIVQKNSDVETPIVNKQPKVIKWTKRTKKLLLIHYINYLKTHKGNEINAKEMWREIASKLSNKTPLSCRKIFAKLKASLVQAPKDEDSLNKLPYYRLLQKILAMKPKFAKTAQDKVQGKNFKDVDLPVAKVENALNYYLQHIEDFASPRYEKKYLWTELANYVSEPVSKVYNKINFLKQSFNNNALLNEVLPYGEVLKEIAAKEIAISLVTETVEKNNEDFNENWTDEETEQLLEWYLSNLDKFKNPKFVRRYLWMEVSSILHKSPLGCSKKMSEIRTQYRNMVRENPGQLNAWRFYDLCQKIYGTGKKGSATSESN

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-