Basic Information

Gene Symbol
-
Assembly
GCA_963556515.1
Location
OY748314.1:7753520-7780900[+]

Transcription Factor Domain

TF Family
MYB
Domain
Myb_DNA-binding domain
PFAM
PF00249
TF Group
Helix-turn-helix
Description
This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 10 0.076 1.6e+02 3.8 0.0 1 23 142 167 142 192 0.74
2 10 0.33 7e+02 1.7 0.0 1 22 460 481 460 503 0.78
3 10 0.35 7.5e+02 1.6 0.0 1 22 722 743 722 764 0.79
4 10 0.33 7e+02 1.7 0.0 1 22 972 993 972 1015 0.78
5 10 0.33 7e+02 1.7 0.0 1 22 1160 1181 1160 1203 0.78
6 10 0.33 7e+02 1.7 0.0 1 22 1378 1399 1378 1421 0.78
7 10 0.33 7e+02 1.7 0.0 1 22 1596 1617 1596 1639 0.78
8 10 0.33 7e+02 1.7 0.0 1 22 1814 1835 1814 1857 0.78
9 10 0.33 7e+02 1.7 0.0 1 22 2032 2053 2032 2075 0.78
10 10 0.33 7e+02 1.7 0.0 1 22 2149 2170 2149 2192 0.78

Sequence Information

Coding Sequence
ATGAAATTAAAGGATATTTTTCAAGATGTCGTTGAGTTAGAGGAACCAGAATGTATGGATAAAATACTAACATACTTAGAAGAAATAAAGGAAGAAGATGATTTATTCATTGTGCAAACAAATTCCGTGTCACGACCATTGAAGGAAAATAGTGAGAATAGTATTGATAGATATAGAACTGATGCACTTTGCAAAGCTGAACATGATCATCTACTACAGAACTGGAAGAACTTTGTACAGGAGTATAATGTACCAGATAAATTAATATGTTTGGCACGCTGGAGAAATAAACGTAAAAGTAGATTACCAACCACACCTGAAGAATGCGCTAGACGTTTCGTTTTGGCATATATGGCTCGAGGACTAAAAAGAACTATATTTCATGTATTCAGACATATACAAACAGCTTTTGGAAGTCCTGTAAAAGGTGCGTACTCACCTGAAGAGGAGAAAATGATCAAAGTTTGTTTCATACATCACCCAAACAATGCTGTGACGCTATCTAAAGTTTTGGGACGAGAACCTCGGGGAATATATAAAAGACTAAAGCAAATGTATAATGGTAAACCagacaaaaagaaaattaaatggaCCATACCATTGGCATCAAAGTTTGTCAAACTGTTGATGAAATATACAGGTTTGCCATTAGAAGAACTGCAATATAAGAGGTTCGACAAACAAGTTTGGTTGAAACTTGAGGAAGATTTTGACCACAACTCTAATTACCTGCAAAACTTCTGGTATTACACGCTTCATGTTCAAGTGTGCGTTAAAACTGATGTCAAGTTCAGAGGTTTAAGGAAGAGCATCATCAAAATACTTAGAGAGTCACCATATCAAATCTGGAGTGACATTCGCTGGAAGGATGTTGTCaaacgttttcctgatggatataCACATCACTTTATATATAGGGCCGGTTATAATCATATTGTGAAAAATAAGCCAAAATACCACAAAGTCCCTTTAGAACAGCTTCTAGATCGTGCGATGGTGAAGCTACAACATATTCCTAAACGACGCATAAGGAGGTTAGCATTCAATGAAAAAGAAGAATTAGAACAACAAACAAATTCCGTGTCACGACCATTGAAGGAAAATAGTGAGAATAGTATTGATAGATATAGAACTGATGCACTTTGCAAAGCTGAACATGATCATCTACTACAGAACTGGAAGAACTTTGTACAGGAGTATAATGTACCAGATAAATTAATATGTTTGGCACGCTGGAGAAATAAACGTAAAAGTAGATTACCAACCACACCTGAAGAATGCGCTAGACGTTGCGTTTTGGCATATTTGGCTCGAGGACTAAAAAGAACTATATTTCATGTATTCAGACATATACAAACAGCTTTTGGAAGTCCTGTAAAAGGTGCGTACTCACCTGAAGAGGAGAAAATTATGAAAGTTTGTTTCATACATCACCCAAACAATGCTGTGACGCTATCTAAAGTTTTGGGACGAGAACCTCGGGGAATATATAAAAGACTAAAGCAAATGTATAATGGTTTGCCATTAGAAGAACTGCAATATAAGAGGTTCGACAAACAAGTTTCGTTGAAACTTGAGGAAGATTTTGACCACAACTCTAATTACCTGCAAAACTTCTGGTATTACACGCTTCATGTTCAAGTGTTCGTTAAAACTGATGTCAAGTTCAGAGGTTTAAGGAAGAGCATCATCAAAATACTTAGAGAGTCACCATATCAAATCTGGAGTGACATTCGCTGGAAGGATGTTGTCaaacgttttcctgatggatataCACATCACTTTATATATAGGGCCGGTTATAATCATATTGTGAAAAATAAGCCAAAATACCACAAAGTCCCTTTAGAACAGCTTCTAGATCGTGCGATGGTGAAGCTACAACATATTCCTAAACGACGCATGAGGAGATATAGAACTGATGCACTTTGCAAAGCTGAACATGATCATCTACTACAGAACTGGAAGAACTTTGTACAGGAGTATAATGTACCAGATAAATTAATATGTTTGGCACGCTGGAGAAATAAACGTAAAAGTAGATTACCAACCACACCTGAAGAATGCGCTAGACGTTGCGTTTTGGCATATTTGGCTCGAGGACTAAAAAGAACTATATTTCATGTATTCAGACATATACAAACAGCTTTTGGAAGTCCTGTAAAAGGTGCGTACTCACCTGAAGAGGAGAAAATTATGAAAGTTTGTTTCATACATCACCCAAACAATGCTGTGACGCTATCTAAAGTTTTGGGACGAGAACCTCGGGGAATATATAAAAGACTAAAGCAAATGTATAATGGTTTCCCATTAGAAGAACTGCAATATAAGAGGTTCGACAAACAAGTTTGGTTGAAACTTGAGGAAGATTTTGACCACAACTCTAATTACCTGCAAAACTTCTGGTATTACACGCTTCATGGTCAAGTGTTCGTTAAAACTGATGTCAAGTTCAGAGGTTTAAGGAAGAGCATCATCAAAATACTTAGAGAGTCACCATATCAAATCTGGAGTGACATTCGCTGGAAGGATGTTGTCaaacgttttcctgatggatataCACATCACTTTATATATAGGGCCGGTTATAATCATATTGTGAAAAATAAGCCAAAATACCACAAAGTCCCTTCAGAACAGCTTCTAGATCGTGCGATGGTGAAGCTACAACATATTCCTAAACGACGCATAAGGAGGTTAGCATTCAATGAAAAAGAAGAATTAGAACAAGAGTATAATGTACCAGATAAATTAATATGTTTGGCACGCTGGAGAAATAAACGTAAAAGTAGATTACCAACCACACCTGAAGAATGCGCTAGACGTTGCGTTTTGGCATATTTGGCTCGAGGACTAAAAAGAACTATATTTCATGTATTCAGACATATACAAACAGCTTTTGGAAGTCCTGTAAAAGGTGCGTACTCACCTGAAGAGGAGAAAATTATGAAAGTTTGTTTCATACATCACCCAAACAATGCTGTGACGCTATCTAAAGTTTTGGGACGAGAACCTCGGGGAATATATAAAAGACTAAAGCAAATGTATAATGGTTTGCCATTAGAAGAACTGCAATATAAGAGGTTCGACAAACAAGTTTGGTTGAAACTTGAGGAAGATTTTGACCACAACTCTAATTACCTGCAAAACTTCTGGTATTACACGCTTCATGTTCAAGTGTTCGTTAAAACTGATGTCAAGTTCAGAGGTTTAAGGAAGAGCATCATCAAAATACTTAGAGAGTCACCATATCAAATCTGGAGTGACATTCGCTGGAAGGATGTTGTCaaacgttttcctgatggatataCACATCACTTTATATATAGGGCCGGTTATAATCATATTGTGAAAAATAAGCCAAAATACCACAAAGTCCCTTCAGAACAGCTTCTAGATCGTGCGATGGTGAAGCTACAACATATTCCTAAACGACGCATGAGGAGACATATACAAACAGCTTTTGGAAGTCCTGTAAAAGGTGCGTACTCACCTGAAGAGGAGAAAATTATGAAAGTTTGTTTCATACATCACCCAAACAATGCTGTGACGCTATCTAAAGTTTTGGGACGAGAACCTCGGGGAATATATAAAAGACTAAAGCAAATGTATAATGGTTTGCCATTAGAAGAACTGCAATATAAGAGGTTCGACAAACAAGTTTGGTTGAAACTTGAGGAAGATTTTGACCACAACTCTAATTACCTGCAAAACTTCTGGTATTACACGCTTCATGTTCAAGTATTCGTTAAAACTGATGTCAAGTTCAGAGGTTTAAGGAAGAGCATCATCAAAATGGCCGGTTATAATCATATTGTGAAAAATAAGCCAAAATACCACAAAGTCCCTTCAGAACAGCTTCTAGATCGTGCGATGGTGAAGCTACAACATATTCCTAAACGACGCATGAGGAGGTTAGCATTCAATGAAAAAGAAGAATTAGAACAAGAGTATAATGTACCAGATAAATTAATATGTTTGGCACGCTGGAGAAATAAACGTAAAAGTAGATTACCAACCACACCTGAAGAATGCGCTAGACGTTGCGTTTTGGCATATTTGGCTCGAGGACTAAAAAGAACTATATTTCATGTATTCAGACATATACAAACAGCTTTTGGAAGTCCTGTAAAAGGTGCGTACTCACCTGAAGAGGAGAAAATTATGAAAGTTTGTTTCATACATCACCCAAACAATGCTGTGACGCTATCTAAAGTTTTGGGACGAGAACCTCGGGGAATATATAAAAGACTAAAGCAAATGTATAATGGTTTGCCATTAGAAGAACTGCAATATAAGAGGTTCGACAAACAAGTTTGGTTGAAACTTGAGGAAGATTTTGACCACAACTCTAATTACCTGCAAAACTTCTGGTATTACACGCTTCATGTTCAAGTATTCGTTAAAACTGATGTCAAGTTCAGAGGTTTAAGGAAGAGCATCATCAAAATGGCCGGTTATAATCATATTGTGAAAAATAAGCCAAAATACCACAAAGTCCCTTCAGAACAGCTTCTAGATCGTGCGATGGTGAAGCTACAACATATTCCTAAACGACGCATGAGGAGGTTAGCATTCAATGAAAAAGAAGAATTAGAACAAGAGTATAATGTACCAGATAAATTAATATGTTTGGCACGCTGGAGAAATAAACGTAAAAGTAGATTACCAACCACACCTGAAGAATGCGCTAGACGTTGCGTTTTGGCATATTTGGCTCGAGGACTAAAAAGAACTATATTTCATGTATTCAGACATATACAAACAGCTTTTGGAAGTCCTGTAAAAGGTGCGTACTCACCTGAAGAGGAGAAAATTATGAAAGTTTGTTTCATACATCACCCAAACAATGCTGTGACGCTATCTAAAGTTTTGGGACGAGAACCTCGGGGAATATATAAAAGACTAAAGCAAATGTATAATGGTTTGCCATTAGAAGAACTGCAATATAAGAGGTTCGACAAACAAGTTTGGTTGAAACTTGAGGAAGATTTTGACCACAACTCTAATTACCTGCAAAACTTCTGGTATTACACGCTTCATGTTCAAGTATTCGTTAAAACTGATGTCAAGTTCAGAGGTTTAAGGAAGAGCATCATCAAAATGGCCGGTTATAATCATATTGTGAAAAATAAGCCAAAATACCACAAAGTCCCTTCAGAACAGCTTCTAGATCGTGCGATGGTGAAGCTACAACATATTCCTAAACGACGCATGAGGAGGTTAGCATTCAATGAAAAAGAAGAATTAGAACAAGAGTATAATGTACCAGATAAATTAATATGTTTGGCACGCTGGAGAAATAAACGTAAAAGTAGATTACCAACCACACCTGAAGAATGCGCTAGACGTTGCGTTTTGGCATATTTGGCTCGAGGACTAAAAAGAACTATATTTCATGTATTCAGACATATACAAACAGCTTTTGGAAGTCCTGTAAAAGGTGCGTACTCACCTGAAGAGGAGAAAATTATGAAAGTTTGTTTCATACATCACCCAAACAATGCTGTGACGCTATCTAAAGTTTTGGGACGAGAACCTCGGGGAATATATAAAAGACTAAAGCAAATGTATAATGGTTTGCCATTAGAAGAACTGCAATATAAGAGGTTCGACAAACAAGTTTGGTTGAAACTTGAGGAAGATTTTGACCACAACTCTAATTACCTGCAAAACTTCTGGTATTACACGCTTCATGTTCAAGTATTCGTTAAAACTGATGTCAAGTTCAGAGGTTTAAGGAAGAGCATCATCAAAATGGCCGGTTATAATCATATTGTGAAAAATAAGCCAAAATACCACAAAGTCCCTTCAGAACAGCTTCTAGATCGTGCGATGGTGAAGCTACAACATATTCCTAAACGACGCATGAGGAGGTTAGCATTCAATGAAAAAGAAGAATTAGAACAAGAGTATAATGTACCAGATAAATTAATATGTTTGGCACGCTGGAGAAATAAACGTAAAAGTAGATTACCAACCACACCTGAAGAATGCGCTAGACGTTGCGTTTTGGCATATTTGGCTCGAGGACTAAAAAGAACTATATTTCATGTATTCAGACATATACAAACAGCTTTTGGAAGTCCTGTAAAAGGTGCGTACTCACCTGAAGAGGAGAAAATTATGAAAGTTTGTTTCATACATCACCCAAACAATGCTGTGACGCTATCTAAAGTTTTGGGACGAGAACCTCGGGGAATATATAAAAGACTAAAGCAAATGTATAATGGTTTGCCATTAGAAGAACTGCAATATAAGAGGTTCGACAAACAAGTTTGGTTGAAACTTGAGGAAGATTTTGACCACAACTCTAATTACCTGCAAAACTTCTGGTATTACACGCTTCATGTTCAAGTATTCGTTAAAACTGATGTCAAGTTCAGAGGTTTAAGGAAGAGCATCATCAAAATACATATACAAACAGCTTTTGGAAGTCCTGTAAAAGGTGCGTACTCACCTGAAGAGGAGAAAATTATGAAAGTTTGTTTCATACATCACCCAAACAATGCTGTGACGCTATCTAAAGTTTTGGGACGAGAACCTCGGGGAATATATAAAAGACTAAAGCAAATGTATAATGGTTTGCCATTAGAAGAACTGCAATATAAGAGGTTCGACAAACAAGTTTGGTTGAAACTTGAGGAAGATTTTGACCACAACTCTAATTACCTGCAAAACTTCTGGTATTACACGCTTCATGTTCAAGTATTCGTTAAAACTGATGTCAAGTTCAGAGGTTTAAGGAAGAGCATCATCAAAATGTAA
Protein Sequence
MKLKDIFQDVVELEEPECMDKILTYLEEIKEEDDLFIVQTNSVSRPLKENSENSIDRYRTDALCKAEHDHLLQNWKNFVQEYNVPDKLICLARWRNKRKSRLPTTPEECARRFVLAYMARGLKRTIFHVFRHIQTAFGSPVKGAYSPEEEKMIKVCFIHHPNNAVTLSKVLGREPRGIYKRLKQMYNGKPDKKKIKWTIPLASKFVKLLMKYTGLPLEELQYKRFDKQVWLKLEEDFDHNSNYLQNFWYYTLHVQVCVKTDVKFRGLRKSIIKILRESPYQIWSDIRWKDVVKRFPDGYTHHFIYRAGYNHIVKNKPKYHKVPLEQLLDRAMVKLQHIPKRRIRRLAFNEKEELEQQTNSVSRPLKENSENSIDRYRTDALCKAEHDHLLQNWKNFVQEYNVPDKLICLARWRNKRKSRLPTTPEECARRCVLAYLARGLKRTIFHVFRHIQTAFGSPVKGAYSPEEEKIMKVCFIHHPNNAVTLSKVLGREPRGIYKRLKQMYNGLPLEELQYKRFDKQVSLKLEEDFDHNSNYLQNFWYYTLHVQVFVKTDVKFRGLRKSIIKILRESPYQIWSDIRWKDVVKRFPDGYTHHFIYRAGYNHIVKNKPKYHKVPLEQLLDRAMVKLQHIPKRRMRRYRTDALCKAEHDHLLQNWKNFVQEYNVPDKLICLARWRNKRKSRLPTTPEECARRCVLAYLARGLKRTIFHVFRHIQTAFGSPVKGAYSPEEEKIMKVCFIHHPNNAVTLSKVLGREPRGIYKRLKQMYNGFPLEELQYKRFDKQVWLKLEEDFDHNSNYLQNFWYYTLHGQVFVKTDVKFRGLRKSIIKILRESPYQIWSDIRWKDVVKRFPDGYTHHFIYRAGYNHIVKNKPKYHKVPSEQLLDRAMVKLQHIPKRRIRRLAFNEKEELEQEYNVPDKLICLARWRNKRKSRLPTTPEECARRCVLAYLARGLKRTIFHVFRHIQTAFGSPVKGAYSPEEEKIMKVCFIHHPNNAVTLSKVLGREPRGIYKRLKQMYNGLPLEELQYKRFDKQVWLKLEEDFDHNSNYLQNFWYYTLHVQVFVKTDVKFRGLRKSIIKILRESPYQIWSDIRWKDVVKRFPDGYTHHFIYRAGYNHIVKNKPKYHKVPSEQLLDRAMVKLQHIPKRRMRRHIQTAFGSPVKGAYSPEEEKIMKVCFIHHPNNAVTLSKVLGREPRGIYKRLKQMYNGLPLEELQYKRFDKQVWLKLEEDFDHNSNYLQNFWYYTLHVQVFVKTDVKFRGLRKSIIKMAGYNHIVKNKPKYHKVPSEQLLDRAMVKLQHIPKRRMRRLAFNEKEELEQEYNVPDKLICLARWRNKRKSRLPTTPEECARRCVLAYLARGLKRTIFHVFRHIQTAFGSPVKGAYSPEEEKIMKVCFIHHPNNAVTLSKVLGREPRGIYKRLKQMYNGLPLEELQYKRFDKQVWLKLEEDFDHNSNYLQNFWYYTLHVQVFVKTDVKFRGLRKSIIKMAGYNHIVKNKPKYHKVPSEQLLDRAMVKLQHIPKRRMRRLAFNEKEELEQEYNVPDKLICLARWRNKRKSRLPTTPEECARRCVLAYLARGLKRTIFHVFRHIQTAFGSPVKGAYSPEEEKIMKVCFIHHPNNAVTLSKVLGREPRGIYKRLKQMYNGLPLEELQYKRFDKQVWLKLEEDFDHNSNYLQNFWYYTLHVQVFVKTDVKFRGLRKSIIKMAGYNHIVKNKPKYHKVPSEQLLDRAMVKLQHIPKRRMRRLAFNEKEELEQEYNVPDKLICLARWRNKRKSRLPTTPEECARRCVLAYLARGLKRTIFHVFRHIQTAFGSPVKGAYSPEEEKIMKVCFIHHPNNAVTLSKVLGREPRGIYKRLKQMYNGLPLEELQYKRFDKQVWLKLEEDFDHNSNYLQNFWYYTLHVQVFVKTDVKFRGLRKSIIKMAGYNHIVKNKPKYHKVPSEQLLDRAMVKLQHIPKRRMRRLAFNEKEELEQEYNVPDKLICLARWRNKRKSRLPTTPEECARRCVLAYLARGLKRTIFHVFRHIQTAFGSPVKGAYSPEEEKIMKVCFIHHPNNAVTLSKVLGREPRGIYKRLKQMYNGLPLEELQYKRFDKQVWLKLEEDFDHNSNYLQNFWYYTLHVQVFVKTDVKFRGLRKSIIKIHIQTAFGSPVKGAYSPEEEKIMKVCFIHHPNNAVTLSKVLGREPRGIYKRLKQMYNGLPLEELQYKRFDKQVWLKLEEDFDHNSNYLQNFWYYTLHVQVFVKTDVKFRGLRKSIIKM

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-