Basic Information

Gene Symbol
-
Assembly
GCA_001015335.1
Location
NW:83542-103245[-]

Transcription Factor Domain

TF Family
MYB
Domain
Myb_DNA-binding domain
PFAM
PF00249
TF Group
Helix-turn-helix
Description
This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 4 0.12 1.1e+02 2.7 0.0 12 42 27 57 8 61 0.75
2 4 0.0027 2.5 7.9 0.1 22 45 142 172 120 173 0.69
3 4 2.7e-06 0.0024 17.6 0.2 18 45 240 266 216 267 0.86
4 4 0.0025 2.3 8.0 0.1 23 46 355 377 325 377 0.81

Sequence Information

Coding Sequence
ATGGGGAGGCCACAACTGTTTGAGCAGAAATTGATAACAGAAGTTCAAAAATATCCATGCCTGTACGATAGAGCATTATTCAAAGAAGGAATAATTAGCGAAAGGGCAAAAACATGGGAAAAGATTGCACCCATAGTCGGTGCTCCAGTCGATGCTTGCAAAATTAGATGGGGCCGGTTAAGAGATCGCTATATTGCTTTAACTTTGAAGACCATTCAAGAACCGGGCTTCACATGTGTCTGGAAGTACACCGAATTGATGTCATTTATGaggcaacatttgcatttgaaaagTGACAGCTTGGAATCAAAGGATCTTCAATTGCCGCAGAGTAAACACTTCAACCGAGAAATGTTCGAAGAAAACCTTATAgaagaagtaaaaaagcatGAAGCTATATACAACCCAGCCCATGTAGATAAACGAAACACAAAAGTAATTGAACAAATTTGGGTGACCATTGCATCTTCTTTGGGAAGTACTGTCAAACAATGCACTAGTCGTTGGAGTACATTAAAAGATATGTTTGTACGACACAACAATATTATGCTAACCTCGGATAAAAAGGGTGAATATCAACCAACACGTTGGAAGTACTACAATGAAATGTCATTTATGAGAGATTATGTAAATATATGTGATGGGCTATTGATTTCTGAGGTAAAGAAACATGAAGCTTTATACAATAGAAAACATCCTGAATATAGCAACTATTCCAAACAAGGCGAAATATGGTCGATAATTGCTTCGGAAGTAAAACGAACGGCCGATGCTTGTAAATCTAGATGGAGGCAATTAAGGGATCGCTATATTGCTTTATTGTCGAAGATCAAAGAAGAACCGGGATACGTAAGTACCTGGAAGTACGCCGAAATGATGTCATTTATGAAGGATCATTTGGATTTAGAAGATAAAAGCTTGCAATCAAATCCTCAACTGAAGCAAAATAAAACGAATCGAgaaatttttggagaaaaacttttaaaagaaATAAGAAAGCATGAAGCTCTTTATAATCCGGCCCATGCAGACAAACGTAACCCAGAAATAATAGAACAAATTTGGATAACCATTGCATCTTCTTTGGGAAGCACTGTCAAAGAATGCCTTAGTCGTTGGAGAACATTGAAAGAGATGTTTGTACTGCACAACAATAAAATACTTACCTCGGACACAAATGATGAGCAAAAGCCATATTGGAAGTACTACAATGCAATGTCGTTTATGAAGGATTATGTCAAAATGCCGAAATTGTAAAACTCGATGGAGACGGTTGGTAAATCGTTATAAAAATGAATACACAAAAAAGACATTGTCCAAAGATTACACTTTCAGTTGGAAATATGCATCTGCCATGAGCTTTATGACTGATTTTATATTTCCCAAAGGCTCCAACAACGACAATTTCAAGAATAACTTTTGTAGGGCTTGCGCTTGCGATATAAGTGTAGATACTGCAGTGAAATACGATATTTTCAATACGTCaggtttaaaagaaaaatttgtgaCCTGCGCCAATCTGGAATTGGCCACAAGTGATGAATTTCCTCGGTCAGTGTGCCAAAAGTGCTACGACAAAATTCTCGATTTCTTTCAGTTCCAAGGTATGTGTCGGAAATCCTTGCAAAAGTTTGACAACATGAAGAAGGAAGAATTTGAGGAAAATGCTTCGATGGATTCGCATCAATCAACGGAATATTCATTTCAGCATGTAAAAACCCCTAATATATTATCCCCTAACTCACAATTTCATGAGTTTGGGAGCAATCAGAAAGTGGAACTGGAGAAAAGTGATTCGGAGGATTCGCAGGAATCAACTGAATACTTAATTCAGGATGTGCTATCCCCTAAAATATTTTCCACTCACTCACATTTGGACAACACAAAGAAAACTAACACCACATTTGTTGCGACAGAACGTGCGACAACTCAGCAGGATCCCTTAAACGTAAATTTGGAGTACAAATATAATATGACCGATGATTCTTATTTAGAACGAACTGTGAGCCAACAGGACAACAATGAAGCTTATGAGGGAAATAAACTTGAATGTATTGATGATGACGATACTAAATGTGATGCGAAAATCACAGAGTTGGAAATATACGAATtaaaagagcttgacaaatcGCCCGAGTACAAAGAGATTATCTGCCCGCCATTAGAGGACCTGACTAGGAAAGAAAACCCTCAAAAGCCTTGCTTCGTGTGTGAACTTTGCCAAAAAACCTTTAGAACCAAATATTGTTTTATAGCCCATCAGCGAAAGCATCAAGGTCTATCGGGATATGTATGTACACATACAAATTGTGACCGTATTTTCAATGGGGTAAGAGACTTAAGGGGACATCTGAGCCGACACAAAGGCGTTCGACCAGATTTTATATGCAACATCAATAATTGTGGCGAacgttttaaagaaaattatctTTTAAGATTTCACAAGCAAAAGGTTCACAACTATAGCGAAACACGAAAAAAAGTAACTAAAACAACAATTGCAGCTAAAGAGACCTTTGTCTGTGAGGTCTGCGGCAAAGTGTTCAACTTCAAGCGGCGTCTCGACAATCATAGTCTCGTACATGTCGATGAATCACAATGGCCTTTTGCGTGCGATGAGCCTGGTTGTGGCAAAAGATTTCGGATGAAAACAAGACTCCAAACACATACACTTCGTCATAAGGGTATTAAGAATTATACATGCCCCCATTGTGGACTAAAGAAAGTCACAAGAAATGAATTAAATATTCACATTAATTTTCACACCTTCGAAAAGAAGTATGCCTGTTCTTTATGTTCCAAAGTATTTAAGAGCGTCGGATGCCTTAGTACACATAGGCATCAAGTCCATGAAGGCAAACCAAAGCCAAAGAAAACGGCAGAGAAGAACGCTCAGTTATATGAATGCAGACACTGTGGACGAATGCTAAGCAGTGAGCAAACGAGAAAAAATCATGAAATGGGACACACCAACGAGAAACCGCACGTTTGCGAAAACTGTGGCAAACGATTTTCGACTTCCTTCAATCTGAAGAAACACATGATTATGCATGCTGTTCACAAAACCTTTGCTTGCGATGTATGTGACAAACAATTCAAACATCGTTCAGGGTTAACGACTCATATGCGAACACATTCGGAAGAAACAAGACTTAAGTGTGATGAATGTGGTAAGTGCTTTGAATGGCCTTCTGTCTTGCATGCTCACAAAAAGCTACACGTTGAGGGGTCGGTTCCCCATGTTTGTGGTATATGTGGCAAAGGATTCCGTTGGCCAGGATCCTTTTATGCTCATAAGAAAAAACATGTGGAGAATAGTAAAGAAAATGAGGATAAACAGAAGGAGATGCAAGTTGTATAA
Protein Sequence
MGRPQLFEQKLITEVQKYPCLYDRALFKEGIISERAKTWEKIAPIVGAPVDACKIRWGRLRDRYIALTLKTIQEPGFTCVWKYTELMSFMRQHLHLKSDSLESKDLQLPQSKHFNREMFEENLIEEVKKHEAIYNPAHVDKRNTKVIEQIWVTIASSLGSTVKQCTSRWSTLKDMFVRHNNIMLTSDKKGEYQPTRWKYYNEMSFMRDYVNICDGLLISEVKKHEALYNRKHPEYSNYSKQGEIWSIIASEVKRTADACKSRWRQLRDRYIALLSKIKEEPGYVSTWKYAEMMSFMKDHLDLEDKSLQSNPQLKQNKTNREIFGEKLLKEIRKHEALYNPAHADKRNPEIIEQIWITIASSLGSTVKECLSRWRTLKEMFVLHNNKILTSDTNDEQKPYWKYYNAMSFMKDYVKMPKL*NSMETVGKSL*K*IHKKDIVQRLHFQLEICICHELYD*FYISQRLQQRQFQE*LL*GLRLRYKCRYCSEIRYFQYVRFKRKICDLRQSGIGHK**ISSVSVPKVLRQNSRFLSVPRYVSEILAKV*QHEEGRI*GKCFDGFASINGIFISACKNP*YIIP*LTIS*VWEQSESGTGEK*FGGFAGIN*ILNSGCAIP*NIFHSLTFGQHKEN*HHICCDRTCDNSAGSLKRKFGVQI*YDR*FLFRTNCEPTGQQ*SL*GK*T*MY***RY*M*CENHRVGNIRIKRA*QIARVQRDYLPAIRGPD*ERKPSKALLRV*TLPKNL*NQILFYSPSAKASRSIGICMYTYKL*PYFQWGKRLKGTSEPTQRRSTRFYMQHQ*LWRTF*RKLSFKISQAKGSQL*RNTKKSN*NNNCS*RDLCL*GLRQSVQLQAASRQS*SRTCR*ITMAFCVR*AWLWQKISDENKTPNTYTSS*GY*ELYMPPLWTKESHKK*IKYSH*FSHLRKEVCLFFMFQSI*ERRMP*YT*ASSP*RQTKAKENGREERSVI*MQTLWTNAKQ*ANEKKS*NGTHQRETARLRKLWQTIFDFLQSEETHDYACCSQNLCLRCM*QTIQTSFRVNDSYANTFGRNKT*V**MW*VL*MAFCLACSQKATR*GVGSPCLWYMWQRIPLARILLCS*EKTCGE**RK*G*TEGDASCI

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-