Basic Information

Gene Symbol
-
Assembly
GCA_905147815.1
Location
LR990633.1:7311991-7319892[+]

Transcription Factor Domain

TF Family
MYB
Domain
Myb_DNA-binding domain
PFAM
PF00249
TF Group
Helix-turn-helix
Description
This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 6 0.062 1.8e+02 3.8 0.0 23 44 345 365 330 367 0.90
2 6 0.00028 0.81 11.3 0.1 7 43 423 464 421 467 0.89
3 6 0.44 1.3e+03 1.1 0.4 22 46 583 611 565 611 0.74
4 6 0.00023 0.67 11.6 0.3 3 45 662 713 660 714 0.85
5 6 0.00014 0.39 12.3 0.2 13 42 832 868 817 872 0.78
6 6 0.037 1.1e+02 4.5 1.2 3 15 1023 1035 1021 1039 0.89

Sequence Information

Coding Sequence
ATGGAGCAAATAGTGGTGAAAAGTGAGATGCCTAGCAATGGGGAGATATTGCTTTTCTACGTTGATGAAAACGGAGGAAATGAAGAAGGTGTATTAACAACCGTCGAAAACATAGAAAGTCAAGCAGTACAGCTGCAACAAGACAATTCGTATGTCATAGAAGATGTTGGTGAGCTATCCAACAACTTAAGCCTTCTCCAATCCAGCGGCCAAAAGGACCATTGGAGTGACATAGACATAAAAAGACTTCTAACATTCTACAATGACAATAAACAATCATTTATATCCGGCACAACGAAAAAGACACACTTATGGACCGTAGCTTGTAAAACAATGCTCATAGGTAAGAACCAAAATTCCTGTGAAGCACAACTGCAGATCTTAAAAGAGAAATATATGGATATATGCAGACATATACAGAAAGGAGACTATGTATCCTGGCCGTATTTCGAACTCTGTCATCAAGTTTTCCAAGAAGATTCTTTTTACACATCAAAATCAAACGATGATTACGATCTTCTGATAGGAAATACCACTATCAAATTGCCAGTTCCTCAACAAAACCAGGATAATTTTGTGGTTAAGAAAGTCAATACTCGTTCTTTAGTTGACGAGAAAGTTGAAATGATGTTGAATTTGTACTTGAAGTACAAGAAAAACTTTCAACAGGATTATTGGAGGCGAGGTTTGTGGGAAACTATAGCTATGGAGATTGGGGAGGAAGACGGAGAGTACTGGCAGAAACGCTTCTTAAACTATAAGCAGCATTACTTAAGGTTACTGGACAAACGTCGGGAAAATGGCTCGGACAACATCAATTGGCCATACTTAGAGTTATTTGATAAGATCTTTGAAGGGGATGAGGGTTTTCGCAAAAAATATGGGCCTGAAGACGCTAAACCGAGTTCAACCCAAATAAACTCAATTGAAATGCCTATTGAATGGGACAACACTGAAAAAACAGTTCTAGTTAAGTACTGCTTTGATTGTTTGGAAGAATTCGAAGACCCAACTATACCGGACAGCTTTCTTTGGAACGAAATTGGACGACTACTAGACAAAACTGCAGATGTATGTAGGCAGAAGTACGAAGAACTTAAAAATGATcatttagatagatatattgaGGGTGTGTACAACATGCGGAGTAGGATACCAATAGAGATATTATTTGACAATATCATATCGAGGGAGATCGAAACAGAACTAGTTAAATCTAAAAACACACCGGAGCATCTAGAAGTATGGAAAATGGCGGAATTAGACGAATTAGTGCAGTTTTTCTATGATAATGTAGAGATGTATAAAGATCGTCTATGCCATTTTGTTTGCTGGTCCGCTATAGCTAAAAAACTTAAAAGGGACCTGCAGAGCTGTAGGGGCCAGTGGGAAGACCTGGTGACCCTCTACAAGACAATATTGAATGATAAAAAAGAGGATTCAGACATGCAAATCGATTGGCGATATATTGAATTGTTTGATAGGATATTTGATTATGGTATGGATGAAAACTTGCTCGATGGCTATGAGAAAAAACAGACGAATCAGAATCAAGATATTGGAAAAATTAGtgtaaaaaaggtaaacatCAACTTGGACGACAACACAGAAGATTTATCAGATGATGACGAATCATACGACGAGAAAGGATACAATAAACGCACCAAACGCCACGTCGGGGACTCAAAAGCCTTCAAAATACTTGAATATTAtcagaaaaataaagaaaaatttaatacGACGAAACGAAACAAACATTCATTATGGGAAGTTCTCGCTAAACAGATCGGAATATCAGCTTTGAAATGCTCACACCGATTCAGAAATCTTAAGCAAGTTTACACGGCATATGTGCAGAGAGAGCTTAATAAGCCAGAAATGCCAATCGTCTGGCCTTATTACGCATTATGCAAGAAAGTCTTCGGCTACCGTGCCATTAAAAGCAagcttaaaaatagtaaaacagACTCCGATGATGCCGAAGAATGGTCTGCAAAAGAaattaaacaattaataaattacttctCTGAAAACTTCCATGAGATCAATAGTAATATAGAATGTATTGATAAATGGTCAGGATTAGCCAGGATTATTGGGAAAAGTGAGGGTTCGTGTAAGGAGAAGTTTCTGGAATTGAGAAAGTCTTACAGGAAGTTGAAGACTATGAGGAGTAGGAACCCAGAAGTGAAGATCAATTGGAAGTATTTTGGTATGTTTGAGCAGATTTATAGGGAACGAGAGGAAAACCAAGATGCTATGGAGGTGGATGACGTCCAGGGTTATGCTGAAATGCATTTGGAACAGGAGGGGCAGGAagAAGACGATTATCAATGCATCATCGTTATACCAGAAGGCCAAGACATATCCCAGATCGAAAACGCACAAATAATAATGCGAGACAACTCTGTGGAGTTACAGCAAATCGAAAGCAATCCTAAACCCCTGAACAAATGGACGAAACGCACCAAAAAACGCTTACTTATCTTCTATATCAACTACATAAGATCGCATAGAGGAAAGGAGATAATAGCTAAGGAAATGTGGGCAGAGATCGCCTCGAAATTACCGGAAAAAACACCTTTATCCTGCAGAAAAATGTTCGCGAAACTTAAAGCGAATCACAAACATCTGGACCAAGATGATGTTAACAAGAAGAAGACTCCATACTACACATTACTGGAAAAGATAATGCGTTTGAAGCCTAAATTTGCAAAGTCTGAACAAAATAAAGTGCTGTCTGGACAGACATATAAAGACGTTGAATTACCTGATGAGAAAGTTGAACAAGCTTTAAATTACTATCTGCAACATATAGAAGACTTTGTTAGTCCGAAATTCGAGAAGAAATACTTATGGACAGAACTAGCCAATCATATTTCGGAGCCAGTGACAAAAGTGTTTaacaaaatcaattatttgaAGCAGTATTACAGTGATAATACAGAGGCAGCGACCAGCTCTCCATATGCTGAGTTGCTTAAAGATATAACAGCAAAGGAAATCGCGATAAAACTGGTAATTGAGAATCAACCGAAGCCTGTTATCGAGGAGGCAGTTGCAGAATCATGGACTGATGAAGAAACAGAACAATTATTGGAGTGGTATCTGAGTAATCTGGACAAGTTTAAGAACCCCAAGTTTGTTCGGAGCTACCTGTGGATGGAAGCTTCTAGTATTTTGAATAAAAGTGCTATAGTTTGCTCGAAAAAGATGTCTGAAATTAGGACTCAGTATAGGAATATGGTTAGGGAGACTCCTGAGGAATTGGGCATTTGGAGATTCCATGATCTGTGTCAGAAGATTTATGGGACGGGGAAGAAAGGGACTGGTGTTACTGCAActtaa
Protein Sequence
MEQIVVKSEMPSNGEILLFYVDENGGNEEGVLTTVENIESQAVQLQQDNSYVIEDVGELSNNLSLLQSSGQKDHWSDIDIKRLLTFYNDNKQSFISGTTKKTHLWTVACKTMLIGKNQNSCEAQLQILKEKYMDICRHIQKGDYVSWPYFELCHQVFQEDSFYTSKSNDDYDLLIGNTTIKLPVPQQNQDNFVVKKVNTRSLVDEKVEMMLNLYLKYKKNFQQDYWRRGLWETIAMEIGEEDGEYWQKRFLNYKQHYLRLLDKRRENGSDNINWPYLELFDKIFEGDEGFRKKYGPEDAKPSSTQINSIEMPIEWDNTEKTVLVKYCFDCLEEFEDPTIPDSFLWNEIGRLLDKTADVCRQKYEELKNDHLDRYIEGVYNMRSRIPIEILFDNIISREIETELVKSKNTPEHLEVWKMAELDELVQFFYDNVEMYKDRLCHFVCWSAIAKKLKRDLQSCRGQWEDLVTLYKTILNDKKEDSDMQIDWRYIELFDRIFDYGMDENLLDGYEKKQTNQNQDIGKISVKKVNINLDDNTEDLSDDDESYDEKGYNKRTKRHVGDSKAFKILEYYQKNKEKFNTTKRNKHSLWEVLAKQIGISALKCSHRFRNLKQVYTAYVQRELNKPEMPIVWPYYALCKKVFGYRAIKSKLKNSKTDSDDAEEWSAKEIKQLINYFSENFHEINSNIECIDKWSGLARIIGKSEGSCKEKFLELRKSYRKLKTMRSRNPEVKINWKYFGMFEQIYREREENQDAMEVDDVQGYAEMHLEQEGQEEDDYQCIIVIPEGQDISQIENAQIIMRDNSVELQQIESNPKPLNKWTKRTKKRLLIFYINYIRSHRGKEIIAKEMWAEIASKLPEKTPLSCRKMFAKLKANHKHLDQDDVNKKKTPYYTLLEKIMRLKPKFAKSEQNKVLSGQTYKDVELPDEKVEQALNYYLQHIEDFVSPKFEKKYLWTELANHISEPVTKVFNKINYLKQYYSDNTEAATSSPYAELLKDITAKEIAIKLVIENQPKPVIEEAVAESWTDEETEQLLEWYLSNLDKFKNPKFVRSYLWMEASSILNKSAIVCSKKMSEIRTQYRNMVRETPEELGIWRFHDLCQKIYGTGKKGTGVTAT*

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-