Basic Information

Gene Symbol
-
Assembly
GCA_963082885.1
Location
OY720196.1:4577264-4582283[+]

Transcription Factor Domain

TF Family
MYB
Domain
Myb_DNA-binding domain
PFAM
PF00249
TF Group
Helix-turn-helix
Description
This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 8 3.7 6.7e+03 -1.7 0.0 2 13 74 85 73 89 0.85
2 8 7.8 1.4e+04 -2.7 0.0 22 31 226 237 217 239 0.73
3 8 0.031 55 5.0 0.0 21 44 339 363 327 365 0.86
4 8 3 5.3e+03 -1.3 0.1 22 43 442 462 434 465 0.87
5 8 0.0032 5.6 8.2 0.2 22 46 577 611 565 611 0.74
6 8 0.0016 2.8 9.2 0.1 3 45 662 713 660 714 0.83
7 8 0.001 1.8 9.8 0.2 22 42 844 871 821 875 0.76
8 8 0.78 1.4e+03 0.5 0.1 3 15 1025 1037 1025 1084 0.82

Sequence Information

Coding Sequence
ATGGAGCAAACTGTAGTTGTGAAGACGGAGATGCAATCCAATGGAGAGATACTGCTGTTTTATGTTGATGAAAATGGTGGCAATGAAGAAGGAGTACTAACAACAGTTCAAAACATAGAAAACCAAGCCTTAGAGCTTCAACAAGACAACTCATACATAATACCAGATGTAGGAGACCTatcaaatgatataaatataccACAGTCAAGTACACACACTGAAAACTGGTCAGAAGATGATATTAAAAGACTCCTTGTGTTTTATAACGATAATAAGCAAACATTCATATCAGGTACaactaaaaagaaacatttatggACCGTTGCATGTAAGACGATGCTAGTTGGCAAAAACCCTCTTGCATGTGATATACAACTCAACAGTCTTAGAGAAAAGTATATAGAAATCTGCAGCCATATACAAAGTGGGGTTTATGTTAAATGGCCATATTTTGAACTGTGTCATCAAGTCTTTCACGATGAAACTGCAAAAACGGGtgaagattttaaaataataaccgaGTCACAAATTGAGAACGTACCGGGTTCAAATGATTATGACAATGTAATGGAGGTAAAGAAAGGGAATAATCGAAGTTCAGGTGATGAAAAAGTTGAAATGatgttaaatttgtatttgaagtataaaaagaattttcaacaAGAGTATCGGAGGAAAGGTTTATGGGAATCTATAGCAATGCAGTTAGGGGAAGATGATGGGGAATATTGGCAGAAACGATTCTTAAATTACAAGCAACATTATTTACGTTTACTGGATAAGAGACGTGATAATGGCTCTGAAGGCATTAATTGGCCGTACATGCAGTTTTTTGACAATATTTTCGAAGACGACGAAGAATTTAATCGAAAATACGTTAATCAAGAATACAAGGCTATTGAGAATCAAGTTATATCCGAAGATTCTAGCCTAGATTGGAATAATACCGAAATGACAGTACTGGTTAAGTACTGCTATGACTGCTTTGATGAATTTGAAGATAAGACAATCCCTAATAATTTCTTATGGAACGAAATCGGCCGTCTTTTAGACAAAACTGCCGATGTTTGCAAGCAGAAATATGAGGAATTGAAAAACGCGCATTTAGACAAATATATTGAGGGTGGTTATGAGTTGCGGAATCGTAaacctatatctatattattcgataatattatatcaaaggAAATAGAATTGCAGGTAATTAAAATGAGAAATCAACCGGAACAATTGGAATTATGGAAAACTAATGAACTTGACGAGTTGGTGCAGTTTTTTTACGATGATATAGAAATGtataaagattttgtttgtcatTATGTTTGTTGGGCGGctgttgtaaaaaaattaaaacgaaaTTTACAAAGTTGTGTAGGTCAATGGGAGGACCTTATGACATTATATAAGACCATATTGAATGATAAGAAAGAAAACCCAGACATGCAAATAGATTGGAGATATATAGAATTGTTTGATCGAATATTTGATTACGGTATGGATACTAATTTGATGACTGGATATCACAATAAATTACAGACATCATCAACACGTCAGGATTCTGGGAAGGTTGGTGttaaaaaagtaaaaataaatatggatGACTTACCGGAAGATTATTCTGATGACGACGAATCCTTTGATGAGAGGGGCTTCACAAAACGAACTAAACGTCGCACTGGAGACTCCAAAGCTTTCAAAATACTCGAATACTATCAAAAGAATAAGGATAAATTCTCTACAACCAACAGAAATAAACACTCCTTATGGGAGGTATTGGCTAAACAGATAGGAATATCGGCCACTCAATGCGCTCATAGATTTAGAAATCTAAAACAAGTATACACAGCATATGTTCAAAGAGAAATAAGCAAACCAGAGATGCCAATTCTATGGCCTTACTACGCTCTATGCAAAAAAGTTTTTGGTTACCGAgcaattaaaactaaattaaagaaTGGCAAAATTGATTCTGATGACAGCGAGGATTGGTCAGCGAAAGAAATAAAgcagttaattaattatttcgcTCAGAATTTTGATGACATTAATAGTAATGTTGATGACAGTACTAAGTGGTCGGATTTGGCTGGTGAATTAGGTAAAGGAGAAATATCTTGTAAGGAAAAGTTTTTGGAATTAAGGAAATCCTATAGGAAGTTAAAGACTATGAGGTCTAGGAATCCTGATGTTAAGATATCCTGGAAGTACTTTTCTATGTTTGAGCAGATTTATAATTATAGAGAAGGAAATGCCATGGATATTGATAGCAAAACTGATGTAAAAATACATATGGATGAGAATAATGATCAAGAAGacGATGATTACCAATGCATAATAGTAATACCAGAGGGAGAAGATATAGAAAACGCACAAATTATTATACAAGAACATCCCACCATACATCAGGAAGAAACTGTACACACGGAACAATTAATCCCTGCTGAGCACAAACCACCTGCTAAATGGACCAAACGGACTAAAAAACTACtcataatacattatattaactACATAAGGTTAAATAAAGGCAAGGAAATCAAACCCAAAGATATGTGGGCAGACATAGCCGCTAAACTACAAGACAAATCACCGCTTTCTTGTAGGAAAATGTTTGCAAAATTAAAAGCCAATCACAGACAAGCTAGCGATGaccctaatataaaaaagactcCTTACTATACACTAATGGAAAAAGTATTCCGTTTAAAACCGAAATTTATCaaaacagaacaaaataaatcaaatgatACTAAAATATACAAAGATGTTGTATTACCAGACGACAAGGTTTTCCAAGcactacaatattatttagaaaatttaactGATTTTGTTAGTCCAAGAtatgaaaagaaatatttatggacTGAATTAGCAAATTATGTTTGCGAACCAGTAACTAAGGTGTTTAATAAGattaattatttgaaacaaCTGTTTAATAGTGATACTGATGAAGCTTCAAAATCACCATTTGCCGCCATATTGAAAGATATATTAGCGAAAGAAATCGCGATAAAACTAGTTATAGAAAATGAACCGAAGCCTGTTATAGATTCTGGCGTAGAAATAAAATGGAATGATGATGAAACTGAACAATTATTAGAATGGTATTTGAGCAATTTGGAGAAATTTAAAAATCCCAAATTTGTTAGAAGTTATTTATGGATGGAAGTATCtgatattttgaataaaagtgCGCTTGCTTGTTCAaagaaaatgtctgaaattagGACTCAATATAGGAATATGGTGAGAGAAACGCCGGGTGAATTGGCCGGTTGGCGATTTTTAGAGTTATGTCAGAAAATATATGGGACGGGAAAGAAAGGAAATCAGATTACTTCTGAACAATCTGATGTTTAA
Protein Sequence
MEQTVVVKTEMQSNGEILLFYVDENGGNEEGVLTTVQNIENQALELQQDNSYIIPDVGDLSNDINIPQSSTHTENWSEDDIKRLLVFYNDNKQTFISGTTKKKHLWTVACKTMLVGKNPLACDIQLNSLREKYIEICSHIQSGVYVKWPYFELCHQVFHDETAKTGEDFKIITESQIENVPGSNDYDNVMEVKKGNNRSSGDEKVEMMLNLYLKYKKNFQQEYRRKGLWESIAMQLGEDDGEYWQKRFLNYKQHYLRLLDKRRDNGSEGINWPYMQFFDNIFEDDEEFNRKYVNQEYKAIENQVISEDSSLDWNNTEMTVLVKYCYDCFDEFEDKTIPNNFLWNEIGRLLDKTADVCKQKYEELKNAHLDKYIEGGYELRNRKPISILFDNIISKEIELQVIKMRNQPEQLELWKTNELDELVQFFYDDIEMYKDFVCHYVCWAAVVKKLKRNLQSCVGQWEDLMTLYKTILNDKKENPDMQIDWRYIELFDRIFDYGMDTNLMTGYHNKLQTSSTRQDSGKVGVKKVKINMDDLPEDYSDDDESFDERGFTKRTKRRTGDSKAFKILEYYQKNKDKFSTTNRNKHSLWEVLAKQIGISATQCAHRFRNLKQVYTAYVQREISKPEMPILWPYYALCKKVFGYRAIKTKLKNGKIDSDDSEDWSAKEIKQLINYFAQNFDDINSNVDDSTKWSDLAGELGKGEISCKEKFLELRKSYRKLKTMRSRNPDVKISWKYFSMFEQIYNYREGNAMDIDSKTDVKIHMDENNDQEDDDYQCIIVIPEGEDIENAQIIIQEHPTIHQEETVHTEQLIPAEHKPPAKWTKRTKKLLIIHYINYIRLNKGKEIKPKDMWADIAAKLQDKSPLSCRKMFAKLKANHRQASDDPNIKKTPYYTLMEKVFRLKPKFIKTEQNKSNDTKIYKDVVLPDDKVFQALQYYLENLTDFVSPRYEKKYLWTELANYVCEPVTKVFNKINYLKQLFNSDTDEASKSPFAAILKDILAKEIAIKLVIENEPKPVIDSGVEIKWNDDETEQLLEWYLSNLEKFKNPKFVRSYLWMEVSDILNKSALACSKKMSEIRTQYRNMVRETPGELAGWRFLELCQKIYGTGKKGNQITSEQSDV

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00869589; iTF_01491900;
90% Identity
-
80% Identity
-