Basic Information

Gene Symbol
-
Assembly
GCA_949126895.1
Location
OX421488.1:9807127-9815839[-]

Transcription Factor Domain

TF Family
MYB
Domain
Myb_DNA-binding domain
PFAM
PF00249
TF Group
Helix-turn-helix
Description
This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 7 1.4 4.5e+03 -0.4 0.0 3 13 73 83 71 84 0.87
2 7 0.00046 1.5 10.7 0.0 21 45 344 369 332 370 0.88
3 7 0.056 1.8e+02 4.1 0.1 10 43 429 467 425 470 0.75
4 7 0.0063 21 7.1 0.2 22 46 587 613 567 613 0.75
5 7 0.00016 0.51 12.2 0.1 3 45 664 715 662 716 0.85
6 7 0.00033 1.1 11.2 0.2 16 42 864 897 846 901 0.73
7 7 0.027 88 5.1 0.1 2 15 1055 1068 1054 1115 0.87

Sequence Information

Coding Sequence
ATGGATCCAACAGTGGTAGTGAAGACGGAAGCCAATGGagacattttattgttttatgttGATGAGAATGGTGGCAATGAAGAAGGGGTTCTTACAACAGTACAAAACATTGAAAACCAAGCAATAGAGCTCCAACAAGACAACTCATATATCTTACCAGAAGTCGGTGACCTCGCCCATGAGATCAATGCTGCACAATCCAGCACCCAGAATGATAATTGGACTGATGACGAGATCAGAAGACTTCTCATCTTCTACACCGATAACAAACAAACTTTTATATCAGGCACAAcgaaaaagaaacatttatGGACTGTGGCATGCAAGACAATGTTGGTCGGAAAGCATCCTAACGCTTGCGAGGCACAGCTTAACAGTCTCCGcacaaaatattttgaaatttgcaACCTTATGCAGCAAGGTGATTATGAGAAAGGCGTATATGTCAAATGGCCTTATTTAGAACTATGTCATTCAGTATTTCATGATGAATCTTCATTAATTGAAGAATATGACACAAGTGATACTCAGATAGCTAACATGCCGGTTGCAAATGCGAACATTGAAACGATGGTTGTAAAAAAAGTTAGCAGTCGGGCATCACCTGATGAAAAAGTTGAGTCAATGCtgaatttatatttgaaatataagaaGAACTTTCAACAGGAGTATCGAAGAAAGGGGCTATGGGAGACTATTGCAATGGAGTTGGGAGAAGATGATGGGGAGTATTGGCAGAAACGATTTTTGAATTACAAGCAGCATTATTCACGATTGCTTGACAAGAGGCGTGTCAGTGGCCCTGAAGGCATCAACTGGCCGTACATGGaattatttgataaaattttcGAAGGAGATGATGACTTCAATCGGAAATACACAAATGTCGATTATAATAAAGATTATAAGACTATTGAAAATCAAACCTTATCGGAAGAACCTAAACTAGATTGGGACAACACGGAAATGACTGTCCTAGTTAAGTACTGCTATGACTGTTTTGATGAATTTGAAGATAAGACCATTCCAAATAATTTCCTTTGGAACGAAATTGGCAGACTTCTCGACAAAACAGCCGATGATTGCAAGCAAAAATATCAAGAAATGAAAAATGCtcatttaaacaaatatatagaAGGTGGCTATGACCTGCGTAGTCGGAATCCTATAGCAATATTATATGACAATATAATATCTAAGGAGGTTGAATTAGAGATGATACAAATAAGGAATCAACCAGAAGAATTGGAACTATGGAAGACCGAATCATTAGATGAACTAGTACAATTCTTTTACGAAAATGTAGAAATGTATAAAGATCTCATCTGTCATTATGTTTGCTGGGCAGCCATTGTGAAAAAGTTAAACAGAGGCTTGCAGAGCTGTAAAGGTCAATGGGAGGACCTAGTCGCTCTTTATAAAACGATTTTAAATGATAAGAAAGAAAATCCTGACATGCAAATTGATTGGAGATATATAGAGTGGTTTGATAGAATATTTGATTATGGTATGGATACAAAGCTGCTGTCTGGATATGAGACGTTGAAGGCACCAGTTCAGGAGTCCAGCAAAGTTGgtgttaaaaaaattaaaatcaaacagGAGGAATTGAACGATGATATCACTGATGACGATGAATCTTTCGACGAGAGGGGTTTCACAAAACGCACCAAGCGTCGAGCTGGTGAATCCAAAGCGTTCAAAATACTCGAATACTACCAGAAGAATAAAGAAAAATTTTCAACCACAAGCAGAAACAAACATTCCCTATGGGAGGTACTAGCCAGACAAATAGGAATATCTGCTACACAGTGTGCCCACAGATTTAGAAACCTCAAACAAGTATACACAGCATATGTTCAAAGAGAAATTAATAAACCAGAGATGCCAATACTCTGGCCTTACTATGCTCTATGCAAAAAAGTTTTTGGTTATCGAGcaattaaaagtaaattgaaGAACGGAAAACTAGATTCCGATGACAGTGAGGAATGGTCTGCTAAAGAAATCAAACAACTTATCAACTATTTCTCACAGAATTACGACgatattgataataatgttgatGATAACAGCAAATGGTCTGATTTGGCTGCTGAAATCGGGAAAGGCGAGACTTCCTGTAAAGAGAAATTCTTAGAACTCAGGAAGTCTTATAGAAAATTGAAGACTATGAGGTCCAGGAACCCAGATGTGAAGATATCTTGGAAGTATTTTGCCATGTTTGAGCAGGTTTACAGTGCGAGGGAGCAAGAGGGAAGGCAGGGTGTTATGGAGGTAGATGAAGAGGCTTATATGGAGATACCAGTAGACTCCTATGAGAAGAATGACCAAGAAGAAGATGATTACCAATGCATAATAGTCATCCCAGAAGGCGAAGATATAGAAAACgctcaaataataatacaagaaCACCCTAATGCACCACAAGATATGATGACTACAGAAGCAATCATAGAAGCATCAGAAATCGAAGCAGCAGAAATGGCAGAAGCAACAGAAACCGTAGAAGCGATAGAACCACAGAGAACATTTGTTAAGTGGACAAAACGATCTAAGAAACGACTTCTTATCTATTATATAAACTATGTACGATCCAATAAgggaaaagaaataaaacccaAAGAAATGTGGGCGGAAATAGCATCAAAATTACAAGATAAAACTCCATTGTCTTGTAGAAAAATGTTCGCGAAACTGAAAGCAAATCACAGACAATTAGATTCTGATGAAATCAATGCTAAAAAGACTCCTTACTTTGCATTGATGGAAAAAGTTATTCGATTAAAACCTAAATTCGTCAAAACAGagcaaaataaagcacttaaagatggtaaaatatataaagatgtTGCATTACCTGACGAAAAAGTAGTTCAAGCACTTCAATATTATTTAGAAAACATTGAAGATTTTGTTAGCCCGAGATATGAAAAGAAATACCTTTGGACAGAATTGGCTAACTTTGTTGGTGAGCCTATAACAAAAGTGTTTAACAAACTGaattatttgaaacaattttACAACAGTGACACCGACGAAGTCGCTGGAAATAGTTCCATTTTTGCGCCTCTattgaaagaaatatttatcAAAGAGATAGCTATCAAGTTGGTATTGGAAAATCAACCTAAACCTGTAGCACACGACTCCGACATCGAAGAGCCGTGGACTGATGAAGAAATTCTACAACTTTTAGAATGGTATTTAAGTAATTTGGAGAAATTCAAAAACCCAAAATTTGTAAGGAGCTACTTGTGGATGGAAGTGTCTTCCATTTTGAATAAAAGTGCGATTGCTTGTTCTAAGAAGATGTCTGAAATAAGAACTCAATACAGGAATATGGTGAGGGAATCTCCAGGGGAATTGGCTGGGTGGAGATTTCTGGAGTTGTGTCAAAAGATTTATGGGACAGGCAAGAAGGGGAATCCAGTTCAGGGTTCTGGAGATGCCTGA
Protein Sequence
MDPTVVVKTEANGDILLFYVDENGGNEEGVLTTVQNIENQAIELQQDNSYILPEVGDLAHEINAAQSSTQNDNWTDDEIRRLLIFYTDNKQTFISGTTKKKHLWTVACKTMLVGKHPNACEAQLNSLRTKYFEICNLMQQGDYEKGVYVKWPYLELCHSVFHDESSLIEEYDTSDTQIANMPVANANIETMVVKKVSSRASPDEKVESMLNLYLKYKKNFQQEYRRKGLWETIAMELGEDDGEYWQKRFLNYKQHYSRLLDKRRVSGPEGINWPYMELFDKIFEGDDDFNRKYTNVDYNKDYKTIENQTLSEEPKLDWDNTEMTVLVKYCYDCFDEFEDKTIPNNFLWNEIGRLLDKTADDCKQKYQEMKNAHLNKYIEGGYDLRSRNPIAILYDNIISKEVELEMIQIRNQPEELELWKTESLDELVQFFYENVEMYKDLICHYVCWAAIVKKLNRGLQSCKGQWEDLVALYKTILNDKKENPDMQIDWRYIEWFDRIFDYGMDTKLLSGYETLKAPVQESSKVGVKKIKIKQEELNDDITDDDESFDERGFTKRTKRRAGESKAFKILEYYQKNKEKFSTTSRNKHSLWEVLARQIGISATQCAHRFRNLKQVYTAYVQREINKPEMPILWPYYALCKKVFGYRAIKSKLKNGKLDSDDSEEWSAKEIKQLINYFSQNYDDIDNNVDDNSKWSDLAAEIGKGETSCKEKFLELRKSYRKLKTMRSRNPDVKISWKYFAMFEQVYSAREQEGRQGVMEVDEEAYMEIPVDSYEKNDQEEDDYQCIIVIPEGEDIENAQIIIQEHPNAPQDMMTTEAIIEASEIEAAEMAEATETVEAIEPQRTFVKWTKRSKKRLLIYYINYVRSNKGKEIKPKEMWAEIASKLQDKTPLSCRKMFAKLKANHRQLDSDEINAKKTPYFALMEKVIRLKPKFVKTEQNKALKDGKIYKDVALPDEKVVQALQYYLENIEDFVSPRYEKKYLWTELANFVGEPITKVFNKLNYLKQFYNSDTDEVAGNSSIFAPLLKEIFIKEIAIKLVLENQPKPVAHDSDIEEPWTDEEILQLLEWYLSNLEKFKNPKFVRSYLWMEVSSILNKSAIACSKKMSEIRTQYRNMVRESPGELAGWRFLELCQKIYGTGKKGNPVQGSGDA

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00404870;
90% Identity
iTF_00637609; iTF_01166653;
80% Identity
-