Basic Information

Gene Symbol
-
Assembly
GCA_947037095.2
Location
OX344839.1:35081233-35087209[-]

Transcription Factor Domain

TF Family
MYB
Domain
Myb_DNA-binding domain
PFAM
PF00249
TF Group
Helix-turn-helix
Description
This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 6 0.39 6.4e+02 1.6 0.1 3 17 75 89 73 130 0.61
2 6 0.054 88 4.3 0.0 21 42 341 363 333 367 0.86
3 6 5e-05 0.083 14.0 0.1 9 44 425 465 415 467 0.86
4 6 0.0052 8.5 7.6 0.1 22 46 586 612 567 612 0.85
5 6 0.00099 1.6 9.9 0.1 3 44 663 713 662 715 0.86
6 6 0.015 24 6.1 0.1 13 35 841 870 826 880 0.76

Sequence Information

Coding Sequence
ATGGAACAACAAATCATGGTTAAAACAGAGGTGCAGTCCAATGGAGAAATATTGCTGTTTTATGTGGATGAAAATGGAGGCAATGAAGAAGGAGTAATAACAACTGTAGAAAACATTGAAAACCAAGCTATACAATTGCAGCAAGATAACTCTTTCATCATCCAAGATGTTGGGGATCTCACTACTGAAATTAGTATTCCACAGACAAGTGTTCAAAGTGATACCTGGACTGATGATGAGATCAAAAGGCTACTGGTCTTCTTTAgtgataacaaacaaacatttataacAGGTACAACAAAGAAAACACATCTTTGGACTGTGGCTTGCAAAACAATGCTCATGGGGAAAACCCCACAAGCTTGTGAACAACAGTTGAACACTCTCCAACTAAAATACTTTGAAATTATTGCTCAAATCCAAAAAGGACATTACACTAAATGGCCATACTTTGAACTTTGTCATCAAGTATTTGAAGATGAGAGCTCTCAAGATTATATTATTGATGATTACTCAATAGAATGTGAACCACAACCTGTTAAAGTACCGGTATCAAAACCTAGTGATAACACAGTTATGGTAgttaaaaaagtaaacacaAACATCAAAAATGTTGATGAGAAAGTTGAAACTATGCTCAATTTGTATATGAAGTATAAAAAGGACTTTCAAAAAGACTATTGGAGGCATGGATTGTGGGAAACGATAGCAATGGAACTTGGCCAAGACGATGGAGAATATTGGCAAAAACGATTCCTCAATTATAAACAGCATTATACAAGATTAGTGGAAAAACGACGTTTGAGCGGTTCTGATGGTATTACTTGGCCTTACATGGAATTGTTTGACAAAATATTTGAAGGTGACCCAGATTTTGGGAAAAGACATCCTAATCAGGAACAGGTTAAAACTGAGGCTGTTGCAGAGGAACCTTGTGTTGATTGGGATAACACTGAAATGACAGTGTTAGTCAAATACTACTTCGACTGTTACGATGAATTTGAAGACAAGACTATCCCAAACAACTTCCTTTGGAATGAAATTGGCAGACTGTTAGATAAAAACCCAGAAGATTGTAGAACAAAGTATGAGTTACTAAAAAATGTACATTTAGAGAAATATATTGCAGGTTCTTATACTATAAGGATTAGAAAAcctatagatatattatttgataatataatcAGCAAAGATATTGAAGGAGAGTTCGCAAAATTCAGAAGTTTACCGGAACAGTTGGAAGAATGGGGTTCTACCGAAACCGATACACTAGTTCAGTTTTTCTATGAaaatgttgaaatgtataaaGACAGTATTTGTTACTATGTGTGTTGGTCTGCAATAGCTAAGGCCTTGAATAAAAGTTTACAAAGCTGCCGAGGTCAATGGCAAGATCTCGTGGAAATCTACAAGACTGTATTGAATGATAAAATGGAAAATCCTGATGTGCAGTTTGATTGGCAGTACATTGATCTTTTTGACAGGATATTTGACTATGGAATGGACACGAATTTACTGTCGGGctatgaaaaaacaaaaatacaaactgaaAAACGTGACACAGTTGTTGGTGtaaaaaaaatcaacatcaAAACAGCAGAAACTCCGGAAGAGACTACATATGACGACAATGAAGATTTCAACGAAAGAGGCTTTCGGAAGCGTTCAAAGCGTAGCTCAGGAGAATCCAAAGCTATTCAAATTCTCGAATAttaccagaaaaataaagaacTGTTCTCTTCCacaaatagaaacaaacattcGTTTTGGGAGGTCCTCGCTAAACAAATTGGAATATCAGCTACCCAATGTGCCCATAGATTCAGGAATTTGAAGCAAGTGTACACCGCATACGTACAGAGGGAAATCAACAAGCCAGATATGCCAATACTATGGCCTTACTATGTTCTATGCAAAAAAGTATTTGGATACAGAGCTCTCAAAtcaaaattgaaaaatgttaaaaCCGAAGCTATTGAAGTCGATCAGTGGTCTCCTaaagaaataaaactattaattaaatacttatctGACCATTTTGATGAGATCAATAATAATACTGATGATAGCAGTAAATGGTCAGGTTTGAGTACACAGATAGGTAagagtgaaaatatttgtcgaGAAAAATTTATAGAATTGAGAAAGTCTTACAGAAAGCTTAGTACTATGAGGAAGAAGGACCCTGATGTCAATATAATGTGGAAATATTACAAGATGTTTGaagatatttacaataatagatcAGTACAGGAATTGCAGGAGCCTACGGAGGAAGATTATGACAGAGGTTATATAGAAATACCTATATCTGATGAGATGAGGCTTGAGGGTGatgatgacTATCAATGCATCATAGTCATGGAAGACGGTCAAGATCTGTCACAGATGGAACACGCTCAAATAATTGTACAGCACTCAGGAGACGTAAGCATGGAACAATCAGCAATAGAAACCAGTGAACCTAAACCCCTCACTAAATGGACGAAACGAAGTAAAAAGAAGCTgcttatattttacattaattatgttAGATCGCACAAAGGCCATGAAATTAAACCTAAAGATATGTGGGCAGAAATAGCACAGAAGTTAAAAGACAAAACACCTGGGGCTAGTAGAAAAATGTTTGCGAAATTAAAAGCAAACCACAAGAATATTGATGAAGCGGACGAAAATACGAAAACGAATCCTTTATATAATCTAATGGAGAAAGTCATGCGATTGAAGCCGAAATTTGTGAAAACGCaacaaaataaagaattaagGGACTCCAAAATATACAAAGATGTAATCTTGCCTTGCGAAAAAGTTGAACTAGCATTACAGTACTATCTCGCAAATTTAGAGGACTTTGTCAGTCCCAAGTACGAGAAAAAATACCTTTGGACCCAACTGGCGAACCACATATCTGAACCTGTGACAAAAATATTCAACAAAATTAACTTTTTGAAGCAGTCGTTTAACTTTGAAACTCTCGAAATAGACGGCCAGCCATCACCATTCTCTGGACTGTTAAAAGACATTTTAGAGCAAGAAATAACAACGAGATTATCTTTAGAAAGTCAGCCGAAACCATCAGTCGAAGATTCTACAATTGAAGAAACATGGAGTGATTATGAAACCGGACAACTTTTGGAATGGTATCTGGGTAACTTAGAGAAATTCAAGAACCCCAAATTTGTTAGAAGCTACCTCTGGATGGAAGCTTCTAGTATTCTGAATAAAACTGCATTGTCTTGCTCAAAGAAAATGTCGGAAATAAGGACTCAGTATCGGAACATGGTGCGAGAGACACCTGAACAATTGAGTGGATGGAAATTCCTAGActtgtgtcaaaaaatatatgGAACTGGCAAAAAAGGAAGCCAGGGTAATTAA
Protein Sequence
MEQQIMVKTEVQSNGEILLFYVDENGGNEEGVITTVENIENQAIQLQQDNSFIIQDVGDLTTEISIPQTSVQSDTWTDDEIKRLLVFFSDNKQTFITGTTKKTHLWTVACKTMLMGKTPQACEQQLNTLQLKYFEIIAQIQKGHYTKWPYFELCHQVFEDESSQDYIIDDYSIECEPQPVKVPVSKPSDNTVMVVKKVNTNIKNVDEKVETMLNLYMKYKKDFQKDYWRHGLWETIAMELGQDDGEYWQKRFLNYKQHYTRLVEKRRLSGSDGITWPYMELFDKIFEGDPDFGKRHPNQEQVKTEAVAEEPCVDWDNTEMTVLVKYYFDCYDEFEDKTIPNNFLWNEIGRLLDKNPEDCRTKYELLKNVHLEKYIAGSYTIRIRKPIDILFDNIISKDIEGEFAKFRSLPEQLEEWGSTETDTLVQFFYENVEMYKDSICYYVCWSAIAKALNKSLQSCRGQWQDLVEIYKTVLNDKMENPDVQFDWQYIDLFDRIFDYGMDTNLLSGYEKTKIQTEKRDTVVGVKKINIKTAETPEETTYDDNEDFNERGFRKRSKRSSGESKAIQILEYYQKNKELFSSTNRNKHSFWEVLAKQIGISATQCAHRFRNLKQVYTAYVQREINKPDMPILWPYYVLCKKVFGYRALKSKLKNVKTEAIEVDQWSPKEIKLLIKYLSDHFDEINNNTDDSSKWSGLSTQIGKSENICREKFIELRKSYRKLSTMRKKDPDVNIMWKYYKMFEDIYNNRSVQELQEPTEEDYDRGYIEIPISDEMRLEGDDDYQCIIVMEDGQDLSQMEHAQIIVQHSGDVSMEQSAIETSEPKPLTKWTKRSKKKLLIFYINYVRSHKGHEIKPKDMWAEIAQKLKDKTPGASRKMFAKLKANHKNIDEADENTKTNPLYNLMEKVMRLKPKFVKTQQNKELRDSKIYKDVILPCEKVELALQYYLANLEDFVSPKYEKKYLWTQLANHISEPVTKIFNKINFLKQSFNFETLEIDGQPSPFSGLLKDILEQEITTRLSLESQPKPSVEDSTIEETWSDYETGQLLEWYLGNLEKFKNPKFVRSYLWMEASSILNKTALSCSKKMSEIRTQYRNMVRETPEQLSGWKFLDLCQKIYGTGKKGSQGN

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-