Basic Information

Gene Symbol
-
Assembly
GCA_905332915.1
Location
HG995313.1:8576917-8582104[-]

Transcription Factor Domain

TF Family
MYB
Domain
Myb_DNA-binding domain
PFAM
PF00249
TF Group
Helix-turn-helix
Description
This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 7 0.12 1.9e+02 3.3 0.9 3 39 72 120 70 127 0.59
2 7 0.0044 7 7.9 0.0 21 44 342 366 331 368 0.87
3 7 0.031 50 5.2 0.2 12 43 429 465 415 468 0.72
4 7 0.0049 7.9 7.7 0.2 22 46 582 616 570 616 0.74
5 7 0.00059 0.94 10.7 0.1 3 44 667 717 665 719 0.80
6 7 2.3e-05 0.037 15.2 0.3 13 42 848 884 834 888 0.85
7 7 0.046 74 4.6 0.1 3 15 1043 1055 1041 1102 0.80

Sequence Information

Coding Sequence
atGGATGGCTCTATAGTGGTTAAAACTGAGATGGGAACTAATGGCGAAATACTGCTTTTCTATGTTGATGAAAATGGTGGCAATGAAGAAGGGGTAATAACAACTGTTGAAAGCATAGAAAATCAAGCGATACAACTGCAACAGGATAACTCCTACATCATTCAAGATGTTGACTCCAATGAAATCAGCATGGATCAGTCTGCGTTAACTGATCACTGGACCGAGGATGAAACTAAAAAACTCCTGGTCTTCTACAATGATAATAAACAGACTTTCATcaatggtacaacaaagaaGAAACATTTATGGACTGTAGCATGTAAGACCATGATCGTTGGTAAAAATCCAAACTCATGTGAAGCTAAACTCAACAGCTTACAAGCAAAGTATAACGAAATTTGTGGCCACATACAAAAAGGTGTCTACGTAAAGTGGCCATACTTTGAACTATgccatcaaatatttcacgatGAGACCCCTATGATTACAGTTGAAACCTTAAATACACCAGAaccacaaataataaaagttccCGCTTTAAAACAGAATTATGATAATGTTATGGTGGTTAAAAAGGTGAACAGTAGAGCAACAGCTGATGAAAAAGTTGAAATGATGCTGAAGTTGTATCTGAAGTACAAGAAAAACTTCCAAGCTGAGTATTGGAGACGTGGCATATGGGAAACCATCGCGTTAGAGATAGGTGAAGATGATGGAGAATACTGGCAGAAACGTTTCTTAAACTATAAGCAACATTATCTGAGATTGATAGACAAAAGACGAGAGTGTGGCTCAGAGGGAATCAACTGGCCTTACTTAGAACTATTTGACAAAATCTTTGAAGGTGACGAGGACTTCCATAGAAAATATCTCACCGAGGAGTATAGACAAATTGAAAACCAGGCCATATCTGAAGTTGAAGAGCCTCCATCTAAAGTTATGgattgggacacaactgaaaTGACAGTATTAGTAAAATACTGTTTTGATTGCTTTGATGAATTTGAAGACAGAACCATACCCAACAATTTCCTTTGGACTGAAATTGGCCGTTTACTAGACAAGACCGCAGAAGcttgcaaatcaaaatatgagGAACTAAAGAACGCACATTTAGACAAATACATAGAAGGTGGTTACGACTTACGAACCCGAAAACCTATAGCTATATTATTCGACAATATAATATCCAAAAACATTGagaatcaaataataaaaattggtaAAATACCTGAGCAACTAGAGATGTGGAAGACAGAGGAATTAGATGAATTAGTGCAATTCTTCTATGACAACATAGAGATGTACAAAGACATGGTCTGCCATTTTGTTTGCTGGGCAGGGGTTACCAAAAAGTTGAAGCGAAATCTGCAAAGTTGCCGAAGCCAATGGGAGGATCTTGTAAGCTTGTATAAGACAATATTGAATGATAAAAAAGAGGATCCTGATATGCAGATTGATTGGAGATATATTGAAGTTTTTGATAGGATCTTTGATTATGGCATGGATACTAACTTGCTATCTGGGTATGAAACTTTGAAGGGATTTGGACAGAATCAGAAAACTGAAAATTCAGGGAAGATTGGTGtAAAGAAAGTCAACATTAAACTGGACGACGCTATGGAAGAATTTACCGACGATGATGAGTCGTACGATGAACGAGGCTTCACGAAACGCACCAAAAGACGCTCAGGCGACTCTAAAGCGTTCAAAATACTCGAATACTACCAGAAAAACAAAGACAAGTTCTCTACGACAAACCGGAACAAACATTCCCTATGGGACATACTAGCCAAGCAGATTGGCATATCAGCTACGCAATGCGCTCACCGCTTCAGAAACCTAAAACAAGTCTACACGGCTTACGTCCAAAGAGAAATCAACAAACCGGAAATGCCAATCCTCTGGCCGTATTACGCACTGTGCAAGAAAGTTTTTGGTTATAGAGCCATCAAATCTAAACTGAAAAACGGAAAATTAGATTCTGATGACAGTGAGGACTGGTCAGCGAAAGAAATCAAACGATTAATAAACTATTTCTCTCACAACTTTGATGATATTAACCTTAACATAGAAGATACAACAAAATGGTCGGACTTAGCTGCTGAAATAGGAAAAGGGGTAAATTCGTGCAAAGAAAAGCTGTTAGAATTACGCAAGTCTTATAGGAAGCTTAAAACAATGAGAAGTAGGAACCCTGATGTTAAGATTTCTTGGAAGTATTTCAACATGTTTGAAGATATTTATAATGCTAAAGAAAATGGCGTGGAGACGCTCGAGGTGGATGATAGTGAGACCACGTATATGGAGTTACAGGCGTCTGATGATAGGGTTGAGCAAGAAGAAGACGACTACCAATGCATCATAGTTATTCCAGAAGGGCAAGATATATCACAGATCGAAAATGCTCGAATTATAATACAAGATAATTCAATGCCACAAGAACAAGAAATTGTCCAAACAGAGCCAGAACCTCCCAAGGAAGTCAGACCACTTGCAAAATGGACGAAAAGAACTAAAAAGAGGTTGATCATATTCTATATAAACTATATCAGAATGCATAAAGGGAAGGAAATTAATGCTAAAGAAATGTGGGCAGAAATTGCgtcaaaaatacctaacaaaacaCCACTTGCTTGTAGAAAAATGTTCGCCAAACTCAAAGCTAATCACAAGCAAATTGATGAGTCTAACCCTTGCATGAAGAAGACTCCTTATTTTGCGTTGATGGAAAAAGTCATGCGTTTAAAACCAAAATTCTTGAAAACTGAACAGAATAAGGCATTGAAAGACGGAAAAGTATACAAAGATGTAGCCTTACCTGATGAAAAAGTTGTACAAGCATTGCAGTACTATTTAGAAAACATCGAGGACTTTGTCAGTCCAAGATTTGAGAAAAAATACCTCTGGACTGAACTTGCAAATTACGTTTGTGAGCCAATAACTAAAGTCTTCAACAAAATCAACTATTTAAAACAGGCTTATAACATGGATACCGATGAAGTAGCCGGAGTAAAGACTCCGTTTGCAGAGTACTTGAAAGAAATCTTTGCCAAGGAGATAGCAATCAAACTCTTTTTAGAAAATCAACCAAAGCCGGTCATCGAAGAACCAGGTATCGAGGAAACCTGGTCTGATGAAGAGACAGAACAGTTACTAGAATGGTATCTAAGCAATTTAGACAAGTTCAAGAACCCCAAATTCGTCAGAAGTTACCTCTGGATGGAAGTTTCTAGCATGCTAAACAAAAGTGCTATCACCTGTTCCAAGAAAATGTCTGAAATTCGGACGCAGTACAGGAATATGGTGAGGGAAAGACCGGAGGAATTGAATGAGTGGAGATTCCTTGATCTTTGTCAGAAGATATATGGGACAGGAAAGAAAGGTACTCCAATAAATAGTAATTAA
Protein Sequence
MDGSIVVKTEMGTNGEILLFYVDENGGNEEGVITTVESIENQAIQLQQDNSYIIQDVDSNEISMDQSALTDHWTEDETKKLLVFYNDNKQTFINGTTKKKHLWTVACKTMIVGKNPNSCEAKLNSLQAKYNEICGHIQKGVYVKWPYFELCHQIFHDETPMITVETLNTPEPQIIKVPALKQNYDNVMVVKKVNSRATADEKVEMMLKLYLKYKKNFQAEYWRRGIWETIALEIGEDDGEYWQKRFLNYKQHYLRLIDKRRECGSEGINWPYLELFDKIFEGDEDFHRKYLTEEYRQIENQAISEVEEPPSKVMDWDTTEMTVLVKYCFDCFDEFEDRTIPNNFLWTEIGRLLDKTAEACKSKYEELKNAHLDKYIEGGYDLRTRKPIAILFDNIISKNIENQIIKIGKIPEQLEMWKTEELDELVQFFYDNIEMYKDMVCHFVCWAGVTKKLKRNLQSCRSQWEDLVSLYKTILNDKKEDPDMQIDWRYIEVFDRIFDYGMDTNLLSGYETLKGFGQNQKTENSGKIGVKKVNIKLDDAMEEFTDDDESYDERGFTKRTKRRSGDSKAFKILEYYQKNKDKFSTTNRNKHSLWDILAKQIGISATQCAHRFRNLKQVYTAYVQREINKPEMPILWPYYALCKKVFGYRAIKSKLKNGKLDSDDSEDWSAKEIKRLINYFSHNFDDINLNIEDTTKWSDLAAEIGKGVNSCKEKLLELRKSYRKLKTMRSRNPDVKISWKYFNMFEDIYNAKENGVETLEVDDSETTYMELQASDDRVEQEEDDYQCIIVIPEGQDISQIENARIIIQDNSMPQEQEIVQTEPEPPKEVRPLAKWTKRTKKRLIIFYINYIRMHKGKEINAKEMWAEIASKIPNKTPLACRKMFAKLKANHKQIDESNPCMKKTPYFALMEKVMRLKPKFLKTEQNKALKDGKVYKDVALPDEKVVQALQYYLENIEDFVSPRFEKKYLWTELANYVCEPITKVFNKINYLKQAYNMDTDEVAGVKTPFAEYLKEIFAKEIAIKLFLENQPKPVIEEPGIEETWSDEETEQLLEWYLSNLDKFKNPKFVRSYLWMEVSSMLNKSAITCSKKMSEIRTQYRNMVRERPEELNEWRFLDLCQKIYGTGKKGTPINSN*