Basic Information

Gene Symbol
-
Assembly
GCA_951394065.1
Location
OX596066.1:1429984-1449657[+]

Transcription Factor Domain

TF Family
MYB
Domain
Myb_DNA-binding domain
PFAM
PF00249
TF Group
Helix-turn-helix
Description
This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 19 0.022 26 5.7 1.0 3 44 153 210 151 212 0.66
2 19 0.011 13 6.6 0.0 1 36 398 449 398 457 0.80
3 19 0.28 3.3e+02 2.2 0.0 23 36 479 491 469 499 0.84
4 19 0.45 5.4e+02 1.5 0.1 23 36 521 533 511 540 0.86
5 19 0.42 5.1e+02 1.6 0.0 23 36 563 575 553 580 0.84
6 19 0.42 5e+02 1.6 0.0 23 36 605 617 596 624 0.84
7 19 0.28 3.3e+02 2.2 0.0 23 36 647 659 637 667 0.84
8 19 0.45 5.4e+02 1.5 0.1 23 36 689 701 679 708 0.86
9 19 0.42 5.1e+02 1.6 0.0 23 36 731 743 721 748 0.84
10 19 0.4 4.8e+02 1.6 0.1 23 36 773 785 764 793 0.84
11 19 0.39 4.7e+02 1.7 0.0 23 36 815 827 804 834 0.84
12 19 0.4 4.8e+02 1.6 0.0 23 36 857 869 847 876 0.84
13 19 0.42 5.1e+02 1.6 0.0 23 36 899 911 889 916 0.84
14 19 0.42 5e+02 1.6 0.0 23 36 941 953 932 960 0.84
15 19 0.4 4.8e+02 1.6 0.0 23 36 983 995 973 1002 0.84
16 19 0.42 5.1e+02 1.6 0.0 23 36 1025 1037 1015 1042 0.84
17 19 0.00019 0.23 12.3 0.7 23 43 1067 1086 1057 1089 0.87
18 19 0.014 16 6.3 1.8 3 44 1173 1230 1171 1232 0.80
19 19 5.9e-09 7e-06 26.7 0.3 1 45 1288 1345 1288 1346 0.94

Sequence Information

Coding Sequence
ATGTGTATAGATTATCAAGAATTCAGTGTGTTTCAGCAAGATAAAAACCTTTCTCAATCAGATGAACACACCCTTGGGTCCAACAAACCTGCCCCACCCTTATTTGAAAGTACATGTGCTTTGGAGGGAGGAGACTCTGCATTATTTAAACTGGAAGATGACCCTTTGGACCAGTTTAAaTCCGaagaggaaataaataagaaaacttatggtgaaaatgatgatgatgacggtgAAGCTTATAATGATGAAGAAtctgacgatgatgatgaagttggcgatgatgacgatgatgaagcaggcgatgatgatgaagatgagaTGTCTGACCAGGAATCTTCACAGGAATTTAAAAGTGAACCGGAGAGAAAGGCTAAAAGAAAACTACCTCCCATTACAAAGATTGACTACGATAGCGATGGTGACCAAGCGTCCTCGCCTGCAACCCCGCGCCTATCCTGGACAAGCGACAGCGACAAGACCCTCATAGAATTCGTGAGGTGCAACGAAGCCCTGTACAATCCTAAGAGCAAATTGTACAAAATAGCGGTGTTGAAGAAAAAACTGTGGATTGATATAGGGAACATACTCGATAAGACTGACGCGGATTGCTACAAGCGTTGGTGTTACGTCCGCGACTACTACATCCGGCGCCGACACAAGATGGCCGCTGGCGGCGAAGCGGCAAAGAAGAGGGCGCAGTTACTGTCTTTCTTGGACCGTGTGCCCACGGCTAGGGGGGGGTGGGCGCCGAAAGTAGGGGGGAGAGATAACCGGAGCGACGATAACGAGTGGATAGAGACCGTGTGCCTAGCAGCTGGAGACGGGATCAGGGGGGGGGGGGTGAAAGTAGGGGGTAGAGATAACCGGAGCGACGATAACGAGTGGATAGAGACCGTGTGCCTAGCAGCTGGAGACGGGATCAGGGGGGGGGTGAAAGTAGGGGGTAGAGATAACCGGAGCGACGATAACGAGTGGATAGAGACCGTGTGCCTAGCAGCTGGAGACGGGATCAGGGGGGGGGGGGTGAAAGTAGGGGGTAGAGATAACCGGAGCGACGATAACGAGTGGATAGAGACGCTGCCTGAGCGACCACCATCCACCGACTCCCGCGACAATACTGTCAAGGAGGAAGCGAGTAATAACGACCCGTGGGAGAACATTCGCATTGGCGAGCAGTATTCGAAAACGCGTGAGCCTTGGACCGCCGCCAAGGAAGACATATTGATACAGTTTGTGAGTCGCAACGAGGCGCTATACAACCCGCGCTGTAAGGACTATCGGCGGGCTCACATCAAACGACAACTGTGGAGTGAGATAGCCTCTCTACTCAACAAGACAGGTAACATATTGATACAGTTCGTGAGACGCAACGAGGCGCTATACAACCCGCGCTGTAAGGACTATCGGCGGGCTCACATCAAACGACAACTGTGGAGTGAGATAGCCTCTCTACTCAACAAGACAGATAACATATTGATACAGTTCGTGAGACGCAACGAGGCGCTATACAACCCGCGCTGTAAGGACTATCGGTGGGCTCACATCAAACGACAACTGTGGAGTGAGATAGCCTCTCTACTCAACAAGACAGGTAACATATTGATACAGTTCGTGAGACGCAACGAGGCGCTATACAACCCGCGCTGTAAGGACTATCGGCGGGCTCACATCAAACGACAACTGTGGAGTGAGATAGCCTCTCTACTCAACAAGACAGGTAACATATTGATACAGTTCGTGAGTCGCAACGAGGCGCTATACAACCCGCGCTGTAAGGACTATCGGCGGGCTCACATCAAACGACAACTGTGGAGTGAGATAGCCTCTCTACTCAACAAGACAGGTAACATATTGATACAGTTCGTGAGACGCAACGAGGCGCTATACAACCCGCGCTGTAAGGACTATCGGCGGGCTCACATCAAACGACAACTGTGGAGTGAGATAGCCTCTCTACTCAACAAGACAGATAACATATTGATACAGTTCGTGAGACGCAACGAGGCGCTATACAACCCGCGCTGTAAGGACTATCGGTGGGCTCACATCAAACGACAACTGTGGAGTGAGATAGCCTCTCTACTCAACAAGACAGGTAACATATTGATACAGTTCGTGAGACGCAACGAGGCGCTATACAACCCGCGCTGTAAGGACTATCGGCGGGCTCACATCAAACGACAACTGTGGAGTGAGATAGCCTCTCTACTCAACAAGACAGGTAACATATTGATACAGTTCGTGAGTCGCAACGAGGCGCTATACAACCCGCGCTGTAAGGACTATCGGCGGGCTCACATCAAACGACAACTGTGGAGTGAGATAGCCTCTCTACTCAACAAGACAGGTAACATATTGATACAGTTCGTGAGACGCAACGAGGCGCTACACAACCCGCGCTGTAAGGACTATCGGCGGGCTCACATCAAACGACAACTGTGGAGTGAGATAGCCTCTCTACTCAACAAGACAGGTAACATATTGATACAGTTCGTGAGACGCAACGAGGCGCTATACAACCCGCGCTGTAAGGACTATCGGCGGGCTCACATCAAACGACAACTGTGGAGTGAGATAGCCTCTCTACTCAACAAGACAGGTAACATATTGATACAGTTCGTGAGACGCAACGAGGCGCTATACAACCCGCGCTGTAAGGACTATCGGCGGGCTCACATCAAACGACAACTGTGGAGTGAGATAGCCTCTCTACTCAACAAGACAGGTAACATATTGATACAGTTCGTGAGTCGCAACGAGGCGCTATACAACCCGCGCTGTAAGGACTATCGGCGGGCTCACATCAAACGACAACTGTGGAGTGAGATAGCCTCTCTACTCAACAAGACAGGTAACATATTGATACAGTTCGTGAGACGCAACGAGGCGCTATACAACCCGCGCTGTAAGGACTATCGGCGGGCTCACATCAAACGACAACTGTGGAGTGAGATAGCCTCTCTACTCAACAAGACAGGTAACATATTGATACAGTTCGTGAGACGCAACGAGGCGCTATACAACCCGCGCTGTAAGGACTATCGGCGGGCTCACATCAAACGACAACTGTGGAGTGAGATAGCCTCTCTACTCAACAAGACAGGTAACATATTGATACAGTTCGTGAGTCGCAACGAGGCGCTATACAACCCGCGCTGTAAGGACTATCGGCGGGCTCACATCAAACGACAACTGTGGAGTGAGATAGCCTCTCTACTCAACAAGACAgACACAGAATGCTACAAGCGCTGGAGCTACGTCCGCGACTACTACATCCGGCGGAAGTCAAATCCGGGCGCTCTGTCCACCGGCAAGGCGGCCAAGAGACGCGCTCAGCTTCTCGCCTTCCTCACGCTTCacgaacagacagacgaacagGAGGAGCAAACCCTTAAATGCGAGTTGGAACAGCTGGCCGCCTCTAATGCGATCGCAACAGACCACGAGGCTGGCTCAGATATTGGACAAGATGAGGATAATACGAGAGAAGACATGTGCCCCCGAGGTCCCAAGGCGTGGACCACAGACTCGGATGAGGTTCTCATAGACCTGGTCGGCAGCCATCAGGCGTTGTTCAATCCGCGCTGTAAGGAGTTTAGGAGACCGCAGCTGAAGAGGCGACTGTGGCGGGACATAGCACGTGACCTGAGGACCACAGAAACGGATTGCTACAAGCGCTGGTGCTACGTCCGCGACTACTACATCAGACGGCGCGGTAAGGTCGGCGTGCTCTCCACCGGGAAAGCTGCCAAGCGACGAGCCGACAAACTCGTGTTTCTCGACCATATTTCTTCACATGAGGAGAAAGAGAGTTCACCGTCGCCATTACAGCCAGAAGAGACTCGGTGCGAGGAGAGCCGCGTGTCGTGGTCCGGGGAGGACGACGCGGCGCTGATACAGTTCGTGCGGCGCCACGGCTCGCTGCACGACCGCAAGCGGCGGGCGGGCCGGCTGAAGGGCGCGCTGTGGACCGAGATCGCTCACACCCTGAACAAAACAGATGCGGCGTGCAACTCGCGTTGGAACTACGTCCGCGACTACTACATCCGTCGGCGAGGCAAGCCGTGCGGCGAGGCGGCGGCCAAGCGAGCGGCGCTACTCACCTTCCTTGACAAAGGCACGCGGCGGAAAGTGAAACGGAGTCATCACTAA
Protein Sequence
MCIDYQEFSVFQQDKNLSQSDEHTLGSNKPAPPLFESTCALEGGDSALFKLEDDPLDQFKSEEEINKKTYGENDDDDGEAYNDEESDDDDEVGDDDDDEAGDDDEDEMSDQESSQEFKSEPERKAKRKLPPITKIDYDSDGDQASSPATPRLSWTSDSDKTLIEFVRCNEALYNPKSKLYKIAVLKKKLWIDIGNILDKTDADCYKRWCYVRDYYIRRRHKMAAGGEAAKKRAQLLSFLDRVPTARGGWAPKVGGRDNRSDDNEWIETVCLAAGDGIRGGGVKVGGRDNRSDDNEWIETVCLAAGDGIRGGVKVGGRDNRSDDNEWIETVCLAAGDGIRGGGVKVGGRDNRSDDNEWIETLPERPPSTDSRDNTVKEEASNNDPWENIRIGEQYSKTREPWTAAKEDILIQFVSRNEALYNPRCKDYRRAHIKRQLWSEIASLLNKTGNILIQFVRRNEALYNPRCKDYRRAHIKRQLWSEIASLLNKTDNILIQFVRRNEALYNPRCKDYRWAHIKRQLWSEIASLLNKTGNILIQFVRRNEALYNPRCKDYRRAHIKRQLWSEIASLLNKTGNILIQFVSRNEALYNPRCKDYRRAHIKRQLWSEIASLLNKTGNILIQFVRRNEALYNPRCKDYRRAHIKRQLWSEIASLLNKTDNILIQFVRRNEALYNPRCKDYRWAHIKRQLWSEIASLLNKTGNILIQFVRRNEALYNPRCKDYRRAHIKRQLWSEIASLLNKTGNILIQFVSRNEALYNPRCKDYRRAHIKRQLWSEIASLLNKTGNILIQFVRRNEALHNPRCKDYRRAHIKRQLWSEIASLLNKTGNILIQFVRRNEALYNPRCKDYRRAHIKRQLWSEIASLLNKTGNILIQFVRRNEALYNPRCKDYRRAHIKRQLWSEIASLLNKTGNILIQFVSRNEALYNPRCKDYRRAHIKRQLWSEIASLLNKTGNILIQFVRRNEALYNPRCKDYRRAHIKRQLWSEIASLLNKTGNILIQFVRRNEALYNPRCKDYRRAHIKRQLWSEIASLLNKTGNILIQFVSRNEALYNPRCKDYRRAHIKRQLWSEIASLLNKTDTECYKRWSYVRDYYIRRKSNPGALSTGKAAKRRAQLLAFLTLHEQTDEQEEQTLKCELEQLAASNAIATDHEAGSDIGQDEDNTREDMCPRGPKAWTTDSDEVLIDLVGSHQALFNPRCKEFRRPQLKRRLWRDIARDLRTTETDCYKRWCYVRDYYIRRRGKVGVLSTGKAAKRRADKLVFLDHISSHEEKESSPSPLQPEETRCEESRVSWSGEDDAALIQFVRRHGSLHDRKRRAGRLKGALWTEIAHTLNKTDAACNSRWNYVRDYYIRRRGKPCGEAAAKRAALLTFLDKGTRRKVKRSHH

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-