Basic Information

Gene Symbol
-
Assembly
GCA_946811585.1
Location
CAMPFD010005459.1:19522-21156[+]

Transcription Factor Domain

TF Family
MH1
Domain
MH1 domain
PFAM
PF03165
TF Group
Unclassified Structure
Description
The MH1 (MAD homology 1) domain is found at the amino terminus of MAD related proteins such as Smads. This domain is separated from the MH2 domain by a non-conserved linker region. The crystal structure of the MH1 domain shows that a highly conserved 11 residue beta hairpin is used to bind the DNA consensus sequence GNCN in the major groove, shown to be vital for the transcriptional activation of target genes. Not all examples of MH1 can bind to DNA however. Smad2 cannot bind DNA and has a large insertion within the hairpin that presumably abolishes DNA binding. A basic helix (H2) in MH1 with the nuclear localisation signal KKLKK has been shown to be essential for Smad3 nuclear import. Smads also use the MH1 domain to interact with transcription factors such as Jun, TFE3, Sp1, and Runx [2, 1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 15 0.0039 44 6.2 0.8 77 101 23 47 9 49 0.81
2 15 0.31 3.5e+03 0.1 0.1 78 96 50 68 46 75 0.81
3 15 0.0034 38 6.4 0.2 77 101 75 99 69 101 0.83
4 15 0.0025 28 6.8 0.1 77 102 101 126 97 127 0.79
5 15 0.0019 21 7.2 0.3 78 101 128 151 125 153 0.78
6 15 0.16 1.8e+03 1.0 0.1 77 96 179 198 172 204 0.77
7 15 0.32 3.6e+03 0.0 0.0 77 96 205 224 199 231 0.84
8 15 0.086 9.7e+02 1.9 0.1 86 101 241 256 227 258 0.74
9 15 0.012 1.4e+02 4.6 3.5 59 101 292 334 259 336 0.72
10 15 0.0017 19 7.4 0.1 74 102 333 361 330 362 0.80
11 15 0.0027 30 6.7 0.7 77 101 388 412 381 414 0.81
12 15 0.0016 17 7.5 0.1 78 101 415 438 410 440 0.79
13 15 1.4 1.6e+04 -2.0 0.0 78 95 441 458 436 460 0.79
14 15 0.0066 74 5.5 0.3 77 101 466 490 457 492 0.74
15 15 0.0055 61 5.7 0.1 78 101 493 516 489 518 0.78

Sequence Information

Coding Sequence
ATGGACGGTATGTTGCAATCCAACACATGGAAGGTATGTTGCGATCCAACACAGGGACGGTATGTTGCAATCAAACACAAGGAAGGTATGTTGCAATCCAACACATGGAAGGTATGTTGCAATCCAACACATGGAAGGTATGTTGCGATCAAACACAGGGCCGGTATGTTTCAATCCAACACATGGAAGGTATGTTGCGATCCAACACAGGGACGGTATGTTGCGATCAAACACAGGGACGTTATGTTGCAATCCAACTCCAGGAAGGTATGTTGCGATCCGACACATGGAGAGTATGTTGCAATCCAACACATGGACGGTATGTTGCGATCCGACAGGGGGAGAGTATGTTGCAATCCAACACATCGAAGGTATGTTGCGATCCAACACAGGGACGGTATGTTGCAATCAAACACAAGGAAGGTATGTTGCAATCCAACACATGGAAGGTATGTTGCAATCCAACACATGGAAGGTATGTTGCGATCAAACACAGGGACGGTATGTTTCAATACAACACATGGAAGGTATGTTGCGATCCAACACATGAACGGTATGTTGCGATCCGACACATGGAGAGTATGTTGCAATCCAACTCCTGGAAGGTATGTTGCGATCCGACACATGGAGAGTATGTTGCAATCCAACACATGGACGGTATGTTGCGATCCGACAGGGGGAGAGTATGTTGCAATCCAACACATCGAAGACGGTATCTTGAGATCCGACACATGGAGTGTATGTTGCGATCCAACACATGGACGGTATGTTGCAATCCAACACATGGAAGGTATGTTGCGATCCAACACAGGGACGGTATGTTGCAATCAAACACAAGGAAGGTATGTTGCAATCCAACACATGGAAGGTATGTTGCAATCCAACACATGGAAGGTATGTTGCAATCAAACACAAGGAAGGTATGTTGCAATCCGACACATGGAGAGTATGTTGCAATCCAACTCCTGGAAGGTATGTTGCGATCCGACACATGGAGAGTATGTTGCAATCCAACACATGGACGGTATGTTGCGATCCGACAGGGGGAGAGTATGTTGCAATCCAACACATCGAAGGTATGTTGCGATCAAACACAGGGACGGTATGTTGCAATTCATTACATTGACGGCGTGTTTCAATCCAACACATAGACGGTATCTTGAGATCCGACACATGGAGTGTATGTTGCGATCCAACACATGGACGGTATGTTGCAATCCAACACATGGAAGGTATGTTGCGATCCAACACAGGGACGGTATGTTGCAATCAAACACAAGGAAGGTATGTTGCAATCCAACACATGGAAGGTATGTTGCAATCCAACACATGGAAGGTATGTTGCGATCAAACACAGGGACGGTATGTTGCAATCAAACACAAGGAAGGTATGTTGCAATCCAACACATGGAAGGTATGTTGCAATCCAACACATGGGGAGTATGTTGCAATCCAACACATGGACGGTGTGTTGCAATCCAACACATTGACGATATGTTTCGATCCGACACATGGGGCGTATGTTGTAATCCAACACATGGAAGGTATGTTGCGATCTAA
Protein Sequence
MDGMLQSNTWKVCCDPTQGRYVAIKHKEGMLQSNTWKVCCNPTHGRYVAIKHRAGMFQSNTWKVCCDPTQGRYVAIKHRDVMLQSNSRKVCCDPTHGEYVAIQHMDGMLRSDRGRVCCNPTHRRYVAIQHRDGMLQSNTRKVCCNPTHGRYVAIQHMEGMLRSNTGTVCFNTTHGRYVAIQHMNGMLRSDTWRVCCNPTPGRYVAIRHMESMLQSNTWTVCCDPTGGEYVAIQHIEDGILRSDTWSVCCDPTHGRYVAIQHMEGMLRSNTGTVCCNQTQGRYVAIQHMEGMLQSNTWKVCCNQTQGRYVAIRHMESMLQSNSWKVCCDPTHGEYVAIQHMDGMLRSDRGRVCCNPTHRRYVAIKHRDGMLQFITLTACFNPTHRRYLEIRHMECMLRSNTWTVCCNPTHGRYVAIQHRDGMLQSNTRKVCCNPTHGRYVAIQHMEGMLRSNTGTVCCNQTQGRYVAIQHMEGMLQSNTWGVCCNPTHGRCVAIQHIDDMFRSDTWGVCCNPTHGRYVAI

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-