Basic Information

Gene Symbol
DBX1
Assembly
GCA_963924495.1
Location
OZ004658.1:7666661-7683663[-]

Transcription Factor Domain

TF Family
Homeobox
Domain
Homeobox
PFAM
PF00046
TF Group
Helix-turn-helix
Description
This entry represents the homeodomain (HD), a protein domain of approximately 60 residues that usually binds DNA. It is encoded by the homeobox sequence [7, 6, 8], which was first identified in a number of Drosophila homeotic and segmentation proteins, but is now known to be well-conserved in many other animals, including vertebrates [1, 2], as well as plants [4], fungi [5] and some species of lower eukaryotes. Many members of this group are transcriptional regulators, some of which operate differential genetic programs along the anterior-posterior axis of animal bodies [3]. This domain folds into a globular structure with three α-helices connected by two short loops that harbour a hydrophobic core. The second and third form a helix-turn-helix (HTH) motif, which make intimate contacts with the DNA: while the first helix of this motif helps to stabilise the structure, the second helix binds to DNA via a number of hydrogen bonds and hydrophobic interactions, which occur between specific side chains and the exposed bases and thymine methyl groups within the major groove of the DNA. One particularity of the HTH motif in some of these proteins arises from the stereo-chemical requirement for glycine in the turn which is needed to avoid steric interference of the β-carbon with the main chain: for cro and repressor proteins the glycine appears to be mandatory, while for many of the homeotic and other DNA-binding proteins the requirement is relaxed.
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 41 1.5e-10 3.4e-08 33.7 0.1 3 45 160 202 159 204 0.96
2 41 1.5e-10 3.4e-08 33.7 0.1 3 45 215 257 214 259 0.96
3 41 1.5e-10 3.4e-08 33.7 0.1 3 45 270 312 269 314 0.96
4 41 1.5e-10 3.4e-08 33.7 0.1 3 45 325 367 324 369 0.96
5 41 1.5e-10 3.4e-08 33.7 0.1 3 45 380 422 379 424 0.96
6 41 1.5e-10 3.4e-08 33.7 0.1 3 45 435 477 434 479 0.96
7 41 1.5e-10 3.4e-08 33.7 0.1 3 45 490 532 489 534 0.96
8 41 1.5e-10 3.4e-08 33.7 0.1 3 45 545 587 544 589 0.96
9 41 1.5e-10 3.4e-08 33.7 0.1 3 45 600 642 599 644 0.96
10 41 1.5e-10 3.4e-08 33.7 0.1 3 45 655 697 654 699 0.96
11 41 1.5e-10 3.4e-08 33.7 0.1 3 45 710 752 709 754 0.96
12 41 1.5e-10 3.4e-08 33.7 0.1 3 45 765 807 764 809 0.96
13 41 1.5e-10 3.4e-08 33.7 0.1 3 45 820 862 819 864 0.96
14 41 1.5e-10 3.4e-08 33.7 0.1 3 45 875 917 874 919 0.96
15 41 1.5e-10 3.4e-08 33.7 0.1 3 45 930 972 929 974 0.96
16 41 1.5e-10 3.4e-08 33.7 0.1 3 45 985 1027 984 1029 0.96
17 41 1.5e-10 3.4e-08 33.7 0.1 3 45 1040 1082 1039 1084 0.96
18 41 1.5e-10 3.4e-08 33.7 0.1 3 45 1095 1137 1094 1139 0.96
19 41 1.5e-10 3.4e-08 33.7 0.1 3 45 1150 1192 1149 1194 0.96
20 41 1.5e-10 3.4e-08 33.7 0.1 3 45 1205 1247 1204 1249 0.96
21 41 1.5e-10 3.4e-08 33.7 0.1 3 45 1260 1302 1259 1304 0.96
22 41 1.5e-10 3.4e-08 33.7 0.1 3 45 1315 1357 1314 1359 0.96
23 41 1.5e-10 3.4e-08 33.7 0.1 3 45 1370 1412 1369 1414 0.96
24 41 1.5e-10 3.4e-08 33.7 0.1 3 45 1425 1467 1424 1469 0.96
25 41 1.5e-10 3.4e-08 33.7 0.1 3 45 1480 1522 1479 1524 0.96
26 41 1.5e-10 3.4e-08 33.7 0.1 3 45 1535 1577 1534 1579 0.96
27 41 1.5e-10 3.4e-08 33.7 0.1 3 45 1590 1632 1589 1634 0.96
28 41 1.5e-10 3.4e-08 33.7 0.1 3 45 1645 1687 1644 1689 0.96
29 41 1.5e-10 3.4e-08 33.7 0.1 3 45 1700 1742 1699 1744 0.96
30 41 1.5e-10 3.4e-08 33.7 0.1 3 45 1755 1797 1754 1799 0.96
31 41 1.5e-10 3.4e-08 33.7 0.1 3 45 1810 1852 1809 1854 0.96
32 41 1.5e-10 3.4e-08 33.7 0.1 3 45 1865 1907 1864 1909 0.96
33 41 1.5e-10 3.4e-08 33.7 0.1 3 45 1920 1962 1919 1964 0.96
34 41 1.5e-10 3.4e-08 33.7 0.1 3 45 1975 2017 1974 2019 0.96
35 41 1.5e-10 3.4e-08 33.7 0.1 3 45 2030 2072 2029 2074 0.96
36 41 1.5e-10 3.4e-08 33.7 0.1 3 45 2085 2127 2084 2129 0.96
37 41 1.5e-10 3.4e-08 33.7 0.1 3 45 2140 2182 2139 2184 0.96
38 41 1.5e-10 3.4e-08 33.7 0.1 3 45 2195 2237 2194 2239 0.96
39 41 1.5e-10 3.4e-08 33.7 0.1 3 45 2250 2292 2249 2294 0.96
40 41 1.5e-10 3.4e-08 33.7 0.1 3 45 2305 2347 2304 2349 0.96
41 41 4.1e-09 9.5e-07 29.1 0.2 3 43 2360 2400 2359 2400 0.96

Sequence Information

Coding Sequence
ATGCTACCTAGTGACGAGCGAGCCAAGTGGCGCGACGACATGATACACAGTCTGCAGACCATACGGGCGGGGGGACACTTCTTGGTGGAGAACCTGCTCCGGAGCACCACCAGGGGGGACCAGAGTGATGATGACGAGTGTAACAGTAGTACTACGTGCTCCCCCCCGCCCCGCACGGTCCCTCTCAAGTTTAGTGTCAGTGCTATACTGGGAGACACGCTCACACAGGCAGACTCAGATCTGCGAGCATGCAGCGAGGTGGGGATGGTGACGTCTTCATGGTCCCCGGCGGTGGTCACTAAGCCGGTCGCCAGTCGAGCTATACCTATACATACACATCTACATCATCTACTAGCACATTGCCATCATCCATACAGAGCTGCAGTTGGTCTCACATCACATACACAAGTGTTCCCCCTACCTGGTGCCTTCCCCTGGGCTCACAGCTCTAGAGGCAAACCTCGACGCGgtatgatgcgacgtgcagtattctccgacctacaacgtaaaggattagagaagaggttccaactacagaaatacatatctaaacctgaccgtaagaaactggcagagaagttaggacttaaggatagtcaggttagttacagctacttacttagttacctagttagtatgatgcgacgtgcagtattctccgacctacaacgtaaaggattagagaagaggttccaactacagaaatacatatctaaacctgaccgtaagaaactggcagagaagttaggacttaaggatagtcaggttagttacagctacttacttagttacctagttagtatgatgcgacgtgcagtattctccgacctacaacgtaaaggattagagaagaggttccaactacagaaatacatatctaaacctgaccgtaagaaactggcagagaagttaggacttaaggatagtcaggttagttacagctacttacttagttacctagttagtatgatgcgacgtgcagtattctccgacctacaacgtaaaggattagagaagaggttccaactacagaaatacatatctaaacctgaccgtaagaaactggcagagaagttaggacttaaggatagtcaggttagttacagctacttacttagttacctagttagtatgatgcgacgtgcagtattctccgacctacaacgtaaaggattagagaagaggttccaactacagaaatacatatctaaacctgaccgtaagaaactggcagagaagttaggacttaaggatagtcaggttagttacagctacttacttagttacctagttagtatgatgcgacgtgcagtattctccgacctacaacgtaaaggattagagaagaggttccaactacagaaatacatatctaaacctgaccgtaagaaactggcagagaagttaggacttaaggatagtcaggttagttacagctacttacttagttacctagttagtatgatgcgacgtgcagtattctccgacctacaacgtaaaggattagagaagaggttccaactacagaaatacatatctaaacctgaccgtaagaaactggcagagaagttaggacttaaggatagtcaggttagttacagctacttacttagttacctagttagtatgatgcgacgtgcagtattctccgacctacaacgtaaaggattagagaagaggttccaactacagaaatacatatctaaacctgaccgtaagaaactggcagagaagttaggacttaaggatagtcaggttagttacagctacttacttagttacctagttagtatgatgcgacgtgcagtattctccgacctacaacgtaaaggattagagaagaggttccaactacagaaatacatatctaaacctgaccgtaagaaactggcagagaagttaggacttaaggatagtcaggttagttacagctacttacttagttacctagttagtatgatgcgacgtgcagtattctccgacctacaacgtaaaggattagagaagaggttccaactacagaaatacatatctaaacctgaccgtaagaaactggcagagaagttaggacttaaggatagtcaggttagttacagctacttacttagttacctagttagtatgatgcgacgtgcagtattctccgacctacaacgtaaaggattagagaagaggttccaactacagaaatacatatctaaacctgaccgtaagaaactggcagagaagttaggacttaaggatagtcaggttagttacagctacttacttagttacctagttagtatgatgcgacgtgcagtattctccgacctacaacgtaaaggattagagaagaggttccaactacagaaatacatatctaaacctgaccgtaagaaactggcagagaagttaggacttaaggatagtcaggttagttacagctacttacttagttacctagttagtatgatgcgacgtgcagtattctccgacctacaacgtaaaggattagagaagaggttccaactacagaaatacatatctaaacctgaccgtaagaaactggcagagaagttaggacttaaggatagtcaggttagttacagctacttacttagttacctagttagtatgatgcgacgtgcagtattctccgacctacaacgtaaaggattagagaagaggttccaactacagaaatacatatctaaacctgaccgtaagaaactggcagagaagttaggacttaaggatagtcaggttagttacagctacttacttagttacctagttagtatgatgcgacgtgcagtattctccgacctacaacgtaaaggattagagaagaggttccaactacagaaatacatatctaaacctgaccgtaagaaactggcagagaagttaggacttaaggatagtcaggttagttacagctacttacttagttacctagttagtatgatgcgacgtgcagtattctccgacctacaacgtaaaggattagagaagaggttccaactacagaaatacatatctaaacctgaccgtaagaaactggcagagaagttaggacttaaggatagtcaggttagttacagctacttacttagttacctagttagtatgatgcgacgtgcagtattctccgacctacaacgtaaaggattagagaagaggttccaactacagaaatacatatctaaacctgaccgtaagaaactggcagagaagttaggacttaaggatagtcaggttagttacagctacttacttagttacctagttagtatgatgcgacgtgcagtattctccgacctacaacgtaaaggattagagaagaggttccaactacagaaatacatatctaaacctgaccgtaagaaactggcagagaagttaggacttaaggatagtcaggttagttacagctacttacttagttacctagttagtatgatgcgacgtgcagtattctccgacctacaacgtaaaggattagagaagaggttccaactacagaaatacatatctaaacctgaccgtaagaaactggcagagaagttaggacttaaggatagtcaggttagttacagctacttacttagttacctagttagtatgatgcgacgtgcagtattctccgacctacaacgtaaaggattagagaagaggttccaactacagaaatacatatctaaacctgaccgtaagaaactggcagagaagttaggacttaaggatagtcaggttagttacagctacttacttagttacctagttagtatgatgcgacgtgcagtattctccgacctacaacgtaaaggattagagaagaggttccaactacagaaatacatatctaaacctgaccgtaagaaactggcagagaagttaggacttaaggatagtcaggttagttacagctacttacttagttacctagttagtatgatgcgacgtgcagtattctccgacctacaacgtaaaggattagagaagaggttccaactacagaaatacatatctaaacctgaccgtaagaaactggcagagaagttaggacttaaggatagtcaggttagttacagctacttacttagttacctagttagtatgatgcgacgtgcagtattctccgacctacaacgtaaaggattagagaagaggttccaactacagaaatacatatctaaacctgaccgtaagaaactggcagagaagttaggacttaaggatagtcaggttagttacagctacttacttagttacctagttagtatgatgcgacgtgcagtattctccgacctacaacgtaaaggattagagaagaggttccaactacagaaatacatatctaaacctgaccgtaagaaactggcagagaagttaggacttaaggatagtcaggttagttacagctacttacttagttacctagttagtatgatgcgacgtgcagtattctccgacctacaacgtaaaggattagagaagaggttccaactacagaaatacatatctaaacctgaccgtaagaaactggcagagaagttaggacttaaggatagtcaggttagttacagctacttacttagttacctagttagtatgatgcgacgtgcagtattctccgacctacaacgtaaaggattagagaagaggttccaactacagaaatacatatctaaacctgaccgtaagaaactggcagagaagttaggacttaaggatagtcaggttagttacagctacttacttagttacctagttagtatgatgcgacgtgcagtattctccgacctacaacgtaaaggattagagaagaggttccaactacagaaatacatatctaaacctgaccgtaagaaactggcagagaagttaggacttaaggatagtcaggttagttacagctacttacttagttacctagttagtatgatgcgacgtgcagtattctccgacctacaacgtaaaggattagagaagaggttccaactacagaaatacatatctaaacctgaccgtaagaaactggcagagaagttaggacttaaggatagtcaggttagttacagctacttacttagttacctagttagtatgatgcgacgtgcagtattctccgacctacaacgtaaaggattagagaagaggttccaactacagaaatacatatctaaacctgaccgtaagaaactggcagagaagttaggacttaaggatagtcaggttagttacagctacttacttagttacctagttagtatgatgcgacgtgcagtattctccgacctacaacgtaaaggattagagaagaggttccaactacagaaatacatatctaaacctgaccgtaagaaactggcagagaagttaggacttaaggatagtcaggttagttacagctacttacttagttacctagttagtatgatgcgacgtgcagtattctccgacctacaacgtaaaggattagagaagaggttccaactacagaaatacatatctaaacctgaccgtaagaaactggcagagaagttaggacttaaggatagtcaggttagttacagctacttacttagttacctagttagtatgatgcgacgtgcagtattctccgacctacaacgtaaaggattagagaagaggttccaactacagaaatacatatctaaacctgaccgtaagaaactggcagagaagttaggacttaaggatagtcaggttagttacagctacttacttagttacctagttagtatgatgcgacgtgcagtattctccgacctacaacgtaaaggattagagaagaggttccaactacagaaatacatatctaaacctgaccgtaagaaactggcagagaagttaggacttaaggatagtcaggttagttacagctacttacttagttacctagttagtatgatgcgacgtgcagtattctccgacctacaacgtaaaggattagagaagaggttccaactacagaaatacatatctaaacctgaccgtaagaaactggcagagaagttaggacttaaggatagtcaggttagttacagctacttacttagttacctagttagtatgatgcgacgtgcagtattctccgacctacaacgtaaaggattagagaagaggttccaactacagaaatacatatctaaacctgaccgtaagaaactggcagagaagttaggacttaaggatagtcaggttagttacagctacttacttagttacctagttagtatgatgcgacgtgcagtattctccgacctacaacgtaaaggattagagaagaggttccaactacagaaatacatatctaaacctgaccgtaagaaactggcagagaagttaggacttaaggatagtcaggttagttacagctacttacttagttacctagttagtatgatgcgacgtgcagtattctccgacctacaacgtaaaggattagagaagaggttccaactacagaaatacatatctaaacctgaccgtaagaaactggcagagaagttaggacttaaggatagtcaggttagttacagctacttacttagttacctagttagtatgatgcgacgtgcagtattctccgacctacaacgtaaaggattagagaagaggttccaactacagaaatacatatctaaacctgaccgtaagaaactggcagagaagttaggacttaaggatagtcaggttagttacagctacttacttagttacctagttagtatgatgcgacgtgcagtattctccgacctacaacgtaaaggattagagaagaggttccaactacagaaatacatatctaaacctgaccgtaagaaactggcagagaagttaggacttaaggatagtcaggttagttacagctacttacttagttacctagttagtatgatgcgacgtgcagtattctccgacctacaacgtaaaggattagagaagaggttccaactacagaaatacatatctaaacctgaccgtaagaaactggcagagaagttaggacttaaggatagtcaggttagttacagctacttacttagttacctagttagtatgatgcgacgtgcagtattctccgacctacaacgtaaaggattagagaagaggttccaactacagaaatacatatctaaacctgaccgtaagaaactggcagagaagttaggacttaaggatagtcagagaagaggttccaactacagaaatacatatctaaacctgaccgtaagaaactggcagagaagttag
Protein Sequence
MLPSDERAKWRDDMIHSLQTIRAGGHFLVENLLRSTTRGDQSDDDECNSSTTCSPPPRTVPLKFSVSAILGDTLTQADSDLRACSEVGMVTSSWSPAVVTKPVASRAIPIHTHLHHLLAHCHHPYRAAVGLTSHTQVFPLPGAFPWAHSSRGKPRRGMMRRAVFSDLQRKGLEKRFQLQKYISKPDRKKLAEKLGLKDSQVSYSYLLSYLVSMMRRAVFSDLQRKGLEKRFQLQKYISKPDRKKLAEKLGLKDSQVSYSYLLSYLVSMMRRAVFSDLQRKGLEKRFQLQKYISKPDRKKLAEKLGLKDSQVSYSYLLSYLVSMMRRAVFSDLQRKGLEKRFQLQKYISKPDRKKLAEKLGLKDSQVSYSYLLSYLVSMMRRAVFSDLQRKGLEKRFQLQKYISKPDRKKLAEKLGLKDSQVSYSYLLSYLVSMMRRAVFSDLQRKGLEKRFQLQKYISKPDRKKLAEKLGLKDSQVSYSYLLSYLVSMMRRAVFSDLQRKGLEKRFQLQKYISKPDRKKLAEKLGLKDSQVSYSYLLSYLVSMMRRAVFSDLQRKGLEKRFQLQKYISKPDRKKLAEKLGLKDSQVSYSYLLSYLVSMMRRAVFSDLQRKGLEKRFQLQKYISKPDRKKLAEKLGLKDSQVSYSYLLSYLVSMMRRAVFSDLQRKGLEKRFQLQKYISKPDRKKLAEKLGLKDSQVSYSYLLSYLVSMMRRAVFSDLQRKGLEKRFQLQKYISKPDRKKLAEKLGLKDSQVSYSYLLSYLVSMMRRAVFSDLQRKGLEKRFQLQKYISKPDRKKLAEKLGLKDSQVSYSYLLSYLVSMMRRAVFSDLQRKGLEKRFQLQKYISKPDRKKLAEKLGLKDSQVSYSYLLSYLVSMMRRAVFSDLQRKGLEKRFQLQKYISKPDRKKLAEKLGLKDSQVSYSYLLSYLVSMMRRAVFSDLQRKGLEKRFQLQKYISKPDRKKLAEKLGLKDSQVSYSYLLSYLVSMMRRAVFSDLQRKGLEKRFQLQKYISKPDRKKLAEKLGLKDSQVSYSYLLSYLVSMMRRAVFSDLQRKGLEKRFQLQKYISKPDRKKLAEKLGLKDSQVSYSYLLSYLVSMMRRAVFSDLQRKGLEKRFQLQKYISKPDRKKLAEKLGLKDSQVSYSYLLSYLVSMMRRAVFSDLQRKGLEKRFQLQKYISKPDRKKLAEKLGLKDSQVSYSYLLSYLVSMMRRAVFSDLQRKGLEKRFQLQKYISKPDRKKLAEKLGLKDSQVSYSYLLSYLVSMMRRAVFSDLQRKGLEKRFQLQKYISKPDRKKLAEKLGLKDSQVSYSYLLSYLVSMMRRAVFSDLQRKGLEKRFQLQKYISKPDRKKLAEKLGLKDSQVSYSYLLSYLVSMMRRAVFSDLQRKGLEKRFQLQKYISKPDRKKLAEKLGLKDSQVSYSYLLSYLVSMMRRAVFSDLQRKGLEKRFQLQKYISKPDRKKLAEKLGLKDSQVSYSYLLSYLVSMMRRAVFSDLQRKGLEKRFQLQKYISKPDRKKLAEKLGLKDSQVSYSYLLSYLVSMMRRAVFSDLQRKGLEKRFQLQKYISKPDRKKLAEKLGLKDSQVSYSYLLSYLVSMMRRAVFSDLQRKGLEKRFQLQKYISKPDRKKLAEKLGLKDSQVSYSYLLSYLVSMMRRAVFSDLQRKGLEKRFQLQKYISKPDRKKLAEKLGLKDSQVSYSYLLSYLVSMMRRAVFSDLQRKGLEKRFQLQKYISKPDRKKLAEKLGLKDSQVSYSYLLSYLVSMMRRAVFSDLQRKGLEKRFQLQKYISKPDRKKLAEKLGLKDSQVSYSYLLSYLVSMMRRAVFSDLQRKGLEKRFQLQKYISKPDRKKLAEKLGLKDSQVSYSYLLSYLVSMMRRAVFSDLQRKGLEKRFQLQKYISKPDRKKLAEKLGLKDSQVSYSYLLSYLVSMMRRAVFSDLQRKGLEKRFQLQKYISKPDRKKLAEKLGLKDSQVSYSYLLSYLVSMMRRAVFSDLQRKGLEKRFQLQKYISKPDRKKLAEKLGLKDSQVSYSYLLSYLVSMMRRAVFSDLQRKGLEKRFQLQKYISKPDRKKLAEKLGLKDSQVSYSYLLSYLVSMMRRAVFSDLQRKGLEKRFQLQKYISKPDRKKLAEKLGLKDSQVSYSYLLSYLVSMMRRAVFSDLQRKGLEKRFQLQKYISKPDRKKLAEKLGLKDSQVSYSYLLSYLVSMMRRAVFSDLQRKGLEKRFQLQKYISKPDRKKLAEKLGLKDSQVSYSYLLSYLVSMMRRAVFSDLQRKGLEKRFQLQKYISKPDRKKLAEKLGLKDSQVSYSYLLSYLVSMMRRAVFSDLQRKGLEKRFQLQKYISKPDRKKLAEKLGLKDSQVSYSYLLSYLVSMMRRAVFSDLQRKGLEKRFQLQKYISKPDRKKLAEKLGLKDSQRRGSNYRNTYLNLTVRNWQRS

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-