Open Access Article
M. Matovina
a,
D. Agić
b,
M. Abramića,
S. Matića,
Z. Karačića and
S. Tomić
*a
aDivision of Organic Chemistry and Biochemistry, Ruđer Bošković Institute, Bijenička cesta 54, Zagreb, Croatia. E-mail: sanja.tomic@irb.hr
bJosip Juraj Strossmayer University of Osijek, Faculty of Agriculture in Osijek, Vladimira Preloga 1, Osijek, Croatia
First published on 21st July 2017
Dipeptidyl peptidase III (DPP III) is a cytosolic enzyme belonging to the metallopeptidase family M49, involved in the final steps of protein catabolism. More than a hundred missense mutations can be found in the coding region of the human DPP3 gene when searching cBioPortal for Cancer Genomics. The role of two highly conserved residues in the family M49, whose mutations G313W and R510W were detected in human cancer, was investigated using combined experimental and computational approaches (substrate docking and MD simulations). Several mutants of human DPP III were expressed and purified as recombinant proteins, and their biochemical properties were determined. The conservative substitution of Arg510 with lysine mildly decreased enzyme activity activity for Arg-Arg-2-naphtylamide substrate, while the substitutions of Arg510 with glutamine and Gly313 with alanine substantially decreased enzyme activity, and tryptophan substitutions found in cancer, G313W and R510W, almost abolished enzyme activity. MD simulations showed that substitution of Gly313, and especially Arg510 with tryptophan, significantly increases the enzyme flexibility, particularly that of the binding site including the H450ELLGH455 motif, and influences the substrate interactions with the catalytic His568. The results clearly indicate that, besides the enzyme structure, its dynamics properties also significantly influence the human DPP III activity.
From a structural point of view, the DPP III is a two-domain protein with the motifs HEXXGH and EE*XR(K)AE(D) considered as its trademark.8,9 Histidines from the first motif and glutamate* from the second one take part in the zinc ion coordination, while the glutamate from the first motif is crucial for the catalytic activity of DPP III, namely the hydrolysis of dipeptides from the N-terminus of its substrates consisting of three or more amino acids.10
Until now several 3D structures of h.DPP III have been determined, ligand free and in complex with natural peptides. The structure of the ligand-free enzyme (PDB codes: 3FVY and 5EGY) was determined in the so-called open and closed form, while that of the liganded enzyme only in a more compact, closed, form (PDB codes: 3T6B, 3T6J, 5E33, 5E3A, 5E2Q, 5EHH, 5E3C).11 The open, ligand-free enzyme structure is characterized with the wide inter-domain cleft (∼40 Å wide and ∼40 Å high) with the catalytic zinc ion bound at its “roof”. Using computational approaches we have traced the protein closure,12,13 gained better insight into enzyme broad substrate specificity and determined the reaction mechanism of h.DPP III.14–16
More than hundred missense mutations can be found in the coding region of h.DPP III gene by searching cBioPortal for Cancer Genomics (http://www.cbioportal.org/; accessed September 2014).17,18 Most of those mutations were predicted to have low or medium impact on protein function, however, two of them, G313W and R510W, which reside in the conserved regions 1 and 4 of DPP III,9 were predicted to have high influence on protein functionality. Those mutations were found in the tumour samples from lung adenocarcinoma, and colorectal adenocarcinoma samples, respectively. Mutation G313W was found in the provisional TCGA (The Cancer Genome Atlas) study, and was removed from newer releases of cBioPortal (http://www.cbioportal.org/; accessed October 2015), however, this study was aimed at finding the cause of the DPP III inactivation by this mutation, and not at studying the potential role of these mutations in the development of cancer so we decided to proceed with the research of this mutant regardless of that. We investigated the influence of those mutations on the enzyme biochemical and physicochemical properties. Those mutants exhibited almost complete loss of activity towards synthetic substrate, Arg-Arg-2-naphthylamide (Arg2-2NA; RRNA) compared to the wild type (wt) h.DPP III, and we set out to determine the cause of this activity loss by combining experimental approaches with computational simulations. In order to determine the root of the activity loss in these two mutants, we investigated three other amino acid substitutions, in which the native amino acid residues were substituted by an amino acid that differs less from the original than mutants found in tumour cells. In this way we were able to gain better understanding of how amino acid substitutions at particular position influence enzyme structure and activity. Altogether, we analysed 5 different mutants, G313W and R510W that were found in cancer, and the more similar (conserved) amino acid substitutions at the same positions, G313A, R510K and R510Q.
| Protein h.DPP III | Specific activity/nmol min−1 mg−1 | RA Ala-Arg-2NA/% | RA Phe-Arg-2NA/% |
|---|---|---|---|
| a RA Relative activity towards substrates (normalized to that for Arg2-2NA). | |||
| WT | 47 900 |
26 | 27 |
| G313W | 2 | 40 | 40 |
| G313A | 76 | 6 | 20 |
| R510W | 31 | 18 | 23 |
| R510K | 21 515 |
25 | 19 |
| R510Q | 3284 | 5 | 5 |
Measurements of specific activity of the wt h.DPP III and the mutants towards Arg2-2NA showed that tryptophan mutants, G313W and R510W, have approximately 24
000, and 1600 fold lower specific activity than the wt h.DPP III, respectively. Substitution of alanine at amino acid residue 313 increased the activity of the protein compared to the G313W mutant, however, it was still considerably lower than the wt h.DPP III, showing that glycine at position 313 is essential for the h.DPP III activity. On the other hand, substitution by lysine instead of tryptophan at position 510 returned the specific activity of the mutant to approximately half of the activity of the wt h.DPP III, while substitution by glutamine raised enzyme activity comparing to the R510W mutant, but the mutant R510Q still had approximately 14 fold lower specific activity than the wt h.DPP III (Table 1). That result showed that for the efficient enzymatic activity h.DPP III requires a positively charged amino acid at position 510. The pH optimum for Arg2-2NA hydrolysis was around 8.6 for R510 mutants, while G313W, and G313A had the optimum at pH 8.0, and 7.4, respectively. The studied substitutions slightly change the relative preferences for different dipeptidyl 2-naphthylamides (Arg2-2NA, Ala-Arg-2NA and Phe-Arg-2NA) compared to the wt, however, Arg2-2NA was still the preferred substrate for all mutants (Table 1).
The catalytic properties of the wt h.DPP III and all mutated forms were determined for the hydrolysis of characteristic synthetic substrate Arg2-2NA from two or three independent measurements at pH 8.6 (Table 2), except for the G313A whose catalytic properties were determined from one measurement at pH 7.4. We did not determine the G313W kinetic parameters due to extremely low enzymatic activity of that mutant. All tested proteins showed similar KM as the wt h.DPP III, except G313A that had more than 20 times higher KM, while kcat differed by 2–3 orders of magnitude for mutants G313A and R510W, respectively. The kcat value of R510Q was 26 times lower than the wt, and R510K had similar kcat as the wt.
| Protein h.DPP III | KM/μM | kcat/s−1 | kcat/KM × 106 M−1 s−1 |
|---|---|---|---|
| a The values are the average ± st. dev. of 2 (WT and R510W) or 3 (R510K and R510Q) independent measurements at pH 8.6, except G313A for which we conducted 1 measurement at pH 7.4. | |||
| WT | 5.45 ± 0.53 | 35.400 ± 6.364 | 6.475 ± 0.538 |
| G313W | — | — | — |
| G313A | 114.57 | 0.224 | 0.002 |
| R510W | 4.48 ± 1.49 | 0.012 ± 0.003 | 0.003 ± 0.000 |
| R510K | 4.06 ± 0.87 | 27.533 ± 9.905 | 7.066 ± 3.269 |
| R510Q | 2.10 ± 1.32 | 1.350 ± 0.365 | 0.864 ± 0.533 |
We have revealed or confirmed the importance of several conserved amino acid residues situated outside of the two active-site motifs H450ELLGH455 and E507ECRAE512, in human DPP III (Tyr318, Trp300, His568, Asp496), so far.19–22 Among previously characterized h.DPP III mutants, H568N showed the lowest kcat value for the hydrolysis of Arg2-2NA, three orders of magnitude lower than the wt h.DPP III,21 without significant change in KM value. According to steady-state kinetic parameters, this protein variant is similar to R510W characterized in our present study.
| h.DPP III variant | Zn2+ coordination | Rgyr | |||
|---|---|---|---|---|---|
| H450 | H455 | E451 | E508 | Range (Å) | |
| a In the case of two simulation of variants, R510K and R510Q, His455 reoriented and during about 4.5% of the total simulation time (200 ns) have not coordinated the zinc ion. | |||||
| WT | + | + | m | m | 26.9–28.4 |
| G313W | + | + | m | m/b | 26.2–28.6 |
| G313A | + | + | m | m/b | 26.4–28.5 |
| R510K | + | ±a | m | m/b | 26.3–28.2 |
| R510W | + | + | m | m | 26.2–28.3 |
| R510Q | + | ±a | m | m | 26.4–28.4 |
| WT-closed | + | + | m | m/b | 25.0–25.5 |
| WT-RRNA | + | + | m | b | 24.9–25.5 |
| G313W-RRNA | + | + | m/b | b | 24.8–25.6 |
| G313A-RRNA | + | + | m/b | b | 24.9–25.5 |
| R510W-RRNA | + | + | m | b | 24.9–25.7 |
| R510Q-RRNA | + | + | m/b | m/b | 24.8–25.5 |
| R510K-RRNA | + | + | m | m/b | 24.8–25.7 |
For the G313W variant, two different orientation of the indole ring were considered. In one, the Trp side chain at position 313 points towards the active site and in the other it points toward the lower domain interior. However, since the orientation toward the lower domain interior turned out to be unfavourable, in this work we consider only results obtained with Trp313 pointing towards the interdomain cleft active site. According to the results of MD simulations, the point mutations at positions 313 and 510, G313A, G313W, R510K, R510Q, and R510W do not influence the metal ion coordination (Table 3). The simulations showed that, like in the wt protein,25 the Zn2+ ion is mostly hexacoordinated in the simulated mutants, as well. In the ligand free variants the metal dication was coordinated by two histidines, H450 and H455, two glutamates, E451 and E508, and one or two water molecules, depending on the position of E451 and E508 carboxyl groups, during the simulations. The water molecules coordinating Zn2+ often exchanged with bulk water during the simulations and they are not shown in Fig. 1. In the case of two simulation (variants R510K and R510Q) His455 reoriented, and during about 4.5% of the total simulation time have not coordinated the zinc ion. However, during the simulations of the other two replicas of the same variants the zinc coordination has been preserved during entire simulation time. Apparently, the zinc ion coordination is mostly preserved in all simulations and rarely occurring reorientation of H455 can be attributed to the stochastic nature of the simulations.
MD simulations revealed that, although Gly313 does not interact with the substrate, it dictates the position of Glu316.
Indeed, Trp at position 313 distorts the β6 sheet conformation, changes the Glu316 position and prevents proper substrate positioning in the active site (Fig. 2 and S2†). The kinetic parameters of G313W mutant could not be determined due to its low activity and KM of the more conservative mutant G313A was significantly increased compared to the wt (Table 2). This supports the finding that the mutation in G313 in general does not prevent the substrate binding, but disturbs its positioning in the active site in the catalytically productive orientation.
In order to get an insight into the overall protein flexibility, and to find out how the point mutations influence this flexibility, we computed the average Root Mean Square Fluctuations (RMSF) for the wild type protein and its mutants (Fig. 3). The results show that the lower (satellite) domain, especially its N and C terminal segments (residues 4 to 300 and 680 to 726, respectively) are the most flexible parts of the enzyme, followed by the substrate binding β-sheet region, consisting of two β-strands, β9 and β10. Analysis of the RMSF clearly indicated that mutations, either at position 313 or in position 510, increase protein flexibility, especially of the lower domain β-core (β-sheets β2–β3 and β9–β10). The R510K flexibility closely resembles that of the wild type protein.
![]() | ||
| Fig. 3 Root Mean Square Fluctuations (RMSF) profiles of the human DPP III variants obtained by averaging over two 100 ns long simulations (see Table S1†). | ||
Apparently, increased flexibility of the lower domain β-core, especially of β-sheets β2–β3 and β9–β10, is negatively correlated with the enzyme activity. The mutants with the highest fluctuations determined during MD simulations were also showing the lowest activity. On the other hand, both the activity and flexibility of R510K are very similar to those of the wild type protein. Further on, the RMSF profile revealed that substitution of Arg510 by Trp significantly increases fluctuations of the H450ELLGH455 motif and its surrounding (Fig. 3 and S3†).
Consequently, the interactions between the amino acid residues from this region with the rest of the protein are weaker in mutants than in the wt h.DPP III. In the wt protein, Arg510 interacts electrostatically with carbonyl groups of Gly454, Leu453 and His455 (Fig. 4). The Arg510 to Trp and Gln mutations resulted with decrease of these interactions and as a consequence flexibility of this motif increased in accord with the RMSF results. The highest fluctuation of the H450ELLGH455 motif has been found in the R510W mutant, followed by R510Q, while the fluctuations in the R510K mutant and the wild type protein are similar (Fig. S3†). It seems that destabilization of the H450ELLGH455 motif does not disturb the Zn2+ coordination in the R510W and R510Q mutants but it significantly influences the enzymatic activity. This is not surprising if we know that Glu451 directly participates in the peptide bond hydrolysis, acting as a general base during the deprotonation of water molecule that attacks the scissile amide bond.16 The increased fluctuations of this motif in the Arg510 mutants has been observed in both open and closed protein forms. Apparently, the H450ELLGH455 instability can be accounted for the decreased peptidase activity of h.DPP III mutants.
![]() | ||
| Fig. 4 Position of highly conserved residue R510 (pink) in the structure of h.DPP3 (PDB 3FVY) interacting with Gly454 (yellow, distance in Å) from the conserved H450ELLGH455 motif (shown as sticks) responsible for coordinating Zn2+ (grey sphere). | ||
Additionally, the hydrogen bond between residue 510 and Ser457, present in the wild type enzyme, is significantly less stable in Arg510 mutants, while the Arg510–Gln626 hydrogen bond is preserved in the R510Q mutant, but not in R510W and R510K mutants (Table 4). Further on, the hydrogen bond between fully conserved Tyr318 and Glu327 is preserved in the R510Q but not in R510K and R510W mutants.
| Residues | % H-bond | |||||
|---|---|---|---|---|---|---|
| Variant | ||||||
| wt | G313W | G313A | R510W | R510Q | R510K | |
| Y318–E327 | 66 | 87 | 52 | 31 | 78 | 34 |
| H450–H455 | 22 | 22 | 19 | 17 | 19 | 14 |
| E451–N406 | 59 | 41 | 11 | 49 | 52 | 55 |
| H455–S457 | 2 | 3 | 3 | 2 | 3 | 16 |
| H455–E507 | 84 | 89 | 86 | 63 | 77 | 73 |
| E508–S504 | 8 | 11 | 6 | 5 | 4 | 8 |
| E512–H568 | 68 | 74 | 32 | 73 | 70 | 61 |
| X510–G454 | 78 | 80 | 74 | 35 | 12 | 13 |
| X510–S457 | 62 | 50 | 72 | 6 | 1 | 4 |
| X510–Q626 | 73 | 71 | 79 | 2 | 78 | 7 |
The hydrogen bond analysis revealed that, similarly to the ligand free protein, interaction between residues at position 510 and Gln626 is completely abolished in the R510W–Arg2-2NA mutant complex, while the hydrogen bond between Arg510 and Gly454 has not been detected during the simulation of the R510Q complex (Table 5). Oppositely, the Tyr318 stabilization by Asp496 and His568 is the most stable in the complex with the G313A and the R510Q mutant, respectively (Fig. S7†) as confirmed by the hydrogen bond analysis (Table 5). We have showed that the 100% conserved Tyr318 is important for substrate stabilization in the active site.16 Its reorientation into direction of either Asp496 or His568 might be one of the reasons for the enzymatic inactivation of the R510Q mutant, also. Analysis of the substrate–protein hydrogen bonds revealed that in general there are no significant differences among the complexes (Fig. 6, S4 and S5†). However, the stability of the hydrogen bond between the substrate and highly conserved residue G385 as well as the catalytic histidine H568 is the most stable in the complex with wt enzyme (Table 5 and Fig. 6 and S6†). It has been shown that His568 plays a significant role in the transition state stabilisation and the enzymatic reaction. By forming a stable hydrogen bond with the substrate carbonyl oxygen, His568 is dragging the negative charge away from the peptide C atom and in this way facilitates nucleophile attack of the activated water.16 So the lack of this hydrogen bond in the complexes with mutated protein may explain the enzymatic inactivity of these variants.
| H-bonds | WT-RRNA | R510W-RRNA | R510Q-RRNA | G313W-RRNA | G313A-RRNA |
|---|---|---|---|---|---|
| Y318–D496 | <1 | <1 | 40 | 3 | 97 |
| Y318–E508 | 76 | 44 | 1 | 85 | <1 |
| Y318–H568 | <1 | <1 | 21 | <1 | 1 |
| X510–G454 | 57 | 34 | <1 | 74 | 41 |
| X510–Q626 | 68 | <1 | 71 | 76 | 63 |
| E512–H450 | 88 | 73 | 71 | 80 | 90 |
| E316–RRNA | 100 | 99 | 100 | 100 | 56 |
| E329–RRNA | 21/63 | 87 | 100 | 100 | 95 |
| G385–RRNA | 49 | <1 | 5 | <1 | <1 |
| G389–RRNA | 54 | 77 | 74 | 55 | <1 |
| N391–RRNA | 51 | 100 | 88 | 100 | 77 |
| N394–RRNA | 89 | 64 | 90 | 88 | 83 |
| D396–RRNA | 50 | 23 | 45 | 62 | 84 |
| D496–RRNA | 41 | 85 | 95 | 100 | 96 |
| H568–RRNA | 47 | 7 | 8 | <1 | <1 |
Apparently, the disrupted hydrogen bond between Arg2-2NA and catalytic His568 in combination with the increased active site fluctuations, especially of the H450ELLGH455 motif provide possible explanation for inactivity of the G313W and R510W mutants.
This is the first report on negative correlation of DPP III activity with increased flexibility of the lower domain β-core.
Protein concentrations were determined using the protein-dye binding assay, with bovine serum albumin as a standard.24
The enzyme variants, either ligand free or its complexes with the synthetic substrate Arg2-2NA, were neutralized by adding Na+ ions and solvated in an octahedron box filled with TIP3P28 water molecules ensuring an at least 11 Å thick water molecules buffer around the protein. The systems were parametrized by the AMBERTools14 modules antechamber and tleap, using GAFF29 and ff14SB30 force fields to parameterize the substrate and the protein, respectively. For the zinc cation, Zn2+, parameters were derived from our previous work12 and modified according to the PDB survey.31,32
000 (ligand free) and ∼64
000 (complexes) atoms (∼27
500 and ∼17
500 molecules of water, respectively), were simulated using periodic boundary conditions. The electrostatic interactions were calculated using the particle-mesh Ewald method.33 The PMEMD module running at GPU was used to conduct the molecular dynamics (MD) simulation. Before MD simulations, three rounds of geometry optimization, with different degree of the protein restraints, were performed by the module sander. In the first cycle 1500 optimization steps were performed, where the first 450 steps were of the steepest descent method, and the rest was the conjugate gradient. Both, the protein atoms and the metal ion, were constrained using a harmonic potential of 32 kcal (mol Å2)−1, in order to equilibrate water molecules. In the second cycle 2500 steps of optimization was done. The metal ion and the protein backbone were constrained with 12 kcal (mol Å2)−1. Finally, in the third cycle the same number of optimization steps was used as in the first cycle, and the protein backbone and the metal ion were constrained with 1 kcal (mol Å2)−1 and 12 kcal (mol Å2)−1, respectively. The minimized system was heated from 0 to 300 K during 30 ps of MD simulation using a canonical ensemble (NVT), followed by equilibration stage of 30 and 100 ps (for the ligand free enzyme and complexes), during which the water density was adjusted. The equilibrated system was then subjected to 100 ns of the productive MD simulations, 50 ns of conventional MD (cMD) followed by 50 ns of the dual boost accelerated MD (aMD) simulations. One boost was applied to torsional terms and the other to total potential. The latter facilitates diffusion of the explicit solvent molecules around the biomolecule and thus reduces hindrance that water induce on the large amplitude displacements of a solute. In this way more efficient conformational sampling is enabled.Type and duration of the performed simulations is given in Table S1.†34 The constant temperature (300 K) and pressure (1 atm) were ensured using Langevin dynamics with a collision frequency of 1 ps−1, and Berendsen barostat, respectively.35,36 The time step during the periods of heating and cooling was 1 fs, and for the rest of the simulation 2 fs (to restrain the motion of hydrogens the SHAKE algorithm was used).
Footnote |
| † Electronic supplementary information (ESI) available. See DOI: 10.1039/c7ra02642k |
| This journal is © The Royal Society of Chemistry 2017 |