A. Marcel
Willemsen‡
a,
Diana M.
Hendrickx‡
bc,
Huub C. J.
Hoefsloot
*bc,
Margriet M. W. B.
Hendriks
d,
S. Aljoscha
Wahl
e,
Bas
Teusink
f,
Age K.
Smilde
bc and
Antoine H. C.
van Kampen
ab
aBioinformatics Laboratory, Department of Clinical Epidemiology, Biostatistics and Bioinformatics, Academical Medical Centre, Amsterdam, The Netherlands
bBiosystems Data Analysis, Swammerdam Institute for Life Sciences, University of Amsterdam, The Netherlands. E-mail: H.C.J.Hoefsloot@uva.nl
cNetherlands Metabolomics Centre, Leiden, The Netherlands
dDSM Biotechnology Center, Delft, The Netherlands
eKluyver Centre for Genomics of Industrial Fermentation, Biotechnology Department, Delft University of Technology, The Netherlands
fSystems Bioinformatics, Centre for Integrative Bioinformatics, Free University of Amsterdam, The Netherlands
First published on 10th October 2014
Understanding cellular adaptation to environmental changes is one of the major challenges in systems biology. To understand how cellular systems react towards perturbations of their steady state, the metabolic dynamics have to be described. Dynamic properties can be studied with kinetic models but development of such models is hampered by limited in vivo information, especially kinetic parameters. Therefore, there is a need for mathematical frameworks that use a minimal amount of kinetic information. One of these frameworks is dynamic flux balance analysis (DFBA), a method based on the assumption that cellular metabolism has evolved towards optimal changes to perturbations. However, DFBA has some limitations. It is less suitable for larger systems because of the high number of parameters to estimate and the computational complexity. In this paper, we propose MetDFBA, a modification of DFBA, that incorporates measured time series of both intracellular and extracellular metabolite concentrations, in order to reduce both the number of parameters to estimate and the computational complexity. MetDFBA can be used to estimate dynamic flux profiles and, in addition, test hypotheses about metabolic regulation. In a first case study, we demonstrate the validity of our method by comparing our results to flux estimations based on dynamic 13C MFA measurements, which we considered as experimental reference. For these estimations time-resolved metabolomics data from a feast-famine experiment with Penicillium chrysogenum was used. In a second case study, we used time-resolved metabolomics data from glucose pulse experiments during aerobic growth of Saccharomyces cerevisiae to test various metabolic objectives.
One manifestation of adaptation on the molecular level is the alteration of fluxes in metabolic networks. This occurs when the availability of substrates changes, which results in changes in fluxes in the affected metabolic pathways.1,10 The (change in) flux can provide important information about cellular physiology, product yield, and response mechanisms to perturbations. Unfortunately, intracellular fluxes cannot be measured directly but have to be estimated from concentration measurements.11,12 Various experimental and associated computational methods have been developed to estimate dynamic fluxes through metabolic networks including kinetic models,1313C-metabolic flux analysis (MFA),14–16 and dynamic flux balance analysis (DFBA).17,18 The development of kinetic models is hampered by limited availability and accuracy of information about condition-specific (in vivo) kinetic parameters,1913C MFA is expensive and tracer availability is limited.20
Flux balance analysis (FBA) is a constraint-based modelling approach to determine fluxes in steady state. Mass balance constraints imposed by the stoichiometry of the metabolic network define an under-determined system of linear equations because the network contains more reactions than metabolites. To reduce the solution space, additional constraints on the upper and lower bounds of the fluxes, also called capacity constraints, are imposed. The linear system is solved as an optimization problem by determining the fluxes that minimize a pre-specified mathematically described objective function.21 This objective function defines the phenotype (e.g., maximum growth yield or energetic efficiency) in the form of a biological objective such as biomass production. The objective function quantifies the relative contribution of each reaction to the phenotype.17,22 It has been proposed in the literature that there are conditions for which the optimality principles cannot be described by a single objective, but by a combination of multiple objectives.23
DFBA, developed by Mahadevan and co-workers,18 is a non-steady state extension of FBA that accounts for dynamic changes in the cellular metabolism as introduced, for example, after a perturbation. Although FBA could be used for a perturbed system to determine fluxes for the different steady state conditions, this would not describe the transient behavior after a perturbation. DFBA estimates the transient behavior of the fluxes after a perturbation by optimizing the objective function over a time interval of interest. The objective function is subjected to capacity constraints and dynamic mass balance constraints, formulated as differential equations, which specify the change in metabolite concentrations as a function of fluxes, biomass and other available (kinetic) parameters. If available, constraints that specify the maximum change rate of the fluxes at any moment in time can be added. By specifying the stoichiometry, the initial concentrations, biomass and constraints, the fluxes and metabolite concentrations as function of time are calculated for the specified time interval.
In contrast to FBA where the change in metabolite concentration is zero and only the fluxes are unknown, DFBA also needs to estimate the change (derivatives) of the metabolite concentrations as a function of time given the initial conditions. This results in a system with even more unknown parameters which is solved as a dynamic optimization problem by parameterizing the dynamic equations through the use of orthogonal collocation on finite elements.18 This makes the approach less suitable for larger metabolic networks or for longer-term extracellular dynamics.
Classical DFBA is used to estimate dynamic flux profiles when it is known which objective drives an organism under changing conditions, for example a diauxic shift.18 Due to the large number of parameters to estimate, it is only used for small systems, mostly describing the flux between extracellular metabolites. In this paper, we describe a new approach that reduces both the computational complexity and the number of parameters to estimate in DFBA. We discuss two applications of this approach, that were hampered by the computational burden of classical DFBA. A first application is the estimation of dynamic flux profiles for larger systems, describing both internal and external fluxes over time after a perturbation, given the objective driving the organism under this particular perturbation. A second application is the evaluation of objective functions driving the transient cellular behavior after perturbations.
Metabolite concentration profiles can directly be measured through, for example, chromatography (liquid or gas) combined with mass spectrometry (LC/GC-MS). From these profiles the changes in concentrations over time can be easily calculated. Hence, instead of estimating time derivatives and metabolite concentrations in DFBA, these can be derived from experimental data. Unfortunately, DFBA cannot incorporate measured concentration profiles. Therefore, we developed a new approach (MetDFBA) which directly uses derivatives calculated from the measured concentration profiles, which are substituted in the mass balance equation leading to a system of linear equations. This approach largely reduces the computational burden of DFBA. The reduced complexity and the smaller number of unknowns makes this framework especially suitable for larger systems.24 In this paper we present our method (MetDFBA) and underlying mathematical framework to determine flux changes after perturbation of a biological system. We first demonstrate the validity of our method by comparing our results to flux estimations based on dynamic 13C experiments, which we consider as a experimental reference. For these estimations time-resolved metabolomics data from a feast-famine experiment with Penicillium chrysogenum was used. We only used the metabolite concentration measurements to estimate fluxes based on a single objective function. Subsequently, we compared our flux estimations to 13C MFA estimates from the 13C mass isotopomer measurements, which shows that our method approximates the benchmark estimates. With a second case study we demonstrate that MetDFBA can be used to generate hypotheses about cellular objectives by comparing different putative objective functions.23 For this application we used time-resolved metabolomics data obtained from glucose pulse experiments during aerobic growth of Saccharomyces cerevisiae. The results are compared with known information about physiology and results from previous kinetic models to determine the most reasonable objective function.
In case of estimating dynamic fluxes given an assumed objective, the results are compared with flux estimations based on concentration measurements in time and 13C mass isotopomer measurements. In contrast to 13C MFA, our method only uses metabolite concentration measurements (and not 13C mass isotopomer measurements) to estimate the fluxes.
In case of testing hypotheses, different objective functions are implemented. For each objective function, the resulting fluxes are plotted and compared with external information from the literature to decide which hypotheses can be rejected. The different steps in the mathematical framework are described below and details can be found in ref. 25.
In DFBA, the differential equations are rewritten as linear equations using a finite collocation method.18 For larger systems DFBA takes an unreasonable amount of computational time. When metabolite concentration measurements are available, the time derivatives dC/dt in the mass balances can be substituted by time derivatives calculated from the data. In this way the differential equation constraints become linear constraints. This makes MetDFBA a computational much simpler problem than DFBA. Consider
minimize or maximize F |
subject to |
A·vt = b |
vmin ≤ vt ≤ vmax |
In DFBA with the Dynamic Optimization Approach (DOA),18 an objective function has the form where t0 and tf are the start (steady state) and the end time of the experiment respectively. The integral is approximated with the finite collocation method. In MetDFBA, we write the integral as a sum of integrals over intervals between subsequent time points
. We apply the trapezoidal rule on each integral, which means that we approximate the integral (area under the curve) of f(t) by the area of a trapezoid in order to reduce computational complexity.
We compared our estimated fluxes from MetDFBA to fluxes based on isotopomer measurements from de Jonge et al.3 to validate our method, using an assumed objective: the minimization of the sum of squared fluxes. We believe that minimizing the cells total enzyme usage is a reasonable objective during constantly changing environmental conditions (e.g. intermittent feeding). We focused on the upper glycolysis, the pentose phosphate pathway (PPP) and storage metabolism. See Tables S1 and S2 (ESI†) for an overview of the metabolic network and used abbreviations. Intracellular and extracellular metabolite concentrations were measured over time during one feast-famine cycle. Time-series of 17 metabolites, participating in 22 reactions were used to apply our method. The dataset is described in more detail in the ESI,† Section S1. Since xylitol 5-phosphate was not measured we lumped four reactions. We constrained the optimization problem by assuming that most reactions are irreversible (Table S2, ESI†). The feed rate was set experimentally. We aim to test whether MetDFBA, with only concentration measurements in time as input, is capable of estimating dynamic fluxes.
![]() | ||
Fig. 3 Estimations of the average fluxes through the considered metabolic network in nmol gDW−1 s−1 including upper glycolysis (r1_2:r1_5), pentose phosphate pathway (r2_1:r2_7) and storage metabolism (trehalose, glycogen and mannitol) in Penicillium. See caption Fig. 2 for a description of the legend. |
A steady state flux distribution required for MetDFBA with minimal glucose uptake as objective, was obtained from FBA applied on a genome scale model of S. Cerevisiae.42 When glucose is limited it is realistic to suppose that the glucose uptake is minimal, which was used as objective to obtain the optimal steady state flux distribution. The lower bound of the growth rate was set equal to the dilution rate. The lower and upper bounds of the remaining fluxes were taken from the genome scale model. After lumping and removing reactions that did not occur under the given experimental conditions, the model consisted of 33 metabolites and 62 reactions. The reaction scheme and a detailed list of the reactions are given in Fig. S11 and Table S5 (ESI†).
We applied MetDFBA in conjunction with seven objective functions to establish the most likely response of the organism. Results from MetDFBA were evaluated against the following five criteria:
1. The solution space of optimal flux estimates should be significantly reduced (compared to the solution space defined by the capacity constraints);
2. The directionality of the metabolic reactions should be in agreement with literature;43
3. After a glucose pulse, S. cerevisiae switches from respiratory to respiro-fermentative conditions.44 The activity of the TCA cycle, crucial for ATP production under respiratory conditions, becomes low when switching to fermentation;45
4. The fluxes for the upper glycolysis (G6P → F6P → FBP → GAP + DHAP) are lower than for the lower glycolysis (GAP → 3PG → PEP → PYR);45
5. The qualitative behavior of the fluxes corresponds with the kinetic model simulations of Vaseghi et al. of the pentose phosphate pathway (PPP) in which the reactions rates show a fast increase immediately after the pulse, followed by a very slow decrease.46
Objective functions are rejected if the flux profiles do not fit cell physiology.
Table S8 (ESI†) shows the results of the evaluation of the five conditions for each of the seven optimization problems shown in Table S7 (ESI†). None of the objectives satisfied all the five criteria mentioned in the previous section.
Previous studies showed that objective functions compete against each other.27,47,50 This means that one objective function can only be improved if another is worsened. Optimal solutions for competing objectives are called Pareto optimal.47 The set of Pareto optimal solutions is called the Pareto front.51 A method to calculate Pareto optima is optimizing a weighted sum of the objectives.52 The following multi-objective optimization problem is solved:
minimize F = F1 − w·F2 |
subject to |
A·vt = b |
vmin ≤ vt ≤ vmax |
In Table S10 (ESI†), the results of checking the five criteria for the solutions of the multi-objective optimization problems (Table S9, ESI†) are shown. The MetDFBA fluxes satisfy all five criteria if MOMA is combined with maximization of ethanol production, and if the weight (w) is in the range of 104–106 (the orders of F1, F2 and w·F2 are 107, 105 and 109–1011 respectively; See Fig. 4 and Fig. S13–S15, ESI†). For weights lower than 104, the solution is in contradiction with criterion 5. This implies that maximizing ethanol production (F2) is more important than MOMA (F1). However, only maximizing ethanol did not lead to results satisfying the five evaluation criteria (see Table S7, ESI†). There is no significant difference among the optimal solutions for w = 104, 105 and 106. However, the solution space becomes larger when increasing w.
![]() | ||
Fig. 4 S. cerevisiae study. Results of combining maximize ethanol yield with MOMA. (a) Average fluxes through glycolysis (in mmol gDW−1 h−1) during the first two minutes after the glucose pulse. The flux through lower glycolysis is higher than the flux through upper glycolysis. (b) The fluxes through the TCA cycle remain low compared with the fluxes through the rest of the network (figures a and c). (c) The flux from G6P → 6PG shows a large increase after the pulse, followed by a slow decrease. Similar profiles for the other PPP fluxes are shown in Fig. S16 (ESI†). Legend: optimum = optimal solution found by MetDFBA for the given weight; min and max are lower and upper bounds found by dynamic FVA for the given weight. |
In the Penicillium study, it was observed, that a series of intracellular fluxes was estimated comparable to the more complex 13C tracer method, except those of the pentose phosphate pathway (PPP). The flux through the PPP was overestimated significantly. This overestimation indicated a property of the applied objective function: the minimal total enzyme usage is obtained by equally distributing the flux at split nodes. Therefore we decided to constrain the entrance reaction of the PPP according to the values obtained in the reference study which indeed improved the results. The choice to fix this flux in particular, was based on the comparison between results and experimental reference, which of course could not be used in situations where this method is applied to predict unknown fluxes. However the split ratios could be further constraint by expert knowledge like kinetic parameters from the literature to calculate particular fluxes. The fluxes should be selected based on the network topology, especially highly connected nodes will require additional constraints.
Additionally we demonstrated how the MetDFBA framework could be used to elucidate the biological objective of a yeast cell through the comparison of different objective functions that represent phenotypes.
For Saccharomyces cerevisiae we showed that minimization of the overall flux and MOMA resulted in a significantly reduced solution space for the fluxes, as determined by dynamic FVA. These results can be explained by the fact that both these objectives constrain all fluxes while the other objectives only constrain flux(es) corresponding to a specific metabolite. The MOMA solution for glycolysis and the TCA cycle was in accordance with the physiological information in the literature, but the qualitative behaviour of the estimated fluxes for the pentose phosphate pathway did not correspond to the model simulations of Vaseghi.46 However, when combining MOMA with maximization of the ethanol yield, the five evaluation criteria (see Section 3.2.) were fulfilled. These results show that maximizing ethanol yield is very important compared to MOMA, but only maximizing ethanol did not lead to results satisfying all five evaluation criteria (see Table S7, ESI†). The results of this case study show that optimizing the yield or uptake of a single compound was insufficient to significantly reduce the solution space of reaction rate profiles determined by the capacity constraints. A reduced solution space could only be obtained from at least two objectives or from one objective that is a function of all the fluxes. From this case study it can be concluded that both the objectives MOMA and ethanol production are important. In biological terms this means that yeast converts the glucose to ethanol with as little as possible adaptations to the steady state fluxes.
MetDFBA requires the measurement of (all) extracellular and intracellular metabolite concentrations in response to perturbations. Because of the low intracellular amounts and high flux, methods for rapid sampling and quenching are required. With current, quantitative metabolomics approaches using MS technologies about 100–150 metabolites, especially of central carbon metabolism can be obtained. Computationally, MetDFBA is a linear programming problem with can easily scale to genome scale models. For dynamic 13C flux identification, the microorganism is supplied with labeled substrate. Next to the metabolite concentrations, the labeling enrichment of the metabolites is measured. This results in the double amount of samples and mass isotopomer measurements. Generally, if the concentration can be measured, also the mass isotopomer enrichments can be measured. Computationally, flux identification using 13C metabolic flux analysis requires to balance metabolites as well as labeling states. In contrast to metabolite balances, the labeling balances lead to non-linear differential equation systems that require advanced computation, even at metabolic steady-state.58,59 The parameter identification, and especially obtaining the global optimum are challenging already for medium sized networks (order of 100 metabolites). In general dynamic 13C metabolic flux analysis is experimentally and computationally more intensive than MetDFBA.
We presented the incorporation of time-resolved metabolomics data into DFBA. Considering the upcoming availability of high-throughput expression data of the last decade, it is probably a logical next step to further develop MetDFBA to make it also suitable for the integration of expression data. Several methods that combined expression data with FBA were recently published.60–62 They all have in common that they try to further reduce the flux distribution solution space. Some use an arbitrary threshold above which corresponding reactions are assumed to be inactive and excluded. For MetDFBA the ones that constrain the maximum possible flux through the reaction, according to the expression levels of the corresponding enzymes, are probably most suitable because MetDFBA uses a predefined network. More tight constraints will probably result in more accurate solutions and with a smaller solution space, a smaller (more lumped) network might be satisfactory and thus fewer metabolite measurements might be needed. Another potentially interesting approach to further reduce the solution space is the addition of extra energy balance constraints, as described by Beard et al.,63 which ensures that all estimated fluxes are thermodynamically feasible. Finally, when available, kinetic descriptions about metabolic regulation (e.g. allosteric activation or inhibition) can be easily incorporated into the constraints in a similar way as described by Chowdhury et al.64 for classical FBA.
Footnotes |
† Electronic supplementary information (ESI) available. See DOI: 10.1039/c4mb00510d |
‡ These authors contributed equally to this work. |
This journal is © The Royal Society of Chemistry 2015 |