Aoife E.
McNamara‡
ab,
Xiaofei
Yin‡
ab,
Cassandra
Collins
ab and
Lorraine
Brennan
*ab
aUCD School of Agriculture and Food Science, Institute of Food and Health, University College Dublin, Belfield, Dublin 4, Ireland. E-mail: lorraine.brennan@ucd.ie; Tel: +353 1 7166815
bUCD Conway Institute of Biomolecular and Biomedical Research, University College Dublin, Dublin, Ireland
First published on 29th August 2023
It is well-established that consumption of cruciferous and brassica vegetables has a correlation with reduced rates of many negative health outcomes. There is an increased interest in identifying food intake biomarkers to address limitations related to self-reported dietary assessment. The study aims to identify biomarkers of broccoli intake using metabolomic approaches, examine the dose–response relationship, and predict the intake by multimarker panel. Eighteen volunteers consumed cooked broccoli in A-Diet Discovery study and fasting and postprandial urine samples were collected at 2, 4 and 24 hours. Subsequently the A-Diet Dose–response study was performed where volunteers consumed different portions of broccoli (49, 101 or 153 g) and urine samples were collected at the end of each intervention week. Urine samples were analysed by 1H-NMR and LC-MS. Multivariate data analysis and one-way ANOVA were performed to identify discriminating biomarkers. A panel of putative biomarkers was examined for its ability to predict intake through a multiMarker model. Multivariate analysis revealed discriminatory spectral regions between fasting and fed metabolic profiles. Subsequent time-series plots revealed multiple features increased in concentration following the consumption. Urinary S-methyl cysteine sulfoxide (SMCSO) increased as broccoli intake increased (0.17–0.24 μM per mOSM per kg, p < 0.001). Similarly from LC-MS data genipin, dihydro-β-tubaic acid and sinapic acid increased with increasing portions of intake. A panel of 8 features displayed good ability to predict intake from biomarker data only. In conclusion, urinary SMCSO and several LC-MS features appeared as potentially promising biomarkers of broccoli consumption and demonstrated dose–response relationship. Future work should focus on validating these compounds as food intake biomarkers.
Traditional commonly used dietary assessment techniques are self-reported methods which are subject to well documented limitations such as inaccurate portion size estimation, recall bias and misreporting.5–8 These limitations highlight the need to develop more accurate and objective dietary assessment measures, for example, the identification of food intake biomarkers. Food intake biomarkers are single metabolites, or a combination of metabolites, reflecting the consumption of either a specific food or food group, displaying a clear time- and dose–response after intake.9
Metabolomics, the study of small metabolites, plays a key role in the discovery of food intake biomarkers. Multiple studies examining cruciferous vegetable intake assessment are present in literature.10–15 The most promising biomarkers of cruciferous vegetables discovered to date are sulphur-containing compounds such as: sulforaphane,10,11 SMCSO (S-methylcysteine sulfoxide)12 and other compounds from the isothiocyanate class.10,11,15 Sulforaphane, an isocyanate, is abundantly present in cruciferous vegetables,16 and sulforaphane N-acetylcysteine and sulforaphane N-cysteine were identified by Andersen et al., (2013)11 as urinary markers of brassica containing meals in a cross-over meal study representative of the new Nordic diet. Upon further analysis, sulforaphane N-acetylcysteine was discovered to be a strong brassica intake biomarker (sensitivity and specificity >80%). SMCSO and 3 of its metabolites were identified as urinary biomarkers of cruciferous intake following a high cruciferous intake dietary intervention (500 g d−1 of broccoli and Brussels sprouts for 14 days).12 A range of acetyl-cysteine derivatives of isothiocyanates have been identified as biomarkers of brassica-containing diets including N-Acetyl-(N′-benzylthiocarbamoyl)-cysteine, Iberin N-acetyl cysteine,10,11 and iberin.15 However there has been no studies attempting to validate the compounds as broccoli biomarkers or to assess their ability to predict intake. To this end the objectives of the present study were as follows: (1) to use a metabolomics approach to identify and confirm biomarkers related to broccoli consumption; (2) to examine the dose–response relationship and (3) to assess the ability of a panel of potential biomarkers for predicting food intake.
Fasting first void urine was collected after an overnight 12 h fast at the end of each test week after participants had consumed the test meal for 4 days and chilled until processing. All urine samples were centrifuged at 1800g for 10 min at 4 °C. Aliquots of 1 mL were stored at −80 °C for later analysis.
Urine samples were analysed by LC-MS as previously described.18 A mixture of five isotope labelled compounds (malic acid d3, methionine d3, myristic acid 13C, adipic acid d4 and succinic acid d4, each at 10 μg mL−1 in 20% (v/v) EtOH/Millipore H2O) as internal standards were used. Test urine and pooled urine samples (used as Quality Control (QC) samples) were thawed on a roller for 30 min and centrifuged at 1800g for 5 min at 4 °C. Urine samples (100 μL) were mixed with 100 μL internal standards, and then vortexed at 35 Hz for 10 s and centrifuged at 2000g for 2 min. The supernatant was subsequently transferred to LC-MS vials with 250 μL inserts and placed into LC autosampler at 4 °C.
An Agilent LC-QTOF-MS, consisting of a 1290 Infinity II LC system and an Agilent Jetstream Electrospray ionization (ESI) source coupled to a 6545 QTOF mass spectrometer were applied to perform urine metabolomic analysis. The chromatography was performed in reverse phase mode with a Zorbax eclipse plus C18 column (2.1 × 50 mm, 1.8 μm) coupled to a Zorbax eclipse plus C18 guard column (2.1 × 5 mm, 1.8 μm). The column temperature was set as 30 °C, and injection volume was 5 μL. The mobile phase A and B were water and 80% of acetonitrile, respectively, both with 0.1% formic acid. The flow rate was 0.4 mL min−1, and the gradient conditions were set as follows: 1% B (0–1.5 min), 11% B (1.5–9 min), 25% B (9–15 min), 50% B (15–18 min), 99% B (18–18.05 min), 99% B (18.05–21 min), 1% B (21–21.05 min) and 1% B (21.05–23 min). The MS parameters used for the analysis: drying gas temperature, 325 °C; drying gas flow rate, 10 L min−1; sheath gas temperature, 350 °C; sheath gas flow rate, 11 L min−1; nebulizer pressure, 45 gauge pressure (pounds per square inch); capillary voltage, 3500 V; nozzle voltage, 1000 V; fragment or voltage, 100 V; skimmer, 45 V. The mass range of mass-to-charge-ratio (m/z) was between 50 and 1600, and the 2 GHz extended dynamic range mode was used. Application of centroid mode at scan rate of 1 spectra per s was used to collect data. Samples were run by using both positive and negative ionization modes.
Data were acquired using MassHunter acquisition B.08.00 software (B.08.00.8058.3 Sp1 Agilent Technologies) and were further processed in MassHunter qualitative analysis software (B.07.00 Sp2 Agilent Technologies). Molecular features were extracted using the molecular feature extractor (MFE) algorithm, and a list of features were generated with retention time, m/z, adducts or isotopes of compounds, signal intensity and accurate mass. The target data type was set to small molecules and the threshold value for peak height was set to 5000 counts. The isotopic peak spacing tolerance was 0.0025 m/z with a maximum charge state of 1 and isotope model was set to common organic molecules. The Compound Exchange Format (.cef) files, transformed from the original spectral data, were generated and imported into MassProfiler (Version B.07.01; build 99.0 Agilent Technologies) in order to align compounds and filter data. The alignment parameters were set with a retention time tolerance ±0.3 min and a mass tolerance of ±15 ppm + 2.0 mDa. The final dataset was exported including molecule features with retention time, mass and feature abundance. For features with missing values, they were replaced with half the minimum abundance value for that particular feature. Feature abundances were normalised to osmolality before performing multivariate data analysis.
The interesting features were initially identified in MassHunter ID Browser (B 7.0.799.2 Agilent Technologies) by putative formula generation and an accurate mass data search against a local METLIN (METabolite LINk, https://metlin.scripps.edu) database. Tandem MS (MS/MS) with 3 collision energies 10 eV, 20 eV and 40 eV under the same chromatographic conditions were further performed to attain structural confidence for the metabolites identified by the accurate mass. MS/MS compound identification efforts included fragmentation modelling using CFM ID19 and fragment matches with the Human Metabolome Database (HMDB).20 Fragment data generated from MS/MS was also input into METLIN MS/MS Spectrum Match Search (precursor m/z and top 30 most intense fragments m/z) in order to confirm identification. Furthermore, some authentic standards were analysed by the same LC-MS/MS method to confirm the identification. All metabolites were attributed to one of four levels of confidence in metabolite identification according to Metabolomics standards initiative: Level I for identified compounds; Level II for putatively annotated compounds; Level III for putatively characterized compound classes; Level IV for unknowns.21
Using the multiMarker web application a panel of putative biomarkers was examined for its ability to predict intake.22 The multiMarker approach is a factor analytic model that expresses the relationship between the biomarkers and the latent intake. The model is built within a Bayesian framework which allows estimation of uncertainties in the predicted intake. The training set was established from randomly selecting 90% of the data to build the model and the rest of data were used to predict intake using biomarker data. This was repeated in an iterative fashion to investigate the agreement between the predicted and actual intake.
Characteristics | Discovery study (N = 18) | Dose–response study (N = 32) |
---|---|---|
Data are mean ± SD. N, number of subjects; M, male; F, female; y, years; SD, standard deviation; BMI, body mass index; W![]() ![]() |
||
Gender | 8M 10F | 16M 16F |
Age (y) | 34 ± 12 | 29 ± 10 |
Anthropometrics | ||
BMI (kg m−2) | 23.84 ± 2.98 | 23.96 ± 2.99 |
W![]() ![]() |
0.77 ± 0.07 | 0.84 ± 0.06 |
After conducting MEBA for Time Series on 1H-NMR data, the top 30 spectral regions were selected and examined against the control food at 0, 2, and 4 h time points. The analysis revealed multiple spectral regions which exhibited an increased peak intensity with time following broccoli consumption only (ESI Fig. 2†) and these spectral regions were listed in Table 2 with their potential identification.
Spectral region (ppm)a | Hotelling-T2 | Potential identification |
---|---|---|
a A selection of top-ranking regions from MEBA timecourse analysis are presented. These regions represent peaks and one metabolite can have multiple peaks depending on the chemical structure of the metabolite. ppm, parts per million; SMCSO, s-methyl cysteine sulfoxide. | ||
2.7675 | 39.066 | Levulinate |
2.7685 | 30.431 | Levulinate |
1.3455 | 27.932 | 2-Hydroxyisobutyrate |
2.7925 | 27.835 | Unknown 1 |
2.7935 | 27.657 | Unknown 1 |
2.7695 | 25.597 | Levulinate |
2.7395 | 24.35 | Sarcosine |
3.0365 | 23.95 | Creatinine |
2.8195 | 22.839 | SMCSO |
1.8925 | 22.575 | Acetate |
1.8915 | 22.125 | Acetate |
2.8205 | 21.969 | SMCSO |
A number of these spectral regions of interest were assigned to SMCSO and identification confirmed through comparison with an authentic analytical standard (ESI Fig. 3†). Other regions were identified as acetate, levulinate, 2-hydroxyisobutyrate, creatinine and sarcosine (Table 2). Profiling of urinary SMCSO revealed that there was a significant increase in the postprandial samples at 2 and 4 hours compared to fasting and 24 hours (Fig. 1A). After 24 hours, the level of SMCSO was back to the similar level as 0 hour. On the contrary, the level of SMCSO after apple intake remained the same across 4 time points.
Using the LC-MS data a total of 1741 features were obtained from urine samples in positive mode and 2086 features were obtained in negative mode. Following MEBA analysis the top 30 features were selected and examined for increases following broccoli consumption compared to control food. A total of 14 features in positive and 7 features in negative indicated an increase after the consumption and also displayed discriminating time profiles when compared to the control food. The results displayed in Fig. 1B–O and Fig. 2A–G show the increased intensity following broccoli intake, whereas the peak intensities didn't change following consumption of the control food, apple.
A molecular formula for each feature was generated by MassHunter using single MS accurate mass data for the molecular ion and its isotopes. Fourteen features of interest in positive, and seven in negative mode, were selected for LC-MS/MS for more complete metabolite identification (Table 3). Matching of MS/MS fragments between urine sample and standard confirmed the identification of Sinapic acid (ESI Fig. 4†). The identification of 3-hydroxy-sebacic acid (m/z 219.1206) was based on comparison with an authentic standard of the functional parent compound Sebacic acid. The fragmentation pattern obtained from the standard reflected key fragments from the urine sample with a spectral shift of 2 Da (m/z 97.1010 and 95.0853, 121.1012 and 119.0854, 139.1119 and 137. 0960). The difference in mass between sebacic acid (∼202 Da) and 3-hydroxysebacic acid (∼218 Da) is ∼16 Da which corresponds to the mass of the addition of a hydroxy group (–OH) and loss of a double bond (ESI Fig. 5†). Matching of MS/MS fragments between urine sample and standard confirmed the identification of Genipin: the main fragment in the standard was m/z 149.0595 however in the urine sample m/z 167.0703 was a prominent fragment, which would indicate addition of a hydroxy group (ESI Fig. 6†).
RT | Mass | Prec. m/z | Ion | MS/MS | Hotelling-T2 | Putative name | Putative formula |
---|---|---|---|---|---|---|---|
RT, retention time; Prec. m/z, precision mass to charge ratio; MS/MS, tandem mass spectrometry; putative identification from HMDB after analysis by CFM-ID. LC-MS/MS in positive mode [M + H]+ and negative mode [M − H]−. Analysis shows putative formula and results of interrogation of the METLIN and HMDB databases.a METLIN formula database.b CFM-ID.c METLIN MSMS spectrum match.d Authentic standard.e Level I identification.f Level II identification.g Level IV identification. For the molecule features without identification level indicated the formula and identification from single MS library search. | |||||||
14.2 | 236.1048 | 237.1087 | [M + H]+ | 134.0961, 133.0645, 197.1283, 84.0802, 43.0173 | 47.14 | Dihydro-beta-tubaic Acidc,f | C13H16O4 |
14.4 | 238.1215 | 239.1297 | [M + H]+ | 69.0695, 134.0961, 57.0699, 123.0438, 43.0535 | 44.826 | Unidentifieda,g | C16H14O2 |
16.4 | 226.0844 | 227.0911 | [M + H]+ | 167.0703, 121.0645, 168.0736, 122.0678, 85.0283 | 40.61 | Genipind,e | C11H14O5 |
14.9 | 305.1300 | 306.1376 | [M + H]+ | 149.096, 122.0267, 167.1059, 185.118, 85.0278 | 37.144 | Unidentifieda,g | C14H19N5OS |
16.6 | 413.2418 | 414.2519 | [M + H]+ | 414.2482, 416.3141, 416.2666, 415.2508, 223.1690 | 36.066 | Unidentifieda,g | C20H29N8O2 |
15.8 | 166.0627 | 167.0692 | [M + H]+ | 35.47 | 3-(4-Hydroxyphenyl)propionic acida | C9H10O3 | |
16.9 | 369.2152 | 370.2225 | [M + H]+ | 370.2219, 85.0282, 191.1060, 374.252, 372.2363 | 35.321 | Unidentifieda,g | C18H15N8O |
15.3 | 389.2411 | 390.2480 | [M + H]+ | 113.0611, 67.0546, 287.9454, 43.0173, 114.0670 | 34.664 | Unidentifieda,g | C19H35NO7 |
17.5 | 415.2567 | 416.2642 | [M + H]+ | 34.55 | Unidentifieda | C21 H37NO7 | |
11.2 | 218.1156 | 219.1206 | [M + H]+ | 119.0854, 137.0960, 95.0853, 109.1014, 91.0542 | 34.416 | 3-Hydroxy-sebacic acidb,d,e | C10H18O5 |
16.0 | 256.1311 | 257.1381 | [M + H]+ | 34.142 | 1-Methoxy-1-(2,4,5-trimethoxyphenyl)-2-propanola | C13H20O5 | |
18.0 | 391.1852 | 392.1917 | [M + H]+ | 33.728 | Orysastrobin | C18H25N5O5 | |
9.5 | 224.0688 | 225.0751 | [M + H]+ | 55.0537, 69.0692, 147.0444, 57.0686, 119.0487 | 33.706 | Sinapic acidc,d,e | C11H12O5 |
14.2 | 271.1427 | 272.1482 | [M + H]+ | 55.0539, 60.0806, 69.0342, 69.0688, 83.0845 | 28.647 | Unidentifieda,g | C14 H25NS2 |
11.8 | 198.0532 | 197.0455 | [M − H]− | 50.044 | Ethyl gallatea | C9 H10O5 | |
15.1 | 198.0892 | 197.0821 | [M − H]− | 41.914 | Guaifenesina | C10H14O4 | |
17.3 | 430.1481 | 429.1403 | [M − H]− | 41.559 | Unidentifieda | C19 H26O11 | |
14.6 | 270.1109 | 269.1033 | [M − H]− | 40.696 | Unidentifieda | C13 H18O6 | |
13.9 | 444.1268 | 443.1197 | [M − H]− | 34.745 | Unidentifieda | C19 H24O12 | |
14.4 | 256.1310 | 255.1238 | [M − H]− | 30.969 | 1-Methoxy-1-(2,4,5-trimethoxyphenyl)-2-propanola | C13 H20O5 | |
11.6 | 216.0998 | 215.0926 | [M − H]− | 30.047 | Unidentifieda | C10H16O5 |
Urinary SMCSO concentrations were determined following intake of low (49 g), medium (101 g) and high (153 g) portions of broccoli. The average urinary SMCSO concentrations increased as broccoli intake increased (from 0.17 to 0.24 μM per mOsm per kg) (Fig. 3A). Repeated measures ANOVA indicated that SMCSO exhibits a dose–response relationship to broccoli intake in fasting urine samples (p < 0.001).
Features which exhibited an increased and differential time-course following broccoli consumption using LC-MS analysis were also examined for a dose–response relationship. Peak areas were examined for the 14 features of interest in positive and 7 features in negative following consumption of the different broccoli portions. Genipin (m/z = 227.0911), dihydro-β-tubaic acid (m/z = 237.1087), sinapic acid (m/z = 225.0751) and 5 unidentified compounds (m/z = 239.1297, 370.2225, 257.1381, 255.1238 and 306.1376) increased in normalised peak area as portion of broccoli increased (Fig. 3B–I). All were significantly increased with the exception of one feature (m/z = 306.1376).
Observation | Actual intake (g) | Predicted intake (g) | Standard deviation | 2.5% percentile | 97.5% percentile |
---|---|---|---|---|---|
The good agreement between actual and predicted intake was highlighted in bold. | |||||
1 | 49 | 53.46 | 24.84 | 30.66 | 110.96 |
2 | 49 | 48.96 | 13.68 | 28.58 | 93.52 |
3 | 49 | 91.56 | 26.64 | 37.42 | 124.37 |
4 | 49 | 50.00 | 18.37 | 29.91 | 104.23 |
5 | 49 | 47.16 | 10.27 | 25.22 | 65.4 |
6 | 49 | 48.39 | 11.06 | 28.81 | 73.02 |
7 | 49 | 100.73 | 15.75 | 59.14 | 136.24 |
8 | 49 | 47.47 | 9.65 | 25.45 | 64.85 |
9 | 49 | 50.18 | 14.27 | 31.81 | 98.83 |
10 | 49 | 49.59 | 11.26 | 32.19 | 49.59 |
11 | 49 | 100.39 | 18.15 | 49.32 | 100.39 |
12 | 49 | 94.99 | 22.01 | 42.12 | 94.99 |
13 | 49 | 48.16 | 10.46 | 28.57 | 68.79 |
14 | 49 | 48.63 | 10.26 | 29.42 | 69.18 |
15 | 49 | 48.50 | 11.22 | 29.22 | 75.83 |
The panel of biomarkers were combined using the multiMarker model, a latent variable model, which allows prediction of food intake when only biomarker data are available. Using the data from the A-Diet study, the model was successfully applied to estimate the relationship between biomarkers and apple intake and to subsequently determine apple intake from biomarker data.21 In the current study, the biomarker panel performed excellently to identify low and medium levels of broccoli intake. For high level of intake the model had difficulties discriminating between 101 and 153 g day−1.
This study employed an untargeted metabolomic approach for the discovery of broccoli intake biomarkers identifying discriminating compounds between fasting and 4 hours postprandial metabolomic profiles. Agreeing with previous literature we identified SMCSO as a biomarker of brassica intake. SMCSO is an organosulfur non-protein amino acid found naturally in brassica.23 SMCSO, and 3 of its metabolites, were identified in the urine samples of 20 men following 14 day consumption of a high cruciferous vegetable diet (500 g day−1 of both broccoli and Brussel sprouts).12 These NMR-identified metabolites were able to differentiate urinary metabolomes after a high cruciferous diet from urinary metabolomes observed after a diet excluding cruciferous vegetables and alliums in the same participants. The present study demonstrated that urinary SMCSO was detectable after a smaller, more achievable intake of cruciferous vegetables, 135 g compared to the 500 g day−1 used by Edmands et al.,12 which supports the use of SMCSO when examining cruciferous vegetable intake after experimental and habitual intakes. Our study has added to the research area by confirming that urinary concentration of SMCSO demonstrated a dose–response relationship, increasing as broccoli intake increased.
From the LC-MS analysis multiple potential broccoli biomarkers were identified. The 3-hydroxysebacic acid is a derivative of 3-hydroxydicarboxylic. It is possible that this potential biomarker is reflective of metabolism of the plant polymers ingested in broccoli and warrant further investigation, however it is unlikely this compound is specific to broccoli intake. Another feature identified by LC-MS was genipin, a geniposide-derived aglycone present in fruit of Gardenia jasminoides. Genipin has multiple food industry applications such as a functional food ingredient, a natural blue colorant, and it is used as part of traditional Chinese medicine.24 Therefore, it is difficult to determine the exact dietary source of genipin identified in urine samples, however it did demonstrate a dose–response relationship with broccoli intake so future investigation is necessary to confirm this compound as a broccoli intake biomarker. LC-MS analysis also identified some flavonoid derivatives. Fruit and vegetables are natural sources of flavonoids.25 As flavonoid derivatives are commonly found in multiple fruit and vegetables, individually, they will not be specific broccoli biomarkers. However, the combination of these compounds into a multi-biomarker panel brings important information.
Even though the majority of previous research has focused on isothiocyanates and their association with brassica consumption, there are a number of limitations associated with verifying these organosulfur compounds as robust markers of intake.12 Glucosinolates are consumed intact but the bioavailability of the isothiocyanates is affected by food processing techniques, pH of stomach and intestines and microbial digestion.26,27 As isothiocyanates are highly reactive compounds they bind easily with macromolecules reducing bioavailability. This rapid interaction leads to a huge diversity of isothiocyanates differing structurally based on their side chain.28 This diversity can affect metabolism and lead to increased concentrations of different conjugated metabolic products excreted as well as the mercapturic acid of parent isothiocyanate. Many studies have limited their analyses to derivatives of specific mercapturates based on the structure of parent glucosinolate and isothiocyanate structures which only count for a proportion of metabolites of original isothiocyanate dose.28 Hence, there is a need to perform mechanistic studies to identify metabolites of structurally related compounds metabolised through separate pathways to get a full overview of potential cruciferous exposure markers.
A potential limitation of the research is that the discovery and dose–response studies were carried out only in the Republic of Ireland and therefore confirmation of these findings would be required in other ethnic populations. This study has multiple strengths associated: a range of different study designs were used to examine the biomarkers in different scenarios. Confirmation of a dose–response in the background of habitual diet is a noteworthy strength. To the best of our knowledge this is the first study to examine the dose–response relationship of urinary biomarkers across multiple levels of broccoli intake. Furthermore, the predictive ability of the panel of biomarkers was examined. Further research is needed into biomarkers of broccoli-intake in order to establish their qualitative or quantitative applications for higher quantities of intake. Further work is also needed to fully validate the biomarkers identified to meet the validation criteria previously outlined.29
The present study has illustrated the successful implementation of an untargeted metabolomics approach in search of dietary biomarkers of broccoli intake. Urinary SMCSO was identified as a potential intake biomarker of broccoli in NMR analysis and exhibited a dose–response relationship with increasing broccoli intake. Several LC-MS features, a number of which are flavonoid derivatives were identified and dose–response relationships revealed. For both NMR and LC-MS analysis, the panel of molecule features as putative broccoli biomarkers indicate their potential to predict the intake and pave the way forward for its application in supporting and improving objective measurement of broccoli intake. Future work should examine the use of the panel of biomarkers in a range of ethnic groups and in larger observational studies.
Footnotes |
† Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d2fo03988e |
‡ These authors contributed equally. |
This journal is © The Royal Society of Chemistry 2023 |