Carlos Eduardo Rodríguez-López,
Carmen Hernández-Brenes and
Rocío I. Díaz de la Garza*
Tecnologico de Monterrey, Campus Monterrey, Eugenio Garza Sada 2501, Monterrey, NL 64849, Mexico. E-mail: rociodiaz@itesm.mx; Tel: +52-818-328-4262
First published on 10th December 2015
Lauraceous acetogenins are fatty acid derivatives with an odd-carbon aliphatic chain found in avocado (Persea americana Mill.). These compounds display a wide range of bioactivities that makes them candidates for use as antimicrobial and proapoptotic agents in the food industry and against cancer cells respectively. Existing knowledge about its metabolism in planta is scarce. This work quantifies eight different acetogenins accumulated in fruit tissues (peel, seed, and pulp) from 22 avocado cultivars to sample the existing variation using a targeted metabolomics approach. Multivariate analyses uncovered correlations among acetogenins present in fruit tissues and their chemical backbone that allowed a proposal for classifying them in three families (Avocatins, Pahuatins and Persenins). The seed acetogenin profile differed from that of the pulp and peel, which while different in concentration (peel accumulated low acetogenin amounts), had the same profile. Acetogenins from samples of known origin were also separated by variety using descriptive Linear Discriminant Analysis (LDA), and a chemotaxonomic model was generated via predictive LDA and was tested on samples from unknown origin. This work effectively sampled acetogenin contents and profile variability in seed (1.09–8.33 mg per g FW), peel (0.22–12.5 mg per g FW), and pulp (0.49–9.58 mg per g FW) from avocado fruit, as well as provides a putative classification to seven avocado cultivars. Results from this work show that the eight acetogenins followed are produced in all 22 avocado cultivars, which points to conserved metabolism among avocado plants.
One of the main active components detected in avocado leaves and fruits are lauraceous acetogenins, which are fatty acid derivatives that typically contain an odd-carbon aliphatic chain (17, 19 or 21) and an acetoxy group that contributes two additional carbons.3 Acetogenins bioactivities have been studied and they have a broad activity range that includes antimicrobial,4 antifungal,5 inhibition of the production of nitric oxide and superoxide in cells,6,7 selective pro-apoptotic activity against several cancer cell lines,2,8,9 and recently, promising activity against Acute Myeloid Leukemia (AML) cell lines.10 Bactericidal and sporostatic capacities have specifically increased the interest of food industry, due to their potential use as food additives.11 Their lipophilic properties, as well as the increasing demand for additives from natural origin and the fact that they are already being consumed by humans at bioactive levels, makes their potential uses quite promising.12
As an approach for studying acetogenin functions in planta, several works have attempted to correlate their concentrations with particular resistance traits, effectively demonstrating toxicity of an acetogenin-rich extract against late instars of Spodoptera exigua.13 Persin ((Z,Z)-1-acetoxy-2-hydroxy-12,15-heneicosadien-4-one) accumulation was followed as a response to Colletotrichum gloeosporioides infection in different avocado cultivars;14 although no correlation was found between resistant variants and Persin concentration in leaves,15 other acetogenins not considered in that work may have contributed to the trait. It has been observed that acetogenin bioactivity is highly dependent on the aliphatic chain structure, to such an extent that a change from an olefinic (1,2,4-trihydroxy heptadec-16-ene; avocadene) to an ethyne (1,2,4-trihydroxy heptadec-16-yne; avocadyne) bond enhances the pro-apoptotic effects against a human prostate adenocarcinoma cell line (PC-3) more than 7-fold.8
Despite the impact of chemical differences on the bioactivity of these moieties, prior works have mainly focused on measuring a single acetogenin, mainly Persenone A or Persin, in avocado tissues. Moreover, the majority of existing literature reports the analysis of only three avocado cultivars that include ‘Reed’ (P. americana var. guatemalensis),16 ‘Fuerte’ (P. americana var. drymifolia),16,17 and ‘Hass’ (P. americana var. guatemalensis × drymifolia).17 To the best of our knowledge, only a single work has evaluated Persin levels in avocado leaves from 21 different avocado cultivars and strains,15 and no work has been published concerning other tissues (particularly fruit) or different acetogenin derivatives.
Thus, the objective of this study was to characterize and quantify acetogenins in fruits from 22 avocado cultivars in order to sample the existing natural variation. The present work also was undertaken to assess acetogenin variations in fruit tissues (peel, seed and pulp), considering the genotypic diversity of these cultivars. A targeted metabolomics approach that involved uni- and multivariate techniques was used, along with classification algorithms, in the pursuit of establishing possible chemotaxonomic rules distinguishing the cultivars group of origin. This is the first study which, taking advantage of a targeted metabolomics approximation, focuses on the characterization of avocado fruit acetogenins.
![]() | ||
Fig. 1 Acetogenin profile and structures in avocado: (A) typical HPLC chromatogram, each peak number corresponds to an acetogenin; and (B) chemical structures of acetogenins present in avocado fruit.11 Structures for peaks 2, 4, 5, 6, 7 and 8 were confirmed by NMR, peak 1 was tentatively assigned as AcO-avocadenyne based on comparisons of mass and fragmentation spectra with the available literature27 and 3, an Unknown Putative Acetogenin (UPA), was assigned a molecular formula of C19H34O4 based on its fragmentation pattern. Since UPA may correspond to three reported acetogenins, which differ only in the position of the insaturations, these are shown as dotted bonds. |
Descriptive Linear Discriminant Analysis (dLDA), used for canonical weight analysis was performed on PCA-normed values by means of the ade4 package in R,22 in which total variance was constrained to a value of 1 to better explain variation of the existing dataset. On the other hand, predictive Linear Discriminant Analysis (pLDA) equations, used for assignation of putative genotypes, were obtained by processing raw individual concentrations with the DiscriMiner library,23 whose algorithms limit the within variance to 1, and therefore external data, as that generated from new extractions, can be properly evaluated in a dataset-independent manner. These classification algorithms were performed on concentrations of seven acetogenins of each tissue, expressed as a column for each individual fruit. Only cultivars with known genetic backgrounds (Guatemalan, Hybrids and Mexicans; with 4, 5 and 6 members respectively) were kept, resulting in a matrix of 45 rows (15 varieties, by triplicate) and 21 columns (7 acetogenin concentrations in peel, pulp, and seed). Both dLDA and pLDA were performed on individual fruits, and final variety assignations of cultivars were decided by prediction of all (unanimity) or two out of three individuals (majority) belonging to the same group; if no majority is found, the cultivar would be assigned as conflicting assignation.
Avocados have been classified by fruit morphology since pre-Hispanic times, as documented by friar Bernardino de Sahagún ca. 1590, into three main cultivars: ahuacatl, with small, black fruits; quilahuacatl, with green, savory fruits; and tlacazolahuacatl, with big fruits with large seeds.24 In modern times, varieties were renamed Mexican, Guatemalan, and West-Indian, respectively; however, the botanical classification has barely changed over the centuries, with the same characteristics used to differentiate Mexican fruits (thin skin, large and detached seeds, and nutty flavor) from Guatemalan (thick or woody skin, small and attached seeds, savory pulp) and West-Indian (smooth skin, big fruits, with a pulp not as palatable as the previous two); with the latter used mainly for rootstock, and not consumption.25 Taking heed of this, the avocado fruits sampled seem to agree well with the morphological classification (ESI Fig. 1 and Table 1†). The samples of unknown genotype, however, are more difficult to classify by morphology, with the exception of fruits from Los Catorce (L14Ch and L14NE), which would fall in the Mexican variety given their edible, black skin and detached seed. This is also in part due to the difficulty of classifying hybrids, which has led to long held misclassifications, such as ‘Hass’ avocado, which was classified as a pure Guatemalan until recently, when molecular tools classified it as a balanced hybrid (M × G – 42%, 58%).26
Mean separation conducted on TACs from seed tissues (Fig. 2B) indicated that most cultivars grouped in a large homogeneous group, with a wide range of mean values (1.1–6.3 mg per g FW). Concentrations were not statistically different in that group due to the high variance encountered in seeds, with an average coefficient of variation (COV) of 44%, and COVs as high as 140%. The ‘Aries’ cultivar, although it belonged to the same large grouping, contained TACs that exceeded average seed concentrations for slightly more than 2-fold, having a TAC of 8.33 ± 0.622 mg per g FW. This contrasts with the group of statistically discernible cultivars, formed by ‘Aguilar’, ‘Reed’, ‘Comcar’, ‘Aquijic’, ‘Larrainzar’, L14NE and L14Ch, which presented a lower range of TACs (1.24–2.30 mg per g FW).
Pulp grouping by TAC (Fig. 2A) resulted in a similar situation as seed: the largest statistically homogeneous group (‘f’), contained 16 cultivars, spanned almost an order of magnitude (0.49–4.3 mg per g FW) with an average of 2.36 ± 1.39 mg per g FW. The remaining 6 cultivars were divided equally in two groups: discernible and non-discernible from the group with the highest concentration (‘a’). Correspondingly peel (Fig. 2C), the tissue with the lowest TAC levels, had a large group of cultivars which showed no statistically significant differences, comprising 19 cultivars, with a range that spans an order of magnitude (0.22–2.8 mg per g FW) and a TAC of 1.1 ± 0.84 mg per g FW, contrasting with the rest: ‘Ag. Negro’ (5.5 ± 1.3 mg per g FW), L14NE (4.6 ± 1.1 mg per g FW) and 264 PTB (12.5 ± 3.0 mg per g FW). The latter avocado line, corresponding to a non-commercial accession, presented consistently high amounts of acetogenins in all tissues tested.
When acetogenin profiles were compared among tissues, Persenone A always resulted as the main acetogenin in peel and pulp, encompassing 46 and 48%, of total acetogenins respectively. However, for three cultivars (‘Vargas’, ‘Aries’ and ‘Larrainzar’), Persin was the most abundant acetogenin. For the peel tissue, there was a larger number of cultivars that had Persin as the most abundant acetogenin. In seeds, however, less than half of the cultivars had Persenone A as the main acetogenin. AcO-avocadene was the main acetogenin in seeds from ‘Pionero’, ‘Aries’, ‘Encinos’, ‘Hass’, ‘Fundación 2’ and ‘Larrainzar’ cultivars, and AcO-avocadenyne was the most abundant compound in other six cultivars (L14NE, L14Ch, ‘Ag. Negro’, ‘Almoloya’, ‘Aquijic’, and ‘Comcar 1’). Noteworthy, almost all cultivars in which AcO-avocadenyne was the main contributor to the seed acetogenin profiles were either confirmed or presumed Mexican varieties, with the exception of ‘Comcar 1’, which was labeled as Guatemalan according to data provided at collection.
The quantitation of these eight compounds across 22 avocado cultivars from Mexican and Guatemalan varieties along with their hybrids (ESI Table 1†) showed that almost two thirds of the examined cultivars (14 out of 22) contained low contents of acetogenins (Fig. 2). Although this may seem counterintuitive from an evolutionary perspective (it would be expected for high acetogenin content to be positively selected, due its role in plant defense),13,14,29,30 it may be explained by co-selection of the trait during domestication, since acetogenins have been reported to present a bitter, unpleasant flavor.3,31 Therefore, it is possible that varieties with a high concentration of acetogenins may have been selected out. In the light of this rationale, it seems coherent that commercially accepted cultivars such as ‘Reed’ and ‘Hass’ have low concentration of acetogenins in pulp, while the highest concentration was found in a non-commercial accession (264PTB, Fig. 2). However, when considering the few cultivars that were statistically discernible, average contents of TACs span more than one order of magnitude in peel (0.22–12.5 mg per g FW) and in pulp (0.49–9.6 mg per g FW) and less than one in seed (1.1–8.3 mg per g FW, Fig. 2). These observations are in accordance with a previous study conducted in avocado leaves with 21 avocado lines, which revealed that Persin concentrations varied within a similar range (0.4–4.5 mg per g FW).15 In the present work Persin concentrations ranged from 0.09–0.9, 0.05–1.3 and 0–0.3 mg per g FW for peel, pulp and seed, respectively.
![]() | ||
Fig. 3 Multivariate analysis for unsupervised classification of acetogenin distribution among tissues. (A) Ternary diagram grouped by carbon number (C17, C19, C21), separated by tissue (seed: red squares; pulp: green circles; and peel: gray triangles); each point is the average of three measurements. (B) Average of PCA scores (depicted as points) by cultivar plotted by tissue (seed: red; pulp: green; and peel: gray) and loadings (blue arrows) projected on the two main components (n = 3). Size of the arrow is proportional to the magnitude of the loadings; vectors are scaled and therefore, the magnitude does not correspond to the axes; numbers indicate the corresponding acetogenin as depicted in Fig. 1, only shown for relevant loadings. PC, Principal Component. |
Since the ternary diagram suggested a role of the number of carbons of avocado acetogenins in their accumulation and therefore capacity to classify seed tissue, the present work proposes acetogenins to be grouped in three families C17, C19 and C21 acetogenins, considering carbon numbers of their de-acetoxylated backbones (Fig. 1B). From this arrangement, a descriptive nomenclature is here being proposed, using the roots avocatl (Nahuatl for avocado) for 17 carbon-, pahuatl (Nahuatl for fruit, still in use for some Mexican North-Eastern avocado cultivars) for the 19 carbon-, and Persea (from the genus) for the 21 carbon-containing acetogenins. Therefore, acetogenins are to be separated in three families: Avocatins (C17), Pahuatins (C19) and Persenins (C21). This nomenclature is consistent with all the known lauraceous acetogenins and most of the published trivial names: Persin and Persenone A belong to Persenins, and all reported avocadenols32 and avocadenes31 (acetoxylated or not) would be in the Avocatins family, along with the avocatin B (a mixture of AcO-avocadene and AcO-avocadyne).10 It also recovers the use of the term Avocatins, employed in the seminal work that named 17-carbon acetogenins.4 This nomenclature is to be fully developed to include all known lauraceous acetogenins in further reports.
Results from the different approximations converge in establishing that pulp and peel contain similar profiles. CCA indicated that acetogenin concentrations in peel were largely explained by pulp (72% of redundancy) with the variation of a set of acetogenins in pulp explaining the behavior of their exact counterparts in peel, therefore hinting at direct transport. However, peel generally accumulated minute amounts of acetogenins when compared to those from pulp and seed (Fig. 2). Our observations suggest that peel tissue cannot synthesize acetogenins and that it relies on transport from the pulp for supply, therefore having a very similar profile. These inferences made from the multivariate analysis are supported by previous works, which noted that peel was incapable of synthesizing Persin from isotopically labeled linoleic acid or acetate.16
If we try to connect these findings with acetogenin production and possible transport among tissues, this would imply that, while some transport may be involved among the tissues, AcO-avocadenyne seems be locally synthesized in seeds. Pulp is known to be able to synthesize acetogenins,16 however, seed tissue has not been evaluated for this capacity. The fact that this tissue clearly differentiates itself from the other two raises the hypothesis of being capable of synthesis. Regarding acetogenin biosynthetic route, evidence from previous works indicate that linoleic acid (C18:2) is a precursor of Persin (C21:2).16 The vast array of acetogenins found here and their backbone differences, hints to differences in the use of precursors for their biosynthesis. For example, both AcO-avocadenyne and AcO-avocadene are calculated to have a 17 carbon backbone, this implies that their biosynthesis possibly has a different precursor than Persenone A, with 21 carbon atom backbones (Fig. 1B). Persin probably have common precursors with Persenone A, as evidenced by the similarity of their accumulation and structures (particularly the two cis double bonds, shared also, in the same ω position, with linoleic acid, Fig. 1B). However, it would be unlikely for Avocatins (such as AcO-avocadenyne and AcO-avocadene) to share linoleic acid as a precursor, not only because of the lack of cis bonds in reported avocatins,31,32 but also because their carbon number is lower, requiring cleavage and oxidation steps that would make their production from linoleic acid improbable. It is more likely that avocatins' precursors are saturated, medium-to-long chain (C < 18) fatty acids.28 Interestingly, it has been reported that seed tissue stores odd-chain fatty acids of the same carbon number that avocatins in a preferential manner (C17:0, 1.7% and C17:1, 0.37%) compared to pulp (0.033 and 0.11%, respectively) along with other odd-chain fatty acids (C19:0 while not detected in pulp, totals 0.61% of seed fatty acids).28 Taking all this into account, it is tempting to speculate that seed is capable of independent acetogenin production, and that such biosynthesis may be related to fatty acid availability.
As shown in Fig. 4A, the main Discriminant Score (DS1) was capable of separating Hybrids from Mexican and Guatemalan varieties, which were almost identical in projection, but formed antipodes on the second dimension (DS2). A canonical weight analysis (Fig. 4B) was also performed, which provided insights on the acetogenin profiles that contributed to genotypic classification observed in Fig. 4A. Results indicated that the main discriminant factors were pulp acetogenins (green vectors), followed by peel acetogenins (gray vectors) and, as a distant third, seed acetogenins (red vectors). For example, Hybrids were separated from the other cultivars, mainly by their pulp concentrations for UPA and Persenone C, which had almost no contribution on the separation of Mexican from Guatemalan varieties. In contrast, concentrations of Persenone A and AcO-avocadene in pulp had a major effect on separating Mexican from Guatemalan genotypes, while failing to separate them from hybrid varieties.
![]() | ||
Fig. 4 Model reduction and classification of unknown cultivars. (A) Score plot showing the projection of individuals (each point is a measurement) on the main discriminant planes, showing grouping by genotype (Guatemalan, blue; Mexican, pale green; and Hybrids, orange) and ellipses corresponding to the mean (center) and variance (ellipse) for each class. Density plots are shown along the margins. (B) Canonical weights, shown as vectors, corresponding to coefficients of the linear discriminant functions, colored by tissue (seed: red; pulp: green; and peel: gray). Size of the vector indicates normalized magnitude, and labels are located at the mean of each specified genotype; numbers correspond to each acetogenin as shown in Fig. 1. (C) The resulting classification using the reduced model, where the center is the mean, ellipses represent the within variance of each group, circles are the known samples used to generate the model and triangles depict the predicted samples of unknown genotype. DS, Discriminant Score. |
It is important to note that, while significance of the loadings in canonical analysis was proportional to their weights, the model and its usefulness depends on the contribution of all coefficients. Hence, in order to reduce the number of components needed for a useful predictive model; a linear regression discriminative analysis was applied to the concentrations matrix data, adding columns by one in each cycle, based on the canonical weights of the dLDA (Fig. 4B). Afterwards, the matrix with the reduced number of columns was subject to pLDA with a 3-fold cross validation and 5000 iterations bootstrapping, and the percentage of miss-assignments was calculated for each variety (ESI Fig. 4†).
In order to obtain the combination of weights that resulted in the minimum number of total wrong assignations, the sum of all error matrices was plotted, and a global minimum was found, comprised of the sum of local minima in the single plots (ESI Fig. 4†), when using the best 5 weights for DS1 and 2 for DS2. It was important to note that using more weights was detrimental to the model, as use of less, due to the over-fitting characteristic of LDA models. Also, taking advantage of this global minimum, best fitting results can be obtained by using only one third of the available data.
A reduced dLDA model was then generated, in which pulp and peel had the highest number of corresponding coefficients, capable of accurately describing varieties, and seed tissue didn't appear to contribute in such task. Interestingly, for the majority of the acetogenins, Guatemalan genotypes were characterized as having significantly less amount of acetogenins than the rest, particularly in peel. This phenotype correlates with the differences in texture and thickness found in the peel in fruits from these two origins: Guatemalan varieties have a thicker, lignified skin while Mexican ones have a soft, even edible peels, in which probably the transport from the pulp that we are suggesting could happen with less constrains. Notably, hybrids showed significantly higher accumulation of the selected acetogenins, suggesting this trait could be heterotic. Average TAC of hybrid varieties doubles their counterparts in pulp (4.75 ± 2.81 vs. 2.36 ± 1.91 mg per g FW) and peel (3.41 ± 4.94 vs. 1.59 ± 1.57 mg per g FW), and almost doubles the TAC in seed of non-hybrid varieties (5.32 ± 2.32 vs. 2.99 ± 1.67 mg per g FW). This observation is relevant, as it opens the possibility of increasing acetogenin content by selective breeding, with a particular emphasis on outcrossing.
As a validation of the model, prediction equations were applied to avocados of known genotype, correctly classifying all cultivars: 13 by unanimity and 2 by majority (ESI Table 1†). The results of the cross validation are summarized in ESI Table 4† for individual predictions. Given that it takes two, out of three, individuals sharing a genotype assignation, the adjusted probability of a prediction to be wrong is 10.4%, 4.6% and 6.9% if the resulting classification is Guatemalan, Mexican and hybrid, respectively. Conversely, the chance of a Guatemalan, Mexican or hybrid variety to be misclassified is 2.3%, 10.4% and 11.4%, in that order. This translates in Mexican and hybrid cultivars having a one in ten probability of being misclassified, probably as Guatemalan.
Genotype prediction for the seven avocados of unknown origin, obtained by a pLDA model, resulted in two unanimous assignments and five by majority, with no conflicting assignation. ‘Aguilar’ and ‘Pionero’ cultivars were classified as hybrids, an origin that seems correct, since they were a product of an effort to produce new cultivars by CICTAMEX.34 Likewise, the classification of both cultivars from “Los Catorce” (L14NE and L14Ch) as Mexican genotypes also corresponded to their morphological characteristics. However, classification of ‘Fundación 2’ and ‘Aries’ as having a Mexican origin contrasts with the observed morphology. These particular cultivars do not show phenotypical characteristics of Mexican cultivars, probably being hybrids with a chemical phenotype similar to Mexican avocados, explaining the classification of one of their replicates as Guatemalan and Hybrid, respectively. An increase in sample number is recommended to increase accuracy of the assignation.
Similarly, ‘Ariete’, the only cultivar assigned as Guatemalan, has been reported to be a segregate of ‘Colin V-33’ line35 that is a reported hybrid.36 However, morphological characterization studies show a strong seclusion from different Mexican avocados,35 which corresponds to the results of the present study. It is important to note that classification of avocado varieties has been challenging, at best. Often, assignation varies depending on the method used for determination, differing between morphological and molecular biology based techniques. As an example, ‘Hass’ cultivar has been classified by morphology as Guatemalan25 but microsatellite markers classified it as an hybrid of Guatemalan and Mexican genotypes.26,36 This is the first time that avocado varieties are attempted to be cataloged considering fruit chemotype as classifying trait, establishing a basis for phenotype-level assignation with well-defined features, that can add up to morphological classification. If enriched by incrementing sample size and tissues analyzed, this method has the potential to be useful for high throughput screenings, since it is easily scalable, faster and cheaper than molecular biology techniques, but also for complementing genotyping with information at a different level, improving robustness of assignations.
Footnote |
† Electronic supplementary information (ESI) available. See DOI: 10.1039/c5ra22854a |
This journal is © The Royal Society of Chemistry 2015 |