Wolfgang
Reindl
a,
Benjamin P.
Bowen
a,
Michael A.
Balamotis
b,
Jeffrey E.
Green
c and
Trent R.
Northen
*a
aDepartment of Bioenergy/GTL & Structural Biology, Life Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA. E-mail: TRNorthen@lbl.gov; Fax: +1 (510) 486-4545; Tel: +1 (510) 486-5240
bDepartment of Genome Dynamics, Life Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA
cTransgenic Oncogenesis and Genomics Section, Laboratory of Cancer Biology and Genetics, National Cancer Institute, Bethesda, Maryland, USA
First published on 6th January 2011
The tissue microenvironment critically influences the molecular characteristics of a tumor. However, as tumorous tissue is highly heterogeneous it may harbor various sub-populations with different microenvironments, greatly complicating the unambiguous analysis of tumor biology. Mass spectrometry imaging techniques allow for the direct analysis of tumors in the spatial context of their microenvironment. However, discovery of heterogeneous sub-populations often depends on the use of multivariate statistical methods. While this is routinely used for 2D images, multivariate statistical approaches are rarely seen in the context of 3D images. Here we present the automatic alignment of 2D images recorded by nanostructure-initiator mass spectrometry (NIMS) to reconstruct a 3D model of a mouse mammary tumor. Multivariate statistical analysis was applied to the whole 3D reconstruction at once, revealing distinct tumor regions, an observation that would not have been possible in such clarity through the analysis of isolated 2D sections. These sub-structures were confirmed by H&E and Oil Red O stains. This study shows that the combination of 3D imaging and multivariate statistics can be used to define tumor regions.
Insight, innovation, integrationThe broad understanding of a tumor's biology is only possible in the context of its microenvironment. Therefore, only techniques that can study the 3D structure of a tumor are able to map all the salient molecular processes. We apply automatic mass spectral image analysis for the generation and characterization of a 3D tumor reconstruction. Multivariate statistical analysis was used to identify different regions of the tumor. The presented technique is an approach to better understand the microenvironmental heterogeneity of tumorous tissue. |
All of the above observations demonstrate that for a broad understanding of tumor biology it is not sufficient to study isolated tumor sections. The highly complex spatial and heterogeneous organization of cancer cells and surrounding ECM needs to be analyzed. Standard metabolite and proteomic approaches that apply tumor extraction (‘grind-and-find’) techniques result in a loss of spatial information and bulk averaging of important cell sub-populations. Therefore, there is great interest in imaging approaches since they allow analysis of tumors in their microenvironment. Traditionally, research efforts have focused on localization of proteins but more recently have shifted to study metabolites.14,15
A wide range of mass spectrometry based imaging techniques are being developed for the analysis of tissue sections and are now being extended for 3D reconstructions, e.g. secondary ion mass spectrometry (SIMS) was used to create a 3D image of lipids in a Xenopus laevis oocyte,16 3D images based on desorption electrospray ionization (DESI) showed the distribution of lipids in mouse brain,17 or matrix-assisted laser desorption/ionization (MALDI) was used for a 3D reconstruction of proteins in rat brain.18 However, these approaches have been restricted to the 3D targeted visualization of metabolites or peptides/proteins.
In contrast to these targeted approaches, untargeted, global analysis critically depends on the use of multivariate statistical tools. Analysis of 2D mass spectral imaging datasets using multivariate statistical approaches has successfully revealed important structures and sub-populations. For example, principal component analysis (PCA) was used to define spatially distinct regions in rat brain19 or in pancreatic adenocarcinomas.20 Yet, physiological features within tissues do not exist in 2D planes. Simultaneous imaging of regions above and below any particular 2D section together with a 3D tissue analysis is important to help define distinct regions of a tumor and eventually provide a more complete understanding of tumor physiology.
Here we present the multivariate statistical analysis of a 3D mass spectral image of a mouse mammary tumor. This approach utilizes automated alignment of 2D mass spectral metabolic images of tumor sections converted into a 3D reconstruction. Multivariate analysis using non-negative matrix factorization (NMF) based 3D reconstruction revealed different regions based on ion abundance patterns and were confirmed using biological tissue stainings. To our knowledge, this is the first time multivariate analysis was applied to a 3D image acquired by mass spectrometry.
![]() | ||
Fig. 1 Scheme presenting the workflow from imaging single sections to 3D reconstruction. Initially, serial sections were prepared from a single tumor. Each of these sections was imaged by mass spectrometry imaging, yielding a dataset of complete mass spectra at each x and y coordinate. Major ions were identified and 2D mass spectral images were aligned by rotation of each section about the z-axis and translation in x any y directions. The resulting 4D datasets (x, y, z, m/z) were operated on to create 3D reconstructions for each ion. |
The correct spatial alignment of all imaged 2D sections is the first challenging step in the reconstruction of a 3D tissue image. Most standard acquisition software does not allow for a precise imaging of the tissue only, but rather images the content of a rectangular box placed around the tissue. This means, that due to variations of the tissue location in that box, sequential tissue sections will not align perfectly on top of each other. This effect could clearly be observed in this study where unaligned sections showed large variations in their x-y localizations or slight rotations in comparison to previous or successive sections (Fig. 2A). A multitude of different approaches has been used for the alignment of 2D sections, primarily relying on manually selected optical markers or semi-automatic software. In this study, a fully automatic alignment script was used. Each section was aligned to its previous section by maximizing the pixel to pixel correlation between subsequent sections to yield the aligned 3D reconstruction shown in Fig. 2B. Here the largest x-y translation was 24 pixels and 12 degrees was the largest rotation required to align the sections.
![]() | ||
Fig. 2 Section alignment of acquired 2D images. (A) An unaligned 3D image of intensities for an ion at m/z 97 demonstrates the small but significant shifts in x and y and the slight rotational variation of the x-y plane when comparing sequential sections. (B) Following the automated alignment strategy, sequential sections in 3D appear to be highly correlated along the z-axis. |
The three dimensional dataset was split into 30 components by non-negative matrix factorization (NMF). This number of dimensions was sufficient to reveal spatially variable patterns in the reconstruction. In general, each component could be classified as belonging to one of 4 groups: background, bulk of tissue, interior, or edge of tissue. Consequently, many of the identified components were spatially overlapping. Of the 30 components, 12 could be associated with the tissue and 18 could be associated with the background. Maximum intensity projections for each component are available in the ESI (supplementary information 2).‡ Three representative components within the tissue were used to generate a 3D volume reconstruction (Fig. 3): component 1 represented the bulk of pixels within the tissue (Fig. 3A), component 11 was the largest interior component (Fig. 3B), and component 15 was the region localized on the edge (Fig. 3C). In the following text component 1 will be referred to as region 1, component 11 as region 2, and component 15 as region 3. The 3D tissue volume-reconstruction assembled by the three NMF-component clusters clearly shows that region 3 (Fig. 3D, blue) frames the edge of region 1 (Fig. 3D, red) and region 2 (Fig. 3D, green) forms a sub-cluster within the red region. This becomes even more obvious when the front side of the 3D model gets ‘trimmed’ (Fig. 6) for better visualization.
![]() | ||
Fig. 3 Multivariate analysis of the 3D tumor model. Non-negative matrix factorization (NMF) was used to identify regions within the 3D dataset. (A) The 3D volume defined by component 1/region 1 represents the bulk of the tumor. (B) Component 11/region 2 lies within the interior of the tumor. (C) A region along the edge of the tumor was identified by component 15/region 3. (D) The 3D intensity data for each region can be used to create a 3D space-filling model, where each color represents a different region within the dataset (region 1: red; region 2: green; region 3: blue). |
For a better understanding of the biology behind the identified components, we coupled ion abundance analysis with histological tissue staining to identify the detected sub-structures. One of the most prominent ions in regions 1 and 2 is detected at m/z 104.1, which is likely choline, a known tumor marker.21–23 As the abundance of m/z 104.1 is significantly lower in region 3, this suggests that regions 1 and 2 together may represent the actual tumor mass (Fig. 4A, red), while region 3 might be formed by a different type of tissue (Fig. 4A, blue). Mouse mammary glands are known for their high content of fat.24 To test for the presence of adipocytes in the analyzed tissue, we stained sections adjacent to the mass spectrometry sections with Oil Red O, a dye that selectively stains fat and lipids. Surprisingly, only a very small part of the tissue showed a positive Oil Red O stain (Fig. 4B). But most interestingly, the stained portion fits exactly to the tissue area described by region 3 (Fig. 4A, blue). This data suggests that region 3 represents fat tissue on the edge of the tumor. For a more detailed analysis of the tumor tissue described by regions 1 and 2, the histology of the imaged tissue was further analyzed by H&E stains of sections adjacent to the imaged sections. For example, Fig. 5 shows the localization of region 2 (Fig. 5A) and an H&E stain (Fig. 5B) for section 24. In general, the hematoxylin staining is dominant throughout the whole tissue, revealing relatively small and densely packed cells. This also shows that with the exception of the detected fat tissue most of the tissue is formed by tumor cells. However, the H&E stain also clearly shows that the cell density is higher on the left side of the tissue, peaking in the area of the tissue that is equivalent to region 2 (Fig. 5A). Magnifications of small areas from the left side of the tissue within region 2 (Fig. 5C) and from the right side within region 1 (Fig. 5D) show the differences in cell density in a more detailed way. These data suggest that regions 1 and 2 represent the same general tissue type (i.e. the tumor), but due to putative differences in the microenvironment, form different populations within the tissue; this is evidenced by their unique ion profiles.
![]() | ||
Fig. 4 Identification of tumor and fat cells. (A) An ion at m/z 104.1, which is likely to be choline, can be used to distinguish the tumor regions. Red in the model represents high abundance of the ion at m/z 104.1 (equivalent to regions 1 and 2 in Fig. 3D), blue represents an area with high levels of an ion at m/z 198.1 (equivalent to region 3 in Fig. 3D), which exists only at low abundance in the red region. (B) Oil Red O stained tissue section 23 reveals the localization of adipocytes at the edge of the tumor. The staining matches the localization of region 3. Scale bar represents 2 mm. |
![]() | ||
Fig. 5 Histological analysis of the tumor tissue. The detected regions match morphological characteristics identified by H&E staining. (A) Localization of region 2 in tissue section 24. (B) H&E stain of tissue section 24 revealing different cell densities within the tissue. Dashed box shows tissue section enlarged in panel C, dotted box shows tissue section enlarged in panel D. Scale bar represents 2 mm. (C) Enlarged image of cells from region 2 shows high cell density. (D) Enlarged image of cells from region 1 shows low cell density. |
Cutting away several virtual planes from the 3D reconstruction to show the three identified regions allowed a more detailed view of spatial sub-structures inside the tissue (Fig. 6). The possibility to access every plane inside the 3D image allows for a precise analysis of ion abundances in the context of any surrounding tissue parts. This presents an important advantage in comparison to 2D tissue analysis, which is limited in explaining the effects of tissues above and below the imaged section. For example, if the tissue localization marked by the arrow in Fig. 6 would be analyzed as an isolated 2D image any observed effects would be attributed to region 1 (Fig. 6, red). However, the 3D image shows that the marked position lies directly adjacent to region 2 (Fig. 6, green). Any effects at the marked position in region 1 could be directly influenced by region 2. Therefore, to understand the molecular effects in tissue it is important to consider the full 3D space for physiological relevance.
![]() | ||
Fig. 6 Cut-away view of 3D tumor model. The reference viewpoint can be set to any arbitrary position and cut-aways of any planar surface allow inspection of the interior of the tumor. By looking at depth profiles along dimensions other than the z-axis, one can see how various ions are distributed in micro-encapsulated pockets (marked by white arrow) which might remain undetected when looking only at single-sections. |
The most challenging and important aspect of the above workflow was the construction of the 4D dataset because all intensity values must be registered on defined x, y, z, m/z axes. Without this alignment step one would be comparing the same ions, albeit slightly shifted in m/z, as if they were unique ions. Similarly, one could mistakenly compare ions in sequential sections that are not spatially registered. To illustrate this point, it is known that in some cases ions shift in m/z as a result of minor changes in tissue properties, making it difficult to maintain high mass accuracy and detection of the same ion as multiple m/z across tissue sections. Multivariate analysis could mistakenly define a component based on the similar spatial distribution of these seemingly unique ‘ions’, actually referring to only one single ion. Since the purpose of multivariate statistics is to discover regions and clusters of ions for subsequent analysis, we have used the strategy of binning ion intensities to minimize the effects of shifting m/z, though it does require subsequent analysis to confirm accurate mass.
Many groups have described the use of marker points for image registration.25 This is particularly useful for alignment of histological images with mass spectral images, particularly of brain regions which have clear anatomical features to aid alignment.18 However, this method is not as well suited for alignment of tumor sections which have less clear morphological features. Therefore, we have employed an automated image registration approach that is based on the correlation of ion intensities between sequential sections. Optimization proceeds by exploring the x and y shifts and rotation of the x-y plane about the z-axis. This process, like most optimizations, is prone to finding false minima. For this reason, we have employed the Pattern Search algorithm that overcomes this limitation and find the global optimum.26 Using this approach, a 143% improvement in the correlation was achieved after re-locating the image.
Multivariate statistical analysis was performed on the resulting coherent 4D dataset. While principle component analysis (PCA) has been the most widely used multivariate analysis technique, we have found it preferable to use non-negative matrix factorization (NMF) because all of the spectral loadings are positive and easier to interpret. This technique has historically been used for a variety of applications including interpreting complex spectra.27 The appearance of the NMF results has the same format as a mass spectrum and can therefore be easily interpreted.
The resulting statistically analyzed and surface rendered 3D image provides a detailed view of variations in the tumor, which has several advantages over conventional 2D image analysis. Fig. 6 shows that region 2 (green) was not detected in the bottom tissue sections. If tissue analysis would have been based solely on one or only very few 2D sections from that part of the tissue, a whole molecular sub-population would not have been detected. Furthermore, local sub-structures in individual sections might be misinterpreted in a 2D context. For example, small sub-structures in a 2D section, that seem to be of minor influence on the whole sections, might be part of a much bigger structure across the whole 3D tissue (e.g. vasculature). Therefore the 3D reconstruction is a more realistic view of the microenvironment by linking physiological structures above and below individual slices that may influence the observed phenotype of any individual section. Therefore, 2D tissue analysis is somewhat dependent on the random selection of an appropriate representative tissue section. To some extent this is also true for the 3D sampling; if a tumor is excised imprecisely, sub-populations can also be missed out. However, we think that by the analysis of a 3D model such imprecisions can be detected in a much clearer way as in an isolated 2D section. For example when one identified component sits at the very edge across several sections of the imaged tissue and was obviously split by the excision. This might have happened for the identified region 3, representing the fat tissue. Using our approach, complete tumor excision would be seen as one component representing the tumor mass being completely surrounded by normal stromal cells. To this end, the analysis of non-tumorous control tissue would be beneficial to define the molecular background for the type of analyzed tissue. Unfortunately, control tissue was not available for this study.
The resolution of the acquired 3D image is 75 μm, both in x- and y-direction, and 28 μm in z-direction. With cell sizes of 10 μm and more, this means that the resolution is not high enough for the analysis of single cells or of a detailed cellular microenvironment. However, this resolution is below the focus of current commercial instrumentation. Therefore, this study was mainly focused on developing the automated statistical analysis of 3D imaging data required to study the tissue microenvironment. For that purpose the resolution was high enough; different cell types and cellular environments could be detected. In comparison to conventional 3D analysis techniques, the big advantage of the presented method is direct label-free detection of analytes. In 3D microscopy for example, a very limited number of molecules can be detected in parallel through tagging with exogenous labels. In a mass spectrometric approach up to several thousand molecules can be analyzed simultaneously. Therefore, the output of data from a single experiment has the potential to greatly exceed any conventional 3D imaging technique.
Footnotes |
† Published as part of an Integrative Biology themed issue in honour of Mina J. Bissell: Guest Editor Mary Helen Barcellos-Hoff. |
‡ Electronic supplementary information (ESI) available. See DOI: 10.1039/c0ib00091d |
This journal is © The Royal Society of Chemistry 2011 |