Mirkó Palla‡
*ab,
Filippo G. Bosco‡d,
Jaeyoung Yang‡b,
Tomas Rindzeviciusd,
Tommy S. Alstrome,
Michael S. Schmidtd,
Qiao Linb,
Jingyue Juc and
Anja Boisen*d
aWyss Institute for Biologically Inspired Engineering, Harvard University, Boston, Massachusetts 02115, United States. E-mail: anja.boisen@nanotech.dtu.dk; mirko.palla@wyss.harvard.edu
bDepartment of Mechanical Engineering, Columbia University, New York, New York 10027, USA
cDepartment of Chemical Engineering, Columbia University, New York, New York 10027, USA
dDepartment of Micro- and Nanotechnology, Technical University of Denmark, Lyngby 2800, Denmark
eDepartment of Applied Mathematics and Computer Science, Technical University of Denmark, Lyngby 2800, Denmark
First published on 2nd October 2015
Surface-enhanced Raman spectroscopy (SERS) based on nanostructured platforms is a promising technique for quantitative and highly sensitive detection of biomolecules in the field of analytical biochemistry. Here, we report a mathematical model to predict experimental SERS signal (or hotspot) intensity distributions of target molecules on receptor-functionalized nanopillar substrates for biomolecular quantification. We demonstrate that by utilizing only a small set of empirically determined parameters, our general theoretical framework agrees with the experimental data particularly well in the picomolar concentration regimes. This developed model may be generally used for biomolecular quantification using Raman mapping on SERS substrates with planar geometries, in which the hotspots are approximated as electromagnetic enhancement fields generated by closely spaced dimers. Lastly, we also show that the detection limit of a specific target molecule, TAMRA-labeled vasopressin, approaches the single molecule level, thus opening up an exciting new chapter in the field of SERS quantification.
During a typical measurement, Raman signals are amplified by large electromagnetic enhancements, so-called hotspots, and are collected from a SERS substrate. These highly localized hotspots are believed to be formed between metallic junctions of noble nanostructures or nanoparticles.12,13 However, the acquisition of uniform SERS signals over a large area is particularly challenging as the signals significantly vary between hotspots with slightly different junction dimensions. Theoretically, an ideal approach to design such a sensing unit would be to precisely position one Raman active molecule (with surface functionalization strategies such as DNA origami or click chemistry) in the hotspot and eventually create an array-like configuration containing many of these units. Unfortunately, such precision for SERS substrate design and molecular positioning has not yet been achieved.
Typically, analyte molecules are adsorbed onto the surface of a nanostructured SERS substrate at random locations by either drop coating or incubation methods using liquid samples containing the analyte. These methods for SERS measurements present several shortcomings. The surface coverage of the analyte molecules is not uniform and difficult to estimate due to the stochastic nature of the adsorption process; thus large spatial variations are expected in measurements over the SERS substrate area. At ultra-low concentrations, often used in single molecule (SM) studies, the probability for a molecule to be located exactly in a hotspot is extremely low, which leads to unreliable statistics due to the large portion of unoccupied hotspots, i.e., low number of detectable SERS signals. On the other hand, for each type of nanostructured substrate and analyte to be detected, there is a concentration regime for which the surface coverage of adsorbed molecules is optimal, such that an exponentially increasing relationship can be observed in the SERS signal intensity (emanating from the Raman active molecules) as a function of analyte concentrations.14
Here, we developed a statistical method based on large-scale Raman mapping, which improves statistical reliability and reproducibility of collected SERS signals, by studying the intensity distribution of an ensemble of SERS signals and its statistical implications. This analytical model has the potential to be used as a quantitative tool in SERS sensing applications. To demonstrate the significance of our approach, the following issues will be examined in this paper: (1) development of a theoretical model to fit and (2) quantification of experimental results based on SM principles, and (3) dependence of SERS responses on the number of probed molecules per unit area at picomolar levels.
In brief, the approach developed here takes advantage of the characteristic power law distribution of SERS hotspots collected from an ensemble of measurements over a large area. Thus, it provides not only a general theoretical framework to describe statistics of SERS signals requiring a minimum set of parameters, but also a novel method for low abundance biomolecular quantification.
SERS enhancements on most substrates are highly non-uniform on the molecular scale. Points of large electromagnetic enhancement or hotspots are highly localized and are so sparsely distributed that they can be often found within tens of nanometers of points with zero enhancements. For this reason, we defined the SM enhancement factor (SMEF) as the SERS enhancement induced by a given molecule at a specific point on the substrate. The SMEF is dependent upon the Raman tensor of the probe molecule, which describes its orientation on the substrate and with respect to the local electromagnetic field at that point. It is also dependent on the SERS substrate orientation with respect to incident laser polarization19 and direction. We assumed these parameters to be constant, since the substrate orientation was kept orthogonal to the incident laser beam, i.e., it was fixed, and 0° polarization was used in all experiments. Therefore, the SMEF is mathematically defined as:
![]() | (1) |
![]() | (2) |
Thus, we showed that F is proportional to the measured intensity I by a constant factor 〈ISMRS〉. In general, F can be calculated using the Raman tensor of the probe molecule, its adsorption and scattering geometry, as well as its vibrational mode energy.18 For simplicity, F can be expressed in a more general form such that:
![]() | (3) |
p(F) ≈ AF−(1+k) | (4) |
d(I) ≈ AI−(1+k) | (5) |
While the intensity distribution model developed in the previous section only described a single hotspot, it is relevant to hotspot mapping experiments of leaning nanopillars, in particular for the picomolar concentration regime and <1 μm2 signal sampling areas. With increasing number of hotspots, the PDF of SERS intensity (eqn (5)) tends to a Gaussian distribution (central limit theorem), whose upper tail – used in data processing – can be approximated fairly well with a TPD.
Additionally, the detected SERS signal results from the summation of many SM signals located within the probed surface area (∼1 μm2), since the observed signal intensity can be expressed as the sum of signals from independent hotspots by the superposition principle. Since the Au nanopillar leaning direction is random, each dimer will be randomly oriented with respect to the incident field polarization. Only a dimer with (i) a smallest gap and (ii) parallel to the incident field polarization displays highest electromagnetic field enhancement, and (iii) a dimer should also contain a number of target molecules within the hotspot area (within 10–20 nm, picomolar concentration regime) which then produces a detectable SERS intensity. We therefore believe that recorded SERS intensities for each measurement point are dominated by a single hotspot.
Specifically, when the nanopillars are functionalized with vasopressin-specific aptamers, upon liquid evaporation, the pillars lean on each other forming highly enhancing hotspots, modeled as dimers of metallic spheres. When the TVP is introduced, it binds to the aptamers randomly distributed on the pillar heads. When vasopressin is trapped by an aptamer close enough to a hotspot, the TAMRA SERS signal is highly enhanced. Because of the random nature of Au–thiol bonding between aptamers and Au-coated pillar heads and the TVP binding to aptamers, statistically only a few TAMRA molecules may be positioned close to the hotspot at a time in the scattering volume. However, the majority of SERS signal will be derived from these particular molecules due to the predominant enhancement effects.
From the above formulated statistical arguments, in the theoretical domain, this particular problem deals with statistically equivalent observations of SERS signals collected on identical dimers of metallic spheres. In the case of vasopressin detection, using the leaning nanopillar approach, we experimentally measure the SERS signal from TAMRA molecules deposited on the Au-coated pillar heads. Here, in the experimental domain, statistics are determined by collecting a large ensemble of random measurements from the SERS substrate and compiling them into histograms. These two approaches are analogous to each other and are the basis for correlating SERS signal intensity to analyte concentration in this analytical quantification method.
To represent the SERS intensity distribution in another way, one can generate an intensity histogram rendering a large number of SERS events of randomly distributed molecules, in which the smallest and largest value on the x axis corresponds to the minimum and maximum of the top X% of all SERS signals measured during the Raman mappings. At 100 pM TVP, for the diagnostic peak 1370 cm−1, such histograms were constructed for the top 5%, 10%, 20% and 100% of measured SERS signals (Fig. 2A–D), and was fitted with a Pareto function in the general form: y(x) = axn. It was observed that there is a functional relationship between the coefficient of determination value (R2) for the Pareto fit and the number of hotspots. This is demonstrated in Fig. 3 for the full range of hotspot percentages, which plateaus approximately at the top 20% cutoff value (R2 ≈ 1.00, X = 20%). At this inflection point (Fig. 2C), the collective SERS intensities reflect a highly skewed or long-tailed TPD distribution similar to that described by eqn (5). Then, the R2 values moderately decrease until the full range of SERS intensities are utilized (X = 100%) (Fig. 3). This observation is consistent with the general Pareto principle and experimental SM SERS intensity distributions studies,21,22 thus this threshold was chosen in further analysis for the development of our quantification model.
![]() | ||
Fig. 3 Coefficient of determination (R2) of power (Pareto) fit for the SERS intensity distribution as a function of percentage (%) of hotspots evaluated in the fitting of peak 1370 cm−1 of TAMRA-labeled vasopressin (TVP) at 100 pM (blue curve), 10 pM (red curve) and 1 pM (green curve) concentrations. Note, that the R2 values follow a similar trend for all TVP concentrations and plateau at about 20%, which is predicted by the theoretical Pareto principle.22 Error bars represent standard deviations for three independent mapping measurements. Fig. 2 represents four slices from this figure. |
The constructed SERS intensity histograms, containing the top 20% of hotspots, were fitted with a power function for the diagnostic TAMRA peak 1370 cm−1 at the three different concentration levels in the picomolar regime (Fig. 4). The high coefficient of determination (R2) values (>0.9) demonstrate a strong power relationship between TVP concentration and SERS intensity integrals (Table 1). Since the experimentally obtained SERS intensity distribution fits well to the predicted long-tailed TPD distribution, we believe that the analytical model – based on single hotspot theory developed in eqn (3)–(5) – is suitable to describe the SERS enhancement mechanism of leaning nanopillars for TVP detection. By arbitrarily defining α = 1 + k, we can now rewrite eqn (5) for a given TVP concentration (C) as:
d(I) ≈ A(C)I−α(C), | (6) |
TVP concentration (pM) | Coefficient of determination (R2) |
---|---|
100 | 0.9270 |
101 | 0.9562 |
102 | 0.9003 |
The SERS substrate is considered as a collection of distinct hotspots. The distribution of corresponding SERS intensities – to a first approximation – can be obtained by summing their respective intensities. Now, based on the experimentally obtained fitting parameters at the three picomolar concentration levels (Table 2), it is reasonable to assume, that α and A may be (arbitrarily) represented as logarithmic functions of the analyte concentration with general forms:
α(C) = b![]() | (7) |
A(C) = g![]() | (8) |
d(I) ≈ (g![]() ![]() | (9) |
TVP concentration (pM) | 102 | 101 | 100 |
---|---|---|---|
A | 5.7244 × 109 | 6.9896 × 1011 | 1.7156 × 1012 |
α | 2.2933 | 3.0187 | 3.2350 |
Imax | 4.1550 × 104 | 3.5370 × 104 | 2.3710 × 104 |
Imin | 4.9875 × 103 | 2.0253 × 103 | 1.6108 × 103 |
For a given analyte concentration, the integral of the intensity distribution between the local intensity minimum (Imin) and maximum (Imax) values give the sum of all intensities (Isum) in a particular mapping experiment. More precisely, Imin and Imax are defined as the lowest (minima) and highest (maxima) experimentally determined hotspot intensity values in each experiment. Now, this integral can be expressed as a function of analyte concentration (C) such that:
![]() | (10) |
![]() | (11) |
By substituting eqn (7) and (8) into (11), we finally gain an analytical expression for the total intensity as a function of the TVP concentration, C:
![]() | (12) |
It should be noted that in eqn (12) the upper and lower limits of integration need to be substituted with their function counterparts, since Imin and Imax may also be described as logarithmic functions of the analyte concentration (Fig. S5C and D in ESI†). Here, we simplified our quantification model by redefining Imin and Imax as the lowest and highest experimentally determined hotspot intensity values in all mapping experiments, i.e., global extremes (Table 2); but this observation may be investigated in a future work using a more rigorous framework.
Fig. 5 shows the total intensities obtained from experiments at various analyte concentrations (solid black line) along with their theoretical fits based on the analytical expression for the intensity integral (eqn (12)) – using the global intensity extremes (dotted red line) or the mean of local extremes (dashed red line) respectively. This result demonstrates that the long-tailed (Pareto-like) intensity distribution model developed for a single hotspot (eqn (6)) can predict experimental values for the TVP quantification using the leaning nanopillar-based SERS substrate (Table 3). The model fits the experimental data well in the tested picomolar concentration regimes and calls for further experiments to evaluate its feasibility in ultralow (femtomolar) analyte concentrations.
![]() | ||
Fig. 5 Comparison of intensity integrals of experimental data and theoretical fit for the diagnostic TAMRA peak 1370 cm−1 as a function of TAMRA-labeled vasopressin (TVP) concentration. The experimental curve (solid black line) is predicted by theoretical fits relatively well in the picomolar TVP concentration regimes. Both theoretical models using the global intensity maxima/minima values (dotted red line) and the mean of local extremes (dashed red line) as limits of integration in the analytical expression for the intensity integral defined by eqn (12) are shown. |
Theoretical limits of integration | TVP concentration (pM) | Experimental mean value (cnt) | Theoretical prediction (cnt) | Percentage error (%) |
---|---|---|---|---|
Global maxima/minima | 100 | 5.5073 × 105 | 5.2994 × 105 | −3.78 ± 6.04 |
101 | 6.6482 × 105 | 7.2652 × 105 | +9.28 ± 8.70 | |
102 | 1.0502 × 106 | 1.0749 × 106 | +2.35 ± 0.83 | |
Mean of local maxima/minima | 100 | 5.5073 × 105 | 5.2994 × 105 | −3.78 ± 6.04 |
101 | 6.6482 × 105 | 6.5573 × 105 | −1.37 ± 8.70 | |
102 | 1.0502 × 106 | 8.9447 × 105 | −14.83 ± 0.83 |
In order to minimize the variability in measurements for individual substrates with the same functionalization/treatment conditions, coefficient of variation (CV) analysis was used to determine the optimal mapping area (see Materials and methods). This statistical method determines the extent of variability in relation to the mean of the population, i.e., it is a normalized measure of the dispersion in the intensity distribution obtained from the Raman mapping experiments. Nanopillar substrates functionalized with the vasopressin-specific aptamer and treated with 1 nM TVP were sampled using the CV algorithm, computing the mapping-to-mapping variability under the same substrate functionalization conditions. The 2D scatter plots in Fig. 6 indicate that the CV exponentially decays as the mapping area increases when the diagnostic TAMRA peak 1370 cm−1 is investigated. A total of 100 pixels – equivalent to a 10 μm × 10 μm SERS substrate area – are required to reach a variability threshold of 1% for both 1 pM and 100 pM TVP concentrations (Fig. 6A and B). Thus, the CV analysis verified that the measurements from the different substrates were not statistically different, using at least square mapping areas of ∼100 μm2 in the picomolar concentration regime, demonstrating the robustness of experimental repeatability of the mapping technique and providing an optimal area threshold to minimize variability between experiments.
The quantitative method denoted above provides information about the ensemble of SERS signals in a collective way, but does not detail the number of molecules per SERS enhancement event, which might be a critical parameter for SM studies. To address this issue, we approximated the leaning nanopillar heads as prolate spheroids and the pillar columns as cylinders based on high resolution SEM images with dimensions shown in Fig. 7. Semi-axes a, b (a = b in our case) and c are 75 nm, 75 nm and 120 nm respectively, while the column radius (d/2) was measured to be about 40 nm. Now, the surface area of the elongated spheroid (S) was calculated using the formula:
![]() | (13) |
![]() | (14) |
Given the calculated nanopillar density, Dpill = 20 μm−2,15 we estimated the total number of nanopillars on the 5 × 5 mm chip (Npill), which was used in all Raman mapping measurements, to be: Npill = AchipDpill = 5 × 108. By using these approximations, the total active gold area of the nanopillars (Atot) – assuming that gold is only deposited on the spheroid and not on the pillar – over the entire chip was calculated to be: Atot = NpillApill = 0.436 cm2. Now, by considering the maximum packing density of thiol–DNA bonds (1012 cm−2),23 the number of adsorbed aptamers (Napt) on this surface was estimated to be: Napt = 4.36 × 1011. Then, using the Langmuir isotherm, we calculated the total number of vasopressin molecules bound by aptamers on the nanopillar surfaces: where C is the TVP concentration and K is the dissociation constant (1.17 nM)24 (Table 4). We then estimated that less than ∼20 molecules per cluster would be probed by a single SERS measurement at 1 pM TVP concentration. This estimation suggests the possibility of SM detection, just one order of magnitude away, using this statistical SERS quantification method.
TVP concentration (pM) | Total number of absorbed molecules (#) | Number of molecules per cluster (1/μm2) |
---|---|---|
100 | 3.96 × 1010 | 1585 |
10 | 4.32 × 109 | 172 |
1 | 4.35 × 108 | 17 |
SERS | Surface-enhanced Raman spectroscopy |
SM | Single molecule |
EF | Enhancement factor |
TVP | TAMRA (5-carboxytetramethyl-rhodamine)-labeled vasopressin |
Probability density function | |
TPD | Truncated Pareto distribution |
CV | Coefficient of variation |
Footnotes |
† Electronic supplementary information (ESI) available: Statistical algorithm and experimental procedures for determining the minimum mapping area; alternative representation of the SERS intensity distributions; figures supplementing the statistical tests; and additional references. See DOI: 10.1039/c5ra16108h |
‡ These authors contributed equally. |
This journal is © The Royal Society of Chemistry 2015 |