Open Access Article
Alex
Dunlop
a,
Kate
Bowman
b,
Olav
Aarstad
c,
Gudmund
Skjåk-Bræk
c,
Bjørn T.
Stokke
d and
Andrew N.
Round
*b
aHH Wills Physics Laboratory, University of Bristol, Tyndall Avenue, Bristol BS8 1TL, UK
bSchool of Pharmacy, University of East Anglia, Norwich Research Park, Norwich NR4 7TJ, UK. E-mail: a.round@uea.ac.uk
cDepartment of Biotechnology, The Norwegian University of Science and Technology, NTNU, NO-7491 Trondheim, Norway
dDepartment of Physics, The Norwegian University of Science and Technology, NTNU, NO-7491 Trondheim, Norway
First published on 2nd October 2017
We evaluate an AFM-based single molecule force spectroscopy method for mapping sequences in otherwise difficult to sequence heteropolymers, including glycosylated proteins and glycans. The sliding contact force spectroscopy (SCFS) method exploits a sliding contact made between a nanopore threaded over a polymer axle and an AFM probe. We find that for sliding α- and β-cyclodextrin nanopores over a wide range of hydrophilic monomers, the free energy of sliding is proportional to the sum of two dimensionless, easily calculable parameters representing the relative partitioning of the monomer inside the nanopore or in the aqueous phase, and the friction arising from sliding the nanopore over the monomer. Using this relationship we calculate sliding energies for nucleic acids, amino acids, glycan and synthetic monomers and predict on the basis of these calculations that SCFS will detect N- and O-glycosylation of proteins and patterns of sidechains in glycans. For these applications, SCFS offers an alternative to sequence mapping by mass spectrometry or newly-emerging nanopore technologies that may be easily implemented using a standard AFM.
The SCFS method uses a cyclodextrin (CD)-based macrocycle tethered to the AFM probe, with the polymer to be interrogated tethered likewise to another surface and induced to form a host–guest complex with the macrocycle to form a polyrotaxane11 or molecular ring-thread complex. Rotaxanes are examples of a broad group of supramolecular complexes that can be induced to do mechanochemical work, and which can be described as ‘molecular machines’.12 This rapidly expanding group includes molecular ‘walkers’,13 shuttles and switches that can in some cases do work against significant external loads.14 The SCFS experiment has parallels with the nanopore sequencing approach that is under continuous development as a DNA sequencing tool15,16 and which has recently been shown to discriminate between different poly(ethylene glycol) (PEG) polymers on the basis of their molecular weight, with monomer resolution.17 In particular, AFM has been used to measure the forces acting on ssDNA as it slides by either a “frictionless” or a “stick-slip” mode in a nanopore.18 The most common terminology19 for describing the processes occurring in a rotaxane depicts a macrocyclic ‘bead’ shuttling between ‘stations’ on the polymer axle. In the SCFS method described here and previously,8–10 the bead is α- or β-cyclodextrin (α-CD and β-CD hereafter) and the stations are the individual monomers comprising the polymer axle, while the AFM probe supplies the unidirectional driving force for shuttling the bead between stations (hereafter ‘sliding’) under a load generated by the controlled separation of probe and sample. Fig. 1 illustrates the parallels between a conventional rotaxane system and the assembly constructed for SCFS. The concept of manipulating a rotaxane using a local force probe has been explored before: Komiyama and coworkers20 used STM (Scanning Tunnelling Microscopy) to manipulate α-CD beads forming a polyrotaxane with poly(ethylene glycol) (PEG) back and forth along the PEG axle, while Stoddart et al.21 and Leigh and Duwez14 have used AFM to measure the force required to drive a bead between two stations in a rotaxane. None of these works addressed the use of a sliding contact between a bead and a polymer as a sequencing tool. Previous analyses of the challenges to polymer sequencing by single molecule force spectroscopy (SMFS), with or without a sliding contact, have focused on DNA sequencing.22,23
We have shown previously that measurements made with the SCFS approach described here yielded excellent agreement with the predicted positions of aromatic rings substituted into PEG polymers based upon the measured molecular weights of the polymers,8 and that the CD bead could be used to unzip interactions between the polymer axle and molecules bound to specific sequences within that polymer.9 Thus the available evidence suggests that SCFS may offer a method for mapping or sequencing long, linear polymers where there are large differences between monomers or blocks, or where specific sequences are recognised by other molecules. However, the limits on the size and nature of the different monomer stations and macrocyclic beads for which differences in force may be distinguished remain undefined.
In the present work we compare the forces measured during the forced sliding of α- and β-CD beads along PEG-based polymers possessing one or more of 4 different stations representative of 2 classes of monomer: aromatic rings and glycans. As well as representing common polymers, these groups encompass a wide range of molecular cross-sectional areas, aqueous solubilities and affinities for complexation with α- and β-CDs. We apply the Friddle–Noy–de Yoreo (F–N–Y)24,25 method for analysing single molecule force spectra in order to extract the energies involved in the bead-station interactions and consider the parameters that have predictive power in determining the resolution of the SCFS sequencing approach. Finally, we consider the potential and the limits of the method for sequencing common polymers.
![]() | ||
| Fig. 2 The monomers used as stations in this study: pyromellitic acid (1), aminoaniline (2), (poly)ethylene glycol (3), guluronic acid (4) and mannuronic acid (5). | ||
Fig. 3 shows examples of force curves, the dynamic force spectrum and histogram of forces for each dataset, while the values of equilibrium force feq and free energy of sliding ΔGsl for the interactions studied are presented in Table 1. The equilibrium forces for sliding over the stations ranged from 29 to 98 pN and are always larger for the α-CD interaction than for the β-CD interaction. They occupy a comparable range to that predicted and observed for single molecule ligand–receptor unbinding events24 and intercalation into DNA.28 Correspondingly, the energies calculated for the sliding interactions (ΔGsl) are found to range from 20 to 160 kJ mol−1; equivalent to between approximately 1 and 8 hydrogen bonds (free energy of hydrogen bond in water = 23.3 kJ mol−1).29,30 In subsequent analyses presented below we use the free energy of sliding ΔGsl rather than equilibrium force feq since the latter quantity is dependent on the spring constant of the particular cantilever used,24 making direct comparison of values obtained with different cantilevers more difficult. ΔGsl is calculated from the value of feq and the spring constant of the cantilever used and is therefore directly comparable across different experiments. However, direct comparison with literature data can only be made when the spring constants of the cantilevers used for each specific data set are reported. As an example, the range of magnitudes of the forces observed in the present study is consistent with those found for the unbinding of host–guest complexes between β-CD and a range of aromatic groups,31,32 although when the equilibrium forces and spring constants reported in the first of those works are used to calculate ΔGbu by Friddle and Noy's method the values range between 13 and 87 kJ mol−1 (see ESI†). The discrepancy in values of ΔGbu calculated from the data in ref. 31 and the ΔGsl values obtained in the present work highlights the distinction between the dissociation of equilibrated host–guest complexes and the forced sliding of the host cyclodextrin ring over the guest monomers.
![]() | ||
| Fig. 3 (a) Example force curves for the interactions investigated here: from the bottom, the first two curves were collected when sliding α-CD over oligoguluronic and mannuronic acids (stations 4 and 5); the remaining curves were collected when (top two curves) α-CD and (middle two curves) β-CD were pulled over a polymer consisting of PEG (station 3) and individual monomers of aminoaniline (station 2) and pyromellitic acid (station 1). Asterisks mark the rupture points at which forces and loading rates are measured. For more details of the polymer characterisation see ref. 1. (b) Dynamic force spectra for the interactions of α-CD (filled circles) and β-CD (open circles) with each of the five stations. For stations 4 and 5, all data is for α-CD; lighter symbols are for guluronic acid and darker symbols for mannuronic acid. (c) Histograms of the most probable sliding force, equivalent to feq, the equilibrium force, for the five stations. Colours of bars follow those described for symbols in part (b). | ||
K, the binding constant; feq, the equilibrium force; kc, the cantilever spring constant; and ΔGsl, the free energy of sliding) for interactions of stations 1 to 5 with α- and β-CD
| Station | Bead | Log K, literature values |
Log K, UV valuesa |
f eq ± sd (pN) | k c (pN nm−1) | ΔGsl ± sd (kJ mol−1) |
|---|---|---|---|---|---|---|
a Values for K were calculated from UV measurements using the Benesi–Hildebrand method.
b For stations 4 and 5, the value of K is the mean of values for several pyranoses. Details of method for the calculation of log K are available in ESI.
|
||||||
| 1 | α-CD | — | 1.4 | 98 ± 8 | 25 | 159 ± 17 |
| β-CD | — | 2.81 | 86 ± 12 | 22 | 134 ± 26 | |
| 2 | α-CD | 1.41 [ref. 23] | 1.47 | 76 ± 6 | 25 | 95 ± 13 |
| β-CD | 2.05 [ref. 23] | 1.74 | 63 ± 6 | 22 | 72 ± 14 | |
| 3 | α-CD | 2.51–4.43 [ref. 24] | — | 29 ± 4; 37 ± 3 | 14; 25 | 22 ± 4; 23 ± 4 |
| β-CD | 3.03–3.17 [ref. 24] | — | 34 ± 5 | 22 | 21 ± 6 | |
| 4b | α-CD | 1.56 [ref. 25] | — | 45 ± 5 | 14 | 54 ± 12 |
| 5b | α-CD | 1.56 [ref. 25] | — | 46 ± 7 | 16 | 50 ± 15 |
Additional comparison can be made with the values predicted and observed for the sliding of a β-CD bead over single-stranded DNA: Lindsay and Williams23 predicted that the force required to drive a β-CD bead along a single strand of DNA was 75–78 pN (corresponding to 31–33 kJ mol−1 using the F–N–Y relation between feq and ΔGbu and a reported spring constant of 0.3 N m−1, see Table S2 in ESI†), depending on whether the base passed over was a purine or a pyrimidine. Therefore, no distinguishing force signature between purine and pyrimidine nucleotides would be detected above instrumental noise (∼15 pN or more). These authors proposed that this rather low value was due to the mobility of the bases around their point of attachment to the (deoxy)ribose backbone, allowing them to fold flat against the phosphate-deoxyribose backbone to pass through the CD pore. The same group subsequently published experimental data10 showing force plateaus for the sliding of β-CD along DNA somewhat larger than this value, at approximately 125 pN. We report here most probable sliding forces of 63–98 pN for differently-substituted aromatic groups and 45–46 pN for a monosaccharide (passing through α-CD), at comparable or higher loading rates (instantaneous loading rates from 400 to 8000 pN s−1).
K, where K is the binding constant for the H–G complex, derived from ΔG° as measured by Auletta et al. using ITC (Fig. 4a). We then compared values of the same binding constant K for each of the monomers used in the present work, (presented in Table 1, and derived from literature reports,33–36 and/or measured by a UV spectroscopic method, see ESI†), to ΔGsl, and find no clear relationship between ΔGsl and log
K, as depicted in Fig. 4b. The lack of dependence observed in our SCFS data, in contrast to the clear relationship observable in Auletta et al.'s data, reflects the distinction between H–G complexation and forced threading of the CD over the monomer in SCFS: in H–G complexation the geometry that favours the lowest (kinetically accessible) energy state for the guest in the host may not involve complete inclusion of that guest in the host; while in SCFS the monomer is forced to pass through the CD pore, driving the complex through energetically unfavourable transition states that constitute the largest energy barriers to the passage of the monomer through the pore. This may be expected to be most relevant when the monomer is large and rigid, as would be the case for substituted aromatic groups. We sketch the differing mechanisms and resulting energy pathways in Fig. 4c. As an illustrative example, we consider that the ΔG° of rupturing the H–G complex between aniline and β-CD, when the aniline guest has adopted the most energetically favourable configuration, was measured to be 2.3 kcal mol−1 (9.6 kJ mol−1) by ITC (and ΔGbu calculated to be 12.7 kJ mol−1 using that work's data and applying the F–N–Y formalism; see Table S2 in ESI†), while we find a ΔGsl for sliding β-CD from a PEO chain over the very similar station p-aminoaniline and on to a subsequent PEO chain to be much higher at 72 kJ mol−1 (see Table 1). Thus, since the binding constants measured or calculated for H–G complexes will not necessarily reflect the main energy barrier to sliding the same CD host over the same monomer guest, we reject using H–G binding constants as a basis for predicting the expected sliding energy.
![]() | ||
| Fig. 4 (a) Plot of the relationship between ΔGbu (calculated from the AFM data in ref. 31 using the Friddle–Noy method) and log K (derived from ITC measurements described in ref. 31) for the host–guest complexes interrogated by Auletta et al.31 (b) Plot of the relationship between ΔGsl and log K for the sliding contact experiments described in this work. (c,d) Illustrations of the unbinding processes and sketches of possible corresponding energy pathways for disrupting the H–G complex (c), where the depth of the energy well for the H–G complex binding is the principal energy barrier, and the sliding contact experiment (d), where other barriers to the passage of the guest may dominate. | ||
In order to estimate the contributions of friction and solvation to the overall energy of passing a particular monomer through the CD pore, we looked for measured or calculable parameters that reflect these two contributions. As already described, the friction component will depend in some part upon the ‘tightness of fit’ of the monomer inside the CD pore, so the ratio of cross-sectional areas of the monomer and the CD pore, called the dimensionless space-filling parameter Φ19 (calculated using the cross-sectional areas of the monomers39 and the cross-sectional area of the interior pore of the bead) and already shown to have predictive power in estimating the stability constants of host–guest complexes where CDs are the host,19,40 may be considered as a proxy for the friction component. Complexation has been observed to occur between guests and cyclodextrin hosts where values of Φ have varied between 0.9 and 1.2.19 A value of Φ greater than 1 implies that the guest is larger than the host cavity, but structural motions of the guest and host, including opening out of the α-(1 → 4) C–O–C bond between neighbouring glucose units in cyclodextrin, allow the host to accommodate larger guests. The introduction of a driving force in the form of an AFM probe may be expected to drive the accommodation and passage of even larger guests at the cost of frictional energy, and indeed the passage of single stranded DNA through β-CD, for which Φ > 3.3, has been experimentally observed.10 The calculation of Φ for stations 1 to 5 is presented in ESI,† and ranges from 0.6 to 3 (Table S3†). Likewise, the solvation component reflects the passage of the monomer from an aqueous environment, into the hydrophobic interior of the CD pore, and then back out into the aqueous phase again, so that P, the dimensionless octanol
:
water partition coefficient commonly used (in its log form) as a measure of hydrophobicity in drug design, can be used to describe the relative favourability of these two environments for a particular monomer. Values of log
P for stations 1–5 are in the range −3.3 to 0.6, corresponding to values of P between 0 and 4.
We then looked for correlations between ΔGsl, Φ and P, as shown by Fig. S8 in ESI.† Taken in isolation, Φ does not show a straightforward relationship with ΔGsl, likely due to the anomalously low values of ΔGsl for the two uronic acids, while there is a clear linear dependence of ΔGsl on P (R2 = 0.90). When we look at the dependence of ΔGsl on the sum Φ + P, we find that all datapoints collapse onto a straight line. Using the method of least squares, we can therefore equate the two eqn (1) and (2):
| ΔGsl = Wfr + Wadh | (1) |
| ΔGsl = k(Φ + P) | (2) |
| ΔGsl = kΦΦ + kPP | (3) |
This analysis produces values of 19.8 (SE = 1.4) and 25.6 (SE = 1.2) kJ mol−1 for kΦ and kP respectively. Fig. 5 shows the very close correspondence between the value of ΔGsl measured by SCFS (ΔGsl(meas)) for the series of stations passing through α- and β-CD and the value calculated from kΦ × Φ + kP × P, (ΔGsl(calc)). The data used to construct this relation encompasses large and small monomers that are hydrophilic or mildly hydrophobic (max. log
P = 0.6). For small, hydrophobic monomers such as ethylene, or for the guest molecules investigated by Auletta et al.31 the relation predicts very high energies (more than 500 kJ mol−1) due to the dependence on P rather than log
P, so clearly our empirical model is applicable over a limited range. Replicating the above analysis, replacing the dependence on P with a dependence on log
P, yields values of 52.9 kJ mol−1 for kΦ and 28.4 kJ mol−1 for klog P, which still corresponds reasonably well to ΔGsl(meas) but does predict negative energies in some cases, while providing more likely values of ΔGsl(calc) for hydrophobic monomers. Nevertheless, other monomers that fall within the model's applicable range include those constituting many important linear and short-branched heteropolymers (essentially all monosaccharides, amino acids, nucleic acids and biocompatible polyhydroxyalkanoates), some of which are not amenable to conventional sequencing methods.
![]() | ||
| Fig. 5 Plot of ΔGsl(meas)vs. ΔGsl(calc) for stations 1–5 passing through α- and β-CD. The dashed line is the fit to the line y = x. The colours of the datapoints follow the pattern used in Fig. 3. Error bars are 2 × SD of the ΔGsl(meas) data, encompassing 95% of data assuming normal distribution. | ||
Examples where this new approach may yield new information include the study of micro- and macroheterogeneity in protein glycosylation41 and the pattern of monosaccharide decoration in polysaccharides, including hemicelluloses whose structure helps determine plant cell wall recalcitrance in bioenergy applications.42,43 In both cases, the current state of the art method of analysis is mass spectrometry (MS). SCFS offers advantages over MS methods where the elucidation of sequence patterns over large distances is required. Therefore, SCFS offers the prospect of an alternative route to mapping critical post-translational modifications of proteins and a first method for mapping the pattern of sidechains in linear glycans that can be easily implemented in any standard AFM.
Samples for AFM were prepared by depositing aqueous solutions of the polymers and rotaxanes on template-stripped gold as follows: 0.4% w/w of each polymer was mixed with a 1
:
1 mole equivalent of amino-functionalized α- or β-CD for 24 hours, and deposited onto template-stripped gold from water for 24 hours.
AFM probes (MLCT silicon nitride from Veeco Instruments, Santa Barbara, CA, USA) with nominal spring constants of 10 and 20 pN nm−1 were prepared by coating under vacuum with 1 nm Cr and 10 nm Au (both from Goodfellow Corp., Berwyn, PA, USA) before incubation with 1 mM 11-11′-dithio-bis(succinimidyl undecanoate) in 1,4-dioxane for 10 minutes. Functionalised probes were used immediately or stored in an inert atmosphere.
Samples for AFM were prepared by an alternative method to that used for stations 1–3: freshly-cleaved mica was functionalised with 3-mercaptopropyl triethoxysilane (MTS) (Sigma-Aldrich) from a 2% solution in acetone (200 μL, 20 min, washed 5× water). To crosslink the amine-terminated PEG-alginate polymer to the thiol-terminated substrate, a short PEG spacer with maleimide and succinimide end groups (SM(PEG)12, Thermo Fisher Scientific) was used (100 μL of 1
:
300 dilution in water deposited on to thiol-functionalised mica for 2 h at RT or overnight at 4 °C).
AFM probes for the alginate-pseudorotaxane experiments (MLCT silicon nitride from Veeco Instruments, Santa Barbara, CA, USA) with nominal spring constants of 10 and 20 pN nm−1 were silanised with thiol-terminated alkylsilane and then further functionalised with (α-maleimido-ω-N-hydroxysuccinimide)-propylene glycol as described above. Both probe treatments resulted in probes functionalized with succinimidyl groups for in situ reaction with the amine groups on the cyclodextrins and gave comparable success rates.
P
P for each of the stations used in this work, April, 2017, https://chemicalize.com/ developed by ChemAxon (http://www.chemaxon.com).
:
guest complex. Instead ΔGsl is proportional to the sum of the values of two dimensionless, easily calculable quantities: the octanol
:
water partition coefficient P and the space-filling parameter Φ, according to the equation ΔGsl = kΦ × Φ + kP × P, where kΦ = 19.8 kJ mol−1 and kP = 25.6 kJ mol−1. Based on these results, we conclude that SCFS will detect the existence and position of branch points in glycans and glycosylated proteins and that it therefore represents a new tool to map patterns of heterogeneous branching and post-translational modifications over long sequences in glycans and proteins.
Footnote |
| † Electronic supplementary information (ESI) available: Details and results of SMFS data selection and analysis; UV data and method; calculation of Φ; table of predicted values of ΔGsl; experimental method and results of gel permeation chromatography of the PEG-uronic acid conjugate. See DOI: 10.1039/c7nr03358c |
| This journal is © The Royal Society of Chemistry 2017 |