Open Access Article
Andy I.
Nguyen
a,
Ryan K.
Spencer
b,
Christopher L.
Anderson
a and
Ronald N.
Zuckermann
*a
aMolecular Foundry, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA. E-mail: rnzuckermann@lbl.gov
bDepartment of Chemistry, Department of Chemical Engineering & Materials Science, University of California, Irvine, CA 92697, USA
First published on 15th November 2018
Synthesis of biomimetic multimetallic clusters is sought after for applications such as efficient storage of solar energy and utilization of greenhouse gases. However, synthetic efforts are hampered by a dearth of ligands that are developed for multimetallic clusters due to current limitations in rational design and organic synthesis. Peptoids, a synthetic sequence-defined oligomer, enable a biomimetic strategy to rapidly synthesize and optimize large, multifunctional ligands by structural design and combinatorial screening. Here we discover peptoid oligomers (≤7 residues) that fold into a single conformation to provide unprecedented tetra- and hexadentate chelation by carboxylates to a [Co4O4] cubane cluster. The structures of peptoid-bound cubanes were determined by 2D NMR spectroscopy, and their structures reveal key steric and side-chain-to-main chain interactions that work in concert to rigidify the peptoid ligand. This efficient ligand design strategy holds promise for creating new scaffolds for the abiotic synthesis and manipulation of multimetallic clusters.
Typical ligand designs use branching motifs with idiosyncratic synthetic routes that can take substantial amounts of time to optimize.5 The few known examples of synthetic ligands that chelate clusters depend on high symmetry, in order to simplify organic synthesis.6–9 An ideal ligand synthesis would permit full access to asymmetry, site-differentiation, and donor diversity, while also remaining easy to synthesize.
An alternative approach to ligand design is to mimic the way nature bind metals (Fig. 1). The linear and modular architecture of polypeptides is an efficient way to modulate size, shape, and function of suitable ligands. Individual metal binding moieties dispersed in a linear sequence are brought into the appropriate geometry through folding of the polypeptide chain. The folding of sequence-defined oligomers is complex, but the modular and efficient synthesis permits the use of combinatorial approaches to seek out properly folded or functional sequences. We aim to develop modular peptidomimetic systems to rapidly create and evaluate large organic ligands for metal cluster chelation. Recently, a short minimalist peptide comprised of natural and mirror-image amino acids was designed to chelate a Fe4S4 cubane, demonstrating an important first step toward non-natural metalloenzymes.10
N-Substituted glycine oligomers, also called “peptoids”, are a class of synthetic peptidomimetics with great potential for ligand design (Fig. 1).11–13 Like polypeptides, they are synthesized in an iterative fashion with their linear monomer sequence precisely defined at each elongation step. However, their synthesis via the solid-phase submonomer method is uniquely rapid and chemically diverse compared to other classes of sequence-defined polymers since the side-chain functional groups are readily introduced from easily obtainable primary amine synthons.14 Furthermore, the properties of peptoids can be efficiently evaluated by high-throughput synthesis and subsequent screening.15 Many laboratories have engineered peptoid side-chains that locally influence the backbone dihedral angles to achieve robust secondary structures such as helices and sheets.16–25 In addition, peptoids can fold into a variety of other secondary structures including ribbons and square-helices.25–29 However, using peptoids as ligands for targeted metal coordination is still has only recently gained momentum,30–39 with very few atomically-defined metallopeptoid structures. To date, most metallopeptoids have mainly been characterized by circular dichroism spectroscopy.40–42 Recently, the first examples of crystal structures of metallopeptoids were reported, however, their solution-phase characterization was limited.31 It is important to note that in cases like this where the molecules have multiple degrees of freedom and multiple pendant functional groups, crystal packing effects can yield solid-state peptoid conformations that do not correlate with their solution-phase structure.19,43 Here we address cluster chelation and peptoid folding in solution, through the design and discovery of conformationally stable peptoids that provide tetradentate to hexadentate chelation to a [Co4O4] cluster via several simple carboxylate moieties. The only previous example of peptoid–metallocluster interaction featured a lanthanide cluster capped with a peptoid monomer.44 NMR spectroscopy was used to elucidate the atomic structures of the peptoid cluster complexes in solution, and single-site mutation studies reveal several non-covalent interactions that contribute to stable peptoid folding.
The tetrametallic cobalt oxo cluster, Co4O4(OAc)4py4 (OAc = acetate, py = pyridine)45,46 (1) has garnered attention as a water splitting catalyst in the context of artificial photosynthesis and as a structural mimic for the oxygen-evolving center (OEC) in the active site of photosystem II (PSII).47–50 It also has convenient properties that make it well-suited for combinatorial and structural studies: kinetic stability, established ligand substitution chemistry,51 strong color (useful for colorimetric binding assay), and diamagnetism (useful for NMR spectroscopy). The cluster has a cubane geometry comprised of four cobalt(III) ions and four oxides. Designing a single peptoid chain that can encapsulate this cubane and determining its atomic structure should give general insight on the key elements required for metal cluster chelation, and provide inroads for synthesis of stable biomimetic “cutouts” of multimetallic enzyme cofactors and their ligands outside of the native protein environment (Fig. 1).
:
1.5
:
0.7
:
1 ratio) belonging to the formyl proton indicating that there are four conformational isomers (Fig. S6†). 1H–13C HSQC showed two distinct classes of NAla methines (1
:
1.3 ratio) (Fig. S8†). The chemical shifts of the methine, in addition to the nuclear Overhauser effect (NOE) crosspeaks, identify them as cis/trans amide isomers (ω ∼ 0 or 180, respectively) of NAla with a Kcis/trans ∼ 0.8 (Fig. S10†). There is also cis/trans isomerization from the formyl cap, giving a total of 22 conformational isomers. The ω of the second amide is undetermined, but it is either frozen in a single isomer or its conformation is coupled with that of the other residues. This distribution of isomers is consistent with the fact that in general, cis/trans isomerization of tertiary amides is nearly isoenergetic (Kcis/trans ∼ 1).23 The multiple overlapping resonances of these isomers precluded full assignment of the NMR spectrum for 1-A.
To remove the amide isomerization at each terminus, NAla was changed to Ala-OH (simulating cis-NAla since it places the 2-carboxyethyl functionality cis with respect to the carbonyl O atom) and formyl was replaced with p-toluenesulfonyl (Ts). Reaction of one and two equivalents of Ts–Ncm–Nbn–Ala–OH (H2B) with 1 produced the mono-peptoid complex, Co4O4(OAc)2(B)(py)4 (1-B), and bis-peptoid complex, Co4O4(B)2(py)4 (1-2B), respectively, demonstrating that termini modification did not significantly affect binding ability. The expected masses for 1-B and 1-2B were observed (m/z = 1238.06 and 1623.24, respectively; Fig. S11 and S13†), and their 1H NMR spectra in d4-methanol showed only one conformer (Fig. S12 and S14†). In contrast, the free peptoid shows two rotamers due to isomerization about the amide bond between Nbn and Ncm (Fig. S2†). While the spectra for 1-B and 1-2B show similar features, the higher symmetry of 1-2B allowed a more facile assignment. The NMR spectrum of 1-2B is consistent with a C2 symmetric molecule, evidenced by two sets of equally intense para-pyridine protons and a singular set of peptoid resonances. This C2 point group symmetry implies that the two chains wrap around the equatorial plane of the cubane in a “head-to-tail” or antiparallel fashion. In contrast, parallel alignment of the peptoid chains around the cubane would result in C1 symmetry that would be distinguishable by NMR spectroscopy. Methylene protons of the peptoid backbone and side-chains are diastereotopic with geminal protons separated up to 1.5 ppm (Fig. S12 and S14†), suggestive of a rigid fold. In addition to mass spectrometry and NMR data, evidence that the [Co4O4] core remains intact upon peptoid coordination comes from solution electrochemistry data, which is a known metric that is sensitive to the [Co4O4] coordination environment.51 For context, the [Co4O4] core in compound 1 exhibits a characteristic reversible redox event in the cyclic voltammogram (CV) corresponding to the [CoIII4O4]4+/[CoIVCoIII3O4]5+ couple (0.35 V vs. Fc+/Fc in methanol, Fig. S33†), and this redox couple has been previously shown to be also sensitive to the ligand environment around the cubane.51 Compound 1-2B exhibits a nearly identical feature at 0.34 V in the CV (Fig. S33†), suggesting that the cubane core is preserved and surrounded in a highly similar ligand environment to that of 1.
Multiple NOE crosspeaks were used as distance restraints for structural determination (Fig. S34, S35, S42†) by replica-exchange molecular dynamics (REMD) to obtain low energy structures consistent with the NMR data (Fig. 2c). In particular, a strong NOE crosspeak between the backbone methylene protons of Ncm and Nbn indicates a less than 3 Å separation, which is diagnostic of a cis-amide linkage between those two residues. The determined NMR structure is consistent with expectations, with the two μ2-carboxylate donors per peptoid interacting with three cobalt ions, for an overall tetradentate chelation per peptoid. Analysis of the backbone dihedral angles, phi (ϕ) and psi (ψ), gives insight into the validity of the structure (Fig. 3a). Satisfyingly, dihedral combinations (ϕ, ψ) of the backbone reside in the low energy regions of the cis-peptoid Ramachandran plot (Fig. 3b).27,54
![]() | ||
| Fig. 3 (a) The ϕ and ψ dihedrals of a peptoid. (b) Backbone dihedral combinations (ϕ, ψ) of the lowest energy NMR structure for 1-2B (triangles) overlaid on the cis-peptoid Ramachandran free energy plot.27 | ||
It is noteworthy that Ncm adopts a cis-amide, whereas PSII's Asp–Leu–Ala–OH sequence is all trans; perhaps the shorter Cα–Cα distance in a cis-amide backbone is needed to account for the longer linkage (12 atoms) between the two carboxylate groups in the peptoid (vide supra). Titration of a monodentate competitor, d4-acetic acid into 1-2B, monitored by NMR spectroscopy, gave stepwise stability constants of log(Ka1) = 2.0 ± 0.6 and log(Ka2) = 0.9 ± 0.1, and overall stability log(β) = 2.9 ± 0.7, demonstrating a stabilization from the chelate effect (Fig. S30–S32†).
Next, we attempted to extend the sequence of 1-2B to allow carboxylate binding to the third face within a single ligand. Similar to the tripeptide fragment that binds the OEC, the N-terminus of [Co4O4]-bound B is pointed outward and slightly away from the third face of the cluster such that more than one peptoid monomer would be needed to connect to the third face in a μ2-fashion. Preliminary geometry-optimized 3-dimensional computer models suggested that a three peptoid monomer chain should be long enough to link the Ncm–Nbn–Ala–OH segment to a carboxylate residue, Nce (N-2-carboxyethyl glycine), bound on the third face. It was unclear whether there would be a preference for cis or trans amides in this loop (as either appeared reasonable), and concern that a loop that is too flexible would diminish binding.
Thus, we employed a combinatorial approach for discovering classes of loop sequences that would result in high-affinity cluster-binding peptoids.32,33,55 The “split-and-pool” strategy,56 was used to synthesize a “one-bead-one-compound” library of 125 resin-bound sequences varied at the three loop positions (Scheme S2†). In this method, the ensemble of beads contains all possible sequence combinations, but a single bead contains only a homogeneous population of a unique sequence. The peptoids were covalently attached to a TentaGel macrobead support by a labile methionine linkage that can be cleaved using cyanogen bromide, which is orthogonal to the synthesis and screening conditions (see ESI† for details). The monomer basis set comprised of two hydrogen bond donors, Nae (N-2-aminoethyl glycine) and Nhe (N-2-hydroxyethyl glycine), a chiral Nrpe (N-(R)-1-phenylethyl glycine), a strictly trans-enforcing residue Nanis (N-4-methoxyphenyl glycine), and a neutral residue, sarcosine (N-methyl glycine). The peptoid library was synthesized by solid-phase synthesis, and side-chains were deprotected (without cleavage from the resin) by trifluoroacetic acid. The [Co4O4] unit was complexed onto the resin-bound peptoids by ligand exchange with 1.
Since the [Co4O4] unit is a dark green color, direct colorimetric analysis was used to identify hit sequences. Subjecting the resin-bound clusters to increasing concentrations of acetic acid at 50 °C for 20 h and selecting for the remaining green-colored beads allowed identification high-affinity sequences. As a benchmark, the resin-bound tripeptoid, Ts–Ncm–Nbn–NAla, showed complete loss of green color at 1.0 M acetic acid (Fig. S3†). While most beads in the heptameric library lost color, ∼25% of the beads retained color up to 3.0 M acetic acid (Fig. S3†). The peptoids belonging to the colored beads were cleaved and sequenced with tandem MS/MS (see ESI† for details).57 A hit sequence, Ts–Nce–Nae–Nhe–Nrpe–Ncm–Nbn–NAla, appeared multiple times and was chosen for further analysis. The hit was resynthesized as the C-terminal acid derivative, Ts–Nce–Nae–Nhe–Nrpe–Ncm–Nbn–Ala–OH (H3C), in order to prevent amide isomerization at the termini (vide supra). Note that due to trifluoroacetic acid used in HPLC purification procedures, Nae is in the ammonium form with a trifluoroacetate counterion.
Peptoid H3C reacts with 1 in a 1
:
1 ratio to form the expected complex 1-C, with expected m/z= 1669.27 (Co4O4(OAc)(C)(py)4+) and 835.14 (Co4O4(OAc)(C)(py)4H2+) (Fig. S17†). The 1H NMR spectrum of purified 1-C in d4-methanol exhibits ∼51% of a single conformation species having sharp resonances, and the remaining less folded conformer exhibiting very broad resonances (Fig. S18†). Backbone methylene protons of the single conformation species are diastereotopic with large separation consistent with rigid folding. This species also exhibits a nearly identical redox couple to that of 1 and 1-2B at 0.36 V that suggests the [Co4O4] core remains intact (Fig. S33†).
The NMR resonances were fully assigned (Fig. S21†), and many NOE crosspeaks were resolved for structure determination (Fig. S36–S38†). Notably, the appearance of medium strength NOE crosspeaks between i, i + 1 and i, i − 1 backbone methylene protons correspond to cis amides for all peptoid residues. The side-chain methylene protons of Nce and Nae are also diasterotopic, and show a J-coupling pattern consistent with a gauche relationship of the non-hydrogen substituents (Fig. S39†). While the side-chain of Nce bound to the cluster should adopt a rigid rotational state, the rigidity of the Nae side-chain is more surprising and suggests that it may be involved in a strong non-covalent interaction. Several crosspeaks between the loop residues and the pyridine ligands of the cluster are also observed. NMR-constrained REMD was used to elucidate the structure of 1-C (Fig. 4 and S43†). All backbone dihedral angles fall into the stable regions of the peptoid Ramachandran plot (Fig. 5). Travelling from the C to N terminus, the chain begins to dip far below the equatorial plane of the cubane after Ncm. The chiral Nrpe4 residue initiates a turn, directing the backbone, via sterics, to steer around the corner of the cubane (Fig. 6a). Nhe3 adopts similar dihedrals as Nrpe4, beginning a partial helical stretch that exits back up to the cubane equatorial plane at Nae2. The chirality of the backbone is reversed at Nae2, creating a ∼90° bend that positions the chain parallel with the cubane face, allowing Nce1 to coordinate the cubane. The backbone can sample both positive and negative stable (ϕ, ψ) regions due to the prochiral nature of peptoids (Fig. 5). This flexibility seems advantageous in allowing such a short oligomer to “freely” mold into the most suitable fold. The cis-amides of the backbone are reinforced in two ways – (1) Nae forms a 7-membered H-bond to the i − 1 carbonyl (average N⋯O distance of 3.2 ± 0.3 Å, average N–H–O angle of 122.3° ± 13.6°), and (2) the steric bulk of Nrpe favors a cis-amide in order to minimize allylic strain (Fig. 6a).
![]() | ||
| Fig. 5 Backbone dihedral combinations (ϕ, ψ) of the lowest energy NMR structure for 1-C (triangles) overlaid on the cis-peptoid Ramachandran free energy plot.27 The residue numbers are indicated. | ||
Single mutation of each of the three loop residues to sarcosine,58 analogous to alanine scanning59 in peptides, was used to probe the structure–function relationship of each residue (Fig. 6b). Liquid chromatography mass spectrometry (LCMS) of the crude reaction mixture of 1 with H3C exhibited a large major peak corresponding to the desired complex, 1-C. Removing the cationic H-bond donor via a Nae2Sar mutation was highly detrimental, producing significant amounts of side products, including partially chelated and a dimeric cluster complex. Even a conservative mutation to an isosteric, weaker H-bond donor, Nae2Nhe, yielded similar side products. However, chelation ability is unaffected by an Nhe3Sar mutation. NMR characterization of the purified cluster-bound Nhe3Sar mutant (1-C-Nhe3Sar) shows a conformation with many characteristic NOE features of 1-C, indicating an identical fold (Fig. S24, S27, S40, S41†). This suggests that the third residue may be useful as a site for future substitution or conjugation. An Nrpe4Sar mutation does not produce as many side products, but gives two identical-mass species having different retention times, suggestive of isomers (possibly isomers about the ω or Φ dihedral angle between residues 3–4). The results of this sarcosine scan are in line with the NMR structure of 1-C – the H-bond from Nae and steric constraints of Nrpe are critical in decreasing backbone flexibility leading to better binding.
A New Design Paradigm for the Construction of Discretely Folded Peptoid Structures, J. Am. Chem. Soc., 2006, 128, 14378–14387 CrossRef CAS PubMed.
A, Nature, 2011, 473, 55–60 CrossRef CAS PubMed.Footnote |
| † Electronic supplementary information (ESI) available. See DOI: 10.1039/c8sc04240c |
| This journal is © The Royal Society of Chemistry 2018 |