Yang
Shen‡
*a,
Ioanna
Kalograiaki‡
bc,
Alessio
Prunotto
d,
Matthew
Dunne
a,
Samy
Boulos
e,
Nicholas M. I.
Taylor
f,
Eric T.
Sumrall
a,
Marcel R.
Eugster
a,
Rebecca
Martin
a,
Alicia
Julian-Rodero
a,
Benjamin
Gerber
a,
Petr G.
Leiman
g,
Margarita
Menéndez
ch,
Matteo Dal
Peraro
d,
Francisco Javier
Cañada
bc and
Martin J.
Loessner
a
aInstitute of Food, Nutrition and Health, ETH Zurich, Schmelzbergstrasse 7, 8092 Zurich, Switzerland. E-mail: yang.shen@hest.ethz.ch
bCentro de Investigaciones Biológicas, Margarita Salas, Consejo Superior de Investigaciones Científicas, Ramiro de Maeztu 9, 28040 Madrid, Spain
cCentro de Investigación Biomédica en Red-Enfermedades Respiratorias (CIBERES), Avenida de Monforte de Lemos 3-5, 28029 Madrid, Spain
dLaboratory for Biomolecular Modeling, EPFL, IBI-SV Station 19, 1015 Lausanne, Switzerland
eInstitute of Food, Nutrition and Health, ETH Zurich, Schmelzbergstrasse 9, 8092 Zurich, Switzerland
fStructural Biology of Molecular Machines Group, Protein Structure & Function Programme, Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Blegdamsvej 3B, Copenhagen 2200, Denmark
gUniversity of Texas Medical Branch, Department of Biochemistry and Molecular Biology, Sealy Center for Structural Biology and Molecular Biophysics, 301 University Blvd, Galveston, TX 77555-0647, USA
hInstituto de Química-Física Rocasolano, Consejo Superior de Investigaciones Cientificas, Serrano 119, 28006 Madrid, Spain
First published on 23rd October 2020
Endolysins are bacteriophage-encoded peptidoglycan hydrolases targeting the cell wall of host bacteria via their cell wall-binding domains (CBDs). The molecular basis for selective recognition of surface carbohydrate ligands by CBDs remains elusive. Here, we describe, in atomic detail, the interaction between the Listeria phage endolysin domain CBD500 and its cell wall teichoic acid (WTA) ligands. We show that 3′O-acetylated GlcNAc residues integrated into the WTA polymer chain are the key epitope recognized by a CBD binding cavity located at the interface of tandem copies of beta-barrel, pseudo-symmetric SH3b-like repeats. This cavity consists of multiple aromatic residues making extensive interactions with two GlcNAc acetyl groups via hydrogen bonds and van der Waals contacts, while permitting the docking of the diastereomorphic ligands. Our multidisciplinary approach tackled an extremely challenging protein–glycopolymer complex and delineated a previously unknown recognition mechanism by which a phage endolysin specifically recognizes and targets WTA, suggesting an adaptable model for regulation of endolysin specificity.
The vast majority of endolysins from bacteriophages infecting Gram-positive bacteria possess modular architectures, encompassed in at least one enzymatically-active domain (EAD) and one cell wall-binding domain (CBD).7 The N-terminal EAD is responsible for cleaving a particular peptidoglycan (PG) bond, and the C-terminal CBD usually holds an independent regulatory function and determines the lytic spectrum upon targeting specific cell-wall elements distributed in a genus-, species- or strain-specific manner. The binding ligands for CBDs are usually of peptidoglycan or carbohydrate nature.8–10 Given their binding specificity, endolysins and in particular the CBDs constitute highly attractive scaffolds for antibacterial protein engineering.11,12
Listeria phage A500 encodes the endolysin Ply500, which comprises L-alanoyl-D-glutamate endopeptidase activity against Listeria spp. serovars 4, 5, and 6.10 Notably, Ply500 is able to rapidly lyse all serovar 4b strains, which are responsible for the vast majority of listeriosis outbreaks,13,14 rendering Ply500 an appealing anti-Listeria agent to use in food safety applications.15–17 Ply500 displays a modular architecture comprising largely independent CBD and EAD moieties connected via a short and flexible linker peptide (Fig. 1a).18 Although both modules seem to be required for full activity of this endolysin on its natural murein substrate,10,18 the isolated C-terminal domain (termed CBD500) binds live bacteria cells,10,19 as well as to purified wall teichoic acid (WTA)20 with high affinity.
Fig. 1 Correlation of Listeria WTA structure with the bacteriolytic activity of Ply500 endolysin and the binding activity of its C-terminal domain CBD500. (a) Modular architecture of the full-length Ply500. (b) Lytic activity of Ply500 detected by spot-on-lawn assay (upper panel; scale bar 300 μm) and corresponding types of WTA reported in the genus Listeria.20 Images are representative of three experiments, and contrast was adjusted for clarity. (c) Relative abundance (%) of the repeating unit variants within Listeria WTAs derived from strain WSLC1042 (WTA1042), determined by UPLC-MS analysis. Latin numerals (I–VI) denote the repeating unit species of WTA1042 identified. Data represent means ± SEM; n = 3. (d) Binding of GFP-CBD500 to wild-type WSLC1042 and mutants lacking specific substituents in the backbone of the WTA repeating unit. Scale bar represents 1 μm. Images are representative of three experiments, and contrast was adjusted for clarity. (e) Reduction of bacterial colony counts (Δlog10CFU ml−1) after 1 h incubation of WSLC1042 and selected WTA mutants in the presence of 100 μg ml−1 of Ply500. Results are the mean ± SEM of three experiments. Statistical significance was assessed using one-way ANOVA test (****p < 0.0001; ns: not significant). |
In Listeria, WTA polymers account for up to 60% of the dry weight of the cell-wall, and are covalently bound to N-acetylmuramic acid (MurNAc) of the PG by a phosphodiester bond through a conserved linkage unit.21 The WTA chains extend beyond the PG layer, comprising 20 to 40 repeating units that are highly variable due to further glycosidic substitutions or modifications.22 Depending on their backbone units, two types of WTA exist (Fig. 1b): type I WTAs possess poly-[5)-Rbo-1-P-(O→]n chains of ribitol phosphate (Rbo-P) units, while type II WTAs feature repeating units of GlcNAc-Rbo-P, following the pattern [4)-GlcNAc-(β1 → 2/4)-Rbo-1-P-(O→]n. WTAs represent the O-antigen determinants in Listeria spp., and confer serovar diversity along with some H-antigens.23 We previously demonstrated that fluorescently labelled CBDs with distinct recognition and binding pattern could differentiate Listeria cells down to the serovar level.24,25 In particular, CBD500 was shown to specifically decorate Listeria strains carrying type II WTAs equipped with 3′O-acetylated GlcNAc, while tolerating further hexose substitution of the integrated GlcNAc moiety (Fig. 1b).20 Yet, the underlying recognition mechanism remains elusive with respect to polymer length, GlcNAc/ribitol connectivity, inherent flexibility and microheterogeneity (due to non-uniform, variable decoration of the repeating units). Thus, the attainment of an atomic-scale model required the combination of multiplex ligand- and receptor-based methods.
We used a state-of-the-art multidisciplinary approach to elucidate the mechanism underpinning the recognition of Listeria WTAs by CBD500 from both the ligand and protein perspectives. Employing X-ray crystallography, we solved the structure of CBD500 to 1.6 Å resolution, which comprises two copies of beta-barrel SH3b-like repeats held together by means of swapped beta-strands. Saturation Transfer Difference (STD) NMR26 studies of CBD500 in complex with full length WTAs derived from a selected panel of Listeria strains and mutants, highlighted the prerequisite of GlcNAc 3′O-acetylation for interaction with CBD500. “Blind” molecular docking on the solved structure suggested three putative binding pockets whose involvement in binding was furtherly assessed by site-directed mutagenesis and functionality assays. STD NMR-driven molecular dynamics simulations and isothermal titration calorimetry (ITC) aided to model the interaction, which is driven by hydrogen bond networks and van der Waals contacts. For the first time, we provide the atomic evidence that swapped SH3b-like repeats found in a Listeria phage endolysin permit the selective recognition of carbohydrate ligands,10,27 instead of PG-related peptides.28–30 This pioneering study fosters the existing knowledge on WTA recognition by endolysins and provides evidence on the diversification of the SH3b domain folding and binding site.
Next, we sought to validate bacterial binding using CBD500 alone and extracted WTA polymers. As such, we immobilized CBD500 onto a surface plasmon resonance (SPR) chip, and used wild type and mutant WTA as the analytes. The SPR analysis revealed that the association of WTA with CBD500 is abolished for WTA lacking the O-acetylation (ESI Fig. 1c†). Likewise, binding of CBD500 to Listeria ivanovii WSLC 3009 (SV 5) was also prevented upon loss of GlcNAc 3′O-acetylation (ESI Fig. 1d and e†). In summary, these results indicate that CBD500 specifically recognizes O-acetylated WTA as its ligand.
Fig. 2 WTA1042 and WTA1020 epitopes recognized by CBD500. (a) Representative Saturation Transfer Difference (STD) NMR (red, scaled 16×) and off-resonance (black) spectra of WTA1042 and WTA1020 recorded in presence of CBD500 upon irradiation at 7.1 and 100 ppm, respectively (1:25 CBD:WTA mixtures containing 30 μM CBD500 and approx. 0.75 mM WTA repeating units). Select STD signals are highlighted. Blue arrows point to O-acetylated GlcNAc protons and orange arrows to GlcNAc protons, bearing –OH or –OGlc at C3′, (inserts correspond to zoomed H3′ signals of the 3′-O-acetylated units). (b) STD epitopes for WTA1042-I, WTA1042-III and WTA1020O-acetylated GlcNAc moieties as derived from at least three independent experiments; blue and yellow spheres indicate higher and lower saturation transfer intensities. (c) Representative STD (red, scaled 16×) and off-resonance (black) spectra of CBD500–WTA1042 complexes where WTA were derived from Lm 1042 mutants ΔoatT ΔgttA, ΔgltB and ΔgalU as indicated; reference STD spectra acquired in absence of CBD500 are also depicted for all mutants in purple (ligand-only control, same scale as STD in presence of protein). Select STD signals are highlighted and absolute STD values are given for the methyl-group protons of relevance. All spectra were acquired as above and the chemical shifts are measured in ppm. Detailed information on STD absolute and normalized values is summarized in ESI Tables 3 and 4.† |
To establish the minimum WTA epitope required for CBD500–WTA interaction and investigate the relevant contributions of 3′O-acetylation or hexose (3′-OGlc or 6′-OGal) substitutions of the GlcNAc moiety, STD-NMR experiments were also performed using WTA isolated from defined WTA mutants (Fig. 1d). Lack of 3′O-acetylation completely abolished recognition by CBD500 (1042ΔoatT, Fig. 2c), in line with the previous observations (ESI Fig. 1c†). Indeed, the absolute STD value measured for the GlcNAc N-Ac-methyl protons was equivalent to the non-specific ligand saturation observed in absence of CBD500 (reference STD, Fig. 2c). For all three variant WTAs derived from the 1042ΔgttA, ΔgltB or ΔgalU mutants, magnetization transfer values were of similar intensity (ESI Table 4†), suggesting that 3′O-Glc and 6′O-Gal modifications are not essential for CBD500 recognition. Overall, findings obtained by NMR established 3′OAc-GlcNAc moiety as the key epitope for CBD500, and, in the case of WTA1042, indicate that Gal substitution at position 6′ is well tolerated.
Fig. 3 Structural features of CBD500. (a) Cartoon representation of the proximal (p; green, Cα atoms 153–164 and 238–289) and distal (d; cyan, Cα atoms 165–237) SH3b-like repeats of CBD500 (number corresponds to the full length Ply500). The β-strands β1p and β8d (magenta) are structurally swapped between the two repeats, thus corresponding to β1′d and β1′p, respectively. Loops connecting β2p-β3p and β2d-β3d closely resemble the ligand-binding RT loops44 of canonical SH3 domains (red). (b) Topology diagram of CBD500 with secondary structure elements shown as cylinders for α-helices and arrows for β-strands. (c) Wire representation of CBD500 and CBDPSA (PDB ID 1XOV, Cα atoms 180–314) structures upon alignment (RMSD of 0.39 Å for 120 Cα atoms). (d) Superposition of the pseudo-symmetric (proximal and distal) repeats of CBD500 (RMSD of 1.23 Å, for 34 Cα atoms). (e) Superposition of the distal repeat of CBD500 (cyan) and the single SH3b domain of lysostaphin calculated by the DALI server37 (PDB ID 5LEO; RMSD of 1.7 Å, Cα atoms 401–493, orange) with the RT loops colored red. |
A DALI search37 of the PDB with full-length CBD500 identified the CBD of PlyPSA27 as a structural homolog (RMSD 0.9 Å for 135 residues), which features the same structurally swapped β-strands connecting its SH3b repeats (Fig. 3c). Interestingly, as individual repeats, the proximal and distal subdomains of CBD500 feature high structural similarity (Fig. 3d), despite only 17% sequence identity. Both domains also present high structural similarity to staphylococcal bacteriocin CBDs consisting of single SH3b domains, such as ALE-1 (PDB ID 1R77; RMSD 2.0 Å for 54 Cα atoms)38 and lysostaphin (PDB ID 5LEO; RMSD 2.1 Å for 58 Cα atoms),35 including similar positioning of their individual RT loops (Fig. 3e).
The molecular weight, chain integrity, and conservation of the protein fold for all GFP-CBD500 mutants were confirmed by SDS-PAGE, analytic gel filtration, and circular dichroism spectroscopy, respectively (ESI Fig. 3a–c†). Only in the case of N248A a size reduction was observed, consistent with a 20 aa truncation at C-terminus identified by MS peptide mapping (data not shown). Another mutant (N248D) did not reveal any sign of truncation, yet still exhibited abolished binding (Fig. 3b). Interestingly, the double mutant (K246A_N248A) was able to restore the binding, albeit not to WT level. Furthermore, to reduce the influence of GFP tag, we examined the conformation of several non-tagged CBD500 mutants by analytical gel filtration. The elution profiles of all the mutants were similar to that of the WT (ESI Fig. 3d and e†).
We next tested if the lack of cell wall recognition by select mutants (W198A, W242A, N248A, and W254A) had an effect on the lytic ability of full-length Ply500 endolysin. Ply500 mutants were purified and tested against purified cell wall substrate (ESI Fig. S3f and g†). As expected, all Ply500 mutants bearing an inactivated CBD showed significantly reduced activity,18 confirming the importance of host recognition by the CBD and of the residues within cavity 1 for modulating the activity of Ply500.
Collectively, the site-directed mutagenesis experiments identified a tweezer-like site in the cavity 1 that possesses a distinctive patch of several hydrophobic and polar residues adjacent to the two RT-loops (Fig. 4c).
For the WTA1020 monomer, the CORCEMA-ST analysis pointed towards a single docking conformation for β-GlcNAc-(1 → 2)-Rbo in line with experimental STD values (Fig. 2a, ESI Table 3†). This docking output was used as input for MD simulations as described in the Methods section.† Throughout the simulation the monomeric ligand remained strongly bound at the interface of the RT loops, while the loops and the overall protein structure did not suffer notable distortions (Fig. 5a), therefore advocating for the establishment of a single WTA1020 repeating unit as the minimal epitope recognized by CBD500. The complete MD 5000-frame trajectory of WTA1020 was subjected to CORCEMA-ST analysis. The profile of the mean theoretical STD values was found to be analogous to the absolute STD experimental values (Fig. 5b), further supporting the use of a single repeating unit in the complex modelling. Protons that received the highest saturation transfer were the methyl protons of O- and N-acetyl groups of 3′OAc-GlcNAc moiety, followed by protons H3′ and H5′ of the sugar ring (Fig. 5b). The MD output was screened for experimentally conformist poses using CORCEMA-ST, yielding two representative monomeric structures for which the calculated absolute STD profile of the sugar ring protons reproduced experimental mean values (Fig. 5b). The position of the sugar ring remains unchanged at the binding cavity surrounded by several tryptophan residues (Fig. 5c). Both selected structures, belonging to the same and most populated group generated by geographical clustering (see Methods section†), were found to occupy a position close to the centroid of the cluster (Fig. 5d). Even considering with high confidence the GlcNAc ring anchored through the two acetyl groups, the Rbo protons exhibit significant motility throughout the MD (Fig. 5b and c), which could be attributed at least in part to the inherent flexibility of Rbo, as expected for a monomeric ligand with open edges.
Fig. 5 STD NMR-driven MD simulations and structure selection for complex modeling. (a) Time evolution of the complex of CBD500 with the 3′OAc-bearing WTA1020 repeating unit by MD simulations. Representative time-frames (top) and root mean square deviation of the monomeric ligand (bottom). The WTA monomer is represented in sticks, and the CBD500 loops that conform the binding pocket are colored green. (b) Absolute STD values per proton for the recognized WTA1020 repeating unit. The mean ± SD values per proton for the 5000 structures (80 ps intervals), extracted from the MD trajectory (orange) and calculated using CORCEMA, is plotted along with the average experimental values from three independent STD-NMR experiments (blue) and the calculated values for the two structures selected on the basis of best correlation to the experimental results (grey). (c) Superposition of the two selected structures for the CBD500:WTA1020 monomer complex. The ligand and relevant Trp residues for reference are represented in sticks and the CBD500 binding pocket in the surface hydrophobicity mode (scale as in Fig. 4, panel a). (d) Eigenvalues projection map of the 5000 structures (80 ps intervals) extracted from the 400 ns MD trajectory of WTA1020. The two identified clusters are represented in red and violet respectively, and the black dots represent their centroids. The yellow dots represent the complex structures that were selected on the basis of correlation with experimental results. (e and f) Superposition of the two selected structures for the CBD500:WTA1042-I and CBD500:WTA1042-III complexes (mode A), respectively. Ligand and protein representation as in (c). |
Docking of WTA1042 I and III monomers onto CBD500 yielded two feasible solutions (modes A and B) per complex. Notably, they were up-down opposite orientations (ESI Fig. 4a and b†), where both acetyl groups of 2′-NAc and 3′-OAc of the GlcNAc exchange their relative positions, while still pointing inside the cavity, thus resulting in an inverted orientation of the polymer chain. MD simulations were run for mode A, with the equivalent orientation to the WTA1020 monomer pose. Although the ligand remained strongly bound throughout the MD trajectory, the mean theoretical STD profile for both WTA1042 monomers did not totally fit with the experimental values obtained with the WT polymer (ESI Fig. 4c and d†). While the calculation predicts the highest saturations for both, N- and O-acetyl proton groups, larger deviations originate from the calculated values for H4′ and H5′ protons in the opposite faces of GlcNAc pyranose ring, in contrast with MD results of WTA1020. As done for WTA1020, two representative structures for each monomer of WTA1042 I and III were selected after the clustering analysis. After superimposition into the binding site of CBD500, increased motility was observed for both WTA1042 monomers compared to that of WTA1020 (Fig. 5e and f). In fact, the two solutions of WTA1042-I showed a considerable shift of the GlcNAc ring, which was significantly reduced by the presence of Gal at position 6′ in WTA1042-III. In the context of a polymeric chain, the longer monomer chain and increased rotational freedom provided by ribitol in βGlcNAc-(1 → 4)-Rbo backboned WTAs could also account for a differential response in their recognition by CBD500, compared to β(1 → 2) in WTA1020.
Fig. 6 Binding thermodynamics of CBD500 variants to WTA1020 and WTA1042 and models of complexes. (a) ITC titration curves of wild-type GFP-CBD500 with WTA1042 (left panels) and WTA1020 (right panels). Top panels show the raw data, and bottom panels depict the dependence of the heat evolved per mol of ligand injected with various ligand/protein molar ratios. Solid lines correspond to the theoretical fit of the experimental data (■). See Methods† for details. (b) Thermodynamic dissection of the binding energetics (ΔG = ΔH − TΔS) of wild-type GFP-CBD500 and its active-binding mutants to WTA1042 (left) and WTA1020 (right). Black bars, ΔG; white bars, ΔH; grey bars, TΔS. (c) Feasible key non hydrogen-bonded contacts of CBD500 with 3′OAc-βGlcNAc-Rbo-PO4 as derived from the MD analyses (1020 in left panel, 1042-I middle panel and 1042-III right panel). Contacts below 4 Å are indicated schematically; the amino-acid residue (in orange) contacts are represented by a purple arc with spokes radiating towards the ligand atoms they contact. The contacted ligand atoms are shown with purple spokes radiating back. |
Constructs | Binding to Listeriaa | N | K b (M−1) | −ΔH (kcal mol−1) | −ΔG (kcal mol−1) | TΔS (kcal mol−1) |
---|---|---|---|---|---|---|
a Binding was visualized by fluorescence microscopy using the GFP-CBD500 construct and its variants: ‘+’, strong binding; ‘(+)’, weak binding; ‘—’, no binding. b N is the number of CBD molecules bound per WTA chain, Kb, is the binding constant per monomer of CBD bound to WTA as measured by ITC, and ΔG (−RTlnKb), ΔH and ΔS represent the corresponding thermodynamic parameters. c ITC measured with CBD500 mutants. d ITC measured with CBD500 and GFP-CBD500 mutants. | ||||||
WT | + | 1.88 ± 0.03 | (3.5 ± 0.2) × 104 | 9.5 ± 0.1 | 6.20 ± 0.03 | −3.3 ± 0.1 |
K162A | + | 2.1 ± 0.2 | (2.8 ± 0.4) × 104 | 6.7 ± 0.2 | 6.07 ± 0.08 | −0.6 ± 0.3 |
Y197A | (+) | 1.8 ± 0.1 | (3.3 ± 0.4) × 104 | 4.3 ± 0.2 | 6.16 ± 0.07 | 1.9 ± 0.3 |
Y211A | (+) | 2.2 ± 0.2 | (2.5 ± 0.2) × 104 | 3.0 ± 0.2 | 6.00 ± 0.05 | 3.0 ± 0.25 |
K246A | + | 2.0 ± 0.1 | (2.3 ± 0.2) × 104 | 10.5 ± 0.2 | 5.95 ± 0.05 | −4.55 ± 0.45 |
S249A | (+) | 1.5 ± 0.1 | (3.6 ± 0.5) × 104 | 8.2 ± 0.2 | 6.30 ± 0.03 | −1.9 ± 0.2 |
K251D | + | 1.0 ± 0.1 | (3.5 ± 0.5) × 104 | 8 ± 1 | 6.20 ± 0.08 | −1.8 ± 1 |
K289A | + | 2.0 ± 0.2 | (4.8 ± 0.7) × 104 | 5.7 ± 0.2 | 6.39 ± 0.09 | 0.7 ± 0.3 |
K246A_N248A | + | 2.3 ± 0.5 | (2.9 ± 0.3) × 104 | 8.4 ± 0.3 | 6.09 ± 0.06 | −2.3 ± 0.4 |
I252A | (+) | Protein aggregation prevented quantification of ITC data | ||||
W254A | — | Extremely weak affinity not quantifiable by ITC | ||||
F175Ac | — | No binding detected by ITC | ||||
W198A | — | Extremely weak affinity not quantifiable by ITC | ||||
W242Ac | — | No binding detected by ITC | ||||
N248Ad | — | Extremely weak affinity not quantifiable by ITC | ||||
GFP | — | No binding detected by ITC |
Apart from the aforementioned key residues, additional amino acids participated in an intricate H bond network (ESI Fig. 5b†, right panel), such as Ser249, whose main chain gripped either GlcNAc:O2N (WTA1042) or Rbo:HO-5 (WTA1020). The neighboring amino acid, Gly250, frequently captured Rbo:O5 in all repeating units, yet atom distance showed high deviation (ESI Table 7†), probably due to the Rbo inherent flexibility. Taken together, the prevalent establishment of a distinct H-bonding pattern in WTA1042 compared to WTA1020 advocates for a distinct, comprehensive CBD500 recognition mode in a global perspective.
In addition to H bonding, van der Waals interactions41 and carbohydrate–aromatic interactions42 were expected to be particularly relevant in the formation of CBD500-complex (Fig. 6c), thanks to methyl-protons sets of C–H moieties and the GlcNAc pyranose ring on the ground of a binding site with high density of aromatic residues. Interestingly, Tyr197 appeared as a good candidate to stack with GlcNAc pyranose ring in some docking poses. However, the interaction was not maintained throughout the 400 ns MD considering the distance and the angle between aromatic and pyranose rings. Alanine scanning indicated that Tyr197 switching to Ala did not induce significant affinity loss when the Y197A mutant was titrated with WTA1042 (Table 1), despite some alteration in thermodynamic behavior of the complex. Furthermore, the quantified fluorescent binding of Y197A mutant labelled with GFP to the Listeria cells was in the same range as for the WT (Fig. 4b). Altogether, these data suggest that pyranose-aromatic geometrical stacking does not play an essential role in the binding. On the other side, MD simulations with the WTA1020 monomer also indicated the establishment of van der Waals contacts between the 3′OAc-methyl group of the ligand and the side chain of Asn248, whose loss might also alter the interactions mediated by the Ser249 main chain in the N248A mutant. Phe175, buried in the bottom of the binding groove, was not found to interact directly with WTA1042 as with WTA1020, but it may play an important structural role through pi coordination of aromatic rings. Therefore, our data suggest that both H bonding and van der Waals interactions greatly contribute to the specificity of the carbohydrate-binding CBD500, allowing for ligand selection with structural heterogeneity.
However, structural studies of WTAs are extremely challenging due to their inherent microheterogeneity and flexibility. Yet, the here described multidisciplinary strategy allows to pinpoint the mechanism by which CBD500 specifically recognizes the 3′OAc-GlcNAc moiety in the context of a Rbo backbone via a surface-exposed cavity located at the interface of SH3b-like repeats. This cavity comprises multiple aromatic residues that are commonly involved in protein–carbohydrate recognition,43 despite no direct pyranose-aromatic stacking was identified. The residues located at the cavity make extensive interactions with the two acetyl groups via hydrogen bonds and van der Waals contacts, while permitting the docking of the WTA ligand regardless of the diastereomorphic connectivity (C2 vs. C4) on Rbo and hexose substitution.
We proposed a model for the CBD500–WTA complex (ESI Fig. S6a†), which is new and distinct from current knowledge of ligand interaction with other SH3b-like domains. For instance, the single SH3b of lysostaphin has been shown to recognize the stem peptide and pentaglycine bridge of PG using a two binding-site mechanism9 (ESI Fig. S6b†). The CBD of Bacillus endolysin LysPBC1530 was shown to bind the peptidoglycan through separate binding sites located on opposite sides of its SH3b domains (ESI Fig. S6c†). Similarly, the CBD of Clostridium endolysin PhiSM10129 features putative peptide-binding sites facing in opposite directions of ‘pseudo dimer’-like SH3b repeats (ESI Fig. S6d†). This bivalent mode of binding is in strong contrast to the single binding site of CBD500 at the interface between the pseudo-symmetric SH3b repeats. Based on these observations, it appears that the SH3b domains are not homogenous, but have evolved to produce diverse binding sites for recognition of different peptide and carbohydrate ligands. In addition, we found that the SH3b repeats of CBD500 are joined together through β-strand swapping, while the SH3b repeats of both Psm and LysPBC5 fold separately and connect in a more conservative fashion via a short linker. The difference in topology may also contribute to the different mechanisms of ligand binding.
The RT loops appear to be fundamental for binding by SH3 and SH3b domains. In eukaryotic systems, SH3 domains feature a canonical binding site involving the region between the RT and Src loops, which typically binds through protein–protein interactions.44 In fact, this closely resembles the properties of CBD500 and the lysostaphin SH3b domain that interacts with PG through their RT loops.35 Despite most Listeria phage CBDs feature similar domain architectures, some of them, e.g., CBDP35 and CBD025, appear to recognize different types of WTA polymers.19,20 Further research is required to dissect this subtle structural disparity and understand how phage endolysins evolved diverse binding strategies to different ligands at the atomic level. Interestingly, three known Listeria virulence factors, InlB, autolysin-like IspC and P60 also feature multiple SH3b repeats at their C-terminus, which have been demonstrated to adhere to glycosyl-polyol teichoic acids.31,45,46 Our data may provide guidance on the target specificity and support future studies of the role of other SH3b-like repeats in virulence and immune defense.
In addition to SH3b, several cell wall-binding modules and their recognizing ligands have been described in structural detail. For instance, the multimodular LysM motifs bind to the PG disaccharide β-N-acetylmuramic acid (1 → 4)-β-N-acetylglucosamine in a ‘beads on a string’ fashion,47 or form a binding groove through the intrachain dimerization when interacting with N-acetylglucosamine-containing chitin.48 Intriguingly, the molecular diversity of LysM is reminiscent of what we found in diverse SH3b-like domains. The three tandem α-helical repeats of Cpl-7 that recognize the PG MurNAc-L-Ala-D-isoGln muropeptide;8 the CBDs of Bacillus phage endolysin PlyL and PlyG that binds to the secondary cell wall polysaccharides;49 and the choline binding repeats described in Streptococcus pneumoniae PG hydrolases that interact with the phosphorylcholine residues on teichoic acids through a left-handed β-solenoid domain.50 These variable cell-wall binding modules provide a pool of scaffolds that may be engineered as molecular tools or payloads to target specific bacteria for diagnostic and therapeutic applications.
O-Acetylation of the MurNAc residues in PG is known to modulate cell wall biosynthesis in S. pneumoniae.51 In a large number of Gram-positive pathogens, this peptidoglycan modification has been shown to confer resistance to lysozyme, a key effector of the innate immunity.52 However, the presence of O-acetylation on other cell wall components has only been uncovered recently, such as in Listeria WTA,20,53Mesorhizobium loti LPS,54 and the GlcNAc residue of Lactobacillus plantarum PG.55 These findings underline the important roles of O-acetylation in modulation of autolysin activity, interaction with protozoans, and even virulence.31 Interestingly, O-acetylation of other bacterial surface polysaccharides has also been identified as essential modifications for antibody recognition, and thus needs to be considered in vaccine development.56,57 Here, we identified another eminent function of O-acetylation of WTA in the recognition by phage endolysins, which are important for phage–bacteria interaction. Surprisingly, the binding affinity (ca. KD = 10−4 M measured in solution by ITC analysis) appears significantly lower than the previously reported nanomolar affinities determined by immobilizing whole cells on the SPR surfaces.10 It is conceivable that weaker local biomolecular interaction may allow the endolysins to better diffuse within the cell wall based on higher dissociation and re-association, and results in an increased avidity through WTA multivalent presentation at cell surface, thereby potentiating the hydrolysis of PG bonds. An intriguing hypothesis is that the mutual adaptability between the binding site and the ligand not only determines the binding specificity, but also boosts the activity of endolysin. The atomic insights and dynamic structures shown in this study will pave new ways towards research in this direction.
Footnotes |
† Electronic supplementary information (ESI) available. See DOI: 10.1039/d0sc04394j |
‡ These authors contributed equally to this work. |
This journal is © The Royal Society of Chemistry 2021 |