Mathilde
Briday
a,
François
Hallé
b,
Lauriane
Lecoq
*a,
Sylvie
Radix
b,
Juliette
Martin
a,
Roland
Montserret
a,
Marie
Dujardin
a,
Marie-Laure
Fogeron
a,
Michael
Nassal
c,
Beat H.
Meier
d,
Thierry
Lomberget
*b and
Anja
Böckmann
*a
aMolecular Microbiology and Structural Biochemistry (MMSB) UMR 5086 CNRS/Université de Lyon, Labex Ecofect, 7 Passage du Vercors, 69367 Lyon, France. E-mail: lauriane.lecoq@ibcp.fr; anja.bockmann@ibcp.fr
bUniversité de Lyon, Université Lyon 1, CNRS UMR 5246 Institut de Chimie et Biochimie Moléculaires et Supramoléculaires (ICBMS), Faculté de Pharmacie-ISPB, 8 Avenue Rockefeller, FR-69373 Lyon Cedex 08, France. E-mail: thierry.lomberget@univ-lyon1.fr
cDepartment of Medicine II/Molecular Biology, University Hospital Freiburg, Medical Center, University of Freiburg, Freiburg 79106, Germany
dPhysical Chemistry, ETH Zürich, 8093 Zürich, Switzerland
First published on 7th July 2022
Hepatitis B virus (HBV) is a small enveloped retrotranscribing DNA virus and an important human pathogen. Its capsid-forming core protein (Cp) features a hydrophobic pocket proposed to be central notably in capsid envelopment. Indeed, mutations in and around this pocket can profoundly modulate, and even abolish, secretion of enveloped virions. We have recently shown that Triton X-100, a detergent used during Cp purification, binds to the hydrophobic pocket with micromolar affinity. We here performed pharmacomodulation of pocket binders through systematic modifications of the three distinct chemical moieties composing the Triton X-100 molecule. Using NMR and ITC, we found that the flat aromatic moiety is essential for binding, while the number of atoms of the aliphatic chain modulates binding affinity. The hydrophilic tail, in contrast, is highly tolerant to changes in both length and type. Our data provide essential information for designing a new class of HBV antivirals targeting capsid–envelope interactions.
The capsid-forming core protein (Cp) is the building block of nucleocapsids and is involved in many crucial steps of the HBV life cycle; it is therefore emerging as a new antiviral target.3,4 Cp comprises 183 amino acids, of which the first 140 constitute the assembly domain required to form ordered capsid shells, connected via a short linker to the arginine-rich C-terminal domain (CTD) involved notably in nucleic acid binding.2,5–7 The X-ray crystallographic structure of truncated CTD-less Cp1498 is shown in Fig. 1A. The assembly domain features five αhelices of which α3 and α4 from two monomers associate into a four-helix bundle that protrudes as a spike from the capsid surface. 120 of the so-formed Cp dimers then assemble into the icosahedral T = 4 capsid (Fig. 1B),20 or into a minor class of T = 3 capsids which comprise only 90 dimers. The interdimer contacts are mediated by the last 30 residues of the assembly domain, including helix α5. The currently most advanced anti-Cp drugs are the Capsid Assembly Modulators (CAMs),9 all targeting the interdimer interface in the capsid and interfering with the assembly of proper, genome containing nucleocapsids (as recently reviewed in2,10,11). A new and unexplored option to interfere with nucleocapsid dynamics in general, and nucleocapsid envelopment in particular, is offered by the hydrophobic pocket formed in the Cp dimer at the base of the spike (Fig. 1C), which we have recently shown by solid-state NMR to bind the detergent Triton X-100 (TX100),12 followed by the resolution of cryo-EM structure of the complex.13 Binding causes well-detectable conformational alterations in the capsid structure which essentially localize to a region implicated in HBV nucleocapsid envelopment, i.e. virion formation.
Fig. 1 Workflow to assess binding to the hydrophobic pocket. (A) Organization of the five helices of Cp149 dimer (PDB 1QGT8) coloured on one monomer. (B) Structure of the HBV capsid, assembled from 120 copies of Cp149 dimer.8 (C) Structure of the Cp149 dimer in surface representation. The residues of the hydrophobic pocket impacted by the interaction of TX10012 are highlighted in blue. The eight residues whose NMR signals were used to quantify CSPs are highlighted in red. The pocket entrance is indicated by an arrow. Naturally occurring mutations are plotted in green. (D) Sequence of the full-length core protein (Cp183) with mutations highlighted. (E) Structure of the TX100 molecule (compound 1a), with the hydrophilic tail in red, the aromatic moiety in blue, and the alkyl part in para-position in orange. (F) Solution-state NMR spectra of the 2H13C15N-Cp149 apo dimer in grey, and with four equivalents TX100 in red ([Cp149monomer] = 80 μM and [TX100] 320 μM). Peaks used to assess CSPs are circled and labelled with their amino-acid type and position in the sequence. The assigned apo spectrum is shown in Fig. S2,† and the full set of CSPs between the two spectra are shown in Fig. S3.† (G) ITC raw data and binding isotherms displaying the titration of 4-hexylphenol (compound 2a) against Cp149 reassembled capsids at pH 7.5 and 298 K (KD = 13 ± 3 μM). For an overview of ITC runs see Fig. S4.† |
Envelopment occurs once the initially encapsidated viral pregenomic (pg) RNA is reverse transcribed, inside the capsid, into mature, double-stranded DNA, signalling readiness of the capsid for interaction with the envelope proteins. The mechanism of this regulation is still undetermined, although several hypotheses have been proposed.12,14–16 Indeed, envelopment is modulated by several naturally-occurring Cp mutations,17,18 many of them being located in and around Cp's hydrophobic pocket, such as F/I97L, P5T and L60V (Fig. 1C and D). Mutations in other envelopment-disabling positions also localize to the hydrophobic pocket, namely at L95 and K96.19 Interestingly, uncoupling of envelopment from genome maturation by the F/I97L mutation can be reverted by the A119F mutation in the preS envelope protein domain.20 In all cases, the most plausible interpretation for the mutationally altered envelopment phenotypes is a modification of the core-envelope interaction. In addition to the hydrophobic pocket in the Cp spike base, the spike tip has also been identified as interacting domain, as indicated by inhibition of envelopment by spike-binding peptides.21,22 The detailed binding site(s) of the envelope proteins on the core protein, as well as a molecular view of the interaction, remain to be determined.
We have recently found that TX100 binds to the hydrophobic pocket in Cp.12 The residues impacted by TX10012 binding are highlighted in blue in Fig. 1C, and colocalize with the residues described to alter envelopment phenotypes (F/I97L, P5T and L60V). Our finding thus opens a new avenue to target the capsid–envelope interaction, by taking this molecule as a lead compound for the development of new antivirals being capable of blocking capsid envelopment. Having determined TX100′s affinity to Cp,12 we here investigated the pharmacophore of Cp hydrophobic pocket binders to establish how the three chemical moieties of the molecule contribute to capsid binding. The variations we explored concern, as shown in Fig. 1E, firstly the TX100 highly hydrophobic branched aliphatic chain carrying five methyl groups (in orange); secondly, the aromatic ring (in blue); and thirdly the (–CH2–CH2-O-)n hydrophilic tail (in red), connected to the aromatic ring in para position to the aliphatic chain.
To assess binding, we first used solution-state NMR on the truncated Cp dimer (Cp149), which allowed to speed up analysis of the large number of compounds compared to the use of solid-state NMR on assembled capsids.23,24 As shown in the example spectra in Fig. 1F, chemical shift perturbations (CSPs) are a clear indicator for TX100 binding, and represent, when the exchange rate is in slow regime, the chemical shifts of the bound form. This is for example the case for TX100 and 4-tert-octylphenol (OP) (compounds 1a and 1c) as can be seen in the spectra in Fig. S5 and S6† where upon titration only peaks of the free and the bound forms can be observed. For compounds with lower affinity (higher KD) resulting in a fast exchange regime, peaks can be found at intermediate positions. This is the case for 4-hexylphenol (4HF, compound 2a) for which a titration is shown in Fig. S7.† In this case, if one considers that the compounds have similar binding modes and contacts, the extent of the CSPs induced by the different compounds can be correlated with the strength of binding.
Where binding constants were in an adequate range, we complemented the NMR data by isothermal titration calorimetry (ITC) measurements to quantify binding affinities with Cp149 reassembled capsids, as shown in the example of 4HF in Fig. 1G. ITC reveals that only compounds 1a and 1c have a KD below 10 μM, while all other compounds tested have lower affinities (see Table 1) and are therefore, as shown for 4HF (Fig. S7†), in fast exchange. Thus, one can consider that CSPs are representative of the binding strength at same molar ratios (4 molar equivalents).25
Ligand | K D (μM) | ΔH (kcal mol−1) |
---|---|---|
1a | 8.3 ± 0.5 | −4.59 ± 0.05 |
1c | 7.5 ± 0.5 | −4.79 ± 0.14 |
2a | 13 ± 3 | −6.0 ± 0.6 |
4a | 18 ± 7 | −5 ± 3 |
4b | 14 ± 7 | −7 ± 4 |
4c | 30 ± 4 | −12.7 ± 1.2 |
6g | 68 ± 7 | −3.41 ± 0.09 |
Chemical-shift assignments of the Cp149 dimer at pH 7.5, used as reference for CSP calculations, are shown in Fig. S2† and were deposited at the BMRB under accession number 51294. The assigned reference spectrum is shown in Fig. S2A.† Unlike in solid-state NMR analysis, where we mainly used 13C detection on protonated samples,12 in solution NMR not all amino acids of Cp are visible due to incomplete back exchange of several NH in the hydrophobic core of the Cp dimer after expression in deuterated medium. While this prevented the use of some residues localized directly in the hydrophobic pocket, we selected eight amino acids located near the entry of the pocket (in red in Fig. 1C), which can be detected to measure CSPs (as shown on the sequence in Fig. S2B†). As exemplified in Fig.S3† for TX100, CSPs can then be derived upon addition of each ligand.
Fig. 2 Effects of modulations of the TX100 aromatic ring and hydrophilic tail. (A) Chemical structures of compounds tested from categories 1 and 2. (B) NMR CSPs induced by the compounds for the eight selected capsid resonances (for full spectra see Fig. S8 and S9†). CSP averages are indicated by dotted lines. Residue numbers of the representative peaks are given below the CSPs; all CSPs throughout the manuscript are shown at the same scale. |
Fig. 3 Effects of modulations of the TX100 hydrophobic moiety. (A) Compounds tested from categories 3 and 4. (B) NMR chemical-shift perturbations induced by compounds on the eight selected capsid resonances (for full spectra see Fig. S10 and S11†). CSPs averages are indicated by dotted lines. |
The CSPs indicate an optimal atom number of four to eight carbons, as indicated by the similar CSPs induced by compounds 1c, 2a, 2h, 3a, 3b and 4c (average CSP between 0.046 and 0.142 ppm) (see Table S1† for solution NMR parameters and Table S2† for the list of all averaged CSPs). Linear or branched three-carbon chains reduced CSPs to below 0.050 ppm, as observed for compounds 4a and 4b. When long branched alkyl chains were introduced para to the phenolic OH group, the interaction also weakened, as seen for compounds 4f and 4g. The CSPs also revealed that investigated groups with heteroatoms did not establish new favourable interactions, as demonstrated by the small CSPs of compounds 3c, 3d, 3e, 4d, and 4e (average CSP < 0.029 ppm). Surprisingly, while a cyclopropyl moiety (3e) did not bind to the pocket, a benzyl moiety (3f) displayed substantial binding, as indicated by a CSP of 0.071 ppm. ITC measurements showed comparable affinities (between 14 and 30 μM) for compounds 4a, 4b, and 4c, and no measurable interactions for 4f and 4g, consistent with the NMR data.
Fig. 4 Effect of modulations of the TX100 aromatic ring, and concurrently on the hydrophilic and hydrophobic groups. (A) Compounds tested from categories 5 and 6. (B) NMR CSPs induced by the compounds on the eight selected capsid resonances (for full spectra see Fig. S12 and S13†). CSPs averages are indicated by dotted lines. |
Subsequently, we also established a variety of simultaneous modifications of the hydrophobic and hydrophilic groups (category 6). The only compound of this category displaying a significant interaction with the capsid was 6c, with an average CSP of 0.048 ppm (for full spectra, see Fig. S13†). Analysis of CSPs induced by 6b and 6c bearing both a benzyl alcohol on the hydrophilic part shows that a terminal tert-butyl group on the hydrophobic chain seems to favor the interaction, since the CSPs for 6c are more than twice as high as for 6b. Yet, CSPs for 6c still remain lower than for 1d, confirming the even higher impact of the 4-tert-butyl moiety on binding. Both compounds 6d and 6g were tested by ITC (Table 1), the latter displaying weak affinity, and the former no measurable data. The lack of affinity of 6d, compared to its para isomer 4a is revealed by ITC (Fig. S4†), which highlights that the relative position of the hydrophilic and hydrophobic groups is central for the interaction. NMR however report small, but, within the error bars, same CSPs for 4a and 6d.
Fig. 5 Computed models of OP and TX100 bound in the hydrophobic pocket of the Cp149 dimer. Docked model of (A) OP (compound 1c) and (B) TX100 (1a) in the hydrophobic pocket of Cp149 dimer. Residues located in the direct proximity of ligands (<5 Å) are represented in blue sticks. (C and D) Comparison of bound TX100 in the cryo-EM structure (in magenta, protein in wheat, PDB 7PZK)13 with the docked models (in green, protein in grey) for (C) TX100 and (D) OP. Neighbouring residues V13, E64, M93, K96 and F97 are shown as sticks. In both cryo-EM structure13 and docked models, TX100 contains n = 3 ethylene oxide units in the PEG tail. |
Fig. 6 Solid-state NMR can probe the impact of the ligand's hydrophilic tail on the hydrophobic pocket. (A) Comparison of 2D extracts from 13C–13C-DARR solid-state NMR spectra of Cp149 capsids reassembled without compounds (empty pocket, grey), and in presence of TX100 (bound pocket, red), OP (blue), or 4HF (orange). Full aliphatic regions of DARR spectra are shown in Fig. S14.† (B) Comparison of 13C-CSPs induced by the three compounds. Three groups could be identified based on their behaviour when bound to 4HF: P5 at the entrance of the pocket showing a mixture of free/bound pocket; the residues showing intermediate CSPs; and the residues showing no CSPs with 4HF. (C) Structure of Cp183-TX100 (PDB 7PZK)13 with residues involved in the three CSP classes shown in orange. The right panel is rotated by 180° to display the bottom of the hydrophobic pocket and the proximity with F97 sidechain. |
Interestingly, OP induces CSPs highly similar to binding of TX100, with spectra almost perfectly overlapping for most residues. This confirms that the PEG tail is not involved in the interaction, and that replacement by a hydroxyl group has almost no impact, even on residues surrounding the pocket. On the contrary, the binding with 4HF induces smaller carbon CSPs for residues within the hydrophobic pocket. In particular, the P5 Cγ-Cδ cross peak in presence of 4HF reveals the presence of two species, respectively corresponding to the bound and free hydrophobic pockets with each about 50% occupancy, as based on peak intensities, suggesting that only half of the pockets are occupied. P5 Cγ is located in the direct vicinity of the ligand, at around 4 Å from the aromatic ring according to the cryo-EM structure13 and docked models (Fig. 5). As such, it is a good indicator of binding and can be used to estimate the percentage of occupancy. However, for the other residues in the vicinity of the ligand such as Y6, A58, N92 and L100, we detect a broad peak, at an intermediate chemical shift between the free and the bound state, as illustrated for A58 (Fig. 6A-B). Finally, a third group could be identified, represented by residues V27, I59 and R98, which show almost no CSPs upon 4HF binding, but high CSPs upon binding of TX100 and OP (Fig. 6B). Interestingly, these residues are located at the bottom of the hydrophobic pocket, behind F97 (Fig. 6C). Hence, the lack of significant CSPs for these residues, as illustrated for R98, is likely due to an absence of a larger conformational rearrangement in presence of 4HF, in contrast to the F97 side-chain rotation observed with TX100.13 We therefore conclude that, compared to the branched tert-octyl moiety, the less sterically hindered n-hexyl group might not be able to induce the rotation of the F97 side chain.13
A recent cryo-EM structure13 allowed to determine the position of the previously identified TX100 molecule12 in the hydrophobic pocket; while electron density was observed in the pocket already in previous EM studies,27,28 the nature of the molecule had not been determined, likely due to the lack of resolution. Indeed, while the EM density allows positioning of TX100 mainly based on the aromatic ring and the terminal –C(CH3)3 group, chemical composition and hydrophilic tail were not accessible from the electron densities.
A docking study based on NMR chemical shifts, coupled with the reasonable assumption that the hydrophobic part, rather than the hydrophilic part, fits into the hydrophobic pocket, results in a bundle of structures, many of which show an orientation very similar to that obtained by cryo-EM.13 Both structural models position the oxygen of the phenolic ring in a position where it can interact with residue K96 at the pocket entry.
In contrast to NMR,12 cryo-EM reported only limited structural changes13 in the presence of TX100, predominantly repositioning of the F97 aromatic ring. NMR indeed detects numerous and beyond-the-pocket-extending CSPs between the apo and bound states. This is likely due to the higher sensitivity of NMR which can identify, with atomic resolution, changes in local dihedral angle orientations, but also differences in the local magnetic field as induced by different positioning of aromatic side chains. When considering that the largest change described in the cryo-EM study13 is the movement of the aromatic F97 ring, one can deduce that cryo-EM can detect changes which give rise to 13C CSPs > 1.5 ppm, which is 5 times higher than the limit under which we consider the CSP insignificant (Fig. 3A in ref. 12). Thus, while cryo-EM has the clear advantage to position the binders inside the pocket, solid-state NMR can identify the residues impacted by binding, including those which react in an allosteric manner.
One should mention that TX100 is known to cause endocrine disruption29 and, besides a variety of other phenols, particularly 4-tert-octylphenol is considered as a major environmental pollutant,30 as it is toxic to many organisms. In order to consider TX100 as a starting point for further compound development, this toxicity must be addressed. Alternatives to TX100 are currently under development, as for instance the new detergent Nereid, an environmentally-friendly replacement candidate for TX100 based on substitution of the phenol with a benzylic alcohol.31 Interestingly, we here show that the benzylic alcohol version (compound 1d) retains strong binding capacity to Cp, which points out a direction towards less endocrine-disrupting Cp pocket binders.
Footnote |
† Electronic supplementary information (ESI) available. See https://doi.org/10.1039/d2sc02420a |
This journal is © The Royal Society of Chemistry 2022 |