Open Access Article
Jindřich
Fanfrlík‡
*a,
Jiří
Brynda‡
a,
Michael
Kugler
a,
Martin
Lepšík
a,
Klára
Pospíšilová
a,
Josef
Holub
b,
Drahomír
Hnyk
b,
Jan
Nekvinda
b,
Bohumír
Grüner
b and
Pavlína
Řezáčová
*a
aInstitute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences, Flemingovo nam. 2, 166 10, Prague 6, Czech Republic. E-mail: fanfrlik@uochb.cas.cz; pavlina.rezacova@uochb.cas.cz
bInstitute of Inorganic Chemistry of the Czech Academy of Sciences, 250 68 Husinec-Řež, Czech Republic
First published on 3rd January 2023
Among non-covalent interactions, B–H⋯π and C–H⋯π hydrogen bonding is rather weak and less studied. Nevertheless, since both can affect the energetics of protein–ligand binding, their understanding is an important prerequisite for reliable predictions of affinities. Through a combination of high-resolution X-ray crystallography and quantum-chemical calculations on carbonic anhydrase II/carborane-based inhibitor systems, this paper provides the first example of B–H⋯π hydrogen bonding in a protein–ligand complex. It shows that the B–H⋯π interaction is stabilized by dispersion, followed by electrostatics. Furthermore, it demonstrates that the similar C–H⋯π interaction is twice as strong, with a slightly smaller contribution of dispersion and a slightly higher contribution of electrostatics. Such a detailed insight will facilitate the rational design of future protein ligands, controlling these types of non-covalent interactions.
![]() | ||
| Fig. 1 (A) The structural formulae of the carborane propyl-sulfonamide compounds 1, 2 and 3 and the values of the inhibition constant Ki for the inhibition of CA II. (B) Details of the crystal structure of compound 1 bound in the active site of CA II (PDB code 6YZT). Interacting residues are marked with sticks. The inhibitor and the residue Phe131 occupy two alternative conformations: the major conformation (an occupancy of 0.8) is depicted in stick representation, whereas the minor conformation (an occupancy of 0.2) is in line representation. Polar interactions are represented by blue dashed lines, while the black dashed line marks the potential B–H⋯π interaction between the carborane cage and Phe131. | ||
Carbonic anhydrases (CAs) are metalloenzymes catalyzing the interconversion between CO2 and bicarbonate. These enzymes participate in many physiological processes and some of their isoforms are involved in human pathologies, such as cancer development (isoform CA IX).11,12 CAs have thus been targets for inhibitor development; dozens of diverse families of CA inhibitors (CAIs) have been prepared.13 Due to this wealth of gathered information, these systems are an attractive model for biophysical studies of protein–ligand binding.14 Human CAs can be efficiently and selectively inhibited by carboranes linked to a sulfamide or sulfonamide moiety.15–18 More than 30 X-ray crystal structures of carborane-containing inhibitors with CA II or CA IX have been determined at high or even atomic resolution, which has provided insight into the structural basis of the interactions of carborane with enzyme active sites.15 Here, we focus on a series of carborane propyl-sulfonamides that differ in the geometric positions of carbon atoms within the carborane cage and on their affinities toward CA II (Fig. 1(A)). Some compounds have exhibited selective inhibitory activity toward CA IX over CA II,19 and the structural basis of this selectivity has been explained by a detailed crystallographic study of compound 1 in these complexes.17 Interestingly, the crystal structure of 1 in complex with CA II (PDB code 6YZT) has revealed that compound 1 has acquired two alternative binding orientations in the CA II active site and the minor conformation (with an occupancy factor of 0.2) is accompanied by a rearrangement of the Phe131 side chain (Fig. 1(B)). The mutual position and distance between the carborane cage and the side chain of Phe131 indicate a potential B–H⋯π interaction (the dashed black line in Fig. 1(B)).
mg ml−1 protein solution in 50
mM Tris–H2SO4, pH 7.8. The final concentration of DMSO in crystallization drops did not exceed 10%. Crystals were prepared by the hanging-drop vapor-diffusion method at 18 °C using EasyXtal® 15-well plates (Qiagen). Drops containing 2 μl of the complex solution were mixed with 1 μl of precipitant solution. Subsequently, these mixtures were equilibrated over a reservoir containing 1
ml of the precipitant solution. The precipitation solution consisted of 1.6
M sodium citrate and 50
mM tris–H2SO4, pH 7.8. Crystals formed within one to three weeks. Prior to data collection, the crystals were soaked for 10
s in the reservoir solution supplemented with 20% (v/v) sucrose and stored in liquid nitrogen. X-ray diffraction data at 100
K were collected on BL14.1 and BL14.2, operated by the Helmholtz-Zentrum Berlin (HZB) at the BESSY II electron storage ring (Berlin-Adlershof, Germany).20 Diffraction data were processed using the XDS suite of programs.21,22 Crystal parameters and data-collection statistics are summarized in the ESI† (Table S1).
The crystal structures of the CA II complexes with compounds 2 and 3 were determined by the difference Fourier technique. Coordinates from the 4Z1N entry in the PDB23 were used as models for the CA II complexes. Atomic coordinates of inhibitor molecules were generated by quantum-mechanical (QM) optimizations in the Turbomole package24 by means of the density functional theory (DFT) method augmented with empirical dispersion correction25 using the B-LYP functional and the SVP basis set. The geometric library for the compounds was generated using the Libcheck program.26 The Coot program27 was used for inhibitor fitting, model rebuilding, and the addition of water molecules. Refinement was carried out with the Refmac5 program28 with 5% of the reflections reserved for cross-validation. The structures were first refined with isotropic atomic-displacement parameters (ADPs). After the addition of solvent atoms and zinc ions, building inhibitor molecules in the active site, and several alternate conformations for a number of residues, anisotropic ADPs were refined for nearly all atoms. The refinement of ADPs was not carried out for spatially overlapping atoms in segments with alternate conformations or for oxygen atoms of water molecules with an unrealistic ratio of ellipsoid axes. The quality of the crystallographic model was assessed with MolProbity.29 The final refinement statistics are summarized in the ESI† (Table S1). All the figures representing structures were created using PyMOL.30 Contacts were analyzed using the Contact program included in the CCP4 suite.31 Structure factors and coordinates were deposited in the PDB under the accession codes 8AA6 and 8AAE for the CAII complex with 2 and 3, respectively.
The score was computed as the sum of the gas-phase interaction energy (ΔEint), the interaction solvation free energy (ΔΔGsolv), and the change of the conformational “free” energy of the ligand and protein
.38 ΔEint was computed using the QM/MM methodology described above. For solvation free-energy calculations, the implicit solvent IGB7 model was utilized.
was computed as the “free”-energy change between the monomer taken from the optimized protein–ligand complex and the monomer structure optimized in solution by the same method. The “free” energy (G′) was defined as the sum of the gas-phase energy (E) and the solvation free energy (G).
The ΔE values of the selected motif were decomposed using the symmetry-adapted perturbation-theory (SAPT) methodology.39 The simplest truncation of SAPT (SAPT0) decomposition was performed with the recommended jun-cc-pVDZ basis set using the PSI4 program.40
![]() | ||
| Fig. 2 The active site of CA II with the bound compounds 1 (panel A), 2 (B) and 3 (C). The inhibitor residue Phe131 occupies two alternative conformations. Major populations are depicted in stick representation, whereas minor conformations are shown in line representation. The 2Fo-Fc map is contoured on 1.2 sigma in blue (for OMIT difference maps see Fig. S1, ESI†). On this level, electron density is shown for the major conformation of the ligand and the side chain of Phe131. Panels D and E depict two inhibitor orientations in CA II active sites: orientations I and II exemplified by compound 3. Interacting residues are highlighted as sticks and labeled. The dashed lines have the same meaning as in Fig. 1. | ||
A comparison of the crystal structures of all the three isomers (Fig. 2(A)–(C)) shows that the interaction of the sulfonamide head moiety with the catalytic zinc ion and residues at the bottom of the active-site cavity is conserved, with differences being located in the position of the carborane cage. For ligands 1 and 3, we can identify two possible ligand orientations: orientations I and II (Fig. 2(D) and (E)). Orientation I is characterized by the staggered position of the S–C and S=O bonds in the –CH2–CH2–SO2–NH− moiety, which enables the binding of the carborane cage to a preformed hydrophobic pocket formed by residues Val121 and Phe131. This orientation is more prevalent in our crystal structures and can be found in all three structures as: (a) the major conformation A of compound 1, (b) a single conformation of 2, and (c) the minor conformation B of compound 3. Orientation II has an extended propyl linker (eclipsed position of the S–C and S
O bonds in the –CH2–CH2–SO2–NH− moiety) and is only possible when the Phe131 side chain changes its rotamer position such that the phenyl ring is more exposed to the solvent (Fig. 2). This orientation occurs as: (a) the minor conformation B of compound 1 and (b) the major conformation A of compound 3.
of −8.5, −9.0 and −9.2 kcal mol−1, respectively, Table 1). It should be noted here that entropic effects, coming for example from two possible conformations of the propyl linker of bound 1 and 3, which are not treated in the present version of the scoring function, could add a higher penalty to the score of 2 with respect to those of 1 and 3. Compound 1, the weakest-binding inhibitor, had the least favorable score because of the least favorable interactions with the CA II active site as reflected in the least negative ΔEint value of −194.5 kcal mol−1 (Table 1). The score of 3 was the most favorable due to the strongest interactions with the protein (the most negative ΔEint of −203.6 kcal mol−1). Considering the conserved positions of the sulfonamide head moiety, the catalytic zinc ion and the residues of the active-site cavity in the modeled structures, the only difference was in the position of the C–H vertex of the carborane cage. The C–H vertex of 3 was oriented toward Phe131, thus replacing the B–H···π interaction of the 1:CA II complex with the C–H⋯π interaction, which implies that the latter is stronger. The stronger interaction (ΔEint) is only partly offset (163.3 vs. 169.8 kcal mol−1) by solvation (ΔΔGsolv; Table 1).
) and scores. The total score was computed as the sum of the following terms: the interaction energy (ΔEint), the interaction solvation free energy (ΔΔGsolv), and the change of the conformational ‘free’ energy
of the ligand (L) and protein (P). The more favorable scores of the two orientations of 1 and 3 are in bold. All energies are in kcal mol−1 and Ki in nM
| Complex | Orientation | Occupancy in crystal | K i | Score | ΔEint | ΔΔGsolv | (L) | (P) | |
|---|---|---|---|---|---|---|---|---|---|
a As determined in ref. 15.
b
is derived from Ki values using .
|
|||||||||
| 1:CA II | I | 0.8 | 622 ± 87 | −8.47 | −8.7 | −190.1 | 168.0 | 5.6 | 7.8 |
| II | 0.2 | −10.6 | −194.5 | 163.3 | 7.4 | 13.4 | |||
| 2:CA II | I | 1.0 | 254 ± 23 | −9.01 | −12.0 | −197.7 | 171.5 | 6.7 | 7.5 |
| 3:CA II | I | 0.3 | 171 ± 17 | −9.24 | −10.1 | −196.6 | 171.8 | 6.9 | 7.8 |
| II | 0.7 | −13.3 | −203.6 | 169.8 | 7.2 | 13.3 | |||
Compound 2 only had a single conformation in orientation I in the crystal structure. In such an arrangement, the C–H vertex of 2 still interacted with the π ring of Phe131, albeit in a less favourable tilted arrangement (as opposed to an ideal perpendicular arrangement). In contrast to other inhibitors, orientation I of compound 2 enabled the formation of C–Hδ+⋯δ−H–B dihydrogen bonds (Fig. 3), a typical non-covalent interaction of heteroboranes with proteins.3
![]() | ||
| Fig. 3 The models of the active site of CA II with the bound compounds 1 (A), 2 (B) and 3 (C) in orientation I (crystal structures of orientations II are shown in Fig. 2). B–H⋯π and C–H···π are shown as dot-dashed lines. Hydrogen contacts are shown as dashed lines. Hydrogen separations are in Å. | ||
The applied scoring function did not rank correctly the orientations of 1. The orientation II with a minor occupancy of 0.2 had erroneously a more favorable score by 1.9 kcal mol−1, which was due to more favorable interaction (ΔEint) and desolvation (ΔΔGsolv) terms. This may be caused by inherent limitations of the methodology used, such as the lack of explicit solvation or dynamical treatment and entropy effects.
To gain an even deeper insight into the nature of the B–H⋯π and C–H⋯π interactions evidenced in CA II complexes with 1 and 3, we further dissected the gas-phase interaction energy ΔE into individual terms using the SAPT methodology on small models derived from the crystal structures.39 The results showed favorable interactions of the carborane cages of both inhibitors with the side chain of Phe131 but more so for C–H⋯π (−3 kcal mol−1) than for B–H⋯π (−1.6 kcal mol−1) (Table 2). The leading term was dispersion (Edisp) as expected for weak hydrogen bonding.43 It was more pronounced for B–H⋯π than for C–H⋯π (74 and 66% of the sum of all attractive terms, respectively). The second largest attractive term was electrostatics (Eelec), which was more important for C–H⋯π than for B–H⋯π (26 and 17% of the sum of all attractive terms, respectively). It is worth mentioning that the Eelec for B–H⋯π is attractive, which has confirmed the amphiphilic noncovalent bonding character of the B–H group.9,44 In both cases, the induction term (Eind) was of little importance (below 10% of the sum of all attractive terms). The exchange-repulsive term (Eexch) was more repulsive for B–H⋯π (3.1 kcal mol−1) than for C–H⋯π (1.7 kcal mol−1). The larger repulsion of B–H⋯π might be due to the smaller spatial requirement of the acidic Hδ+ atom of the C–H group in comparison with the hydridic Hδ− atom of the B–H group. Moreover, the C atom is also smaller than the B atom (van der Waals radius of C being smaller by 0.22 Å45 than that of B), which overall reduces the spatial requirement of the entire C–H vertex.
Footnotes |
| † Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d2cp04673c |
| ‡ These authors contributed equally to this work. The manuscript was written through contributions of all authors. |
| This journal is © the Owner Societies 2023 |