Shikshya
Bhusal
and
Omar
Valsson
*
Department of Chemistry, University of North Texas, Denton, Texas 76203, USA. E-mail: omar.valsson@unt.edu
First published on 6th November 2025
Targeted protein degradation has emerged as a promising strategy for developing novel therapeutics, particularly for “undruggable” disease-related proteins. One approach is the use of PROteolysis TArgeting Chimeras (PROTACs) degraders, which induce the formation of ternary complexes between the target protein and E3 ligase, leading to ubiquitination and degradation of the target protein. Understanding the conformational behavior of PROTACs in solutions and how it relates to their pharmacokinetic properties and membrane permeability is crucial for optimizing PROTAC design and efficacy. Due to the large size and flexibility of PROTACs and their chameleonic character, it is essential to understand their conformational ensembles, and how they depend on the environment. Here, we introduce a novel methodology for exploring PROTAC conformational behavior using atomistic simulations. We employ the enhanced sampling method parallel bias metadynamics, where we bias generic local collective variables, specifically all rotatable dihedral angles, thereby avoiding the considerable challenge of identifying suitable global collective variables for biasing. The methodology allows for obtaining free energy surfaces of global CVs via reweighting. We apply the method to the prototypical case of the MZ1 PROTAC degrader, which targets bromodomain-containing protein-4 (BRD4) for degradation via the von Hippel-Lindau (VHL) E3 ligase, and elucidate its conformational behavior in different solvents, allowing us to gain insights into the chameleonic property of MZ1. Our results confirm that MZ1 adopts distinct conformations depending on the solvent, exhibiting collapsed conformations in water and chloroform, and extended conformations in DMSO. Collapsed conformations in chloroform have been correlated to increased cell permeability. Thus, our results show that MZ1 takes on conformations suitable for membrane permeation in apolar environments. Our methodological framework is generally applicable to large flexible molecules like PROTACs, and the results demonstrate its efficiency, laying the groundwork for similar investigations for other PROTACs and other “beyond-rule-of-5” drug candidates. This work provides valuable insights into the design and optimization of PROTACs, ultimately contributing to developing novel therapeutics for “undruggable” proteins.
Targeted protein degradation is a promising approach in the realm of therapeutics, offering solutions for proteins that have long resisted conventional drug targeting.2 This innovative field is gaining attention because it allows us to manipulate proteins that were previously challenging to target using standard small-molecule drugs and considered “undruggable”.3 These difficulties arise from factors such as the complex shapes of some protein active sites, which make them hard to bind to, or the smooth surfaces of certain proteins, which provide few binding sites for traditional drugs.4 PROteolysis TArgeting Chimeras (PROTACs) and molecular glues have emerged as alternative therapeutic modalities to target proteins.5 They proficiently overcome the limitations of traditional small-molecule inhibitors by leveraging the ubiquitin-proteasome system to tag and degrade target proteins, rather than merely inhibiting their activity. This mechanism allows for the targeting of “undruggable” proteins with complex structures or limited binding sites, enhancing selectivity and reducing off-target effects.5,6
PROTACs are highly specific molecules that degrade unwanted or harmful proteins in cells by hijacking the ubiquitin-proteasome system of the cell.5,7 The unique mechanism of PROTACs has attracted considerable attention in the pharmaceutical industry and shows promise in addressing a wide range of conditions, including cancers, immune disorders, viral infections, and neurodegenerative diseases.8 PROTAC molecules consist of two ligand-binding warheads that are connected via an intermediate linker. One of the ligands binds to the protein of interest, while the other one binds to an E3 ligase protein, inducing proximity and creating a ternary complex with a stable protein–protein interaction between the two proteins.9 Examples of E3 ligases are von Hippel-Lindau (VHL), cereblon (CRBN), MDM2, KEAP1, and CHIP.10 The role of E3 ligase in the cell is to tag protein for degradation by adding ubiquitin to it. Thus, by inducing proximity, the PROTAC molecule facilitates that the E3 ligase will ubiquitinate the protein of interest, leading to subsequent degradation in the proteasome.9 The intermediate linker also plays a vital role in physicochemical properties and bioactivity of the PROTAC molecule. The linker architecture determines to what degree the two ligands interact with the protein of interest and E3 ligase, and thus the maximal activity of the PROTAC molecule.11,12 An important aspect is that the PROTAC molecule is recycled and can engage another protein pair. A formidable challenge preventing PROTACs from realizing their therapeutic potential is their lack of compliance with the well-established drug-like properties associated with oral drugs, namely Lipinski's “rule-of-5” (Ro5) and Veber's rules.13,14 Over the past decade, the field of PROTACs has rapidly evolved, with numerous PROTACs developed for various E3 ligases and protein ligands, enabling the degradation of diverse proteins of interest. Numerous PROTACs are already in the advanced clinical trials including vepdegestrant (ARV-471).15 These advancements in PROTACs have resulted in achievements such as overcoming drug resistance, targeting previously “undruggable” proteins, and improving drug selectivity and specificity.2
Computational methods, including AI-driven drug design, molecular docking, molecular dynamics (MD) simulations, and quantum mechanics/molecular mechanics modeling, are increasingly essential in PROTAC development, enabling the prediction of degradation activity, structural modeling of ternary complexes, ubiquitination site accessibility, and optimization of linker properties and pharmacokinetics.9,16–24 These approaches have led to advances such as dual-target PROTACs and machine learning models capable of forecasting degradation efficiency and suggesting molecular modifications to enhance performance and selectivity.19,25
PROTACs are large and structurally complex molecules, comprising two ligand-binding warheads connected by a flexible linker group.9 This complex architecture poses significant challenges for membrane permeability and cellular uptake, ultimately affecting their bioavailability and intracellular activity. However, the flexibility of the linker group enables PROTACs to adopt multiple conformations, which is crucial for their function. This ability of PROTACs to change conformations or behavior in different environments is referred to as chameleonicity.26 This ability allows molecular chameleons to display both high aqueous solubility and high cell permeability.27 In the context of PROTACs, chameleonicity can be exhibited in several ways, such as changes in solubility, stability, or conformations in different solvents and cellular membrane environments, or in the presence of their target proteins and E3 ligases. For instance, PROTACs may adopt different conformations when bound to their target proteins and E3 ligases, forming ternary complexes that are necessary for their protein degradation activity.28 Moreover, this property can also impact their physicochemical properties, such as lipophilicity and polarity, which are critical for their oral bioavailability and pharmacokinetics.29 As such, understanding and optimizing PROTACs’ chameleonicity is an essential aspect of their development as therapeutic agents.
As a result of the flexible nature of the molecule and chamelonicity, studying a single conformation of PROTAC is insufficient, and instead, a conformational ensemble approach is necessary to capture their dynamical behavior. This approach, also explored in previous studies,30,31 involves characterizing the conformational behavior of PROTACs to understand their chameleonic behavior in different environments, such as aqueous and nonpolar solvents. By doing so, researchers can gain insights into how PROTACs adapt to various environments and maintain their activity, which is essential for their development as therapeutic agents. To accurately characterize these molecules, it is necessary to study the conformational ensemble, which can be represented as a collection of structures weighted by their Boltzmann factors.30 This approach allows researchers to capture the dynamic behavior of PROTACs and understand how they adapt to different environments. Atomistic molecular dynamics (MD) simulations are a powerful tool for studying the conformational ensemble of PROTACs, but they generally require enhanced sampling techniques32,33 to accurately sample the phase space and efficiently explore the vast conformational space. By employing these techniques, researchers can, in principle, reconstruct the free energy surface of PROTACs and their conformational ensemble and identify the most stable conformations, ultimately gaining insights into their chameleonic behavior and its implications for their therapeutic activity.
Dimethyl sulfoxide (DMSO), chloroform, and water are commonly used solvents to study the conformational behavior of PROTACs, as they mimic different cellular environments.34 Due to their varying polarities, these solvents help reveal how PROTAC structures adapt in polar versus apolar environments. DMSO supports permeability studies and is compatible with cellular systems.35 Chloroform plays an important role in mimicking the apolar environment of the cell membrane as it has a dielectric constant (ε = 4.8) similar to that of a lipid bilayer (ε = 3.0).11,36,37 The propensity of PROTACs to adopt collapsed conformations in apolar environments, such as chloroform, correlates with increased cell membrane permeability.11 Water, being highly polar, is ideal for exploring hydrophilic interactions and biological activity in physiological conditions. Together, these solvents provide a comprehensive picture of PROTAC behavior across biologically relevant environments.
In this study, we concentrate on the prototypical case of the MZ1 PROTAC molecule, that has been extensively studied both experimentally18,39–44 and using simulations.18,41 This compound induces the degradation of bromodomain-containing protein 4 (BRD4) by forming a ternary complex in a positively cooperative manner with a VHL E3 ligase, see Fig. 1(a). MZ1 is “sandwiched” between the two proteins, inducing extensive new protein–protein and protein–ligand contacts of both hydrophobic and electrostatic nature. It is known that BRD4 is overexpressed in a variety of tumor cells;45 thus, BRD4 has been an important target protein for degradation. Structurally, MZ1 consists of a VHL warhead linked to the BRD4 warhead via a three-unit poly(ethylene glycol) (PEG) linker, as shown in Fig. 1(c). The VHL warhead plays a critical role in recognizing and recruiting the E3 VHL ligase, which mediates the ubiquitination and subsequent degradation of the target protein, making it a key component in the function of PROTACs like MZ1.18,46,47
![]() | ||
| Fig. 1 (a) Crystal structure of the BRD4–MZ1–VHL ternary complex (PDB entry 5T3538,39), with BRD4 (red) and VHL (blue) bound by the PROTAC MZ1. (b) Molecular structure of the MZ1 PROTAC molecule. (c) Detailed view of MZ1, highlighting the BRD4 warhead, VHL warhead, and linker region. (d) The MZ1 molecule, highlighting 29 rotatable local dihedral angles used as collective variables in the parallel bias metadynamics simulations. | ||
The main aim of this paper is to develop a simulation methodology that allows us to accurately and efficiently characterize the conformational ensemble of PROTACs in different solvents, including water, chloroform, and DMSO. To achieve this, we employ the enhanced sampling method called parallel bias metadynamics (PBMetaD).48 A crucial aspect of enhanced sampling methods is the selection of collective variables (CVs), which can be challenging but is critical for the success of the simulation.49 The selection of inappropriate CVs can result in poor convergence and inaccurate outcomes. In this study, we address this challenge by considering generic local CVs, specifically all rotatable dihedral angles. We apply this strategy to the PROTAC molecule MZ1, identifying a total of 29 dihedral angles within the MZ1 structure. One of the benefits of parallel bias metadynamics is that it allows for biasing a large number of CVs simultaneously.48 A similar procedure was recently applied to characterize the conformational ensemble of intrinsically disordered peptides in solution.50–52 Employing generic local CVs like dihedral angles is a key aspect of our work as it allows for developing an effective sampling strategy that is generally applicable to most all PROTACs, and other large flexible molecules, but avoids the challenging task of selecting appropriate global CVs for biasing. By using this approach, we can efficiently explore the conformational space of MZ1 in different solvents and gain insights into its chameleonic behavior. Furthermore, we employ a reweighting method32,53–55 to remove the effect of the PBMetaD bias potential and obtain the free energy surface (FES) as a function of global CVs, such as radius of gyration, distances between different moieties within MZ1, solvent accessible surface area (SASA), and polar surface area (PSA). This approach enables us to obtain a more comprehensive understanding of the conformational ensemble of MZ1, including its overall shape, compactness, and solvent interactions. By analyzing these global CVs, we can gain insights into how the conformational ensemble of MZ1 changes in response to different solvents, and how these changes affect its chameleonic behavior and potential for therapeutic applications. In particular, understanding the conformational behavior in chloroform is of great importance as this apolar solvent mimics the hydrophobic membrane environment,11,36,37 and it has been observed that a higher likelihood of adopting collapsed conformations in chloroform correlates with increased cell permeability.11
523 solvent molecules for the water setup, 3070 solvent molecules for the DMSO setup, and 3808 solvent molecules for the chloroform setup. For the water solvent setup, we add a physiological solution of sodium and chlorine ions with a salt concentration of 0.15 molar. The ions parameters are taken from the Amber99sb force field.66–69
All the molecular dynamics (MD) simulations are run using the GROMACS 2022.5 code70 patched with the PLUMED 2.8.2 enhanced sampling plug-in code.71,72 We use the leap-frog integrator with a 2 fs timestep, periodic boundary conditions with a dodecahedron box, Verlet cutoff scheme73 with a 10 Å (1 nm) cutoff distance for electrostatic and van der Waals interactions, and the particle mesh Ewald (PME) method for long-range electrostatics.74 We employ long-range dispersion corrections for the energy and pressure. All bonds between heavy atoms and hydrogens are constrained with LINCS algorithm.75 The rigid TIP3P water model is constrained using the SETTLE algorithm.76 To maintain a stable temperature and pressure during the simulations, we employ the stochastic velocity rescale thermostat77 with a coupling time constant of 0.1 ps and a reference temperature of 300 K, and isotropic stochastic cell rescaling barostat78 with a coupling time constant of 1.0 ps, reference pressure of 1 atm, and compressibility of 4.5 × 10−5 bar−1. To validate the compressibility parameter for different solvents, we calculated the densities of neat chloroform and DMSO and found them to be approximately similar to experimental values.
The initial setup of the molecule in each solvent is performed through a multi-step process. First, to remove bad contacts, we perform energy minimization of 50
000 steps using the steepest descent algorithm with default convergence criteria. This is followed by an NVT (constant number of particles, volume, and temperature) equilibration run at 300 K for 2 ns, and then an NPT (constant number of particles, pressure, and temperature) equilibration simulation at 300 K and 1 atm pressure for 2 ns, with both simulations using position restraints on the heavy atoms of the MZ1 molecule to allow the solvent to relax (force constant of 1000 kJ mol−1 nm−2). Finally, we perform an NPT equilibration without position restraints at 300 K and 1 atm pressure for 2 ns. The final frame from this run is used as the starting point for further production molecular dynamics simulations.
The parallel bias metadynamics simulations48 are run for 5 μs in the NPT ensemble at 300 K and 1 atm pressure. As CVs for biasing, we consider all 29 rotatable local dihedral angles shown in Fig. 1(d). The fluctuations of the dihedral angles within the planar ring moieties of the ligands binding to BRD4 and VHL are largely determined by the rigid planar geometry rather than specific interactions or conformational transitions so we exclude them from the biasing, thereby minimizing the risk of introducing artificial correlations in the biasing and ensuring a more accurate description of the systems' conformational landscape. We update the PBMetaD bias potentials by depositing Gaussians every 0.5 ps. We employ a bias factor of 24 and an initial Gaussian height of 0.3 kJ mol−1. The Gaussian width for all the dihedral angle CVs is set to 0.2 radians. The bias potentials are expanded on grids ranging from −π and +π radians and using a default grid spacing of 1/5 of the CVs’ Gaussian width. For the water solvent setup, we perform three independent PBMetaD simulations, and for the DMSO and chloroform solvent setup, we perform two independent PBMetaD simulations, with each simulation starting using different random seeds for the thermostat and barostat.
We also perform PBMetaD simulations for the water solvent setup where, in addition to the 29 dihedral angle CVs, we consider the radius of gyration of the MZ1 molecule as an additional biased CV. The radius of gyration CV is defined using all the heavy atoms of MZ1. The Gaussian widths for the radius of gyration CV is set as 0.5 Å (0.05 nm). The bias potential for the radius of gyration is expanded on a grid with a range from 0.0 to 15 Å (0.0 to 1.5 nm) and using a default grid spacing of 1/5 of the CVs' Gaussian width. All other PBMetaD biasing parameters are the same as above. We perform two independent simulations for this computational setup.
For analysis, we save trajectory frames and CV values every 1 ps. For trajectory analysis, we use VMD 1.9.3.79 In addition to the local dihedral angles, we also consider several global CVs to analyze the simulations. These include the distances between the planar ring moieties, specifically the ring structure of MZ1 (5, 6, or 7 membered rings), selected from both the BRD4 and VHL warheads, as shown in Fig. S2 and Table S1 in the SI. We also consider the radius of gyration, defined using all the heavy atoms of MZ1. All these CVs are calculated using the PLUMED 2.8.2 enhanced sampling plug-in code.71,72 We also consider the solvent accessible surface area (SASA)80 and the polar surface area (PSA).81 For PSA, we defined it to include all polar atoms and atoms with high electronegativity and the potential to engage in polar interactions, specifically oxygen, nitrogen, sulfur, chlorine, and hydrogens attached to these atoms. As discussed below and in the SI, we also considered PSA defined using several other definitions. SASA and PSA are calculated using the Shrake and Rupley algorithm82 implemented in GROMACS 2022.5, employing a default probe radius of 1.4 Å (0.14 nm). We employ PLUMED to extract conformations from free energy minima by applying conditional filtering in the minima region based on ranges of CV values, as shown in Table S2 in the SI.
To obtain an unbiased estimate of the free energy surfaces, we employ reweighting to remove the effect of the PBMetaD bias potential.32,54,83 To calculate the weights, we employ last-bias reweighting where we load all the Gaussians deposited after 5 μs and calculate the statistical weight for each frame in the simulation trajectory assuming a fixed bias potential.83 Additionally, we perform last-bias reweighting at several intermediate time points like 100 ns, 200 ns, 500 ns, 700 ns, 1 μs, 2 μs, and 3 μs to assess the convergence of the simulations and the reweighting procedure as shown in Fig. S7–S9 in the SI. All densities and free energy surfaces are obtained using kernel density estimation in PLUMED 2.8.2 where we validate the bandwidths employed by comparing to one-dimensional free energy profiles obtained using discrete histograms. All density and free energy surface figures are made using matplotlib.84
The GROMACS and PLUMED input files required to reproduce the results reported in this paper are available on PLUMED-NEST (https://www.plumed-nest.org), the public repository of the PLUMED consortium,72 as plumID:25.006 at https://www.plumed-nest.org/eggs/25/006/.
We start by considering the radius of gyration, which is a key global variable that provides valuable insights into the overall shape and compactness of a molecule. It measures the average distance of the atoms from the center of mass.85 Smaller radius of gyration values indicate more compact and globular structures, while larger values suggest more extended or unfolded conformations. Changes in the radius of gyration can signal conformational changes, such as folding or unfolding. Additionally, a compact molecule with a smaller radius of gyration may be less soluble in a solvent, whereas a more extended molecule with a larger radius of gyration may exhibit greater solubility.86 In Fig. 2, we present the distribution of the radius of gyration values obtained for the simulations in the different solvents. We observe that the radius of gyration values are quite low in water, as compared to the other solvent, indicating a more compact structure. This suggests that the molecule is more globular in water, potentially affecting its solubility and interactions with other molecules. In contrast, the radius of the gyration values are much larger in DMSO, indicating more extended and unfolded conformations. In addition, the radius of gyration distribution is quite broad in DMSO, indicating that the molecule can more easily sample different conformations, that is, it is more flexible, as compared to the water solvent. Similarly, like water, we observe that the radius of gyration values are also low for chloroform, indicating more collapsed structures than DMSO. Previous studies have shown that the tendency of PROTACs, such as MZ1, to adopt collapsed conformations in apolar environments like chloroform is associated with improved cell permeability.11 The distribution of radius of gyration for chloroform also shows a broader distribution than water but is not as broad as in DMSO. We can observe two peaks in the chloroform distribution that correspond to two distinct molecular conformations, as discussed below (see Fig. 6). These findings suggest that the molecule adopts different conformations in the three solvents, which may have implications for its behavior and function.
To understand the convergence behavior of our simulations, we obtain the radius of gyration distributions at different times and present the average and standard deviation as a function of time in Fig. 3. The corresponding radius of gyration distributions are shown in Fig. S7 in the SI. In Fig. 3, we can observe that the overall trends for the conformational behavior in the different solvents become evident after around 200–500 ns, while nearly full convergence is achieved after around 2 μs. The same conclusion can be obtained by considering Fig. S7 and S9 in the SI, where Fig. S9 exhibits the two-dimensional FES for the radius of gyration and distances between two warheads (see Fig. 4 below) obtained at different times. This demonstrates that shorter simulation times can be employed to characterize the differences in conformational behavior between solvents, especially when one is only interested in understanding trends, such as when characterizing and screening PROTACs in a drug discovery environment.
To ensure the reliability of our simulation procedure, we conducted multiple independent runs: three for water and two for both chloroform and DMSO. Analysis of global CVs revealed that all independent runs converged to the same results for each solvent, indicating strong consistency. The results for the water solvent setup are shown in Fig. S3–S5 in the SI, where further discussion on the methodological validation can be found. These results validate our methodology and demonstrate its effectiveness in sampling the conformational space of flexible PROTAC molecules like MZ1. We also perform a simulation for the water solvent setup where we consider the radius of gyration as an additional biased CV, in addition to the local dihedral angles CVs, similar to what was done in a previous study on an intrinsically disordered peptide.50 In these simulations, we obtain the same results for the global CVs as for the runs that only bias the dihedral angles as shown in Fig. S6 in the SI. This further validates our methodology and shows it is sufficient to consider only the local dihedral angles as biased CVs to obtain an accurate sampling of conformational space.
In addition to the radius of gyration, we consider various distances to illustrate the behavior of the MZ1 in different environments. Specifically, we identify important moieties in MZ1 from two warheads, as shown in Fig. S2 in the SI. We define the center of geometry of each moiety and calculate possible distances between these centers from both BRD4 and VHL warheads. One important distance that plays a significant role in our analysis is the end-to-end distance, or the distance between the two warheads, defined as the distance between the 6-membered ring moieties in the two warheads, as shown in Fig. 4(a). We show the distance distributions obtained in the different solvents in Fig. 4(b), where we can observe rather similar behavior as for the radius of gyration. The results indicate that in water, the molecule is compact and collapsed, while in DMSO it is more extended, flexible, and samples a wider range of conformations. The results in chloroform indicate that the conformations are more collapsed compared to those in DMSO. The collapsed conformations that we observed for MZ1 in apolar solvents like chloroform are significant, as it correlates with increased cell permeability.11 In Fig. 4(c), we present two-dimensional energy surfaces of the distance between the two warheads and the radius of gyration where we can observe that these two variables are highly correlated, as one might expect. The correlation is especially clear for DMSO. We also observe a single minimum in water, multiple minima in chloroform, and single broad minimum in DMSO.
Another important distance we consider is that between the tert-butyl group from the VHL warhead and the phenyl group from BRD4 warhead as shown in Fig. 5(a). From the results shown in Fig. 5(b) and (c), we can observe a similar pattern as before, where MZ1 is collapsed in water and more extended and flexible in DMSO.
To better understand the conformational behavior, we extract representative structures from the minima in the free energy surfaces for the distance between its two warheads and radius of gyration shown in Fig. 4(c). We illustrate the different MZ1 conformations in Fig. 6. As previously discussed, in water, the conformational landscape reveals a single minimum (panel a), corresponding to fully collapsed structures. In contrast, chloroform displays four distinct minima. The first minimum (panel b) resembles the collapsed conformations observed in water. The second (panel c) and third (panel d) minima also represent collapsed structures. However, the fourth minimum (panel e) shows semi-collapsed conformations. As discussed above, the chloroform radius of gyration distribution in Fig. 2 has two peaks. Here, the peak with radius of gyration values around 5.8 Å corresponds to structures with similar distance between warheads (panel c). The peak with radius of gyration values around 6.6 Å corresponds to structures with different distance between warheads (panels b, d, and e). For DMSO, the free energy minimum is much broader, so we analyze three representative areas. Structures obtained from the lower edge of the minimum (panel h) closely resemble the semi-collapsed conformation as seen in chloroform (panel e). Meanwhile, structures obtain from the center of the minimum (panel g) and the upper edge (panel f) show extended conformations of MZ1. This analysis provides a clearer understanding of how MZ1 adopts different conformations in water, chloroform, and DMSO.
![]() | ||
| Fig. 6 Representative MZ1 structures extracted from the minima of the free energy surfaces for the distance between its two warheads and radius of gyration (Fig. 4(c)). The blue box (panel (a)) shows conformations in water, the orange boxes (panels (b)–(e)) represent conformations in chloroform, and the green boxes (panels (f)–(h)) correspond to conformations in DMSO. These structures are obtained by extracting the conformations located within the small black box (indicating the low-free-energy region) shown in the figure. | ||
Solvent accessible surface area (SASA) is a key global collective variable that quantifies the total surface area of a biomolecule accessible to the solvent, see Fig. 7(a). This parameter provides valuable insights into the compactness and conformations of a molecule.80 Specifically, smaller SASA values indicates more compact, globular structures that have a smaller surface area exposed to the solvent, whereas larger SASA values suggests more extended or unfolded conformations with larger surface area accessible to the solvent. A molecule with larger SASA values may be more soluble in a solvent, as it has a larger surface area exposed to the solvent. Furthermore, SASA can be used to monitor important conformational changes in a molecule, such as folding or unfolding.87 Our results, depicted in Fig. 7, reveal a similar trend as above: SASA values are lower in water, indicating more compact structures, while they are higher in DMSO, suggesting extended conformations. Chloroform, on the other hand, exhibits SASA values lower than those in DMSO, reflecting collapsed conformations. We can observe from the two-dimensional free energy surfaces for SASA and the radius of gyration that these two global CVs are correlated, though the correlation is not as strong as for distances and the radius of gyration in Fig. 4 and 5.
Polar surface area (PSA) is a widely used molecular descriptor in drug transport studies, as it reflects the potential of a molecule to form hydrogen bonds and has been shown to associate with aqueous solubility and permeability.81 PSA is often correlated with the ability of a compound to pass through cell membranes and its potential for oral absorption, as molecules with high PSA typically exhibit reduced membrane permeability.88 In addition to its role in solubility and permeability, PSA can also be employed to estimate the potential for adverse drug reactions, such as interactions with transporters or enzymes in the body.14
There are different definitions of PSA used in literature; thus we have employed various definitions of PSA based on the existing literature to understand the differences among them. These different definitions are summarized in Table 1. First, we define PSA as the total surface area attributed solely to polar atoms and atoms with high electronegativity and the potential to engage in polar interactions, specifically oxygen, nitrogen, sulfur, chlorine.89 Second, PSA can be defined as the surface area contributed by selected atoms and the hydrogen atoms bonded to these electronegative atoms.81 Third, we define PSA as the combined surface area of selected atoms and carbon atoms that are either double-bonded to these electronegative atoms or bonded to two polar atoms. Finally, we expand the PSA definition to include the surface area from selected atoms, and hydrogen and carbon atoms included in the previous two definitions. It has been emphasized that incorporating carbon and hydrogen atoms in calculations such as PSA is essential, as these atoms significantly influence how a molecule interacts with its environment, including its solubility in solvents and its ability to bind to biological targets, which are critical factors in the design of effective drugs.14 The results for the different PSA definitions are shown in Fig. 8. We can observe that all definitions show a consistent trend of similar PSA values in water and chloroform, while PSA values in DMSO are higher. We can observe that including the hydrogens in PSA has a larger effect on differences between solvents (compare panels a and b), than including the attached carbon atoms (compare panels a and c). For PSA with only polar atoms (panel a), the water and chloroform values are almost the same. However, including the hydrogens in PSA (panel b) leads to a small relative shift of chloroform PSA to higher values compared to water. Adding carbon atoms (panel c) only leads to a near-constant shift of all distributions to higher values, due to PSA being calculated over larger number of atoms, while the relative differences between solvents are similar. These results indicate that it is beneficial to include the bonded hydrogen atoms in PSA calculation. Thus, in the following, we will employ the second definition of PSA calculated over the polar atoms and the hydrogen atoms bonded to these electronegative atoms, see Fig. 9(a).
| Definition | Selected atoms |
|---|---|
| 1 | Polar atoms and atoms with high electronegativity (N, S, O, Cl) |
| 2 | Polar atoms and atoms with high electronegativity + hydrogens attached to these atoms |
| 3 | Polar atoms and atoms with high electronegativity + carbons attached to two or more of these atoms |
| 4 | Polar atoms and atoms with high electronegativity + hydrogens attached to these atoms + carbons attached to two or more of these atoms |
![]() | ||
| Fig. 9 Conformational analysis of MZ1 in different solvents: (a) schematic representation of the MZ1 molecule, highlighting all the polar atoms and hydrogens attached to it, used to define polar surface area (PSA) (definition 2 in Table 1) (b) density plot of PSA in water, chloroform, and DMSO. The upper panel shows the average and standard deviation for each solvent. (c) Free energy surfaces of MZ1 as a function of PSA and the radius of gyration. | ||
Our results for PSA in different solvents in Fig. 9 reveal distinct solvation effects. Specifically, PSA in chloroform is lower than in the DMSO, reflecting the weaker interactions between the PROTAC molecule and the relatively non-polar environment of chloroform. In contrast, DMSO exhibits the highest PSA values (approximately 450 Å2), indicating stronger solute–solvent interactions due to the high polarity and hydrogen-bonding capacity of DMSO. For water and chloroform, the PSA value distributions have considerable overlap. In water, PSA values are slightly lower, with the polar atoms being less accessible to the solvent. In contrast, in chloroform, PSA values are slightly higher, indicating that the polar surface is more accessible to the solvent. These results suggest that the solvent's polarity and its ability to interact with the polar regions of the PROTAC molecule play a crucial role in determining the exposed polar surface area, which can influence the molecule's behavior in different environments. We also present two-dimensional free energy surfaces of PSA and the radius of gyration (panel c), where we can observe that the two variables are somewhat correlated, though the correlation is not as strong as for distances and the radius of gyration above (Fig. 4 and 5). Additionally, the results for other definitions of PSA are shown in SI, see Fig. S12–S14.
Interestingly, looking at the water and chloroform results, there is a slight difference between SASA and PSA values. SASA distinguishes the water and chloroform, though there is some overlap of the two distributions, with the chloroform SASA distribution tending to higher values than for water (see Fig. 7(b)). However, for PSA, the chloroform and water distributions have much more significant overlap, with the chloroform distribution tending to slightly higher values as compared to the water distribution for the second PSA definition (see Fig. 9(b)), though this depends on the definition of PSA employed as discussed above. The two-dimensional free energy surface of SASA and PSA is shown in Fig. S11 in the SI, where we can observe that these two variables are somewhat correlated, though the correlation appears to be less significant in water, as compared to chloroform and DMSO. These results indicate that SASA and PSA can measure different properties of the conformational ensemble.
The conformational flexibility of the MZ1 molecule, influenced by the solvent environment, plays an important role in its bioavailability and membrane permeability. Since MZ1 is flexible, studying its conformational ensemble is essential, as a single structure does not adequately represent its behavior. In polar solvents like water, MZ1 may adopt more compact conformations, which can potentially facilitate higher membrane permeability by reducing their size and increasing lipophilicity. However, this compact form and the reduced flexibility can affect the molecule's ability to effectively engage both the target protein and the E3 ligase, which is crucial for its mechanism of action. Chloroform exhibits collapsed conformational behavior similar to that of water. Chloroform mimics the hydrophobic membrane environment, and it has been observed that the propensity to adopt collapsed conformations in chloroform correlates with increased cell permeability.11 Therefore, our observations of collapsed conformations in chloroform are significant, suggesting that in apolar environments, MZ1 takes on forms suitable for membrane permeation. This highlights the potential of chloroform to serve as a proxy for studying membrane interactions and permeability.
Our results are consistent with NMR experiments and MD simulation findings reported in a recent study on MZ1, which also observed similar conformational behavior under different solvent conditions, with MZ1 being collapsed in water and extended in DMSO.18
Looking ahead, applying polarizable force fields could enhance the accuracy of the simulations by better capturing solvent effects and molecular interactions.90,91 Furthermore, machine learning potentials offer a promising direction for future research, as integrating such machine learning-based potentials with the current methodology could improve predictions of PROTAC behavior in diverse biological environments.92,93 Another key area for future work is the integration of experimental data (e.g., from NMR spectroscopy and small-angle X-ray scattering) with molecular simulations.94–97 This would refine and validate our models, ensuring that the simulated predictions closely align with real-world behavior.
The GROMACS and PLUMED input files required to reproduce the results reported in this paper are available on PLUMED-NEST (https://www.plumed-nest.org), the public repository of the PLUMED consortium,72 as plumID:25.006 at https://www.plumed-nest.org/eggs/25/006/. Further data supporting the results reported in this paper is available upon request from the authors.
| This journal is © the Owner Societies 2025 |