Sanat Kumar Mahapatra
a,
Bijoy Ghosh
b and
Lisa Roy
*b
aInstitute of Chemical Technology Mumbai – IOC Odisha Campus Bhubaneswar, Bhubaneswar 751013, India
bDepartment of Education, Indian Institute of Technology Kharagpur, Kharagpur 721302, India. E-mail: L.Roy@edu.iitkgp.ac.in
First published on 11th February 2026
DFT calculations uncovered the crucial role of non-structural water molecules trapped inside the cavity of a self-assembled hexameric resorcinarene capsule in the reaction between N-methylpyrrole and phenyl isocyanate to generate an amidopyrrole through two crucial steps: C–C coupling and a 1,3C → N proton wire mechanism. Incidentally, the reaction in the bulk solvent is energetically demanding and underscores the role of encapsulation. In fact, non-structural incorporation of water molecules modulates the acidity of the hexameric resorcinarene capsule through hydrogen-bonded networks. This results in a shift in the rate-determining transition state to the initial C–C coupling step inside the capsule, as opposed to the energy intensive proton-wire mechanism in solution. Thus, the incorporated water molecules within the supramolecular cavity helps in accelerating the overall rate of the acid-catalyzed transformation. Our work, thus, indicates the advantage of the ubiquitous presence of water-enriched local domains in providing utmost control to the supramolecule in the catalytic process. These local hydrophilic domains with water clusters, encapsulated within organic molecules, emulate the kinetics, selectivity and mass transfer observed in some natural processes. The enhanced catalytic activity is primarily attributed to water molecule’s ability to stabilize the reactants, products, intermediates or transition states through remote bond polarization, or proton shuttling, or ability of water to act as a co-catalyst.
![]() | ||
| Scheme 1 Amidopyrrole formation from pyrrole and isocyanate in the hexameric resorcinarene capsule.7b | ||
Recently, several supramolecular research groups have unfolded key reactions utilizing the self-assembled hexameric resorcinarene capsule C (Scheme 1).7 Particularly interesting is the coupling of pyrroles and isocyanates to yield amidopyrroles in water-saturated chloroform in the mandatory presence of the capsule (Scheme 2), as reported by La Manna and co-workers in a recent study.7b This is extremely relevant in the context of druggable targets and inhibitory activities by molecular cage systems.8 Originally reported by Atwood and MacGillivray,9 capsule C in its solid state consists of a chiral spherical molecular assembly of six resorcinarene molecules and eight water molecules, held together by 60 hydrogen bonds. In an apolar solvent medium, the capsule features a well-defined cavity with an internal volume of about 1375 Å3 and elevated Brønsted acidity (pKa = 5.6 as referenced by protonation of pyridine)7d compared with the monomeric units due to the optimal existence of the H-bond network. Furthermore, a combined NMR spectroscopy and classical molecular dynamics (MD) simulation study identified two distinct conformers of the hexameric undecyl-resorcin[4]arene capsule (CA and CB), featuring 8 and 15 hydrogen-bonded water molecules.10,11 In fact, peripheral structural changes in the capsule effectuated by water molecules are believed to modulate the acidity of the supramolecular assembly, resulting in a 10-fold enhancement in the catalytic rate of an acid-catalysed Diels–Alder reaction by CB as compared to CA.10 Additionally, NMR diffusion studies detected several chloroform molecules inside the cavity of this superstructure, further adding layers of complexity to the self-assembly strategy.12 Indeed, the capsule is surmised to have a strong affinity towards zwitterionic tetrahexylammonium bromide over chloroform molecules, indicating a strongly electrostatic host environment.12 Hence, during self-assembly in H2O-saturated chloroform (Scheme 2), one cannot rule out the trapping of activated non-structural water molecules in addition to the structural water molecules in the H-bonded networks. This concept is in line with the discovery of isolated water droplets inside a Ga4L612− supramolecular tetrahedral cage employing THz absorption spectroscopy and ab initio molecular dynamics.13 Although mechanistic studies on cavity-controlled selectivity are reported in the literature,4,10,14 to date, an understanding of the role of trapped water molecules in reactivity is underexplored and warrants a thorough investigation. This, we believe, will open new channels to achieve highly efficient organic transformations in locally water-enriched media, as pervasive and regulated in Nature.
![]() | ||
| Scheme 2 Reaction conditions for the coupling of 2a and 3a to yield 4aa in the presence of the capsule C, with snapshots of the proposed reaction species. | ||
In the present work, therefore, we undertake a DFT study to explore the binding of additional non-structural water molecules, which have greater geometrical flexibility than the water molecules in the hydrogen-bonded networks of capsule C, and unravel the mechanistic intricacies of the coupling between N-methylpyrrole (2a) and phenyl isocyanate (3a) (Scheme 2). We hypothesize two crucial steps: C–C bond formation through nucleophilic attack of the pyrrole on the carbonyl of the isocyanate and a 1,3C → N proton wire mechanism through acid–base catalysis (Scheme 2).
| EONIOM(QM/MM) = EQM,model + EMM,real − EMM,model, | (1) |
| Ehigh,real ≈ EONIOM2(high:low) = Ehigh,model + Elow,real – Elow,model. | (2) |
:
1 ratio within the domain of the CPCM solvation model, the following dielectric parameters of water were used: static dielectric constant (ε = 78.35), dynamic dielectric constant (ε∞ = 1.77), hydrogen-bond acidity (1.17) and basicity (0.4), the surface tension of the solvent at interface (23.23), carbon aromaticity (0.0) and electronegative halogenicity (0.0), while the following parameters of chloroform were considered: static dielectric constant (ε = 4.7), dynamic dielectric constant (ε∞ = 2.09), hydrogen-bond acidity (0.15) and basicity (0.02), the surface tension of the solvent at interface (38.39), carbon aromaticity (0.0) and electronegative halogenicity (0.75), following Paton and co-workers’ methodology.19 Relative Gibbs free energies are evaluated by adding thermal corrections obtained from vibrational analyses to the solvent-phase electronic energies. Additionally, a correction term of 1.89 kcal mol−1 (at 298.15 K) was included as necessary to account for the standard-state concentration of 1 M, especially to account for the solvation of many species, including micro-solvated water molecules, taking part in the transition state.20 Thus, the discussions in the main text are provided based on Gibbs free energies (kcal mol−1) at the B3LYP-D3(BJ)/CPCM {H2O
:
CHCl3 = 1
:
1}/Def2-TZVP//ONIOM[M06-2X/CPCM(CHCl3)/6-31G(d,p):PM6] level of theory. Furthermore, to evaluate the dependence on the DFT functional, energies are computed at B3LYP/CPCM {H2O
:
CHCl3 = 1
:
1}/Def2-TZVP//ONIOM[M06-2X/CPCM(CHCl3)/6-31G(d,p):PM6] and M06-2X/CPCM {H2O
:
CHCl3 = 1
:
1}/Def2-TZVP//ONIOM[M06-2X/CPCM(CHCl3)/6-31G(d,p):PM6] (see Table S2). To check for dependence on solvent dielectric constant, single-point energies are also conducted in chloroform as solvent with B3LYP-D3(BJ), M06-2X and B3LYP functionals (see Table S3).
To verify the dependence of the low-level method on predicting the structures and thereby affecting the overall relative energies, we have conducted additional geometry optimizations at ONIOM[B3LYP-D3(BJ)/ALPB(CHCl3)/Def2-SVP:XTB] with ORCA 5.0.315b and further single-point energy refinements at the B3LYP-D3(BJ)/ALPB(CHCl3)/Def2-TZVP level of theory. For all geometry optimizations, the following thresholds were used: convergence tolerance of 5 × 10−6 Eh for energy changes, 3 × 10−4 Eh per Bohr for maximum gradients, 1 × 10−4 Eh per Bohr for root-mean-square (RMS) gradients, 4 × 10−3 Bohr for maximum displacements, and 2 × 10−3 Bohr for RMS displacements. The numerical integration Grid6 was used throughout. Tight convergence criteria with an energy tolerance of 1 × 10−8 Eh for self-consistent field (SCF) achievement were utilized for all ORCA computations.
To gather an in-depth analysis of the key interactions responsible for relative energies, local energy decomposition (LED) partitioning of DLPNO-CCSD(T)/def2-TZVPP was utilized.21 The theoretical background of this useful tool has been detailed elsewhere.21 In principle, the binding energy (ΔE) between a pair of fragments X and Y is hypothesized to consist of “geometric preparation energy” (ΔEgeo-prep) or the amount of strain required to distort them from their equilibrium structures to an optimal point of interaction and the inter-fragment interaction energy (ΔEint). The latter is further decomposed into various physical components, such as electrostatic interaction (ΔEelstat), quantum mechanical exchange (ΔEexchange), charge-transfer correlation (ΔECT), and the London dispersion energy (ΔEdisp). In short, ΔEint is summed up into dispersive (ΔEdisp) and non-dispersive (ΔEnon-disp) components. Hence, according to this scheme, the relative free energy of activation for a transition state is given by:
![]() | (3) |
NBO second-order perturbation theory analysis and Wiberg bond indices were calculated at the B3LYP-D3(BJ)/6-311+G(d,p) level, taking only the fragments of the substrates out of the relevant optimized geometries (Fig. S2). Within the context of conceptual DFT, Parr et al. defined the global electrophilicity index ω as:
![]() | (4) |
| ω+(r) = ωf+(r), | (5) |
![]() | (6) |
Next, we consider the assistance of a single monomeric resorcin-arene unit to facilitate the coupling. However, here, the optimum anchorage of the monomer to the substrates through dispersion interactions enables the reactant complexes as the thermodynamic reference to be used to compute the activation-free energy barrier (see Fig. 1). Interestingly, the rearrangement of the two reactants into the intermediate is a mildly endergonic process in the presence (INT1@MWA, ΔG = 1.1 kcal mol−1) or absence (INT1@MU, ΔG = 2.6 kcal mol−1) of water assistance with the monomer (M) and features a new amido-NH bond. Furthermore, strong, attractive H-bonds stabilize the preceding (TS1@MWA and TS1@MU) and succeeding (TS2@MWA and TS2@MU) transition states as compared to the significantly energy-intensive reaction in solution (Fig. 1). In fact, encapsulation of the substrates in the single resorcin-arene moiety is impacted by a change in the rate-determining barrier to the initial nucleophilic addition of pyrrole to isocyanate (TS1@MWA = 14.2 kcal mol−1).
At this stage, our analysis reveals that a large geometric distortion is required to constrain the pyrrole and isocyanate in TS1, particularly the angular bending of the N–C–O moiety (Fig. 1, inset). Notably, bifurcation of the activation energy (ΔE‡) into distortion energy (ΔEdist), due to geometric rearrangement, and interaction energy (ΔEint), due to overlapping orbitals of the two substrates, provides meaningful insights into the C–C coupling transition states (Fig. 2).23 As shown in Fig. 2, ΔEdist dictates the extent of ΔE‡. A detailed investigation shows that TS1@SU suffers the greatest geometric distortion in the substrates (46.2 kcal mol−1), while TS1@MWA experiences the least (25.3 kcal mol−1). This observation is in line with the trend in second-order perturbation analyses, E(2), between the donor and the acceptor orbitals24 in transition states as follows: TS1@SU (29.0 kcal mol−1) > TS1@SWA (37.1 kcal mol−1) > TS1@MU (71.3 kcal mol−1) > TS1@MWA (97.9 kcal mol−1) (Fig. S1), emphasizing the crucial role of anchorage through non-covalent interactions.
![]() | ||
| Fig. 2 Energy decomposition analysis TS1@SU, TS1@SWA, TS1@MU, and TS1@MWA at B3LYP-D3BJ/Def2-TZVP/CPCM in kcal mol−1. | ||
To further understand the factors responsible for the unusual C–C coupling in bulk implicit solvent as compared to monomeric confinement, we analysed the various physical interactions between the substrates and the resorcin-arene monomer using the DLPNO-CCSD(T)/LED methodology. As observed in Table 1, for all the transition states, a large amount of energy is invested in geometric preparation. As expected, detailed investigation demonstrates that ΔE‡geo-prep-3a for substrate 3a (isocyanate) is the largest in TS1@SU, followed by TS1@SWA, TS1@MWA, and TS1@MU, arising from the addition of pyrrole to the bent, congested orientation of isocyanate. Hence, a comparison of TS1@SU and TS1@SWA shows that the former is hugely destabilizing. Decomposition of ΔE‡int into dispersive and nondispersive components reveals that London dispersion forces preferentially stabilize the transition states, further emphasizing the crucial role of H-bonding networks.21c The attractive dispersion forces are significantly larger in TS1@MWA compared to its competing transition state, TS1@MU, reversing the trend of ΔE‡geo-prep-3a, and suggesting that a balance of repulsive steric interactions and dispersive interactions is responsible for the desired outcome (Table 1).
| TS1@SU | TS1@SWA | TS1@MU | TS1@MWA | |
|---|---|---|---|---|
| ΔG‡ | 41.6 | 25.6 | 23.1 | 18.2 |
| ΔE‡ | 38.9 | 23.4 | 23.0 | 15.0 |
| ΔE‡geo-prep | 45.7 | 33.0 | 25.4 | 23.7 |
| ΔE‡geo-prep-3a | 35.0 | 27.2 | 18.8 | 21.4 |
| ΔE‡int | −6.8 | −9.6 | −2.4 | −8.8 |
| ΔE‡disp | −8.4 | −7.8 | −0.1 | −7.5 |
| ΔE‡no-disp | 1.6 | −1.8 | −2.2 | −1.2 |
Thereby, we diverge on the reactivity inside the capsules, which are inherently reaction vessels, bio-mimicking the active site of several (metallo)enzymes.25 Notably, for reactions within the confined environment, the self-assembly binding enthalpy is expected to overshadow the entropic cost; hence, reactant complexes would be the reference point necessary to channelize the reaction. Analogous to RC@CU (Fig. 3), in the experimental study, the authors propose the formation of a heterocomplex 2a + 3a@C, on the basis of reactivity inhibition in the presence of competitive hosts such as [NEt3]+ and during H-bond disruption by DMSO.7b The optimized geometries of RC@CU and RC@CWA show the guest molecules align in a T-shape, such that any susceptible steric repulsions within the cavitand are judiciously avoided. The energy barriers for the C–C coupling step in the presence (via TS1@CWA, ΔG‡ = 14.2 kcal mol−1) or absence (via TS1@CU, ΔG‡ = 18.3 kcal mol−1) of the non-structural water molecule inside the capsule are comparable to the reactions with M, highlighting similar dispersion effects imposed by the cavity. It may be presumed that intermediate INT1@MWA is formed through monomer assistance,14c and is trapped inside the capsule, facilitated by the capsule's acidity, where it undergoes the proton-wire mechanism (Fig. 3). Indeed, our calculations reveal an enhancement in the average pKa values of the alcoholic –OH groups on going from MU(8.2) and MWA(8.0) to CU(3.2) and CWA(2.3). Thus, while TS2@MWA and TS2@MU display activation energies in the order of 7.9–8.6 kcal mol−1, the corresponding barriers for TS2@CWA and TS2@CU are significantly curtailed to 2.6 and 5.6 kcal mol−1, respectively. Notably, the formation of product 4aa@CWA is calculated to be highly exergonic compared to all the other cases studied.
Furthermore, molecular electrostatic potential (MEP) plots of the catalytic frameworks in INT1 show an electropositive environment around the –OH groups at the rim, while a distinct electronegative cloud emerges surrounding the phenoxide (O–) moiety (Fig. 4). Additionally, the calculated high nucleophilicity (ω− = −7.8 eV) on the catalytic framework in INT1@CWA can be directly correlated to the capsule's augmented Brønsted acidity in the presence of the additional hydrogen-bonded water molecule. In fact, the electrophilicity (ω+ = 15.1 eV, Table S4) of the pyrrolic C–H unit and the low Wiberg bond index (0.75, Fig. S3) in INT1@CWA highlight that an electronic balance is achieved in TS2@CWA for the synchronous C–H activation coupled to the proton shuttle. Thus, our results validate that the water-assisted capsule is the active catalyst. Reduced density gradient (RDG) plots further support our understanding of augmented NCIs in water-assisted transition states for C–C coupling (Fig. S5).
![]() | ||
| Fig. 4 MEP plot and computed nucleophilicity of the catalytic framework in INT1. The colour spectrum ranges from negative (blue) to positive (red) potential within ±0.01 au. | ||
Nevertheless, our work also hints towards the advantage of the ubiquitous presence of water-enriched local domains, which regulate reaction kinetics, selectivity, and mass transfer in natural processes, primarily due to the unique properties of water molecules, such as remote bond polarization, proton shuttling, and the ability to act as co-catalysts.26 Interestingly, such a natural phenomenon is laterally existent in several industrial-grade processes, working in confined or unconfined environments.26 Recently, the promotional role of water as an active mediator in C–C coupling reactions has been verified by isotope effect experiments and molecular dynamics simulations.26b Indeed, a comparison with one-, two- and three-water molecule-assisted pathways in confinement of the capsule together with an unassisted mechanism strongly support our argument that non-structural water molecules form a bridge between the catalyst and the substrates, polarizing the neighbouring terminals, discharging the steric constrains associated with the optimally oriented substrates, and thereby promoting the C–C coupling step with predominantly low-barriers ranging between 12.7 and 16.4 kcal mol−1 as opposed to the unassisted mechanism with a 26.6 kcal mol−1 reaction barrier predicted by B3LYP-D3BJ/Def2-TZVP/ALPB(CHCl3) calculations on ONIOM[B3LYP-D3(BJ)/ALPB(CHCl3)/def2-SVP:XTB] optimized geometries (Fig. 5).
![]() | ||
| Fig. 5 Gibbs free-energy profiles in kcal mol−1 at B3LYP-D3BJ/Def2-TZVP/ALPB(CHCl3) on ONIOM[B3LYP-D3(BJ)/ALPB(CHCl3)/def2-SVP:XTB]15b optimized geometries for the coupling of 2a and 3a to yield 4aa inside capsule C. @CWA = water-assisted capsule (solid black line); @C2WA = two-water-assisted capsule (dashed black line); @C3WA = three-water-assisted capsule (dotted black line) @CU = unassisted capsule (shown by red line). Subscript O refers to reactions studied with ORCA 5.0.3. | ||
| This journal is © the Owner Societies 2026 |