Open Access Article
Haojiang Yaoab,
Zhenhao Zhoubc,
Ye Liu
*d,
Dong H. Zhang
*b and
Guohui Li
*bd
aSchool of Chemistry and Materials Science, University of Science and Technology of China, Hefei, 230026, China
bState Key Laboratory of Chemical Reaction Dynamics, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian, 116023, China. E-mail: ghli@dicp.ac.cn; zhangdh@dicp.ac.cn
cUniversity of Chinese Academy of Sciences, Beijing 100049, China
dInterdisciplinary Research Center for Biology and Chemistry, Liaoning Normal University, Dalian, 116029, China. E-mail: yeliu@lnnu.edu.cn
First published on 31st March 2026
The HSiR3/KOtBu catalytic system affords a notable regioselective C–H silylation reaction using alkaline earth metals, yet its mechanistic origins and the temperature-dependent selectivity inversion remain unresolved. Here, we combine active learning neural network potentials with long-timescale ab initio quality molecular dynamics and enhanced sampling to simulate the HSiEt3/KOtBu catalyzed deprotonation reaction of N-methylindole in a complete operando solution environment. We found that KOtBu can rapidly aggregate in organic solution to form catalyst clusters, which are able to activate and stabilize a hydride derived from silane, and this operando generated hydride is the catalytically active species. The hydride is able to deprotonate C2 and C3 protons on N-methylindole, and this reaction site selectivity is affected by the stability of the cluster controlled by solvent and temperature coupling, which gives rise to nonlinear activation entropy of the C3 pathway in the neat silane environment. Unlike tetrahydrofuran solutions, the weak polarity of neat silanes makes it difficult to support deprotonation reactions at the less acidic C3 position at high temperature, thus affecting temperature-dependent selectivity. These results provide a microscopic predictive framework for tuning the selectivity in base-promoted hydrosilane chemistry and illustrate how explicit operational solvation and cluster formation can reshape mechanistic conclusions drawn from static models.
A notable breakthrough was reported by Grubbs and co-workers, who developed an Earth-abundant metal C–H silylation protocol using potassium tert-butoxide (KOtBu) and hydrosilanes as the silicon source (Scheme 1a).8 This HSiR3/KOtBu system exhibits broad substrate scope, high regioselectivity, and operational simplicity and has since been extended to numerous heteroaromatic and carbocyclic substrates. Despite its synthetic utility, however, the microscopic mechanism of this catalytic system remains under active debate. Several pathways have been proposed, including electron transfer,9–12 hydrogen atom transfer13 and hydride transfer mechanisms (Scheme 1),10,13–15 reflecting the mechanistic complexity of this system and its sensitivity to reaction conditions.
Particularly intriguing is the experimentally observed dependence of regioselectivity on solvent and temperature, including cases of temperature-dependent selectivity inversion.11 Desorption electrospray ionization mass spectrometry (DESI-MS) experiments have identified transient deprotonated heteroarenes and pentacoordinate silicate species consistent with an ionic hydride-transfer pathway.15 The origin of regioselectivity, and how it is affected by reaction conditions, has not been explained at a molecular level. In particular, it remains unclear how the solvent environment and temperature cooperatively modulate the stability of reactive intermediates and transition states in this strongly ionic system.
From a theoretical perspective, most mechanistic studies of the HSiR3/KOtBu system have relied on static quantum chemical calculations, often employing implicit solvent models or small, predefined coordination clusters. While these approaches can provide qualitative insights, they inherently neglect the dynamic formation of multimeric catalytic species and the fluctuating solvation structures present under operando conditions.16 This limitation becomes especially severe when charged intermediates or hydride species17,18 are involved, as static models fail to capture localized polarization, nonlinear dielectric responses,19 and temperature-dependent entropic effects.20 As a result, key experimental phenomena, such as solvent-specific reactivity trends and the nontrivial temperature dependence of selectivity, remain inaccessible to conventional static models.
Here, we address these challenges by employing ab initio quality machine learning molecular dynamics (MLMD) simulations combined with an active-learning workflow to investigate the HSiR3/KOtBu catalytic system under realistic solution-phase conditions. This fully dynamic approach enables explicit treatment of solvent molecules, operando formation of multimeric KOtBu clusters, and temperature-dependent structural fluctuations, without imposing preconceived mechanistic assumptions. Crucially, this workflow extends mechanistic studies of solution-phase reactions beyond static, few-molecule models to simulations of chemically realistic, many-body environments, allowing direct modeling of operando experimental conditions at ab initio accuracy.
By systematically comparing tetrahydrofuran (THF) solvent and neat silane environments over a broad temperature range, we uncover how solvent–temperature coupling governs hydride generation, free energy of deprotonation, and ultimately product selectivity. Our results provide a microscopic and thermodynamic framework that rationalizes experimentally observed selectivity patterns and their temperature-dependent inversion, establishing dynamic hydride transfer as the key mechanistic origin of catalysis in this system.
200
000 steps, covering approximately 150
000 conformations.
![]() | ||
| Fig. 1 (a) Active learning for neural network potential training and chemical space exploration workflow. (b) Boxes constructed to simulate catalytic reactions in real solvent environments. | ||
We employed the well-tempered metadynamics (WT-MetaD) method,26 which is an enhanced-sampling technique used in MD to efficiently explore free-energy landscapes and accelerate rare events. In WT-MetaD, a history-dependent bias potential is added along selected collective variables (CVs) to encourage the system to escape free-energy minima. A Gaussian bias V(s,t) is added in CV space:w
An equalized metadynamics bias is applied along the CV to increase the probability of molecular collisions leading to a reaction. The advantage of temperature-controlled metadynamics is that it limits the size of the deposition bias, prevents the exploration of unphysical regions of the reaction space, and allows for accurate calculation of the free energy.27 Free energy curves calculated using WT-MetaD are verified for convergence (Fig. S3 and S4).
i is the averaged force on atom i from the four models, and Fαi is the predicted force on atom i from model α. Two thresholds were set for labeling: σflow = 0.20 eV Å−1 and σfhigh = 0.35 eV Å−1. For each exploration iteration, the average model deviation σfavg was calculated. Conformations with σf values between (σflow + σfavg) and (σfhigh + σfavg) were considered informative and were selected for labeling via DFT calculations and then added to the training set. Structures with σ below the lower bound were regarded as accurately described, while those exceeding the upper bound were discarded as unphysical. This iterative workflow, combining NNP training with WT-MetaD, was repeated until the training dataset was sufficiently comprehensive (Fig. S2).
The formation of a free hydride species in solution is exceptionally rare and mechanistically intriguing. Its microscopic pathway is often ambiguous because a hydride can only exist when a catalyst provides a strongly electrostatic and highly stabilizing environment, typically through operando formed clusters.38–40
In the first stage of the reaction, we defined a collective variable (CV) based on the KOtBu-HSiEt3 distance to monitor their mutual approach and subsequent bond rearrangement.
In the initial THF environment simulation box, KOtBu ion pairs were placed in a dispersed configuration (Fig. 2). As the simulation progressed, KOtBu gradually organized into stable clusters.
![]() | ||
| Fig. 2 (a–d) Sequential snapshots at 0 ps, 20 ps, 100 ps, and 2 ns, highlighting the evolution of KOtBu molecules; other molecules are rendered transparent. | ||
The time-resolved MLMD trajectory clearly shows the aggregation process: a dimer appears in the first 20 ps, followed by the formation of a trimer at around 100 ps. After a lot more time has passed, a tetramer (the primary clusters capable of forming hydrides) appears for the first time around 2 ns.
To place the simulated potassium alkoxide aggregate in structural context, it is useful to compare it with representative single-crystal potassium alkoxide structures. Chisholm et al.41 showed that KOtBu can be isolated as a cubane-like tetramer [KOtBu]4, while crystallization from tBuOH-containing media affords the hydrogen-bonded ribbon-chain precursor [KOtBu·tBuOH]n. Under strongly donor-stabilizing conditions, lower-order species such as elemental [K(18-crown-6)(OtBu)] can be obtained, where crown ether coordination saturates the potassium center.42 It can thus be seen that in weakly polar solvents, potassium alkoxides tend to adopt self-aggregation structures. Therefore, when considering potassium alkoxide catalysts, it is recommended to treat aggregates as the fundamental structural units.
In our simulations, the radial distribution function (RDF) between K+ and the oxygen atom of THF in the system exhibits a first coordination shell peak at r = 2.85 Å (Fig. S5a), with an average coordination number CN = 3.2 for K+ in THF (Fig. S5b). These values are in close agreement with results obtained from similar THF-solvated potassium complex crystals.43,44
We conduct additional simulations with larger and more dilute boxes that more closely approximate experimental composition regimes (Table S1). In these extended systems, tetrameric KOtBu aggregates persist as the dominant and enduring catalytic units throughout molecular dynamics simulations even as collision frequencies decrease (Fig. S6). This indicates that tetramer formation does not solely result from simulation bias originating from the initial high-density setup.
Because KOtBu self-aggregation and complexation of KOtBu with silane constitute competing processes (Fig. S7), these trajectories directly establish that aggregation is the dominant pathway, leading to the operando formation of tetrameric KOtBu clusters that serve as the key reactive units.
Importantly, the formation of the tetramer does not merely “collect” ions; it qualitatively reshapes the reactive coordination chemistry by enabling the tBuO− anion to dynamically interconvert between two coordination motifs: the tetrameric KOtBu cluster and a pentacoordinate silicon intermediate formed upon interaction with Et3SiH (Fig. 3a and S8b). Pentacoordinate and, more generally, hypercoordinate silicon species are well precedented in the literature and have been structurally characterized in a variety of settings.45 These include silicon hydride-containing hypervalent species and hydridosilicates.46–48 In our case, the optimized pentacoordinate silicon adopts a distorted trigonal-bipyramidal arrangement (Fig. S8b), with an elongated axial Si–O bond of 2.03 Å, a trans Si–C bond of 2.01 Å, and two shorter equatorial Si–C bonds of 1.91 and 1.92 Å. The Si–H bond length increased from 1.50 to 1.52 Å upon activation (Fig. S8a).
![]() | ||
| Fig. 3 (a and b) Representative snapshots of the hydride-formation transition state (TS), i.e., hydride transfer and Si–H cleavage from the pentacoordinate silicon, and the corresponding final state (FS). (c) Free energy curves for hydride generation at 300 K under different solvent environments, with IS, TS, and FS labelled. (d) Atomic charges49 of the hydride-origin hydrogen atom at IS, TS, and FS. The coloring of the atoms in all snapshots: C—cyan, H—white, O—red, Si—yellow, K—pink, and H−—purple. | ||
Hydride generation occurs only when the pentacoordinate silicon approaches the highly charged pocket of the tetramer (Fig. 3a and b). In this configuration, the catalytic cluster establishes an intense electrostatic field and a locally polarized electrostatic environment, which collectively weaken the Si–H bond and lower the rearrangement barrier.50,51 Once the Si–H bond undergoes heterolytic cleavage, the nascent hydride is immediately captured by the cluster core, preventing its diffusion into bulk solution. This cluster-assisted Si–H activation is inaccessible in monomeric KOtBu (Fig. S9).
Long-timescale MLMD simulations further confirm that the resulting hydride-bound cluster is kinetically stable. This observation is consistent with experimental intuition: the hydride is an extremely hard base with an exceptionally high charge density, and it cannot exist as a freely solvated species in an organic solution. Instead, it remains sequestered within a tightly multimeric cluster immediately following the cleavage of the Si–H bond.52–54
With this cluster-enabled mechanism established, we next quantify how the solvent environment modulates the same hydride-generation event. The free energy barriers for hydride generation differ slightly under the two examined conditions (Fig. 3c). In THF, the TS appears at 18.6 kcal mol−1, whereas in the neat environment, the TS increases to 21.5 kcal mol−1. Although the barrier is higher in the neat environment, both values fall within the range accessible at ambient temperatures for organic reactions. After crossing the TS, the reaction reaches the FS shown in Fig. 3b, in which a hydride is captured by the KOtBu catalyst cluster.
The charge evolution along the reaction coordinate further highlights the differentiated electrostatic environments created by THF and neat silane (Fig. 3d). In THF, the reacting hydrogen consistently carries a more negative partial charge, from the initial state to the transition state and finally to the hydride-rich product state. This indicates that the polar solvent effectively stabilises the progressive charge separation associated with Si–H heterolysis. The enhanced stabilization is most evident in the product region, where the hydride reaches a substantially more negative charge in THF, reflecting the combined effect of solvent polarity and the well-organised tetrameric potassium cluster. Conversely, the neat silane medium provides weaker dielectric screening; the hydrogen remains less negatively charged at every stage, revealing a reduced ability to accommodate charge redistribution during bond cleavage. This result also implies a difference in the free energy barrier for the deprotonation step under the two environments.
At the same time, the limited stability in a nonpolar environment explains the higher and more temperature-sensitive free energy barriers observed in neat silane. From a mechanistic perspective, THF readily supports hydride polarization through solvation, whereas neat silane relies heavily on the integrity of the multimeric KOtBu cluster to compensate for their weak solvating ability, which makes the reaction pathway more susceptible to thermal disruption of the cluster structure (see below).
The reaction is characterized by a highly ordered transition state in which the hydride nucleophile approaches the C–H bond while the surrounding solvent molecules reorganize to stabilize the charge separation being formed. As a result, the process is entropically disfavored, and its free energy barrier exhibits a pronounced temperature dependence.
To capture these effects, we performed more than 10 ns of well-tempered metadynamics simulations for each solvent environment (THF and neat silane) and temperature (300–450 K); the extended thermal range enables a systematic decomposition of temperature dependence, solvation structure, and entropy contributions.
The simulations reveal a clear and physically intuitive trend: the free energy barriers for both C2 and C3 deprotonation increase monotonically with temperature. This increase reflects the entropic penalty associated with the formation of a tightly organized transition state, where hydride binding, K+ coordination, and solvent reorganization reduce configurational freedom.
Specifically, under THF dissolution conditions, moderate polarity and modest dielectric shielding are more effective in stabilizing the progressive charge redistribution that accompanies deprotonation from the IS (Fig. 4a) to the TS (Fig. 4b and c) than in the weakly polarized neat silane environment; as a consequence, THF yields overall lower computed free-energy barriers and a near-linear temperature dependence of ΔG≠. At 300 K the calculated ΔG≠ values for C2 and C3 deprotonation are 18.1 and 21.3 kcal mol−1, respectively; by 450 K these barriers increase to 22.0 and 22.8 kcal mol−1, reducing the intrinsic kinetic gap between the two sites (Fig. 5a). We emphasise that activation entropies drive this temperature dependence: the activation entropy is obtained from the temperature derivative of the barrier, and in THF the C2 pathway incurs a substantially larger entropy loss (ΔS≠ = −25.7 cal mol−1 K−1) than the C3 pathway (ΔS≠ = −9.8 cal mol−1 K−1) (Fig. 5c). The larger negative ΔS≠ for C2 causes its barrier to rise more rapidly with temperature, which explains why the kinetic preference for C2 diminishes as T increases in THF.
By contrast, in the neat silane environment, the C3 pathway exhibits a pronounced, non-linear entropic response: between 300 and 350 K, the C3 barrier rises sharply (21.5 to 24.6 kcal mol−1), while the C2 barrier changes only modestly (18.6 to 19.8 kcal mol−1) (Fig. 5d). Structural analysis indicates that this behaviour arises from temperature-sensitive disruption of the multimeric K-hydride aggregation that particularly stabilises the ordered TS required for C3 deprotonation (Fig. 6). The abrupt change in cluster stability produces a large, non-linear change in the effective ΔS≠ for C3 (Fig. 5d), which is not captured by implicit-solvent or static calculations.
We also note two important caveats relevant to experimental selectivity. First, our computed ΔG≠ values do not show a strict barrier crossover within the simulated temperature range; rather, they converge, such that kinetic discrimination is reduced at high temperature in THF. Second, the final product distribution observed experimentally depends not only on these deprotonation barriers but also on the relative thermodynamic stability of the silylated products and specialized reverse reaction mechanisms different from forward reactions. Some experiments report product interconversion and small amounts of reversion to N-indole, and such long-timescale, multi-step processes are difficult to sample exhaustively in finite MD trajectories. For these reasons, we present the deprotonation free-energy barriers primarily as mechanistic indicators of trends (entropic sensitivity, solvent coupling, and cluster-dependence) rather than as definitive predictors of absolute product ratios. Taken together, the computed entropic trends rationalise why C3 silylation becomes increasingly competitive at elevated temperature in THF, while the strong, non-linear entropic penalty in neat silane disfavors C3 across the accessible temperature range.
These findings are consistent with the experimentally observed trends: in THF solution, the reaction predominantly affords the C2-silylated product at low temperature, but the selectivity reverses at elevated temperature, favoring the C3-silylated product.11 In contrast, in a neat environment, C2 selectivity persists across all temperatures, and no inversion is observed.
For completeness, our NNP has been extended to silylation reactions and simulates the third step of the catalytic sequence, namely the silylation of the deprotonated indole intermediate (Fig. S10). Potassium does not play a direct mechanistic role in the final C–Si bond-forming event. Instead, the reactivity of this step is determined primarily by the position of the reactive carbon center generated in the preceding deprotonation step. Thus, the silylation process follows the regioselectivity already established upstream, rather than introducing a new potassium-controlled selectivity element.
To rationalize the observed nonlinear entropic effects and site-selectivity inversion, we analyzed the structural statistics of the hydride cluster extracted from the MLMD trajectories. Throughout the reaction, from the initial state to the transition state and until H2 is formed, the reactive hydride is tightly stabilized by cooperative ion pairing among tBuO−, K+ and H−. The extent of this aggregation can be quantitatively described using the K–H coordination number. In THF, the hydrides preferentially adopt a tetrameric arrangement (coordination number = 3), with a lower probability density of trimers (Fig. 6a). In the neat silane environment, although tetramers are still the dominant polymer species, the distribution is more moderate and the probability density of trimers is elevated (Fig. 6a).
The probability of forming well-ordered tetramers decreases systematically with increasing temperature, and the distribution of tetramers and trimers becomes gentler (Fig. 6b). This change reflects enhanced thermal motions and weakened ionic associations at higher temperatures, which destabilize the synergistic stabilization provided by multiple potassium cations. Importantly, no conformations corresponding to “2K-H”, “1K-H”, or uncoordinated hydride species were observed, even at 450 K. The results of this study are summarized below.
Differences in aggregation states in THF and in neat silane provide a mechanistic origin for the different temperature dependences of the C2 and C3 deprotonation barriers. This can be seen in the RDFs between the hydride and solvent molecules. In THF, the structural environment of the hydride does not change significantly when the temperature increases from 300 K to 450 K. The solvent environment does not undergo rearrangement within this temperature range (Fig. 6c). This stable aggregation leads to nearly linear ΔG≠ behavior for both C2 and C3 positions.
In the neat environment, from 300 K to 350 K, the solubility of the hydride decreases considerably and the stability decreases abruptly (Fig. 6d). Deprotonation of C3 is more sensitive to partial disruption of ionic clusters, as it requires a more ordered transition state and tighter hydride polymerization than C2. Loss of ion cluster integrity disproportionately destabilizes the C3 transition state, producing the nonlinear steep increase in ΔG≠ observed only for the C3 pathway (Fig. 5b). Importantly, this drop is not driven by intrinsic solvent structural changes: the solvent–solvent RDFs for THF and silane show only mild, uniform temperature broadening (Fig. S11).
The electronic environments of the C2 and C3 sites are fundamentally different: whereas the enhanced acidity of the C2–H bond in N-methylindoles arises from the excellent resonance stability of the resulting carbanion, in which the negative charge is efficiently dispersed across the indole framework and partially transferred to the nitrogen atom, deprotonation of C3 leads to a more localized anion with reduced stability.55
Therefore, deprotonation at C3 is expected to require more efficient solvent stabilization. More generally, explicit solvation and solvent reorganization can play a decisive role in solution-phase ionic reaction pathways and in the associated free energy barriers.56,57 We further quantified the solvent contributions by tracking the coordination number between the reactive hydride and solvent molecules along the reaction coordinate (Fig. 6e–h). Two representative temperatures, 300 and 400 K, were chosen to reflect typical experimental conditions.
These statistical trends reveal different solvent recombination behaviors at the two reaction sites. The significant increase in the coordination of C2 and C3 upon deprotonation in THF (Fig. 6e and f) suggests that the additional THF molecules effectively stabilize the charged species formed in TS. In a neat environment, the deprotonation of C2 and C3 showed opposite trends. The deprotonation of C2 was essentially insensitive to the silane solvent, and the coordination number of the hydride at the reaction center to silane remained almost the same with the progress of the reaction (Fig. 6g), suggesting that the solvent's ability to stabilize the charge-transfer product at this position was constant, and there was no solvent recombination. In contrast, the coordination number of silane decreases during C3 deprotonation (Fig. 6h), internal interactions dominate in the TS, and solvent molecules with weaker interactions are displaced. The ability of the solvent to stabilize the transition state of the C3 deprotonation reaction decreases, leading to an anomalous entropic effect on the free energy barriers of the reaction.
Relatively speaking, the temperature increase has a slightly greater effect on the neat environment (Fig. 6e–h), which is consistent with the previously calculated trend of a more significant change in the neat deprotonation free energy barrier with temperature.
For the first time, we constructed a model very close to the experimental environment and accomplished the simulation of the HSiR3/KOtBu catalytic system with the help of a neural network potential. Our results strongly support the reaction mechanism of hydride transfer, and by means of this reaction mechanism, we discuss the selectivity of the deprotonation process of de-C–H bonds under different experimental conditions. Our trend studies and analyses strongly suggest a microscopic mechanism of selectivity reversal and disappearance of selectivity reversal, which is in accordance with the experimental results and shows the correctness of our hydride transfer mechanism.
From the perspective of the tetramer KOtBu catalytic reaction, if the catalytic mechanism in solution proceeds through stepwise transformations involving hydride intermediates, the system must be capable of maintaining a structurally confined and electrostatically stabilized space for the hydride to persist.
Related potassium-based catalytic systems provide useful comparisons. For example, potassium graphite (KC8) possesses a layered intercalation structure, in which potassium atoms are periodically inserted between graphene sheets.58 In such materials, potassium atoms occupy well-defined positions within the carbon lattice, which can generate localized electrostatic environments and potential trapping sites for hydride-like species formed under reaction conditions.59,60 Experimental studies have shown that KC8 can catalyze the same transformation,61,62 albeit with lower activity and turnover frequency compared to KOtBu.11 The confined interlayer regions of KC8 may therefore provide transient stabilization sites for reactive hydride species within the carbon matrix.
Potassium hydride (KH) represents another relevant reference system. In contrast to molecular or oligomeric potassium alkoxides, KH exists as an extended ionic solid adopting a NaCl-type lattice, where hydride ions are octahedrally coordinated by six potassium cations.63 In addition, KH is essentially insoluble in common organic solvents such as THF, meaning that reactions typically occur at the solid–liquid interface rather than through discrete solvated hydride species.64,65 Palumbo et al. proposed that the active catalyst for potassium-based catalytic silylation reactions is not KOtBu, but rather the operando formed KH. This form differs from commercially available solid aggregates of KH and exhibits higher reactivity.14
While our machine learning molecular dynamics simulations, enabled by an active learning workflow, allow for large-scale modeling of reactive systems, extending the neural network potential to additional catalytic environments still incurs considerable computational cost. This lies beyond the present scope of discussion. Nevertheless, we plan to expand our simulation dataset in future work to explore how other alkali metal based catalysts modulate hydride reactivity and stability, thereby providing a broader mechanistic framework for base-promoted silane transformations.
H Functionalization/Silylation of Heteroarenes, Angew. Chem., Int. Ed., 2008, 47, 7508–7510 CrossRef CAS PubMed.| This journal is © The Royal Society of Chemistry 2026 |