New insights into the biphasic “CO-free” Pauson–Khand cyclisation reaction through combined in situ spectroscopy and multiple linear regression modelling

Multiple linear regression modelling is used to analyse in situ Raman spectra recorded during a “CO-free” Pauson–Khand type cyclisation which enables a knowledge-driven optimisation protocol.


Introduction
Carbonylation reactions are cornerstones of pharmaceutical and chemical industry employed for the production of highly relevant synthetic products, such as methyl propionate, 1 ibuprofen, 2 and vanillin, 3 at the multi-million tonnes scale annually. Usually, carbonylation reactions take place in the liquid phase employing a homogeneous catalyst. This makes, to the best of our knowledge, carbonylation reactions one of the most important applications of homogeneous catalysis in industry. 4 On an industrial level, carbonylation reactions are conducted using a synthesis gas feedstocka mixture of CO and H 2 gas. 5 This feedstock is easily generated via steam reforming, but requires adequate safety measures to handle the toxic and flammable gas mixture. These properties render synthesis gas unattractive for carbonylations in the scope of pharmaceutical products and intermediates as well as speciality chemicals, as small chemical plants or laboratories do not always have the proper infrastructure to safely handle a synthesis gas stream.
As carbonylation reactions are very useful transformations in organic synthesis, considerable efforts have been directed towards the development of "CO-free" carbonylation reactions where the synthesis gas is replaced by less harmful surrogate molecules that are safe to handle on a laboratory scale. 6 The most interesting CO surrogate molecules are formaldehyde, formed from paraformaldehyde (PFA), 7,8 and formic acid 9 as they are very atom efficient and can be formally seen as CO + H 2 or H 2 O, respectively. The role of the utilized transition metal catalysts is thus twofold. Firstly, they decompose the surrogate molecule into a CO (equivalent) species, and secondly, they insert this species into the organic product. 9 The concept of "CO-free" carbonylations has e.g. been successfully applied in the carbonylation of alkenes, 10 the carbonylations of aryl bromides, 11 or the synthesis of 9-fluorenones. 12 One of the earliest works on "CO-free" carbonylations was published by Morimoto et al., describing the use of Rhphosphine complexes in a biphasic Pauson-Khand type reaction for the production of bicyclopentenone compounds (see Scheme 1). 13 This reaction features many interesting concepts, such as the use of two different phosphine ligands (i.e., 1,3-bis(diphenylphosphino)propane (dppp) and 3,3′,3″phosphanetriyltris(benzenesulfonic acid)trisodium salt (TPPTS)) to separate the formaldehyde decomposition and the carbonylation reaction by using water and the organic substrate to form a biphasic reaction mixture. The suspension is further stabilized by sodium dodecyl sulphate (SDS), which acts as a surfactant and enhances the reaction rate. However, the underlying reaction mechanism has not been investigated in detail.
Recently, we have studied Pd-catalysed "CO-free" carbonylation reactions and found that, to our own surprise, no CO is formed during this reaction, and postulated a new formyl group-based mechanism. 14 The Rh-catalysed Pauson-Khand type reaction, reported by Morimoto et al., 13 caught our attention as it can be compared to the previously studied Pdcatalysed reaction. Hence, in this work, we investigate the reaction mechanism of the "CO-free" Pauson-Khand type reaction to gain new insight into "CO-free" carbonylation reactions and to enable a knowledge-driven optimisation of the Rh-catalysed reaction. However, its biphasic nature and the application of phase-specific phosphine ligands makes it in particular challenging to characterise intermediates by in situ spectroscopy. In order to tackle this obstacle, we have developed a new analysis approach, which is based on in situ bulk Raman spectroscopy coupled with advanced data analysis and a multiple linear regression model. This allowed us to evaluate the influence of the reaction conditions (i.e., temperature, metal precursor, ligand, (co)-substrate and surfactant concentration) on the reaction kinetics. Furthermore, the molecular origin of the influencing parameters was studied using NMR spectroscopy as well as dynamic light scattering (DLS). Finally, a new reaction mechanism is proposed that was verified by density functional theory (DFT) calculations. In what follows, we present our novel analytical approach and illustrate our new mechanistic findings for the "CO-free" Rh-catalysed Pauson-Khand type cyclisation.

Chemicals and materials
All reactions were carried out under Ar atmosphere using standard Schlenk technique.

Raman spectroscopy
Raman spectra were recorded using a Renishaw InVia Raman microscope (Renishaw, UK), a 532 nm diode laser, a 50× objective (0.75 NA, Leica, Germany) and a grid with 1200 lines mm −1 .
For reference purposes and the subsequent data analysis, 32 Raman spectra with an integration time of 10 s were taken of 3-phenyl-2-propyne-  S1-S14 †) The reference spectra were recorded between 250 and 4000 cm −1 .
For a typical in situ Raman experiment, a microwave vial (Biotage, Cardiff, United Kingdom) was charged with [Rh(cod)Cl] 2 , dppp, TPPTS, paraformaldehyde, SDS and a stirring bar before being evacuated and flushed with argon three times. Degassed H 2 O (5 mL) and enyne were added to the microwave vial under argon atmosphere before the vial was sealed. The utilized amounts for each individual in situ experiment are summarized in Table S1. † The microwave vial was heated to the intended measurement temperature while Raman spectra were recorded before (3 spectra), during and after the heating period. The samples were heated using a sand bath on a feedback-controlled heating plate (IKA, Germany). Care should be taken as the vial auto pressurizes at elevated temperatures! For each in situ measurement, 340 Raman spectra between 250-3400 cm −1 with an integration time of 30 s were recorded. The power at the samples was 30.1 mW. To validate the linear relationship between the enyne concentration and its Raman signal at 2238 cm −1 , a calibration experiment was performed. For this experiment, 9 Raman spectra with an integration time of 10 s, each were collected from different enyne/H 2 O suspensions (0.00, 0.04, 0.11, 0.22, 0.37, 0.54, 0.81 mmol L −1 ). The result is depicted in Fig. S21. † The description of the subsequent data analysis procedure is summarized in Scheme 2 as well as in Fig. S15 and described in detail in the ESI. † Scheme 1 Rh-Catalysed Pauson-Khand type carbonylation reaction to transform enyne components into bicyclopentenones. The functional groups participating in the cyclisation reaction are highlighted in red. The asterisk highlights the newly formed chiral centre.
All NMR spectra show small contributions from the oxidized phosphine ligands and trace amounts of water. These contributions are unfortunately unavoidable and stem from the sample preparation process.

Dynamic light scattering
Dynamic light scattering (DLS) was measured on a Zetasizer Nano-S from Malvern Panalytical (Malvern, United Kingdom). The accompanying Zetasizer Software (version 7.13) automatically controlled the measurement volume as well as the light intensity. The subsequent autocorrelation and fitting procedure were also done via the measurement software. The samples were prepared by mixing enyne (0.08 mL) with H 2 O (2.0 mL) and different amounts of SDS (0, 37

Quantum chemical simulations
All quantum chemical simulations were performed using the Gaussian16 software. 22 The ground state equilibrium structures and electronic properties of the rhodium complexes, i.e.
[Rh(TPPTS) 3 and mer-[Rh(dppp)(κ 1 -dppp)(CO-enyne)] + (III.4) as well as of enyne, BCP, dppp, TPPTS, CO, H 2 and HCHO were obtained at the density functional (DFT) level of theory utilizing the B3LYP XC functional. 23 The def2-SVP basis set as well as the respective core potentials were applied for all atoms. 24 A subsequent vibrational analysis was carried out for each optimized ground state structure to verify that a minimum on the potential energy (hyper-)surface (PES) was obtained. To correct for the lack of anharmonicity and for the approximate description of electron correlation, the harmonic frequencies were scaled by a factor of 0.95. 25 All calculations were performed including D3 dispersion correction with Becke-Johnson damping. 26 An analogous computational setup was applied for the optimisation of transition states (TSs) and the intrinsic reaction coordinate (IRC) calculations, accordingly. For the TS search, an initial guess in the vicinity of the saddle point was at first obtained via the nudged elastic band (NEB) 27 method as implemented in physisphus 28 with xtb. 29 Thereafter, the TSs were obtained in Gaussian16 via the Berny algorithm, 30 followed by a vibrational analysis to verify that a first-order saddle point on the PES was obtained. From the optimized TS structures, the IRC calculations based on a local quadratic approximation (LQA) algorithm 31 were performed, in order to verify that the TSs were located in the minimal energy path (MEP) connecting the desired educt and product states.
Raman intensities were obtained based on the calculated Raman activities using the following expression: 32 18 cm −1 ). It was assumed that the Gaussian and Lorentzian part contribute equally to the Voigt function. Thus, both convoluted functions featured a FWHM of 11 cm −1 , which leads to the described FWHM of 18 cm −1 for the resulting Voigt function. 33

In situ Raman spectroscopy
Raman spectroscopy is ideally suited to follow the changing concentrations during the transformation of an enyne into a bicyclopentenone. The substrate features a distinct alkynyl group that gives a strong Raman signal at 2238 cm −1 . The product includes a newly formed carbonyl group, which exhibits a vibrational band at 1690 cm −1 . A set of in situ Raman spectra with the aforementioned bands highlighted can be seen in Fig. 1.
During the data acquisition it became clear that the Raman spectra, due to the complex nature of the reaction process, suffered from poor signal-to-noise ratios (SNRs). Thus, a simple single band integration was susceptible to errors and consequently featured large standard errors. To overcome this problem, a new and more advanced data analysis scheme was developed. Each individual timedependent in situ Raman spectrum was linearly fitted by using the Raman spectra of the pure components. Initially, all compounds (Rh salt, phosphine ligands, enyne, BCP, H 2 O, SDS, PFA and formaldehyde) were considered for the fit (see Fig. S16 †). However, most components did not contribute significantly to the overall Raman signal, therefore we analysed which components could be neglected from the fitting procedure without significantly influencing the quality of the final fit (see Fig. S18 †). It was found that only the substrate (enyne), the product (BCP) and the solvent (H 2 O) were necessary to reliably reproduce the in situ Raman spectra (see Fig. 1, Fig. S17 and S19 †). This finding was to be expected as these three components feature the largest concentration in the reaction solution. The remaining components are merely present as traces, therefore their contribution to the in situ Raman spectra is marginal.
The fitting approach was cross validated by principal component analysis (PCA) (see Fig. S20 †) and multivariate curve resolution (MCR). The first two pure component spectra extracted by MCR are very similar to the Raman spectra of enyne and BCP (see Fig. S23 and S24 †) while the PCA revealed that the majority of the variance in the datasets is described by three components.
The linear fitting procedure results in time-dependent coefficient profiles that are linked to the concentration of enyne, BCP, and H 2 O (see Fig. 1 and S22 †). As the reaction proceeds, the contribution of enyne to the fit decreases while simultaneously the contribution from BCP increases. Thus, the extracted fit coefficients can be used to follow the reaction progress. Interestingly the contribution of H 2 O also increases. This is effect does not stem from a change in water concentration but is rather associated with the change in

View Article Online
Raman cross scattering section of the reaction solution when enyne is consumed in favour of BCP. The Raman cross scattering section of enyne is higher than the one of BCP. Thus, at first the organic molecules contribute more to the Raman signal of the reaction solution than H 2 O but as the reaction proceeds the Raman cross section of the organic molecules decreases as enyne is transformed into BCP and therefore the contribution of water to the overall Raman signal increases. Nevertheless, the fit coefficients of the organic molecules are proportional to their concentration. For further analysis, the coefficient profiles of enyne were fitted with a first-order kinetic exponential function to extract a kinetic rate constant k from each in situ measurement (see ESI † for details). The first-order kinetic of the reaction was cross validated by integrating the ν(CC) band at 2238 cm −1 by summing up the relevant Raman channel intensities to follow the concentration of enyne (see Fig. S22 †).

Multiple linear regression
With the kinetic rate constants k at hand, the influence of the initial reaction conditions on the reaction rate was analysed, i.e. how to tune the experimental conditions to accelerate the investigated Pauson-Khand type cyclisation. Morimoto et al. performed some basic optimisation and found: i) a combination of dppp/TPPTS is more effective than dppp alone, ii) TPPTS alone is not able to catalyse the carbonylation, and iii) an increased SDS concentration is favourable for the reaction. 13 However, the molecular origin associated to the altered reactivity was not investigated in-depth.
To enable a knowledge-driven optimisation procedure, we used a multiple linear regression model to predict the kinetic rate constants k from the reaction conditions. The basic idea behind the approach is to vary the reaction conditions and evaluate their impact on the reaction's rate constant. In physical chemistry, this increase in reaction rate is described by an increase in the associated kinetic rate constant k which is available from the in situ Raman measurements. By using a multiple linear regression model, multiple experiments can be evaluated at the same time, making it possible to vary multiple reaction conditions in one experiment while still being able to extract the influence of each individual reaction condition on the kinetic rate constant k. Thus, the multiple linear regression model enables an overarching multiparameter evaluation on the reaction conditions via in situ Raman experiments.
The seven reaction conditions considered for the multiple linear regression model are the reaction temperature as well as the starting molar concentrations (in units of L mmol −1 ) of [Rh(cod)Cl] 2 , dppp, TPPTS, SDS, PFA and enyne. Formally, the model is described by the following equation: The obtained parameters a x for each reaction condition are summarized in Table 1. The sign of a x indicates if an increased or decreased reaction rate is observed under the altered conditions, i.e. temperature or concentrations, respectively.
The influence of the temperature on the reaction is straight forward: a higher temperature results in a faster reaction rate (a T > 0). The influence of the remaining reaction conditions is not quite as obvious. Increasing the Rh concentration has a positive influence on the reaction rate (a Rh > 0) while an increasing phosphine ligand concentration has a negative effect (a dppp < 0, a TPPTS < 0). An increased SDS concentration accelerates the reaction (a SDS > 0), while large amounts of enyne substrate and paraformaldehyde slow the reaction down (a Enyne < 0, a PFA < 0,).
When comparing the linear influence parameters in a quantitative fashion, it becomes evident that [Rh(cod)Cl] 2 , dppp and TPPTS concentrations have a major impact as the rate constant is increased by one or even two orders of magnitudes, while the influence of the SDS, enyne and PFA concentrations is less prominent. This is not surprising, as the formed Rh-phosphine complexes play the key role in enabling the reaction.
It is very remarkable that this type of information can be extracted from bulk in situ Raman measurements. This was only possible due to the introduced advanced data analysis procedure which allowed to rationalise the kinetic rate constant k by a linear fit of in situ Raman measurements followed by a multiple linear regressionlinking the experimental reaction conditions with the reaction rate. This analysis approach is a clear improvement over the simple qualitative variation of experimental parameters without in situ spectroscopy. The presented approach is not limited to the Rh-catalysed Pauson-Khand cyclisation or tied to the use of Raman spectroscopy presented here but can be extend to other reactions and spectroscopic techniquesas long as the rate of the reaction of interest can be linked to its experimental conditions.

Nuclear magnetic resonance and dynamic light scattering
The next step was to address the molecular origin for the observed influence of the reaction conditions on the reaction rate. The influencing parameters can be grouped as follows: firstly, the Rh, TPPTS and dppp concentration are associated with the active catalytic species, secondly, the substrate and SDS concentrations are linked with the exchange rate between the organic and aqueous phase and finally, the formaldehyde and substrate concentrations are important for the ratio between CO formation and Pauson-Khand type carbonylation.
Firstly, our explanation on the positive influence of an increasing Rh concentration and the negative influence of increased phosphine ligand concentrations on the reaction rate is the existence of multiple equilibria between mono-and dinuclear Rh-phosphine complexes. We further studied the Rh-dppp equilibria by using 31 P NMR spectroscopy (see Fig. 2a and b). The NMR spectra reveal a signal at 23.5 ppm ( 1 J Rh-P = 112 Hz) at sub-stochiometric dppp concentrations, which changes into a signal at 11.3 ppm ( 1 J Rh-P = 121 Hz) with a small signal at 33.1 ppm ( 1 J Rh-P = 135 Hz) at a Rh : dppp ratio of 1 : 1, which transforms into a signal at 7.8 ppm ( 1 J Rh-P = 131 Hz) at higher dppp concentrations. In accordance with Heller et al., 34 the signals are assigned to [Rh(cod)(dppp)(μ 2 -Cl) 2 ], [Rh(cod)(dppp)(Cl)], [Rh(dppp)(μ 2 -Cl)] 2 and [Rh(dppp) 2 ]Cl, respectively. Therefore, the NMR spectra clearly show how an increasing dppp concentration forces the formation of more stable Rh-dppp complexes. Our interpretation of the kinetic results and the NMR experiments is that a lower dppp concentration enables the coordination of labile cod ligands, which can be easily displaced when either HCHO or enyne approach the complex.
A similar experiment with TPPTS was conducted (see Fig. 2c and d). Surprisingly, only the mononuclear complex [Rh(TPPTS) 3 ]Cl at 29.1 ppm ( 1 J Rh-P = 150.0 Hz) 17 is visible in the 31 P NMR spectra, independent of the TPPTS concentration. This finding can be explained by the observation that excess [Rh(cod)Cl] 2 is not soluble in D 2 O. Thus, only as much Rh + is present in the aqueous phase as there is TPPTS to form [Rh(TPPTS) 3 ]Cl. Consequently, the negative influence of an increasing TPPTS concentration can be explained by an increasing tendency to leech Rh + from the organic phase into the aqueous phase, which, in turn, reduces the amount of catalytically active Rh-dppp complexes. We conclude from our kinetic analysis based on in situ Raman spectroscopy and the supporting NMR experiments that [Rh(cod)(dppp)(Cl)] is more active than the [Rh(dppp) 2 ]Cl in the Pauson-Khand type carbonylation reaction, and that the mononuclear complex [Rh(TPPTS) 3 ]Cl is the catalytically active species in the decomposition of formaldehyde in the aqueous phase.
Secondly, an increased SDS concentration leads to the formation of smaller organic micelles in the aqueous phase. This hypothesis was proven based on DLS measurements. The DLS experiments reveal that at high SDS concentrations, micelles with an average diameter of 250 nm are formed from  the organic substrate in the aqueous environment (see Fig. 2e and f). Below a critical SDS concentration, which lays between 42.4 and 62.5 mmol L −1 , the micelles are instable, causing them to aggregate and form larger bubbles which are too large (d > 10 μm) to be measured by DLS. The same effect is observed when the relative enyne concentration increases, leading to larger bubbles. This change in average micelle diameter from >10 μm to 250 nm with increasing SDS concentration leads to an increase in the aqueous-organic exchange surface by a factor of >40. The positive influence of an increased SDS concentration was also observed by Morimoto et al. although they did not explore its origin. 13 In line with their proposed mechanism (formaldehyde decomposition in the aqueous phase, carbonylation in the organic phase), we propose a molecular exchange across the aqueous-organic interface, which is enhanced by a larger micellular surface. Finally, a large formaldehyde concentrationstemming from a large initial PFA concentration 7is competitively hindering the carbonylation reaction by blocking Rh-dppp complexes. As Rh-dppp complexes are able to catalyse the Pauson-Khand cyclisation and the HCHO decomposition while Rh-TPPTS complexes can only catalyse the HCHO decomposition, the availability of Rh-dppp complexes is crucial for the overall reaction progress. NMR experiments on the decomposition of formaldehyde reveal that not merely [Rh(TPPTS) 3 ]Cl is forming CO from HCHO but [Rh(dppp) 2 ]Cl yields [Rh(dppp)(CO) 2 ] + (ref. 35) when CO is formed from HCHO (see Fig. S28 † for 13 C and 31 P NMR spectra). Thus, an excess of formaldehyde can suppress the desired carbonylation reaction by reacting more efficiently with Rhdppp complexes in comparison to the enyne substrate. Based on the NMR spectra, we propose that the structure of [Rh(dppp)(CO) 2 ] + is either trans-square planarwhich seems unlikely with dppp as a ligandor the structure is labile on the NMR time scale. Both hypotheses allow to explain the origin of triplets observed in the 13 C and 31 P NMR spectra.
In conclusion, the Rh-catalysed Pauson-Khand type reaction can be improved by i) an elevated temperature, ii) an excess of Rh + favouring the formation of catalytically active complexes, iii) the formation of small micelles by an excess of SDS and iv) avoiding an excess of HCHO, which would block all active Rh centres.

Proposed mechanism and quantum chemical simulations
We summarize the results, obtained by in situ Raman spectroscopy, NMR and DLS, in a newly proposed mechanism, which is illustrated in Scheme 3.  .3). Subsequently, CO inserts into the Rh-C bond in cis position, expanding the rhodacycle by one carbon atom (III.4). Finally, [Rh(dppp) 2 ] + is regenerated by a reductive elimination of BCP. Here it is important to note that in the present case, a chiral molecule is formed from an achiral substrate. Thus, the initial formation of the rhodacycle can potentially be influenced by chiral phosphine ligands favouring the formation of a specific enantiomer.
For cycle II and III we favour the idea that all Rh-dpppcod complexes identified by NMR spectroscopy (see Fig. 2a) are able to catalyse the decomposition and cyclisation reactions but that the reactions are of different speeds due to the fact that dppp ligands dissociate from the Rh centre much slower than chloride and cod ligands. This is especially important for the equilibria II. 2 2 will also contribute to the catalytic reaction when they are present. This assumption is consistent with our earlier kinetic analysis which revealed the decelerating effect of added dppp.
Finally, the proposed reaction mechanism was studied in detail using DFT. All structures and TSs were calculated in the gas phase. It is therefore important to keep in mind that solvent interactionseither explicit or implicitare neglected which play an important role especially in the aqueous phase. Therefore, activation energies related to intermediates featuring a vacant coordination are likely overestimated as such species would be stabilised by the surrounding solvent (or reactants). Unfortunately, including an explicit solvent environment without a biased preselection of specific structures comprising one water molecule interacting with the catalyst are computationally to demanding to be applied along the entire reaction profile. Nevertheless, the DFT calculations provide a solid foundation to qualitatively rationalise our experimental findings.
The first question we tried to answer was why [Rh(TPPTS) 3 ] + is better than [Rh(dppp) 2 ] + in decomposing HCHO into CO (see Fig. 3a). From an energetic point of view, both reactions are possible, which is in line with the experimental observations. The energy profile for [Rh(TPPTS) 3 ] + shows that the addition and decomposition of HCHO at the complex is favoured when compared to the pure complex. This is not the case when [Rh(dppp) 2 ] + is used. Here, the intermediate steps are all higher in energy than the initial complex. As the activation energies for all steps are comparable, the main reason why [Rh(TPPTS) 3 ] + is better than [Rh(dppp) 2 ] + in decomposing HCHO is that the intermediary complexes are lower in energy than the starting complex which is not the case for [Rh(dppp) 2 ] + .

View Article Online
Furthermore, we were interested in the geometry of the Rh complexes throughout the reaction. The findings can be seen in Scheme 3. The HCHO activation at [Rh(TPPTS) 3 ] + and [Rh(dppp) 2 ] + in both cases results, as expected, in a cis coordination of the resulting hydride and formyl group. Both complexes reorganize into a square-pyramidal geometry with a vacancy in trans position opposed to the formyl group. This is expected as formyl groups are known for their strong trans effect. An important information is that the necessary dissociation of a dppp phosphorus from the Rh centre in [Rh(dppp) 2 (CHO)(H)] + (II.2a to II.2b in Scheme 3) seems to be an endergonic reaction step. This is in line with the NMR experiments discussed earlier, which revealed that [Rh(cod) (dppp)(Cl)] is a better complex for the HCHO decomposition due to the easier displacement of cod. Subsequently, the formyl group decomposes into CO and a second hydride.
The Pauson-Khand type carbonylation (cycle III in Scheme 3), starts with the oxidative addition of enyne to [Rh(dppp) 2 ] + (III.1-TS). Subsequently, the Rh-P bond trans to the alkenyl carbon breaks (III.2a to III.2b). Again, this behaviour can be rationalised by the trans effect. The open coordination site is occupied by a CO molecule, which is located trans to the alkenyl carbon but cis to the alkyl carbon (III.3). Thus, the following CO insertion can only proceed at the alkyl Rh-C bond, as shown by structure III.4. Finally, the catalyst is regenerated by the reductive elimination of BCP. The formation of BCP from enyne and CO is thermodynamically highly favoured by ΔG = −179 kJ mol −1 (ΔH = −246 kJ mol −1 ).
It is worth noting that the Pauson-Khand cyclisation is susceptible to enantioselective induction. The calculations show that the right-handed isomer Δ-cis-[Rh(dppp) 2 (enyne)] + slightly favours the (S)-enantiomer of enyne by about 10 kJ mol −1 when compared to the (R)-enantiomer. This is due to steric constraints enforced by the two dppp ligands (see Fig. 3c). Of course, dppp is not a chiral ligand and, therefore, the formation of Δ-cis-[Rh(dppp) 2 (enyne)] + and Λ-cis-[Rh(dppp) 2 (enyne)] + , which favours the (R)-enantiomer of enyne, is equally likely and, thus, it comes to no surprise that a racemate of BCP is formed. Our calculations indicate that it is possible to perform an enantioselective "CO-free" Pauson-Khand cyclisation when a chiral bisphosphine ligand is used. This is an improvement over the Ti-catalysed approach for the synthesis of chiral bicyclopentones 36 and in line with the results from Kim et al. 37

Conclusions
We have investigated the mechanism of one of the oldest and most challenging "CO-free" carbonylation reactions using in situ Raman spectroscopy, nuclear magnetic resonance (NMR) and dynamic light scattering (DLS) coupled with advanced data analysis and density functional theory (DFT) calculations. Our data analysis approach revealed, and is able to predict, how the studied catalytic reaction can be accelerated in quantitative fashion. Furthermore, all influencing reaction conditions (i.e., temperature, metal precursor, ligand, (co)-substrate and surfactant concentration) were studied on the molecular level revealing a series of relevant Rh-phosphine equilibria and the formation of micelles in the biphasic system. Finally, the newly proposed catalytic cycles take into consideration the biphasic nature of the reaction and explain all experimental findings. DFT calculations unravelled how the reaction can be performed in an enantioselective way by the utilization of chiral bisphosphines which is a subject of great interest in asymmetric catalysis.
Overall, the presented approach of combining in situ spectroscopy and advanced data analysis enables a knowledge-driven optimisation and improvement of not only the studied "CO-free" Pauson-Khand type cyclisation but of catalytic reactions in general. Our findings encourage us to investigate how our analysis scheme can be used as a building block to enhance automated high throughput experiments.

Conflicts of interest
There are no conflicts of interest to declare.