Daniel A.
Holland-Moritz
*a,
Sarah R.
Moor
*a,
Joseph B.
Parry
a,
Elliot J.
Medcalf
c,
Claire M.
Eberle
a,
Andrew C.
Strakham
b,
Shane T.
Grosser
a,
Hang
Hu
a,
Noah P.
Dunham
a and
Maximilian
Gantz
c
aProcess Research & Development, Merck & Co., Inc., Rahway, New Jersey 07065, USA. E-mail: sarah.moor@merck.com; daniel.holland-moritz@merck.com
bAnalytical Research and Development, Merck & Co., Inc., Rahway, New Jersey 07065, USA
cDepartment of Biochemistry, University of Cambridge, 80 Tennis Court Road, CB2 1GA Cambridge, UK
First published on 6th August 2025
In this report, we detail direct inject mass spectrometry via a robotically automated vacuum enabled (RAVE) interface that utilizes commercially available capillary electrophoresis hardware to directly inject samples for mass spectrometry (MS) at a sampling rate of approximately 12 s per sample. This system enables direct electrospray ionization from standard 48, 96 or 384-well plates with minimal investment in hardware and utlilizes custom developed open source software that provides both autosampler control and analysis of raw extracted data from the mass spectrometer. We show a high level of correlation among results obtained with RAVE coupled MS, acoustic ejection (Echo) MS, and liquid chromatography coupled MS (LCMS) on 384 biocatalytically driven reactions. We additionally utilize RAVE MS on an array of 96 chemocatalytic reaction conditions to show that, while direct MS analysis can be challenging in complex mixtures, simple dilution followed by direct injection is often sufficient for analysis. With these results, we demonstrate the potential for RAVE MS to be utilized as a low-cost, low barrier to entry tool for rapid direct-inject MS analysis.
A growing body of research and development has focused on developing faster analytical methodologies to match the pace with which samples are generated.5,6 In screening workflows where photometric methods of detection are available and unambiguous, spectroscopy remains the dominant technology for rapid analytics in HTS.7–9 However, when assays are either too complex or challenging to read directly with fluorescence or absorption (as is often the case in pharmaceutical chemistry), liquid chromatography (LC) predominates as the analytical tool of choice.10 Unfortunately, chromatographic separations greatly increase the time required to read samples generated in HTS, as each reaction must be sampled, injected, eluted, and the column regenerated for the next analytical run to begin. When scaled across hundreds or thousands of reactions, this analytical burden becomes a critical bottleneck, and a growing need for selective, rapid, and direct analysis of complex reactions has become evident.11
Direct analysis by mass spectrometry has emerged as a key technique that bridges the gap between the higher analytical throughput of spectroscopic interrogation and the richness of analytical data provided by chromatographic separations.12–14 The growing capabilities of mass spectrometry to both isolate and monitor individual species in complex reaction mixtures enable direct sample analysis without the prerequisite of chromatographic separation.15–17 The speed with which direct MS analysis may be implemented has in turn necessitated the development of new methods for sample introduction directly to the MS source.13,18,19
Rapid sample introduction to mass spectrometry has taken shape in two major approaches – analysis via direct sampling such as in acoustic ejection (Echo) MS and desorption electrospray ionization DESI, and analysis via reformatting into an MS-interfaceable format. Both Echo and DESI enable sampling at a throughput of less than one second per sample with minimal sample preparation but require highly specialized ionization equipment to execute.20 Matrix assisted laser desorption/ionization (MALDI) and direct analysis in real time (DART)20–29 utilize a reformatting strategy, with sample spotted onto a substrate plate with ionization matrix or mesh for ionization, respectively. This reformatting process often becomes the bottleneck for these methods, and utilization is hindered by the speed and reliability of the reformatting process, as well as the complexity of sample preparation necessary to interrogate reactions.26,29,30 These tools are also costly analytical options due to the distinctive (and often, non-standard) methods by which they achieve rapid ionization of their reformatted samples, requiring specialized ionization sources and expensive additional hardware.
The alternative to reformatting samples for rapid MS analysis (that does not require specialized sampling interfaces) is the direct injection of samples into electrospray ionization sources. The simplest way to automate this for HTS is to take advantage of existing auto sampling systems to perform serial direct injections.25 The drawbacks to this approach are sample-to-sample carryover, sample matrix effects, and the low throughput at which a traditional LC autosampler can send liquid to the analyzer and reset for the next injection (1–2 min).31,32 Matrix effects can be reduced or eliminated using online solid phase extraction at the front end of a direct inject system, and employment of an autosampler optimized for direct inject workflows significantly reduces the timescales of direct-inject MS. The Agilent RapidFire MS is a commercialized platform for achieving these goals.33
The Echo MS (commercialized by Sciex) can be used to analyze samples as fast as 1/s by coupling heavy dilution in an open port interface to acoustic ejection from a well plate to deliver sample to a continuous dilution stream.34,35 Due to the low carryover achieved by acoustic ejection, the minimization of matrix effects via dilution, and the rapid sample introduction, Echo MS has rapidly become the preferred method for direct injection mass spectormetry.30,36 Despite these benefits, the cost of both the RapidFire and Echo MS systems is high, rendering it beyond the reach of many academic and small-scale laboratories and necessitating the development of more accessible alternatives to achieve similar efficiency.
In this work, we describe the principles of operation for robotically automated vacuum enabled (RAVE) MS, a simple direct injection mass spectrometry tool. We detail its experimental usage in both bio- and chemo-catalysis screening and demonstrate its utility as a rapid and low-cost tool with a low barrier to entry. We directly compare the operation of RAVE MS on a single quadrupole mass spectrometer to the performance of commercial direct-inject tools such as Echo MS, demonstrating strong agreement on a flexible, standard, and accessible platform. RAVE MS enables rapid qualitative assessment of relative reaction yields, and throughputs that, prior to this point, came only with significant capital investment for most laboratories. This workflow can be implemented using commercially available ionization sources and autosamplers and is operated using open-source Python libraries. We share here two use cases for RAVE MS and envision additional utilization across a wide range of MS-enabled reaction monitoring and screening applications.
In our work, we utilized Agilent's CE-ESI (G1607-60002) triple tube source (Fig. 1D) that utilizes a two layered co-axial flow of sheath buffer and drying gas around an inserted capillary (Fig. 1E) to achieve electrospray ionization.37 The sheath flow ensures a steady stream of liquid for Taylor Cone formation at the emitter tip, irrespective of the relatively low flow rates typical of capillary electrophoretic flow.38 The combination of liquid sheath flow and heated drying gas has been previously reported to exert a hydrodynamic force on the liquid within CE capillaries, an effect postulated to be due to the Bernoulli effect caused by the high velocity of gas flow at the source tip.39 This hydrodynamic force drives samples into the mass spectrometer through vacuum suction, enabling continuous and reliable infusion of samples.
The capillary was cut to a length of ≤65 cm to minimize capillary back pressure – the resistance of the system to flow when filled with sample. The robotic autosampler was placed on top of the mass spectrometer, as close to the source as possible (Fig. 1A). The capillary was threaded through the sampling needle on the Sielc Alltesta autosampler and held in place by a capillary fitting in the modified robot arm (Fig. S5). The robot was then used to immerse the sampling needle with the protruding end of the capillary first in fluorinated oil to provide sample separation, and then in sample, alternating between the two liquids during operation.
Additionally, the lack of separation on the front end of the analysis in RAVE made it important to carefully account for differences in reaction composition across samples, which affects the ionization efficiency and suppression of the target analyte.43,44 Samples with significantly different background matrices were therefore challenging to compare directly to one another due to the differences in ionization efficiency across samples. We resolved this by tuning our dilution factor for each sample matrix screened.
To analyze RAVE data, we first extracted the SIM trace on all utilized channels. This produced up to four files with a time and an intensity read for the entirety of the run. To detect all peaks in each run, it was critical that at least one of these four files showed peaks for every sample injection to ensure that wells without target analyte present were accounted for. This signal trace, which we referred to as the marker trace, could be extracted from a read of the substrate signal because all reactions contained this analyte. However, in instances where substrate signal proved challenging to monitor, background ions from the remaining matrix of injected samples or internal standards in the dilution matrix could be used instead to monitor the injection time for each sample. With the help of this marker trace, well sampling times could be identified and applied to the other three extracted spectra to measure target analyte signal intensity during screen analysis (Fig. 2 steps 1 and 2).
Using custom-built software incorporated into the RAVE MS autosampler control user interface, we analyzed extracted files for each RAVE run. The marker trace was uploaded to the application, where the user could adjust the peak width, minimum intensity, plateau size, and the minimum distance between peaks (Fig. S4). These arguments were then used to identify each of the peaks from a screen by utilizing the SciPy peak picking algorithm.45 To aid in peak identification, the data could be smoothed prior to peak picking by calculating the moving average of the peaks or applying a Savitzky–Golay filter.46 This generated a list of retention times that were then applied to each of the other uploaded traces to mark the location of each sample. The average ion intensity in the region identified from the marker file was then used as the reported value for the analyte of interest at each marked retention time, generating a list of values for each sample. These values were then used directly as a measure of relative intensity (Fig. 2 steps 3 and 4).
It is important to note that, due to the on/off nature of sampling in RAVE MS, peak area alone was a poor measure of analyte response because injection volumes were controlled only through the hydrodynamic pull to the source, and the volumes of RAVE injection can therefore vary from sample to sample. A larger injection volume will spray for longer than a smaller one, and raw peak areas will therefore unintentionally bias the response towards larger injections. We addressed this by both reporting an average peak height for each peak (corrected for peak width at half maximum) and a maximum. We have found that both correlated well to the relative performance of individual reactions (Fig. S7).
Lysine has recently been reported as a smart amine donor for transaminase reactions due to the thermodynamically favored cyclization of the 6-amino-2-oxohexanoic acid side product to the corresponding piperidine-2-carboxylic acid imine.48 In their original publication, our colleagues utilized LCMS to assess a panel of 384 commercially available, distinct, wild-type enzymes for their compatibility with lysine as an amine donor. Their work looked at the transamination of pyruvate to alanine to assess the preference of each panel member for L- and D-lysine in the reaction. This initial analytical endeavour required >27 h of LCMS run time, even with an optimized LC method less than 4 min in length. The stereoselectivity of each panel member was also assessed by running the reaction separately with a feed of L- and D-lysine, doubling the analytical burden. We therefore set out to demonstrate the utility of RAVE MS for the same screen to reduce analytical run time and achieve comparative quantitative analysis.
We repeated the work of our colleagues with L-alanine to generate samples that could be directly compared across analytical platforms. Reactions were performed (SI section 4b) in a total solvent volume of 50 μL within a 384 well plate pre-plated with 1 mg of fermentation powder from each enzyme variant. After a 24 h reaction time, the plate was sampled and diluted (5% v/v) into a quench solution of 1:
1 MeCN/100 mM potassium phosphate buffered to pH 7.0. Quenched samples were centrifuged to spin down any particulate and reduce the opportunity for capillary clogging during RAVE MS analysis.
To rapidly assess the analytical sensitivity of RAVE MS to alanine and pyruvate concentrations, stock solutions in the quench and diluent solution were created. These were diluted further in MeCN/potassium phosphate buffer to create calibration curves for alanine and pyruvate that approximate the background matrix of the panel samples (Fig. 3A). Prior to obtaining the calibration curve, direct injections of both alanine and pyruvate (1.25 mM) were read in full scan mode from 50–160 m/z in positive and negative ion mode to determine the optimal SIM trace to monitor. These experiments showed that negative ion mode monitoring resulted in clear signal for the target analytes while positive mode did not (SI – SIM target selection). Given this, negative ion mode was used to monitor alanine production (88 m/z) and pyruvate (87 m/z) consumption.
The calibration plot for alanine can be seen in Fig. 3B. A strong linear correlation (R2 = 0.995) between RAVE MS peak height and concentration was obtained, indicating that average peak height can be used to accurately compare analyte concentrations across samples. Limits of detection for alanine and pyruvate in RAVE injections were found to be 99 μM and 68 μM, respectively (Fig. S11).
Following calibration, we analyzed the 384 well plate of transaminases using RAVE MS, LCMS, and Echo MS. After completion of RAVE MS analysis, peak heights for each monitored ion were extracted using our custom data analysis software. The extracted RAVE MS peak height averages and Echo MS peak areas are plotted in Fig. 3A and exhibit a strong correlation (R2 = 0.912). Correlations between RAVE MS and LCMS were also in alignment (R2 = 0.922, Fig. S10), further validating the RAVE MS assay for HTS and hit selection. The correlation between pyruvate signal among the three analytical methods was similarly robust, with R2 values above 0.970 for all comparisons (Fig. S10). Prior to visualization, the product peak height and peak area were Z-score normalized to facilitate comparison between RAVE MS and Echo MS. As responses from RAVE MS and Echo MS have different units and ranges, Z-score normalization allows for meaningful cross-assay comparison by placing the data on a similar scale. These observations highlight the effectiveness of RAVE MS as a rapid and reliable analytical tool, offering significant time savings and strong agreement with established mass spectrometry techniques and enabling high-throughput screening in biochemical assays and providing an exciting path forward to address a critical bottleneck in evaluating the many unique engineered enzymes accessible through directed evolution.
In this investigation, we used RAVE MS to assess the performance of 24 phosphine ligands, encompassing both monodentate and bidentate scaffolds, in combination with 4 bases, to identify the optimal reaction conditions for the C–N cross-coupling of 4-methoxyaniline and 4-bromo-1,1′-biphenyl. Our aim was to demonstrate utilization of RAVE MS to rapidly triage reactions for catalyst/ligand selection and optimize the reaction with minimal method development and analytical run time.
24 pre-plated phosphine ligands were combined with 4 pre-plated bases (NaOtBu, Na2CO3, Cs2CO3 and K2CO3), resulting in the generation of 96 reaction conditions in a plate format, as depicted in Fig. 4B. Following the completion of the reactions (see SI, Pd-catalyzed Buchwald–Hartwig coupling), two plates were prepared for analysis: one for RAVE MS and the other for LCMS, serving as an orthogonal analytical approach for the validation of our RAVE MS results. Analytical plates were prepared by diluting reaction plates with a solution of internal standard. For analysis via RAVE MS, the peak heights corresponding to the molecular ions formed by the product ([M + H+] = 276.14) and lysine ([M + H+] = 147.11) internal standard from the dilution were tracked in positive single-ion- monitoring mode. For LCMS analysis, the extracted ion chromatogram for the target product was integrated over its elution.
To determine the most suitable concentration range for plate dilution and to perform similar calibration studies to those described in our first case study, we synthesized an authentic product standard and subjected it to analysis via RAVE MS in the expected dilution matrix. At concentrations between 0–60 μM, we observed a linear response (Fig. S18). We subsequently diluted our reaction samples to a concentration of 25 μM product at 100% conversion and compared the RAVE response to LCMS conducted at an analyte concentration of 125 μM at 100% conversion. A strong linear correlation of R2 = 0.9461 was achieved (Fig. 4A).
The results obtained from both assays were visualized as 96 well heatmaps to evaluate the performance of the screen and to determine any assay-to-assay variability. Reactions analyzed on RAVE MS show high levels of product formation when Cs2CO3 is used as the base in concert with the following ligands: XPhos (L1), SPhos (L2), BrettPhos (L7), CataCXium A (L8) and [(tBu)3P][HBF4] (L10), which is echoed in the LCMS data. This indicates that RAVE MS is suitable for hit identification, even in systems that feature a high degree of variation in reaction matrix composition.
Analysis of the 96 well plate took >570 min to complete when using a 6 min per-well LCMS method. In contrast, analysis using RAVE MS took only 3.5% of that analytical time at only 20 min per plate. We found that the screening data provided by RAVE MS enabled facile identification of reaction conditions of interest for further investigation. The rapid and efficient analysis provided by RAVE MS on these chemocatalytic reactions underscores its potential as an accessible analytical tool across chemistries for high-throughput screening, with its impact only likely to increase with the size of screen conducted.
It is important to note that because RAVE MS does not incorporate the chromatographic separation that LCMS does, it is not capable of distinguishing isomers or isobaric species present in the sample matrix. It was important for this work to validate the target mass and calibrate the response in the reaction matrix for each case study (see SI 5b) to ensure that the mass analyzed corresponds directly to the target product of interest. RAVE MS in its current form would be less well suited to reactions that produce stereoisomers, but one could easily envision using it to triage catalysts that improve overall yield prior to more detailed selection for stereoselectivity. This approach is particularly attractive in protein engineering workflows that rely heavily on the screening of large catalyst libraries with (initially) low substrate affinity.
Future integration of the RAVE autosampler with higher-resolution MS tools and/or gas phase separation and fragmentation—such as those enabled by triple quadrupole (TQ), Orbitrap, or ion mobility (IM) mass spectrometers—may also help expand the capabilities of RAVE-MS in the near future. For instance, the μM limits of detection reported herein could likely be further improved by utilizing more advanced MS systems, with fragmentation and selection enabling gas-phase elimination of competing species and higher sensitivity to low-abundance analytes. This has been demonstrated already by commercial echo-MS systems (the one used in this work is configured with a triple quadrupole). The utilization of the standard Agilent source geometry for RAVE MS makes adaptation across platforms a relatively simple pursuit.
The analytical pipeline described in this work outlines the development and implementation of RAVE MS as a platform that can be readily incorporated into high-throughput screening assays. At our company, we have already begun to use it to accelerate hit identification within the process research and development space – deploying it for rapid screening on tight timelines to power our complex protein engineering portfolio. With the expansion into chemocatalytic screening, we anticipate further proliferation across small molecule process development in the coming years.
The data supporting this article have been included as part of the SI. The code for RAVE MS can be found at https://github.com/MSDLLCpapers/dre-rave-ms.
This journal is © The Royal Society of Chemistry 2025 |