Open Access Article
Ricardo Monge Neriaa,
Rachel A. Saylorb and
Lydia Kisley
*ac
aDepartment of Physics, Case Western Reserve University, Cleveland, Ohio 44106, USA. E-mail: lydia.kisley@case.edu
bDepartment of Chemistry and Biochemistry, Oberlin College, Oberlin, OH 44074, USA
cDepartment of Chemistry, Case Western Reserve University, Cleveland, Ohio 44106, USA
First published on 1st May 2026
The stochastic model of chromatography mathematically represents the molecular mass transfer that occurs during a separation. Single-molecule microscopy allows for direct visualization of molecular analytes adsorbing within column materials. Experimental single-molecule data and the stochastic model can predict elution profiles using the adjustable variable
m, the average number of adsorptions per molecule. Previously, only a single
m value was used to match either peak location or shape, while the effects of adjusting np, the number of modeling points, have not been studied. Here, we systematically explore a wide range of these two variables in the stochastic model to determine if it is possible to optimize agreement between modeled single-molecule and high-performance liquid chromatography (HPLC) chromatograms. A metric to quantify chromatogram agreement is introduced by taking the weighted difference in the elution time and shape of the chromatograms. We determine the non-linear effects of
m and np on peak height, width, and asymmetry and link the observations to the molecular behavior. Applying our approach to experiments with variable flow rate shows that increased sampling of rare, long time adsorption events affects agreement between simulated and HPLC elution profiles. Finally, we make quantitative recommendations that the single-molecule experiments should sample available binding sites following an exponential association model and that np must be > 1.5 ×
m to achieve accurate results. Overall, we verify that the current form of the stochastic model based solely on mass transfer is unable to simultaneously match both peak location and shape and make recommendations for future improvements to the model by separation scientists.
In contrast, microscopic models of chromatography view the separation process as the result of a statistical distribution of single-molecule-scale behaviors. This bottom-up approach, originally proposed by Giddings and Eyring,5 models the behavior of individual solute molecules as they travel through the column. The microscopic (or stochastic) theory models the migration of single molecules (SM) as random walks, where molecules adsorb to the stationary phase and then desorb. The adsorption–desorption events are treated as a Poisson process, with exponentially distributed residence times in each phase. The final chromatographic peak shape is then the result of the summed behavior of a vast number of individual molecules, with the elution profile described by the probability density function of the retention times for all the molecules.
However, while both the macroscopic and microscopic approaches are complementary and, under linear conditions, can be shown to be mathematically equivalent,12–14 the microscopic description is less widely used. The microscopic model has been expanded into a Lévy process representation,6 and applied to study chromatographic retention in relation to experimental single-molecule microscopy observables15–19 and Monte Carlo simulations.7,20 Previous applications7,8,16,18,21 have shown that rare, long-lasting adsorptions can have a significant influence on elution time and the skewness of chromatograms. However, these studies have struggled to obtain consistent agreement between the experimental on-column elution and the SM-based simulations, with some inconsistencies in the use of the model, as well as the adjustment and inclusion of free parameters.
Here, we explore if the Lévy process representation of the stochastic model can match both peak elution time and shape of HPLC on-column elution peaks from single-molecule adsorption data through the iterative variation of model inputs to reveal nonlinear behaviors. We first review the history, theory, and derivation of the Lévy process representation of the stochastic model and show how simulated chromatograms vary depending on the average number of SM adsorptions (
m), as well as the number of modeling points (np). Systematically scanning over a large range of variables that represent realistic column lengths, we quantify nonlinear trends in peak height, width, and asymmetry with respect to both parameters, as well as under sampling that can occur due to the choice of np. Making use of experimentally measured SM adsorptions on commercial stationary phase particles and associated HPLC on-column experiments,19 we implement an error minimization approach to match stochastic modeled results to the experimental HPLC data and explain the sources of discrepancy. Finally, we discuss sampling issues that arise from the experimental SM conditions. Overall, we show how free parameters can be adjusted to match specific characteristics of the experimental HPLC elution profiles, while revealing nonlinearities in the model behavior with respect to the average number of adsorptions, as well as under sampling that can occur in both the model and SM experiments. We make recommendations for total data collection and np to ensure adequate sampling in future single-molecule measurements and modeling of chromatography.
The trajectories of molecules were defined in terms of probabilities of adsorption on the surface of the stationary phase and a constant mobile phase velocity that is maintained for the overall column. Molecules through the chromatographic bed spend an average fly time (τm) in the mobile phase between adsorption events and an average sojourn time (τs) adsorbed on the surface of the stationary phase. These two quantities (τm, τs) are randomly distributed and account for the unique residence times of each individual molecule with its own trajectory. From this, the chromatogram then becomes the probability-density function of retention times for individual molecular trajectories, modeled as a Poisson distribution.5,24
Adsorption–desorption kinetics are related to the sojourn times in terms of mass transfer rate constants. These are defined as the probability per unit time that a molecule enters (
, for adsorption) and leaves (
, for desorption) the stationary phase, with a retention factor
. Applying these to a Poisson distribution of molecules that spend time t0 (unretained, also known as void time) in the mobile phase, and time tR (retained) in the stationary phase results in the probability density function P(t) that defines the chromatographic peak shape as:
![]() | (1) |
![]() | (2) |
.5,24
Unfortunately, at the time when Giddings and Eyring first published their work (1955), the stochastic theory saw little application due to computation methods being incredibly laborious and it being experimentally impossible to measure the necessary molecular observables for practical implementation. By 1963, D. A. McQuarrie worked to expand the model, and provide a more usable solution to the many sites version of the stochastic model.26 Then, in 1999, the multiple-site model was further refined to account for more complex molecular interactions by using a characteristic function (CF) definition of the equations by A. Cavazzini et al.27 Their work depicted the chromatogram as originally defined by the Giddings–Eyring expression in eqn (2) by the inverse Fourier transform of the CF and was successfully used to generate model elution curves. The same group, under A. Felinger, then continued to develop the model, including creating a novel Monte Carlo simulation based approach for nonlinear chromatography.28 By 2003, L. Pasti et al.29 provided experimental validation of the stochastic theory by applying it to size exclusion chromatography (SEC) column experiments eluting multiple sized polymer standards. However, at this point the CF-SEC was complex and difficult to decipher, and the lack of experimental single-molecule observables made it challenging to further bridge the gap between the ensemble-scale chromatograms and the theory.
Under the Lévy process formalism, the CF expression exists as a frequency (ω) distribution of different solute retention times and is re-written as a discrete sum to relate this formalism back to SM experimental observables, e.g. adsorption times. Defining the original Poisson probability distribution that during time tm a given molecule will adsorb rm times as:
![]() | (3) |
![]() | (4) |
, the average time spent in the mobile phase between adsorptions. Then, for the distribution of random rm values, the CF is defined in the frequency domain as:
![]() | (5) |
For the Lévy process representation,15 the CF is defined as:
![]() | (6) |
![]() | (7) |
The CF function (an array of values) is joined by the mirror (i.e. flipped vector left-to-right, or a 180° rotation) of its complex conjugate, and an Inverse Fast Fourier Transform (IFFT) is applied to return the chromatogram distribution.
![]() | (8) |
In application, ΔFS(τS,i) is defined by the measured (normalized) frequency distribution of observed single-molecule adsorption times (τS,i), and ω is defined by the number of model sampling points (np; i.e. number of points in the final chromatogram) as:
![]() | (9) |
Importantly, only the real part of the IFTT is kept, and it is scaled by
to account for sampling points, with the time axis defined by steps of ΔτS (the time resolution of the measured single-molecule observations).
This Lévy process formulation was then applied by Pasti et al. to experimental single-molecule data, using eqn (8) with experimental measurements of adsorption–desorption times obtained by M. J. Wirth and D. J. Swinton30 and S. H. Kang, M. R. Shortreed, and E. S. Yeung31 through single-molecule spectroscopy on stationary phase-mimicking thin film surfaces. The model results were compared to Wirth and Yeung's associated on-column experimental elution results. The work by Pasti et al. represented the first successful application of the stochastic theory in direct connection to the experimental observables from single-molecule fluorescence microscopy. The stochastic model with the Lévy process formulation was an essential bridge between a molecular perspective of chromatography and ensemble separations via the model, single-molecule microscopy, and ensemble HPLC and electrophoretic data.
Agreement between the model and ensemble results were limited by differences in sample conditions and the inaccessibility of the parameter
m from single molecule experiments. In practice,
m, the average number of adsorptions throughout the column was left as an adjustable free parameter which had a heavy influence on the resulting elution time and shape of the model peaks. As such, Pasti et al. approached this disconnect by defining two different
m values to try to either match experimental elution peak shape (chosen by inference and trial-and-error), or elution time by defining
m in relation to the ratio of the experimental peak elution time (tS) and the average adsorption (dwell) time (τs) from the SM measurements as:
![]() | (10) |
However, it was not clear how these definitions of
m affect other aspects of the model peak, with Pasti et al. noting that “[t]wo simulations were thus performed aiming at attaining the best agreement, for one, in peak shapes and, for the other one, in peak positions”.15 Further, the experimental single-molecule data was limited to model thin film surfaces which can have different chemistries and steric pore structure than the more complex, commercially-prepared stationary phases packed in columns typically used in chromatography.32
Two open questions were left about the applicability of the Lévy representation of the stochastic model of chromatography. First, is it possible to optimize and achieve agreement between both shape (width and asymmetry) and position of a chromatogram and single-molecule measurement? And second, was the inability to match the elution profiles a result of experimental differences in the thin film samples compared to the chromatographic packing material, the adjustable variables
m and np used, or a limitation of the model itself? To tackle these questions, here we explore the stochastic model and accuracy in relation to packed bed (fully porous spherical particles) liquid chromatography columns and recent single-molecule microscopy measurements in the exact same stationary phase particles.19
:
1
:
6 volume ratio, respectively) bath at 70 °C for 90 s, water rinsed, dried with nitrogen gas (5.0 grade, Airgas), then plasma cleaned in an O2 (Industrial grade, Airgas) plasma cleaner (PDC-32G, 115 V, Harrick Plasma) at 140–280 Torr, medium power, for 2 minutes. Different water suspensions of 0.1 wt% solids were used to deposit the stationary phases by drop casting 8 μL volumes unto the glass slides and allowed to dry overnight. The slides were then covered by silicon flow cells (Hybriwell, 13 mm diameter, 0.15 mm depth, Grace Biolabs) and rinsed multiple times with water, then left to rehydrate for at least 2 hours before imaging.19
Single-molecule measurements of adsorption were achieved with highly inclined and laminated optical sheet (HILO) fluorescence imaging of nanomolar concentrations of fluorescent rhodamine 6G dye (99%, Fisher) under flow at ambient 22 °C, as previously described.32 The instrument consists of an Olympus IX-73 inverted microscope, with a 100× magnification oil immersion objective (Olympus, 100×, NA 1.49, UAPON100XOTIRF), and a CNI diode laser (532 nm, MGL-III-100 mW) used for excitation. The laser light is set to a 77° exit angle from the objective to achieve HILO. Imaging was done with 1 nM rhodamine 6G solutions in 20 mM HEPES buffer (Biotang Inc, pH 7.33) passed through the flow cells using a syringe pump (NE-1000, New Era Pump Systems Inc.). Flow rates were varied between 1 to 100 μL min−1, corresponding to linear velocities 0.06 to 6.0 cm min−1 (Table S2). An EMCCD camera (iXon Life 897, Andor) collected the emission signal at 30 ms exposure, 400 electron multiplying gain, in Frame Transfer mode under 532 nm laser excitation of 10 mW (379 W cm−2) at the sample. Single molecules were identified and analyzed using a home-written, publicly available33 MATLAB (2022b) code that uses 2D Gaussian fitting strategies, paired with radial symmetry34 centroid position refinements, to obtain super-resolved, single-molecule locations and adsorption times (Fig. S1).
:
30% HEPES) at 0.1 mL min−1 for 2 h prior to experiments.
For comparison to microscopy measurements, rhodamine 6G (3 µL, 100 µM in HEPES buffer, from Biotang Inc) was run with an isocratic mobile phase of 70% EtOH
:
30% HEPES, and varying flow rates (see Table S2) up to 0.5 mL min−1, and a column temperature of 30 °C. Detection was accomplished using a photodiode array over the range 200–600 nm at 5 Hz. All elution runs were done in triplicate. Resulting chromatograms (Fig. S2) were visualized in Shimadzu Lab Solutions software, extracted at 530 nm (4 nm bandwidth), exported, and further analyzed.
m to match HPLC elution curve characteristicsA modified version of the stochastic model implementation from Pasti et al. was implemented in MATLAB (2024a), which can be found at (Zenodo https://doi.org/10.5281/zenodo.20272588).18,19 We systematically tested elution profile behavior under varying np (50 to 2000, variable step sizes of 10, 50, and 100) and
m (1 to 2000, variable step sizes of 1, 5, 10, and 100) and with different single-molecule dwell time (i.e. adsorption) distributions (see Fig. S1) obtained from microscopy experiments at varying flow rates (Table S2). An option to select
m between the theoretical value defined by eqn (10), or an error minimization fitting with respect to experimental chromatogram elution and shape between the model elution and HPLC data was implemented in our code. All results presented here use the error minimization approach unless stated otherwise, or if
m is manually selected/defined for the purposes of testing model outputs. The error minimization scales as three varied forms of error parameter, E, calculations for (1) elution time, (2) peak width, and (3) peak asymmetry defined as:
![]() | (11) |
![]() | (12) |
![]() | (13) |
Here the LC subscript labels correspond to the HPLC bulk chromatography measurements compared to the model subscript referring to the single-molecule model run parameters. We characterized the elution peak time and shape as shown in Fig. 1. The asymmetry factors (As = b/a, red) are defined by the left (a) and right (b) peak widths with respect to the peak center (max), with the width (w0.1 = a + b, green) defined using the tangential method at 10% signal (0.1 of the normalized peak height, Pmax, blue). For the experimental elution time (tS,LC), we defined it in relation to the void time (t0,LC), signaled by the dip caused by the unretained volume during sample injection (Fig. 1b), and the elution peak location (tR,LC). Thus, the corrected experimental elution time is defined by tS = tR − t0, where time t0 (unretained, also known as void time) in the mobile phase, and time tR (retained) are used. Meanwhile, the model peak elution time (tR,model) is defined as the peak mode, which coincides with the peak maximum for low asymmetry (near As = 1.0) peaks. Note that the model peak is simulated using time units (ΔτS, from the time resolution of SM observations, 30 ms in our case) of ms, but the resulting time axis (i.e. the time delta between model points, np) is defined by unit conversion
.
There is no void time to correct for in the model peaks. The net fitting error for minimization is defined as the sum of the desired fitting parameters, weighted by the parameter Wi, as:
![]() | (14) |
m
m to avoid any model sampling issues (see section 4.2), and a full dataset of SM observations for our 0.29 cm min−1 flow velocity SM experiments.
Most of the peak characteristic parameters vary non-linearly in relation to increasing average number of single-molecule adsorptions. We observe a consistently increasing delay in elution time (tR), linearly proportional to the value of
m (Fig. 2c, black) and peak widening (w0.1, Fig. 2c, green) following what matches a two-component exponential association curve shape (see Fig. S3 for curve fits). Meanwhile, both the peak height (Pmax, Fig. 2c, blue) and asymmetry (As, Fig. 2c, red) follow a two-component exponential decay (see Discussion). Initial observations from varying
m over a large range (5–2000) show that the elution time relation to
m is directly proportional, but suggest that wide and high asymmetry peaks are not accurately represented outside of low (≪200) or very high (≫200) average number of adsorptions. These factors indicate that a combination of long adsorption times and a high frequency of adsorptions can significantly dictate the peak shape.
m on model elution peaks reveals a threshold for adequate sampling regimes and consistent
m dependance
m, the sampling of the resulting elution profile is dictated by the user-defined variable, np. We simultaneously quantify how np and
m affect the elution time, peak asymmetry, width, and peak height of the modeled elution peaks (Fig. 3). The elution peak shape parameters demonstrate that in small np conditions (np ≪
m), both elution time (Fig. 3a) and peak height (Fig. 3b) show variable rates of change with respect to
m depending on which region of the np model space sampling they are in. Closer inspection (Fig. S4) suggests that the observed discontinuities (Fig. 3a, red arrows) correspond to undersampling that occurs in Fourier space, scaling as approximately 1.5 ×
m, which results in non-real solutions to the inverse Fourier transform of the CF in eqn (8). Meanwhile, both peak width (Fig. 3c) and peak asymmetry (Fig. 3d) show no direct relation with np in comparison to
m, except for discontinuities in the under-sampled range (np < 1.5 ×
m). All parameters show discontinuities except for peak height due to our definition as signal maximum to avoid simulation errors. Notably, peak height (Fig. 3b) is the most sensitive parameter to both variables.
m to accurately match HPLC elution times but does not capture peak width and asymmetry
m can be systematically adjusted to match experimental HPLC elution times with stochastic model data from measured SM adsorption time distributions. HPLC data was collected of the same rhodamine 6G analyte and Cellulose B stationary phase packed within a column as was imaged via SM microscopy. The
m values are determined through our error minimization fitting with respect to three chromatogram parameters (position, width, and asymmetry) compared between the model elution and HPLC data as detailed in section 3.3.
Using the SM dwell-time datasets of rhodamine 6G solution with flow velocities of comparable magnitude to the HPLC conditions (Table S2) through the stationary phases (Fig. S1), we test the model fitting results of using an error minimization approach, defined by eqn (14) when we vary the weighting scheme of the error calculation. The weights were defined by the values of Wtime, Wwidth, and Wasym (which we hereto present in said order), with set values of 0, 1, or 2 to prioritize certain elution time vs. shape characteristics (e.g. ‘W(2,1,1)’ corresponds to weights of Wtime = 2, Wwidth = 1, and Wasym = 1). For error minimization fitting of the model, we tested the measured HPLC data in comparison to the SM data with the most similar flow velocity (Fig. 4). This corresponded to the 2.9 ± 0.5 cm min−1 SM data for all HPLC conditions except for the 5.0 ± 1.0 cm min−1 HPLC runs, which better coincide with the 6.0 ± 1.0 cm min−1 SM experimental conditions (Table S2). By fitting the model with different parameter weights through adjustment of the free parameter
m, we obtain multiple simulated elution peaks that variedly match the HPLC elution time and shape (Fig. 4a).
![]() | ||
Fig. 4 Elution time-based error minimization fitted m values for SM data of varying flow velocity. Simulated chromatograms of eluting rhodamine 6G based on SM adsorption kinetics distributions obtained at varying flow rate conditions were fitted to experimental HPLC elutions (of varying flow velocities) through error minimization using the free parameter m. (a) The resulting simulated peaks, showing agreement in elution time with the HPLC experimental data. The model weights were adjusted based on the parameters of elution time, width, and asymmetry (as shown in bottom panel legend). (b) The obtained m from fitting each flow condition based on the four tested weighting schemes (legend matches order in (a)), compared to the m values predicted by eqn (10). (c–e) The elution peak shape parameters from the fitting show that model peaks closely match the HPLC experimental elution time (c, dotted black line), but much less so for the HPLC width (d, dotted green) and asymmetry (e, dotted red). The value of np was fixed to 5000 for all simulated chromatograms, and the weights set as shown in the legend, written as W(Wtime,Wwidth,Wasym). | ||
The best agreement between the SM model and HPLC data (Fig. 4a, brown fits) occurs when the model is weighted to prioritize elution time (i.e. Wtime > Wwidth and >Wasym). When only elution time is considered (W(1,0,0)), the obtained
m values from error minimization without a-priori knowledge end up closely matching the theoretical values predicted by eqn (10) (Fig. 4b), with good matching of the experimental elution times (Fig. 4c). The agreement based on elution time supports Pasti et al.'s derivation and parameter definitions (eqn (10), Fig. S5 and S6). By adjusting the error minimization weighting for width (Fig. 4d) or asymmetry (Fig. 4e), the HPLC peak shapes can be better approximated by the simulated chromatograms, but the elution time and shape could not be simultaneously matched by adjusting the value of
m (Fig. S7). When the weights prioritize only peak shape, ignoring elution time, as W(0,1,1), the resulting fits show high agreement in peak width (Fig. 4d, dark aquamarine), but less so for asymmetry (Fig. 4e). Interestingly, the gap between model and HPLC narrows for both width (Fig. 4d) and elution time (Fig. 4c) as the flow velocity increases, regardless of the chosen weighting scheme. Notably, asymmetry was the parameter most poorly captured by the model (Fig. 4e), with it having the least influence over the other weights. This corresponds with the inverse relation seen between these parameters in Fig. 2c.
, with 0.999 R2). This can be attributed to the increasing saturation of available adsorption sites on the imaged surfaces. As the number of sampled single molecules increases, the standard average and standard deviations in measured dwell times (Fig. 5b) also increase, despite the mode remaining constant. This would align with the wider and more asymmetric simulated peaks with increasing SM experimental flow rate (Fig. S5–S8). Furthermore, comparison of the linear velocities measured in the SM experiments and those in the HPLC column runs (Table S2) shows that different velocity regimes were approached, with the higher SM flow rates being closer to the HPLC conditions.
SM data collected at increased flow rates has increasing tailing more similar to the HPLC data. We observe that higher SM flow velocities result in wider peaks (Fig. S5D green, and Fig. S7D green) and increased asymmetry (Fig. S5D red, and Fig. S7D red), regardless of whether it is fitted by elution time or shape. The very distinct tailing visible in the simulated peaks that arises at the higher flow velocity datasets (Fig. S5A, S5B, S7A, and S7B, blue lines) demonstrate the significant influence that even rare SM adsorption times have on the model. We also find that these trends remain true across a wide range (5 to 2000) of
m values (Fig. S8), with datasets of higher SM flow velocities showing drastic differences in asymmetry when
m is large (Fig. S8B).
m), can be directly linked to physical, molecular behaviors occurring in a column. The elution time scales directly to
m (Fig. 2c, black), while peak width (w0.1) follows a two-component exponential association function (Fig. 2c and S3, green) with respect to
m. On a column, an increasing
m corresponds to an increasing number of available adsorption sites, which can occur with either an increased density of sites, increased accessibility, or a longer column. Then, the exponential association behavior matches known peak broadening trends for longer columns39 as well as recent Monte Carlo simulations8 that show that increased site densities and column lengths can lead to similar non-linear peak broadening. In terms of equilibrium adsorption behavior, the increasing
m can be likened to increasing occupancy of adsorption sites, which scales non-linearly, as seen in Langmuir and similar isotherms.40 Isotherms like the Jovanovic–Freundlich model combine kinetics with allowed surface heterogeneity and are often written in terms of exponential association functions of the form q(C) = q0(1 − e−Cn).41
In the transition from equilibrium modeling to adsorption kinetics, the Langmuir isotherm itself describes equilibrium (the steady state), while the approach to that equilibrium is often mathematically characterized by exponential functions, particularly in complex or heterogeneous systems.42 Relatedly, we observe that simulated elution peak heights (Pmax, Fig. 2c and S3, blue) follow a two-component decaying exponential function with increasing
m, which likely arises from the same number of measured single molecules (i.e. constant concentration) being spread out over a broader range of adsorption sites and total flight times – thus being inversely related to the peak width. In the field of single-molecule kinetics, it is not uncommon to describe dwell time distributions (Fig. S1) and similar datasets in terms of multi-component decay exponentials, which are attributed to the multiple heterogeneous forms of adsorption that can exist on a sample surface.16,18–20,30,32,43,44
The model peak asymmetry factor (As, Fig. 2c and S3, red) also follows a two-component decaying exponential behavior. Similar to peak broadening, this can be related back to the increasing number of sites leading to a lower degree of skewing due to non-mass transfer behavior (e.g. diffusion and convection) dominating the separation, with adsorption kinetics having a reduced effect. The total time spent in the column follows the relation
, which dictates that at low values of
m, the net elution time is dominated by time spent in the mobile phase.15 Thus, as it can also be gathered from careful inspection of the CF (eqn (6)), the initial degree of skewing is dominated by the frequency function of adsorption times at low number of average adsorptions.
m is fixed, we observe that the model does not output singular, complete elution peaks at low numbers of model points (Fig. S4 and Fig. 3) but then settles at a single elution time position at higher np values. By plotting the elution peak parameters (Fig. S4c and Fig. 3), it becomes evident that discontinuities can occur depending on the value of np, where the solution to the IFFT of the CF results in only non-real values. When simultaneously probing np and
m (Fig. 3) we observe that these discontinuities and inconsistencies occur throughout the whole variable space, until a threshold is passed, corresponding to approximately when np is at least 1.5 times greater than
m. The discountinuity at np < 1.5 ×
m likely directly results from the mathematical construction of the model (eqn (6)) and the numerical limitations of the IFFT calculation itself, where the frequency term (ω, in the Fourier domain) scales inversely to np (eqn (9)). Given a sufficiently small np (relative to
m), the value of the imaginary component of the IFFT (primarily ω dependent) will dominate over the influence of the purely
m scaling components. Thus, we find that an important requirement (np > 1.5 ×
m) exists in selecting a sufficiently high number of model points when applying this discrete form of the Lévy process representation of the stochastic model; a threshold and important caveat that to our knowledge was not discussed in the original report,15 nor previous applications of the model.8,16,18
m free parameter. By tuning the weighting scheme (Wi values in eqn (14)) for the error minimization, we can prioritize fitting (Fig. 4) by elution time (i.e. Wtime > Wwidth and >Wasym, Fig. S5–6) or fitting by peak shape (i.e. Wtime < Wwidth and <Wasym, Fig. S7). We find that the model very accurately matches the HPLC elution time when
m is at or near the values predicted by the theory (Fig. 4c, eqn (10)), corresponding to prioritizing elution time fitting, matching previous observations15,16 and demonstrating applicability across a range of different elution times with changing HPLC flow rates (Fig. S6). The agreement in elution time supports Pasti et al.'s original derivation and parameter definitions. Interestingly, we also observe that regardless of the chosen weighting scheme, the gap between model and HPLC narrows for both width (Fig. 4d) and elution time (Fig. 4c) as the flow velocity increases. However, both the elution peak shape and time cannot be fully matched simultaneously in this application of the model (Fig. S5D vs. Fig. S7D). We find that when the weights prioritize only peak shape (e.g. W(0,1,1)), the resulting fits show high agreement in peak width (Fig. 4d), but much less so for asymmetry (Fig. 4e), which never fully converges with the HPLC experiment. We suspect that the narrowing gap between model and HPLC with increasing flow velocity, and the inaccurate asymmetry point to the necessity to improve the model and SM experimental design as sources of peak broadening other than mass transfer (e.g. diffusion) are neglected in the derivation and formalism of the model, as well as in the SM experimental observations.
m is applied as a single average value in the discrete form of the CF; additional work could add a mathematically rigorous expansion to include a distribution of values that would be more representative of the on-column availability of adsorption sites.
SM experimental factors of sampling can affect agreement between simulated and HPLC elutions and can hint at sources of peak broadening in the on-column experiments. Higher SM flow velocity conditions (2.9 and 6.0 cm min−1) show distinctively higher degrees of elution peak asymmetry (Fig. S5–S8), which also affects trends in
m behavior (Fig. S8B). The significantly lower sampling of SM adsorption events (Fig. 5a) that occurs at the lower flow conditions (0.06 and 0.29 cm min−1) correspond to a lower likelihood of observing the rare long-lasting adsorption events (Fig. 1a and 5b) which disproportionately contribute to peak broadening and tailing,16 and generate a wider distribution of SM elution times with more significant tailing (Fig. 5b). Peak tailing can also be linked to the effects that long observation times can have by introducing irreversible adsorption into the measured dwell time distributions (causing increasingly exponential cumulative distributions).45 However, in our experiments we do not venture out to very long observation timescales (measurements are ∼1 minute per collection), and our distributions do not show any dwell times >5.5 seconds (Fig. S1A), suggesting little to no observed irreversible adsorption. Further, here, the cellulose-based stationary phase exists as matrix that has been modified with 3,5-dimethylphenyl carbamate groups at the hydroxy groups. With the choice of the small (∼1 nm46,47) cationic rhodamine 6G dye analyte, the predominant interactions should be van der Waals interactions between hydrophobic moieties of the dye and the stationary phase and pi–pi-interactions, which result in primarily short-lived adsorption.48–50 However, similarly to what has been observed in previous work44,51 exposed silanol groups could remain on the modified silica surfaces that make up the solid support of the stationary phase, leading to the observed rare relatively long-lasting adsorptions.
Other sources of discrepancy between HPLC and our model results could be the effects of diffusion phenomena based on the mobile phase polarity and stationary phase porosity. While our HPLC and SM experiments both use the same stationary phase materials and analytes, they were ultimately performed under different mobile phase conditions due to experimental constraints, which can lead to significant differences in elution behavior.19 Most notably, the addition of ethanol on columns that was necessary for any rhodamine analyte to reach the detector might lead to shorter overall adsorption times.52 Increasing the eluent strength (primarily from addition of more polar solvent modifiers) has been shown to decrease analyte-stationary phase interactions, thereby increasing the effective contribution of analyte-mobile phase interactions (by saturating the stationary phase with mobile phase) for cellulose-type stationary phases.53,54 The incorporation of the stronger eluent ethanol could result in increased peak broadening and asymmetric effects due to diffusion mechanisms, compared to the mass-transfer dominated SM measurements. As we have previously shown19 the high degree of surface functionalization present on the cellulose-B “fully porous” stationary phases can limit accessibility to the porous network. Confined pores can cause flow channeling (Fig. S1B), which leads to flow heterogeneity and consequently peak broadening in the HPLC elution. However, as we demonstrated,19 decreasing the degree of functionalization directly speeds up elution, despite the increased time spent diffusing within the porous stationary phases. This would mean that the longer retention on column can be directly attributed in large part to the high density of adsorptive sites causing more frequent adsorptions (i.e. higher average number of adsorptions, rm), matching the behavior presented by this model. Overall, differences between SM stochastic modeled elution and HPLC data can point to factors beyond simple adsorption–desorption mass transfer contributing to the elution.
Further consideration should be given to the diffusion mechanisms, driven by flow conditions, that affect on-column separations. In our microscopy flow cells, we are limited by pressure, which we estimate19 to be well under 1000 psi, with our shown flow rates (Table S2) corresponding to ∼678, 681, 714, and 750 psi (or ∼47 to 52 bar). Higher pressures in HPLC have been associated with improved separation factors, which is partly attributed to conformational changes in the case of proteins.55 However, pressure related effects become less pronounced for small molecules, and indeed the primary components of the dwell times distributions (Fig. S1A) in our system – which are the primary data used within the model – remain mostly unaffected by increasing pressure (i.e. flow rate). Another consideration is the corresponding shearing rates (Table S2) that occur due to flow near the walls, which have been shown56 to lead to increased ka values (i.e. shorter association times), which we also observe (Fig. S1C).
Overall, we believe that future work should aim to better bridge these gaps between HPLC and SM experimental conditions, as well expand the model to more rigorously include diffusive information. At present we are working to better replicate the high-pressure packed column environment of HPLC columns in our microscopy set-up. We are working towards imaging at >30
000 psi using capillary-based approaches developed in biophysics,57 trying to explore open questions about diffusive behaviors and the influence of back pressure. Future research should focus on quantifying eddy diffusion in packed beds and membranes while pushing for higher operating pressures. By simultaneously enhancing microscopic resolution and speed, we can better connect individual molecular behaviors to full-scale column performance. Simultaneously, the model should be expanded to present
m as a distribution of values (rather than an average) and expand the Brownian components of the Lévy process representation to better include diffusion information based on experimental conditions.
m (the average number of single molecule adsorptions) and np (the number of modeling points) in the stochastic model of chromatography to determine if it is possible to optimize and achieve agreement between SM modeled and HPLC chromatograms from measurements carried out on the same commercial stationary phase materials under varying flow velocity conditions. We also introduced a metric to quantify the agreement between the single-molecule and HPLC by taking the weighted difference in the elution time and shape of the chromatograms and explored the effects that experimental sampling can have on the model outputs. We showed how elution peak shapes vary depending on the average number of SM adsorptions (
m), as well as the number of modeling points (np). Our results reveal two-component exponential association curve behaviors occur in peak broadening (width), and two-component decaying exponentials in peak height and asymmetries, as well as under sampling that can occur due to the choice of np, setting a minimum threshold value of np > 1.5 ×
m. Using the error minimization approach we showed simulated elution times and peak shape can be independently matched to experimental HPLC data, which was applied to new experimentally measured SM adsorptions on commercial stationary phase particles and comparable HPLC on-column experiments.19 Sampling issues that can occur in experimental SM measurements were also explored. We conclude, below, with some suggestions for future implementation of the model.
From our observations, we summarize the following recommendations for other separation scientists interested in applying the stochastic model to single-molecule data in the future. (1) The experimental SM observation time scales should be significantly longer than the scale of the adsorption times, with a high degree of sampling of single-molecule events. For this we recommend tailoring both analyte concentrations, flow conditions, number of available adsorptions sites (e.g. number of stationary phase particles in the FOV or increased number of samples), and total observation time to increase the robustness and statistics of the SM datasets. Collecting data at or near conditions of the plateau of exponential association curve, as in Fig. 5a, are advised. (2) The model free parameter
m can be fit with weighted error towards either elution time or shape, but not both simultaneously. Given that elution time has a more direct theory-based connection with
m, it is the recommended fitting target. If the SM observations are sufficiently robust as noted in the first point, agreement with elution peak shape should improve. (3) The number of model points should be scaled with the average number of adsorption events, following the rule of np > 1.5 ×
m at a minimum. For our application, we have kept np > 2·
m as a minimum to avoid any Fourier space undersampling. Beyond direct application of the model, future attempts to connect SM experimental observables and HPLC experiments should continue to bridge the gap between their experimental conditions, such as by matching mobile phase conditions, including pressure and linear velocities, more closely. Finally, further development of the stochastic model can be implemented by including terms to help add diffusional information into the microscopic theory of elution,16 extending beyond the focus on mass transfer kinetics, as well as designing SM experiments to measure these diffusion parameters.
Code for analyzing the raw microscopy data and simulating chromatograms are available at https://github.com/KisleyLabAtCWRU/
Matlab code package found in: https://doi.org/10.5281/zenodo.20272588.
Supplementary information (SI): Fig. S1–S8. See DOI: https://doi.org/10.1039/d6an00199h.
Any additional data or analysis requests can be directed to lydia.kisley@case.edu.
| SM | Single molecule | τs | Time spent in the stationary phase |
| HPLC | High performance liquid chromatography | τm | Time spent in the mobile phase |
| tR | Elution time | ||
| CF | Characteristic function | t0 or tvoid | Detector void time for unretained solutes. Only applies for experimental HPLC data. |
| FFT | Fast Fourier Transform | Pmax | Elution peak height. |
| IFFT | Inverse Fast Fourier Transform | w0.1 | Peak width defined by tangential method at 10% peak maximum. |
m |
Average number of single-molecule adsorptions | As | Peak asymmetry, defined as the ratio of the right over left widths of the peak. |
| This journal is © The Royal Society of Chemistry 2026 |