Open Access Article
Fabrizio
Ruggieri
*a,
Milena
Casalena
a,
Mariagiovanna
Accili
a,
Elisa
Mattei
b,
Fabrizio
Stecca
b and
Mosè
Lamolinara
b
aDipartimento di Scienze Fisiche e Chimiche, Università degli Studi dell’Aquila, Via Vetoio, 67010 Coppito, L'Aquila, Italy. E-mail: fabrizio.ruggieri@univaq.it
bAgenzia Regionale per la Protezione dell' Ambiente (ARPA) Abruzzo, 67100 L'Aquila, Italy
First published on 13th October 2025
This study presents a robust, statistically validated analytical method for the quantification of C5–C10 volatile petroleum hydrocarbons (VPHs) in aqueous matrices using headspace gas chromatography with flame ionization detection (HS-GC-FID). A central composite face-centered (CCF) experimental design was employed to optimize critical extraction parameters, including sample volume, temperature, and equilibration time. The response variable, defined as the chromatographic peak area per microgram of analyte (Area per μg), was used to model the extraction efficiency. Analysis of variance (ANOVA) confirmed the global significance of the fitted model (R2 = 88.86%, RMSE = 4.997, p < 0.0001), with significant main, quadratic, and interaction effects. Sample volume showed the strongest negative impact, while temperature and interaction terms demonstrated synergistic behavior. The optimized conditions improved both sensitivity and reproducibility. The proposed method aligns with ISO 9377-2 principles and provides a reliable, environmentally relevant protocol for trace-level VPH monitoring in water samples.
Accurate and sensitive determination of VPHs in aqueous samples is a critical component of environmental surveillance and public health protection. In this context, the ISO 9377-2:2000 standard outlines a reference method for the determination of hydrocarbons in water using gas chromatography, with particular emphasis on the quantification of extractable and volatile fractions.7,8 This international guideline provides criteria for method validation, quality control, and analytical performance, emphasizing the need for reproducibility, recovery efficiency, and the ability to detect hydrocarbons in complex aqueous matrices.9–12 The method described in the present study aligns with the technical principles and objectives of ISO 9377-2, aiming to offer a statistically validated, reproducible alternative tailored to volatile hydrocarbons (C5–C10) that are typically not covered by standard extraction protocols.13 Regulatory bodies such as the U.S. Environmental Protection Agency (EPA) and the European Union have developed standardized protocols, including EPA Method 8015 and the European Water Framework Directive (2000/60/EC), to guide analytical practices.14,15 These frameworks mandate the use of reliable, reproducible, and sensitive methods capable of detecting VPHs at trace levels in complex aqueous matrices.16,17
Among the various analytical strategies available, headspace gas chromatography coupled with flame ionization detection (HS-GC-FID) remains one of the most effective and widely used techniques for the quantification of volatile hydrocarbons.18–20 HS-GC-FID offers distinct advantages in terms of sample cleanliness, ease of automation, and compatibility with volatile organic compounds. However, the accuracy and sensitivity of HS-GC-FID measurements are highly dependent on the optimization of experimental parameters such as sample volume, headspace equilibration temperature, and extraction time.21–24 These variables influence the distribution of analytes between the aqueous and vapor phases and can significantly affect signal response and reproducibility.25–27
To address this challenge, the current study employs a multivariate statistical approach based on design of experiments (DoE) to optimize HS-GC-FID extraction conditions for VPHs in water.28 Unlike the traditional one-variable-at-a-time (OVAT) methodology, which fails to account for synergistic effects and often leads to inefficient and incomplete optimization, DoE allows for the simultaneous assessment of multiple factors and their interactions. This enables the construction of predictive mathematical models and facilitates a more thorough understanding of the extraction dynamics. A central composite face-centered (CCF) design was selected for its efficiency and capability to model curvature and interaction effects.
The use of DoE not only enhances experimental efficiency by reducing the number of required runs but also strengthens statistical confidence in identifying significant variables.29 This approach is especially valuable in complex analytical systems, where multiple interdependent parameters govern method performance.30–33 Through this framework, the present study aims to derive scientifically validated optimal conditions and to highlight the utility of DoE as a powerful tool in analytical method development.34,35
The objective of this research is to develop a robust, reproducible, and environmentally relevant analytical protocol for the quantification of volatile petroleum hydrocarbons (VPHs, C5–C10) in water samples. Current international standards do not adequately cover this fraction: the ISO 9377-2 method is based on liquid–liquid extraction (LLE) and was designed mainly for heavier hydrocarbons, U.S. EPA Method 8015 relies on purge-and-trap strategies, and the European Water Framework Directive (2000/60/EC) establishes monitoring requirements without prescribing headspace protocols. In this context, our study addresses a methodological gap by developing and statistically optimizing a headspace-based HS-GC-FID procedure specifically for volatile hydrocarbons. The application of a CCF design allowed us to elucidate the influence of key extraction parameters, improve sensitivity and precision, and capture interaction effects not revealed by traditional OVAT approaches. The advantages of the proposed protocol include full automation, reduced solvent consumption, enhanced reproducibility, and compliance with international performance criteria, ultimately contributing to the development of reliable tools for regulatory monitoring and environmental risk assessment.Beyond controlled experiments on spiked ultrapure water, the optimized HS-GC-FID protocol was also applied to real groundwater samples, confirming its practical applicability for environmental monitoring and its alignment with international regulatory frameworks.
:
1 split ratio), with a fixed injection volume of 1.0 mL of vapor phase introduced into the GC inlet The GC oven program was 40 °C hold for 2 min and successively ramped linearly to 180 °C over 12 min with a final hold of 1 min, for a total run time of 13 min; injector temperature 250 °C; detector temperature 300 °C; carrier gas helium at 1.2 mL min−1. The GC oven, injector, and detector parameters were optimized to achieve sharp peak resolution and linear detector response across the tested concentration range. Routine maintenance and performance checks, including septum integrity, leak testing, and detector calibration, were conducted to ensure data quality and instrumental reliability throughout the experimental campaign.
The vials were immediately sealed with PTFE/silicone septa and aluminum crimp caps to prevent analyte loss. Stirring during equilibration was not included, as the Agilent 7697A autosampler does not allow vial agitation. Sample volume (V), incubation temperature (T), and equilibration time (t) were varied systematically according to the experimental design matrix. All samples were equilibrated in the headspace autosampler oven prior to injection into the GC system. Replicates were included at the center point of the design to assess repeatability and estimate pure error. Blanks and quality control samples were analyzed in parallel to monitor for contamination and assess method performance.
The linearity was assessed over a concentration range of 0.1–20 μg mL−1 using six replicate injections per level.
The precision of the method was evaluated through comprehensive intra-day and inter-day studies to assess method repeatability and reproducibility. Intra-day precision was assessed by performing six replicate injections at three concentration levels (0.5, 5, and 15 μg mL−1) within a single day, while inter-day precision was assessed by repeating the same procedure across three consecutive days.
The accuracy of method was determined through recovery experiments by spiking blank water samples with standard mixtures of C5–C10 hydrocarbons at three concentration levels: low (0.5 μg mL−1), medium (5 μg mL−1), and high (15 μg mL−1). Each level was analyzed in six replicates to ensure statistical reliability. The method sensitivity was rigorously assessed following ICH Q2(R1) guidelines by determining the limit of detection (LOD) and limit of quantification (LOQ) using the calibration curve approach. The standard deviation of the response (σ = 0.153), derived from multiple blank injections, and the slope of the calibration curve (S = 124.78) were used in the formulas LOD = 3.3σ/S and LOQ = 10σ/S. The robustness was evaluated by deliberately varying method parameters (±2 °C equilibration temperature and ±2 min extraction time).
Replicates were performed with variable frequency (two or three runs) to allow estimation of both intra-day and inter-day error, thereby improving the robustness of the statistical model. The inclusion of replicates allowed for the estimation of pure experimental error and improved the robustness of the model by enabling the detection of lack-of-fit. The response variable was defined as the total chromatographic peak area per microgram of analyte (Area per μg), obtained from GC-FID analysis. The response variable was defined as the sum of normalized chromatographic peak areas per microgram of analyte for the C5–C10 hydrocarbons, thus providing a global measure of extraction efficiency. Regarding calibration, petroleum hydrocarbon methods commonly adopt a range-based approach for gasoline range organics or total petroleum hydrocarbons, calibrating with a representative petroleum standard and integrating the total area over the specified carbon window rather than requiring compound-specific calibration for every constituent. This practice is documented for EPA and ISO 9377-2. This statistical structure is particularly suited for response surface methodology (RSM) and facilitates the construction of a second-order polynomial model that captures both curvature and interaction effects.
This normalized measure served as a quantitative indicator of the extraction efficiency under varying experimental conditions, accounting for differences in analyte concentration and ensuring consistency across the design space. The data were subjected to analysis of variance (ANOVA) to evaluate the statistical significance of main effects, interaction terms, and quadratic contributions. Model accuracy and predictive reliability were assessed using the root mean square error (RMSE) and the coefficient of determination (R2). This rigorous design and analysis strategy ensured a comprehensive understanding of the influence of experimental variables on the analytical response and facilitated the identification of robust and optimized extraction conditions.
| Run | Independent variable (coded form) | Indipendent variable (actual form) | Response (peak area per μg) | ||||
|---|---|---|---|---|---|---|---|
| X 1 | X 2 | X 3 | T (°C) | t (min) | V (mL) | ||
| 1 | −1 | −1 | −1 | 60 | 15 | 4 | 94.012 |
| 2* | −1 | −1 | −1 | 60 | 15 | 4 | 98.321 |
| 3 | −1 | −1 | 0 | 60 | 15 | 7 | 61.133 |
| 4 | −1 | −1 | 1 | 60 | 15 | 10 | 51.817 |
| 5* | −1 | −1 | 1 | 60 | 15 | 10 | 56.651 |
| 6* | −1 | −1 | 1 | 60 | 15 | 10 | 55.011 |
| 7 | −1 | 0 | −1 | 60 | 20 | 4 | 92.226 |
| 8 | −1 | 0 | 0 | 60 | 20 | 7 | 64.579 |
| 9* | −1 | 0 | 0 | 60 | 20 | 7 | 68.815 |
| 10* | −1 | 0 | 0 | 60 | 20 | 7 | 71.797 |
| 11 | −1 | 0 | 1 | 60 | 20 | 10 | 57.626 |
| 12 | −1 | 1 | −1 | 60 | 25 | 4 | 78.835 |
| 13* | −1 | 1 | −1 | 60 | 25 | 4 | 79.904 |
| 14 | −1 | 1 | 0 | 60 | 25 | 7 | 50.145 |
| 15 | −1 | 1 | 1 | 60 | 25 | 10 | 60.552 |
| 16* | −1 | 1 | 1 | 60 | 25 | 10 | 65.519 |
| 17* | −1 | 1 | 1 | 60 | 25 | 10 | 62.943 |
| 18 | 0 | −1 | −1 | 70 | 15 | 4 | 99.689 |
| 19 | 0 | −1 | 0 | 70 | 15 | 7 | 62.488 |
| 20 | 0 | −1 | 1 | 70 | 15 | 10 | 60.551 |
| 21 | 0 | 0 | −1 | 70 | 20 | 4 | 86.064 |
| 22* | 0 | 0 | −1 | 70 | 20 | 4 | 80.705 |
| 23* | 0 | 0 | −1 | 70 | 20 | 4 | 89.424 |
| 24 | 0 | 0 | 0 | 70 | 20 | 7 | 71.035 |
| 25 | 0 | 0 | 1 | 70 | 20 | 10 | 66.085 |
| 26* | 0 | 0 | 1 | 70 | 20 | 10 | 68.121 |
| 27* | 0 | 0 | 1 | 70 | 20 | 10 | 70.885 |
| 28 | 0 | 1 | −1 | 70 | 25 | 4 | 77.183 |
| 29 | 0 | 1 | 0 | 70 | 25 | 7 | 46.319 |
| 30 | 0 | 1 | 1 | 70 | 25 | 10 | 69.976 |
| 31 | 1 | −1 | −1 | 80 | 15 | 4 | 100.96 |
| 32* | 1 | −1 | −1 | 80 | 15 | 4 | 94.284 |
| 33 | 1 | −1 | 0 | 80 | 15 | 7 | 68.951 |
| 34 | 1 | −1 | 1 | 80 | 15 | 10 | 67.532 |
| 35 | 1 | 0 | −1 | 80 | 20 | 4 | 91.096 |
| 36 | 1 | 0 | 0 | 80 | 20 | 7 | 73.705 |
| 37* | 1 | 0 | 0 | 80 | 20 | 7 | 78.982 |
| 38* | 1 | 0 | 0 | 80 | 20 | 7 | 75.629 |
| 39 | 1 | 0 | 1 | 80 | 20 | 10 | 67.722 |
| 40 | 1 | 1 | −1 | 80 | 25 | 4 | 61.354 |
| 41* | 1 | 1 | −1 | 80 | 25 | 4 | 77.088 |
| 42* | 1 | 1 | −1 | 80 | 25 | 4 | 65.868 |
| 43 | 1 | 1 | 0 | 80 | 25 | 7 | 51.461 |
| 44 | 1 | 1 | 1 | 80 | 25 | 10 | 67.690 |
| 45* | 1 | 1 | 1 | 80 | 25 | 10 | 68.459 |
| 46* | 1 | 1 | 1 | 80 | 25 | 10 | 71.460 |
Analysis of variance (ANOVA) confirmed the model's global significance, with a high coefficient of determination (R2 = 89.1%) and adjusted R2 of 86.9%, indicating a reliable model fit to the data, as shown in Table 2. The root mean square error (RMSE) of 4972 supports the low dispersion of residuals. The inclusion of numerous replicates allowed accurate estimation of pure error and improved the model's diagnostic capability, particularly in detecting curvature and interactions.
| Parameters | Value ± SD | R 2 | Adj-R2 | Q 2 |
|---|---|---|---|---|
| Intercept | 67.9 ± 1.6 | |||
| X 2 | 2.0 ± 0.8 | |||
| X 2 | −5.5 ± 0.9 | |||
| X 3 | −9.3 ± 0.9 | |||
| X 1·X2 | −2.3 ± 1.1 | 0.891 | 0.868 | 0.811 |
| X 1·X3 | 3.4 ± 1.9 | |||
| X 2·X3 | 7.3 ± 1.1 | |||
| X 2 2 | −6.3 ± 1.6 | |||
| X 3 2 | 12.0 ± 1.7 |
| Variation source | Sum of squares | Degrees of freedom | Mean square | F-value | p-value |
|---|---|---|---|---|---|
| Lack of fit | 612.81 | 18 | 34.04 | 2.1423 | 00 541 |
| Pure error | 301.94 | 19 | 15.89 | ||
| Model | 7496.47 | 8 | 937.06 | 37.90 | <00 001 |
| Residual | 914.76 | 37 | 24.72 |
The regression coefficients revealed statistically significant effects for the main factors (temperature, extraction time, and sample volume), with the volume of the aqueous phase (X3) showing the strongest negative effect (−9.3, p < 0.0001).
Interaction terms (T·V, t·V) and quadratic effects (t2, V2) were also significant, confirming non-linearity and factor interplay in the extraction process. Calculated vs. actual plots demonstrated excellent correlation with no apparent systematic deviations as reported in Fig. 1.
The scatter plot demonstrates a strong linear relationship between the actual and calculated values of the extraction response (Area per μg), with minimal dispersion around the regression line. This high degree of alignment confirms the model's predictive capability and indicates that the majority of the experimental variability is well captured by the model.
Lowering the sample volume shifts this equilibrium toward the gas phase, thereby increasing the analyte concentration in the headspace and consequently the GC-FID signal. Moreover, reduced liquid volumes may also decrease matrix effects and lower the solvation energy barrier for analyte release, further enhancing extraction efficiency. These effects become especially relevant in the analysis of moderately volatile hydrocarbons, where partitioning dynamics are sensitive to even small variations in liquid-phase volume. Therefore, the systematic negative correlation between volume and response is both thermodynamically and kinetically justified.
From a practical perspective, the graphical shape of the response surfaces also provides visual confirmation of experimental robustness. The presence of wide and smooth maximum regions, especially at 20 minutes, suggests that the optimized conditions are not acutely sensitive to small deviations in volume or temperature. This characteristic is desirable in routine environmental analyses, where minor procedural fluctuations may occur. Moreover, the curvature and tilt of the surfaces offer mechanistic insight: sharp peaks or valleys would imply strong local sensitivity and potential instability, whereas gently sloping optima indicate more forgiving conditions. Thus, these surface plots are not only predictive but also diagnostically valuable, enabling the analyst to balance efficiency with operational resilience.
At short extraction time (Fig. 2), the response is predominantly governed by the sample volume effect, with temperature contributing marginally to the signal intensity. This behavior suggests that the system has not yet reached full thermodynamic equilibrium between the aqueous and headspace phases. Under these conditions, the volatilization of analytes is driven primarily by the headspace-to-liquid ratio, as governed by Henry's law, but the lower exposure time restricts the ability of thermal agitation to enhance mass transfer. As a result, temperature-induced increases in vapor pressure are not fully realized, and the thermal equilibrium necessary for optimal analyte desorption is likely incomplete.
Consequently, kinetic limitations dominate the extraction dynamics at short equilibration times, minimizing the observable influence of temperature.
In Fig. 3 response surface is reported with X2 fixed at an intermediate level; a synergistic interaction between temperature and volume becomes evident.
![]() | ||
| Fig. 3 Response surface plot showing the effect of sample volume and extraction temperature on HS-GC-FID response (Area per μg), with equilibration time fixed at 20 min. | ||
The response surface displays a well-defined optimal region, where moderate temperatures and low sample volumes (X3 < 0) lead to the highest extraction efficiency. This configuration enhances both the volatilization kinetics and the equilibrium partitioning of the analytes. At 20 minutes, the system likely approaches thermodynamic equilibrium, allowing the elevated temperature to significantly increase analyte vapor pressures and thus their transfer into the headspace. Simultaneously, the reduced liquid volume maintains a favorable headspace-to-liquid ratio, maximizing mass transfer. These concurrent effects confirm the joint role of temperature-induced vapor pressure increase and volume-controlled phase distribution, yielding a cooperative enhancement of the extraction yield.
At t = 25 min, as reported in Fig. 4, the overall GC-FID response decreases slightly and the response surface becomes notably flatter, especially at higher temperatures. This trend reflects a non-linear effect of extraction time, where prolonged equilibration does not further enhance analyte partitioning and may in fact lead to marginal losses in signal intensity. Such losses could arise from volatilization beyond the optimal headspace saturation point, condensation of higher boiling compounds on cooler vial walls, or sample leek. Under these conditions, the system is likely to have surpassed its thermodynamic optimum, where analyte partitioning into the vapor phase is no longer favored.
Prolonged equilibration may also introduce kinetic disturbances, potentially shifting or destabilizing the equilibrium previously attained, thereby compromising extraction efficiency and analytical reproducibility.
The observed flattening suggests that prolonged heating no longer contributes positively to analyte recovery and may introduce variance. The positive curvature associated with the X32 term supports this observation, revealing that very low sample volumes under these conditions may lead to reduced reproducibility, possibly due to increased susceptibility to vapor-phase variability or edge effects within the vial.
Taken together, these surfaces underscore the critical role of not only optimizing individual experimental variables but also understanding their synergistic and antagonistic interactions. The presence of significant interaction and quadratic terms in the fitted model highlights the complexity of the headspace extraction system, where temperature, time, and volume exert interdependent effects on analyte volatilization and partitioning dynamics. Among the conditions evaluated, the intermediate extraction time of 20 minutes provides the most favorable balance between efficient mass transfer and signal stability. This setting allows sufficient time for the system to approach equilibrium while minimizing potential losses due to overexposure or degradation, thus maximizing both analytical sensitivity and method robustness.
![]() | ||
| Fig. 5 HS-GC-FID chromatogram of a representative groundwater sample showing quantified hydrocarbons; other analytes were below the LOD. | ||
The other two samples displayed analogous patterns, with detectable hydrocarbons limited to a few analytes and at concentrations generally close to the low μg L−1 range.
These results confirm that the method is fully applicable to real environmental matrices and capable of detecting and quantifying volatile hydrocarbons even in complex aqueous samples without additional pre-treatment. The concentrations observed, particularly for n-pentane, reflect the high volatility and environmental mobility of the lighter hydrocarbons, which are among the most frequently encountered contaminants in groundwater impacted by fuel leaks and industrial emissions. Importantly, the method achieved quantification well within the limits established by international standards, with accuracy and reproducibility consistent with ISO 9377-2 requirements.
Overall, the successful application to real groundwater samples demonstrates that the optimized HS-GC-FID protocol is not only statistically robust under controlled experimental conditions but also reliable and effective in routine environmental monitoring scenarios.
The method's optimization resulted in improved extraction efficiency, reproducibility, and predictive reliability, as supported by strong statistical parameters (R2 = 88.86%, RMSE = 4.997, p < 0.0001). The validated response surface model accurately predicts analytical behavior within the tested domain and permits valuable guidance for method transferability and routine application. These findings underscore the importance of integrating experimental design strategies into analytical method development, enabling precise control of key variables and improving the robustness of determinations in environmental matrices.
This work advances the development of environmentally relevant analytical methods by introducing a model-driven and statistically validated framework for optimizing sample preparation in headspace gas chromatography. Such an approach enhances both reliability and sensitivity while aligning with regulatory and sustainability objectives through reduced solvent consumption and improved method standardization. The optimized HS-GC-FID protocol for C5–C10 hydrocarbons in water complies with international performance requirements (recoveries 92–105%, RSD < 6.5%, LOQ = 0.123 μg mL−1) and outperforms conventional HS-GC-FID in terms of sensitivity and reproducibility. Combined with its automation capability, minimal solvent demand, and demonstrated applicability to real samples, the method represents a robust and sustainable solution for routine environmental monitoring. While the present work was designed to optimize the overall extraction efficiency of the C5–C10 volatile fraction, future studies will be directed towards evaluating the behavior of individual hydrocarbons within this range. Applying DoE to each analyte separately may reveal compound-specific differences related to volatility or matrix interactions, thereby offering additional mechanistic insights. Such an approach could further strengthen the applicability of the method for regulatory monitoring, where both total fractions and individual components can be of environmental and toxicological relevance. The successful application of the protocol to real groundwater samples further confirms its reliability and suitability for routine environmental monitoring of volatile hydrocarbons.
| This journal is © The Royal Society of Chemistry 2025 |