Andrej
Grobin
,
Robert
Roškar
* and
Jurij
Trontelj
*
University of Ljubljana, Faculty of Pharmacy, Aškerčeva cesta 7, 1000 Ljubljana, Slovenia. E-mail: robert.roskar@ffa.uni-lj.si; jurij.trontelj@ffa.uni-lj.si; Fax: +386 1 4258 031; Tel: +386 1 4769 500
First published on 16th May 2023
Estrogenic endocrine disruptors are one of the biggest ecotoxicological threats in water that pose a significant ecological burden and health-risk for humans due to their high biological activity and proven additive effects. Therefore, we have developed and validated the most comprehensive and ultra-sensitive analytical method published to date, for reliable quantification of 25 high-risk endocrine disruptors at their ecologically relevant concentrations: naturally excreted hormones (estradiol, estrone, estriol, testosterone, corticosterone, and progesterone), synthetic hormones used for contraception and menopausal symptoms (ethinylestradiol, drospirenone, chlormadinone acetate, norgestrel, gestodene, tibolone, norethindrone, dienogest, and cyproterone) and bisphenols (BPS, BPA, BPF, BPE, BPAF, BPB, BPC, and BPZ). It is based on a solid-phase extraction of water samples, followed by a robust dansyl chloride derivatization with detection by liquid chromatography-tandem mass spectrometry with a single sample preparation and two analytical methods using the same analytical column and mobile phases. The achieved limits of quantitation are in the sub-ng L−1 range, and detection limits as low as 0.02 ng L−1, meeting the newest proposal for environmental quality standards (EQS) by the EU water framework directive for estradiol and ethinylestradiol. The method was extensively validated and applied to seven representative Slovenian water samples, where we detected 21 out of 25 analytes; 13 were quantified in at least one sample. Estrone and progesterone were quantified in all samples, reaching levels up to 50 ng L−1; ethinylestradiol was higher than the current EQS (0.035 ng L−1) in three samples, and estradiol was above its EQS (0.4 ng L−1) in one sample, proving the method's applicability and the necessity for monitoring these pollutants.
Most EDs are continually released into the environment, which is why they are “pseudo-persistent”4 even with relatively short environmental half-lives (i.e. <1 day for BPA) compared to other, more persistent pollutants with similar structures (i.e. polychlorinated biphenyls).5,6 The main sources of ED entry into the aquatic environment are via wastewater treatment plants, incorrect medicine disposal, agriculture, run-offs, veterinary use, hospitals, and industrial manufacturing sites.7–11
The chronic toxic effects of EDs are important not only in terms of decreasing populations of aquatic wildlife and their biodiversity, but they can also be detrimental to human health, especially considering that the estrogenic hormonal system in advanced aquatic organisms (i.e. fish) is similar to that in humans.12 These effects are mainly related to their ability to disrupt the homeostasis of the endocrine system which may manifest in serious conditions such as the development of cancer and disorders of reproductive, neuroendocrine, and immune systems.13–15
Non-target chronic effects on humans are associated with chronic exposure to contaminated water through ingestion of treated and untreated water, contaminated food, and dermal exposure.16,17 This poses a risk for negative effects on fertility rates and constitutes a serious modern public health issue.18,19 Therefore, EDs are often deemed priority substances and were present on monitoring watchlists in the EU (Vella, 2018). Estrone, estradiol and ethinylestradiol have been present on the watchlist for 4 years. The hormones mentioned are no longer on the latest watchlist, even though it is not clear whether sufficient data on their environmental presence was obtained.21
There are numerous studies related to the occurrence of hormones or bisphenols in surface waters, and a few with environmentally relevant limits of quantitation (LOQs). In these studies, different methods of sample preparation were utilized. Natural hormones were determined utilizing solid phase extraction (SPE),22,23 and also on-line SPE,24–26 which has become increasingly popular in recent years.27 The volumes of samples used in standard extraction were 200 mL or more, while in on-line SPE, only 10 mL of sample was used. The latter enables a high throughput as sample preparation times are significantly shorter, although the sensitivity is decreased.
Studies of synthetic hormones in surface waters are scarcer, and are usually performed with SPE extraction using volumes as large as 1 L.28,29 The analysis of synthetic hormones has proven to be quite challenging and the LOQ values obtained have only become environmentally relevant in recent years.30,31
Analytical methods related to the occurrence of bisphenols are more numerous (more than 20 studies), and were performed with SPE extraction using sample volumes up to 2 L10,32 and liquid–liquid extraction (LLE) with volumes up to only 20 mL.33,34 All the mentioned studies utilized LC-MS/MS for analysis.
EDs are highly effective even at very low environmental amounts, as their predicted no-effect concentration levels are as low as sub-ng L−1,35 with endocrine effects that are most likely additive.36–38 Therefore, analytical methods used for determining their presence in the environment must be suitably sensitive and suitable for studying a larger set of substances which cause similar effects in organisms. For the substances of interest, the number of studies found in the literature, their array of analyte selection and their measured concentrations are considerably lower, in comparison with other, more researched substances (i.e. antibiotics).39
Based on a detailed overview of analytical methods with the corresponding analytes in published literature, we have concluded that the aim of most analytical methods is not focused on the simultaneous detection of a larger scope of natural, synthetic hormones and bisphenols. A summary of methods, which encompass at least 3 analytes also included in our study is presented in Table S1† with their linearity ranges, LOQ values and their method of determination.40–54 A more comprehensive summary of methods where at least one bisphenol or hormone was analyzed is reported in our previous work.35
The LOQ values in the studies are environmentally relevant for some analytes. However, most of them were calculated from a simple signal to noise ratio of 10, at much higher than actual concentrations which were in most cases either not described, or higher than the obtained LOQ values by at least a few orders of magnitude. In some methods, the principle of LOQ value determination was not even described. The recommended method detection limits by the EU watchlist for hormones such as ethinylestradiol are very low (35 pg L−1) given their efficacy in sub-ng L−1 concentrations, and the use of SPE is recommended. To further improve the reliability of methods at such low concentrations, a more stringent and standardized methodology for LOQ determination is necessary. It seems that most studies are limited to just one group of analytes (or less than 10 substances from different groups) from the 2018 EU watchlist, out of which only 3 are hormones. Most commonly, estrogens and bisphenols are monitored, although never in a combination of more than 10 analytes.
The primary aim of this study was therefore to develop and validate a method suitable for the monitoring of 25 endocrine disruptors in surface waters by SPE-LC-MS/MS from the groups of natural, synthetic hormones, and bisphenols with suitable detection and quantitation (low-ng L−1) limits with regard to the simplicity of sample preparation. Therefore, a method with such a simple sample preparation and environmentally relevant LOQ values for as many as 25 analytes is a substantial improvement over the existing methodology, which is less sensitive and has not included a majority of the analytes in our method to date, even though they all elicit similar effects in organisms in sub-ng L−1 to ng L−1 levels. This goal was achieved for a wide range of analytes with the use of a mid-range mass analyzer and a simple derivatization for substances suitable for derivatization such as estrogens, which are present in the environment in low concentrations and have relatively low MS signal intensity. This was achieved by having two LC-MS/MS methods using the same analytical column and mobile phases with only one sample preparation. The method is validated according to stringent parameters and its quantification additionally confirmed with a standard addition principle. Furthermore, the method was applied to different surface water matrices to prove its suitability for the environmental monitoring of hormones and bisphenols. This further underlines the usability of the method for determining the risks attributed to an important range of EDs in the aquatic environment.
Bisphenols BPA, BPAF, BPC, BPS, BPZ and BPG, corticosterone, and isotopically labeled internal standards [13C6]-carbamazepine, [2H5]-diazepam and [2H16]-BPA were purchased from Sigma-Aldrich (Steinheim, Germany). Testosterone was purchased from Fluka (Seelze, Germany) and progesterone from Pion UK (Forest Row, United Kingdom). BPF, BPAP and BPE were obtained from TCI Europe (Zwijndrecht, Belgium), while BPB was purchased from Fluorochem (Glossop, United Kingdom). [2H4]-estrone, [2H5]-17β-estradiol, [2H4]-17α-ethinylestradiol, [2H4]-17β-estriol, [2H4]-testosterone, [2H9]-progesterone and [13C6]-diclofenac were purchased from Alsachim (Illkirch-Graffenstaden, France).
HPLC and LC-MS grade methanol, ammonium fluoride, hydrochloric acid, HPLC grade acetonitrile, sodium bicarbonate, dansyl chloride and acetone were obtained from Merck (Darmstadt, Germany). Ultra-pure (MQ) water was prepared by using a Milli-Q water purification system A10 Advantage (Millipore Corporation, Bedford, MA, USA).
An Agilent 6460 triple quadrupole (Agilent Technologies, Santa Clara, CA, USA) was used for MS/MS analysis. The instrument was operated with an Agilent jet stream electrospray ionization source (ESI) in mixed mode, with multiple-reaction monitoring (MRM). The optimized instrument conditions were as follows: gas temperature 300 °C, gas flow 5 L min−1, sheath gas temperature 375 °C and sheath gas flow 11 L min−1. All other MS/MS parameters and chromatographic conditions are listed in Sections 2.2.1 and 2.2.2 and ionization parameters in Table S2† (non-derivatized analytes) and Table S3† (derivatized analytes). For all analytes, the detector dwell time was 10 ms and the cell accelerator voltage was 4.
Due to a significant difference in the retention of non-derivatized and derivatized analytes, two chromatographic (and MS/MS methods) methods were used, both using the same analytical column and mobile phase, while also keeping most MS parameters identical, as listed in this section. No additional equilibration time, except for the run time of a blank sample, was required between switching the methods for the analysis of both non-derivatized and derivatized analytes.
The signal gain obtained by derivatization in comparison with that of non-derivatized samples was evaluated on ten non-zero concentration point calibration curves of derivatized and non-derivatized samples.
The combined solution of standards was used for spiking the samples in 200 mL amber glass volumetric flasks to obtain calibration standards as presented in Table S4.† After acidifying (as described in Section 2.3), each sample (for validation and real samples) was also spiked with 200 μL of a mixture of isotopically labeled internal standards.
Two quality control (QC) samples at low (QCL) and high concentrations (QCH) were prepared (in triplicate) in the same manner as described above: a separate combined stock standard solution was spiked into sample flasks to obtain the desired concentrations (Table S4†) which were afterwards spiked with 200 μL of the isotopically labeled internal standard stock solution.
For the determination of validation parameters where different concentrations were required (for LOQ samples and for the utilization of the standard addition principle), two different combined stock solutions were prepared for further spiking, depending on the concentrations needed for each substance in each assay.
The linearity was evaluated for each analyte using a combined stock solution, at seven non-zero concentration points, each prepared in duplicate (Table S4†). The correlation coefficient for determining linearity was calculated by using least-square linear regression and each calibration curve was also inspected visually. The determination coefficient acceptance criterion was set at R2 > 0.99.
LOQs were determined by evaluating several parameters determined at samples with the lowest possible concentration (at the LOQ value). The parameters were the signal to noise ratio (S/N) > 10 (calculated peak-to-peak based on area with the closest baseline region to the peak with a duration of at least 2 minutes), signal to average blank sample signal (S/B) > 5 (calculated peak-to-peak based on area), accuracy of 50–150% and relative standard deviation (RSD) of less than 50%, each performed on two parallel samples. LODs were calculated as LOQ/3.
The method precision was evaluated in terms of repeatability and reproducibility from measurements performed on the same day (intra-day precision) and three consecutive days (between-day precision) for QC samples (in triplicate) and at each calibrator and sample (in duplicate) on each day, calculated as the RSD. The acceptance criterion was RSD ≤ 20%.56 Injection repeatability was evaluated by re-injecting QC samples 10 times. The acceptable values were RSD ≤ 10%.
The method intra-day accuracy was calculated as the ratio of the calculated and theoretical concentrations of analytes at two QC levels and was denoted as a percentage (%). The acceptable values were set in the range of 70–130%.56
Recovery was determined by performing an extraction of a spiked MQ solution in triplicate and one calibration curve in MeOH with no extraction. It was calculated by using the following equation: (average slopes of spiked MQ solutions/slope of an equivalent calibrator curve) × 100%. All calibrator slopes used in determining the recovery were composed of seven non-zero concentration points. The recovery was denoted as a percentage (%) and was deemed acceptable for values 50–130%. These limits are preferred, although they can also be lower for specific analytes, if adequate repeatability is achieved.
ME for each respective matrix was determined by performing an extraction of each spiked matrix sample in duplicate and an extraction of a spiked MQ solution in triplicate. It was calculated by using the following equation: ((average slopes of spiked matrix samples/average slopes of equivalent spiked MQ solutions) − 1) × 100%.57 All calibration curves used in determining the ME were comprised of seven non-zero concentration points. The values were denoted as a percentage (%) and were expected to fall between ±30%, where 0% means no matrix effect, negative values represent signal suppression and positive values a signal enhancement in comparison with the spiked standard solution, without a matrix.
Detector specificity was determined by observing the ion-channel crosstalk (ICCT). The intensity of signals in samples of each individual analyte with a concentration corresponding to approximately the middle of its respective calibration curve in MRM channels not corresponding to the analyte was examined. The acceptable quotient of the obtained peak area (PA) in other MRM channels and the PA in the analyte MRM channel was ≤0.01% as this represents instrument noise and would prove an insignificant interference even at concentration differences of a factor higher than the method's complete concentration range.
The satisfactory sample stability in relation to method accuracy was deemed acceptable in the range of 70–130%. In the case of our analytes, their long-term stability is suitable, although it should always be determined.58,59 The requirements for autosampler stability are more stringent, depending on the analytes. It was evaluated on QC samples at 8 °C. Stability is expressed as the ratio of the PA corresponding to a certain time (24 or 48 h) and the PA of the sample at the initial time.
At each sampling event, one field blank was also prepared. For this, two of the same sampling bottles were used as for the collection of the real samples. 1 L of MQ water was recurrently transferred between bottles once at each sampling location, simulating the worst-case scenario for sample contamination.
Additionally, to further confirm the method performance, three additional surface water samples were selected for evaluating the method performance with the standard addition (SA) principle. A clean river sample (a few kilometers away from major cities and industries), a burdened river sample (along the banks of a river near a larger town) and a lake sample (a river-based lake in a small town) were collected. All three samples were collected in larger volume (2 L). Based on the measured concentrations of analytes, one separate combined stock solution of standards was prepared with approximately the average of 100, 250 and 500% of the determined concentrations in the real samples. This solution was then used for spiking in a three-point calibration for all three samples. Finally, the analyte's concentration was calculated back from the obtained SA calibration curve for each analyte and compared to the value calculated directly from the method calibration curves.
Several different mobile phases such as acetic and formic acid, ammonium acetate, ammonium carbonate, ammonium formate and ammonium fluoride were tested on the separated mixture for signal intensity. The signal intensity of ammonium fluoride was comparably better by at least one order of magnitude, as was also determined by some other researchers.60–62 A 0.2 mM concentration of ammonium fluoride proved optimal, as lower concentrations showed greater variations in retention times of analytes, and higher concentrations, decreasing signal intensity. In terms of the organic modifier, the use of methanol has enabled signals of a 2- to 10-fold higher intensity than acetonitrile and was therefore selected for further method development.
Since the analytes with the lowest signal intensity in our method were expectedly hydroxy substances (group on position 3) that ionize in negative mode (Table S2†), a derivatization was planned beforehand to obtain suitable sensitivity.63 As the mentioned phenolic hydroxy group is quite reactive, dansyl chloride was selected as the derivatization agent, because an established derivatization procedure which is described as simple, quick, and cheap64 was optimized for our purpose (Fig. S2†). Extensive variations of different derivatization parameters were evaluated on selected substances, such as different bases (NaOH and triethylamine), dansyl solvents (MeOH, acetonitrile, and dichloromethane), dansyl concentrations (0.5 to 5 mg mL−1) and different combinations of solvent volumes (from 10 to 1000 μL), temperatures (room temperature to 80 °C) and reaction times (1 minute to a few hours). After roughly estimating parameters based on the analytes' signal gain, a more fine-tuned optimization of different solvent volumes, reaction temperatures and reaction times was evaluated to further improve the signal gain. To improve the accuracy of the determination, this was performed with the comparison of slopes (Fig. S3†) and correlation coefficients of four non-zero concentration point calibration curves instead of just a single sample experiment. On variating the volume of reagents, the differences in reaction efficiencies were mostly negligible, while variations in temperature determined that 70 °C has proven beneficial compared to lower and higher temperatures (especially for E3 and E2). Reaction times shorter and longer than 5 minutes yielded marginally decreasing efficiencies, so a 5 minute reaction time was selected.
All analytes that ionize in negative mode were successfully derivatized (shown in Table S2†). The derivatized analytes obtained one (steroids) to two (bisphenols) dansyl fragments (the signals for mono-derivatized BPs were insignificant in a separate analysis of derivatization efficacy). A gradient elution program was developed for derivatized analytes in a 9.8 minute run time, which proved to be satisfactory in terms of selectivity (Table S3†). As all the water extracts were split into two parts (one for derivatization and one remaining) with the remaining part being chemically unchanged, it was deemed suitable to also analyze the non-derivatized portion of the sample alongside the derivatized one. This both eliminates any issues resulting from derivatization for non-derivatized analytes such as SHs (possibility of degradation due to the heating and alkaline conditions) and also enables the analysis of analytes by both methods to further confirm the accuracy of the method and to enable a much wider concentration range.
Furthermore, no interference from the non-derivatized analytes was observed in the method for derivatized analytes, as they are eluted from the column with a lower percentage of organic modifier than the starting conditions of the derivatized chromatographic method. No other coeluting peaks or other interferences (related to substances that can be derivatized in samples but were not analyzed in the method) were observed.
A signal gain obtained by derivatization compared to that of non-derivatized samples was determined for all the derivatized analytes, at least by a factor of 2 for BPS and to more than 450 for EE2 (Table 1). This is especially important for analytes where the lowest LOQ values are required because of their sub-ng L−1 effective concentrations in the environment (and consequently the requirements by the 2018 EU watchlist), as these values could not be obtained without derivatization.65 This is mainly the case for estrogens and EE2 where LOQ values without derivatization are ranging from 2.3 to 40 ng L−1 and are as low as 0.060 to 0.32 ng L−1 after derivatization. A graphical representation of derivatization signal gain on an example of E2 is also presented in Fig. S4.†
E3 | E2 | EE2 | E1 | BPS | BPF | BPE | BPA | BPAF | BPB | BPC | BPAP | BPZ | BPG | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Slope (pre-dansylation) | 251 | 258 | 139 | 2005 | 110![]() |
4125 | 844 | 14![]() |
10![]() |
766 | 4160 | 1826 | 7032 | 148 |
Slope (post-dansylation) | 12![]() |
34![]() |
63![]() |
45![]() |
266![]() |
381![]() |
38![]() |
349![]() |
91![]() |
144![]() |
247![]() |
20![]() |
86![]() |
3974 |
Signal gain factor | 49 | 133 | 453 | 23 | 2.4 | 93 | 46 | 23 | 8.4 | 189 | 60 | 11 | 12 | 27 |
The procedure was based on a method previously developed in-house for monitoring other pharmaceuticals in surface waters.55 Key parameters important for the sensitivity of the method were tested to assess the optimal loading parameters on spiked MQ samples (Table S7†). At volumes higher than 500 mL, decreasing recovery values for EE2, testosterone and progesterone were observed. Due to the increased loading time of a 500 mL sample and the fact that adequate LOQ values were obtained for most analytes even with 200 mL of sample, method development was continued with the latter volume. Flow rates up to 5 mL min−1 were satisfactory both in terms of recovery and precision and were deemed adequate for further method development, as the parameters were worse with higher flow rates for most analytes.
Ideally, only a small volume of sample would be loaded with a fast flow rate, achieving greater time efficiency, but would prove obtaining low-ng L−1 LOQ levels difficult. On the other hand, large loading volumes and fast loading rates can cause decreased (and inconsistent) recovery,66,67 as was also observed in our case.
The samples were not pre-filtered before being loaded onto cartridges, as this is not recommended due to the potential loss of analytes bound to particles and should be avoided.68 As most methods in the literature filter samples before loading, we deem this to be an important distinction of our method. All samples were loaded in full volume successfully even though partial clogging was observed, which additionally supports the selection of the 200 mL sample loading volume.
The visually inspected linearity of the calibration curves and the high determination coefficients (mostly above 0.999) showed a wide calibrator concentration range ratio for all the analytes (1000), except for BPC, where the range was only 200 (Table 2). Because analytes that have been derivatized (E1, E2, E3, EE2 and bisphenols) can also be quantified non-derivatized, their linearity ranges overlap. As the signal intensities of non-derivatized analytes are relatively low even in the highest points used in the calibration curve (Table S5†) it is possible to combine both calibrations to obtain an even wider monitoring concentration range, for monitoring samples with higher concentrations, such as wastewaters. This is a significant advantage of the method, as it enables even greater utility, if needed.
Compound name | Calibration range (ng L−1) | R 2 | LOD (ng L−1) | LOQ (ng L−1) | Accuracy (%) | Intra- and between-day precision (%) | Injection repeatability (%) | Recovery (%) | ME (%) | ||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
QCL | QCH | QCL | QCH | QCL | QCH | DW | RW | ||||||
a Notes: QCL is the low and QCH the high concentration QC sample. LOD. limit of detection; LOQ, limit of quantitation; ME, matrix effect; DW, drinking water; RW, river water; *, analytes denoted with -D are derivatized with dansyl chloride; /, the signal of the corresponding analyte is too low at the QCL level to be adequately assessed and is therefore relevant at this level only when derivatized. | |||||||||||||
BPS | 0.3–300 | 0.9998 | 0.08 | 0.24 | 109–112 | 96–97 | 1.0/7.7 | 1.2/5.8 | 1.4 | 1.0 | 99.7 | −9.4 | −17.8 |
BPF | 4.0–400 | 0.9995 | 1.3 | 4.0 | — | 97–100 | — | 1.8/5.4 | — | 1.4 | 100.8 | 9.4 | 17.4 |
Estriol | 2.3–75 | 0.9990 | 0.75 | 2.3 | — | 97–99 | — | 6.2/9.0 | — | 2.3 | 89.5 | 5.9 | 22.2 |
BPE | 1.5–300 | 0.9995 | 0.5 | 1.5 | 103–117 | 93–94 | 0.3/14.6 | 2.4/5.6 | 0.4 | 1.5 | 104.4 | 6.5 | 24.2 |
BPA | 0.8–300 | 1.0000 | 0.28 | 0.84 | — | 95–96 | — | 0.5/7.0 | — | 0.4 | 109.8 | 0.3 | 17.1 |
BPAF | 1.5–300 | 0.9981 | 0.5 | 1.5 | — | 90–99 | — | 1.0/6.6 | — | 0.7 | 112.9 | −5.8 | 20.9 |
BPB | 1.5–300 | 0.9998 | 0.5 | 1.5 | 108–115 | 97–98 | 8.9/10.5 | 3.0/4.8 | 7.2 | 2.6 | 118.0 | 8.3 | 16.5 |
BPC | 9.0–300 | 0.9989 | 3.0 | 9.0 | — | 88–94 | — | 3.2/6.8 | — | 2.9 | 111.3 | −12 | 19.6 |
Estradiol | 40–400 | 0.9994 | 13.3 | 40 | — | 94–97 | — | 1.5/5.2 | — | 1.3 | 111.2 | 8.6 | 19.5 |
Ethinylestradiol | 25–75 | 0.9992 | 8.8 | 26 | — | 88–95 | — | 5.2/18.5 | — | 2.3 | 88.2 | 5.2 | 10.2 |
BPAP | 0.4–300 | 0.9999 | 0.14 | 0.42 | 100–110 | 98–101 | 2.0/8.0 | 1.9/5.9 | 2.2 | 1.5 | 103.8 | −3.6 | 6.1 |
BPZ | 2.0–200 | 0.9999 | 0.67 | 2.0 | — | 101–102 | — | 1.5/7.5 | — | 1.5 | 100.0 | −5.5 | 0.6 |
Corticosterone | 0.9–850 | 0.9998 | 0.23 | 0.68 | 81–82 | 109–110 | 4.4/7.3 | 1.0/7.8 | 1.8 | 0.7 | 88.9 | 5.1 | −13.4 |
Dienogest | 0.4–400 | 1.0000 | 0.11 | 0.32 | 90–92 | 106–107 | 2.3/4.9 | 1.4/5.8 | 1.9 | 1.0 | 89.1 | −8 | −20.6 |
Estrone | 3.0–30 | 0.9999 | 1.0 | 3.0 | — | 98–102 | — | 3.2/5.6 | — | 2.5 | 98.8 | −1.3 | 21.0 |
Testosterone | 0.1–100 | 1.0000 | 0.046 | 0.14 | 96–97 | 101–102 | 2.1/6.6 | 1.3/2.8 | 1.4 | 1.1 | 99.2 | −3.3 | 1.7 |
Norethindrone | 0.8–750 | 0.9994 | 0.20 | 0.60 | 114–115 | 95–98 | 1.5/5.3 | 0.4/4.3 | 0.5 | 0.6 | 99.9 | −4.6 | 5.8 |
Tibolone | 2.0–700 | 0.9976 | 0.65 | 2.0 | 73–75 | 113–117 | 3.1/9.2 | 0.4/3.9 | 2.9 | 0.4 | 96.4 | 6 | 10.7 |
BPG | 6.0–600 | 0.9994 | 2.0 | 6.0 | — | 95–98 | — | 1.5/6.8 | — | 1.4 | 97.4 | −17.6 | 14.0 |
Gestodene | 0.55–550 | 0.9999 | 0.15 | 0.44 | 88–91 | 104–105 | 0.6/3.9 | 0.7/3.3 | 0.5 | 0.5 | 101.4 | −3.1 | 8.3 |
Cyproterone | 1.25–1250 | 0.9986 | 0.33 | 1.0 | 71–72 | 109–110 | 1.7/4.5 | 1.0/2.5 | 1.4 | 1.1 | 102.4 | 2.8 | 8.5 |
Norgestrel | 0.5–500 | 0.9998 | 0.13 | 0.4 | 106–109 | 98–100 | 1.3/3.2 | 1.1/3.0 | 0.8 | 0.8 | 104.9 | −5.6 | 13.0 |
Chlormadinone | 3.5–3500 | 0.9998 | 0.93 | 2.8 | 106–106 | 100–102 | 1.9/4.5 | 1.0/7.9 | 1.9 | 0.9 | 93.6 | −2.4 | 12.3 |
Drospirenone | 0.4–400 | 0.9998 | 0.11 | 0.32 | 106–106 | 99–102 | 2.1/4.1 | 1.6/7.3 | 2.0 | 1.3 | 100.4 | −7.6 | 6.5 |
Progesterone | 0.1–100 | 0.9997 | 0.027 | 0.080 | 102–108 | 97–102 | 3.8/5.4 | 2.3/6.1 | 1.0 | 2.0 | 94.6 | −7.9 | −0.6 |
Estriol-D* | 0.075–26 | 0.9999 | 0.02 | 0.06 | 92–99 | 101–103 | 4.9/8.0 | 3.5/4.0 | 2.4 | 0.8 | 100.3 | −5.3 | 0.5 |
Ethinylestradiol-D | 0.075–26 | 10![]() |
0.02 | 0.06 | 93–94 | 100–102 | 2.8/3.9 | 2.4/3.5 | 1.2 | 0.8 | 97.1 | −2.1 | 23.4 |
Estradiol-D | 0.4–140 | 0.9999 | 0.11 | 0.32 | 103–108 | 98–102 | 3.0/5.1 | 0.9/3.2 | 0.6 | 0.7 | 100.9 | −10.5 | −5.4 |
Estrone-D | 0.03–11 | 0.9999 | 0.028 | 0.084 | 86–91 | 100–103 | 1.6/4.2 | 3.2/4.8 | 1.2 | 1.0 | 94.6 | 6.2 | 13.7 |
BPS-D | 0.3–105 | 1.0000 | 0.08 | 0.24 | 99–103 | 102–103 | 1.0/5.5 | 1.3/5.9 | 1.3 | 0.5 | 102.0 | −1.8 | −12.1 |
BPAF-D | 0.3–105 | 0.9995 | 0.08 | 0.24 | 98–101 | 99–103 | 5.1/17.4 | 2.9/6.9 | 0.9 | 0.5 | 87.3 | 9.7 | −8.4 |
BPF-D | 0.4–140 | 10![]() |
0.11 | 0.32 | 97–107 | 100–104 | 2.3/7.7 | 3.1/5.2 | 1.4 | 0.7 | 88.2 | −8.9 | 2.5 |
BPE-D | 0.3–105 | 10![]() |
0.08 | 0.24 | 97–100 | 98–101 | 3.2/6.9 | 0.3/4.2 | 1.0 | 0.8 | 98.3 | −4.8 | 4.4 |
BPA-D | 0.3–105 | 10![]() |
0.08 | 0.24 | 93–117 | 98–99 | 0.5/15.6 | 0.9/5.4 | 0.3 | 0.7 | 97.7 | −4.0 | −1.8 |
BPB-D | 0.3–105 | 0.9999 | 0.08 | 0.24 | 99–100 | 99–100 | 2.1/6.5 | 2.3/4.8 | 0.8 | 0.6 | 87.6 | 0.0 | 1.8 |
BPC-D | 1.5–105 | 0.9997 | 0.5 | 1.5 | 87–105 | 96–98 | 8.5/18.4 | 1.2/7.5 | 2.6 | 1.1 | 73.0 | −10.1 | 20.2 |
BPG-D | 0.6–210 | 1.0000 | 0.16 | 0.48 | 91–108 | 100–102 | 6.1/16.0 | 1.3/17.3 | 2.9 | 1.1 | 80.5 | −17.1 | 22.7 |
BPAP-D | 0.3–105 | 0.9997 | 0.08 | 0.24 | 89–89 | 94–97 | 7.5/9.2 | 3.0/6.9 | 2.6 | 0.6 | 77.0 | 9.9 | −9.6 |
BPZ-D | 0.2–70 | 0.9999 | 0.053 | 0.16 | 90–90 | 93–97 | 7.8/9.1 | 1.5/4.2 | 2.1 | 1.0 | 79.6 | 1.7 | −2.8 |
Our assessment of acceptable LOQ values was based on S/N, S/B, accuracy, and precision at the representative concentration (Table S8†). The S/B and accuracy parameters are based on the EPA guideline, with additional stringent requirements based on existing analytical experience in related fields of environmental and bioanalytical studies. The representative chromatograms used in calculations of S/N values for each analyte are shown in Fig. S5† (non-derivatized) and Fig. S6† (derivatized). The LOQ values of most analytes are limited by the S/N value, while some analytes have shown some level of background contamination at relatively high S/N values. This was the case for BPA (S/B ratio over 5 and S/N ratio > 1800), and to a lesser extent also for some other bisphenols (BPS, BPZ and BPAF). Calculating LOQ values simply from S/N would therefore deem much lower values which proves that adding additional S/B requirements was needed for obtaining relevant LOQ values.
Some published methods report only estimates of theoretically achievable LOQ values, by calculation from the S/N ratios obtained at much higher concentration levels.10,70–72 This is not a good practice, as in this way, the performance of the method can be artificially improved but may not be representative of the true sensitivity.25 Therefore, in our study, the LOQ values were determined at the relevant concentrations (the samples with the same concentrations as the LOQ values). Furthermore, it is also important that the results are reliable in terms of the acceptable accuracy and repeatability proposed, as many measurements in the environment are close to these values.
The LOQ values for all analytes were in the sub-ng L−1 range (Table 2), as was the aim of the study (with the exception of tibolone, chlormadinone and BPC). The lowest values were obtained for E3 and EE2, even as low as 0.060 ng L−1. The acceptable method detection limit for EE2 is 0.035 ng L−1 in the 2018 EU watchlist (Vella, 2018) and is therefore easily achieved with our method (LOD = 0.020 ng L−1). Furthermore, the LOQ values in our method were not extrapolated but determined in actual samples with sub-ng L−1 concentrations, and were obtained with 200 mL of sample loaded, and considering the fact that method development has proven volumes up to 500 mL acceptable in terms of recoveries, the acceptable limit could easily be set even lower.
The developed method was accurate on both QC levels, as the results are well within the proposed levels. Most of the values fall in an even stricter range of 90–110% which further underlines the accuracy of the method (Table 2).
The results for intra- and between-day precisions are also within the proposed limits of RSD ≤ 20%, while values for injection repeatability were well within the limits with most values below 3%. Altogether, this verifies the precision of the method (Table 2).
SPE recoveries were all in the 50–130% range, with most values close to 100%.
All the ME values were in the ±30% range (Table 2). The MEs in drinking water (obtained from the tap in-house) and river water (obtained as a composite grab sample from the Drava River, located in a sparsely populated area about 5 km downstream from a major city) were low, mostly due to the inclusion of isotopically labeled internal standards in the method, especially in river water. The ME effect could also be tested by spiking the empty matrix samples after extraction, but we have chosen to spike before it as it is difficult to obtain an environmental matrix without any trace of analytes. We believe this is advantageous, as the ME encompasses the total effect of the matrix on the instrument signal (i.e., both the matrix effect on recovery and the ion suppression on the MS system). The comparison of matrix effects with and without the use of internal standards is presented in Table S9.†
All analytes were found to be stable over 24 and 48 hours in the autosampler at 8 °C where the loss of signal was <10% for most analytes, except for BPZ where the loss is 15%, and E1, with slightly over 20% signal loss at both QC levels (Table S10†). Both analytes have shown better stability in their derivatized form, so this was deemed acceptable, as they are primarily determined as derivatized. Sample stability is an important factor and was deemed acceptable for all analytes for at least 48 hours, especially as the combined analysis times for a larger number of samples can extend for longer periods of time. This was also valid for the derivatized analytes, as the derivates were suitably stable and did not convert back into their non-derivatized form.
Sample name | LK | R1 | R2 | R3 | R4 | ST | CH |
---|---|---|---|---|---|---|---|
Latitude | 46.440431 N | 46.459027 N | 46.516857 N | 46.527754 N | 46.543028 N | 46.465591 N | 46.461673 N |
Longitude | 15.680078 E | 15.752519 E | 15.699987 E | 15.712321 E | 15.703320 E | 15.657304 E | 15.670147 E |
a Notes: n.d., analyte not detected; <LOQ, measurement under the method quantitation limit;*, analytes denoted with -D are derivatized with dansyl chloride. | |||||||
Measured concentrations (ng L−1) | |||||||
BPS | n.d. | 3.7 ± 0.1 | 4.0 ± 0.1 | 4.0 ± 0.2 | 3.6 ± 0.2 | 28 ± 1 | 7.0 ± 0.1 |
BPF | n.d. | n.d. | n.d. | n.d. | n.d. | <LOQ | n.d. |
Estriol | n.d. | n.d. | n.d. | n.d. | n.d. | <LOQ | n.d. |
BPE | n.d. | n.d. | n.d. | n.d. | n.d. | n.d. | n.d. |
BPA | <LOQ | 4.9 ± 0.2 | 4.7 ± 0.4 | 3.0 ± 0.4 | 5.2 ± 1.2 | 47 ± 7 | 5.0 ± 0.6 |
BPAF | n.d. | n.d. | n.d. | n.d. | n.d. | n.d. | n.d. |
BPB | n.d. | n.d. | n.d. | n.d. | n.d. | n.d. | n.d. |
BPC | n.d. | n.d. | n.d. | n.d. | n.d. | n.d. | n.d. |
Estradiol | n.d. | n.d. | n.d. | n.d. | n.d. | n.d. | n.d. |
Ethinylestradiol | n.d. | n.d. | n.d. | n.d. | n.d. | n.d. | n.d. |
BPAP | n.d. | n.d. | n.d. | n.d. | n.d. | n.d. | n.d. |
BPZ | n.d. | n.d. | n.d. | n.d. | n.d. | n.d. | n.d. |
Corticosterone | n.d. | <LOQ | n.d. | <LOQ | <LOQ | n.d. | <LOQ |
Dienogest | <LOQ | n.d. | n.d. | n.d. | n.d. | 4.6 ± 0.2 | n.d. |
Estrone | 3.8 ± 0.4 | n.d. | n.d. | n.d. | n.d. | 46 ± 4 | 5.8 ± 0.4 |
Testosterone | n.d. | n.d. | n.d. | n.d. | n.d. | n.d. | n.d. |
Norethindrone | n.d. | <LOQ | n.d. | n.d. | <LOQ | n.d. | n.d. |
Tibolone | n.d. | n.d. | n.d. | n.d. | n.d. | n.d. | n.d. |
BPG | n.d. | n.d. | n.d. | n.d. | n.d. | n.d. | n.d. |
Gestodene | n.d. | n.d. | <LOQ | n.d. | <LOQ | n.d. | n.d. |
Cyproterone | <LOQ | <LOQ | <LOQ | <LOQ | <LOQ | n.d. | <LOQ |
Norgestrel | <LOQ | <LOQ | <LOQ | <LOQ | <LOQ | 12 ± 0 | 38 ± 3 |
Chlormadinone | <LOQ | 12 ± 0 | <LOQ | n.d. | <LOQ | n.d. | n.d. |
Drospirenone | n.d. | 9.0 ± 0.1 | n.d. | n.d. | <LOQ | n.d. | n.d. |
Progesterone | 2.8 ± 0.1 | 2.2 ± 0.0 | <LOQ | 2.2 ± 0.0 | 2.8 ± 0.1 | 6.0 ± 0.1 | 3.0 ± 0.1 |
Estriol-D* | 0.14 ± 0.05 | n.d. | n.d. | n.d. | n.d. | 1.8 ± 0.1 | 0.51 ± 0.02 |
Ethinylestradiol-D | n.d. | 0.06 ± 0.06 | n.d. | n.d. | <LOQ | 0.23 ± 0.06 | 0.23 ± 0.00 |
Estradiol-D | n.d. | <LOQ | n.d. | n.d. | n.d. | 2.1 ± 0.1 | <LOQ |
Estrone-D | 3.7 ± 0.1 | 0.49 ± 0.05 | 0.43 ± 0.05 | 0.58 ± 0.04 | 0.52 ± 0.03 | 50 ± 1 | 6.0 ± 0.1 |
BPS-D | <LOQ | 3.2 ± 0.2 | 4.1 ± 0.1 | 3.5 ± 0.1 | 3.8 ± 0.0 | 24 ± 2 | 7.1 ± 0.5 |
BPAF-D | n.d. | 0.41 ± 0.24 | <LOQ | <LOQ | n.d. | <LOQ | <LOQ |
BPF-D | <LOQ | <LOQ | <LOQ | <LOQ | <LOQ | 2.0 ± 0.2 | 0.33 ± 0.03 |
BPE-D | <LOQ | n.d. | n.d. | n.d. | n.d. | <LOQ | n.d. |
BPA-D | <LOQ | 5.1 ± 0.1 | 4.6 ± 0.3 | 2.7 ± 0.1 | 5.5 ± 0.9 | 46 ± 6 | 5.4 ± 1.6 |
BPB-D | n.d. | n.d. | n.d. | n.d. | n.d. | n.d. | n.d. |
BPC-D | <LOQ | <LOQ | <LOQ | <LOQ | <LOQ | <LOQ | <LOQ |
BPG-D | n.d. | <LOQ | <LOQ | n.d. | n.d. | n.d. | n.d. |
BPAP-D | <LOQ | n.d. | n.d. | n.d. | n.d. | <LOQ | n.d. |
BPZ-D | <LOQ | <LOQ | <LOQ | <LOQ | <LOQ | <LOQ | <LOQ |
The sensitivity of the method for derivatized analytes is higher. Therefore, with lower LOQ values, more analytes are present in samples in quantifiable concentrations, which further underlines the benefit of derivatization. Among natural hormones, E3, EE2, E2 and E1 are all quantifiable in at least one sample, where the concentrations are usually the highest by (at least) one order of magnitude in the stream sample, compared to in other samples (Fig. 3 for non-derivatized analytes and Fig. 4 for derivatized analytes).
However, it should be noted that it is impossible to evaluate the general state of any watershed included in the study from just one sample, as more sampling events would be needed for proper evaluation. The noticeably highest measured concentrations of estrogens for E1 and E3 are expected as they are both metabolites of E2 and are excreted by humans into the environment.76 Among bisphenols, alongside BPS and BPA which are present in measurable concentrations already in the non-derivatized samples, BPAF and BPF are also detectable in quantifiable concentrations with the derivatized method. The sensitivity of the derivatized method is especially important for E1, E2, and EE2, as their environmentally toxic concentrations are ranging from 3 to 74 pg L−1 and also for E3, BPA and BPS (0.47 to 1.0 ng L−1).35 This shows that all the determined concentrations in samples (except one for E3), are higher than their representative safe values. This further underlines the need for the accurate ultra-trace level determination the method facilitates, and consequently enables a much more accurate risk assessment of the watersheds, as even the sub-ng L−1 presence of these substances can now be adequately quantified.
As some analytes are quantifiable by both methods, the determined concentrations can be compared. Such an assessment is possible for BPA and BPS in all samples. All the determined values are comparable (all fall in the ±20% range, with most less than ±10%), highlighting that both calibrations are comparable and useable at the same time, although in different concentration ranges. Estrone, however, was determined in a quantifiable concentration with the method for non-derivatized analytes only in three samples, two slightly above the LOQ and the other with a much higher concentration (all comparable in the ±10% range), while being above the LOQ in all seven samples with the method for derivatized analytes. This further underlines the applicability of the method including derivatization, as it enables higher sensitivity for quantification in ultra-trace levels of analytes, without sacrificing the method performance for non-derivatized analytes.
We believe that it is crucial that the quantitation method performs accurately and precisely in the sub-ng L−1 to low-ng L−1 range since this is exactly the region where the safety limits are currently set for most high-priority EDs. This is also the concentration range that is frequently observed in environmental samples in our region. It is therefore essential that the LOQ values are set correctly and low enough for establishing any meaningful assessment of the toxicological burden imposed by the measured EDs. This is the case for many measured analytes in similar studies from similar watersheds as our study, such as the tributaries of the Danube and other rivers in central Europe. The reported concentrations for estrone, estradiol and estriol were ranging from below theoretically calculated LOD values (the lowest being 0.44 ng L−1),79,80 to below the LOQ (0.08 to 0.31 ng L−1) up to low-ng L−1 values.24,81,82 In the mentioned studies, ethinylestradiol was only found in concentrations below LOD or LOQ values (0.08 to 3 ng L−1).
This is the same order of magnitude or higher as its potentially toxic concentrations (from 3 pg L−1 to 0.47 ng L−1 for NHs).35 The existing methodology therefore does not enable accurate determination of such low concentrations and does not provide an accurate risk assessment of watersheds (concentrations under high LOD values are reported as zero and those under LOQ values usually as LOQ/2). On the other hand, our method provides clear improvement with reliable results even in such low ranges, further confirmed by a standard addition experiment, which can greatly improve the determination of the waters' burden with both NHs, SHs and BPS at the same time.
Our methodology is therefore useful for monitoring the quality of surface waters, according to the 2018 EU directive, which is important, as the data on the environmental presence of the analytes are still insufficient,20 and a contaminant candidate list from the US EPA (US EPA, 2018).83 As the method was also validated in a drinking water matrix with adequate sub-ng L−1 LOQ values, it is also suitable for the monitoring of drinking water pollutants such as estradiol and BPA.84,85 Furthermore, the method was already successfully used for the analysis of large sample batches from an international project in Central Europe, boDEREC-CE (data not published yet).
The method's suitability for routine application in ultra-trace level quantification was confirmed by analysis of seven representative samples, i.e. one lake, four river, one stream, and one channel sample where twenty-one analytes were detected, and thirteen out of those were quantified in at least one sample. The method's simple sample preparation combined with a wide linear range and robustly determined environmentally relevant LOQ values as low as 60 pg L−1 exceed regulatory requirements (i.e. for ethinylestradiol). Altogether, the method provides a reliable tool for the routine monitoring of 25 important endocrine disruptors in order to provide detailed information on their presence in the environment.
Footnote |
† Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d3ay00602f |
This journal is © The Royal Society of Chemistry 2023 |