Issue 11, 2023

Adding open spectral data to MassBank and PubChem using open source tools to support non-targeted exposomics of mixtures

Abstract

The term “exposome” is defined as a comprehensive study of life-course environmental exposures and the associated biological responses. Humans are exposed to many different chemicals, which can pose a major threat to the well-being of humanity. Targeted or non-targeted mass spectrometry techniques are widely used to identify and characterize various environmental stressors when linking exposures to human health. However, identification remains challenging due to the huge chemical space applicable to exposomics, combined with the lack of sufficient relevant entries in spectral libraries. Addressing these challenges requires cheminformatics tools and database resources to share curated open spectral data on chemicals to improve the identification of chemicals in exposomics studies. This article describes efforts to contribute spectra relevant for exposomics to the open mass spectral library MassBank (https://www.massbank.eu) using various open source software efforts, including the R packages RMassBank and Shinyscreen. The experimental spectra were obtained from ten mixtures containing toxicologically relevant chemicals from the US Environmental Protection Agency (EPA) Non-Targeted Analysis Collaborative Trial (ENTACT). Following processing and curation, 5582 spectra from 783 of the 1268 ENTACT compounds were added to MassBank, and through this to other open spectral libraries (e.g., MoNA, GNPS) for community benefit. Additionally, an automated deposition and annotation workflow was developed with PubChem to enable the display of all MassBank mass spectra in PubChem, which is rerun with each MassBank release. The new spectral records have already been used in several studies to increase the confidence in identification in non-target small molecule identification workflows applied to environmental and exposomics research.

Graphical abstract: Adding open spectral data to MassBank and PubChem using open source tools to support non-targeted exposomics of mixtures

Supplementary files

Article information

Article type
Paper
Submitted
28 huhti 2023
Accepted
25 kesä 2023
First published
10 heinä 2023
This article is Open Access
Creative Commons BY license

Environ. Sci.: Processes Impacts, 2023,25, 1788-1801

Adding open spectral data to MassBank and PubChem using open source tools to support non-targeted exposomics of mixtures

A. Elapavalore, T. Kondić, R. R. Singh, B. A. Shoemaker, P. A. Thiessen, J. Zhang, E. E. Bolton and E. L. Schymanski, Environ. Sci.: Processes Impacts, 2023, 25, 1788 DOI: 10.1039/D3EM00181D

This article is licensed under a Creative Commons Attribution 3.0 Unported Licence. You can use material from this article in other publications without requesting further permissions from the RSC, provided that the correct acknowledgement is given.

Read more about how to correctly acknowledge RSC content.

Social activity

Spotlight

Advertisements