How to judge whether QSAR/read-across predictions can be trusted: a novel approach for establishing a model's applicability domain

A. Gajewicz

doi:10.1039/C7EN00774D

How to judge whether QSAR/read-across predictions can be trusted: a novel approach for establishing a model's applicability domain†

A. Gajewicz

^a University of Gdansk, Faculty of Chemistry, Laboratory of Environmental Chemometrics, Gdansk, Poland
E-mail: a.gajewicz@qsar.eu.org
Fax: +48 58 523 5012
Tel: +48 58 523 5246

Abstract

The EU REACH legislation, the OECD and US EPA official guidance documents, as well as the 3Rs principle (replacement, reduction, refinement of animal testing), all advocate the necessity of developing comprehensive computational methods (e.g. quantitative structure–activity relationship, read-across) that would enable the predictive modeling of both chemical (e.g. nanoparticle) specific functionalities and their hazards. However, since computational (nano)toxicology continues to ‘learn on the fly’ and relies on the use of a vast array of innovative machine-learning algorithms, serious concerns about the reliability of in silico predictions are raised. This study aimed to give an answer to the following question: how to judge whether QSAR/read-across predictions are reliable. Here, an effective approach for graphical assessment of the limits of a model's reliable predictions (so-called applicability domain, AD) was introduced. The probability-oriented distance-based approach (AD_ProbDist) was proposed as a robust and automatic method for defining the interpolation space where true and reliable predictions can be expected. Its usefulness was confirmed by using four nano-QSAR/read-across models recently reported in the literature. The results of the study showed that the AD_ProbDist approach is more restrictive in terms of the chemical space that falls in the AD of a model than the range, geometrical, distance and leverage approaches. The advantages of the proposed AD_ProbDist approach include (but are not limited to) the fact that it works with relatively small datasets and enables the identification of (un)reliable predictions for newly screened chemicals without experimental data. Further, to facilitate the use of the AD_ProbDist approach, this study provides the developed in-house R-codes.

Supplementary files

Article information

DOI: https://doi.org/10.1039/C7EN00774D
Article type: Paper
Submitted: 22 Aug 2017
Accepted: 06 Dec 2017
First published: 08 Dec 2017

Download Citation

Environ. Sci.: Nano, 2018,5, 408-421

Permissions

Request permissions

How to judge whether QSAR/read-across predictions can be trusted: a novel approach for establishing a model's applicability domain

A. Gajewicz, Environ. Sci.: Nano, 2018, 5, 408 DOI: 10.1039/C7EN00774D

To request permission to reproduce material from this article, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Environmental Science: Nano

How to judge whether QSAR/read-across predictions can be trusted: a novel approach for establishing a model's applicability domain†

Abstract

Supplementary files

Article information

Download Citation

Permissions

How to judge whether QSAR/read-across predictions can be trusted: a novel approach for establishing a model's applicability domain

Social activity

Search articles by author

Spotlight

Advertisements