Jump to main content
Jump to site search

Issue 3, 2016
Previous Article Next Article

Improving the prediction of organism-level toxicity through integration of chemical, protein target and cytotoxicity qHTS data

Author affiliations

Abstract

Prediction of compound toxicity is essential because covering the vast chemical space requiring safety assessment using traditional experimentally-based, resource-intensive techniques is impossible. However, such prediction is nontrivial due to the complex causal relationship between compound structure and in vivo harm. Protein target annotations and in vitro experimental outcomes encode relevant bioactivity information complementary to chemicals’ structures. This work tests the hypothesis that utilizing three complementary types of data will afford predictive models that outperform traditional models built using fewer data types. A tripartite, heterogeneous descriptor set for 367 compounds was comprised of (a) chemical descriptors, (b) protein target descriptors generated using an algorithm trained on 190 000 ligand–protein interactions from ChEMBL, and (c) descriptors derived from in vitro cell cytotoxicity dose–response data from a panel of human cell lines. 100 random forests classification models for predicting rat LD50 were built using every combination of descriptors. Successive integration of data types improved predictive performance; models built using the full dataset had an average external correct classification rate of 0.82, compared to 0.73–0.80 for models built using two data types and 0.67–0.78 for models built using one. Pairwise comparisons of models trained on the same data showed that including a third data domain on top of chemistry improved average correct classification rate by 1.4–2.4 points, with p-values <0.01. Additionally, the approach enhanced the models’ applicability domains and proved useful for generating novel mechanism hypotheses. The use of tripartite heterogeneous bioactivity datasets is a useful technique for improving toxicity prediction. Both protein target descriptors – which have the practical value of being derived in silico – and cytotoxicity descriptors derived from experiment are suitable contributors to such datasets.

Graphical abstract: Improving the prediction of organism-level toxicity through integration of chemical, protein target and cytotoxicity qHTS data

Back to tab navigation
Please wait while Download options loads

Supplementary files

Publication details

The article was received on 29 Oct 2015, accepted on 01 Mar 2016, published on 03 Mar 2016 and first published online on 03 Mar 2016


Article type: Paper
DOI: 10.1039/C5TX00406C
Citation: Toxicol. Res., 2016,5, 883-894
  • Open access: Creative Commons BY license
  •   Request permissions

    Improving the prediction of organism-level toxicity through integration of chemical, protein target and cytotoxicity qHTS data

    C. H. G. Allen, A. Koutsoukas, I. Cortés-Ciriano, D. S. Murrell, T. E. Malliavin, R. C. Glen and A. Bender, Toxicol. Res., 2016, 5, 883
    DOI: 10.1039/C5TX00406C

    This article is licensed under a Creative Commons Attribution 3.0 Unported Licence. Material from this article can be used in other publications provided that the correct acknowledgement is given with the reproduced material.

    Reproduced material should be attributed as follows:

    • For reproduction of material from NJC:
      [Original citation] - Published by The Royal Society of Chemistry (RSC) on behalf of the Centre National de la Recherche Scientifique (CNRS) and the RSC.
    • For reproduction of material from PCCP:
      [Original citation] - Published by the PCCP Owner Societies.
    • For reproduction of material from PPS:
      [Original citation] - Published by The Royal Society of Chemistry (RSC) on behalf of the European Society for Photobiology, the European Photochemistry Association, and RSC.
    • For reproduction of material from all other RSC journals:
      [Original citation] - Published by The Royal Society of Chemistry.

    Information about reproducing material from RSC articles with different licences is available on our Permission Requests page.

Search articles by author