Identification of bioprivileged molecules: expansion of a computational approach to broader molecular space

Lauren M. Lopez; Brent H. Shanks; Linda J. Broadbelt

doi:10.1039/D1ME00013F

Identification of bioprivileged molecules: expansion of a computational approach to broader molecular space†

Lauren M. Lopez,^a Brent H. Shanks

^b and Linda J. Broadbelt

*^c

Author affiliations

* Corresponding authors

^a Department of Materials Science and Engineering, Northwestern University, 2220 Campus Drive, Evanston, Illinois 60208, USA

^b Department of Chemical and Biological Engineering, Iowa State University, 1140L BRL, Ames, Iowa 50011, USA

^c Department of Chemical and Biological Engineering, Northwestern University, 2145 Sheridan Road, Evanston, Illinois 60208, USA
E-mail: broadbelt@northwestern.edu

Abstract

As interest in biobased chemicals grows, and their application space expands, computational tools to navigate molecule space as a complement to experimental approaches are imperative. This work expands upon previous work that identified candidate bioprivileged molecules from the C₆H_xO_y (C6) subspace. It refines the framework that was developed previously to better refine the molecules according to their biological origin and applies it to three new subspaces of chemical structure: C₄H_xO_y (C4), C₅H_xO_y (C5), and C₇H_xO_y (C7). For C5 and C7, roughly the top 100 bioprivileged candidates were identified, and the enhanced framework was applied to recast slightly the previous list of the top 100 C6 molecules. In addition, all top candidates were analyzed for their key functional moieties using a random forest model, and this algorithm was applied to compare the functional group space occupied by bioprivileged molecules of various databases of molecules with a focus on evaluating how closely the molecules were aligned with those known to biology. Furthermore, with the present work's focus on automation and data science principles, the framework can be easily expanded to include other chemical formulae to screen for bioprivileged candidates. This in turn facilitates the retrosynthesis process inherent in the framework to identify those bioprivileged intermediates in other subspaces that lead to target molecules.

This article is part of the themed collection: 2021 MSDE Symposium Collection

Supplementary files

Article information

DOI: https://doi.org/10.1039/D1ME00013F
Article type: Paper
Submitted: 22 Feb 2021
Accepted: 23 Apr 2021
First published: 24 Apr 2021

Download Citation

Mol. Syst. Des. Eng., 2021,6, 445-460

Author version available

Download author version (PDF)

Permissions

Request permissions

Identification of bioprivileged molecules: expansion of a computational approach to broader molecular space

L. M. Lopez, B. H. Shanks and L. J. Broadbelt, Mol. Syst. Des. Eng., 2021, 6, 445 DOI: 10.1039/D1ME00013F

To request permission to reproduce material from this article, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Molecular Systems Design & Engineering

Identification of bioprivileged molecules: expansion of a computational approach to broader molecular space†

Abstract

Supplementary files

Article information

Download Citation

Author version available

Permissions

Identification of bioprivileged molecules: expansion of a computational approach to broader molecular space

Social activity

Search articles by author

Spotlight

Advertisements