Molecular similarity: a key technique in molecular informatics

Andreas Bender; Robert C. Glen

doi:10.1039/B409813G

Molecular similarity: a key technique in molecular informatics†

Andreas Bender^a and Robert C. Glen*^a

* Corresponding authors

^a Unilever Centre for Molecular Science Informatics, Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge, United Kingdom
E-mail: rcg28@cam.ac.uk
Tel: +44 (1223) 336 432

Abstract

Molecular Informatics utilises many ideas and concepts to find relationships between molecules. The concept of similarity, where molecules may be grouped according to their biological effects or physicochemical properties has found extensive use in drug discovery. Some areas of particular interest have been in lead discovery and compound optimisation. For example, in designing libraries of compounds for lead generation, one approach is to design sets of compounds ‘similar’ to known active compounds in the hope that alternative molecular structures are found that maintain the properties required while enhancing e.g. patentability, medicinal chemistry opportunities or even in achieving optimised pharmacokinetic profiles. Thus the practical importance of the concept of molecular similarity has grown dramatically in recent years. The predominant users are pharmaceutical companies, employing similarity methods in a wide range of applications e.g. virtual screening, estimation of absorption, distribution, metabolism, excretion and toxicity (ADME/Tox) and prediction of physicochemical properties (solubility, partitioning etc.). In this perspective, we discuss the representation of molecular structure (descriptors), methods of comparing structures and how these relate to measured properties. This leads to the concept of molecular similarity, its various definitions and uses and how these have evolved in recent years. Here, we wish to evaluate and in some cases challenge accepted views and uses of molecular similarity. Molecular similarity, as a paradigm, contains many implicit and explicit assumptions in particular with respect to the prediction of the binding and efficacy of molecules at biological receptors. The fundamental observation is that molecular similarity has a context which both defines and limits its use. The key issues of solvation effects, heterogeneity of binding sites and the fundamental problem of the form of similarity measure to use are addressed.

Article information

https://doi.org/10.1039/B409813G

Article type

Perspective

Submitted

28 Jun 2004

Accepted

09 Sep 2004

First published

14 Oct 2004

Download Citation

Org. Biomol. Chem., 2004,2, 3204-3218

Permissions

Request permissions

Molecular similarity: a key technique in molecular informatics

A. Bender and R. C. Glen, Org. Biomol. Chem., 2004, 2, 3204 DOI: 10.1039/B409813G

To request permission to reproduce material from this article, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Organic & Biomolecular Chemistry

Molecular similarity: a key technique in molecular informatics†

Abstract

Article information

Download Citation

Permissions

Molecular similarity: a key technique in molecular informatics

Social activity

Search articles by author

Spotlight

Advertisements