Issue 13, 1969

Organisation of large collections of chemical structures for computer searching

Abstract

New techniques of partitioning large files of chemical structures are described which allow searches for whole structures to be concentrated on a small part of the file. Using these techniques, recognition of compounds new to the system may be accomplished without generating a unique representation of the chemical structure.

The chemical structure file is arranged into molecular formula groups and the larger groups are subdivided by comparing ordered lists of the atom-bond-atom pairs present in each compound. Where a finer division is required augmented versions of the atom-bond-atom pairs are used. Analysis of a number of molecular formula groups has shown that even the largest groups can be divided into small subgroups by the augmented pair technique. In the majority of cases the subgroups contain only one or two compounds.

Article information

Article type
Paper

J. Chem. Soc. C, 1969, 1732-1736

Organisation of large collections of chemical structures for computer searching

M. F. Lynch, J. Orton and W. G. Town, J. Chem. Soc. C, 1969, 1732 DOI: 10.1039/J39690001732

To request permission to reproduce material from this article, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Read more about how to correctly acknowledge RSC content.

Spotlight

Advertisements