Chemical documents: machine understanding and automated information extraction

Joe A. Townsend; Sam E. Adams; Christopher A. Waudby; Vanessa K. de Souza; Jonathan M. Goodman; Peter Murray-Rust

doi:10.1039/B411033A

You do not have JavaScript enabled. Please enable JavaScript to access the full features of the site or access our non-JavaScript page.

Chemical documents: machine understanding and automated information extraction†

Joe A. Townsend,^a Sam E. Adams,^a Christopher A. Waudby,^a Vanessa K. de Souza,^a Jonathan M. Goodman*^a and Peter Murray-Rust^a

Author affiliations

* Corresponding authors

^a Unilever Centre for Molecular Science Informatics, Department of Chemistry, Lensfield Road, Cambridge, UK
E-mail: pm286@cam.ac.uk
Fax: +44 1223 763076
Tel: +44 1223 763069

Abstract

Automatically extracting chemical information from documents is a challenging task, but an essential one for dealing with the vast quantity of data that is available. The task is least difficult for structured documents, such as chemistry department web pages or the output of computational chemistry programs, but requires increasingly sophisticated approaches for less structured documents, such as chemical papers. The identification of key units of information, such as chemical names, makes the extraction of useful information from unstructured documents possible.

Download options Please wait...

Article information

DOI: https://doi.org/10.1039/B411033A
Article type: Paper
Submitted: 21 Jul 2004
Accepted: 08 Oct 2004
First published: 20 Oct 2004

Download Citation

Org. Biomol. Chem., 2004,2, 3294-3300

Permissions

Request permissions

Chemical documents: machine understanding and automated information extraction

J. A. Townsend, S. E. Adams, C. A. Waudby, V. K. de Souza, J. M. Goodman and P. Murray-Rust, Org. Biomol. Chem., 2004, 2, 3294 DOI: 10.1039/B411033A

To request permission to reproduce material from this article, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Social activity

Fetching data from CrossRef.
This may take some time to load.

Organic & Biomolecular Chemistry

Chemical documents: machine understanding and automated information extraction†

Abstract

Article information

Download Citation

Permissions

Chemical documents: machine understanding and automated information extraction

Social activity

Search articles by author

Spotlight

Advertisements