Jump to main content
Jump to site search

Issue 22, 2004
Previous Article Next Article

Chemical documents: machine understanding and automated information extraction

Author affiliations

Abstract

Automatically extracting chemical information from documents is a challenging task, but an essential one for dealing with the vast quantity of data that is available. The task is least difficult for structured documents, such as chemistry department web pages or the output of computational chemistry programs, but requires increasingly sophisticated approaches for less structured documents, such as chemical papers. The identification of key units of information, such as chemical names, makes the extraction of useful information from unstructured documents possible.

Graphical abstract: Chemical documents: machine understanding and automated information extraction

Back to tab navigation

Publication details

The article was received on 21 Jul 2004, accepted on 08 Oct 2004 and first published on 20 Oct 2004


Article type: Paper
DOI: 10.1039/B411033A
Citation: Org. Biomol. Chem., 2004,2, 3294-3300
  •   Request permissions

    Chemical documents: machine understanding and automated information extraction

    J. A. Townsend, S. E. Adams, C. A. Waudby, V. K. de Souza, J. M. Goodman and P. Murray-Rust, Org. Biomol. Chem., 2004, 2, 3294
    DOI: 10.1039/B411033A

Search articles by author

Spotlight

Advertisements