Issue 71, 2024

A chemical reaction entity recognition method based on a natural language data augmentation strategy

Abstract

Impressive applications of artificial intelligence in the field of chemical reaction prediction heavily depend on abundant reliable datasets. The automated extraction of reaction procedures to build structured chemical databases is of growing importance. Here, we propose a novel model named DACRER for large-scale reaction extraction, in which transfer learning and a data augmentation strategy were employed. This model was evaluated for chemical datasets and shows good performance in identifying and processing chemical texts.

Graphical abstract: A chemical reaction entity recognition method based on a natural language data augmentation strategy

Supplementary files

Article information

Article type
Communication
Submitted
31 Mar 2024
Accepted
29 Jul 2024
First published
09 Aug 2024

Chem. Commun., 2024,60, 9610-9613

A chemical reaction entity recognition method based on a natural language data augmentation strategy

X. Zhang, Y. Li, C. Li, J. Zhu, Z. Gan, L. Wang, X. Sun and H. You, Chem. Commun., 2024, 60, 9610 DOI: 10.1039/D4CC01471E

To request permission to reproduce material from this article, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Read more about how to correctly acknowledge RSC content.

Social activity

Spotlight

Advertisements