Issue 11, 2022

Prediction of the taxonomical classification of the Ranunculaceae family using a machine learning method

Abstract

Ranunculaceae is a botanical source for various pharmaceutically active compounds, which has been commonly utilized in traditional Chinese medicine. Increasing interest in Ranunculaceae pharmaceutical resources has led to a taxonomical study of this family, which might provide new insight to understand its diversification, relationship and phylogenetic position, and further to find new medicinal resources and promising compounds. In this study, we used the machine learning method to explore the classification of the medicinal Ranunculaceae family. 204 species representing 17 genera of the Ranunculaceae family were collected from the TCMID with their 1280 active compounds composed of structure-based fingerprints. After the construction of species-compound and genus-compound matrices, CNNs and Ext fingerprints were determined as the best machine learning method and fingerprint type using ACC and F-score as clustering criteria, respectively. We found that taxonomical classification within the Ranunculaceae family could be accurately predicted, especially at the genus level with a top ACC of 0.86 and an F-score of 0.85. The top features of compounds that were important for the classification of 17 genera were also identified, and thus some genera with high medicinal values were associated with characteristic cis and (or) trans features. As far as we know, this is the first time that some genera are found to be associated with the structural features of compounds.

Graphical abstract: Prediction of the taxonomical classification of the Ranunculaceae family using a machine learning method

Supplementary files

Article information

Article type
Paper
Submitted
30 Jul 2021
Accepted
04 Feb 2022
First published
24 Feb 2022

New J. Chem., 2022,46, 5150-5161

Prediction of the taxonomical classification of the Ranunculaceae family using a machine learning method

J. Chen, W. Yang, G. Tan, C. Tian, H. Wang, J. Zhou and H. Liao, New J. Chem., 2022, 46, 5150 DOI: 10.1039/D1NJ03632G

To request permission to reproduce material from this article, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Read more about how to correctly acknowledge RSC content.

Social activity

Spotlight

Advertisements