Issue 47, 2022

Contrastive representation learning of inorganic materials to overcome lack of training datasets

Abstract

Data representation forms a feature space where forms data distribution that is one of the key factors determining the prediction accuracy of machine learning (ML). In particular, the data representation is crucial to handle small and biased training datasets, which is the main challenge of ML in chemical applications. In this paper, we propose a data-agnostic representation method that automatically and universally generates a vector-shaped and target-specified representation of crystal structures. By employing the new materials representation of the proposed method, the prediction capabilities of ML algorithms were highly improved on small training datasets and transfer learning tasks. Moreover, the prediction accuracies of ML algorithms were improved by 28.89–30.87% in extrapolation problems to predict the physical properties of the materials in unknown material groups. The source code of EMRL is publicly available at https://github.com/ngs00/emrl/tree/master/EMRL.

Graphical abstract: Contrastive representation learning of inorganic materials to overcome lack of training datasets

Supplementary files

Article information

Article type
Communication
Submitted
28 Mar 2022
Accepted
09 Apr 2022
First published
11 May 2022

Chem. Commun., 2022,58, 6729-6732

Contrastive representation learning of inorganic materials to overcome lack of training datasets

G. S. Na and H. W. Kim, Chem. Commun., 2022, 58, 6729 DOI: 10.1039/D2CC01764D

To request permission to reproduce material from this article, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Read more about how to correctly acknowledge RSC content.

Social activity

Spotlight

Advertisements