Issue 2, 2026

Joint embedding predictive architecture for self-supervised pretraining on polymer molecular graphs

Abstract

Recent advances in machine learning (ML) have shown promise in accelerating the discovery of polymers with desired properties by aiding in tasks such as virtual screening via property prediction. However, progress in polymer ML is hampered by the scarcity of high-quality labeled datasets, which are necessary for training supervised ML models. In this work, we study the use of the very recent ‘Joint Embedding Predictive Architecture’ (JEPA), a type of architecture developed for self-supervised learning (SSL), on polymer molecular graphs to understand whether pretraining with the proposed SSL strategy improves downstream performance when labeled data is scarce. We first pretrain our polymer-JEPA model on a large dataset of conjugated copolymer photocatalysts. The pretrained model is then fine-tuned on two distinct downstream tasks: predicting electron affinity in the same chemical space and classifying phase behavior in diblock copolymers, a different chemical space. Our results indicate that JEPA-based self-supervised pretraining enhances downstream performance, particularly when labeled data is very scarce, achieving improvements across both tested datasets. The method provides performance gains in cross-domain fine-tuning, highlighting its potential to extract general knowledge across different classes of polymers. By leveraging large amounts of unlabeled polymer structures for pretraining, the proposed strategy can further reduce the dependence on extensive labeled datasets.

Graphical abstract: Joint embedding predictive architecture for self-supervised pretraining on polymer molecular graphs

Article information

Article type
Paper
Submitted
13 Jul 2025
Accepted
11 Dec 2025
First published
23 Jan 2026
This article is Open Access
Creative Commons BY license

Digital Discovery, 2026,5, 819-834

Joint embedding predictive architecture for self-supervised pretraining on polymer molecular graphs

F. Piccoli, G. Vogel and J. M. Weber, Digital Discovery, 2026, 5, 819 DOI: 10.1039/D5DD00308C

This article is licensed under a Creative Commons Attribution 3.0 Unported Licence. You can use material from this article in other publications without requesting further permissions from the RSC, provided that the correct acknowledgement is given.

Read more about how to correctly acknowledge RSC content.

Social activity

Spotlight

Advertisements