Issue 10, 2025

“Twisting” the data: a universal machine-learning approach to classify single-molecule curves and beyond

Abstract

We present a new automated supervised procedure trained to classify both conductance-voltage (G(V)) curves and conductance-distance (G(z)) traces generated in single-molecule junctions to a high degree of confidence. Compared to unsupervised methods, our approach, involving a convolutional neural network (CNN), is vastly superior as it allows core shapes to be recognised by ignoring differences in scale and is relatively insensitive to conductance jumps. A key aspect is the transformation of curves into a spiral image map, which allows us to separate various fundamental G(V) and G(z) shapes from datasets containing tens of thousands of curves. Moreover, by using transfer learning, training requires little input data compared to other approaches. This is extremely advantageous as it reduces training time by many orders of magnitude and means the model can be trained on user-selected shapes, including rare types. This contrasts with arbitrary class-assignment, instead basing classification on a sound physical understanding of the system. Furthermore, as there is no minimum class population requirement, our method can be used to find rare events with a high degree of confidence. As an example, we used our procedure to find, with a minimum 66% confidence level, a class of G(V) curves which are parabolic at low bias but flat at high bias. Such curves make up just 4% of the total, and would be very difficult to detect cleanly with unsupervised methods. This gives insights into the electron transport behaviour at high-bias because we can now easily quantify the types of curves present. Thanks to its universality, this opens up new possibilities in general signal processing and the identification of rare and important events.

Graphical abstract: “Twisting” the data: a universal machine-learning approach to classify single-molecule curves and beyond

Supplementary files

Article information

Article type
Paper
Submitted
19 May 2025
Accepted
18 Aug 2025
First published
15 Sep 2025
This article is Open Access
Creative Commons BY license

Digital Discovery, 2025,4, 3043-3052

“Twisting” the data: a universal machine-learning approach to classify single-molecule curves and beyond

C. Roldán-Piñero, M. T. González, P. M. Olmos, L. A. Zotti and E. Leary, Digital Discovery, 2025, 4, 3043 DOI: 10.1039/D5DD00207A

This article is licensed under a Creative Commons Attribution 3.0 Unported Licence. You can use material from this article in other publications without requesting further permissions from the RSC, provided that the correct acknowledgement is given.

Read more about how to correctly acknowledge RSC content.

Social activity

Spotlight

Advertisements