Classification of recycled plastics using sparse and imbalanced spectral data and data augmentation by the generative adversarial network

Abstract

Accurate identification of post-consumer plastics is essential to establishing high-performance recycling processes and enabling a circular and sustainable economy and environment through effective recycling and remanufacturing. However, Fourier transform infrared (FTIR) spectra of recycled materials often exhibit noise, baseline shifts, and overlapping signatures from additives or contaminants, resulting in datasets that are both sparse and severely imbalanced. This data complexity, sparsity, and class imbalance can degrade conventional machine-learning classifiers, resulting in higher rates of misclassifying plastics. To address these challenges, we investigated if data augmentation using generative adversarial networks could enhance polymer classification performance. We implemented a Generative Adversarial Network (GAN) framework that integrates adversarial training with a classifier-guided feedback loop to synthesize realistic, class-discriminative FTIR spectra for six commonly recycled polymers, polyethylene (PE), polypropylene (PP), polystyrene (PS), polycarbonate (PC), polyethylene terephthalate (PET), and acrylonitrile butadiene styrene (ABS), and trained multilayer perceptron classifiers on datasets with varying ratios of synthetic data. The optimal balanced accuracy of 96.2% was achieved when synthetic spectra accounted for 50% of the training set, whereas including more than 90% synthetic data degraded generalization. Synthetic data augmentation using a GAN with the optimal augmentation ratio improved ABS classification accuracy, precision, and recall by 43%, 50%, and 33%, respectively, compared with no augmentation and replicate experimental measurements. These results demonstrate that GAN-based data augmentation can effectively mitigate data sparsity and class imbalance in spectral classification of common plastics, providing a practical foundation for creating robust online polymer classification systems.

Graphical abstract: Classification of recycled plastics using sparse and imbalanced spectral data and data augmentation by the generative adversarial network

Supplementary files

Article information

Article type
Paper
Submitted
29 Sep 2025
Accepted
10 Mar 2026
First published
17 Mar 2026
This article is Open Access
Creative Commons BY license

Analyst, 2026, Advance Article

Classification of recycled plastics using sparse and imbalanced spectral data and data augmentation by the generative adversarial network

X. Liu, X. Song, Y. Sulub, D. Zoller, Z. (James) Kong and B. N. Johnson, Analyst, 2026, Advance Article , DOI: 10.1039/D5AN01042J

This article is licensed under a Creative Commons Attribution 3.0 Unported Licence. You can use material from this article in other publications without requesting further permissions from the RSC, provided that the correct acknowledgement is given.

Read more about how to correctly acknowledge RSC content.

Social activity

Spotlight

Advertisements