Decoding the Unseen: Unsupervised Anomaly Detection in Metal-Organic Frameworks for Discovery Beyond the Norm

Abstract

The discovery of chemically novel or structurally anomalous metal–organic frameworks (MOFs) is essential for expanding reticular design space and enhancing dataset reliability. We present \textbf{CHEM-AD} (Chemically Unusual Metal–organic Frameworks via Autoencoder-based Detection), a label-free, CPU-efficient pipeline that detects anomalous MOFs using 81 engineered descriptors (32 geometric/chemical/topological scalars plus a 49-dimensional metal-composition encoding). A compact symmetric autoencoder (\(\sim 1.8\times10^{5}\) trainable parameters) learns the latent distribution of typical MOFs and assigns anomaly scores based on reconstruction error. Applied to 26{,}025 entries from MOFxDB, CHEM-AD identifies 488 outliers (\(\sim\)1.87\%) featuring distinctive topologies, unusual pore metrics (PLD: 2.56–29.48~\AA; LCD: 4.89–63.59~\AA), and extreme densities (0.057–4.27~g\,cm\(^{-3}\)). These anomalies consistently occupy peripheral clusters in PCA embeddings and exhibit substantial Mahalanobis distances from normal MOFs, indicating multivariate deviation. Feature attribution reveals \textit{connectivity} (e.g., edge/node counts, degree dispersion) as the primary driver of anomalies, followed by window-limited geometry and linker–metal composition. We categorize results into three groups: (A) topologically unusual yet plausible candidates, (B) anomalies with chemically resolvable issues, and (C) likely structural artifacts. The full pipeline executes in under six minutes on standard CPUs and does not require 3D structure fitting or graph parsing. CHEM-AD generalizes to other porous materials, providing a scalable framework for discovery, database curation, and robust preprocessing in materials informatics.

Supplementary files

Transparent peer review

To support increased transparency, we offer authors the option to publish the peer review history alongside their article.

View this article’s peer review history

Article information

Article type
Edge Article
Submitted
22 Aug 2025
Accepted
20 Feb 2026
First published
24 Feb 2026
This article is Open Access

All publication charges for this article have been paid for by the Royal Society of Chemistry
Creative Commons BY license

Chem. Sci., 2026, Accepted Manuscript

Decoding the Unseen: Unsupervised Anomaly Detection in Metal-Organic Frameworks for Discovery Beyond the Norm

H. Alimardani, S. Abaei and M. Asgari, Chem. Sci., 2026, Accepted Manuscript , DOI: 10.1039/D5SC06431G

This article is licensed under a Creative Commons Attribution 3.0 Unported Licence. You can use material from this article in other publications without requesting further permissions from the RSC, provided that the correct acknowledgement is given.

Read more about how to correctly acknowledge RSC content.

Social activity

Spotlight

Advertisements