Evaluating large language models for inverse semiconductor design

Abstract

Large Language Models (LLMs) with generative capabilities have garnered significant attention in various domains, including materials science. However, systematically evaluating their performance for structure generation tasks remains a major challenge. In this study, we fine-tune multiple LLMs on various density functional theory (DFT) datasets (including superconducting and semiconducting materials at different levels of DFT theory) and apply quantitative metrics to benchmark their effectiveness. Among the models evaluated, the Mistral 7 billion parameter model demonstrated excellent performance across several metrics. Leveraging this model, we generated candidate semiconductors and further screened them using a graph neural network property prediction model and validated them with DFT. Starting from nearly 100 000 generated candidates, we identified six semiconductor materials near the convex hull with DFT that were not present in any known datasets, one of which was found to be dynamically stable (Na3S2). This study demonstrates the effectiveness of a pipeline spanning fine-tuning, evaluation, generation, and validation for accelerating inverse design and discovery in computational materials science.

Graphical abstract: Evaluating large language models for inverse semiconductor design

Supplementary files

Article information

Article type
Paper
Submitted
08 Dec 2025
Accepted
20 Dec 2025
First published
16 Jan 2026
This article is Open Access
Creative Commons BY license

Digital Discovery, 2026, Advance Article

Evaluating large language models for inverse semiconductor design

M. N. T. Kilic, D. Wines, K. Choudhary, V. Gupta, Y. Li, S. Chakrabarty, W. Liao, A. Choudhary and A. Agrawal, Digital Discovery, 2026, Advance Article , DOI: 10.1039/D5DD00544B

This article is licensed under a Creative Commons Attribution 3.0 Unported Licence. You can use material from this article in other publications without requesting further permissions from the RSC, provided that the correct acknowledgement is given.

Read more about how to correctly acknowledge RSC content.

Social activity

Spotlight

Advertisements