From one building to many: transferability of a deep reinforcement learning agent for optimizing pollutant exposure and energy consumption

Nishchaya Kumar Mishra; Sameer Patel

doi:10.1039/D5VA00438A

From one building to many: transferability of a deep reinforcement learning agent for optimizing pollutant exposure and energy consumption

Nishchaya Kumar Mishra

^a and Sameer Patel

*^abc

Author affiliations

* Corresponding authors

^a Department of Civil Engineering, India
E-mail: sameer.patel@iitgn.ac.in

^b Department of Chemical Engineering, India

^c Kiran C. Patel Centre of Sustainable Development, Indian Institute of Technology Gandhinagar, Palaj, Gandhinagar, Gujarat 382355, India

Abstract

Minimizing indoor pollutant exposure while conserving energy is essential for protecting human health and the environment. Deep reinforcement learning (DRL) has emerged as a promising approach for optimizing residential ventilation and air conditioning systems. While DRL deployment is simpler than fully physics-driven strategies like dynamic optimization (DynOpt), its generalizability across diverse buildings and ambient conditions remains challenging. Although researchers have studied transfer and imitation learning techniques to address these challenges, they still require house characteristics and field measurements to adaptively train an agent. Therefore, the large-scale deployment of DRL agents can still be potentially challenging. This study assesses the performance of a trained DRL agent against the DynOpt (benchmark) when transferred to houses with varying characteristics and environmental conditions using digital twins. When varying house characteristics one at a time, the agent's performance remained comparable to DynOpt, with particulate matter (PM) exposure and energy ratios near unity (1.05 ± 0.03). Similarly, under simultaneous variations in house characteristics, the exposure (1.03 ± 0.07) and energy (1.09 ± 0.06) ratios remained close to one. However, the agent's performance declines in houses with high PM infiltration under high ambient parameters. The results indicate that the agent can still be integrated into different houses under varying ambient conditions by restricting the infiltration of PM, as evident by lower exposure and energy ratios in houses with lower infiltration. Moving forward, uncertainty quantification and benchmarking of the agent's performance are critical for enhancing confidence in predictions.

This article is part of the themed collection: HOT articles from Environmental Science: Advances

Supplementary files

Article information

DOI: https://doi.org/10.1039/D5VA00438A
Article type: Paper
Submitted: 27 Nov 2025
Accepted: 13 Mar 2026
First published: 08 Apr 2026
This article is Open Access

Download Citation

Environ. Sci.: Adv., 2026,5, 1292-1305

Permissions

Request permissions

From one building to many: transferability of a deep reinforcement learning agent for optimizing pollutant exposure and energy consumption

N. K. Mishra and S. Patel, Environ. Sci.: Adv., 2026, 5, 1292 DOI: 10.1039/D5VA00438A

This article is licensed under a Creative Commons Attribution-NonCommercial 3.0 Unported Licence. You can use material from this article in other publications, without requesting further permission from the RSC, provided that the correct acknowledgement is given and it is not used for commercial purposes.

To request permission to reproduce material from this article in a commercial publication, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party commercial publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Environmental Science: Advances

From one building to many: transferability of a deep reinforcement learning agent for optimizing pollutant exposure and energy consumption

Abstract

Supplementary files

Article information

Download Citation

Permissions

From one building to many: transferability of a deep reinforcement learning agent for optimizing pollutant exposure and energy consumption

Social activity

Search articles by author

Spotlight

Advertisements