Open Access Article
Indranil Nandi
* and
Dilip Ganguly
Centre for Atmospheric Sciences, Indian Institute of Technology Delhi, India. E-mail: indranilnandi2011@gmail.com; Indranil.Nandi@cas.iitd.ac.in
First published on 19th March 2026
Air pollution in India remains a severe environmental and public health challenge, intensified by meteorological conditions, particularly during winter and the post-monsoon season. This study conducts a systematic, multi-component sensitivity evaluation of the Weather Research and Forecasting model coupled with Chemistry (WRF-Chem) to identify an optimal configuration for simulating air quality over India. We assess the influence of meteorological initial and boundary conditions, physical and chemical parameterization schemes, emission inventories, and grid-nudging strengths through simulations for two contrasting periods, the monsoon season (August 2018) and winter (January 2019). Model performance is strongly configuration dependent, with ECMWF Reanalysis 5th Generation (ERA5) meteorology substantially improving the representation of surface temperature, relative humidity, wind speed, and vertical thermodynamic structure. Incorporating moderate grid nudging (0.0005 s−1) reduced relative humidity biases by 13% during January 2019 and improved temperature inversion strength and planetary boundary layer (PBL) representation. The Yonsei University PBL (YSU) and Lin microphysics schemes produced the most realistic PBL heights, rainfall fields, and wet-deposition-driven pollutant removal. For atmospheric composition, the Model for Ozone And Related chemical Tracers coupled with Model for Simulating Aerosol Interactions and Chemistry (MOZART-MOSAIC) mechanism, combined with the high-resolution Speciated Multi-pollutant Generator – Coordinated Aerosol and Pollution Source Emissions for Chemical Evaluation (SMoG-COALESCE) emission inventory, yielded the most accurate simulations of PM2.5, SO2, O3, and CO across urban and rural regions. The integrated optimal configuration improved model skill relative to Central Pollution Control Board (CPCB) surface measurements, reanalysis products, and satellite datasets. These results demonstrate the need for coordinated evaluation of meteorological and chemical components in WRF-Chem and provide a robust modelling framework for improving air-quality prediction and supporting policy-relevant assessments of pollution mitigation strategies over India.
Environmental significanceAir-quality models are essential for understanding pollution and guiding clean-air policies in India, but their accuracy depends strongly on model configuration. This study reduces uncertainty in air-quality simulations by systematically testing different WRF-Chem settings, including weather inputs, atmospheric processes, chemical mechanisms, emissions, and meteorology data-nudging, across monsoon and winter seasons. Reliable modelling is crucial because air pollution in India causes serious health and environmental impacts. Our findings show that jointly improving meteorological and chemical components significantly enhances model performance. The optimized framework enables more accurate air-quality forecasting, pollution assessment, and evaluation of mitigation strategies, supporting effective environmental planning and public-health protection across India. |
The complexity of India's air pollution arises from the confluence of diverse emission sources and highly heterogeneous meteorological conditions. Numerical weather prediction (NWP) models, particularly the Weather Research and Forecasting model coupled with Chemistry (WRF-Chem), play a central role in simulating these coupled processes.7 WRF-Chem resolves emissions, transport, chemical transformation, deposition, and meteorology within a unified modelling framework, enabling process-level understanding of pollutant behaviour. However, the accuracy of WRF-Chem simulations is highly sensitive to choices in meteorological initial and boundary conditions, physical and chemical parameterization schemes, and emission inventories.8 These sensitivities are especially consequential in India, where monsoonal circulations, winter stagnation, pre-monsoon dust activity, and steep spatial gradients in terrain, from coastal regions to the Himalayas, create complex meteorological regimes that strongly modulate pollutant dispersion and transformation.9
Accurate meteorological simulation is particularly vital in India, where extreme events such as heatwaves, lightning, cold surges, and heavy precipitation are key drivers of socioeconomic impacts, with 25% of accidental deaths between 2001 and 2014 linked to such events.5,10 WRF-Chem applications in the region have expanded rapidly, driven by its ability to represent monsoon dynamics, moisture fluxes, and convective processes that shape rainfall patterns and pollutant distribution.11,12 The selection of planetary boundary layer (PBL) schemes, microphysics schemes, and radiation parameterizations critically influences simulations of temperature, wind speed, turbulence, and pollutant concentrations.13 PBL schemes are especially important for resolving diurnal mixing and vertical transport of pollutants, with substantial impacts on surface ozone, PM, and other regulated species.2,14–16
Despite major advances, WRF-Chem performance over India continues to exhibit substantial uncertainties, driven by the region's meteorological diversity, complex chemistry, and emission heterogeneity.17 Sensitivity studies have shown that improved model accuracy requires careful combination of parameterization schemes, high-quality meteorological inputs, and region-specific emissions.18 Data assimilation approaches such as Four-Dimensional Data Assimilation (FDDA) or grid nudging can enhance large-scale meteorological realism by constraining model fields toward observations, thereby improving the representation of temperature gradients, wind profiles, and pollutant transport.19 Prior studies have also demonstrated that microphysics choices significantly influence rainfall, wet deposition, and cloud–aerosol interactions,20,21 while aerosol schemes such as MADE-SORGAM (Modal Aerosol Dynamics Model for Europe – Secondary Organic Aerosol Model)22 and MOSAIC (Model for Simulating Aerosol Interactions and Chemistry)23 can lead to divergent predictions of PM2.5.24 Boundary layer and land surface schemes similarly modulate surface fluxes and turbulence, affecting pollutant mixing depth and vertical structure.25
Although numerous studies have examined individual components of WRF-Chem have over India, such as boundary layer schemes, chemical mechanisms, or emission inventories, few have performed a comprehensive, multi-component sensitivity assessment spanning contrasting meteorological regimes. This gap limits our understanding of how different modelling choices interact to influence pollutant simulations across India's diverse atmospheric environment. To address this need, the present study systematically evaluates WRF-Chem sensitivity to meteorological and chemical configurations during two distinct periods: the monsoon season (August 2018), characterized by strong convection, high humidity, and efficient wet deposition; and the winter season (January 2019), dominated by stagnant conditions, shallow PBLs, and pollution buildup.2,6 Through coordinated perturbations to meteorological initial and boundary conditions, PBL schemes, microphysics schemes, chemical mechanisms, emission inventories, and grid-nudging strengths, we identify an optimal WRF-Chem configuration capable of more accurately reproducing observed meteorological fields and air pollutant concentrations across India.
This work advances regional air-quality modelling by evaluating the combined effects of physical and chemical modelling choices rather than assessing individual components in isolation. The resulting optimized configuration has substantial implications for air-quality forecasting, long-term air pollution assessments, and policy design. Although the present analysis focuses on August (peak monsoon) and January (peak winter), we recognize that India also experiences distinct pre-monsoon and post-monsoon regimes. Pre-monsoon conditions are characterized by high temperatures, deep daytime boundary layers, and frequent dust transport from western arid regions, while post-monsoon conditions over northern India are often influenced by enhanced stagnation and episodic agricultural biomass burning. These regimes introduce additional sensitivities related to dust emission and transport, aerosol optical properties and mixing state, and the temporal allocation and vertical injection of biomass-burning emissions. A full extension of the present multi-component sensitivity matrix to these seasons would require a substantial additional set of simulations and is therefore beyond the scope of this work. Instead, we use the two most meteorologically contrasting regimes, peak monsoon and peak winter, to provide a stringent test of WRF-Chem meteorology–chemistry coupling and to identify a robust baseline configuration intended for broad air-quality applications across India. The paper is organized as follows: Section 2 describes the datasets, model configuration, sensitivity experiments, and validation methodology. Section 3 presents the results, identifying the optimal WRF-Chem configuration and discussing its strengths and limitations. Section 4 synthesizes key findings and proposes a modelling framework for improving air-quality simulations across India and other regions with complex meteorology and diverse emission sources. SI includes additional methodological details, supplementary figures (Fig. S1–S14), and tables (e.g., Tables S1–S5) supporting the main results.
The model domain spans 60°E to 106°E and 1°N to 46°N, encompassing India and surrounding regions to account for long-range transport influences. The domain is centred over Bilaspur (22.38°N and 82.13°E), providing balanced spatial coverage of India's diverse climatic zones while maintaining computational efficiency. Simulations were conducted at a horizontal resolution of 27 × 27 km using a grid of 321 × 331 points, with 39 vertical levels extending from the surface to 50 hPa (Fig. S1). This vertical configuration provides enhanced resolution within the planetary boundary layer (PBL), where pollutant mixing and removal processes dominate, while also allowing realistic representation of free-tropospheric and upper-level dynamical features.27,28
Two simulation periods were selected to represent contrasting meteorological regimes. August 2018 corresponds to the monsoon season, characterized by deep convection, strong moisture transport, high humidity, and frequent wet deposition, whereas January 2019 represents winter conditions marked by weak winds, strong temperature inversions, shallow boundary layers, and severe pollutant accumulation. A 10 day spin-up period was applied prior to each simulation to minimize initialization effects, and spin-up output was excluded from the analyses.
To systematically assess model sensitivity and identify an optimal configuration for air-quality applications over India, a comprehensive suite of sensitivity experiments was conducted. Seven distinct sensitivity experiment groups were designed to isolate the influence of individual modelling components, including meteorological initial and boundary conditions, physical parameterization schemes, chemical mechanisms, emission inventories, and grid-nudging strategies. These groups collectively comprise a total of 18 WRF-Chem simulations, following a controlled, stepwise experimental design in which one model component is varied at a time while all other settings are held constant. This approach enables a clear attribution of model performance differences to specific configuration choices.
Specifically, the first set of simulations (control, CFSWRF, and GDASWRF) evaluated sensitivity to meteorological initial and boundary conditions. This was followed by experiments assessing planetary boundary layer and microphysics parameterizations (MYJWRF, MYNWRF, THOMWRF, and MDMWRF) to identify optimal representations of boundary-layer mixing and precipitation processes. Subsequently, Four-Dimensional Data Assimilation (FDDA) was introduced, and model performance was evaluated using two chemical mechanisms (CBMZWRF and Nudged03). Using the best-performing chemical configuration, four anthropogenic emission inventories (CMIPWRF, ED5WRF, ED6WRF, and HTAPWRF) were then tested for their ability to reproduce pollutant concentrations at the surface and 850 hPa. Finally, five additional simulations (Nudged05, Nudged1, NoNudPBL03, NoNudPBL05, and NoNudPBL1) were conducted to examine the sensitivity of model performance to grid-nudging strength and boundary-layer-specific nudging strategies. Details of all simulations, including their physical and chemical configurations, emission inventories, meteorological forcings, and nudging settings, are summarized in SI Table S1, which serves as a comprehensive reference for the experimental design and supports reproducibility of the study.
Unless otherwise stated, all sensitivity experiments are evaluated relative to a common baseline (control) configuration. The control simulation employs ERA5 meteorological initial and boundary conditions, the Yonsei University (YSU) planetary boundary layer scheme, Lin microphysics, the MOZART-MOSAIC chemical mechanism, the SMoG-COALESCE anthropogenic emission inventory, and no grid nudging. In each sensitivity experiment, only one model component is modified relative to this baseline, while all other physical, chemical, emission, and meteorological settings are held fixed.
ECMWF Reanalysis 5th Generation reanalysis (ERA5), with a horizontal resolution of 31 km, provides high-quality fields capable of resolving synoptic and mesoscale features critical for representing pollutant dispersion and meteorological variability across the Indian region.30 Its higher resolution and improved representation of monsoonal dynamics, moisture fluxes, and temperature structure offer potential advantages for both monsoon and winter simulations.
The National Centers for Environmental Prediction – Global Data Assimilation System (NCEP-GDAS) Final Analysis (0.25° resolution) offers high-frequency updates that capture rapid temporal variability but is limited by its coarser horizontal resolution. This can weaken the representation of fine-scale meteorological features such as localized convergence, orographic lifting, or nocturnal low-level jets, which influence pollutant transport and dispersion.31
National Centers for Environmental Prediction Climate Forecast System Version 2 (NCEP CFSv2, 0.5° resolution) is widely used for climate applications, but its relatively coarse horizontal resolution and weaker representation of regional-scale features make it less suitable for resolving important processes such as shallow winter inversions, weak wind regimes, and humidity stratification, which are essential for pollution buildup and persistence.32
The performance of these datasets was assessed by comparing simulated surface temperature, relative humidity, rainfall, and wind fields against observations, satellite products, and reanalysis datasets, as shown in Fig. 1, S2–S4 and Table S2 within the SI.
The Yonsei University PBL (YSU),34 a nonlocal first-order closure scheme, is known for effectively representing convective boundary-layer dynamics and has demonstrated strong performance over Indian conditions during both monsoon and winter. Its ability to capture entrainment processes and the diurnal evolution of the PBL makes it a strong candidate for simulating pollutant mixing under varying atmospheric stability. The Mellor–Yamada–Janjic PBL scheme (MYJ)35 employs a local 1.5-order turbulence kinetic energy closure and is better suited to representing stable boundary layers. This makes it particularly relevant for winter conditions over regions such as the Indo-Gangetic Plain, where persistent temperature inversions and weak winds trap pollutants near the surface. The Mellor–Yamada–Nakanishi–Niino Level 3 PBL scheme (MYNN3)36 offers an advanced turbulence parameterization designed for complex terrain and urban environments. It attempts to capture a broader spectrum of turbulent processes and may, in principle, improve pollutant dispersion simulations in heterogeneous landscapes.
These three schemes were evaluated using PBL height (PBLH), vertical temperature profiles, and surface pollutant concentrations. Fig. 2, S5 and Table S2 provide detailed comparisons illustrating how each scheme influences pollutant accumulation and vertical mixing across regions and seasons.
The Lin scheme38 is a widely used single-moment scheme that offers computational efficiency while providing reasonable performance for regional-scale meteorological and air-quality applications. Its representation of cloud microphysical processes and precipitation formation is often suitable for capturing large-scale monsoon rainfall patterns. The Thompson scheme39 is a more advanced double-moment formulation that improves the representation of cloud–aerosol interactions by predicting both mass and number concentrations for hydrometeors. Its enhanced microphysical detail may improve the simulation of cloud fraction, liquid water path, and aerosol–cloud interactions. The Morrison scheme,40 also a double-moment scheme, is designed to capture a broad spectrum of hydrometeor species and is well suited for convective environments. Its ability to represent convective microphysical processes may contribute to improved precipitation fields and enhanced removal of pollutants.
Rainfall distributions, cloud fraction fields, and wet deposition estimate from these schemes were compared using India Meteorological Department (IMD) rainfall observations, Modern-Era Retrospective Analysis for Research and Applications, Version 2 (MERRA2) datasets, and satellite measurements. Fig. S6, S7 and Table S3 summarize their performance during the simulation periods.
The Carbon Bond Mechanism version Z – Model for Simulating Aerosol Interactions and Chemistry (CBMZ-MOSAIC) mechanism42 is computationally efficient and widely implemented in regional air-quality studies. However, its simplified representation of secondary chemistry may limit its ability to capture detailed oxidant chemistry and multiphase processes, especially under low-photolysis winter conditions or in regions with complex Volatile Organic Compounds (VOC) and Nitrogen Oxides (NOx) mixtures. The Model for OZone And Related chemical Tracers – Model for Simulating Aerosol Interactions and Chemistry (MOZART-MOSAIC) mechanism43 includes more comprehensive representations of ozone–NOx–VOC chemistry, aerosol thermodynamics, heterogeneous reactions, and the Volatility Basis Set (VBS) approach for secondary organic aerosol formation.44 This richer framework enables improved simulation of secondary pollutant formation, making it more suitable for regions such as northern India where wintertime stagnation and complex emissions contribute to severe pollution episodes.
Chemical mechanism performance was evaluated using pollutant concentrations from CPCB monitoring stations, MERRA2 and satellite observations for O3, SO4 and NOx, as shown in Fig. 3, S8, S12, S13 and Table S5.
The Coupled Model Intercomparison Project Phase 6 (CMIP6) inventory from Community Emissions Data System,46 available at a coarser 50 km resolution, is optimized for climate modelling applications rather than urban air-quality forecasting. Its coarse spatial representation tends to smooth strong emission gradients, which can lead to systematic underestimation of pollutant concentrations in densely populated or industrially active regions. Emissions Database for Global Atmospheric Research, Version 5 (EDGARv5),47 with a resolution of 10 km, supplies global emission coverage and updated sectoral contributions, but its reliance on generalized global activity data limits its ability to capture India's fine-scale urban and industrial emission structures. Emissions Database for Global Atmospheric Research, Version 6.1 (EDGARv6.1),48 also provided at 10 km resolution, incorporates more recent emission factors and activity data, yet still lacks the spatial detail required to characterize the intense but localized sources common in Indian metropolitan regions. Emissions Database for Global Atmospheric Research – Hemispheric Transport of Air Pollution, Version 3 (EDGAR-HTAPv3)49 provides updated global anthropogenic emissions intended for hemispheric transport assessments but remains limited in its ability to represent the complex emission heterogeneity characteristic of India.
The Speciated Multi-pOllutant Generator – COordinated Aerosol and oLution SOurce Emissions for Chemical Evaluation (SMoG-COALESCE) inventory50 is a recent high-resolution, India-specific dataset (5 km) that provides detailed sectoral emissions and offers the spatial granularity needed to resolve intra-urban gradients and localized emission hotspots. Anthropogenic emissions of PM2.5 and its precursors are primarily taken from the Speciated MultipOllutant Generator-India (SMoG-India, 2019),51 which provides sectorally resolved emissions for residential combustion, transportation, industry, energy production, agriculture, waste burning, and several other categories. This comprehensive sectoral breakdown makes SMoG-COALESCE particularly suited for simulating the diverse and spatially heterogeneous emission landscape of India.
Emission inventories that were directly available for the study period were used without modification. For inventories not available for the exact simulation months (August 2018 and January 2019), the most recent year available in the respective database was adopted as a representative proxy. Specifically, when emissions for August 2018 or January 2019 were available, those datasets were used directly; otherwise, the latest available August and January emissions from that inventory were used to represent the corresponding simulation months. This approach ensures temporal consistency while relying on the most up-to-date emission information available for each inventory. The capability of each inventory to reproduce observed spatial and temporal pollution patterns, including regional gradients and urban pollution hotspots across the Indo-Gangetic Plain and major metropolitan centres, was systematically evaluated using Fig. 4 and S9, S12, S13. The monthly total and sectoral emission comparisons over India from different emission inventories, used in the WRF-Chem emission inventory sensitivity simulations for major primary anthropogenic species during August and January, have been analyzed and are presented in Fig. S14 of the SI. The figure also illustrates the relative contribution of different emission sectors to the total emissions over the Indian region for the corresponding species and months. These comparisons reveal substantial differences in how inventories represent sectoral contributions and spatial variability, which ultimately influence the accuracy of WRF-Chem simulations under contrasting seasonal meteorological conditions.
The effects of nudging on meteorological fields were examined using surface variables (Fig. 5 and S10), surface temperature and wind speed (Fig. S3), and vertical temperature profiles (Fig. 2). Particular attention was given to winter simulations, where accurate representation of temperature inversions and stable boundary layers is essential for realistic pollutant accumulation.
Wind speed and wind direction, which are critical determinants of horizontal pollutant transport and dispersion, also exhibited strong sensitivity to the choice of meteorological initial and boundary dataset. Winter simulations forced with GDAS and CFSv2 tended to underestimate weak-wind and stagnation events in the Indo-Gangetic Plain (IGP), whereas ERA5-driven simulations more accurately captured the magnitude, timing, and diurnal cycle of low-level winds. Fig. S3 demonstrates ERA5's ability to resolve diurnal wind variability, while Fig. S4 shows improved spatial patterns of surface wind speed, both of which are crucial for simulating pollutant accumulation and dispersion during winter.29
Comparison with MERRA2 850 hPa wind speed showed that ERA5 exhibits the best performance over the Indian landmass during January, with an RMSE of 1.72 m s−1 and a correlation coefficient of 0.61. In contrast, GDAS and CFSv2 showed higher RMSE values of 1.81 m s−1 and 1.89 m s−1, with lower correlation coefficients of 0.58 and 0.55, respectively. ERA5 also provided a markedly better representation of relative humidity and moisture fields, which influence cloud formation, fog and haze occurrence, and wet deposition. In August, the improved moisture structure led to more realistic rainfall patterns, as evidenced by Fig. S2 and S6 (accumulated rainfall), enabling a more accurate simulation of aerosol wet scavenging processes. In January, realistic humidity fields improved the representation of fog, haze, and aerosol hygroscopic growth, all of which have strong implications for visibility and particulate matter concentrations. A detailed statistical analysis of ERA5 performance for various meteorological variables has been summarized in Table S2.
The combined improvements in temperature, wind, and humidity fields under ERA5 translated directly into enhanced air-quality simulations. More realistic dispersion and boundary-layer structure led to improved PM2.5, NOx, and CO concentrations. These results confirm ERA5 as the preferred meteorological initial and boundary dataset for WRF-Chem simulations over India, especially in regions affected by complex interactions among synoptic forcing, monsoonal dynamics, inversions, and fog formation. Overall, the use of ERA5 forcing reduced surface temperature RMSE by approximately 20–25% relative to GDAS and CFSv2 during August 2018, while increasing correlation coefficients by up to 0.08. During January 2019, ERA5 improved 850 hPa wind speed skill by reducing RMSE by ∼0.1–0.2 m s−1 and increasing correlation by ∼0.05–0.06. These quantitative improvements highlight the strong dependence of air-quality simulations on the accuracy of large-scale meteorological forcing.
Boundary-layer dynamics exert particularly strong control over pollutant accumulation by determining the depth of vertical mixing and the extent of surface pollutant dilution. Fig. 2 shows that the YSU PBL scheme produced the most realistic PBL heights during both monsoon and winter seasons. During January 2019, the ability of YSU to capture shallow nocturnal PBLs and strong inversion layers was essential for simulating wintertime pollution buildup over the IGP. In contrast, MYJ and MYNN3 systematically overestimated PBL height, resulting in excessive vertical mixing and underestimation of PM2.5, NOx, and CO concentrations across the IGP and other polluted urban regions.
Evaluation against MERRA2 PBL height indicates that the YSU scheme performs better over the Indian landmass during January, with an RMSE of 0.18 km and a correlation coefficient of 0.49. In comparison, MYJ and MYNN3 show higher RMSE values (0.21 km each) and lower correlations (0.38 and 0.44, respectively). Consistent behaviour is found for August when compared with Aqua/AIRS 850 hPa temperature, where YSU again shows superior performance (RMSE: 2.58 °C; correlation: 0.58), while MYJ and MYNN3 exhibit larger errors (3.29 °C and 3.19 °C) and weaker correlations (0.52 and 0.51, respectively). These findings are consistent with earlier studies demonstrating that YSU effectively represents diurnal boundary-layer dynamics and nonlocal mixing under stable conditions over India.15,34 Spatial distributions of PM2.5 (Fig. 4), together with vertical structure diagnostics (Fig. S5), further highlight the YSU scheme's ability to capture pollutant accumulation under stable winter conditions and to improve simulations of surface PM2.5, NOx, and O3.
During August, the YSU scheme again performed best by realistically representing deep convective mixing and the diurnal evolution of the boundary layer under monsoon conditions. Accurate simulation of daytime PBL growth is essential for representing pollutant lofting into the free troposphere, while nighttime collapse strongly influences surface accumulation and nocturnal chemistry. The robustness of YSU across both monsoon and winter therefore highlights its suitability for simulating boundary-layer processes and pollutant dispersion in regions characterized by strong seasonal contrasts in stability and pollution levels. A detailed statistical analysis of YSU PBL scheme performance for various meteorological variables has been summarized in Table S2.
Microphysics schemes introduced additional sensitivity, particularly during the monsoon when rainfall dominates pollutant removal. Microphysics governs cloud processes, precipitation formation, and wet deposition, making it central to both meteorological and air-quality performance.33 Fig. S6 compares accumulated rainfall for the Lin, Thompson, and Morrison schemes. The Lin scheme38 produced the most realistic rainfall patterns, especially over central and southern India where convective precipitation dominates during the monsoon season (with IMD RMSE: 12.16 mm and correlation: 0.37). Accurate representation of both stratiform and convective rainfall structures is critical because rainfall intensity, frequency, and spatial distribution determine the efficiency and timing of wet deposition, the primary mechanism for pollutant removal during this period.
The realistic rainfall fields simulated by the Lin scheme led to improved predictions of pollutant concentrations, particularly for SO2 and PM10, as demonstrated in Fig. 4 and S9. In contrast, the Thompson and Morrison schemes tended to overestimate rainfall in several regions, leading to excessive wet scavenging and unrealistically low concentrations of aerosols and trace gases. Furthermore, cloud fraction representation, crucial for radiative transfer, boundary-layer evolution, and photolysis rates, was most accurate in the Lin scheme (Fig. S7), enhancing the reliability of pollutant concentration predictions during monsoon conditions.60
A comprehensive statistical comparison of surface temperature, wind speed, relative humidity, cloud fraction, and accumulated rainfall against Aqua/AIRS observations, MERRA2 reanalysis, and IMD measurements59 confirms that simulations using ERA5 meteorology coupled with the YSU PBL and Lin microphysics schemes achieved the best overall performance across the Indian region (Table S3). The sensitivity analysis therefore highlights the superior performance of the YSU scheme for simulating boundary-layer processes and the Lin scheme for capturing precipitation and pollutant removal mechanisms. Their combined application offers a robust framework for improving meteorological and air-pollution simulations over India, and these schemes are recommended for air-quality modelling in both winter and monsoon periods to support more accurate forecasts and policy-relevant assessments. Relative to MYJ and MYNN3, the YSU scheme reduced wintertime PBL height RMSE by ∼15–20%, resulting in systematically higher near-surface pollutant concentrations that are more consistent with CPCB observations. For microphysics, the Lin scheme reduced monsoon rainfall RMSE by ∼30–40% compared to Thompson and Morrison, leading to more realistic wet scavenging and avoiding excessive removal of aerosols and trace gases.
Emission inventory choice introduced an additional major source of sensitivity. The SMoG-COALESCE inventory, with its high spatial resolution (5 km) and India-specific sectoral detail,50 provided the most accurate representation of pollution distributions during both monsoon and winter. Comparison with MERRA2 surface PM2.5 showed that SMoG-COALESCE exhibits better performance over the Indian landmass during January, with an RMSE of 44.46 µg m−3 and a correlation coefficient of 0.87. In contrast, EDGARv6.1 and EDGAR-HTAPv3 showed higher RMSE values of ∼70 µg m−3 for both, with lower correlation coefficients of 0.80 and 0.41, respectively. Urban regions such as Delhi, Kolkata, and Mumbai showed substantial improvements in simulated pollutant concentrations when SMoG-COALESCE was used, owing to its realistic representation of industrial, transport, and residential emissions (Fig. 4). In contrast, coarser global inventories such as EDGARv5, EDGARv6.1, EDGAR-HTAPv3, and CEDS-CMIP6 failed to capture localized emission hotspots and intra-urban gradients, leading to systematic underprediction of pollutant concentrations in densely populated regions and along industrial corridors (Fig. 4 and S9). A detailed statistical analysis of the performance of all emission inventories for various air pollutants is summarized in Table S4.
Overall, the combination of the MOZART-MOSAIC chemical mechanism with the SMoG-COALESCE emission inventory provided the most reliable predictions for air quality over India, particularly in urban and industrial regions with complex emission signatures. These results highlight the importance of using both a chemically comprehensive mechanism and a high-resolution, region-specific emission inventory for accurate air-quality simulations (Fig. 3, 4 and S8, S9, S12, S13). The use of the SMoG-COALESCE inventory reduced PM2.5 RMSE by approximately 35–40% relative to global inventories during winter and increased correlation coefficients by up to 0.46 in some regions (Fig. S12 and S13). These differences highlight the dominant role of emission resolution and sectoral representation in controlling simulated pollution levels over India, particularly under stagnant winter conditions.
Although MOZART-MOSAIC consistently outperforms CBMZ-MOSAIC for the present configurations and evaluation periods, this result should be interpreted in a process-based context rather than as an indication of universal superiority. The improved performance of MOZART-MOSAIC during January 2019 is physically consistent with its more comprehensive treatment of ozone–NOx–VOC chemistry, heterogeneous reactions, aerosol thermodynamics, and volatility-basis-set (VBS) secondary organic aerosol formation. These processes become increasingly important under wintertime conditions characterized by weak photolysis, shallow boundary layers, and enhanced multiphase chemistry, which are prevalent over northern India during severe pollution episodes. At the same time, it is important to recognize that apparent improvements in model skill may partially reflect compensating errors among meteorology, emissions, and chemistry. For example, uncertainties in emission magnitudes, spatial allocation, or vertical mixing can offset biases in chemical production or loss pathways, leading to improved agreement with observations without necessarily implying optimal representation of individual processes (Fig. S14). Under different regimes, such as pre-monsoon dust-dominated conditions or post-monsoon biomass-burning periods, the relative importance of mineral dust chemistry, aerosol optical properties, fire emission injection heights, and plume aging may alter the comparative performance of chemical mechanisms. Consequently, the advantages identified here for MOZART-MOSAIC should be viewed as regime-dependent, and further evaluation across additional seasons, years, and emission scenarios is required to assess broader transferability.
Excessive nudging at 0.001 s−1 weakened boundary-layer turbulence, sea-breeze circulations, and urban heat-island effects, all of which are crucial for pollutant dispersion. Conversely, weaker nudging at 0.0003 s−1 was insufficient to correct synoptic-scale wind and temperature biases, allowing errors in the reanalysis boundary conditions to propagate through the domain (Table S5).
Fig. 1 and 2, reinforced by vertical structure diagnostics in Fig. S3 and Table S5, show that moderate nudging (0.0005 s−1) significantly improved the representation of vertical temperature profiles during January 2019. The enhanced simulation of temperature inversions and boundary-layer stability strengthened the model's ability to capture conditions that trap pollutants near the surface over northern India. Improved inversion strength was accompanied by better reproduction of low-level wind speed and direction, which play a critical role in determining wintertime PM2.5 and NOx accumulation (Table S5). In addition, moderate nudging improved moisture and relative humidity fields, enhancing the simulation of fog, haze, cloud microphysics, and wet deposition processes features especially important during strongly stratified winter conditions.
These meteorological improvements translated directly into enhanced pollutant simulations. Fig. 5 and 6 demonstrate that the 0.0005 s−1 nudging configuration produced substantially more accurate PM2.5 and NOx concentrations, particularly during stagnant cold-season episodes when small errors in stability and wind fields can lead to large discrepancies in pollutant predictions (Table S5). Thus, the analyses clearly indicate that moderate grid nudging at 0.0005 s−1, applied above the PBL, provides the optimal configuration for improving both meteorological and air-quality predictions over India. It corrects large-scale biases without eroding mesoscale variability essential for pollutant dispersion. These findings further support its inclusion in the preferred WRF-Chem configuration identified in this study, alongside ERA5 initial and boundary conditions, the YSU PBL scheme, the Lin microphysics scheme, the MOZART-MOSAIC chemical mechanism, and the SMoG-COALESCE emission inventory (see Fig. S8 and S9).
Finally, when evaluated as part of the optimized model setup, this nudging configuration contributed substantially to improved agreement with CPCB ground observations across major Indian states (Fig. 6). Coefficients of determination ranged from 0.41 for surface PM2.5 in Rajasthan (January 2019) to 0.73 for surface CO in Delhi (August 2018), while meteorological parameters such as wind speed and relative humidity showed r2 values of 0.42 (Karnataka, August 2018) and 0.67 (Andhra Pradesh, January 2019), respectively (Fig. S11). These results highlight the importance of grid nudging in enhancing the fidelity of meteorological and chemical simulations over India's complex atmospheric environment and emphasise its value in generating reliable, policy-relevant air-quality forecasts.
Therefore, the sensitivity experiments demonstrate that differences in model configuration can lead to changes of 20–50% in key performance metrics, emphasizing that physically consistent choices of meteorological forcing, PBL and microphysics schemes, chemical mechanisms, emission inventories, and nudging strategies are essential for reliable air-quality simulations over India.
We note that the present evaluation does not explicitly include pre-monsoon dust-dominated conditions or post-monsoon biomass-burning episodes. Consequently, while the optimized configuration is robust across monsoon and winter extremes, additional uncertainties may arise in seasons where mineral dust emissions, aerosol mixing state and optical properties, and biomass-burning emission timing/injection heights play a dominant role. Extending the optimized configuration to these regimes is a logical next step to further refine season-specific aerosol and emission process representations.
Across all experiments, the selection of meteorological initial and boundary conditions emerged as the dominant control on model performance, establishing the baseline realism upon which chemistry and aerosol processes operate. Simulations driven by ERA5 consistently outperformed those forced by NCEP GDAS and NCEP CFSv2 for key variables, including near surface temperature, relative humidity, wind fields, cloud characteristics, and vertical thermodynamic structure. ERA5's superior resolution and improved representation of mesoscale circulation and moisture transport enhanced the simulation of monsoonal convection and rainfall-related processes, while also better capturing wintertime stagnation, inversion strength, and shallow boundary-layer evolution. These meteorological improvements translated directly into more realistic pollutant dispersion, accumulation, and removal, underscoring that credible air-quality predictions over India require meteorological forcing that robustly represents both convective and stable-regime dynamics.
Sensitivity to boundary layer physics further demonstrated that accurate representation of vertical mixing and stability is essential for reproducing observed pollution levels. Among the tested schemes, the YSU PBL scheme provided the most realistic PBL structure in both seasons. Its ability to represent shallow nocturnal PBLs and strong inversions during winter improved the simulation of near-surface pollutant build-up over northern India, while its performance under convective conditions supported more realistic daytime mixing during the monsoon. Microphysics choices introduced additional sensitivity, particularly during August when wet scavenging is the principal removal pathway for aerosols and soluble gases. The Lin microphysics scheme produced the most realistic rainfall distribution and cloud characteristics among the tested options, contributing to improved representation of monsoon-season wet deposition and associated pollutant variability. Together, these results highlight that physically consistent boundary-layer and cloud-precipitation processes are prerequisite for reliable air-quality simulation across India's seasonal extremes.
Chemical mechanisms and emission inventories introduced substantial additional uncertainty by controlling secondary formation pathways and the magnitude and structure of emitted precursors. Within the tested configurations, MOZART-MOSAIC performed better than CBMZ-MOSAIC for multiple pollutants, consistent with its more comprehensive treatment of ozone–NOx–VOC chemistry, aerosol thermodynamics, and secondary pollutant formation processes that are especially relevant under wintertime stagnant conditions. Emission inventory choice exerted a similarly strong influence on simulated pollution magnitudes and spatial gradients. The SMoG-COALESCE inventory, with its India-specific sectoral detail and high spatial resolution, best captured observed urban-industrial hotspots and regional gradients, whereas coarser global inventories (e.g., EDGAR and CEDS-CMIP6) tended to smooth localized emissions and systematically underpredict concentrations over densely populated and industrial corridors, particularly across the Indo-Gangetic Plain. The combined results reinforce that both chemically appropriate mechanism complexity and regionally realistic, high-resolution emissions are necessary to reproduce India's highly heterogeneous pollution environment.
Grid nudging experiments further emphasized the need to balance synoptic-scale realism with preservation of mesoscale and boundary-layer variability. A moderate nudging coefficient of 0.0005 s−1 applied above the PBL provided the best overall performance, improving large-scale circulation patterns, vertical thermal structure, and winter inversion strength without excessively damping local circulations and mixing processes that govern pollutant dispersion. Stronger nudging suppressed mesoscale variability relevant to air quality, whereas weaker nudging did not adequately constrain synoptic-scale biases. These findings support the use of moderate, above-PBL nudging as a pragmatic approach for extended regional air-quality simulations over India.
Based on the combined evaluation, the preferred WRF-Chem configuration for India in this study consists of ERA5 meteorological initial and boundary conditions, the YSU PBL scheme, Lin microphysics, the MOZART-MOSAIC chemical mechanism, the SMoG-COALESCE anthropogenic emission inventory, and grid nudging (0.0005 s−1) applied above the boundary layer. This configuration improves the simulation of near-surface meteorology, pollutant concentrations, and their spatiotemporal patterns across diverse Indian regions under both monsoon and winter regimes. Agreement with CPCB observations and satellite/reanalysis products confirms that the optimized setup provides a robust foundation for applications requiring physically consistent meteorology–chemistry coupling, including air-quality forecasting, retrospective assessments, and scenario-based analyses relevant to mitigation planning.
At the same time, the conclusions of this work should be interpreted with appropriate caution. The optimized configuration demonstrates robustness within the evaluated months and tested sensitivity matrix, but it should not be interpreted as universally optimal across all seasons, years, or emission scenarios. In regional air-quality modelling, improved agreement with observations can arise partly from compensating errors among meteorological forcing, emissions, and chemistry. Accordingly, the stronger performance of MOZART-MOSAIC relative to CBMZ-MOSAIC here should be viewed as regime-dependent and linked to the specific seasonal conditions examined, rather than as unconditional superiority. A further limitation is that the sensitivity experiments were conducted for peak monsoon (August) and peak winter (January) only, and therefore do not explicitly evaluate pre-monsoon dust-dominated conditions or post-monsoon biomass-burning episodes. Although these processes are represented in WRF-Chem, key uncertainties such as dust emission parameterizations, aerosol optical properties, and biomass-burning emission magnitude, temporal allocation, and plume injection height may influence model skill in those regimes and therefore warrant targeted evaluation. Extending the optimized configuration to the pre- and post-monsoon periods, as well as testing its transferability across additional years and emission scenarios, constitutes an important next step toward refining season-specific process representations and strengthening the generality of the framework developed here.
| This journal is © The Royal Society of Chemistry 2026 |