Improving deterministic forecasts of maximum and minimum temperature using machine learning

Harvir Singh; Anumeha Dube; Prashant Kumar Srivastava; Raghavendra Ashrit; John P. George; V. S. Prasad

doi:10.1039/D6VA00077K

Improving deterministic forecasts of maximum and minimum temperature using machine learning

Harvir Singh,

*^ab Anumeha Dube,

^a Prashant Kumar Srivastava,

^b Raghavendra Ashrit,

^a John P. George

^a and V. S. Prasad

^a

Author affiliations

* Corresponding authors

^a National Centre for Medium-Range Weather Forecasting, Ministry of Earth Sciences, India
E-mail: harvir@ncmrwf.gov.in, harvir.ncmrwf@gmail.com

^b Institute of Environmental and Sustainable Development, Banaras Hindu University, India

Abstract

Accurately forecasting near-surface temperature is essential for heatwave and cold-wave warnings and impact-based decision support over India. Deterministic numerical weather prediction (NWP) models show systematic, regionally varying biases that increase with lead times. To improve the reliability of these forecasts, bias correction is essential. This study applies a multivariate machine-learning (ML) bias-correction framework to location specific 2 m maximum (T_max) and minimum (T_min) temperature forecasts from the operational NWP model at the National Centre for Medium Range Weather Forecasting (NCMRWF). Data from 179 India Meteorological Department (IMD) stations covering the period 2019–2024 were used. Four ML methods, Random Forest (RF), eXtreme Gradient Boosting (XGB), Long Short-Term Memory (LSTM), and Convolutional Neural Networks (CNNs) were used for bias correction of the forecasts at the 179 stations. The ML models were assessed using continuous metrics like mean error (ME), root mean square error (RMSE), and correlation/Taylor diagnostics. Along with these categorical skills for extremes, metrics like equitable threat score (ETS) and Heidke Skill Score (HSS) (for T_max ≥ 30/35 °C in MAMJ (March–June) and T_min ≤ 10/15 °C in DJF (December–February)), and Relative Economic Value (REV) were used. It is found that ML post-processing substantially reduces bias and error across stations and lead times. For T_max, RMSE improvement increases with lead time, typically ∼10–15% at Day-1, ∼20–30% by Day-5, and frequently >30–40% (locally reaching ∼50–60%) by Day-9, especially for XGB/LSTM. For T_min, improvements are strongest: XGB improves RMSE by ∼25–40% at Day-1, increasing to ∼40–60% by Day-7 to Day-9 across many stations. Categorical verification shows consistent improvements in terms of higher ETS/HSS values after bias correction across most stations. Winter T_min shows large gains for both thresholds, particularly for T_min ≤ 15 °C. REV analysis indicates that ML-corrected forecasts remain economically useful over a wider range of cost–loss ratios and retain value at longer lead times compared to the raw model. Overall, XGB provides the most consistent improvement across regions and metrics, RF is generally second-best, LSTM shows competitive performance, particularly for T_max and at longer lead times, while CNN performs worst. SHAP-based analysis links the corrections to physically meaningful drivers, with T_max corrections dominated by boundary-layer/land-surface predictors and T_min corrections dominated by radiative and synoptic controls.

Environmental Science: Advances

Improving deterministic forecasts of maximum and minimum temperature using machine learning

Abstract

Transparent peer review

Article information

Download Citation

Permissions

Improving deterministic forecasts of maximum and minimum temperature using machine learning

Social activity

Search articles by author

Spotlight

Advertisements