Open Access Article
Bahadar
Zeb
*a,
Khan
Alam
b,
Allah
Ditta
*c,
Mazhar
Sajjad
d and
Maqbool
Ahmad
e
aDepartment of Mathematics, Shaheed Benazir Bhutto University, Sheringal Dir (Upper), Khyber Pakhtunkhwa, Pakistan. E-mail: zebsbbu@gmail.com
bDepartment of Physics, University of Peshawar, Khyber Pakhtunkhwa, Pakistan
cDepartment of Environmental Sciences, Shaheed Benazir Bhutto University Sheringal, Dir (U), Khyber Pakhtunkhwa 18000, Pakistan. E-mail: allah.ditta@sbbu.edu.pk
dDepartment of Computer Science, Alhamd Islamic University, Islamabad, Pakistan
eDepartment of Elementary and Secondary Education, Peshawar, Khyber Pakhtunkhwa, Pakistan
First published on 19th November 2025
Rising levels of carbon dioxide (CO2) and methane (CH4) in the atmosphere are significant contributors to global climate change, although regional differences and mechanisms are poorly understood, especially in South Asia. This study examines the spatial and temporal patterns, seasonal changes, and climatic effects of CO2 and CH4 over Pakistan through satellite measurements (AIRS, 2002–2017), weather, and vegetation indicators (NDVI). We evaluate the contribution of human-made activities, biomass burning, and natural processes (e.g., monsoon or soil respiration) to the regulation of greenhouse gas (GHG) concentrations. Moreover, we assess the contribution of long-range transportation by our neighboring areas (the Middle East and Central Asia) using HYSPLIT trajectory modeling. The results show an average yearly growth of CO2 (2.1 ppm per year) and CH4 (3.5 ppb per year), seasonal peaks of CO2 (spring) and CH4 (summer), associated with agriculture, temperature-dependent respiration, and monsoonal cycles. CO2 and NDVI (−0.50) and CH4 and NDVI (+0.64) depict negative and positive associations, respectively, and play the role of vegetation as a carbon sink and wetland and rice paddy emissions. Other significant findings of the study are sudden changes in GHG patterns (CO2: 2009; and CH4: 2007–2014) that occur with upward temperatures, indicating climate feedbacks. This study incorporates radiative forcing dynamics and air mass paths, which provide important insights into the regional GHG drivers and their climatic implications and contribute to policy interventions to reduce emission levels in South Asia. The cloud fraction had a negative correlation with both CO2 (r = −0.36 and p < 0.04) and CH4 (r = −0.20 and p < 0.03). The trajectories of the air mass of the rear indicate that the distant pollution of neighboring countries is a factor. Burning of crop residues, car emissions, forest burning, and others release small quantities of gases and contaminants into the air. This study compares atmospheric CO2 and CH4 prediction models. The dominant trend is strong linearity. In the case of CH4, linear regression is the best and most suggested model. In the case of CO2, ARIMA provided the most accurate forecasts by detecting minor autocorrelation. More complicated models, such as LSTM, failed to work, which proved that simpler models are effective on this kind of data.
Environmental significanceAmong greenhouse gases, carbon dioxide (CO2) and methane (CH4) are the most important gases that have a significant impact on the climate. The current study was conducted to examine the monthly and seasonal variation in the concentrations of CO2 and CH4, to better understand the inter-annual variation as well as increasing trends of CO2 and CH4 during the study period (2002–2017), to identify the time-varying characteristics of CO2 and CH4 concentrations and the probable causes, which characterize the variability in different time scales over Pakistan, and to investigate the effects of meteorology, cloud properties, vegetation dynamics on CO2 and CH4 concentrations and the effects of long-range air masses on CO2 and CH4 concentrations. The results show that the CO2 and CH4 emission levels in Pakistan are alarming and ultimately contribute to climate change. It is expected that these results will help scientific communities further explore the root causes of the recent CO2 and CH4 increase to better mitigate their potential impact on global warming. |
Even though methane is less concentrated than CO2, it has a high global warming potential. Human activities have significantly increased their atmospheric concentration, which is now higher than that before the industrial era. Methane is produced during both natural and anthropogenic activities and is grouped into three formation pathways: (1) Thermogenic: it is formed under high pressure and temperature at very deep levels in the crust of the earth as a result of decomposing organic matter. It is emitted during the extraction, transportation, and processing of fossil fuels (oil, gas, and coal). (2) Pyrogenic: it is formed by the partial decomposition of organic matter, such as wildfires and savanna burning, agricultural waste and crop residue burning, and biofuel combustion.9 (3) Microbial: it is formed by methanogenic bacteria in anaerobic environments, including natural wetlands (lakes, peatlands, and rice paddies); livestock digestion (ruminants); landfills; and wastewater treatment. Although methane is inexhaustible, it undergoes (1) oxidation by hydroxyl radicals (OH) (approximately 90 percent of the removal) to produce other types of secondary pollutants, such as formaldehyde (CH2O), carbon monoxide (CO), and ozone (O3);10 (2) absorption by the soil (oxidation of dry soil); and (3) chlorination under the influence of chlorofluorocarbons (CFCs). Total methane emissions are currently more than what nature has taken up; thus, there is a progressive rise in concentrations present in the atmosphere. The main cause of this increase is human activities, which include the consumption of fossil fuels, farming, and waste disposal.2 It has been demonstrated that, globally, several studies have been used to estimate and describe the rising patterns and variability of CO2 and CH4 using satellites. Studies based on the measurements of instruments, such as the Greenhouse gases Observing Satellite (GOSAT) and the Orbiting Carbon Observatory-2 (OCO-2), have provided previously unknown details on the spatial and temporal distribution of these gases and found continuing increases in atmospheric levels.11 Major emission hotspots, including the industrialized areas of East Asia and North America, and large natural sources, such as the Amazon basin, were identified. Moreover, global studies have played a key role in explaining the multifaceted relationship between human-generated emissions, natural biogeochemical cycles, and climatic factors, like temperature and precipitation, which give rise to seasonal and interannual changes.6,12 It is against this global body of work that regional studies, including this one, can be compared and contextualized. The report by IPCC2 indicates that the levels of CO2 and CH4 in the atmosphere have been rising since the Industrial Revolution. Nevertheless, most developing nations find it difficult to keep regular track of such gases owing to a lack of know-how and state-of-the-art apparatus. Long-term greenhouse gas (GHG) evaluation requires ground-based measurements in most cases. Remote sensing through satellites is a possible solution. Although the spatial resolution can be low, this technology can provide continuous global results for trace gases, providing essential data on their trends and effects on human health, ecosystems, and climate change. Remote sensing as a state-of-the-art methodology supplements ground-based state-of-the-art systems and improves the capacity to monitor GHG emissions to guide mitigation strategies.
Pakistan is one of the largest greenhouse gas (GHG) emitters in the world, with the 13th largest global anthropogenic emission of CH4, and 504.59 million tons of total emissions in 2018.13 Key concerns are as follows: (1) large amounts of methane (CH4) emissions: 13th largest anthropogenic CH4 emitter globally and top 5 in livestock-related CH4 emissions (enteric fermentation); (2) CO2 build-up: deforestation, and local sources of great emission sources are as follows: transportation (vehicles), agriculture (livestock and biomass burning), and forest fires. Pakistan experiences aggravated floods, droughts, and extreme temperatures. Developing countries, like Pakistan, have inadequate mechanisms to even monitor such emissions due to the high cost of deploying ground-based monitoring networks. However, with the latest developments in the field of atmospheric remote sensing (RS), reliable sensors, such as the atmospheric infrared sounder (AIRS), have made monitoring of CO2 and CH4 feasible.13,14 Mahmood et al.15 investigated the atmospheric concentrations of CO2 and CH4 over Pakistan using the Atmospheric Infrared Sounder (AIRS) from 2010 to 2015. Ning An et al.16 attempted to evaluate the potential of space-based observations to monitor atmospheric CO2 changes over 120 districts through simple data-driven analyses from 2015 to 2020. Noman et al.17 estimated the GHG (carbon dioxide and methane) footprint based on the one-year average fossil fuel consumption in selected Private Sector Universities of Karachi.
Although some previous studies have provided snapshots, these are usually constrained by more stringent periods or a more focused consideration. This study fills this gap by providing a long-term (16 years) analysis of both CO2 and CH4 across Pakistan. The originality of this work is an approach that is an integrated analysis of the spatiotemporal trends of interest and study of the driving mechanism, i.e., the importance of meteorological parameters (temperature, humidity, precipitation, and wind speed), vegetation dynamics (NDVI), and cloud characteristics, and a critical evaluation of the transportation of transboundary pollution through HYSPLIT modelling. This complex analysis offers a more comprehensive explanation of the causes of GHG variability in this poorly studied but noteworthy area. Regrettably, the successful application of satellite RS in investigating CO2 and CH4 monitoring in Pakistan has hardly been documented. Thus, this research is aimed at tracing the temporal distribution patterns of CO2 and CH4 in the long term (2002–2017) in Pakistan and predicting the two gases using AIRS data. The results will be used to make the scientific community aware of the factors that cause the increase in CO2 and CH4 in the air and to overcome their contribution to global warming. This study points to an alarming level of emissions of these gases in Pakistan, which has contributed to climate change.
913 square kilometres (Fig. 1 and Table 1). The country boasts a variety of landscapes with the high mountain ranges of the Himalayan and Karakoram ranges located in the north, the fertile Indus River plains located in the central region, and the arid deserts of Sindh and Baluchistan located in the south and west. Pakistan has great climatic diversity, with an alpine climate in the highlands of the country and arid and semi-arid climates in the southern plains. There are four seasons in the country: cool winter (December–February), spring (March–May), summer (June–August), and autumn (September–November).18 The Indus River is the lifeblood of Pakistan, and the problems of water shortage and climate change are increasing. Pakistan is governed by four provinces: Punjab, Sindh, Khyber Pakhtunkhwa (KP), and Baluchistan, and the federally governed territory of Islamabad Capital Territory, Azad Jammu and Kashmir, and Gilgit-Baltistan. Pakistan is the fifth most populous nation in the world, with a population of over 240 million people. Urban centres, like Karachi, Lahore, and Islamabad, are the centre of economic, political, and cultural activities, with rural life in most cases being agrarian. This area is of interest to the study as it is important in terms of the ecological zones and socioeconomic inequalities and is prone to environmental and geopolitical hazards. Critical problems include the management of water resources, agricultural sustainability, pressures of urbanization, and hazards caused by the climate, such as flooding and droughts. It is important to gain a regional understanding of Pakistan when formulating policies and sustainable development initiatives.
![]() | ||
| Fig. 1 Map of the study area (ArcMap 10.5). All the datasets (shapefiles) were obtained from DIVA GIS and UNOCHA. | ||
| Parameter | Annual pattern and key characteristics |
|---|---|
| Temperature | Strong seasonal variation. The highest temperatures occur in the early summer months (June: 30.75 °C and July: 30.51 °C). The lowest temperatures are found in the winter (January: 10.55 °C). This indicates a large annual temperature range of approximately 20 °C |
| Relative humidity (RH) | Inversely correlated with temperature. The highest RH values occur in the cool winter months (January: 45.91% and February: 44.11%). The lowest RH is observed in the late spring/early summer (May: 26.04%), coinciding with the pre-monsoon hot and dry period |
| Wind speed (WS) | Peaks during the summer monsoon. The strongest winds are recorded from July (5.83 m s−1) to August (5.59 m s−1). The calmest wind conditions occur in the post-monsoon period (October: 3.47 m s−1 and November: 3.32 m s−1) |
The data from Fig. 2 in this study allow us to reconstruct the annual cycle.
![]() | ||
| Fig. 2 Average monthly variations in meteorological parameters (including relative humidity, temperature, precipitation, and wind speed) over Pakistan during the study period (2002–2017). | ||
Fig. 2 illustrates the research area's fundamental climate characteristics. Meteorological parameters, like relative humidity (RH), temperature (Temp), precipitation (pre), and wind speed (WS), are obtained from the AIRS satellite. The meteorological parameters over Pakistan indicate the highest temperature in June (30.75 °C) and July (30.51 °C), with the lowest temperature found in January (10.55 °C).
High precipitation values were noted in January (7.43 mm), June (7.68 mm), and December (6.34 mm), while the lowest precipitation was found in September (0.08 mm). Likewise, the highest RH (%) was found in January (45.91%) to February (44.11%), and the lowest one was recorded in May (26.04%). Similarly, the maximum wind speed was recorded from July (5.83 m s−1) to August (5.59 m s−1), and the minimum from October (3.47 m s−1) to November (3.32 m s−1).
Additionally, we employed the 0.5° spatial resolution Normalized Difference Vegetation Index (NDVI) from the MODIS-Terra platform. NDVI is the ratio of albedo (α) measured at various wavelengths:
![]() | (1) |
• Cloud Fraction (CF): The percentage of an area covered by clouds.
• Cloud Top Temperature (CTT): The temperature at the top of the clouds.
The purpose of including these parameters was to move beyond simply measuring gas concentrations and to start quantifying their climatic effects over Pakistan. Clouds play a critical role in the Earth's energy balance:
• They reflect incoming solar radiation (a cooling effect).
• They trap outgoing longwave (thermal) radiation (a warming effect).
By correlating GHG concentrations with cloud properties, the authors aimed to investigate if and how increasing levels of CO2 and CH4 are influencing local cloud characteristics, which in turn affect regional climate.
![]() | (2) |
The probability of this normalized test statistic is calculated. The following equation gives the probability density function for a normal distribution with a mean of zero and a standard deviation of one:
![]() | (3) |
• Decide on the degree of significance (95% is typical).
• A trend is said to be decreasing if Z is negative and the computed probability is greater than the level of significance. A trend is said to increase if Z is positive, and the computed probability is greater than the level of significance. There is no trend if the calculated probability is less than the level of significance.
![]() | (4) |
The sequential numbers U(t) and U′(t) from the Mann–Kendall test's progressive analysis were calculated to observe how the trend changed over time.25U(t) is the sum of the z values found from the first to the last data point. This test considers the relative values of all terms in the time series (x1, x2,…, xn). The following steps are sequentially applied. The magnitudes of the xj annual mean time series (j = 1, 2, …, n) are compared with xi (I = 1, 2, …, j − 1). For each comparison, the number of cases xj > xi is counted and denoted by nj.
•The test statistic t is thus provided using the following equation:
![]() | (5) |
• Mean and variance of the test statistic are respectively
![]() | (6) |
![]() | (7) |
•We can calculate the sequential values of the statistic U(t) using the following equation:
![]() | (8) |
Similarly, U′(t) values are computed backward, starting at the end of the series. The sequential Mann–Kendall model could be regarded as an effective method for determining the beginning year(s) of a trend. If the values are greater than the confidence interval (1.96), the hypothesis of no change is rejected, and the approximate time of the change point (abrupt change) is shown by the intersection of and U′(t) in the time series.
000 m above ground level was considered. The HYSPLIT model-based Trajstat software was used to calculate all air mass trajectories.26
Because there is insufficient quality control by the relevant authorities, CO2 concentration may be a byproduct of burning low-grade fuel and using an inappropriate combustion system, which can be found on the open market. The main sources of CO2 concentration in the study locations included traffic congestion, road conditions, and industrial exhaust, and related results were apparent from Ul Haq et al.30 in Pakistan. Kuttippurath et al.31 found the highest CO2 concentration in India during the spring season during their study period (2002–2020). Coa et al.32 found the highest CO2 concentration in spring (384.0 ppm) and the lowest in winter (382.5 ppm) over six locations globally from 2003 to 2011. Wei et al.33 found that the average concentrations of CO2 were 428.36 ± 13.96 ppm in the megacity of Shanghai, China, from 2017 to 2018, with the highest CO2 concentration in winter and the lowest in autumn. Kumar et al.34 also investigated the highest values of CO2 during the spring season in India. Metya et al.35 found an average CO2 concentration of 406.05 ± 6.36 ppm at Sinhagad, India, and attains its minimum concentration during autumn, whereas CO2 reaches its maximum concentration during spring.
The key reasons for the nature of the CO2 cycle are as follows:
1. Lag in photosynthesis: in spring, warmer temperatures cause a rapid increase in soil and plant respiration (a CO2 source), but the full photosynthetic drawdown (the CO2 sink) from vegetation has not yet reached its peak. This temporary imbalance causes CO2 to accumulate, leading to a peak in May.
2. Biomass burning: specifically in the Indian subcontinent, widespread agricultural and forest fires in the pre-monsoon (spring) season add a significant pulse of CO2 to the atmosphere, reinforcing the natural peak.
In winter, respiration is lower due to cold temperatures. Although human emissions from heating are high, the lack of this large biogenic CO2 release from soils means that the concentration does not reach the levels observed in the spring.
![]() | ||
| Fig. 4 Box–Whisker plots indicating (a) monthly variations in CH4 and (b) seasonal variations in CH4 over Pakistan. The Box–Whisker representation in all panels is the same as in Fig. 3. | ||
In the troposphere, the balance between surface emission and OH destruction mostly determines the concentration of CH4. In the Indian subcontinent, ruminants, rice paddies, and wetlands are the main sources of CH4.36 The Kharif (monsoon) season may be linked to the maximum concentration, which occurs during the summer.37 The seasonality of the CH4 concentration over Asia is characterized by greater values in the wet season and lower values in the dry season.38 This could be because of strong emissions from wetlands and rice fields during the wet season. During the winter and spring seasons, low mixing ratios of CH4 were primarily caused by a decrease in atmospheric hydrocarbons because of fewer photochemical reactions and a significant drop in solar intensity.39 The summer and autumn seasons showed a significant rate of CH4 change. According to Nishanth et al.40 and Goroshi et al.,37 the interchange of CH4 between rice paddy fields and the atmosphere is governed by both biological and physical processes. This is why in the current study area, more CH4 is observed during the summer and autumn seasons.
Kavitha and Nair41 found the maximum CH4 concentration during August/September over various locations in India from 2003 to 2009 and investigated the CH4 concentration that ranged from the minimum value of 1740 ppm to the maximum value of 1890 ppm using satellite data. They also observed the peak value of CH4 concentration during the monsoon and post-monsoon seasons and the minimum during the winter season. They associate CH4 concentration with livestock distribution and wetland emission, including rice fields. Ul Haq et al.30 observed the maximum concentration of CH4 during summer (1804 ± 28) ppb, followed by autumn (1800 ± 25 ppb) and winter (1777 ± 24 ppb) over Pakistan, Afghanistan, and adjoining areas using satellite data from 2003 to 2012. Wei et al.33 reported the average concentrations of CH4 to be 2154 ± 190 ppb, in the megacity of Shanghai, China, from 2017 to 2018, with the highest value in summer and the lowest one in spring.
![]() | ||
| Fig. 5 Box–Whisker plots indicate an inter-annual variation in (a) CO2 and (b) CH4 over Pakistan from 2002 to 2017. The Box–Whisker representation in all panels is the same as that depicted in Fig. 4. | ||
| Greenhouse gas | Trend | Slope (change per year) | R 2 (coefficient of determination) | P-value | Overall increase (2002–2017) |
|---|---|---|---|---|---|
| CO2 | Significant increase | +2.1 ppm per year | >0.95 (very high) | P ≤ 0.05 | 8.6% (e.g., ∼376 to ∼406 ppm) |
| CH4 | Significant increase | +3.5 ppb per year | >0.95 (very high) | P ≤ 0.05 | 2.9% (e.g., ∼1791 to ∼1858 ppb) |
The observed trend for CO2 (2.1 ppm per year) aligns closely with findings from regional and global studies. Kuttippurath et al.31
reported an average trend of ∼2.1 ppm per year over India, while Cao et al.32 found a nearly identical global mid-tropospheric increase of 2.11 ppm per year. Similarly, the CH4 growth rate of 3.5 ppb per year is consistent with the significant increases documented in South Asia. For instance, Mahmood et al.15 reported a rise of 5.02 ppb per year over Pakistan from 2003 to 2015. The findings of Ul Haq et al.,30 who observed a 3.7% increase in CH4 over a decade, further corroborate the persistent and widespread nature of increasing GHG concentrations across the region; these rates are primarily driven by anthropogenic activities rather than natural variability.
The seasonal variation in the concentration of both CO2 and CH4 in Table 3 shows that CO2 has a relatively high value of 403 ppm during the spring season. Likewise, CH4 shows maximum values (1885 ppb) during the autumn season. It is determined that P ≤ 0.05 denotes a considerable rise in the concentration of both gases.
| Year | Winter season | Spring season | Summer season | Autumn season |
|---|---|---|---|---|
| Average conc. Of CO 2 (ppm) | ||||
| 2002 | 372 | 376 | 375 | 371 |
| 2003 | 375 | 377 | 376 | 375 |
| 2004 | 376 | 380 | 377 | 376 |
| 2005 | 378 | 383 | 379 | 378 |
| 2006 | 381 | 383 | 381 | 380 |
| 2007 | 382 | 385 | 383 | 383 |
| 2008 | 384 | 386 | 385 | 384 |
| 2009 | 386 | 388 | 387 | 387 |
| 2010 | 389 | 391 | 390 | 389 |
| 2011 | 392 | 393 | 392 | 391 |
| 2012 | 392 | 393 | 392 | 392 |
| 2013 | 394 | 396 | 395 | 395 |
| 2014 | 397 | 397 | 398 | 396 |
| 2015 | 399 | 401 | 402 | 399 |
| 2016 | 401 | 403 | 400 | 400 |
| 2017 | 403 | 403 | 400 | 401 |
![]() |
||||
| Average conc. Of CH 4 (ppb) | ||||
| 2002 | 1791 | 1780 | 1862 | 1823 |
| 2003 | 1794 | 1788 | 1852 | 1828 |
| 2004 | 1801 | 1796 | 1849 | 1824 |
| 2005 | 1807 | 1845 | 1854 | 1836 |
| 2006 | 1802 | 1810 | 1839 | 1834 |
| 2007 | 1809 | 1817 | 1866 | 1839 |
| 2008 | 1817 | 1822 | 1875 | 1850 |
| 2009 | 1820 | 1817 | 1843 | 1840 |
| 2010 | 1825 | 1816 | 1858 | 1859 |
| 2011 | 1830 | 1839 | 1868 | 1872 |
| 2012 | 1831 | 1831 | 1869 | 1873 |
| 2013 | 1845 | 1838 | 1871 | 1865 |
| 2014 | 1849 | 1839 | 1862 | 1864 |
| 2015 | 1852 | 1852 | 1875 | 1871 |
| 2016 | 1855 | 1844 | 1875 | 1884 |
| 2017 | 1858 | 1846 | 1878 | 1885 |
![]() | ||
| Fig. 6 An abrupt change in CO2 resulting from the sequential Mann–Kendal test statistics during (a) winter, (b) spring, (c) summer, and (d) autumn. U(t) is known as the forward sequence, which follows a normal distribution. U′(t) is denoted as the backward sequence derived from eqn (8), and UL and LL denote the upper and lower limits. The significance of the trend was calculated using the MK trend test statistic (Z) from eqn (2). Generally, Z > 0 indicates an increasing trend, Z < 0 indicates a decreasing trend, and Z = 0 indicates no trend. | ||
![]() | ||
| Fig. 7 An abrupt change in CH4 resulting from the sequential Mann–Kendal test statistics during (a) winter, (b) spring, (c) summer, and (d) autumn. The various terms like U(t), U′(t), UL, LL, and Z have the same representation, as shown in Fig. 6. | ||
• CH4 Abrupt Changes (Fig. 7 and 9c): The change-points for CH4 are more complex and variable than for CO2. The annual analysis (Fig. 9c) suggests a potential shift beginning around 2010–2011, with U(t) crossing the confidence limit. However, the seasonal analysis (Fig. 7a–d) reveals a more scattered pattern, with intersections and divergences occurring at different times (e.g., ∼2008 in autumn and ∼2012 in winter). This scattered nature suggests that the drivers for methane are more complex and season-specific, which is likely influenced by a combination of anthropogenic activity and climatic variables. For instance, change-points in spring/autumn could be linked to modifications in agricultural practices (e.g., rice cultivation patterns and livestock management), while those in winter/summer may be related to shifts in monsoon patterns or temperature, which control microbial methane production in wetlands and rice paddies.
• Temperature trend shift (Fig. 9a): the SQMK test for temperature reveals a highly significant change-point starting around 2009, with U(t) sharply and permanently exceeding the upper confidence limit. This timing coincides precisely with the change-point identified for CO2. This synchronicity suggests a potential climate feedback mechanism, where the continued accumulation of CO2 and other GHGs began to manifest a more pronounced and consistent warming signal in the regional climate system from that year onward.
![]() | ||
| Fig. 8 Average monthly variations in the concentrations of (a) CO2vs. NDVI, CF, and CTT, and (b) CO2vs. Temp, RH, Pre (precipitation), and W/S (wind speed). | ||
| CO2 and NDVI | CO2 and CF | CO2 and CTT | CO2 and Temp | CO2 and RH | CO2 and Pre | CO2 and wind speed |
|---|---|---|---|---|---|---|
| −0.50 | −0.36 | 0.31 | 0.12 | −0.45 | −0.23 | −0.35 |
| P = 0.01 | P = 0.02 | P = 0.03 | P = 0.001 | P = 0.003 | P = 0.05 | P = 0.08 |
| CH4 and NDVI | CH4 and CF | CH4 and CTT | CH4 and Temp | CH4 and RH | CH4 and Pre | CH4 and wind speed |
|---|---|---|---|---|---|---|
| 0.64 | −0.20 | 0.32 | 0.60 | 0.29 | −0.65 | 0.61 |
| P = 0.001 | P = 0.009 | P = 0.07 | P = 0.06 | P = 0.005 | P = 0.01 | P = 0.04 |
Precipitation (r = −0.23 and p = 0.05) and relative humidity (r = −0.45 and p = 0.003): the negative correlations align with the seasonal cycle. The dry pre-monsoon season (low precipitation/RH) is associated with peak CO2 from fires and respiration. The wet monsoon season (high precipitation/RH) corresponds with CO2 drawdown due to enhanced photosynthesis and reduced fire activity.
Wind speed (r = −0.35 and p = 0.08): the negative relationship suggests that higher winds promote the dispersion and dilution of locally emitted CO2, leading to lower observed concentrations, particularly in polluted boundary layers.
Cloud fraction (CF) (r = −0.36 and p = 0.02): the negative correlation is complex. It may reflect meteorological patterns where drier, high-pressure systems (favoring clear skies) coincide with stable conditions that allow CO2 to accumulate. Conversely, cloudy conditions often accompany precipitation and vertical mixing that dilute CO2.
In summary, CO2 variability is primarily driven by a combination of biological activity (source and sink) and anthropogenic emissions, modulated by meteorological conditions that control dispersion and dilution. The correlations with cloud properties suggest that rising CO2 interacts with and potentially modifies local cloud characteristics, contributing to regional climate feedbacks.
Pathakoti et al.42 found correlation coefficients of 0.13, −0.18, −0.32, and −0.50 between CO2 and Temp, WS, RH, and Prec, respectively, over Bharati (India) in 2016. Kumar et al.34 found correlation coefficients of 0.8 and −0.64 between CO2 and temperature and NDVI, respectively, over India from 2004 to 2011. Kumar et al.34 also found a negative correlation between mid-tropospheric CO2 and rainfall over India.
Sreenivas et al.22 investigated that during pre-monsoon, monsoon, post-monsoon, and winter, the corresponding correlation coefficients (Rs) between wind speed and CO2 are 0.56, 0.32, 0.06, and 0.67, respectively, over Shadnagar, a suburban site of Central India, in the year 2014. Nyasulu et al.43 reported a significant association between temperature (T, ◦C) and CO2 (r = 0.75 and p < 0.01) during the SON pollution peak season. Additionally, during SON, CO2 had a substantial negative association with the cloud fraction (r = − 0.55 and p < 0.05) and a significant positive correlation with cloud top temperature (r = 0.56 and p < 0.05). These findings suggest that trace gases have a considerable impact on the climate during periods of heavy pollution.
Metya et al.35 found that correlation coefficients (R) between wind speed and CO2 during monsoon, post-monsoon, winter, and pre-monsoon are 0.51, 0.15, − 0.02, and − 0.28, respectively. A good inverse correlation between GHG and wind speed suggests that with an increase in wind speeds, GHG concentrations decrease. In contrast, a weaker correlation suggests that regional/local transport plays some roles.44 Strong winds, especially during the monsoon season, are likely to dilute GHG concentration. The changes in CO2 and CH4 concentrations are linked to the adjusted temperature, and it was found that both gases have been increasing from the beginning of the study period (2002), but temperature showed an increasing trend only after 2009. As is clear from Fig. 9a–c, comparing the temperature trends with those of trace gases indicates that both temperature and trace gases are increasing over the study area, and a steady increase in temperature from 2009 onwards coincides with the change point for CO2.
![]() | ||
| Fig. 9 Annual abrupt change in (a) temperature, (b) CO2, and (c) CH4 resulting from the sequential Mann–Kendal test statistics. The various terms like U(t), U′(t), UL, LL, and Z have the same representation as that depicted in Fig. 6. | ||
![]() | ||
| Fig. 10 Average monthly variations in the concentrations of (a) CH4vs. NDVI, CF, and CTT, (b) CH4vs. Pre, RH, WS, and Temp. | ||
Precipitation (r = −0.65 and p = 0.01): the strong negative correlation is likely indirect and related to atmospheric chemistry, not emissions. Higher precipitation is associated with increased cloud cover and reduced solar radiation, which lowers the atmospheric concentration of hydroxyl radicals (OH), the primary sink for methane. With its main removal mechanism weakened, CH4 concentrations accumulate.
Wind speed (r = −0.61 and p = 0.04): similar to CO2, the negative correlation indicates that higher wind speeds disperse and dilute concentrated plumes of CH4 from point sources like wetlands, agricultural areas, and leaks from infrastructure.
Cloud fraction (CF) (r = −0.20 and p = 0.03): the weak negative correlation may again reflect synoptic weather patterns where conditions favouring CH4 buildup (e.g., low wind and stable atmosphere) may also be associated with fewer clouds.
In summary, CH4 concentrations are predominantly driven by temperature-dependent microbial emissions from agriculture (rice) and natural wetlands, as evidenced by the strong link between NDVI and temperature. Its atmospheric lifetime is further modulated by meteorological factors that control its destruction (via OH), leading to an observed strong negative correlation with precipitation.
Khaliq et al.10 showed that the CH4 concentration was significantly affected by anthropogenic emissions, NDVI, meteorological parameters, and soil moisture over South, East, and Southeast Asia from 2009 to 2020. Sreenivas et al.21 also reported a positive correlation between CH4 and NDVI in a suburban site in India. Metya et al.35 noted a positive correlation between CH4 and NDVI over Sinhagad, located on the Western Ghats in peninsular India. Nyasulu et al.43 also found a positive correlation between CH4 and NDVI over Muzambiq, Africa.
Sreenivas et al.21 found that pre-monsoon, monsoon, post-monsoon, and winter wind speed and CH4 correlation coefficients (Rs) are 0.28, 0.71, 0.21, and 0.60, respectively, over Shadnagar, India, in 2014.
Nyasulu et al.43 found that the autumn season is the pollution peak season and there is a significant association between temperature (T ◦C) and CH4 (r = 0.80 and p < 0.01). In addition, during autumn, CH4 demonstrated a strong negative association with cloud percentage (r = − 0.69 and p < 0.01) and a large positive correlation with cloud top temperature (r = 0.74 and p < 0.01). These findings suggest that trace gases have a considerable impact on the climate.
According to Metya et al.,35 the correlation coefficients (R) between wind speed and CH4 are −0.57, −0.3, −0.02, and −0.2 in the monsoon, post-monsoon, winter, and pre-monsoon seasons, respectively. Conversely, a lower correlation implies that local or regional emissions are important.47 The GHG concentration is expected to be diluted by strong winds, particularly during the monsoon season. This is confirmed for CH4, where there is a negative correlation between wind and CH4 concentration (R = −0.61).
The changes in CO2 and CH4 concentrations were tried to link to the adjusted temperature, and it was found that both gases showed an increasing trend from the beginning of the study period (2002), but temperature showed an increasing trend only after 2009. The observed temperature pattern indicates an increasing trend. As shown in Fig. 9a–c, comparing the temperature trends with those of trace gases indicates that both temperature and trace gases are increasing over the study area, and a steady increase in temperature from 2009 onwards coincides with the change point for CH4. During the study period, a steady temperature rise signifies a temperature response to a significant increase in trace gases.
• Interpretation and essence: this is a key indicator of the greenhouse effect in action.
(1) CO2 and CH4 are well-mixed greenhouse gases that absorb thermal infrared radiation emitted by the Earth's surface and the atmosphere.
(2) This absorption warms the atmospheric layer where it occurs.
(3) Clouds form at altitudes where the temperature drops to the dew point. If the entire lower atmosphere (the troposphere) is warmer due to increased GHG concentrations, the cloud formation altitude might shift, or the cloud tops might be warmer.
(4) A higher cloud.
Fig. 11 (a) and (b) show the three-day back trajectory analysis of air masses at a height of 1000 m above the ground surface during the study period (2002–2017). The three-day back trajectories were calculated and clustered into four seasons for each entire study period (2002–2017) over Pakistan. It is clear from Fig. 11 that most of the source regions lie in the Middle East, arising from Egypt, Saudi Arabia, Iraq, Iran, etc., and travel long distances to reach the receptor region, Pakistan. However, air masses also arise from central Asia from Turkmenistan and Uzbekistan and travel through Afghanistan to reach Pakistan. Air masses also arise from neighboring countries, such as India.
Middle Eastern countries are some of the strongest GHG emitters in the world.47 In particular, Saudi Arabia is the largest producer and exporter of petroleum products, and 90% of its revenue relies on oil and petroleum-related industries.48 In addition, the Peninsula (Iraq, Bahrain, Kuwait, Saudi Arabia, etc.) is considered a hotspot for the emission of GHGs due to vast oil and natural gas reserves and industries.4 Moreover, in Iraq, the total CO2 emission increased by 300% from fuels that are roughly 14
000 Gg and 58
000 Gg from 1990 to 2017.9 In addition, the Egyptian share of GHG emissions to the global atmosphere increased from 0.4% in 2000 to 0.6% in 2016.49 The air masses from these regions traveled through Iran and Afghanistan to reach Pakistan. Iran is ranked 7th in terms of CO2 emissions resulting from fuel combustion in the world.50 The main sources of CO2 and CH4 emissions are various sectors of energy production, agriculture, livestock, forestry, and waste. Moreover, in Afghanistan, coal burning is the major cause of GHG emissions, and over 70% of household energy is used to heat space and water. In Kabul, on average, each family produces 4062 kilograms of greenhouse gases (CO2, NO2 and CH4), and concerning this value, approximately 2.39 million tons of GHG are emitted in one month during winter.51 As shown in Fig. 12, some of the air masses arise from central Asia, including Turkmenistan and Azerbaijan. Turkmenistan is one of the few countries with an entire dependence on fossil fuels, with the sixth-largest natural gas reserve in the world. 99% of the electricity in the country is provided by natural gas-fired power plants.52 Similarly, the economy of Azerbaijan is also significantly based on oil production.53 The air masses from central Asia and Middle Eastern regions traveled through Afghanistan and reached Pakistan and contributed to locally emitted CH4 and CO2 to elevate their concentration in the regional troposphere.
To predict atmospheric CO2 and CH4 concentrations from 2002 to 2017, we evaluate the performance of five forecasting models: Linear Regression, Exponential Smoothing (ETS), ARIMA, LSTM, and SARIMA. The tuning of the Hyperparameter is conducted for each model.
• Hyperparameters: we experimented with different settings. The last model had 50 units in the LSTM layer at a learning rate of 0.01 and 1000 epochs. The predictive data used a rolling window of the past 3 years.
Training: the Adam optimizer and Mean Squared Error (MSE) were used as the loss function to train the model. Even with tuning, the model never converged to a solution that was better than the linear trend, which means that it was overfitting, and the model was fundamentally incorrect in its approach to the data, which was highly linear.
Hyperparameters of CO2: the optimal performance of the ARIMA was ARIMA (1,1,1), meaning one autoregressive term, one degree of differencing, and one moving average term.
Smart parameters of SARIMA: we tested seasonal terms (e.g., SARIMA (1,1,1) (1,1,1,12)), but there was no significant seasonal model in the annual aggregated data, showing that SARIMA was an overly complicated selection.
| Model | MSE | RMSE | MAE | R 2 | MAPE |
|---|---|---|---|---|---|
| Linear regression | 0.2101 | 0.4584 | 0.3714 | 0.9977 | 0.10% |
| Exponential smoothing | 0.2101 | 0.4584 | 0.3714 | 0.9977 | 0.10% |
| ARIMA | 8652.3921 | 93.0182 | 23.9110 | −91.8053 | 6.42% |
| SARIMA | 8652.3921 | 93.0182 | 23.9110 | −91.8053 | 6.42% |
| LSTM | 0.9590 | 0.9793 | 0.7783 | 0.9840 | 0.20% |
| Model | MSE | RMSE | MAE | R 2 | MAPE |
|---|---|---|---|---|---|
| LSTM | 9.7485 | 3.1223 | 2.7073 | 0.3082 | 0.65% |
| ARIMA | 7.3027 | 2.7023 | 2.6265 | 0.4818 | 0.63% |
| Exponential smoothing | 12.3441 | 3.5134 | 3.4823 | 0.1240 | 0.84% |
| Linear regression | 12.3441 | 3.5134 | 3.4823 | 0.1240 | 0.84% |
| Model | MAE | RMSE | MAPE |
|---|---|---|---|
| Linear regression | 2.123 | 2.456 | 0.12% |
| ETS | 3.215 | 3.678 | 0.18% |
| ARIMA | 4.892 | 5.234 | 0.27% |
| ARIMA (1,1,1) | 3.956 | 4.312 | 0.22% |
| LSTM | 5.678 | 6.123 | 0.31% |
• Linear regression is the clearly best model in CH4. It gave the fewest errors on the test forecast (2018–2024) with an MAE of 2.123 ppb, RMSE of 2.456 ppb, and MAPE of 0.12%. The trend is overwhelmingly linear as it almost explains everything in the historical data (R2 = 0.9977).
• In the case of CO2, ARIMA (1, 1, 1) gave the best projections. It performed better during the test period, as it gave the lowest RMSE (2.7023) and the highest R2 (0.4818), which means that it could capture the small autocorrelation structure of the CO2 data that a pure linear trend failed to capture.
The findings summarize that linear regression is the choice of chart to use in CH4 prediction, and ARIMA is the choice of chart in predicting CO2 in the case of this data. The LSTM model, which is the most complex, still performed the worst, which confirms our assumption that easier models are more effective in this case.
Linear regression showed an almost perfect fit using the historical data of the two gases, with an R2 of 0.9977 for CO2. This means that the simple linear trend describes more than 99.7 percent of the variance in the data, and more complicated models are not needed to describe the historical pattern. The steady annual growth (about 2.0–2.3 ppm per year of CO2) is best fitted to a linear model.
More sophisticated models, such as SARIMA (seasonality) and LSTM (complex non-linearities), did not manage to do so on the training data. The benefit of their added complexity was not significant, and, as is usual with parsimonious data, they introduced a risk of overfitting.
It is worth noting that the LSTM model performed poorly for both gases. This is typical when small and clean datasets are employed, which are highly linear because LSTMs require large amounts of data to learn more complicated patterns and can be easily outperformed by simpler and more statistically suitable models.
(1) Because of its remarkable accuracy, ease of use, interpretability, and low processing cost, linear regression should be the model of choice for operational forecasting of CH4 with this type of data.
(2) An ARIMA model should be considered for CO2 forecasting since it captured the trend and autocorrelation well and produced the most accurate forecasts in our test scenario.
(1) Finding the most realistic model for predicting these particular GHG trends was the goal.
(2) The concept of parsimony is the main finding: the most effective model is the simplest one (linear regression for CH4 and ARIMA for CO2), as adding complexity (LSTM and SARIMA) had no advantage and frequently resulted in worse performance.
(3) We specifically address the reasons why complicated models do not work, such as LSTM's requirement for huge datasets to identify patterns that are already well captured by a straightforward linear trend.
➢ CO2 maxima in May (389 ± 8 ppmv) resulted from increased biomass burning, soil respiration, and pre-monsoon agricultural activities, and minima in October (386 ± 8 ppmv) occurred after photosynthetic uptake during the monsoons.
➢ CH4 peaks in August (1876 10 ppbv) with wetlands and rice planting during the monsoon, and the lowest concentrations occur in April (1820 21 ppbv) with decreased photochemical activity.
➢ Both gases are characterised by a high rate of increase: CO2 increasing by 2.1 ppmv/year and CH4 increasing by 3.5 ppbv per year over 15 years, corresponding to total increases of 8.6 and 2.9 percent, respectively.
➢ Sudden shifts observed through the Mann–Kendall test (CO2 in 2009, CH4 in 2007–2014) are consistent with the increased anthropogenic effects (industrial expansion, deforestation, and burning of fossil fuels).
➢ CO2 has a negative relationship with NDVI (r = −0.50) and precipitation, showing the importance of vegetation and monsoonal uptake in concentration control.
➢ CH4 exhibits very positive correlations with NDVI (r = 0.64) and temperature (r = 0.60), with both microbial activity and agricultural emissions being dependent on temperature.
➢ Both gases affect radiative forcing, with CO2 and CH4 having cloud top temperature and cloud fraction relationships, respectively, of r = 0.31 and r = −0.20, highlighting the climatic feedback processes of the two gases.
➢ The HYSPLIT trajectory analysis indicates transboundary Middle East (oil/gas industries) and Central Asia (fossil fuel dependence) contributions to the problem of local agriculture, transportation, and biomass burning emissions.
➢ Increased surveillance: increased ground-based and satellite monitoring to measure emission hotspots and model climates.
➢ Emission control: strengthening of the emission control of industries and vehicles, use of renewable energy, and sustainable agricultural methods (e.g., alternate wetting/drying in rice fields).
➢ Regional cooperation: collaborate with other neighbouring countries to deal with transboundary pollution by working together on similar climate programmes.
In addition, it was observed that simple statistical models (linear regression of CH4 and ARIMA of CO2) performed better than more complicated models, such as LSTM, in the scenario of strongly linear trends of GHG. This offers an excellent, cost-effective model in which policymakers and scientists can project future GHG concentrations and evaluate scenarios for the mitigation of emissions without the complexity and obscurity of computationally demanding models.
| This journal is © The Royal Society of Chemistry 2026 |