Machine learning-based prediction of ambient CO2 and CH4 concentrations with high temporal resolution in Seoul metropolitan area

Seongjun Park, Kwang Joo Moon, Hyo Jin Eom, Seung Muk Yi, Youngkwon Kim, Moonkyung Kim, Donghyun Rim, Young Su Lee

Research output: Contribution to journalArticlepeer-review

Abstract

Machine learning has the potential to support the growing need for high-resolution greenhouse gas monitoring in urban and industrial environments, where deploying extensive sensor networks is often limited by cost and operational challenges. This study presents a novel approach for estimating greenhouse gas (GHG) concentrations using routinely collected air quality and meteorological data from existing monitoring stations. Focusing on the Seoul metropolitan area in the Republic of Korea, we developed and evaluated three machine learning models - Random Forest, Long Short-Term Memory (LSTM), and an ensemble learning approach - to predict CO2 and CH4 concentrations without relying on additional GHG monitoring equipment. Among these, the ensemble learning model outperformed the individual models, consistently achieving lower error metrics, even in data-limited scenarios. Feature importance analysis identifies NO2, CO, O3, and temperature as key predictors of CO2 and CH4 level variations, highlighting the influence of combustion-related pollutants and photochemical processes. Cross-validation results confirm the model's out-of-sample capabilities; however, local factors, such as traffic density, industrial activities, and meteorology, can affect performance. Consequently, model retraining or transfer learning may be required when applying the model to new locations with comparable emission profiles or atmospheric conditions. These findings emphasize the importance of localized context in model application while also demonstrating the broader applicability of the approach. By utilizing data already available through urban monitoring networks, this study offers a scalable and cost-effective strategy to support high-resolution GHG monitoring and inform targeted climate policies in complex urban-industrial regions.

Original languageEnglish (US)
Article number126362
JournalEnvironmental Pollution
Volume376
DOIs
StatePublished - Jul 1 2025

All Science Journal Classification (ASJC) codes

  • Toxicology
  • Pollution
  • Health, Toxicology and Mutagenesis

Fingerprint

Dive into the research topics of 'Machine learning-based prediction of ambient CO2 and CH4 concentrations with high temporal resolution in Seoul metropolitan area'. Together they form a unique fingerprint.

Cite this