TY - JOUR
T1 - Machine learning-based prediction of ambient CO2 and CH4 concentrations with high temporal resolution in Seoul metropolitan area
AU - Park, Seongjun
AU - Moon, Kwang Joo
AU - Eom, Hyo Jin
AU - Yi, Seung Muk
AU - Kim, Youngkwon
AU - Kim, Moonkyung
AU - Rim, Donghyun
AU - Lee, Young Su
N1 - Publisher Copyright:
© 2025 Elsevier Ltd
PY - 2025/7/1
Y1 - 2025/7/1
N2 - Machine learning has the potential to support the growing need for high-resolution greenhouse gas monitoring in urban and industrial environments, where deploying extensive sensor networks is often limited by cost and operational challenges. This study presents a novel approach for estimating greenhouse gas (GHG) concentrations using routinely collected air quality and meteorological data from existing monitoring stations. Focusing on the Seoul metropolitan area in the Republic of Korea, we developed and evaluated three machine learning models - Random Forest, Long Short-Term Memory (LSTM), and an ensemble learning approach - to predict CO2 and CH4 concentrations without relying on additional GHG monitoring equipment. Among these, the ensemble learning model outperformed the individual models, consistently achieving lower error metrics, even in data-limited scenarios. Feature importance analysis identifies NO2, CO, O3, and temperature as key predictors of CO2 and CH4 level variations, highlighting the influence of combustion-related pollutants and photochemical processes. Cross-validation results confirm the model's out-of-sample capabilities; however, local factors, such as traffic density, industrial activities, and meteorology, can affect performance. Consequently, model retraining or transfer learning may be required when applying the model to new locations with comparable emission profiles or atmospheric conditions. These findings emphasize the importance of localized context in model application while also demonstrating the broader applicability of the approach. By utilizing data already available through urban monitoring networks, this study offers a scalable and cost-effective strategy to support high-resolution GHG monitoring and inform targeted climate policies in complex urban-industrial regions.
AB - Machine learning has the potential to support the growing need for high-resolution greenhouse gas monitoring in urban and industrial environments, where deploying extensive sensor networks is often limited by cost and operational challenges. This study presents a novel approach for estimating greenhouse gas (GHG) concentrations using routinely collected air quality and meteorological data from existing monitoring stations. Focusing on the Seoul metropolitan area in the Republic of Korea, we developed and evaluated three machine learning models - Random Forest, Long Short-Term Memory (LSTM), and an ensemble learning approach - to predict CO2 and CH4 concentrations without relying on additional GHG monitoring equipment. Among these, the ensemble learning model outperformed the individual models, consistently achieving lower error metrics, even in data-limited scenarios. Feature importance analysis identifies NO2, CO, O3, and temperature as key predictors of CO2 and CH4 level variations, highlighting the influence of combustion-related pollutants and photochemical processes. Cross-validation results confirm the model's out-of-sample capabilities; however, local factors, such as traffic density, industrial activities, and meteorology, can affect performance. Consequently, model retraining or transfer learning may be required when applying the model to new locations with comparable emission profiles or atmospheric conditions. These findings emphasize the importance of localized context in model application while also demonstrating the broader applicability of the approach. By utilizing data already available through urban monitoring networks, this study offers a scalable and cost-effective strategy to support high-resolution GHG monitoring and inform targeted climate policies in complex urban-industrial regions.
UR - http://www.scopus.com/inward/record.url?scp=105004373245&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=105004373245&partnerID=8YFLogxK
U2 - 10.1016/j.envpol.2025.126362
DO - 10.1016/j.envpol.2025.126362
M3 - Article
C2 - 40320126
AN - SCOPUS:105004373245
SN - 0269-7491
VL - 376
JO - Environmental Pollution
JF - Environmental Pollution
M1 - 126362
ER -