Comparative Assessment of Linear Regression and Machine Learning for Analyzing the Spatial Distribution of Ground-level NO2 Concentrations: A Case Study for Seoul, Korea

Kang, Eunjin; Yoo, Cheolhee; Shin, Yeji; Cho, Dongjin; Im, Jungho

doi:10.7780/kjrs.2021.37.6.1.21

Scholarworks@UNIST

UNIST Library

File Download

서울 지역 지상~.pdf.pdf

SFX Link

Find it @ UNIST can give you direct access to the published full text of this article. (UNISTARs only)

Related Researcher

임정호

Im, Jungho: Intelligent Remote sensing and geospatial Information Science Lab.

Read More

Views & Downloads

Detailed Information

Cited time in webofscience

Cited time in scopus

Metadata Downloads

Comparative Assessment of Linear Regression and Machine Learning for Analyzing the Spatial Distribution of Ground-level NO2 Concentrations: A Case Study for Seoul, Korea

Alternative Title: 서울 지역 지상 NO2 농도 공간 분포 분석을 위한 회귀 모델 및 기계학습 기법 비교

Author(s): Kang, Eunjin, Yoo, Cheolhee, Shin, Yeji, Cho, Dongjin, Im, Jungho

Issued Date: 2021-12

DOI: 10.7780/kjrs.2021.37.6.1.21

URI: https://scholarworks.unist.ac.kr/handle/201301/55657

Citation: Korean Journal of Remote Sensing, v.37, no.6-1, pp.1739 - 1756

Abstract: Atmospheric nitrogen dioxide (NO2) is mainly caused by anthropogenic emissions. It contributes to the formation of secondary pollutants and ozone through chemical reactions, and adversely affects human health. Although ground stations to monitor NO2 concentrations in real time are operated in Korea, they have a limitation that it is difficult to analyze the spatial distribution of NO2 concentrations, especially over the areas with no stations. Therefore, thisstudy conducted a comparative experiment ofspatial interpolation of NO2 concentrations based on two linear-regression methods(i.e., multi linear regression (MLR), and regression kriging (RK)), and two machine learning approaches(i.e., random forest (RF), and support vector regression (SVR)) for the year of 2020. Four approaches were compared using leave-one-out-cross validation (LOOCV). The daily LOOCV resultsshowed that MLR, RK, and SVR produced the average daily index of agreement (IOA) of 0.57, which was higher than that of RF (0.50). The average daily normalized root mean square error of RK was 0.9483%, which was slightly lower than those of the other models. MLR, RK and SVR showed similar seasonal distribution patterns, and the dynamic range of the resultant NO2 concentrationsfrom these three models was similar while that from RF was relatively small. The multivariate linear regression approaches are expected to be a promising method for spatial interpolation of ground-level NO2 concentrations and other parameters in urban areas.

Publisher: 대한원격탐사학회

ISSN: 1225-6161

Keyword (Author): random forest,support vector regression, regression kriging, multi linear regression, Spatial Interpolation, gap-filling, ground-level NO2 concentration

Show Full Item Record

qrcode

RSS 1.0 RSS 2.0

UNIST | Library

Tel : 052-217-1404 / Email : scholarworks@unist.ac.kr

ScholarWorks@UNIST was established as an OAK Project for the National Library of Korea.