An Interactive Clustering-Based Visualization Tool for Air Quality Data Analysis

被引:0
作者
Ashouri, Mahsa [1 ]
Phoa, Frederick Kin Hing [2 ]
Chen, Chun-Houh [2 ]
Shmueli, Galit [3 ]
机构
[1] Univ Michigan, Dept Biostat, Ann Arbor, MI 48109 USA
[2] Acad Sinica, Inst Stat Sci, Taipei 11529, Taiwan
[3] Natl Tsing Hua Univ, Inst Serv Sci, Hsinchu 30013, Taiwan
关键词
Time series; Clustering; Web-based tool; Air quality; Environmental protection agencies; PM2.5; POLLUTION; PATTERNS;
D O I
10.4209/aaqr.230124
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Examining PM2.5 (atmospheric particulate matter with a maximum diameter of 2.5 micrometers), seasonal patterns is an important research area for environmental scientists. An improved understanding of PM2.5 seasonal patterns can help environmental protection agencies (EPAs) make decisions and develop complex models for controlling the concentration of PM2.5 in different regions. This work proposes an R Shiny App web-based interactive tool, namely a "model-based time series clustering" (MTSC) tool, for clustering PM2.5 time series using spatial and population variables and their temporal features, like seasonality. Our tool allows stakeholders to visualize important characteristics of PM2.5 time series, including temporal patterns and missing values, and cluster series by attribute groupings. We apply the MTSC tool to cluster Taiwan's PM2.5 time series based on air quality zones and types of monitoring stations. The tool clusters the series into four clusters that reveal several phenomena, including an improvement in Taiwan's air quality since 2017 in all regions, although at varying rates, an increasing pattern of PM2.5 concentration when moving from northern towards southern regions, winter/summer seasonal patterns that are more pronounced in certain types of areas (e.g., industrial), and unusual behavior in the southernmost region. The tool provides cluster-specific quantitative figures, like seasonal variations in PM2.5 concentration in different air quality zones of Taiwan, and identifies, for example, an annual peak in early January and February (maximum value around 120 mu g m(-3)). Our analysis identifies a region in southernmost Taiwan as different from other zones that are currently grouped together with it by Taiwan EPA (TEPA), and a northern region that behaves differently from its TEPA grouping. All these cluster-based insights help EPA experts implement short-term zone-specific air quality policies (e.g., fireworks and traffic regulations, school closures) as well as longer-term decision-making (e.g., transport control stations, fuel permits, old vehicle replacement, fuel type).
引用
收藏
页数:18
相关论文
共 45 条
[1]   Assessment of PM2.5 Patterns in Malaysia Using the Clustering Method [J].
Ab Rahman, Ezahtulsyahreen ;
Hamzah, Firdaus Mohamad ;
Latif, Mohd Talib ;
Dominick, Doreena .
AEROSOL AND AIR QUALITY RESEARCH, 2022, 22 (01)
[2]  
Akaike H., 1973, 2 INT S INF THEOR, P267, DOI [DOI 10.1007/978-1-4612-1694-0_15, 10.1007/978-1-4612-1694-015, DOI 10.1007/978-1-4612-1694-015]
[3]   Visualization Support to Interactive Cluster Analysis [J].
Andrienko, Gennady ;
Andrienko, Natalia .
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT III, 2015, 9286 :337-340
[4]  
[Anonymous], 2013, Shiny: web application framework for R
[5]   Tree-based methods for clustering time series using domain-relevant attributes [J].
Ashouri, Mahsa ;
Shmueli, Galit ;
Sin, Chor-Yiu .
JOURNAL OF BUSINESS ANALYTICS, 2019, 2 (01) :1-23
[6]   Fast Forecast Reconciliation Using Linear Models [J].
Ashouri, Mahsa ;
Hyndman, Rob J. ;
Shmueli, Galit .
JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2022, 31 (01) :263-282
[7]  
Blazquez C. A., 2020, 18 LACCEI VIRT INT M, P27, DOI [10.18687/LACCEI2020.1.1.528, DOI 10.18687/LACCEI2020.1.1.528]
[8]   Bipeline: A Web-Based Visualization Tool for Biclustering of Multivariate Time Series [J].
Cachucho, Ricardo ;
Liu, Kaihua ;
Nijssen, Siegfried ;
Knobbe, Arno .
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2016, PT III, 2016, 9853 :12-16
[9]   Risk Assessment for People Exposed to PM2.5 and Constituents at Different Vertical Heights in an Urban Area of Taiwan [J].
Chen, Hsiu-Ling ;
Li, Chi-Pei ;
Tang, Chin-Sheng ;
Lung, Shih-Chun Candice ;
Chuang, Hsiao-Chi ;
Chou, Da-Wei ;
Chang, Li-Te .
ATMOSPHERE, 2020, 11 (11)
[10]   A Big Data Analysis of PM2.5 and PM10 from Low Cost Air Quality Sensors near Traffic Areas [J].
Chen, Shida ;
Cui, Kangping ;
Yu, Tai-Yi ;
Chao, How-Ran ;
Hsu, Yi-Chyun ;
Lu, I-Cheng ;
Arcega, Rachelle D. ;
Tsai, Ming-Hsien ;
Lin, Sheng-Lun ;
Chao, Wan-Chun ;
Chen, Chunneng ;
Yu, Kwong-Leung J. .
AEROSOL AND AIR QUALITY RESEARCH, 2019, 19 (08) :1721-1733