An open-source software package for multivariate modeling and clustering: applications to air quality management

被引:20
作者
Wang, Xiuquan [1 ]
Huang, Guohe [2 ,3 ]
Zhao, Shan [1 ]
Guo, Junhong [3 ]
机构
[1] Univ Regina, Inst Energy Environm & Sustainable Commun, Regina, SK S4S 0A2, Canada
[2] Univ Regina, Inst Energy Environm & Sustainabil Res, UR NCEPU, Regina, SK S4S 0A2, Canada
[3] North China Elect Power Univ, Inst Energy Environm & Sustainabil Res, UR NCEPU, Beijing 102206, Peoples R China
关键词
Multivariate modeling; Multivariate clustering; Stepwise cluster analysis; Cluster tree; Air quality management; SYSTEM; REMEDIATION; IMPACT;
D O I
10.1007/s11356-015-4664-7
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
This paper presents an open-source software package, rSCA, which is developed based upon a stepwise cluster analysis method and serves as a statistical tool for modeling the relationships between multiple dependent and independent variables. The rSCA package is efficient in dealing with both continuous and discrete variables, as well as nonlinear relationships between the variables. It divides the sample sets of dependent variables into different subsets (or subclusters) through a series of cutting and merging operations based upon the theory of multivariate analysis of variance (MANOVA). The modeling results are given by a cluster tree, which includes both intermediate and leaf subclusters as well as the flow paths from the root of the tree to each leaf subcluster specified by a series of cutting and merging actions. The rSCA package is a handy and easy-to-use tool and is freely available at http://cran.r-project.org/package=rSCA. By applying the developed package to air quality management in an urban environment, we demonstrate its effectiveness in dealing with the complicated relationships among multiple variables in real-world problems.
引用
收藏
页码:14220 / 14233
页数:14
相关论文
共 50 条
  • [1] Asymptotic statistical theory of overtraining and cross-validation
    Amari, S
    Murata, N
    Muller, KR
    Finke, M
    Yang, HH
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1997, 8 (05): : 985 - 996
  • [2] [Anonymous], TOXICOL ENV CHEM
  • [3] [Anonymous], POPULATION PERHAPS B
  • [4] [Anonymous], ENV QUALITY REPORTS
  • [5] [Anonymous], CHEM ENG
  • [6] [Anonymous], 1993, Advanced Methods in Neural Computing
  • [7] [Anonymous], CHINA ENV SCI
  • [8] [Anonymous], 1971, Multivariate Data Analysis
  • [9] [Anonymous], Q J R METEOROL SOC
  • [10] HIERARCHICAL CLUSTER-ANALYSIS WITH STOPPING RULES BUILT ON AKAIKES INFORMATION CRITERION FOR AEROSOL-PARTICLE CLASSIFICATION BASED ON ELECTRON-PROBE X-RAY-MICROANALYSIS
    BONDARENKO, I
    VANMALDEREN, H
    TREIGER, B
    VANESPEN, P
    VANGRIEKEN, R
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1994, 22 (01) : 87 - 95