Missing value imputation method for disaster decision-making using K nearest neighbor

被引:6
作者
Ma, Xiaofei [1 ]
Zhong, Qiuyan [1 ]
机构
[1] Dalian Univ Technol, Inst Informat Management & Informat Syst, Dalian, Peoples R China
关键词
disaster; missing values; trapezoidal fuzzy numbers; K nearest neighbor; MULTIPLE IMPUTATION; REGRESSION;
D O I
10.1080/02664763.2015.1077377
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Due to destructiveness of natural disasters, restriction of disaster scenarios and some human causes, missing data usually occur in disaster decision-making problems. In order to estimate missing values of alternatives, this paper focuses on imputing heterogeneous attribute values of disaster based on an improved K nearest neighbor imputation (KNNI) method. Firstly, some definitions of trapezoidal fuzzy numbers (TFNs) are introduced and three types of attributes (i.e. linguistic term sets, intervals and real numbers) are converted to TFNs. Then the correlated degree model is utilized to extract related attributes to form instances that will be used in K nearest neighbor algorithm, and a novel KNNI method merging with correlated degree model is presented. Finally, an illustrative example is given to verify the proposed method and to demonstrate its feasibility and effectiveness.
引用
收藏
页码:767 / 781
页数:15
相关论文
共 24 条
  • [1] K nearest neighbor reinforced expectation maximization method
    Aci, Mehmet
    Avci, Mutlu
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (10) : 12585 - 12591
  • [2] [Anonymous], 1992, Fuzzy Multiple Attribute Decision Making: Methods and Applications
  • [3] [Anonymous], 1987, STAT ANAL
  • [4] A hybrid method for imputation of missing values using optimized fuzzy c-means with support vector regression and a genetic algorithm
    Aydilek, Ibrahim Berkan
    Arslan, Ahmet
    [J]. INFORMATION SCIENCES, 2013, 233 : 25 - 35
  • [5] Relative efficiency measurement: The problem of a missing output in a subset of decision making units
    Cook, Wade D.
    Harrison, Julie
    Rouse, Paul
    Zhu, Joe
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2012, 220 (01) : 79 - 84
  • [6] Recursive partitioning for missing data imputation in the presence of interaction effects
    Doove, L. L.
    Van Buuren, S.
    Dusseldorp, E.
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2014, 72 : 92 - 104
  • [7] K nearest neighbours with mutual information for simultaneous classification and missing data imputation
    Garcia-Laencina, Pedro J.
    Sancho-Gomez, Jose-Luis
    Figueiras-Vidal, Anibal R.
    Verleysen, Michel
    [J]. NEUROCOMPUTING, 2009, 72 (7-9) : 1483 - 1493
  • [8] Variable selection using random forests
    Genuer, Robin
    Poggi, Jean-Michel
    Tuleau-Malot, Christine
    [J]. PATTERN RECOGNITION LETTERS, 2010, 31 (14) : 2225 - 2236
  • [9] A practical comparison of single and multiple imputation methods to handle complex missing data in air quality datasets
    Gomez-Carracedo, M. P.
    Andrade, J. M.
    Lopez-Mahia, P.
    Muniategui, S.
    Prada, D.
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2014, 134 : 23 - 33
  • [10] Multiple imputation: Review of theory, implementation and software
    Harel, Ofer
    Zhou, Xiao-Hua
    [J]. STATISTICS IN MEDICINE, 2007, 26 (16) : 3057 - 3077