A novel kNN algorithm with data-driven k parameter computation
被引:136
作者:
Zhang, Shichao
论文数: 0引用数: 0
h-index: 0
机构:
Guangxi Normal Univ, Guangxi Key Lab Multisource Informat Min & Secur, Guilin, Guangxi, Peoples R ChinaGuangxi Normal Univ, Guangxi Key Lab Multisource Informat Min & Secur, Guilin, Guangxi, Peoples R China
Zhang, Shichao
[1
]
Cheng, Debo
论文数: 0引用数: 0
h-index: 0
机构:
Guangxi Normal Univ, Guangxi Key Lab Multisource Informat Min & Secur, Guilin, Guangxi, Peoples R China
Univ South Australia, Informat Technol & Math Sci, Adelaide, SA, AustraliaGuangxi Normal Univ, Guangxi Key Lab Multisource Informat Min & Secur, Guilin, Guangxi, Peoples R China
Cheng, Debo
[1
,2
]
Deng, Zhenyun
论文数: 0引用数: 0
h-index: 0
机构:
Guangxi Normal Univ, Guangxi Key Lab Multisource Informat Min & Secur, Guilin, Guangxi, Peoples R ChinaGuangxi Normal Univ, Guangxi Key Lab Multisource Informat Min & Secur, Guilin, Guangxi, Peoples R China
Deng, Zhenyun
[1
]
Zong, Ming
论文数: 0引用数: 0
h-index: 0
机构:
Massey Univ, Inst Nat & Math Sci, Auckland, New ZealandGuangxi Normal Univ, Guangxi Key Lab Multisource Informat Min & Secur, Guilin, Guangxi, Peoples R China
Zong, Ming
[3
]
Deng, Xuelian
论文数: 0引用数: 0
h-index: 0
机构:
Guangxi Univ Chinese Med, Coll Publ Hlth & Management, Nanning, Guangxi, Peoples R ChinaGuangxi Normal Univ, Guangxi Key Lab Multisource Informat Min & Secur, Guilin, Guangxi, Peoples R China
Deng, Xuelian
[4
]
机构:
[1] Guangxi Normal Univ, Guangxi Key Lab Multisource Informat Min & Secur, Guilin, Guangxi, Peoples R China
[2] Univ South Australia, Informat Technol & Math Sci, Adelaide, SA, Australia
[3] Massey Univ, Inst Nat & Math Sci, Auckland, New Zealand
[4] Guangxi Univ Chinese Med, Coll Publ Hlth & Management, Nanning, Guangxi, Peoples R China
This paper studies an example-driven k-parameter computation that identifies different k values for different test samples in kNN prediction applications, such as classification, regression and missing data imputation. This is carried out with reconstructing a sparse coefficient matrix between test samples and training data. In the reconstruction process, an l(1)-norm regularization is employed to generate an element-wise sparsity coefficient matrix, and an LPP (Locality Preserving Projection) regularization is adopted to keep the local structures of data for achieving the efficiency. Further, with the learnt k value, k NN approach is applied to classification, regression and missing data imputation. We experimentally evaluate the proposed approach with 20 real datasets, and show that our algorithm is much better than previous k NN algorithms in terms of data mining tasks, such as classification, regression and missing value imputation. (C) 2017 Elsevier B.V. All rights reserved.