Attribute reduction based on overlap degree and k-nearest-neighbor rough sets in decision information systems
被引:43
作者:
Hu, Meng
论文数: 0引用数: 0
h-index: 0
机构:
Macau Univ Sci & Technol, Fac Informat Technol, Taipa, Macao, Peoples R ChinaMacau Univ Sci & Technol, Fac Informat Technol, Taipa, Macao, Peoples R China
Hu, Meng
[1
]
Tsang, Eric C. C.
论文数: 0引用数: 0
h-index: 0
机构:
Macau Univ Sci & Technol, Fac Informat Technol, Taipa, Macao, Peoples R ChinaMacau Univ Sci & Technol, Fac Informat Technol, Taipa, Macao, Peoples R China
Tsang, Eric C. C.
[1
]
Guo, Yanting
论文数: 0引用数: 0
h-index: 0
机构:
Macau Univ Sci & Technol, Fac Informat Technol, Taipa, Macao, Peoples R ChinaMacau Univ Sci & Technol, Fac Informat Technol, Taipa, Macao, Peoples R China
Guo, Yanting
[1
]
Chen, Degang
论文数: 0引用数: 0
h-index: 0
机构:
North China Elect Power Univ, Dept Math & Phys, Beijing 102206, Peoples R ChinaMacau Univ Sci & Technol, Fac Informat Technol, Taipa, Macao, Peoples R China
Chen, Degang
[2
]
Xu, Weihua
论文数: 0引用数: 0
h-index: 0
机构:
Southwest Univ, Coll Artificial Intelligence, Chongqing 400715, Peoples R ChinaMacau Univ Sci & Technol, Fac Informat Technol, Taipa, Macao, Peoples R China
Xu, Weihua
[3
]
机构:
[1] Macau Univ Sci & Technol, Fac Informat Technol, Taipa, Macao, Peoples R China
[2] North China Elect Power Univ, Dept Math & Phys, Beijing 102206, Peoples R China
[3] Southwest Univ, Coll Artificial Intelligence, Chongqing 400715, Peoples R China
The k-nearest-neighbor rule is a popular classification technique, and rough set theory is an effective mathematical tool to deal with the uncertainty of data. Rough set models based on k-nearest-neighbor relations have a strong ability to approximate decisions, but the cal-culation is very time-consuming. In this paper, we model the overlap degree of objects from different categories in advance to accelerate the attribute reduction and improve the classification performance of the selected attributes. Firstly, we define the coincidence degree (CD) and distance (DIS) of objects from different categories to measure the coverage and distance of between-class objects. Secondly, we combine CD and DIS to define the over-lap degree (OD) to pre-sort attributes, then use k-nearest-neighbor rough sets to filter inconsistent and redundant attributes. The pre-sort operation based on OD can greatly reduce the number of searches for attributes and ensure that the attributes with high sep-arability should be selected first. Furthermore, we design a fast reduction algorithm (OD&KNN) to obtain a reduct with the ability to approximate decisions as well as the orig-inal attributes but with lower OD. Comparing experimental results and time complexity of OD&KNN with state-of-the-art algorithms, OD&KNN is more efficient for high-dimensional data while ensuring classification accuracy. (c) 2021 Elsevier Inc. All rights reserved.