NEC: A nested equivalence class-based dependency calculation approach for fast feature selection using rough set theory
被引:13
|
作者:
Zhao, Jie
论文数: 0引用数: 0
h-index: 0
机构:
Guangdong Univ Technol, Sch Management, Guangzhou 510006, Peoples R China
Cornell Univ, Sch Elect & Comp Engn, New York, NY 14850 USAGuangdong Univ Technol, Sch Management, Guangzhou 510006, Peoples R China
Zhao, Jie
[1
,2
]
Liang, Jia-Ming
论文数: 0引用数: 0
h-index: 0
机构:
Guangdong Univ Technol, Sch Management, Guangzhou 510006, Peoples R ChinaGuangdong Univ Technol, Sch Management, Guangzhou 510006, Peoples R China
Liang, Jia-Ming
[1
]
Dong, Zhen-Ning
论文数: 0引用数: 0
h-index: 0
机构:
Guangdong Univ Technol, Sch Management, Guangzhou 510006, Peoples R ChinaGuangdong Univ Technol, Sch Management, Guangzhou 510006, Peoples R China
Dong, Zhen-Ning
[1
]
Tang, De-Yu
论文数: 0引用数: 0
h-index: 0
机构:
Guangdong Pharmaceut Univ, Sch Med Informat & Engn, Guangzhou 510006, Peoples R ChinaGuangdong Univ Technol, Sch Management, Guangzhou 510006, Peoples R China
Tang, De-Yu
[3
]
Liu, Zhen
论文数: 0引用数: 0
h-index: 0
机构:
Guangdong Pharmaceut Univ, Sch Med Informat & Engn, Guangzhou 510006, Peoples R ChinaGuangdong Univ Technol, Sch Management, Guangzhou 510006, Peoples R China
Liu, Zhen
[3
]
机构:
[1] Guangdong Univ Technol, Sch Management, Guangzhou 510006, Peoples R China
[2] Cornell Univ, Sch Elect & Comp Engn, New York, NY 14850 USA
[3] Guangdong Pharmaceut Univ, Sch Med Informat & Engn, Guangzhou 510006, Peoples R China
Feature selection plays an important role in data mining and machine learning tasks. As one of the most effective methods for feature selection, rough set theory provides a systematic theoretical framework for consistency-based feature selection, in which positive region-based dependency calculation is the most important step. However, it is time-consuming, and although many improved algorithms have been proposed, they are still computationally time-consuming. Therefore, to overcome this shortcoming, in this study, a nested equivalence class (NEC) approach is introduced to calculate dependency. The proposed method starts from the finest partition of the universe, and then extracts and uses the known knowledge of reducts in a decision table to construct an NEC. The proposed method not only simplifies dependency calculation but also reduces the universe correspondingly, in most cases. Using the proposed NEC-based approach, a number of representative heuristic- and swarm intelligence-based feature selection algorithms that apply rough set theory were enhanced. Note that the feature subset selected by each modified algorithm and that selected by the original algorithm were the same. Experiments conducted using 33 datasets from the UCI repository and KDD Cup competition, which included large-scale and high-dimensional datasets, demonstrated the efficiency and effectiveness of the proposed method. (C) 2020 Elsevier Inc. All rights reserved.
机构:
Guangdong Univ Technol, Sch Management, Guangzhou 510006, Peoples R China
Cornell Univ, Sch Elect & Comp Engn, New York, NY 14850 USAGuangdong Univ Technol, Sch Management, Guangzhou 510006, Peoples R China
Zhao, Jie
Liang, Jia-ming
论文数: 0引用数: 0
h-index: 0
机构:
Guangdong Univ Technol, Sch Management, Guangzhou 510006, Peoples R ChinaGuangdong Univ Technol, Sch Management, Guangzhou 510006, Peoples R China
Liang, Jia-ming
Dong, Zhen-ning
论文数: 0引用数: 0
h-index: 0
机构:
Guangdong Univ Technol, Sch Management, Guangzhou 510006, Peoples R ChinaGuangdong Univ Technol, Sch Management, Guangzhou 510006, Peoples R China
Dong, Zhen-ning
Tang, De-yu
论文数: 0引用数: 0
h-index: 0
机构:
Guangdong Pharmaceut Univ, Sch Med Informat & Engn, Guangzhou 510006, Peoples R ChinaGuangdong Univ Technol, Sch Management, Guangzhou 510006, Peoples R China
Tang, De-yu
Liu, Zhen
论文数: 0引用数: 0
h-index: 0
机构:
Guangdong Pharmaceut Univ, Sch Med Informat & Engn, Guangzhou 510006, Peoples R ChinaGuangdong Univ Technol, Sch Management, Guangzhou 510006, Peoples R China