Data reduction is an important step that helps ease the computational intractability for learning techniques when data are large. This is particularly true for the huge datasets that have become commonplace in recent times. The main problem facing both data preprocessors and learning techniques is that data are expanding both in terms of dimensionality and also in terms of the number of data instances. Approaches based on fuzzy-rough sets offer many advantages for both feature selection and classification, particularly for real-valued and noisy data; however, the majority of recent approaches tend to address the task of data reduction in terms of either dimensionality or training data size in isolation. This paper demonstrates how the notion of fuzzy-rough bireducts can be used for the simultaneous reduction of data size and dimensionality. It also shows how bireducts and, therefore, reduced subtables of data can be used not only as a preprocessing tool but also for the learning of compact and robust classifiers. Furthermore, the ideas can also be extended to the unsupervised domain when dealing with unlabeled data. Experimental evaluation of various techniques demonstrate that high levels of simultaneous reduction of both dimensionality and data size can be achieved whilst maintaining robust performance.
机构:
Indian Inst Technol, Dept Elect Engn, Control Grp, New Delhi 110016, IndiaIndian Inst Technol, Dept Elect Engn, Control Grp, New Delhi 110016, India
Bhatt, RB
Gopal, M
论文数: 0引用数: 0
h-index: 0
机构:
Indian Inst Technol, Dept Elect Engn, Control Grp, New Delhi 110016, IndiaIndian Inst Technol, Dept Elect Engn, Control Grp, New Delhi 110016, India
机构:
North China Elect Power Univ, Dept Math & Phys, Beijing 102206, Peoples R China
Xi An Jiao Tong Univ, Xian 710049, Peoples R China
Tsinghua Univ, Tsinghua, Peoples R China
North China Elect Power Univ, Beijing 102206, Peoples R ChinaNorth China Elect Power Univ, Dept Math & Phys, Beijing 102206, Peoples R China
Chen, Degang
Yang, Yanyan
论文数: 0引用数: 0
h-index: 0
机构:
North China Elect Power Univ, Sch Control & Compute Engn, Beijing 102206, Peoples R ChinaNorth China Elect Power Univ, Dept Math & Phys, Beijing 102206, Peoples R China
机构:
Xi An Jiao Tong Univ, Sch Energy & Power Engn, State Key Lab Multiphase Flow Power Engn, Xian 710049, Peoples R ChinaXi An Jiao Tong Univ, Sch Energy & Power Engn, State Key Lab Multiphase Flow Power Engn, Xian 710049, Peoples R China
Wu, Haoyang
Wu, Yuyuan
论文数: 0引用数: 0
h-index: 0
机构:
Xi An Jiao Tong Univ, Sch Energy & Power Engn, State Key Lab Multiphase Flow Power Engn, Xian 710049, Peoples R ChinaXi An Jiao Tong Univ, Sch Energy & Power Engn, State Key Lab Multiphase Flow Power Engn, Xian 710049, Peoples R China
Wu, Yuyuan
Luo, Jinping
论文数: 0引用数: 0
h-index: 0
机构:
Kunlun Technol Ind Corp, Hangzhou 310012, Zhejiang, Peoples R ChinaXi An Jiao Tong Univ, Sch Energy & Power Engn, State Key Lab Multiphase Flow Power Engn, Xian 710049, Peoples R China