Fuzzy-Rough Set Bireducts for Data Reduction

被引:20
|
作者
Parthalain, Neil Mac [1 ]
Jensen, Richard [1 ]
Diao, Ren [2 ]
机构
[1] Aberystwyth Univ, Dept Comp Sci, Aberystwyth SY23 3DB, Dyfed, Wales
[2] Candela Shenzhen Technol Innovate Co Ltd, Shenzhen 815000, Peoples R China
关键词
Rough sets; Tools; Uncertainty; Feature extraction; Noise measurement; Training data; Dimensionality reduction; Bireducts; feature selection (FS); fuzzy-rough sets; instance selection;
D O I
10.1109/TFUZZ.2019.2921935
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data reduction is an important step that helps ease the computational intractability for learning techniques when data are large. This is particularly true for the huge datasets that have become commonplace in recent times. The main problem facing both data preprocessors and learning techniques is that data are expanding both in terms of dimensionality and also in terms of the number of data instances. Approaches based on fuzzy-rough sets offer many advantages for both feature selection and classification, particularly for real-valued and noisy data; however, the majority of recent approaches tend to address the task of data reduction in terms of either dimensionality or training data size in isolation. This paper demonstrates how the notion of fuzzy-rough bireducts can be used for the simultaneous reduction of data size and dimensionality. It also shows how bireducts and, therefore, reduced subtables of data can be used not only as a preprocessing tool but also for the learning of compact and robust classifiers. Furthermore, the ideas can also be extended to the unsupervised domain when dealing with unlabeled data. Experimental evaluation of various techniques demonstrate that high levels of simultaneous reduction of both dimensionality and data size can be achieved whilst maintaining robust performance.
引用
收藏
页码:1840 / 1850
页数:11
相关论文
共 50 条
  • [41] Fuzzy rough set attribute reduction based on decision ball model
    Ji, Xia
    Duan, Wanyu
    Peng, Jianhua
    Yao, Sheng
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2025, 179
  • [42] Fuzzy-rough nearest neighbour classification and prediction
    Jensen, Richard
    Cornelis, Chris
    THEORETICAL COMPUTER SCIENCE, 2011, 412 (42) : 5871 - 5884
  • [43] Towards scalable fuzzy-rough feature selection
    Jensen, Richard
    Mac Parthalain, Neil
    INFORMATION SCIENCES, 2015, 323 : 1 - 15
  • [44] Application of fuzzy-rough control in deoxidation system
    Qu, YB
    Su, JY
    Feng, LG
    PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 2444 - 2447
  • [45] On the compact computational domain of fuzzy-rough sets
    Bhatt, RB
    Gopal, M
    PATTERN RECOGNITION LETTERS, 2005, 26 (11) : 1632 - 1640
  • [46] Attribute Reduction for Heterogeneous Data Based on the Combination of Classical and Fuzzy Rough Set Models
    Chen, Degang
    Yang, Yanyan
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2014, 22 (05) : 1325 - 1334
  • [47] Fuzzy Binary Rough Set
    Syau, Yu-Ru
    Lin, En-Bing
    Liau, Churn-Jung
    INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2020, 28 (02) : 317 - 329
  • [48] A fuzzy rough set-based undersampling approach for imbalanced data
    Zhang, Xiao
    He, Zhaoqian
    Yang, Yanyan
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (07) : 2799 - 2810
  • [49] An Interval Type-2 Fuzzy Rough Set Model for Attribute Reduction
    Wu, Haoyang
    Wu, Yuyuan
    Luo, Jinping
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2009, 17 (02) : 301 - 315
  • [50] Representation of spatial data in an OODB using rough and fuzzy set modeling
    Beaubouef, T
    Petry, FE
    SOFT COMPUTING, 2005, 9 (05) : 364 - 373