A Rough Set System for Mining from Streaming Data

被引:1
作者
Wei, Yidong [1 ]
Leung, Carson K. [1 ]
Li, Cheng [1 ]
机构
[1] Univ Manitoba, Dept Comp Sci, Winnipeg, MB, Canada
来源
2022 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE) | 2022年
基金
加拿大自然科学与工程研究理事会;
关键词
rough set; classification; data streams; decision rules; prediction; batch; aggregate; landmark; sliding window; time-fading; INCREMENTAL ATTRIBUTE REDUCTION;
D O I
10.1109/FUZZ-IEEE55066.2022.9882664
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the era of big data, dynamic data have become more popular than static data because high volumes of data can be generated and collected at a rapid rate. Although rough set theory has been widely used as a framework to mine decision rules from information system, most of the existing algorithms were not designed to handle streaming data. Hence, in this paper, we present a system based on rough set theory to mine decision rules from streaming data. In particular, our rough set system processes data streams on two bases (namely, batch-based, and aggregated-based) with three models (namely, landmark, sliding window, and time-fading models) for a total of six combinations of stream processing and mining models (e.g., batch-based landmark model). Evaluation results on comparisons with existing works on several benchmark datasets show the benefits in terms of both accuracy improvements and runtime reduction and the practicality of our rough set system in mining data streams.
引用
收藏
页数:8
相关论文
共 47 条
  • [1] Chi-BD-DRF: Design of Scalable Fuzzy Classifiers for Big Data via A Dynamic Rule Filtering Approach
    Aghaeipoor, Fatemeh
    Javidi, Mohammad Masoud
    Triguero, Isaac
    Fernandez, Alberto
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2020,
  • [2] Ahn S., 2019, 2019 IEEE INT C FUZZ, P1259
  • [3] Mining Frequent Patterns from Hypergraph Databases
    Alam, Md Tanvir
    Ahmed, Chowdhury Farhan
    Samiullah, Md
    Leung, Carson K.
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2021, PT II, 2021, 12713 : 3 - 15
  • [4] Discriminating Frequent Pattern Based Supervised Graph Embedding for Classification
    Alam, Md Tanvir
    Ahmed, Chowdhury Farhan
    Samiullah, Md
    Leung, Carson K.
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2021, PT II, 2021, 12713 : 16 - 28
  • [5] Arora Nidhi R., 2012, Database and Expert Systems Applications. Proceedings of the 23rd International Conference, DEXA 2012, P502, DOI 10.1007/978-3-642-32600-4_38
  • [6] An Intelligent Predictive Analytics System for Transportation Analytics on Open Data Towards the Development of a Smart City
    Audu, Abdul-Rasheed A.
    Cuzzocrea, Alfredo
    Leung, Carson K.
    MacLeod, Keaton A.
    Ohin, Nibrasul, I
    Pulgar-Vidal, Nadege C.
    [J]. COMPLEX, INTELLIGENT, AND SOFTWARE INTENSIVE SYSTEMS (CISIS 2019), 2020, 993 : 224 - 236
  • [7] Comparative accuracies of artificial neural networks and discriminant analysis in predicting forest cover types from cartographic variables
    Blackard, JA
    Dean, DJ
    [J]. COMPUTERS AND ELECTRONICS IN AGRICULTURE, 1999, 24 (03) : 131 - 151
  • [8] Camara R.C., 2018, FUZZ-IEEE, P576
  • [9] A Decision-Theoretic Rough Set Approach for Dynamic Data Mining
    Chen, Hongmei
    Li, Tianrui
    Luo, Chuan
    Horng, Shi-Jinn
    Wang, Guoyin
    [J]. IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2015, 23 (06) : 1958 - 1970
  • [10] Cristiani A.L., FUZZ IEEE 2021, P684