A novel approach using incremental oversampling for data stream mining

被引:5
作者
Anupama, N. [1 ]
Jena, Sudarson [2 ]
机构
[1] GITAM Univ, Hyderabad, India
[2] Sambalpur Univ, Inst Informat Technol, Sambalpur, India
关键词
Knowledge discovery; Data streams; Imbalanced data; Oversampling; Increment over sampling for data streams (IOSDS); CLASSIFICATION;
D O I
10.1007/s12530-018-9249-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data stream mining is very popular in recent years with advanced electronic devices generating continuous data streams. The performance of standard learning algorithms is been compromised with imbalance nature present in real world data streams. In this paper we propose a novel algorithm dubbed as increment over sampling for data streams (IOSDS) which uses an unique over sampling technique to almost balance the data sets to minimize the effect of imbalance in stream mining process. The experimental analysis is conducted on 15 data chunks of data streams with varied sizes and different imbalance ratios. The results suggests that the proposed IOSDS algorithm improves the knowledge discovery over benchmark algorithms like C4.5 and Hoeffding tree in terms of standard performance measures namely accuracy, AUC, precision, recall and F-measure.
引用
收藏
页码:351 / 362
页数:12
相关论文
共 50 条
  • [31] A Novel Approach for Horizontal Privacy Preserving Data Mining
    Jalla, Hanumantha Rao
    Girija, P. N.
    INFORMATION SYSTEMS DESIGN AND INTELLIGENT APPLICATIONS, VOL 2, INDIA 2016, 2016, 434 : 101 - 111
  • [32] Clustering Models for Data Stream Mining
    Mythily, R.
    Banu, Aisha
    Raghunathan, Shriram
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGIES, ICICT 2014, 2015, 46 : 619 - 626
  • [33] Data Stream Analytics and Mining in the Cloud
    Ari, Ismail
    Olmezogullari, Erdi
    Celebi, Omer Faruk
    2012 IEEE 4TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING TECHNOLOGY AND SCIENCE (CLOUDCOM), 2012,
  • [34] Research and Evolvement of Data Stream Mining
    Sun Yafeng
    Yang Xiaopin
    Huang Zhiping
    ISTM/2009: 8TH INTERNATIONAL SYMPOSIUM ON TEST AND MEASUREMENT, VOLS 1-6, 2009, : 1438 - 1441
  • [35] Data Stream Mining: Challenges and Techniques
    Khan, Latifur
    22ND INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2010), PROCEEDINGS, VOL 2, 2010, : 295 - 295
  • [36] New Approach in Data Stream Association Rule Mining Based on Graph Structure
    Mojaveri, Samad Ganderi
    Mirzaeian, Esmaeil
    Bornaee, Zarrintaj
    Ayat, Saeed
    ADVANCES IN DATA MINING: APPLICATIONS AND THEORETICAL ASPECTS, 2010, 6171 : 158 - +
  • [37] Intelligent Adaptive Ensembles for Data Stream Mining: A High Return on Investment Approach
    Olorunnimbe, M. Kehinde
    Viktor, Herna L.
    Paquet, Eric
    NEW FRONTIERS IN MINING COMPLEX PATTERNS, 2016, 9607 : 61 - 75
  • [38] IoT Big Data Stream Mining
    Morales, Gianmarco De Francisci
    Bifet, Albert
    Khan, Latifur
    Gama, Joao
    Fan, Wei
    KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 2119 - 2120
  • [39] Mining Infrequent Patterns in Data Stream
    Lakshmi, R.
    Hemalatha, C. Sweetlin
    Vaidehi, V.
    2014 INTERNATIONAL CONFERENCE ON RECENT TRENDS IN INFORMATION TECHNOLOGY (ICRTIT), 2014,
  • [40] An Approach for Mining Imbalanced Datasets Combining Specialized Oversampling and Undersampling Methods
    Jedrzejowicz, Joanna
    Jedrzejowicz, Piotr
    IEEE ACCESS, 2023, 11 : 136782 - 136792