Applying Map-Reduce to Imbalanced Data Classification

被引:0
作者
Jedrzejowicz, Joanna [1 ]
Neumann, Jakub [1 ]
Synowczyk, Piotr [1 ]
Zakrzewska, Magdalena [1 ]
机构
[1] Univ Gdansk, Fac Math Phys & Informat, Inst Informat, PL-80308 Gdansk, Poland
来源
2017 IEEE INTERNATIONAL CONFERENCE ON INNOVATIONS IN INTELLIGENT SYSTEMS AND APPLICATIONS (INISTA) | 2017年
关键词
imbalanced data; classification; parallelization;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The aim of the paper was to apply MapReduce paradigm to the algorithm SplitBal which classifies imbalanced datasets and perform the evaluation of results for different parameters. Parallelization of time consuming operations allows to classify larger datasets, in perspective Big Data.
引用
收藏
页码:29 / 33
页数:5
相关论文
共 8 条
  • [1] KEEL: a software tool to assess evolutionary algorithms for data mining problems
    Alcala-Fdez, J.
    Sanchez, L.
    Garcia, S.
    del Jesus, M. J.
    Ventura, S.
    Garrell, J. M.
    Otero, J.
    Romero, C.
    Bacardit, J.
    Rivas, V. M.
    Fernandez, J. C.
    Herrera, F.
    [J]. SOFT COMPUTING, 2009, 13 (03) : 307 - 318
  • [2] Learning from class-imbalanced data: Review of methods and applications
    Guo Haixiang
    Li Yijing
    Shang, Jennifer
    Gu Mingyun
    Huang Yuanyue
    Bing, Gong
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2017, 73 : 220 - 239
  • [3] Handling imbalanced classification problem: A case study on social media datasets
    Nguyen, Tuong Tri
    Hwang, Dosam
    Jung, Jason J.
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2017, 32 (02) : 1437 - 1448
  • [4] A novel ensemble method for classifying imbalanced data
    Sun, Zhongbin
    Song, Qinbao
    Zhu, Xiaoyan
    Sun, Heli
    Xu, Baowen
    Zhou, Yuming
    [J]. PATTERN RECOGNITION, 2015, 48 (05) : 1623 - 1637
  • [5] Synowczyk P., 2016, THESIS
  • [6] A Parallel Implementation of Relief Algorithm Using Mapreduce Paradigm
    Yazidi, Jamila
    Bouaguel, Waad
    Essoussi, Nadia
    [J]. COMPUTATIONAL COLLECTIVE INTELLIGENCE, ICCCI 2016, PT II, 2016, 9876 : 418 - 425
  • [7] KRNN: k Rare-class Nearest Neighbour classification
    Zhang, Xiuzhen
    Li, Yuxuan
    Kotagiri, Ramamohanarao
    Wu, Lifang
    Tari, Zahir
    Cheriet, Mohamed
    [J]. PATTERN RECOGNITION, 2017, 62 : 33 - 44
  • [8] A dissimilarity-based imbalance data classification algorithm
    Zhang, Xueying
    Song, Qinbao
    Wang, Guangtao
    Zhang, Kaiyuan
    He, Liang
    Jia, Xiaolin
    [J]. APPLIED INTELLIGENCE, 2015, 42 (03) : 544 - 565