Ensemble based adaptive over-sampling method for imbalanced data learning in computer aided detection of microaneurysm

被引:45
|
作者
Ren, Fulong [1 ,2 ]
Cao, Peng [1 ,2 ]
Li, Wei [2 ]
Zhao, Dazhe [1 ,2 ]
Zaiane, Osmar [3 ]
机构
[1] Northeastern Univ, Coll Comp Sci & Engn, Shenyang, Peoples R China
[2] Northeastern Univ, Key Lab Med Image Comp, Minist Educ, Shenyang, Peoples R China
[3] Univ Alberta, Comp Sci, Edmonton, AB, Canada
基金
中国国家自然科学基金;
关键词
Microaneurysm detection; Classification; False positive reduction; Imbalanced data learning; Ensemble learning; AUTOMATIC DETECTION; MACHINE;
D O I
10.1016/j.compmedimag.2016.07.011
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Diabetic retinopathy (DR) is a progressive disease, and its detection at an early stage is crucial for saving a patient's vision. An automated screening system for DR can help in reduce the chances of complete blindness due to DR along with lowering the work load on ophthalmologists. Among the earliest signs of DR are microaneurysms (MAs). However, current schemes for MA detection appear to report many false positives because detection algorithms have high sensitivity. Inevitably some non-MAs structures are labeled as MAs in the initial MAs identification step. This is a typical "class imbalance problem". Class imbalanced data has detrimental effects on the performance of conventional classifiers. In this work, we propose an ensemble based adaptive over-sampling algorithm for overcoming the class imbalance problem in the false positive reduction, and we use Boosting, Bagging, Random subspace as the ensemble framework to improve microaneurysm detection. The ensemble based over-sampling methods we proposed combine the strength of adaptive over-sampling and ensemble. The objective of the amalgamation of ensemble and adaptive over-sampling is to reduce the induction biases introduced from imbalanced data and to enhance the generalization classification performance of extreme learning machines (ELM). Experimental results show that our ASOBoost method has higher area under the ROC curve (AUC) and G-mean values than many existing class imbalance learning methods. (C) 2016 Elsevier Ltd. All rights reserved.
引用
收藏
页码:54 / 67
页数:14
相关论文
共 50 条
  • [31] Evolutionary under-sampling based bagging ensemble method for imbalanced data classification
    Bo Sun
    Haiyan Chen
    Jiandong Wang
    Hua Xie
    Frontiers of Computer Science, 2018, 12 : 331 - 350
  • [32] A new sampling method for classifying imbalanced data based on support vector machine ensemble
    Jian, Chuanxia
    Gao, Jian
    Ao, Yinhui
    NEUROCOMPUTING, 2016, 193 : 115 - 122
  • [33] Evolutionary under-sampling based bagging ensemble method for imbalanced data classification
    Sun, Bo
    Chen, Haiyan
    Wang, Jiandong
    Xie, Hua
    FRONTIERS OF COMPUTER SCIENCE, 2018, 12 (02) : 331 - 350
  • [34] PDFOS: PDF estimation based over-sampling for imbalanced two-class problems
    Gao, Ming
    Hong, Xia
    Chen, Sheng
    Harris, Chris J.
    Khalaf, Emad
    NEUROCOMPUTING, 2014, 138 : 248 - 259
  • [35] Feature selection and its combination with data over-sampling for multi-class imbalanced datasets
    Tsai, Chih-Fong
    Chen, Kuan-Chen
    Lin, Wei -Chao
    APPLIED SOFT COMPUTING, 2024, 153
  • [36] The study of under- and over-sampling methods' utility in analysis of highly imbalanced data on osteoporosis
    Bach, M.
    Werner, A.
    Zywiec, J.
    Pluskiewicz, W.
    INFORMATION SCIENCES, 2017, 384 : 174 - 190
  • [37] Ensemble of Classifiers Based on Multiobjective Genetic Sampling for Imbalanced Data
    Fernandes, Everlandio R. Q.
    de Carvalho, Andre C. P. L. F.
    Yao, Xin
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2020, 32 (06) : 1104 - 1115
  • [38] A Robust Enhanced Ensemble Learning Method for Breast Cancer Data Diagnosis on Imbalanced Data
    Wang, Zhenzhen
    Xie, Junde
    Zhang, Jia
    IEEE ACCESS, 2024, 12 : 189776 - 189788
  • [39] Adaptive Fusion Based Method for Imbalanced Data Classification
    Liang, Zefeng
    Wang, Huan
    Yang, Kaixiang
    Shi, Yifan
    FRONTIERS IN NEUROROBOTICS, 2022, 16
  • [40] A modified adaptive synthetic sampling method for learning imbalanced datasets
    Hussein, Ahmed Saad
    Li, Tianrui
    Abd Ali, Doaa Mohsin
    Bashir, Kamal
    Yohannese, Chubato Wondaferaw
    DEVELOPMENTS OF ARTIFICIAL INTELLIGENCE TECHNOLOGIES IN COMPUTATION AND ROBOTICS, 2020, 12 : 76 - 83