Classifier ensemble generation and selection with multiple feature representations for classification applications in computer-aided detection and diagnosis on mammography

被引:41
作者
Choi, Jae Young [1 ]
Kim, Dae Hoe [1 ]
Plataniotis, Konstantinos N. [2 ]
Ro, Yong Man [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Dept Elect Engn, Image & Video Syst Lab, Daejeon 305701, South Korea
[2] Univ Toronto, Dept Elect & Comp Engn, Knowledge Media Design Inst, Toronto, ON M5S 3GA, Canada
基金
新加坡国家研究基金会;
关键词
Ensemble learning; Ensemble selection; Classification; Mammographic masses; Multiple feature representations; Computer-aided Detection (CADe); Computer-aided Diagnosis (CADx); FALSE-POSITIVE REDUCTION; NORMAL BREAST-TISSUE; FINITE-SAMPLE SIZE; DIGITAL MAMMOGRAMS; TEXTURE ANALYSIS; DECISION FUSION; MASSES; SEGMENTATION; ALGORITHM; ADABOOST;
D O I
10.1016/j.eswa.2015.10.014
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a novel ensemble classifier framework for improved classification of mammographic lesions in Computer-aided Detection (CADe) and Diagnosis (CADx) systems. Compared to previously developed classification techniques in mammography, the main novelty of proposed method is twofold: (1) the "combined use" of different feature representations (of the same instance) and data resampling to generate more diverse and accurate base classifiers as ensemble members and (2) the incorporation of a novel "ensemble selection" mechanism to further maximize the overall classification performance. In addition, as opposed to conventional ensemble learning, our proposed ensemble framework has the advantage of working well with both weak and strong classifiers, extensively used in mammography CADe and/or CADx systems. Extensive experiments have been performed using benchmark mammogram dataset to test the proposed method on two classification applications: (1) false-positive (FP) reduction using classification between masses and normal tissues, and (2) diagnosis using classification between malignant and benign masses. Results showed promising results that the proposed method (area under the ROC curve (AUC) of 0.932 and 0.878, each obtained for the aforementioned two classification applications, respectively) impressively outperforms (by an order of magnitude) the most commonly used single neural network (AUC = 0.819 and AUC = 0.754) and support vector machine (AUC = 0.849 and AUC = 0.773) based classification approaches. In addition, the feasibility of our method has been successfully demonstrated by comparing other state-of-the-art ensemble classification techniques such as Gentle AdaBoost and Random Forest learning algorithms. (C) 2015 Elsevier Ltd. All rights reserved.
引用
收藏
页码:106 / 121
页数:16
相关论文
共 69 条
[1]  
Alimoglu F., 2001, Turkish Journal Electrical Engineering and Computer Sciences, Elektrik, V9, P1
[2]  
[Anonymous], 2002, Statistical Methods in Diagnostic Medicine
[3]  
[Anonymous], P IEEE INT C IM PROC
[4]   Gene expression profile classification: A review [J].
Asyali, Musa H. ;
Colak, Dilek ;
Demirkaya, Omer ;
Inan, Mehmet S. .
CURRENT BIOINFORMATICS, 2006, 1 (01) :55-73
[5]   Tolerance to missing data using a likelihood ratio based classifier for computer-aided classification of breast cancer [J].
Bilska-Wolak, AO ;
Floyd, CE .
PHYSICS IN MEDICINE AND BIOLOGY, 2004, 49 (18) :4219-4237
[6]   A mass classification using spatial diversity approaches in mammography images for false positive reduction [J].
Braz Junior, Geraldo ;
da Rocha, Simara Vieira ;
Gattass, Marcelo ;
Silva, Aristofanes Correa ;
de Paiva, Anselmo Cardoso .
EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (18) :7534-7543
[7]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[8]   Learning from unbalanced data: A cascade-based approach for detecting clustered microcalcifications [J].
Bria, A. ;
Karssemeijer, N. ;
Tortorella, F. .
MEDICAL IMAGE ANALYSIS, 2014, 18 (02) :241-252
[9]   Incorporation of an iterative, linear segmentation routine into a mammographic mass CAD system [J].
Catarious, DM ;
Baydush, AH ;
Floyd, CE .
MEDICAL PHYSICS, 2004, 31 (06) :1512-1520
[10]   Classifier design for computer-aided diagnosis: Effects of finite sample size on the mean performance of classical and neural network classifiers [J].
Chan, HP ;
Sahiner, B ;
Wagner, RF ;
Petrick, N .
MEDICAL PHYSICS, 1999, 26 (12) :2654-2668