Classifier ensemble generation and selection with multiple feature representations for classification applications in computer-aided detection and diagnosis on mammography

被引:41
作者
Choi, Jae Young [1 ]
Kim, Dae Hoe [1 ]
Plataniotis, Konstantinos N. [2 ]
Ro, Yong Man [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Dept Elect Engn, Image & Video Syst Lab, Daejeon 305701, South Korea
[2] Univ Toronto, Dept Elect & Comp Engn, Knowledge Media Design Inst, Toronto, ON M5S 3GA, Canada
基金
新加坡国家研究基金会;
关键词
Ensemble learning; Ensemble selection; Classification; Mammographic masses; Multiple feature representations; Computer-aided Detection (CADe); Computer-aided Diagnosis (CADx); FALSE-POSITIVE REDUCTION; NORMAL BREAST-TISSUE; FINITE-SAMPLE SIZE; DIGITAL MAMMOGRAMS; TEXTURE ANALYSIS; DECISION FUSION; MASSES; SEGMENTATION; ALGORITHM; ADABOOST;
D O I
10.1016/j.eswa.2015.10.014
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a novel ensemble classifier framework for improved classification of mammographic lesions in Computer-aided Detection (CADe) and Diagnosis (CADx) systems. Compared to previously developed classification techniques in mammography, the main novelty of proposed method is twofold: (1) the "combined use" of different feature representations (of the same instance) and data resampling to generate more diverse and accurate base classifiers as ensemble members and (2) the incorporation of a novel "ensemble selection" mechanism to further maximize the overall classification performance. In addition, as opposed to conventional ensemble learning, our proposed ensemble framework has the advantage of working well with both weak and strong classifiers, extensively used in mammography CADe and/or CADx systems. Extensive experiments have been performed using benchmark mammogram dataset to test the proposed method on two classification applications: (1) false-positive (FP) reduction using classification between masses and normal tissues, and (2) diagnosis using classification between malignant and benign masses. Results showed promising results that the proposed method (area under the ROC curve (AUC) of 0.932 and 0.878, each obtained for the aforementioned two classification applications, respectively) impressively outperforms (by an order of magnitude) the most commonly used single neural network (AUC = 0.819 and AUC = 0.754) and support vector machine (AUC = 0.849 and AUC = 0.773) based classification approaches. In addition, the feasibility of our method has been successfully demonstrated by comparing other state-of-the-art ensemble classification techniques such as Gentle AdaBoost and Random Forest learning algorithms. (C) 2015 Elsevier Ltd. All rights reserved.
引用
收藏
页码:106 / 121
页数:16
相关论文
共 69 条
[21]   A concentric morphology model for the detection of masses in mammography [J].
Eltonsy, Nevine H. ;
Tourassi, Georgia D. ;
Elmaghraby, Adel S. .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2007, 26 (06) :880-889
[22]   A decision-theoretic generalization of on-line learning and an application to boosting [J].
Freund, Y ;
Schapire, RE .
JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 1997, 55 (01) :119-139
[23]   Additive logistic regression: A statistical view of boosting - Rejoinder [J].
Friedman, J ;
Hastie, T ;
Tibshirani, R .
ANNALS OF STATISTICS, 2000, 28 (02) :400-407
[24]  
Galtier V, 2007, ICSPC: 2007 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS, VOLS 1-3, PROCEEDINGS, P165
[25]   Multi-scaled morphological features for the characterization of mammographic masses using statistical classification schemes [J].
Georgiou, Harris ;
Mavroforakis, Michael ;
Dimitropoulos, Nikos ;
Cavouras, Dionisis ;
Theodoridis, Sergios .
ARTIFICIAL INTELLIGENCE IN MEDICINE, 2007, 41 (01) :39-55
[26]  
Heath M, 2001, IWDM 2000: 5TH INTERNATIONAL WORKSHOP ON DIGITAL MAMMOGRAPHY, P212
[27]   Segmentation of Regions of Interest in Mammograms in a Topographic Approach [J].
Hong, Byung-Woo ;
Sohn, Bong-Soo .
IEEE TRANSACTIONS ON INFORMATION TECHNOLOGY IN BIOMEDICINE, 2010, 14 (01) :129-139
[28]   Score normalization in multimodal biometric systems [J].
Jain, A ;
Nandakumar, K ;
Ross, A .
PATTERN RECOGNITION, 2005, 38 (12) :2270-2285
[29]   Optimized approach to decision fusion of heterogeneous data for breast cancer diagnosis [J].
Jesneck, Jonathan L. ;
Nolte, Loren W. ;
Baker, Jay A. ;
Floyd, Carey E. ;
Lo, Joseph Y. .
MEDICAL PHYSICS, 2006, 33 (08) :2945-2954
[30]   Computer-Aided Diagnosis of Mammographic Masses Using Scalable Image Retrieval [J].
Jiang, Menglin ;
Zhang, Shaoting ;
Li, Hongsheng ;
Metaxas, Dimitris N. .
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2015, 62 (02) :783-792