An analysis of robust cost functions for CNN in computer-aided diagnosis

被引:14
作者
Barbu, Adrian [1 ]
Lu, Le [2 ]
Roth, Holger [2 ]
Seff, Ari [2 ]
Summers, Ronald M. [2 ]
机构
[1] Florida State Univ, Dept Stat, Tallahassee, FL 32306 USA
[2] NIH, Imaging Biomarkers & Comp Aided Diag Lab, Radiol & Imaging Sci, Clin Ctr, Bldg 10, Bethesda, MD 20892 USA
基金
美国国家卫生研究院;
关键词
Lymph node detection; convolutional neural networks; computer aided diagnosis;
D O I
10.1080/21681163.2016.1138240
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Deep convolutional neural networks (CNNs) have proven to be powerful and flexible tools that advance the state-of-the-art in many fields, e.g. speech recognition, computer vision and medical imaging. Usually deep CNN models employ the logistic (soft-max) loss function in the training process of classification tasks. Recent evidence on a computer vision benchmark data-set indicates that the hinge (SVM) loss might give smaller misclassification errors on the test set compared to the logistic loss (i.e. offer better generality). In this paper, we study and compare four different loss functions for deep CNNs in the context of computer-aided abdominal and mediastinal lymph node detection and diagnosis (CAD) using CT images. Besides the logistic loss, we compare three other CNN losses that have not been previously studied for CAD problems. The experiments confirm that the logistic loss performs the worst among the four losses, and an additional 3% increase in detection rate at 3 false positives/volume can be obtained by just replacing it with Lorenz loss. The free-receiver operating characteristic curves of two of the three loss functions consistently outperform the logistic loss in testing.
引用
收藏
页码:253 / 258
页数:6
相关论文
共 16 条
[1]  
Barbu A, 2013, ARXIV13102880
[2]   Training a support vector machine in the primal [J].
Chapelle, Olivier .
NEURAL COMPUTATION, 2007, 19 (05) :1155-1178
[3]  
Cherry KM, 2014, SPIE MED IMAGING
[4]   On the algorithmic implementation of multiclass kernel-based vector machines [J].
Crammer, K ;
Singer, Y .
JOURNAL OF MACHINE LEARNING RESEARCH, 2002, 2 (02) :265-292
[5]   Caffe: Convolutional Architecture for Fast Feature Embedding [J].
Jia, Yangqing ;
Shelhamer, Evan ;
Donahue, Jeff ;
Karayev, Sergey ;
Long, Jonathan ;
Girshick, Ross ;
Guadarrama, Sergio ;
Darrell, Trevor .
PROCEEDINGS OF THE 2014 ACM CONFERENCE ON MULTIMEDIA (MM'14), 2014, :675-678
[6]   ImageNet Classification with Deep Convolutional Neural Networks [J].
Krizhevsky, Alex ;
Sutskever, Ilya ;
Hinton, Geoffrey E. .
COMMUNICATIONS OF THE ACM, 2017, 60 (06) :84-90
[7]   Gradient-based learning applied to document recognition [J].
Lecun, Y ;
Bottou, L ;
Bengio, Y ;
Haffner, P .
PROCEEDINGS OF THE IEEE, 1998, 86 (11) :2278-2324
[8]  
Lee CY, 2014, AISTATS 2015
[9]   Multicategory support vector machines: Theory and application to the classification of microarray data and satellite radiance data [J].
Lee, YK ;
Lin, Y ;
Wahba, G .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2004, 99 (465) :67-81
[10]  
Liu J, 2014, SPIE MED IMAGING