THE FEATURE SELECTION PROBLEM IN COMPUTER-ASSISTED CYTOLOGY

被引:17
作者
Kowal, Marek [1 ]
Skobel, Marcin [1 ]
Nowicki, Norbert [1 ,2 ]
机构
[1] Univ Zielona Gora, Inst Control & Computat Engn, Ul Szafrana 2, PL-65516 Zielona Gora, Poland
[2] Univ Hosp Zielona Gora, Dept Med Phys, Ul Zyty 26, PL-65516 Zielona Gora, Poland
关键词
nuclei segmentation; feature selection; classification; breast cancer; convolutional neural network; SEGMENTATION; IMAGES;
D O I
10.2478/amcs-2018-0058
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Modern cancer diagnostics is based heavily on cytological examinations. Unfortunately, visual inspection of cytological preparations under the microscope is a tedious and time-consuming process. Moreover, intra-and inter-observer variations in cytological diagnosis are substantial. Cytological diagnostics can be facilitated and objectified by using automatic image analysis and machine learning methods. Computerized systems usually preprocess cytological images, segment and detect nuclei, extract and select features, and finally classify the sample. In spite of the fact that a lot of different computerized methods and systems have already been proposed for cytology, they are still not routinely used because there is a need for improvement in their accuracy. This contribution focuses on computerized breast cancer classification. The task at hand is to classify cellular samples coming from fine-needle biopsy as either benign or malignant. For this purpose, we compare 5 methods of nuclei segmentation and detection, 4 methods of feature selection and 4 methods of classification. Nuclei detection and segmentation methods are compared with respect to recall and the F1 score based on the Jaccard index. Feature selection and classification methods are compared with respect to classification accuracy. Nevertheless, the main contribution of our study is to determine which features of nuclei indicate reliably the type of cancer. We also check whether the quality of nuclei segmentation/detection significantly affects the accuracy of cancer classification. It is verified using the test set that the average accuracy of cancer classification is around 76%. Spearman's correlation and chi-square test allow us to determine significantly better features than the feature forward selection method.
引用
收藏
页码:759 / 770
页数:12
相关论文
共 29 条
[1]  
[Anonymous], 2018, FEATURE SELECTION LI
[2]  
[Anonymous], NUCL WATERSHED SEPAR
[3]  
Breiman L., 1984, CLASSIFICATION REGRE, V40, P582, DOI [http://doi.org/10.2307/2530946, DOI 10.1201/9781315139470, 10.2307/2530946]
[4]   Segmentation of Clustered Nuclei With Shape Markers and Marking Function [J].
Cheng, Jierong ;
Rajapakse, Jagath C. .
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2009, 56 (03) :741-748
[5]   SUPPORT-VECTOR NETWORKS [J].
CORTES, C ;
VAPNIK, V .
MACHINE LEARNING, 1995, 20 (03) :273-297
[6]   NEAREST NEIGHBOR PATTERN CLASSIFICATION [J].
COVER, TM ;
HART, PE .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1967, 13 (01) :21-+
[7]   Computer-Aided Breast Cancer Diagnosis Based on the Analysis of Cytological Images of Fine Needle Biopsies [J].
Filipczuk, Pawel ;
Fevens, Thomas ;
Krzyzak, Adam ;
Monczak, Roman .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2013, 32 (12) :2169-2178
[8]   TEXTURAL FEATURES FOR IMAGE CLASSIFICATION [J].
HARALICK, RM ;
SHANMUGAM, K ;
DINSTEIN, I .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1973, SMC3 (06) :610-621
[9]   Methods for nuclei detection, segmentation, and classification in digital histopathology: A review-current status and future potential [J].
Irshad, Humayun ;
Veillard, Antoine ;
Roux, Ludovic ;
Racoceanu, Daniel .
IEEE Reviews in Biomedical Engineering, 2014, 7 :97-114
[10]   Classification of breast cancer malignancy using cytological images of fine needle aspiration biopsies [J].
Jelen, Lukasz ;
Fevens, Thomas ;
Krzyzak, Adam .
INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE, 2008, 18 (01) :75-83