Lung sounds classification using convolutional neural networks

被引:122
作者
Bardou, Dalal [1 ]
Zhang, Kun [1 ]
Ahmad, Sayed Mohammad [2 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing, Jiangsu, Peoples R China
[2] Lareb Technol, Delhi, India
关键词
Convolutional neural network; Lung sounds classification; Handcrafted features extraction; Deep learning; Models ensembling; Support vector machines; REAL-TIME ANALYSIS; DATA AUGMENTATION; CRACKLE; FREQUENCY; MODEL;
D O I
10.1016/j.artmed.2018.04.008
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Lung sounds convey relevant information related to pulmonary disorders, and to evaluate patients with pulmonary conditions, the physician or the doctor uses the traditional auscultation technique. However, this technique suffers from limitations. For example, if the physician is not well trained, this may lead to a wrong diagnosis. Moreover, lung sounds are non-stationary, complicating the tasks of analysis, recognition, and distinction. This is why developing automatic recognition systems can help to deal with these limitations. In this paper, we compare three machine learning approaches for lung sounds classification. The first two approaches are based on the extraction of a set of handcrafted features trained by three different classifiers (support vector machines, k-nearest neighbor, and Gaussian mixture models) while the third approach is based on the design of convolutional neural networks (CNN). In the first approach, we extracted the 12 MFCC coefficients from the audio files then calculated six MFCCs statistics. We also experimented normalization using zero mean and unity variance to enhance accuracy. In the second approach, the local binary pattern (LBP) features are extracted from the visual representation of the audio files (spectrograms). The features are normalized using whitening. The dataset used in this work consists of seven classes (normal, coarse crackle, fine crackle, monophonic wheeze, polyphonic wheeze, squawk, and stridor). We have also experimentally tested dataset augmentation techniques on the spectrograms to enhance the ultimate accuracy of the CNN. The results show that CNN outperformed the handcrafted feature based classifiers. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:58 / 69
页数:12
相关论文
共 63 条
[21]  
[Anonymous], HLTH CARE EXCHANGES
[22]  
[Anonymous], P 22 ACM INT C MULT
[23]  
[Anonymous], ADV MAT RES
[24]  
[Anonymous], IND INF SYST ICIIS 2
[25]  
[Anonymous], P ICML WORKSH DEEP L
[26]   Pattern recognition methods applied to respiratory sounds classification into normal and wheeze classes [J].
Bahoura, Mohammed .
COMPUTERS IN BIOLOGY AND MEDICINE, 2009, 39 (09) :824-843
[27]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[28]   Performance evaluation and enhancement of lung sound recognition system in two real noisy environments [J].
Chang, Gwo-Ching ;
Lai, Yung-Fa .
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2010, 97 (02) :141-150
[29]   Support vector machines for histogram-based image classification [J].
Chapelle, O ;
Haffner, P ;
Vapnik, VN .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1999, 10 (05) :1055-1064
[30]  
CORTES C, 1995, MACH LEARN, V20, P273, DOI 10.1023/A:1022627411411