Deep Neural Network for Respiratory Sound Classification in Wearable Devices Enabled by Patient Specific Model Tuning

被引:148
作者
Acharya, Jyotibdha [1 ]
Basu, Arindam [2 ]
机构
[1] Nanyang Technol Univ, HealthTech NTU, Interdisciplinary Grad Program, Singapore 639798, Singapore
[2] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore
关键词
Data models; Biological system modeling; Deep learning; Quantization (signal); Feature extraction; Training; Diseases; CNN; LSTM; patient specific model; respiratory audio analysis; weight quantization; RECOGNITION;
D O I
10.1109/TBCAS.2020.2981172
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
The primary objective of this paper is to build classification models and strategies to identify breathing sound anomalies (wheeze, crackle) for automated diagnosis of respiratory and pulmonary diseases. In this work we propose a deep CNN-RNN model that classifies respiratory sounds based on Mel-spectrograms. We also implement a patient specific model tuning strategy that first screens respiratory patients and then builds patient specific classification models using limited patient data for reliable anomaly detection. Moreover, we devise a local log quantization strategy for model weights to reduce the memory footprint for deployment in memory constrained systems such as wearable devices. The proposed hybrid CNN-RNN model achieves a score of 66.31% on four-class classification of breathing cycles for ICBHI'17 scientific challenge respiratory sound database. When the model is re-trained with patient specific data, it produces a score of 71.81% for leave-one-out validation. The proposed weight quantization technique achieves approximate to 4x reduction in total memory cost without loss of performance. The main contribution of the paper is as follows: Firstly, the proposed model is able to achieve state of the art score on the ICBHI'17 dataset. Secondly, deep learning models are shown to successfully learn domain specific knowledge when pre-trained with breathing data and produce significantly superior performance compared to generalized models. Finally, local log quantization of trained weights is shown to be able to reduce the memory requirement significantly. This type of patient-specific re-training strategy can be very useful in developing reliable long-term automated patient monitoring systems particularly in wearable healthcare solutions.
引用
收藏
页码:535 / 544
页数:10
相关论文
共 58 条
[1]   A Comparison of Low-Complexity Real-Time Feature Extraction for Neuromorphic Speech Recognition [J].
Acharya, Jyotibdha ;
Patil, Aakash ;
Li, Xiaoya ;
Chen, Yi ;
Liu, Shih-Chii ;
Basu, Arindam .
FRONTIERS IN NEUROSCIENCE, 2018, 12
[2]  
Acharya J, 2017, IEEE ENG MED BIO, P4574, DOI 10.1109/EMBC.2017.8037874
[3]   Low-Power Computer Vision: Status, Challenges, and Opportunities [J].
Alyamkin, Sergei ;
Ardi, Matthew ;
Berg, Alexander C. ;
Brighton, Achille ;
Chen, Bo ;
Chen, Yiran ;
Cheng, Hsin-Pai ;
Fan, Zichen ;
Feng, Chen ;
Fu, Bo ;
Gauen, Kent ;
Goel, Abhinav ;
Goncharenko, Alexander ;
Guo, Xuyang ;
Ha, Soonhoi ;
Howard, Andrew ;
Hu, Xiao ;
Huang, Yuanjun ;
Kim, Jaeyoun ;
Ko, Jong Gook ;
Kondratyev, Alexander ;
Lee, Junhyeok ;
Lee, Seungjae ;
Lee, Suwoong ;
Li, Zichao ;
Liang, Zhiyu ;
Liu, Juzheng ;
Liu, Xin ;
Lu, Yang ;
Lu, Yung-Hsiang ;
Malik, Deeptanshu ;
Nguyen, Hong Hanh ;
Park, Eunbyung ;
Repin, Denis ;
Shen, Liang ;
Sheng, Tao ;
Sun, Fei ;
Svitov, David ;
Thiruvathukal, George K. ;
Zhang, Baiwu ;
Zhang, Jingchi ;
Zhang, Xiaopeng ;
Zhuo, Shaojie ;
Kang, D. .
IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS, 2019, 9 (02) :411-421
[4]   Deep Neural Networks for Identifying Cough Sounds [J].
Amoh, Justice ;
Odame, Kofi .
IEEE TRANSACTIONS ON BIOMEDICAL CIRCUITS AND SYSTEMS, 2016, 10 (05) :1003-1011
[5]  
[Anonymous], 2015, P INT C PATT REC APP
[6]  
[Anonymous], 2004, P IEEE ELECT COMPUTE
[7]  
[Anonymous], CORR
[8]  
[Anonymous], FUNDAMENTALS EMBED 3
[9]  
[Anonymous], 2018, CORR
[10]  
[Anonymous], 2015, PROC 3 INT C LEARNIN