Stethoscope-Sensed Speech and Breath-Sounds for Person Identification With Sparse Training Data

被引：17

作者：

Van-Thuan Tran ^{[1
]}

Tsai, Wei-Ho ^{[1
]}

机构：

[1] Natl Taipei Univ Technol, Dept Elect Engn, Taipei 10608, Taiwan

来源：

IEEE SENSORS JOURNAL | 2020年 / 20卷 / 02期

关键词：

Artificial neural networks; bronchial breath sounds; audio data augmentation; feature engineering; person identification; stethoscope; support vector machines; i-vector; CONVOLUTIONAL NEURAL-NETWORKS; DATA AUGMENTATION; CLASSIFICATION;

D O I：

10.1109/JSEN.2019.2945364

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

A novel person identification (PID) technique is developed in this study, which exploits a new biometric called bronchial breath sound and speech signal acquired by a stethoscope. In addition to investigating the acoustic characteristics of breath sounds for PID, we evaluate three identification methods, including support vector machines (SVM), artificial neural networks (ANN), and i-vector approach. Recognizing the requirement that the amount of sound data collected from each person should be as small as possible, this work studies data augmentation (DA) techniques that avoid the system training process from the overfitting problem when the training sound data is insufficient. In addition, we apply feature engineering techniques to find the informative subset of breath sound features which is beneficial for PID. Our experiments were conducted using a dataset composed of 16 subjects, including an equal number of male and female participants. In the test phase, both Support Vector Machine combined with feature selection and Artificial Neural Networks approaches yielded the promising accuracies of 98%.

引用

页码：848 / 859

页数：12

共 47 条

[1] Convolutional Neural Networks for Speech Recognition [J].

Abdel-Hamid, Ossama ;

Mohamed, Abdel-Rahman ;

Jiang, Hui ;

Deng, Li ;

Penn, Gerald ;

Yu, Dong .

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (10) :1533-1545

[2]

[Anonymous], ARXIV171200866

[3]

[Anonymous], [No title captured]

[4]

[Anonymous], INT C APPL HUM FACT

[5]

[Anonymous], 2012, P INT C SYST ENG TEC

[6] Classification of lung sounds using convolutional neural networks [J].

Aykanat, Murat ;

Kilic, Ozkan ;

Kurt, Bahar ;

Saryal, Sevgi .

EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2017,

[7] The use of the area under the roc curve in the evaluation of machine learning algorithms [J].

Bradley, AP .

PATTERN RECOGNITION, 1997, 30 (07) :1145-1159

[8]

Chamberlain D, 2016, IEEE ENG MED BIO, P804, DOI 10.1109/EMBC.2016.7590823

[9] BreathPrint: Breathing Acoustics-based User Authentication [J].

Chauhan, Jagmohan ;

Hu, Yining ;

Seneviratne, Suranga ;

Misra, Archan ;

Seneviratne, Aruna ;

Lee, Youngki .

MOBISYS'17: PROCEEDINGS OF THE 15TH ANNUAL INTERNATIONAL CONFERENCE ON MOBILE SYSTEMS, APPLICATIONS, AND SERVICES, 2017, :278-291

[10] Breathing-Based Authentication on Resource-Constrained IoT Devices using Recurrent Neural Networks [J].

Chauhan, Jagmohan ;

Seneviratne, Suranga ;

Hu, Yining ;

Misra, Archan ;

Seneviratne, Aruna ;

Lee, Youngki .

COMPUTER, 2018, 51 (05) :60-67

← 1 2 3 4 5 →