A Speech Obfuscation System to Preserve Data Privacy in 24-Hour Ambulatory Cough Monitoring

被引:2
作者
Taylor, Terence E. [1 ]
Keane, Frank [1 ]
Zigel, Yaniv [1 ,2 ]
机构
[1] Vitalograph Ireland Ltd, Ennis V95 HFT4, Ireland
[2] Ben Gurion Univ Negev, Dept Biomed Engn, IL-8410501 Beer Sheva, Israel
关键词
Feature extraction; Audio recording; Monitoring; Microphones; Pulmonary diseases; Voice activity detection; Training; Cough sound; data privacy; cough monitor; speech obfuscation; audio signal processing; RECOGNITION; INTELLIGIBILITY;
D O I
10.1109/JSTSP.2021.3134560
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Audio analysis of cough sounds can provide objective measures of respiratory clinical features such as cough frequency. Audio-based 24-hour ambulatory cough monitoring systems currently lead the way in providing these objective measures across a range of respiratory diseases. However, to preserve data privacy in cough audio recordings, there is interest to remove any identifiable information contained within patient and third-party speech. In this study we employed real-life patient audio recordings from the VitaloJAK 24-hour ambulatory cough monitoring device. We developed an audio-based speech obfuscation system that specifically detects and obfuscates intelligible speech while retaining cough events. An algorithm was developed to detect vowel sounds since most intelligible information is contained here. The detection algorithm employed audio features including energy, spectral centroid and an adaptive voiced speech feature. The detected vowel sounds were obfuscated by replacing the original audio signal with a synthetic version generated using the original energy and pitch but without formants information. The system was designed using seven hours of audio recordings from seven different patients with respiratory disease. The system was then evaluated on five 24-hour real-life patient audio recordings (120 hours in total) which consisted of 21.6 hours of intelligible speech along with 3,376 coughs. The system obfuscated 99.3% (21.5 hours) of intelligible speech while retaining 99.6% (3,362) of coughs. This speech obfuscation system can preserve data privacy while using 24-hour ambulatory cough monitors. Furthermore, it can retain cough events and other aspects of 24-hour cough recordings which may be of clinical interest.
引用
收藏
页码:188 / 196
页数:9
相关论文
共 24 条
  • [1] EFFECT OF VOICED SPEECH PARAMETERS ON INTELLIGIBILITY OF PB WORDS
    AGRAWAL, A
    LIN, WC
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1975, 57 (01) : 217 - 222
  • [2] Deep Neural Networks for Identifying Cough Sounds
    Amoh, Justice
    Odame, Kofi
    [J]. IEEE TRANSACTIONS ON BIOMEDICAL CIRCUITS AND SYSTEMS, 2016, 10 (05) : 1003 - 1011
  • [3] Speaker recognition based on deep learning: An overview
    Bai, Zhongxin
    Zhang, Xiao-Lei
    [J]. NEURAL NETWORKS, 2021, 140 : 65 - 99
  • [4] Biometric Recognition Using Multimodal Physiological Signals
    Bianco, Simone
    Napoletano, Paolo
    [J]. IEEE ACCESS, 2019, 7 : 83581 - 83588
  • [5] ECG analysis: A new approach in human identification
    Biel, L
    Pettersson, O
    Philipson, L
    Wide, P
    [J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2001, 50 (03) : 808 - 812
  • [6] Chen Francine, 2008, P 16 ACM INT C MULT, P733
  • [7] Cole RA, 1996, INT CONF ACOUST SPEE, P853, DOI 10.1109/ICASSP.1996.543255
  • [8] THE FEASIBILITY AND VALIDITY OF OBJECTIVE COUGH MONITORING IN CHILDREN USING AN ADULT COUGH DETECTION SYSTEM
    Elghamoudi, D. Deblej
    Sumner, H.
    McGuiness, K.
    Smith, J.
    Murray, C. S.
    [J]. THORAX, 2015, 70 : A198 - A198
  • [9] Kadambi P, 2018, 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), P2161, DOI 10.1109/ICASSP.2018.8461394
  • [10] Contribution of consonant versus vowel information to sentence intelligibility for young normal-hearing and elderly hearing-impaired listeners
    Kewley-Port, Diane
    Burkle, T. Zachary
    Lee, Jae Hee
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2007, 122 (04) : 2365 - 2375