Effective pre-processing of long term noisy audio recordings: An aid to clinical monitoring

被引:15
作者
Orlandi, S. [1 ,2 ,3 ]
Dejonckere, P. H. [3 ,4 ]
Schoentgen, J. [5 ]
Lebacq, J. [6 ]
Rruqja, N. [1 ]
Manfredi, C. [1 ]
机构
[1] Univ Florence, Dept Informat Engn, Via S Marta 3, I-50139 Florence, Italy
[2] Univ Bologna, Dept Elect Energy & Informat Engn, Bologna, Italy
[3] Catholic Univ Louvain, B-3000 Louvain, Belgium
[4] Fed Inst Occupat Dis, Brussels, Belgium
[5] Univ Bruxelles, Fac Sci Appl, Labs Images Signals & Telecommun Devices, Brussels, Belgium
[6] Catholic Univ Louvain, Inst Neurosci CEMO, B-1200 Brussels, Belgium
关键词
Voice analysis; Biomedical signal processing; Long duration recordings; Otsu method; Voiced/unvoiced selection; Synthetic signals; Newborn infant cry; FUNDAMENTAL-FREQUENCY; JITTER MEASURES; VOICE; PERTURBATION; VALIDITY; PRETERM; CRIES;
D O I
10.1016/j.bspc.2013.07.009
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Nowadays, great attention is devoted to minimizing the discomfort caused by connection of patients to sensors for long-term monitoring of physiological parameters. Hence, the need for contact-less monitoring systems is increasingly recognized in clinical investigation. To this aim, audio signals recorded by ambient microphones are an appealing and increasing field of research: in the biomedical field, application of contact-less audio recording of long duration may concern obstructive apnoea syndrome, preterm newborns in Intensive Care Units, daily monitoring in occupational dysphonia, speech therapy, Parkinson and Alzheimer disease, monitoring of psychiatric and autistic subjects, etc. However, a significant amount of ambient noise is inevitably included in the records. Especially in the case of recordings that take a longtime, manual extraction of clinically useful information from a whole record is a time-consuming operator-dependent task, the length of a whole recording (even several hours) being prohibitive both for perceptual analysis made by listening to it and for visual inspection of signal patterns. Moreover, objective measures of signal characteristics may serve clinicians as a common ground for diagnosis. Hence, automatic methods are needed to speed up and objectify the analysis task. The present work describes a new, automatic, fast and reliable method for extracting "voiced candidates" from audio recordings of long duration for both clinical and home applications. To demonstrate its effectiveness, the method is compared to existing software tools commonly used in biomedical applications using synthetic signals. (C) 2013 Elsevier Ltd. All rights reserved.
引用
收藏
页码:799 / 810
页数:12
相关论文
共 37 条
[1]   Automatic and Unsupervised Snore Sound Extraction From Respiratory Sound Signals [J].
Azarbarzin, Ali ;
Moussavi, Zahra M. K. .
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2011, 58 (05) :1156-1162
[2]   Longitudinal study of the fundamental frequency of hunger cries along the first 6 months of healthy babies [J].
Baeck, Heidi Elisabeth ;
de Souza, Marcio Nogueira .
JOURNAL OF VOICE, 2007, 21 (05) :551-559
[3]   The professional voice [J].
Benninger, M. S. .
JOURNAL OF LARYNGOLOGY AND OTOLOGY, 2011, 125 (02) :111-116
[4]   Objective measurement of vocal fatigue in classical singers: A vocal dosimetry pilot study [J].
Carroll, Thomas ;
Nix, John ;
Hunter, Eric ;
Emerich, Kate ;
Titze, Ingo ;
Abaza, Mona .
OTOLARYNGOLOGY-HEAD AND NECK SURGERY, 2006, 135 (04) :595-602
[5]   To what degree of voice perturbation are jitter measurements valid? A novel approach with synthesized vowels and visuo-perceptual pattern recognition [J].
Dejonckere, P. H. ;
Giordano, A. ;
Schoentgen, J. ;
Fraj, S. ;
Bocchi, L. ;
Manfredi, C. .
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2012, 7 (01) :37-42
[6]  
Dejonckere PH, 2001, OCCUPATIONAL VOICE C
[7]   Validity of jitter measures in non-quasi-periodic voices. Part I: Perceptual and computer performances in cycle pattern recognition [J].
Dejonckere, Philippe ;
Schoentgen, Jean ;
Giordano, Andrea ;
Fraj, Samia ;
Bocchi, Leonardo ;
Manfredi, Claudia .
LOGOPEDICS PHONIATRICS VOCOLOGY, 2011, 36 (02) :70-77
[8]  
Deller J. R., 1993, DISCRETE TIME PROCES, P724
[9]   Automatic detection, segmentation and assessment of snoring from ambient acoustic data [J].
Duckitt, W. D. ;
Tuomi, S. K. ;
Niesler, T. R. .
PHYSIOLOGICAL MEASUREMENT, 2006, 27 (10) :1047-1056
[10]   Development and perceptual assessment of a synthesizer of disordered voices [J].
Fraj, Samia ;
Schoentgen, Jean ;
Grenez, Francis .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2012, 132 (04) :2603-2615