Speech Signal and Facial Image Processing for Obstructive Sleep Apnea Assessment

被引:37
作者
Espinoza-Cuadros, Fernando [1 ]
Fernandez-Pozo, Ruben [1 ]
Toledano, Doroteo T. [2 ]
Alcazar-Ramirez, Jose D. [3 ]
Lopez-Gonzalo, Eduardo [1 ]
Hernandez-Gomez, Luis A. [1 ]
机构
[1] Univ Politecn Madrid, GAPS Signal Proc Applicat Grp, E-28040 Madrid, Spain
[2] Univ Autonoma Madrid, ATVS Biometr Recognit Grp, Madrid, Spain
[3] Hosp Quiron, Resp Dept, Sleep Unit, Malaga, Spain
关键词
CEPHALOMETRIC ANALYSIS; APPEARANCE; TUTORIAL; FACE;
D O I
10.1155/2015/489761
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Obstructive sleep apnea (OSA) is a common sleep disorder characterized by recurring breathing pauses during sleep caused by a blockage of the upper airway (UA). OSA is generally diagnosed through a costly procedure requiring an overnight stay of the patient at the hospital. This has led to proposing less costly procedures based on the analysis of patients' facial images and voice recordings to help in OSA detection and severity assessment. In this paper we investigate the use of both image and speech processing to estimate the apnea-hypopnea index, AHI (which describes the severity of the condition), over a population of 285 male Spanish subjects suspected to suffer from OSA and referred to a Sleep Disorders Unit. Photographs and voice recordings were collected in a supervised but not highly controlled way trying to test a scenario close to an OSA assessment application running on a mobile device (i.e., smartphones or tablets). Spectral information in speech utterances is modeled by a state-of-the-art low-dimensional acoustic representation, called i-vector. A set of local craniofacial features related to OSA are extracted from images after detecting facial landmarks using Active Appearance Models (AAMs). Support vector regression (SVR) is applied on facial features and i-vectors to estimate the AHI.
引用
收藏
页数:13
相关论文
共 53 条
[1]  
[Anonymous], 1995, Respir Care, V40, P1336
[2]   Speaker age estimation using i-vectors [J].
Bahari, Mohamad Hasan ;
McLaren, Mitchell ;
Hugo Van Hamme ;
van Leeuwen, David A. .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2014, 34 :99-108
[3]  
Bahari MH, 2013, INT CONF ACOUST SPEE, P7344, DOI 10.1109/ICASSP.2013.6639089
[4]   A REVIEW OF FACE RECOGNITION METHODS [J].
Beham, M. Parisa ;
Roomi, S. Mohamed Mansoor .
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2013, 27 (04)
[5]  
Benavides A. M., 2015, J VOICE
[6]   A tutorial on text-independent speaker verification [J].
Bimbot, F ;
Bonastre, JF ;
Fredouille, C ;
Gravier, G ;
Magrin-Chagnolleau, I ;
Meignier, S ;
Merlin, T ;
Ortega-García, J ;
Petrovska-Delacrétaz, D ;
Reynolds, DA .
EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2004, 2004 (04) :430-451
[7]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[8]   The Face of Sleepiness: Improvement in Appearance after Treatment of Sleep Apnea [J].
Chervin, Ronald D. ;
Ruzicka, Deborah L. ;
Vahabzadeh, Arshia ;
Burns, Margaret C. ;
Burns, Joseph W. ;
Buchman, Steven R. .
JOURNAL OF CLINICAL SLEEP MEDICINE, 2013, 9 (09) :845-852
[9]   Identification of craniofacial risk factors for obstructive sleep apnoea using three-dimensional MRI [J].
Chi, L. ;
Comyn, F-L. ;
Mitra, N. ;
Reilly, M. P. ;
Wan, F. ;
Maislin, G. ;
Chmiewski, L. ;
Thorne-FitzGerald, M. D. ;
Victor, U. N. ;
Pack, A. I. ;
Schwab, R. J. .
EUROPEAN RESPIRATORY JOURNAL, 2011, 38 (02) :348-358
[10]  
Cootes T., AAM TOOLS SOFTWARE