Automated Segmentation of the Vocal Folds in Laryngeal Endoscopy Videos using Deep Convolutional Regression Networks

被引：22

作者：

Hamad, Ali ^{[1
]}

Haney, Megan ^{[2
]}

Lever, Teresa E. ^{[3
]}

Bunyak, Filiz ^{[1
]}

机构：

[1] Univ Missouri, Dept Elect Engn & Comp Sci, Columbia, MO 65211 USA

[2] Univ Missouri, Dept Vet Pathobiol, Columbia, MO USA

[3] Univ Missouri, Sch Med, Dept Otolaryngol Head & Neck Surg, Columbia, MO USA

来源：

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019) | 2019年

关键词：

ANATOMY;

D O I：

10.1109/CVPRW.2019.00023

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Swallowing and breathing are vital, life-sustaining upper airway functions that require precise, reciprocal coordination of the vocal folds (VFs). During swallowing, the VFs mustfully close to prevent aspiration offood/liquid into the lungs, whereas during breathing, the VFs must remain open to prevent obstruction of airflow into and out of the lungs. This coordination may become impaired by a variety of neurological conditions and diseases. Clinical evaluation relies on transnasal endoscopy to visualize the VFs within the larynx, and subjective interpretation of VF function by clinicians. However, objective, quantitative, and high-throughput analysis of VF function is important for early diagnosis, monitoring disease progression, treatment monitoring, and treatment discovery. In this paper we propose a fully automated, deep learning based VF segmentation system for the analysis of VF motion behavior captured using flexible endoscopes with low-speed capability. Experimental results on human laryngeal videos showed promising results that were robust to many challenges caused by imaging, anatomical, and behavioral variations. The proposed segmentation and tracking system will be used to compute quantitative outcome measures describing VF motion behavior in order to help clinical practice and scientific discovery.

引用

页码：140 / 148

页数：9

共 21 条

[1] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].

Badrinarayanan, Vijay ;

Kendall, Alex ;

Cipolla, Roberto .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495

[2] Etiology of vocal cord paralysis [J].

Chen, Hsin-Chien ;

Jen, Yee-Min ;

Wang, Chih-Hung ;

Lee, Jih-Chin ;

Lin, Yaoh-Shiang .

ORL-JOURNAL FOR OTO-RHINO-LARYNGOLOGY AND ITS RELATED SPECIALTIES, 2007, 69 (03) :167-171

[3] Accuracy of endoscopic and videofluoroscopic evaluations of swallowing for oropharyngeal dysphagia [J].

Fernando Giraldo-Cadavid, Luis ;

Renata Leal-Leano, Lorena ;

Alfredo Leon-Basantes, Guillermo ;

Rodrigo Bastidas, Alirio ;

Garcia, Rafael ;

Ovalle, Sergio ;

Abondano-Garavito, Jorge E. .

LARYNGOSCOPE, 2017, 127 (09) :2002-2010

[4] Fully Automated Glottis Segmentation in Endoscopic Videos Using Local Color and Shape Features of Glottal Regions [J].

Gloger, Oliver ;

Lehnert, Bernhard ;

Schrade, Andreas ;

Voelzke, Henry .

IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2015, 62 (03) :795-806

[5]

Haney MM., 2018, Laryngoscope

[6] An Automatic Detection System of Lung Nodule Based on Multigroup Patch-Based Deep Learning Network [J].

Jiang, Hongyang ;

Ma, He ;

Qian, Wei ;

Gao, Mengdi ;

Li, Yan .

IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2018, 22 (04) :1227-1237

[7]

Langmore S. E., 2006, ENDOSCOPIC EVALUATIO

[8] Clinically evaluated procedure for the reconstruction of vocal fold vibrations from endoscopic digital high-speed videos [J].

Lohscheller, Joerg ;

Toy, Hikmet ;

Rosanowski, Frank ;

Eysholdt, Ulrich ;

Doellinger, Michael .

MEDICAL IMAGE ANALYSIS, 2007, 11 (04) :400-413

[9]

MATLAB, 2017, DEEP LEARN TOOLB 201

[10] Anatomy and Physiology of Feeding and Swallowing: Normal and Abnormal [J].

Matsuo, Koichiro ;

Palmer, Jeffrey B. .

PHYSICAL MEDICINE AND REHABILITATION CLINICS OF NORTH AMERICA, 2008, 19 (04) :691-+

← 1 2 3 →