Automatic tracing of vocal-fold motion from high-speed digital images

被引:79
作者
Yan, Yuling
Chen, Xin
Bless, Diane
机构
[1] Univ Hawaii Manoa, Dept Mech Engn, Honolulu, HI 96822 USA
[2] Univ Wisconsin, Dept Surg, Madison, WI 53792 USA
基金
美国国家科学基金会;
关键词
high-speed digital imaging; Rayleigh histogram thresholding; region-growing; vocal-fold motion;
D O I
10.1109/TBME.2006.873751
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Defining characteristics of the phonatory vocal fold vibration is essential for studies that aim to understand the mechanism of voice production and for clinical diagnosis of voice disorders. The application of high-speed digital imaging techniques to these studies makes it possible to capture sequences of images of the vibrating vocal folds at a frequency that can resolve the actual vocal fold vibrations of a patient. The objective of this study is to introduce a new approach for automatic tracing of vocal fold motion from image sequences acquired from high-speed digital imaging of the larynx. The approach involves three process steps. 1) Global thresholding-the threshold value is selected on the basis of the histogram of the image, which is assumed to follow Rayleigh distribution; 2) applying a morphology operator to remove the isolated object regions; 3) using region-growing to delineate the object, or the vocal fold opening region, and to obtain the area of-the glottis; the segmented object obtained after global threshold and the morphological operation is used as a seed region for the final region-growing operation. The performance, effectiveness and validation of our approach is demonstrated using representative, highspeed imaging recordings of subjects having normal and pathological voices.
引用
收藏
页码:1394 / 1400
页数:7
相关论文
共 21 条
[1]  
ALLIN S, 2004, IEEE INT S BIOM IM W
[2]  
Baken R. J., 2000, Clinical Measurement of Speech and Voice, V2nd ed
[3]   AUTOMATED-ANALYSIS OF ULTRA HIGH-SPEED LARYNGEAL FILMS [J].
BOOTH, JR ;
CHILDERS, DG .
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 1979, 26 (04) :185-192
[4]  
COLTON R H, 1989, Journal of Voice, V3, P132, DOI 10.1016/S0892-1997(89)80139-0
[5]   PIXEL CLASSIFICATION BASED ON GRAY LEVEL AND LOCAL BUSYNESS [J].
DONDES, PA ;
ROSENFELD, A .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1982, 4 (01) :79-84
[6]  
Gonzales R. C., 2001, Digital Image Processing, V2nd
[7]   IMAGE SEGMENTATION TECHNIQUES [J].
HARALICK, RM ;
SHAPIRO, LG .
COMPUTER VISION GRAPHICS AND IMAGE PROCESSING, 1985, 29 (01) :100-132
[8]   IMAGE-ANALYSIS USING MATHEMATICAL MORPHOLOGY [J].
HARALICK, RM ;
STERNBERG, SR ;
ZHUANG, XH .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1987, 9 (04) :532-550
[9]  
HONDA K, 1987, LARYNGEAL FUNCTION P, P485
[10]  
KIRITANI S, 2000, VOICE QUALITY MEASUR