Contextual maximum entropy model for edit disfluency detection of spontaneous speech

被引:0
作者
Yeh, Jui-Feng [1 ]
Wu, Chung-Hsien [2 ]
Wu, Wei-Yen [2 ]
机构
[1] Far East Univ, Dept Comp Sci & Informat Engn, No. 49,Chung Hua Rd, Hsin Shih 744, Taiwan
[2] Natl Cheng Kung Univ, Dept Comp Sci & Informat Engn, No.1, Ta-Hsueh Road, Tainan 701, Taiwan
来源
CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS | 2006年 / 4274卷
关键词
disfluency; maximum entropy; contextual feature; spontaneous speech;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This study describes an approach to edit disfluency detection based on maximum entropy (ME) using contextual features for rich transcription of spontaneous speech. The contextual features contain word-level, chunk-level and sentence-level features for edit distluency modeling. Due to the problem of data sparsity, word-level features are determined according to the taxonomy of the primary features of the words defined in Hownet. Chunk-level features are extracted based on mutual information of the words. Sentence-level feature are identified according to verbs and their corresponding features. The Improved Iterative Scaling (IIS) algorithm is employed to estimate the optimal weights in the maximum entropy models. Performance on edit disfluency detection and interruption point detection are conducted for evaluation. Experimental results show that the proposed method outperforms the DF-gram approach.
引用
收藏
页码:578 / +
页数:4
相关论文
共 33 条
[1]  
[Anonymous], 2005, INTERSPEECH
[2]  
[Anonymous], 2004, P HLT NAACL 2004 SHO
[3]  
Bangalore S, 2004, HLT-NAACL 2004: HUMAN LANGUAGE TECHNOLOGY CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE MAIN CONFERENCE, P33
[4]  
Bear J., 1992, P ACL, P56
[5]  
Berger AL, 1996, COMPUT LINGUIST, V22, P39
[6]  
Charniak E, 2001, 2ND MEETING OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, P118
[7]  
CKIP, 1993, 9305 CKIP
[8]  
COQUOZ S, 2004, BRAODCAST NEWS SEGME
[9]   Analysis and recognition of spontaneous speech using Corpus of Spontaneous Japanese [J].
Furui, S ;
Nakamura, M ;
Ichiba, T ;
Iwano, K .
SPEECH COMMUNICATION, 2005, 47 (1-2) :208-219
[10]  
Gregory ML, 2004, HLT-NAACL 2004: HUMAN LANGUAGE TECHNOLOGY CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE MAIN CONFERENCE, P81