The contribution of mutual information in the intonational phrase prediction in chinese text

被引:0
作者
Hu, GP [1 ]
Chen, BF [1 ]
Fan, M [1 ]
Wang, RH [1 ]
机构
[1] Univ Sci & Technol China, Hefei 230027, Peoples R China
来源
2003 INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, PROCEEDINGS | 2003年
关键词
intonational phrase; mutual information; prosody boundary; part-of-speech;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The contribution of Mutual Information (MI) in the Intonational Phrase (IP) prediction is analyzed and Verified in this paper. The basic idea of employing MI in IP prediction is that people is likely to pause between the less correlated words, where the MI value is low. The paper presents a decision tree based predictor which adopts POS as the main feature firstly as the baseline, and then the paper analyzes the correlation between MI and IP. The approach which only bases oil the MI to predict the IP boundary is demonstrated ill this paper, and three methods combining the MI and POS in the predictor is presented too. In the MI based approach, a considerable performance (F-Score: 64.2%) is achieved, and 3.4% promotion from the baseline is achieved after combining MI and POS in our experiment. All our work indicates that MI is all effective feature in the prosodic phrase boundaries prediction in Chinese text, and combining the MI and POS in the predictor is valuable.
引用
收藏
页码:407 / 412
页数:6
相关论文
共 30 条
[1]  
CALZOLORI N, 1990, P COLING AUG HELS FI, V2, P54
[2]  
CHEN BF, ISCSLP2002
[3]  
CHIEN F, 1997, SIGI
[4]  
Chu M., 2001, Computational Linguistics and Chinese Language Processing, V6, P61
[5]  
Chu M., 1996, CHINESE J ACOUSTICS, V15, P81
[6]  
CHUCH K, 1989, P 27 ACL
[7]  
Church K. W., 1991, Computer Speech and Language, V5, P19, DOI 10.1016/0885-2308(91)90016-J
[8]  
CLAKSON P, 1997, P EUROSPEECH, V5, P2707
[9]  
DAVID MM, P AAAI 90, P984
[10]  
FANO R, 1961, TRANSMISSION INFORMA