A conjoint analysis framework for evaluating user preferences in machine translation

被引:7
作者
Kirchhoff, Katrin [1 ]
Capurro, Daniel [2 ]
Turner, Anne M. [3 ,4 ]
机构
[1] Univ Washington, Dept Elect Engn, Seattle, WA 98195 USA
[2] Pontificia Univ Catolica Chile, Dept Internal Med, Santiago, Chile
[3] Univ Washington, Dept Hlth Serv, Seattle, WA 98195 USA
[4] Univ Washington, Dept Biomed Informat & Med Educ, Seattle, WA 98195 USA
关键词
Machine translation; Evaluation; User modeling; Preference elicitation;
D O I
10.1007/s10590-013-9140-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite much research on machine translation (MT) evaluation, there is surprisingly little work that directly measures users' intuitive or emotional preferences regarding different types of MT errors. However, the elicitation and modeling of user preferences is an important prerequisite for research on user adaptation and customization of MT engines. In this paper we explore the use of conjoint analysis as a formal quantitative framework to assess users' relative preferences for different types of translation errors. We apply our approach to the analysis of MT output from translating public health documents from English into Spanish. Our results indicate that word order errors are clearly the most dispreferred error type, followed by word sense, morphological, and function word errors. The conjoint analysis-based model is able to predict user preferences more accurately than a baseline model that chooses the translation with the fewest errors overall. Additionally we analyze the effect of using a crowd-sourced respondent population versus a sample of domain experts and observe that main preference effects are remarkably stable across the two samples.
引用
收藏
页码:1 / 17
页数:17
相关论文
共 37 条
[1]  
Al-Maskari A, 2006, EACL 2006, P9
[2]  
Altman DG, 1991, PRACTICAL STAT MED R
[3]  
Boutilier Craig, 1997, WORKING PAPERS AAAI, P19
[4]  
BRAZIUNAS D, 2006, TECH REP
[5]  
Callison- Burch C, 2007, P 2 WORKSH STAT MACH, P136
[6]  
Chen L Pu P, 2004, IC200467 HUM COMP IN
[7]  
Christiadi Cushing B, 2007, 46 ANN M SO REG SCI
[8]  
Condon S, 2010, LREC 2010, P729
[9]  
Denkowski M, 2010, AMTA 2010
[10]  
Doyle J., 1999, AI Magazine, V20, P55