Effects of Automated Transcription Quality on Non-native Speakers' Comprehension in Real-time Computer-mediated Communication

被引:0
作者
Pan, Ying-xin [1 ]
Jiang, Dan-ning [1 ]
Yao, Lin [2 ]
Picheny, Michael [3 ]
Qin, Yong [1 ]
机构
[1] IBM Res China, Bldg 19,Zhongguancun Software Pk, Beijing, Peoples R China
[2] Chinese Acad Sci, Insitute Psychol, Beijing 100101, Peoples R China
[3] IBM TJ Watson Res Ctr, Yorktown Hts, NY USA
来源
CHI2010: PROCEEDINGS OF THE 28TH ANNUAL CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, VOLS 1-4 | 2010年
关键词
Real-time Transcription; Automated Speech Recognition; Non-native Speakers; Experiment; CMC;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Real-time transcription has been shown to be valuable in facilitating non-native speakers' comprehension in real-time communication. Automated speech recognition (ASR) technology is a critical ingredient for its practical deployment. This paper presents a series of studies investigating how the quality of transcripts generated by an ASR system impacts user comprehension and subjective evaluation. Experiments are first presented comparing performance across three different transcription conditions: no transcript, a perfect transcript, and a transcript with Word Error Rate (WER) =20%. We found 20% WER was the most likely critical point for transcripts to be just acceptable and useful. Then we further examined a lower WER of 10% (a lower bound for today's state-of-the-art systems) employing the same experimental design. The results indicated that at 10% WER comprehension performance was significantly improved compared to the no-transcript condition. Finally, implications for further. system development and design are discussed.
引用
收藏
页码:1725 / +
页数:3
相关论文
共 22 条
  • [1] Advances in speech transcription at IBM under the DARPA EARS program
    Chen, Stanley F.
    Kingsbury, Brian
    Mangu, Lidia
    Povey, Daniel
    Saon, George
    Soltau, Hagen
    Zweig, Geoffrey
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (05): : 1596 - 1608
  • [2] Cui XD, 2008, INT CONF ACOUST SPEE, P5121
  • [3] Maximum likelihood linear transformations for HMM-based speech recognition
    Gales, MJF
    [J]. COMPUTER SPEECH AND LANGUAGE, 1998, 12 (02) : 75 - 98
  • [4] HAMON O, P 12 C EUR CHAPT ACL, P353
  • [5] Jin Y., 2005, Psychological measurement, V2nd
  • [6] KAISER EC, 2007, P ACM CHI 07 SAN JOS, P1009
  • [7] Kheir R, 2007, ITICSE 2007: 12TH ANNUAL CONFERENCE ON INNOVATION & TECHNOLOGY IN COMPUTER SCIENCE EDUCATION, P261
  • [8] LAU R, P ARPA WORKSH HUM LA, P108
  • [9] LEITH D, 2003, YEAR 3 REPORT
  • [10] Munteanu C., 2006, PROC CHI 06, P493