IMPROVING MULTIPLE-CROWD-SOURCED TRANSCRIPTIONS USING A SPEECH RECOGNISER

被引:0
|
作者
van Dalen, R. C. [1 ]
Knill, K. M. [1 ]
Tsiakoulis, P. [1 ]
Gales, M. J. F. [1 ]
机构
[1] Univ Cambridge, Dept Engn, Trumpington St, Cambridge CB2 1PZ, England
来源
2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP) | 2015年
关键词
Automatic speech recognition; crowd-sourcing; transcription combination;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper introduces a method to produce high-quality transcriptions of speech data from only two crowd-sourced transcriptions. These transcriptions, produced cheaply by people on the Internet, for example through Amazon Mechanical Turk, are often of low quality. Often, multiple crowd-sourced transcriptions are combined to form one transcription of higher quality. However, the state of the art is to use essentially a form of majority voting, which requires at least three transcriptions for each utterance. This paper shows how to refine this approach to work with only two transcriptions. It then introduces a method that uses a speech recogniser (bootstrapped on a simple combination scheme) to combine transcriptions. When only two crowd-sourced transcriptions are available, on a noisy data set this improves the word error rate to gold-standard transcriptions by 21% relative.
引用
收藏
页码:4709 / 4713
页数:5
相关论文
共 39 条
  • [1] ANALYZING QUALITY OF CROWD-SOURCED SPEECH TRANSCRIPTIONS OF NOISY AUDIO FOR ACOUSTIC MODEL ADAPTATION
    Audhkhasi, Kartik
    Georgiou, Panayiotis G.
    Narayanan, Shrikanth S.
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4137 - 4140
  • [2] Reliability-Weighted Acoustic Model Adaptation Using Crowd Sourced Transcriptions
    Audhkhasi, Kartik
    Georgiou, Panayiotis G.
    Narayanan, Shrikanth S.
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 3052 - 3055
  • [3] Echo: A Crowd-sourced Romanian Speech Dataset
    Ungureanu, Remus-Dan
    Dascalu, Mihai
    INTERACTION DESIGN AND ARCHITECTURES, 2024, (62) : 141 - 152
  • [4] Using Selected Peers to Improve the Accuracy of Crowd Sourced Forecasts
    Feng, Ye
    Budescu, David V.
    DECISION-WASHINGTON, 2024, 11 (01): : 86 - 107
  • [5] Crowd-Sourced Annotation of ECG Signals Using Contextual Information
    Zhu, Tingting
    Johnson, Alistair E. W.
    Behar, Joachim
    Clifford, Gari D.
    ANNALS OF BIOMEDICAL ENGINEERING, 2014, 42 (04) : 871 - 884
  • [6] Crowd-Sourced Annotation of ECG Signals Using Contextual Information
    Tingting Zhu
    Alistair E. W. Johnson
    Joachim Behar
    Gari D. Clifford
    Annals of Biomedical Engineering, 2014, 42 : 871 - 884
  • [7] Assessing Workers Reliability in Crowd-sourced Computing using Bayesian Rules
    Hussin, Masnida
    Rozlan, Nur Aliya
    PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON APPLIED SCIENCE AND TECHNOLOGY (ICAST'18), 2018, 2016
  • [8] Fuzzy Integrals of Crowd-Sourced Intervals Using A Measure of Generalized Accord
    Havens, Timothy C.
    Anderson, Derek T.
    Wagner, Christian
    Deilamsalehy, Hanieh
    Wonnacott, Dereck
    2013 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ - IEEE 2013), 2013,
  • [9] Decision Support System for Agriculture Industry using Crowd Sourced Predictive Analytics
    Remya, S.
    Sasikala, R.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2018, 9 (11) : 310 - 318
  • [10] Worker Selection in Crowd-sourced Platforms using Non-dominated Sorting
    Mishra, Sumit
    Yadav, Akash
    Sairam, Ashok Singh
    PROCEEDINGS OF THE 2019 IEEE REGION 10 CONFERENCE (TENCON 2019): TECHNOLOGY, KNOWLEDGE, AND SOCIETY, 2019, : 41 - 45