System for Automatic Transcription of Sessions of the Polish Senate

被引：7

作者：

Marasek, Krzysztof ^{[1
]}

Korzinek, Danijel ^{[1
]}

Brocki, Lukasz ^{[1
]}

机构：

[1] Polish Japanese Inst Informat Technol, PL-02008 Warsaw, Poland

来源：

ARCHIVES OF ACOUSTICS | 2014年 / 39卷 / 04期

关键词：

large vocabulary speech recognition; language modelling; transcription; transliteration; sub titles;

D O I：

10.2478/aoa-2014-0054

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper describes research behind a Large-Vocabulary Continuous Speech Recognition (LVCSR) system for the transcription of Senate speeches for the Polish language. The system utilizes several components: a phonetic transcription system, language and acoustic model training systems, a Voice Activity Detector (VAD), a LVCSR decoder, and a subtitle generator and presentation system. Some of the modules relied on already available tools and some had to be made from the beginning but the authors ensured that they used the most advanced techniques they had available at the time. Finally, several experiments were performed to compare the performance of both more modern and more conventional technologies.

引用

页码：501 / 509

页数：9

共 40 条

[1]

[Anonymous], 2002, CAMBRIDGE U ENG DEP

[2]

[Anonymous], 1997, Statistical methods for speech recognition

[3]

[Anonymous], 2012, Foundations of Intelligent Systems, DOI DOI 10.1007/978-3-642-34624-8_17

[4]

[Anonymous], 2013, SEQUENCE DISCRIMINAT

[5]

[Anonymous], 2002, INTERSPEECH

[6]

[Anonymous], 2011, WORKSH AUT SPEECH RE

[7]

Brocki L., 2010, THESIS POLISH JAPANE

[8]

Brocki L., 2014, LECT NOTES COMPUTER, V8502, P355

[9]

Brocki L., 2008, TELEPHONY BASED VOIC

[10]

Brocki L., 2010, 12 INT PHD WORKSH OW

← 1 2 3 4 →