Evaluation Framework for Distant-talking Speech Recognition under Reverberant Environments - Newest Part of the CENSREC Series -

被引：0

作者：

Nishiura, Takanobu

Nakayama, Masato

Denda, Yuki

Kitaoka, Norihide

Yamamoto, Kazumasa

Yamada, Takeshi

Tsuge, Satoru

Miyajima, Chiyomi

Fujimoto, Masakiyo

Takiguchi, Tetsuya

Tamura, Satoshi

Kuroiwa, Shingo

Takeda, Kazuya

Nakamura, Satoshi

机构：

来源：

SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008 | 2008年

关键词：

D O I：

暂无

中图分类号：

H0 [语言学];

学科分类号：

030303 ; 0501 ; 050102 ;

摘要：

Recently, speech recognition performance has been drastically improved by statistical methods and huge speech databases. Now performance improvement under such realistic environments as noisy conditions is being focused on. Since October 2001, we from the working group of the Information Processing Society in Japan have been working on evaluation methodologies and frameworks for Japanese noisy speech recognition. We have released frameworks including databases and evaluation tools called CENSREC-1 (Corpus and Environment for Noisy Speech RECognition 1; formerly AURORA-2J), CENSREC-2 (in-car connected digits recognition), CENSREC-3 (in-car isolated word recognition), and CENSREC-1-C (voice activity detection under noisy conditions). In this paper, we newly introduce a collection of databases and evaluation tools named CENSREC-4, which is an evaluation framework for distant-talking speech under hands-free conditions. Distant-talking speech recognition is crucial for a hands-free speech interface. Therefore, we measured room impulse responses to investigate reverberant speech recognition. The results of evaluation experiments proved that CENSREC-4 is an effective database suitable for evaluating the new dereverberation method because the traditional dereverberation process had difficulty sufficiently improving the recognition performance. The framework was released in March 2008, and many studies are being conducted with it in Japan.

引用

页码：1828 / 1834

页数：7

共 47 条

[41] Simultaneous recognition of distant-talking speech of multiple talkers based on the 3-D N-best search method
Heracleous, P
Nakamura, S
Shikano, K
JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2004, 36 (2-3): : 105 - 116
[42] Simultaneous Recognition of Distant-Talking Speech of Multiple Talkers Based on the 3-D N-Best Search Method
Panikos Heracleous
Satoshi Nakamura
Kiyohiro Shikano
Journal of VLSI signal processing systems for signal, image and video technology, 2004, 36 : 105 - 116
[43] CENSREC-1-C: An evaluation framework for voice activity detection under noisy environments
Kitaoka, Norihide
Yamada, Takeshi
Tsuge, Satoru
Miyajima, Chiyomi
Yamamoto, Kazumasa
Nishiura, Takanobu
Nakayama, Masato
Denda, Yuki
Fujimoto, Masakiyo
Takiguchi, Tetsuya
Tamura, Satoshi
Matsuda, Shigeki
Ogawa, Tetsuji
Kuroiwa, Shingo
Takeda, Kazuya
Nakamura, Satoshi
ACOUSTICAL SCIENCE AND TECHNOLOGY, 2009, 30 (05) : 363 - 371
[44] Beamforming Using Uniform Circular Arrays for Distant Speech Recognition in Reverberant Environments and Double-Talk Scenarios
Pessentheiner, Hannes
Petrik, Stefan
Romsdorfer, Harald
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1366 - 1369
[45] Simultaneous recognition of distant-talking speech of multiple sound sources based on 3-D N-best search algorithm
Heracleous, P
Nakamura, S
Shikano, K
ASRU 2001: IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, CONFERENCE PROCEEDINGS, 2001, : 111 - 114
[46] Development of vad evaluation framework CENSREC-1-C and investigation of relationship between VAD and speech recognition perfornlance
Kitaoka, Norihide
Yamamoto, Kazumasa
Kusamizu, Tomohiro
Nakagawa, Seiichi
Yamada, Takeshi
Tsuge, Satoru
Miyajima, Chiyomi
Nishiura, Takanobu
Nakayama, Masato
Denda, Yuki
Fujimoto, Masakiyo
Takiguchi, Tetsuya
Tamura, Satoshi
Kuroiwa, Shingo
Takeda, Kazuya
Nakamura, Satoshi
2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2, 2007, : 607 - +
[47] SINGLE AND MULTI-CHANNEL APPROACHES FOR DISTANT SPEECH RECOGNITION UNDER NOISY REVERBERANT CONDITIONS: I2R'S SYSTEM DESCRIPTION FOR THE ASpIRE CHALLENGE
Dennis, Jonathan
Tran Huy Dat
2015 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2015, : 518 - 524

← 1 2 3 4 5 →