Speech recognition based on unified model of acoustic and language aspects of speech

被引：0

作者：

机构：

[1] Kubo, Yotaro

[2] Ogawa, Atsunori

[3] Hori, Takaaki

[4] Nakamura, Atsushi

来源：

| 1600年 / Nippon Telegraph and Telephone Corp.卷 / 11期

关键词：

Deep learning;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Automatic speech recognition has been attracting a lot of attention recently and is considered an important technique to achieve natural interaction between humans and machines. However, recognizing spontaneous speech is still considered to be difficult owing to the wide variety of patterns in spontaneous speech. We have been researching ways to overcome this problem and have developed a method to express both the acoustic and linguistic aspects of speech recognizers in a unified representation by integrating powerful frameworks of deep learning and a weighted finite-state transducer. We evaluated the proposed method ill an experiment to recognize a lecture speech dataset, which is coilsidered as a spontaneous speech dataset, and confirmed that the proposed method is promising for recognizing spontaneous speech.

引用

共 50 条

[1] A unified language model architecture for web-based speech recognition grammars
Holland, Wesley
May, Daniel
Baca, Julie
Lazarou, Georgios
Picone, Joseph
2006 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY, VOLS 1 AND 2, 2006, : 294 - +
[2] A class based language model for speech recognition
Ward, W
Issar, S
1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 416 - 418
[3] A unified language model for large vocabulary continuous speech recognition of Turkish
Arisoy, Ebru
Dutagaci, Helin
Arslan, Levent M.
SIGNAL PROCESSING, 2006, 86 (10) : 2844 - 2862
[4] Speech Emotion Recognition Based on Acoustic Segment Model
Zheng, Siyuan
Du, Jun
Zhou, Hengshun
Bai, Xue
Lee, Chin-Hui
Li, Shipeng
2021 12TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2021,
[5] (Speech recognition based on Spanish accent acoustic model)
Plaza, Johanna
Sanchez-Zhunio, Cristina
Acosta-Uriguen, Maria-Ines
Orellana, Marcos
Cedillo, Priscila
Zambrano-Martinez, Jorge Luis
ENFOQUE UTE, 2022, 13 (03): : 45 - 57
[6] ACOUSTIC AND LANGUAGE PROCESSING TECHNOLOGY FOR SPEECH RECOGNITION
MATSUOKA, T
MINAMI, Y
NTT REVIEW, 1995, 7 (02): : 30 - 39
[7] Joint acoustic and language modeling for speech recognition
Chien, Jen-Tzung
Chueh, Chuang-Hua
SPEECH COMMUNICATION, 2010, 52 (03) : 223 - 235
[8] DISCRIMINATIVELY ESTIMATED JOINT ACOUSTIC, DURATION, AND LANGUAGE MODEL FOR SPEECH RECOGNITION
Lehr, Maider
Shafran, Izhak
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 5542 - 5545
[9] A unified system for multilingual speech recognition and language identification
Liu, Danyang
Xu, Ji
Zhang, Pengyuan
Yan, Yonghong
SPEECH COMMUNICATION, 2021, 127 : 17 - 28
[10] A CACHE-BASED LANGUAGE MODEL FOR SPEECH RECOGNITION
KUHN, R
DEMORI, R
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1992, 14 (06) : 691 - 692

← 1 2 3 4 5 →