Speech recognition based on unified model of acoustic and language aspects of speech

被引:0
|
作者
机构
[1] Kubo, Yotaro
[2] Ogawa, Atsunori
[3] Hori, Takaaki
[4] Nakamura, Atsushi
来源
| 1600年 / Nippon Telegraph and Telephone Corp.卷 / 11期
关键词
Deep learning;
D O I
暂无
中图分类号
学科分类号
摘要
Automatic speech recognition has been attracting a lot of attention recently and is considered an important technique to achieve natural interaction between humans and machines. However, recognizing spontaneous speech is still considered to be difficult owing to the wide variety of patterns in spontaneous speech. We have been researching ways to overcome this problem and have developed a method to express both the acoustic and linguistic aspects of speech recognizers in a unified representation by integrating powerful frameworks of deep learning and a weighted finite-state transducer. We evaluated the proposed method ill an experiment to recognize a lecture speech dataset, which is coilsidered as a spontaneous speech dataset, and confirmed that the proposed method is promising for recognizing spontaneous speech.
引用
收藏
相关论文
共 50 条
  • [1] A unified language model architecture for web-based speech recognition grammars
    Holland, Wesley
    May, Daniel
    Baca, Julie
    Lazarou, Georgios
    Picone, Joseph
    2006 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY, VOLS 1 AND 2, 2006, : 294 - +
  • [2] A class based language model for speech recognition
    Ward, W
    Issar, S
    1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 416 - 418
  • [3] A unified language model for large vocabulary continuous speech recognition of Turkish
    Arisoy, Ebru
    Dutagaci, Helin
    Arslan, Levent M.
    SIGNAL PROCESSING, 2006, 86 (10) : 2844 - 2862
  • [4] Speech Emotion Recognition Based on Acoustic Segment Model
    Zheng, Siyuan
    Du, Jun
    Zhou, Hengshun
    Bai, Xue
    Lee, Chin-Hui
    Li, Shipeng
    2021 12TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2021,
  • [5] (Speech recognition based on Spanish accent acoustic model)
    Plaza, Johanna
    Sanchez-Zhunio, Cristina
    Acosta-Uriguen, Maria-Ines
    Orellana, Marcos
    Cedillo, Priscila
    Zambrano-Martinez, Jorge Luis
    ENFOQUE UTE, 2022, 13 (03): : 45 - 57
  • [6] ACOUSTIC AND LANGUAGE PROCESSING TECHNOLOGY FOR SPEECH RECOGNITION
    MATSUOKA, T
    MINAMI, Y
    NTT REVIEW, 1995, 7 (02): : 30 - 39
  • [7] Joint acoustic and language modeling for speech recognition
    Chien, Jen-Tzung
    Chueh, Chuang-Hua
    SPEECH COMMUNICATION, 2010, 52 (03) : 223 - 235
  • [8] DISCRIMINATIVELY ESTIMATED JOINT ACOUSTIC, DURATION, AND LANGUAGE MODEL FOR SPEECH RECOGNITION
    Lehr, Maider
    Shafran, Izhak
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 5542 - 5545
  • [9] A unified system for multilingual speech recognition and language identification
    Liu, Danyang
    Xu, Ji
    Zhang, Pengyuan
    Yan, Yonghong
    SPEECH COMMUNICATION, 2021, 127 : 17 - 28
  • [10] A CACHE-BASED LANGUAGE MODEL FOR SPEECH RECOGNITION
    KUHN, R
    DEMORI, R
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1992, 14 (06) : 691 - 692