Automatic Pronunciation Generation by Utilizing a Semi-supervised Deep Neural Networks

被引:0
|
作者
Takahashi, Naoya [1 ]
Naghibi, Tofigh [2 ]
Pfister, Beat [2 ]
机构
[1] Sony Corp, Tokyo, Japan
[2] Swiss Fed Inst Technol, Speech Proc Grp, Zurich, Switzerland
来源
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES | 2016年
关键词
speech recognition; deep neural networks; semi-supervised learning; dictionary; sub-word unit; k-dimensional Viterbi; SPEECH RECOGNITION;
D O I
10.21437/Interspeech.2016-761
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Phonemic or phonetic sub-word units are the most commonly used atomic elements to represent speech signals in modern ASRs. However they are not the optimal choice due to several reasons such as: large amount of effort required to handcraft a pronunciation dictionary, pronunciation variations, human mistakes and under-resourced dialects and languages. Here, we propose a data-driven pronunciation estimation and acoustic modeling method which only takes the orthographic transcription to jointly estimate a set of sub-word units and a reliable dictionary. Experimental results show that the proposed method which is based on semi-supervised training of a deep neural network largely outperforms phoneme based continuous speech recognition on the TIMIT dataset.
引用
收藏
页码:1141 / 1145
页数:5
相关论文
共 50 条
  • [21] Deep Neural Backdoor in Semi-Supervised Learning: Threats and Countermeasures
    Yan, Zhicong
    Wu, Jun
    Li, Gaolei
    Li, Shenghong
    Guizani, Mohsen
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2021, 16 : 4827 - 4842
  • [22] ACOUSTIC CLASSIFICATION USING SEMI-SUPERVISED DEEP NEURAL NETWORKS AND STOCHASTIC ENTROPY-REGULARIZATION OVER NEAREST-NEIGHBOR GRAPHS
    Thulasidasan, Sunil
    Bilmes, Jeffrey
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 2731 - 2735
  • [23] DEEP CONTEXTUALIZED ACOUSTIC REPRESENTATIONS FOR SEMI-SUPERVISED SPEECH RECOGNITION
    Ling, Shaoshi
    Liu, Yuzong
    Salazar, Julian
    Kirchhoff, Katrin
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6429 - 6433
  • [24] Active and Semi-Supervised Graph Neural Networks for Graph Classification
    Xie, Yu
    Lv, Shengze
    Qian, Yuhua
    Wen, Chao
    Liang, Jiye
    IEEE TRANSACTIONS ON BIG DATA, 2022, 8 (04) : 920 - 932
  • [25] IMPROVING SMALL CONVOLUTIONAL NEURAL NETWORKS WITH SEMI-SUPERVISED LEARNING
    Badea, Mihai
    Vertan, Constantin
    Florea, Corneliu
    Florea, Laura
    Racoviteanu, Andrei
    UNIVERSITY POLITEHNICA OF BUCHAREST SCIENTIFIC BULLETIN SERIES C-ELECTRICAL ENGINEERING AND COMPUTER SCIENCE, 2022, 84 (03): : 107 - 118
  • [26] Semi-supervised distillation: Personalizing deep neural networks in activity recognition using inertial sensors
    Iwasawa Y.
    Yairi I.E.
    Matsuo Y.
    Transactions of the Japanese Society for Artificial Intelligence, 2017, 32 (03)
  • [27] Assessment of Semi-supervised Approaches Applied to Convolutional Neural Networks
    Bassani, Cristiano N. de O.
    Saito, Prisicla T. M.
    Bugatti, Pedro H.
    ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, ICAISC 2022, PT II, 2023, 13589 : 195 - 205
  • [28] Multi-softmax Deep Neural Network for Semi-supervised Training
    Su, Hang
    Xu, Haihua
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 3239 - 3243
  • [29] 3D spatial priors for semi-supervised organ segmentation with deep convolutional neural networks
    Petit, Olivier
    Thome, Nicolas
    Soler, Luc
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2022, 17 (01) : 129 - 139
  • [30] Shift Quality Classifier Using Deep Neural Networks on Small Data with Dropout and Semi-Supervised Learning
    Kawakami, Takefumi
    Ide, Takanori
    Hoki, Kunihito
    Muramatsu, Masakazu
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2023, E106D (12) : 2078 - 2084