MULTI-STYLE MLP FEATURES FOR BN TRANSCRIPTION

被引:6
|
作者
Le, Viet-Bac [1 ]
Lamel, Lori [1 ]
Gauvain, Jean-Luc [1 ]
机构
[1] LIMSI CNRS, Spoken Language Proc Grp, F-91403 Orsay, France
来源
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2010年
关键词
MLP features; condition-specific adaptation; BN transcription;
D O I
10.1109/ICASSP.2010.5495116
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
It has become common practice to adapt acoustic models to specific-conditions (gender, accent, bandwidth) in order to improve the performance of speech-to-text (STT) transcription systems. With the growing interest in the use of discriminative features produced by a multi layer perceptron (MLP) in such systems, the question arise of whether it is necessary to specialize the MLP to particular conditions, and if so, how to incorporate the condition-specific MLP features in the system. This paper explores three approaches (adaptation, full training, and feature merging) to use condition-specific MLP features in a state-of-the-art BN STT system for French. The third approach without condition-specific adaptation was found to outperform the original models with condition-specific adaptation, and was found to perform almost as well as full training of multiple condition-specific HMMs.
引用
收藏
页码:4866 / 4869
页数:4
相关论文
共 50 条
  • [1] Multi-Style Migration QR Code
    You, Fucheng
    Lai, Shuren
    Gong, Hechen
    Zhao, Yangze
    3RD ANNUAL INTERNATIONAL CONFERENCE ON INFORMATION SYSTEM AND ARTIFICIAL INTELLIGENCE (ISAI2018), 2018, 1069
  • [2] Fast Video Multi-Style Transfer
    Gao, Wei
    Lie, Yijun
    Yin, Yihang
    Yang, Ming-Hsuan
    2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 3211 - 3219
  • [3] The Communication Value of Multi-style Subtitles
    Zeng, Guangyu
    PROCEEDINGS OF THE 2017 2ND INTERNATIONAL CONFERENCE ON EDUCATION, SPORTS, ARTS AND MANAGEMENT ENGINEERING (ICESAME 2017), 2017, 123 : 685 - 690
  • [4] Multi-Style Generative Reading Comprehension
    Nishida, Kyosuke
    Saito, Itsumi
    Nishida, Kosuke
    Shinoda, Kazutoshi
    Otsuka, Atsushi
    Asano, Hisako
    Tomita, Junji
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 2273 - 2284
  • [5] Interactive Artistic Multi-style Transfer
    Wang, Xiaohui
    Lyu, Yiran
    Huang, Junfeng
    Wang, Ziying
    Qin, Jingyan
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2021, 14 (01)
  • [6] Interactive Artistic Multi-style Transfer
    Xiaohui Wang
    Yiran Lyu
    Junfeng Huang
    Ziying Wang
    Jingyan Qin
    International Journal of Computational Intelligence Systems, 14
  • [7] Design of a Multi-Style and Multi-Frequency FPGA
    Manoranjan, Jotham Vaddaboina
    Sajjan, Solomon Surya Tej Mano
    Gujari, Vivek B.
    Stevens, Kenneth S.
    2016 IFIP/IEEE INTERNATIONAL CONFERENCE ON VERY LARGE SCALE INTEGRATION (VLSI-SOC), 2016,
  • [8] Image Style Transfer via Multi-Style Geometry Warping
    Alexandru, Ioana
    Nicula, Constantin
    Prodan, Cristian
    Rotaru, Razvan-Paul
    Tarba, Nicolae
    Boiangiu, Costin-Anton
    APPLIED SCIENCES-BASEL, 2022, 12 (12):
  • [9] FPGA architecture for multi-style asynchronous logic
    Huot, N
    Dubreuil, H
    Fesquet, L
    Renaudin, M
    DESIGN, AUTOMATION AND TEST IN EUROPE CONFERENCE AND EXHIBITION, VOLS 1 AND 2, PROCEEDINGS, 2005, : 32 - 33
  • [10] MSN: Multi-Style Network for Trajectory Prediction
    Wong, Conghao
    Xia, Beihao
    Peng, Qinmu
    Yuan, Wei
    You, Xinge
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (09) : 9751 - 9766