HIERARCHICAL MULTITASK LEARNING WITH CTC

被引:0
|
作者
Sanabria, Ramon [1 ]
Metze, Florian [1 ]
机构
[1] Carnegie Mellon Univ, Sch Comp Sci, Language Technol Inst, Pittsburgh, PA 15213 USA
关键词
hierarchical multitask learning; ASR; CTC;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In Automatic Speech Recognition, it is still challenging to learn useful intermediate representations when using high-level (or abstract) target units such as words. For that reason, when only a few hundreds of hours of training data are available, character or phoneme-based systems tend to outperform word-based systems. In this paper, we show how Hierarchical Multitask Learning can encourage the formation of useful intermediate representations. We achieve this by performing Connectionist Temporal Classification at different levels of the network with targets of different granularity. Our model thus performs predictions in multiple scales for the same input. On the standard 300h Switchboard training setup, our hierarchical multitask architecture demonstrates improvements over singletask architectures with the same number of parameters. Our model obtains 14.0% Word Error Rate on the Switchboard subset of the Eval2000 test set without any decoder or language model, outperforming the current state-of-the-art on non-autoregressive Acoustic-to-Word models.
引用
收藏
页码:485 / 490
页数:6
相关论文
共 50 条
  • [31] Sparse and Structured Function-on-Function Quality Predictive Modeling by Hierarchical Variable Selection and Multitask Learning
    Wang, Kai
    Tsung, Fugee
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (10) : 6720 - 6730
  • [32] Collaborative Online Multitask Learning
    Li, Guangxia
    Hoi, Steven C. H.
    Chang, Kuiyu
    Liu, Wenting
    Jain, Ramesh
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (08) : 1866 - 1876
  • [33] Online Federated Multitask Learning
    Li, Rui
    Ma, Fenglong
    Jiang, Wenjun
    Gao, Jing
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 215 - 220
  • [34] A Hierarchical Multitask Learning Approach for the Recognition of Activities of Daily Living Using Data from Wearable Sensors
    Nisar, Muhammad Adeel
    Shirahama, Kimiaki
    Irshad, Muhammad Tausif
    Huang, Xinyu
    Grzegorzek, Marcin
    SENSORS, 2023, 23 (19)
  • [35] NMDA-driven dendritic modulation enables multitask representation learning in hierarchical sensory processing pathways
    Wybo, Willem A. M.
    Tsai, Matthias C.
    Tran, Viet Anh Khoa
    Illing, Bernd
    Jordan, Jakob
    Morrison, Abigail
    Senn, Walter
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2023, 120 (32)
  • [36] Embodied Multimodal Multitask Learning
    Chaplot, Devendra Singh
    Lee, Lisa
    Salakhutdinov, Ruslan
    Parikh, Devi
    Batra, Dhruv
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 2442 - 2448
  • [37] Exploring Multitask Learning for Steganalysis
    Makelberge, Julie
    Ker, Andrew D.
    MEDIA WATERMARKING, SECURITY, AND FORENSICS 2013, 2013, 8665
  • [38] Multitask Soft Option Learning
    Igl, Maximilian
    Gambardella, Andrew
    He, Jinke
    Nardelli, Nantas
    Siddharth, N.
    Bohmer, Wendelin
    Whiteson, Shimon
    CONFERENCE ON UNCERTAINTY IN ARTIFICIAL INTELLIGENCE (UAI 2020), 2020, 124 : 969 - 978
  • [39] A dozen tricks with multitask learning
    Caruana, R
    NEURAL NETWORKS: TRICKS OF THE TRADE, 1998, 1524 : 165 - 191
  • [40] Multitask Learning With Enhanced Modules
    Zheng, Zishuo
    Wei, Yadong
    Zhao, Zixu
    Wu, Xindi
    Li, Zhengcheng
    Ren, Pengju
    2018 IEEE 23RD INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2018,