Improving Speech-Based Dysarthria Detection using Multi-task Learning with Gradient Projection

被引:0
作者
Xiang, Yan [1 ]
Berisha, Visar [1 ,2 ]
Liss, Julie [2 ]
Chakrabarti, Chaitali [1 ]
机构
[1] Arizona State Univ, Sch Elect Comp & Energy Engn, Tempe, AZ 85281 USA
[2] Arizona State Univ, Coll Hlth Solut, Tempe, AZ USA
来源
INTERSPEECH 2024 | 2024年
关键词
Dysarthria detection; speech processing; deep neural network; multi-task learning; DISEASE;
D O I
10.21437/Interspeech.2024-1563
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Speech analytic models based on deep learning are popular in clinical diagnostics. However, constraints on clinical data collection and sharing place limits on available dataset sizes, which adversely impacts trained model performance. Multi-task learning (MTL) has been utilized to mitigate the effect of limited sample size by jointly training on multiple tasks that are considered to be related. However, discrepancies between clinical and non-clinical tasks can reduce MTL efficiency and can even cause it to fail, especially when there are gradient conflicts. In this paper, we enhance the performance of dysarthria detection by using MTL with an auxiliary task of learning speaker embeddings. We propose a task-specific gradient projection method to overcome gradient conflicts. Our evaluation shows that the proposed MTL paradigm outperforms both single-task learning and conventional MTL under different data availability settings.
引用
收藏
页码:902 / 906
页数:5
相关论文
共 50 条
  • [41] Improved Accented Speech Recognition Using Accent Embeddings and Multi-task Learning
    Jain, Abhinav
    Upreti, Minali
    Jyothi, Preethi
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2454 - 2458
  • [42] GDOD: Effective Gradient Descent using Orthogonal Decomposition for Multi-Task Learning
    Dong, Xin
    Wu, Ruize
    Xiong, Chao
    Li, Hai
    Cheng, Lei
    He, Yong
    Qian, Shiyou
    Cao, Jian
    Mo, Linjian
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 386 - 395
  • [43] Automatic Cataract Detection with Multi-Task Learning
    Wu, Hongjie
    Lv, Jiancheng
    Wang, Jian
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [44] MULTI-OBJECTIVE MULTI-TASK LEARNING ON RNNLM FOR SPEECH RECOGNITION
    Song, Minguang
    Zhao, Yunxin
    Wang, Shaojun
    2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 197 - 203
  • [45] Speech based suicide risk recognition for crisis intervention hotlines using explainable multi-task learning
    Ding, Zhong
    Zhou, Yang
    Dai, An-Jie
    Qian, Chen
    Zhong, Bao-Liang
    Liu, Chen-Ling
    Liu, Zhen-Tao
    JOURNAL OF AFFECTIVE DISORDERS, 2025, 370 : 392 - 400
  • [46] Joint Disaster Classification and Victim Detection using Multi-Task Learning
    Tham, Mau-Luen
    Wong, Yi Jie
    Kwan, Ban Hoe
    Owada, Yasunori
    Sein, Myint Myint
    Chang, Yoong Choon
    2021 IEEE 12TH ANNUAL UBIQUITOUS COMPUTING, ELECTRONICS & MOBILE COMMUNICATION CONFERENCE (UEMCON), 2021, : 407 - 412
  • [47] NEURAL MOS PREDICTION FOR SYNTHESIZED SPEECH USING MULTI-TASK LEARNING WITH SPOOFING DETECTION AND SPOOFING TYPE CLASSIFICATION
    Choi, Yeunju
    Jung, Youngmoon
    Kim, Hoirin
    2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT), 2021, : 462 - 469
  • [48] Sentiment Analysis and Sarcasm Detection using Deep Multi-Task Learning
    Tan, Yik Yang
    Chow, Chee-Onn
    Kanesan, Jeevan
    Chuah, Joon Huang
    Lim, YongLiang
    WIRELESS PERSONAL COMMUNICATIONS, 2023, 129 (03) : 2213 - 2237
  • [49] STATISTICAL PARAMETRIC SPEECH SYNTHESIS USING GENERATIVE ADVERSARIAL NETWORKS UNDER A MULTI-TASK LEARNING FRAMEWORK
    Yang, Shan
    Xie, Lei
    Chen, Xiao
    Lou, Xiaoyan
    Zhu, Xuan
    Huang, Dongyan
    Li, Haizhou
    2017 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2017, : 685 - 691
  • [50] Accelerated Gradient Method for Multi-Task Sparse Learning Problem
    Chen, Xi
    Pan, Weike
    Kwok, James T.
    Carbonell, Jaime G.
    2009 9TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, 2009, : 746 - +