Improving Speech-Based Dysarthria Detection using Multi-task Learning with Gradient Projection

被引:0
|
作者
Xiang, Yan [1 ]
Berisha, Visar [1 ,2 ]
Liss, Julie [2 ]
Chakrabarti, Chaitali [1 ]
机构
[1] Arizona State Univ, Sch Elect Comp & Energy Engn, Tempe, AZ 85281 USA
[2] Arizona State Univ, Coll Hlth Solut, Tempe, AZ USA
来源
INTERSPEECH 2024 | 2024年
关键词
Dysarthria detection; speech processing; deep neural network; multi-task learning; DISEASE;
D O I
10.21437/Interspeech.2024-1563
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Speech analytic models based on deep learning are popular in clinical diagnostics. However, constraints on clinical data collection and sharing place limits on available dataset sizes, which adversely impacts trained model performance. Multi-task learning (MTL) has been utilized to mitigate the effect of limited sample size by jointly training on multiple tasks that are considered to be related. However, discrepancies between clinical and non-clinical tasks can reduce MTL efficiency and can even cause it to fail, especially when there are gradient conflicts. In this paper, we enhance the performance of dysarthria detection by using MTL with an auxiliary task of learning speaker embeddings. We propose a task-specific gradient projection method to overcome gradient conflicts. Our evaluation shows that the proposed MTL paradigm outperforms both single-task learning and conventional MTL under different data availability settings.
引用
收藏
页码:902 / 906
页数:5
相关论文
共 50 条
  • [21] Multi-Task Learning for Mispronunciation Detection on Singapore Children's Mandarin Speech
    Tong, Rong
    Chen, Nancy E.
    Ma, Bin
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2193 - 2197
  • [22] Improving Weakly Supervised Lesion Segmentation using Multi-Task Learning
    Chu, Tianshu
    Li, Xinmeng
    Vo, Huy V.
    Summers, Ronald M.
    Sizikova, Elena
    MEDICAL IMAGING WITH DEEP LEARNING, VOL 143, 2021, 143 : 60 - 73
  • [23] Deep Chessboard Corner Detection Using Multi-task Learning
    Yoon, Hyunse
    Lee, Seongmin
    Kang, Jiwoo
    Lee, Sanghoon
    IEEE MMSP 2021: 2021 IEEE 23RD INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2021,
  • [24] Fetal Cardiac Structure Detection Using Multi-task Learning
    He, Jie
    Yang, Lei
    Zhu, Yunping
    Li, Donglian
    Ding, Zhixing
    Lu, Yuhuan
    Liang, Bocheng
    Li, Shengli
    ADVANCED INTELLIGENT COMPUTING IN BIOINFORMATICS, PT II, ICIC 2024, 2024, 14882 : 405 - 419
  • [25] Generalizing Hate Speech Detection Using Multi-Task Learning: A Case Study of Political Public Figures
    Yuan, Lanqin
    Rizoiu, Marian-Andrei
    COMPUTER SPEECH AND LANGUAGE, 2025, 89
  • [26] Towards multi-task learning of speech and speaker recognition
    Vaessen, Nik
    van Leeuwen, David A.
    INTERSPEECH 2023, 2023, : 4898 - 4902
  • [27] Arabic Offensive and Hate Speech Detection Using a Cross-Corpora Multi-Task Learning Model
    Aldjanabi, Wassen
    Dahou, Abdelghani
    Al-qaness, Mohammed A. A.
    Abd Elaziz, Mohamed
    Helmi, Ahmed Mohamed
    Damasevicius, Robertas
    INFORMATICS-BASEL, 2021, 8 (04):
  • [28] Improving Deep Neural Network Based Speech Synthesis through Contextual Feature Parametrization and Multi-Task Learning
    Zhengqi Wen
    Kehuang Li
    Zhen Huang
    Chin-Hui Lee
    Jianhua Tao
    Journal of Signal Processing Systems, 2018, 90 : 1025 - 1037
  • [29] A multi-task based deep learning approach for intrusion detection
    Liu, Qigang
    Wang, Deming
    Jia, Yuhang
    Luo, Suyuan
    Wang, Chongren
    KNOWLEDGE-BASED SYSTEMS, 2022, 238
  • [30] Improving Low-Resource Chinese Event Detection with Multi-task Learning
    Tong, Meihan
    Xu, Bin
    Wang, Shuai
    Hou, Lei
    Li, Juaizi
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT (KSEM 2020), PT I, 2020, 12274 : 421 - 433