PRIVACY SENSITIVE SPEECH ANALYSIS USING FEDERATED LEARNING TO ASSESS DEPRESSION

被引:14
作者
Suhas, B. N. [1 ]
Abdullah, Saeed [1 ]
机构
[1] Penn State Univ, Coll Informat Sci & Technol, University Pk, PA 16802 USA
来源
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2022年
关键词
speech classification; depression; privacy; paralinguistics; mHealth;
D O I
10.1109/ICASSP43922.2022.9746827
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Recent studies have used speech signals to assess depression. However, speech features can lead to serious privacy concerns. To address these concerns, prior work has used privacy-preserving speech features. However, using a subset of features can lead to information loss and, consequently, non-optimal model performance. Furthermore, prior work relies on a centralized approach to support continuous model updates, posing privacy risks. This paper proposes to use Federated Learning (FL) to enable decentralized, privacy-preserving speech analysis to assess depression. Using an existing dataset (DAIC-WOZ), we show that FL models enable a robust assessment of depression with only 4-6% accuracy loss compared to a centralized approach. These models also outperform prior work using the same dataset. Furthermore, the FL models have short inference latency and small memory footprints while being energy-efficient. These models, thus, can be deployed on mobile devices for real-time, continuous, and privacy-preserving depression assessment at scale.
引用
收藏
页码:6272 / 6276
页数:5
相关论文
共 24 条
[21]  
WHO, 2017, Depression and other common mental disorders: global health estimates
[22]   Inferring Colocation and Conversation Networks from Privacy-Sensitive Audio with Implications for Computational Social Science [J].
Wyatt, Danny ;
Choudhury, Tanzeem ;
Bilmes, Jeff ;
Kitts, James A. .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (01)
[23]  
Yao X, 2018, IEEE I C VI COM I PR
[24]  
You Kaichao, 2019, How does learning rate decay help modern neural networks?