Unsupervised Speaker Diarization in Distributed IoT Networks Using Federated Learning

被引:0
|
作者
Bhuyan, Amit Kumar [1 ]
Dutta, Hrishikesh [1 ]
Biswas, Subir [1 ]
机构
[1] Michigan State Univ, Dept Elect & Comp Engn, E Lansing, MI 48823 USA
来源
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE | 2025年 / 9卷 / 02期
关键词
Mel frequency cepstral coefficient; Computational modeling; Accuracy; Oral communication; Training; Bayes methods; Feature extraction; Data models; Computational intelligence; Unsupervised Learning; Bayesian methods; federated learning; distributed processing; Hotelling's t-squared statistic; Bayesian information criterion; cepstral analysis; SEGMENTATION;
D O I
10.1109/TETCI.2024.3482855
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a computationally efficient and distributed speaker diarization framework for networked IoT-style audio devices. The work proposes a Federated Learning model which can identify the participants in a conversation without the requirement of a large audio database for training. An unsupervised online update mechanism is proposed for the Federated Learning model which depends on cosine similarity of speaker embeddings. Moreover, the proposed diarization system solves the problem of speaker change detection via. unsupervised segmentation techniques using Hotelling's t-squared Statistic and Bayesian Information Criterion. In this new approach, speaker change detection is biased around detected quasi-silences, which reduces the severity of the trade-off between the missed detection and false detection rates. Additionally, the computational overhead due to frame-by-frame identification of speakers is reduced via. unsupervised clustering of speech segments. The results demonstrate the effectiveness of the proposed training method in the presence of non-IID speech data. It also shows a considerable improvement in the reduction of false and missed detection at the segmentation stage, while reducing the computational overhead. Improved accuracy and reduced computational cost makes the mechanism suitable for real-time speaker diarization across a distributed IoT audio network.
引用
收藏
页码:1934 / 1946
页数:13
相关论文
共 50 条
  • [1] Federated Feature Selection for Horizontal Federated Learning in IoT Networks
    Zhang, Xunzheng
    Mavromatis, Alex
    Vafeas, Antonis
    Nejabati, Reza
    Simeonidou, Dimitra
    IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (11) : 10095 - 10112
  • [2] Low-Latency Federated Learning With DNN Partition in Distributed Industrial IoT Networks
    Deng, Xiumei
    Li, Jun
    Ma, Chuan
    Wei, Kang
    Shi, Long
    Ding, Ming
    Chen, Wen
    IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2023, 41 (03) : 755 - 775
  • [3] Unsupervised Data Splitting Scheme for Federated Edge Learning in IoT Networks
    Nour, Boubakr
    Cherkaoui, Soumaya
    IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2022), 2022,
  • [4] Incremental Unsupervised Adversarial Domain Adaptation for Federated Learning in IoT Networks
    Huang, Yan
    Du, Mengxuan
    Zheng, Haifeng
    Feng, Xinxin
    2022 18TH INTERNATIONAL CONFERENCE ON MOBILITY, SENSING AND NETWORKING, MSN, 2022, : 186 - 190
  • [5] HT-FL: Hybrid Training Federated Learning for Heterogeneous Edge-Based IoT Networks
    Gu, Yixun
    Wang, Jie
    Zhao, Shengjie
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2025, 24 (04) : 2817 - 2831
  • [6] FedCL: An Efficient Federated Unsupervised Learning for Model Sharing in IoT
    Zhao, Chen
    Gao, Zhipeng
    Wang, Qian
    Mo, Zijia
    Yu, Xinlei
    COLLABORATIVE COMPUTING: NETWORKING, APPLICATIONS AND WORKSHARING, COLLABORATECOM 2022, PT I, 2022, 460 : 115 - 134
  • [7] Lightweight Federated-Learning-Driven Traffic Prediction for Heterogeneous IoT Networks
    Wang, Ying
    Zhang, Qianqian
    Wei, Tongyan
    Cong, Lin
    Yu, Peng
    Guo, Shaoyong
    Qiu, Xuesong
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (24): : 40656 - 40669
  • [8] Accountable and Verifiable Secure Aggregation for Federated Learning in IoT Networks
    Yang, Xiaoyi
    Zhao, Yanqi
    Chen, Dian
    Yu, Yong
    Du, Xiaojiang
    Guizani, Mohsen
    IEEE NETWORK, 2022, 36 (05): : 173 - 179
  • [9] Compressive-Learning-Based Federated Learning for Intelligent IoT With CloudEdge Collaboration
    Gao, Xianwei
    Hou, Lingyu
    Chen, Bi
    Yao, Xiang
    Suo, Zhufeng
    IEEE INTERNET OF THINGS JOURNAL, 2025, 12 (02): : 2291 - 2294
  • [10] Federated Learning for Privacy-Preserving Speaker Recognition
    Woubie, Abraham
    Backstrom, Tom
    IEEE ACCESS, 2021, 9 : 149477 - 149485