Unsupervised Speaker Diarization in Distributed IoT Networks Using Federated Learning

被引:0
作者
Bhuyan, Amit Kumar [1 ]
Dutta, Hrishikesh [1 ]
Biswas, Subir [1 ]
机构
[1] Michigan State Univ, Dept Elect & Comp Engn, E Lansing, MI 48823 USA
来源
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE | 2025年 / 9卷 / 02期
关键词
Mel frequency cepstral coefficient; Computational modeling; Accuracy; Oral communication; Training; Bayes methods; Feature extraction; Data models; Computational intelligence; Unsupervised Learning; Bayesian methods; federated learning; distributed processing; Hotelling's t-squared statistic; Bayesian information criterion; cepstral analysis; SEGMENTATION;
D O I
10.1109/TETCI.2024.3482855
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a computationally efficient and distributed speaker diarization framework for networked IoT-style audio devices. The work proposes a Federated Learning model which can identify the participants in a conversation without the requirement of a large audio database for training. An unsupervised online update mechanism is proposed for the Federated Learning model which depends on cosine similarity of speaker embeddings. Moreover, the proposed diarization system solves the problem of speaker change detection via. unsupervised segmentation techniques using Hotelling's t-squared Statistic and Bayesian Information Criterion. In this new approach, speaker change detection is biased around detected quasi-silences, which reduces the severity of the trade-off between the missed detection and false detection rates. Additionally, the computational overhead due to frame-by-frame identification of speakers is reduced via. unsupervised clustering of speech segments. The results demonstrate the effectiveness of the proposed training method in the presence of non-IID speech data. It also shows a considerable improvement in the reduction of false and missed detection at the segmentation stage, while reducing the computational overhead. Improved accuracy and reduced computational cost makes the mechanism suitable for real-time speaker diarization across a distributed IoT audio network.
引用
收藏
页码:1934 / 1946
页数:13
相关论文
共 50 条
  • [41] Optimizing Federated Learning in Distributed Industrial IoT: A Multi-Agent Approach
    Zhang, Weiting
    Yang, Dong
    Wu, Wen
    Peng, Haixia
    Zhang, Ning
    Zhang, Hongke
    Shen, Xuemin
    IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2021, 39 (12) : 3688 - 3703
  • [42] Speaker2Vec: Unsupervised Learning and Adaptation of a Speaker Manifold using Deep Neural Networks with an Evaluation on Speaker Segmentation
    Jati, Arindam
    Georgiou, Panayiotis
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3567 - 3571
  • [43] Collaborative DDoS Detection in Distributed Multi-Tenant IoT using Federated Learning
    Neto, Euclides Carlos Pinto
    Dadkhah, Sajjad
    Ghorbani, Ali A.
    2022 19TH ANNUAL INTERNATIONAL CONFERENCE ON PRIVACY, SECURITY & TRUST (PST), 2022,
  • [44] Prototype Similarity Distillation for Communication-Efficient Federated Unsupervised Representation Learning
    Zhang, Chen
    Xie, Yu
    Chen, Tingbin
    Mao, Wenjie
    Yu, Bin
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (11) : 6865 - 6876
  • [45] uitAnDiNeFed: android malware classification on distributed networks by using federated learning
    Cam, Nguyen Tan
    Vuong, Vo Quoc
    WIRELESS NETWORKS, 2025, 31 (03) : 2667 - 2683
  • [46] Energy-Efficient Federated Learning in IoT Networks
    Kong, Deyi
    You, Zehua
    Chen, Qimei
    Wang, Juanjuan
    Hu, Jiwei
    Xiong, Yunfei
    Wu, Jing
    SMART COMPUTING AND COMMUNICATION, 2022, 13202 : 26 - 36
  • [47] Federated Learning for IoT Networks: Enhancing Efficiency and Privacy
    Zahri, Sofia
    Bennouri, Hajar
    Chehri, Abdellah
    Abdelmoniem, Ahmed M.
    2023 IEEE 9TH WORLD FORUM ON INTERNET OF THINGS, WF-IOT, 2023,
  • [48] Federated Deep Learning for Intrusion Detection in IoT Networks
    Belarbi, Othmane
    Spyridopoulos, Theodoros
    Anthi, Eirini
    Mavromatis, Ioannis
    Carnelli, Pietro
    Khan, Aftab
    IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 237 - 242
  • [49] Explainable Federated Learning for Botnet Detection in IoT Networks
    Kalakoti, Rajesh
    Bahsi, Hayretdin
    Nomm, Sven
    2024 IEEE INTERNATIONAL CONFERENCE ON CYBER SECURITY AND RESILIENCE, CSR, 2024, : 22 - 29
  • [50] Federated Learning Meets Human Emotions: A Decentralized Framework for Human-Computer Interaction for IoT Applications
    Chhikara, Prateek
    Singh, Prabhjot
    Tekchandani, Rajkumar
    Kumar, Neeraj
    Guizani, Mohsen
    IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (08): : 6949 - 6962