Anomaly Detection from Distributed Data Sources via Federated Learning

被引：3

作者：

Cavallin, Florencia ^{[1
]}

Mayer, Rudolf ^{[1
,2
]}

机构：

[1] SBA Res, Vienna, Austria

[2] Vienna Univ Technol, Vienna, Austria

来源：

ADVANCED INFORMATION NETWORKING AND APPLICATIONS, AINA-2022, VOL 2 | 2022年 / 450卷

基金：

欧盟地平线“2020”;

关键词：

Federated Machine Learning; Anomaly detection;

D O I：

10.1007/978-3-030-99587-4_27

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Anomaly detection is an important task to identify rare events such as fraud, intrusions, or medical diseases. However, it often needs to be applied on personal or otherwise sensitive data, e.g. business data. This gives rise to concerns regarding the protection of the sensitive data, especially if it is to be analysed by third parties, e.g. in collaborative settings, where data is collected by different entities, but shall be analysed together to benefit from more effective models. Besides various approaches for e.g. data anonymisation, one approach for privacy-preserving data mining is Federated Learning - especially in settings where data is collected in several distributed locations. A common, global model is obtained by aggregating models trained locally on each data source, while the training data remains at the source. Therefore, data privacy and machine learning can coexist in a decentralised system. While Federated Learning has been studied for several machine learning settings, such as classification, it is still rather unexplored for anomaly detection tasks. As anomalies are rare, they are not picked up easily by a detection method, and the representation in the model dedicated to recognise them might be lost during model aggregation. In this paper, we thus study anomaly detection task on two different benchmark datasets, in supervised, semi-supervised, and unsupervised settings. We federate Multi-Layer Perceptrons, Gaussian Mixture Models, and Isolation Forests, and compare them to a centralised approach.

引用

页码：317 / 328

页数：12