Coincident learning for unsupervised anomaly detection of scientific instruments

被引:1
作者
Humble, Ryan [1 ]
Zhang, Zhe [2 ]
OShea, Finn [2 ]
Darve, Eric [1 ,3 ]
Ratner, Daniel [2 ]
机构
[1] Stanford Univ, Inst Comptuat & Math Engn, Stanford, CA 94305 USA
[2] SLAC Natl Accelerator Lab, Machine Learning Dept, Menlo Pk, CA 94025 USA
[3] Stanford Univ, Dept Mech Engn, Stanford, CA 94305 USA
来源
MACHINE LEARNING-SCIENCE AND TECHNOLOGY | 2024年 / 5卷 / 03期
关键词
anomaly detection; machine learning for physical sciences; unsupervised learning; OUTLIER DETECTION; NETWORK;
D O I
10.1088/2632-2153/ad64a6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Anomaly detection is an important task for complex scientific experiments and other complex systems (e.g. industrial facilities, manufacturing), where failures in a sub-system can lead to lost data, poor performance, or even damage to components. While scientific facilities generate a wealth of data, labeled anomalies may be rare (or even nonexistent), and expensive to acquire. Unsupervised approaches are therefore common and typically search for anomalies either by distance or density of examples in the input feature space (or some associated low-dimensional representation). This paper presents a novel approach called coincident learning for anomaly detection (CoAD), which is specifically designed for multi-modal tasks and identifies anomalies based on coincident behavior across two different slices of the feature space. We define an unsupervised metric, F<^>beta, out of analogy to the supervised classification F beta statistic. CoAD uses F<^>beta to train an anomaly detection algorithm on unlabeled data, based on the expectation that anomalous behavior in one feature slice is coincident with anomalous behavior in the other. The method is illustrated using a synthetic outlier data set and a MNIST-based image data set, and is compared to prior state-of-the-art on two real-world tasks: a metal milling data set and our motivating task of identifying RF station anomalies in a particle accelerator.
引用
收藏
页数:26
相关论文
共 65 条
[1]  
Agogino A., 2007, NASA Ames Prognostics Data Repository
[2]  
Akaho S., 2001, Int. Meet. of Psychometric Soc
[3]   A Review of Local Outlier Factor Algorithms for Outlier Detection in Big Data Streams [J].
Alghushairy, Omar ;
Alsini, Raed ;
Soule, Terence ;
Ma, Xiaogang .
BIG DATA AND COGNITIVE COMPUTING, 2021, 5 (01) :1-24
[4]  
Andrew G., 2013, P 30 INT C MACH LEAR, P1247
[5]  
[Anonymous], 1999, Statistical analysis in climate research
[6]  
Bachman P, 2019, ADV NEUR IN, V32
[7]  
BRETHERTON CS, 1992, J CLIMATE, V5, P541, DOI 10.1175/1520-0442(1992)005<0541:AIOMFF>2.0.CO
[8]  
2
[9]   LOF: Identifying density-based local outliers [J].
Breunig, MM ;
Kriegel, HP ;
Ng, RT ;
Sander, J .
SIGMOD RECORD, 2000, 29 (02) :93-104
[10]  
Challu C, 2022, PR MACH LEARN RES, V151