Can Large Language Models be Anomaly Detectors for Time Series?

被引:1
作者
Alnegheimish, Sarah [1 ]
Nguyen, Linh [1 ]
Berti-Equille, Laure [2 ]
Veeramachaneni, Kalyan [1 ]
机构
[1] MIT, LIDS, Cambridge, MA 02139 USA
[2] IRD ESPACE DEV, Marseille, France
来源
2024 IEEE 11TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS, DSAA 2024 | 2024年
关键词
anomaly detection; time series; large language models;
D O I
10.1109/DSAA61799.2024.10722786
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The flexible nature of large language models allows them to be used for diverse applications. Recent studies have showcased numerous abilities of these models, including performing time series forecasting. In this paper, we present a novel study of large language models used for the challenging task of time series anomaly detection. This problem entails two novel aspects for LLMs specifically: first, the model needs to be able to identify part of an input sequence (or multiple parts) as anomalous; and second, the model needs to work with time series data rather than with text input. We introduce SIGLLM, a framework for time series anomaly detection using large language models. Our framework includes a time-series-to-text conversion module, as well as end-to-end pipelines that prompt language models to perform time series anomaly detection. We investigate two paradigms for testing the abilities of large language models to perform the detection task. First, we present a prompt-based detection method that directly asks a language model to indicate which elements of the input are anomalies. Second, we leverage the forecasting capability of a large language model to guide the anomaly detection process. We evaluated our framework on 11 datasets spanning various sources and 10 pipelines. We show that the forecasting method significantly outperformed the prompting method in all 11 datasets with respect to the F1 score. Moreover, while large language models are capable of finding anomalies, state-of-the-art deep learning models are still superior in performance, achieving 30% improvement.
引用
收藏
页码:218 / 227
页数:10
相关论文
共 34 条
[11]  
Daehyung Park, 2018, IEEE Robotics and Automation Letters, V3, P1544, DOI [10.1109/lra.2018.2801475, 10.1109/LRA.2018.2801475]
[12]  
Dooley S, 2023, Arxiv, DOI arXiv:2311.01933
[13]   TadGAN: Time Series Anomaly Detection Using Generative Adversarial Networks [J].
Geiger, Alexander ;
Liu, Dongyu ;
Alnegheimish, Sarah ;
Cuesta-Infante, Alfredo ;
Veeramachaneni, Kalyan .
2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, :33-43
[14]  
Gruver Nate, 2023, ADV NEURAL INFORM PR
[15]  
Howard J, 2018, PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL), VOL 1, P328
[16]   Detecting Spacecraft Anomalies Using LSTMs and Nonparametric Dynamic Thresholding [J].
Hundman, Kyle ;
Constantinou, Valentino ;
Laporte, Christopher ;
Colwell, Ian ;
Soderstrom, Tom .
KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, :387-395
[17]  
Jiang AQ, 2023, Arxiv, DOI [arXiv:2310.06825, DOI 10.48550/ARXIV.2310.06825]
[18]  
Koh J. Y., 2023, Advances in Neural Information Processing Systems, V36, p21 487
[19]  
Liu TD, 2023, Arxiv, DOI arXiv:2305.14201
[20]  
Malhotra P, 2016, Arxiv, DOI arXiv:1607.00148