Contextual anomaly detection on time series: a case study of metro ridership analysis

被引:10
|
作者
Pasini, Kevin [1 ,2 ]
Khouadjia, Mostepha [1 ]
Same, Allou [2 ]
Trepanier, Martin [3 ]
Oukhellou, Latifa [2 ]
机构
[1] Inst Rech Technol IRT SystemX, Paris, France
[2] Univ Gustave Eiffel, Cosys Grettia, Champs Sur Marne, France
[3] Polytech Montreal, Ctr Interuniv Rech Sur Reseaux Entreprise Logist, Montreal, PQ, Canada
关键词
Contextual anomaly detection; Forecasting; Machine learning; Multivariate time series; Recurrent neural network; REGRESSION; MODEL;
D O I
10.1007/s00521-021-06455-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The increase in the amount of data collected in the transport domain can greatly benefit mobility studies and create high value-added mobility information for passengers, data analysts, and transport operators. This work concerns the detection of the impact of disturbances on a transport network. It aims, from smart card data analysis, to finely quantify the impacts of known disturbances on the transportation network usage and to reveal unexplained statistical anomalies that may be related to unknown disturbances. The mobility data studied take the form of a multivariate time series evolving in a dynamic environment with additional contextual attributes. The research mainly focuses on contextual anomaly detection using machine learning models. Our main goal is to build a robust anomaly score to highlight statistical anomalies (contextual extremums), considering the variability within the time series induced by the dynamic context. The robust anomaly score is built from normalized forecasting residuals. The normalization of the residuals is carried out using the estimated contextual variance. Indeed, there are complex dynamics on both the mean and the variance in the ridership time series induced by the flexible transportation schedule, the variability in transport demand, and contextual factors such as the station location and the calendar information. Therefore, they should be considered by the anomaly detection approach to obtain a reliable anomaly score. We investigate several prediction models (including an LSTM encoder-decoder of the recurrent neural network deep learning family) and several variance estimators obtained through dedicated models or extracted from prediction models. The proposed approaches are evaluated on synthetic data and real data from the smart card riderships of the Quebec Metro network. It includes a basis of events and disturbances that have impacted the transport network. The experiments show the relevance of variance normalization on prediction residuals to build a robust anomaly score under a dynamic context.
引用
收藏
页码:1483 / 1507
页数:25
相关论文
共 50 条
  • [21] USAD : UnSupervised Anomaly Detection on Multivariate Time Series
    Audibert, Julien
    Michiardi, Pietro
    Guyard, Frederic
    Marti, Sebastien
    Zuluaga, Maria A.
    KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 3395 - 3404
  • [22] Anomaly Detection in Fractal Time Series with LSTM Autoencoders
    Kirichenko, Lyudmyla
    Koval, Yulia
    Yakovlev, Sergiy
    Chumachenko, Dmytro
    MATHEMATICS, 2024, 12 (19)
  • [23] Adaptable Anomaly Detection in Traffic Flow Time Series
    Alam, Md Rakibul
    Gerostathopoulos, Ilias
    Amini, Sasan
    Prehofer, Christian
    Attanasi, Alessandro
    MT-ITS 2019: 2019 6TH INTERNATIONAL CONFERENCE ON MODELS AND TECHNOLOGIES FOR INTELLIGENT TRANSPORTATION SYSTEMS (MT-ITS), 2019,
  • [24] Hybrid approach for Anomaly Detection in Time Series Data
    Ghrib, Zeineb
    Jaziri, Rakia
    Romdhane, Rim
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [25] Contrastive autoencoder for anomaly detection in multivariate time series
    Zhou, Hao
    Yu, Ke
    Zhang, Xuan
    Wu, Guanlin
    Yazidi, Anis
    INFORMATION SCIENCES, 2022, 610 : 266 - 280
  • [26] Integrating Time Series Anomaly Detection Into DevOps Workflows
    Kanahols, Gustav
    Hasan, Shahriar
    Erik Strandberg, Per
    IEEE ACCESS, 2025, 13 : 46459 - 46477
  • [27] Deep Learning for Time Series Anomaly Detection: A Survey
    Darban, Zahra zamanzadeh
    Webb, Geoffrey i.
    Pan, Shirui
    Aggarwal, Charu
    Salehi, Mahsa
    ACM COMPUTING SURVEYS, 2025, 57 (01)
  • [28] Management Analysis Method of Multivariate Time Series Anomaly Detection in Financial Risk Assessment
    Zhang, Yongshan
    Jiang, Zhiyun
    Peng, Cong
    Zhu, Xiumei
    Wang, Gang
    JOURNAL OF ORGANIZATIONAL AND END USER COMPUTING, 2024, 36 (01) : 1 - 19
  • [29] Understanding the Spatiotemporal Impacts of the Built Environment on Different Types of Metro Ridership: A Case Study in Wuhan, China
    Yang, Hong
    Peng, Jiandong
    Zhang, Yuanhang
    Luo, Xue
    Yan, Xuexin
    SMART CITIES, 2023, 6 (05): : 2282 - 2307
  • [30] Ridership and Human Mobility of Metro System Under the Typhoon Weather Event: A Case Study in Fuzhou, China
    Jiang, Shixiong
    Lin, Yuchen
    URBAN RAIL TRANSIT, 2022, 8 (01) : 32 - 44