Normalizing Flows for Human Pose Anomaly Detection

被引:16
作者
Hirschorn, Or [1 ]
Avidan, Shai [1 ]
机构
[1] Tel Aviv Univ, IL-69978 Tel Aviv, Israel
来源
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023) | 2023年
关键词
D O I
10.1109/ICCV51070.2023.01246
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video anomaly detection is an ill-posed problem because it relies on many parameters such as appearance, pose, camera angle, background, and more. We distill the problem to anomaly detection of human pose, thus decreasing the risk of nuisance parameters such as appearance affecting the result. Focusing on pose alone also has the side benefit of reducing bias against distinct minority groups. Our model works directly on human pose graph sequences and is exceptionally lightweight (similar to 1K parameters), capable of running on any machine able to run the pose estimation with negligible additional resources. We leverage the highly compact pose representation in a normalizing flows framework, which we extend to tackle the unique characteristics of spatio-temporal pose data and show its advantages in this use case. The algorithm is quite general and can handle training data of only normal examples as well as a supervised setting that consists of labeled normal and abnormal examples. We report state-of-the-art results on two anomaly detection benchmarks - the unsupervised ShanghaiTech dataset and the recent supervised UBnormal dataset. Code available at https://github.com/orhir/STG-NF.
引用
收藏
页码:13499 / 13508
页数:10
相关论文
共 45 条
  • [1] Acsintoae Andra, 2022, P IEEE CVF C COMP VI
  • [2] Barbalau Antonio, 2022, SSMTL REVISITING SEL
  • [3] Racial categories in machine learning
    Benthall, Sebastian
    Haynes, Bruce D.
    [J]. FAT*'19: PROCEEDINGS OF THE 2019 CONFERENCE ON FAIRNESS, ACCOUNTABILITY, AND TRANSPARENCY, 2019, : 289 - 298
  • [4] Bergman L., 2020, ARXIV200210445
  • [5] Bertasius G, 2021, PR MACH LEARN RES, V139
  • [6] Buet-golfouse Francois, 2022, FAccT '22: 2022 ACM Conference on Fairness, Accountability, and Transparency, P1399, DOI 10.1145/3531146.3533197
  • [7] Skeleton-Based Action Recognition with Shift Graph Convolutional Network
    Cheng, Ke
    Zhang, Yifan
    He, Xiangyu
    Chen, Weihan
    Cheng, Jian
    Lu, Hanqing
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 180 - 189
  • [8] Unsupervised video anomaly detection via normalizing flows with implicit latent features
    Cho, MyeongAh
    Kim, Taeoh
    Kim, Woo Jin
    Cho, Suhwan
    Lee, Sangyoun
    [J]. PATTERN RECOGNITION, 2022, 129
  • [9] Where are we with Human Pose Estimation in Real-World Surveillance?
    Cormier, Mickael
    Clepe, Aris
    Specker, Andreas
    Beyerer, Juergen
    [J]. 2022 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS (WACVW 2022), 2022, : 591 - 601
  • [10] Dinh L., 2016, Density estimation using real nvp