Normalizing Flows for Human Pose Anomaly Detection

被引：16

作者：

Hirschorn, Or ^{[1
]}

Avidan, Shai ^{[1
]}

机构：

[1] Tel Aviv Univ, IL-69978 Tel Aviv, Israel

来源：

2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023) | 2023年

关键词：

D O I：

10.1109/ICCV51070.2023.01246

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Video anomaly detection is an ill-posed problem because it relies on many parameters such as appearance, pose, camera angle, background, and more. We distill the problem to anomaly detection of human pose, thus decreasing the risk of nuisance parameters such as appearance affecting the result. Focusing on pose alone also has the side benefit of reducing bias against distinct minority groups. Our model works directly on human pose graph sequences and is exceptionally lightweight (similar to 1K parameters), capable of running on any machine able to run the pose estimation with negligible additional resources. We leverage the highly compact pose representation in a normalizing flows framework, which we extend to tackle the unique characteristics of spatio-temporal pose data and show its advantages in this use case. The algorithm is quite general and can handle training data of only normal examples as well as a supervised setting that consists of labeled normal and abnormal examples. We report state-of-the-art results on two anomaly detection benchmarks - the unsupervised ShanghaiTech dataset and the recent supervised UBnormal dataset. Code available at https://github.com/orhir/STG-NF.

引用

页码：13499 / 13508

页数：10

共 45 条

[1] Acsintoae Andra, 2022, P IEEE CVF C COMP VI
[2] Barbalau Antonio, 2022, SSMTL REVISITING SEL
[3] Racial categories in machine learning
Benthall, Sebastian
Haynes, Bruce D.
[J]. FAT*'19: PROCEEDINGS OF THE 2019 CONFERENCE ON FAIRNESS, ACCOUNTABILITY, AND TRANSPARENCY, 2019, : 289 - 298
[4] Bergman L., 2020, ARXIV200210445
[5] Bertasius G, 2021, PR MACH LEARN RES, V139
[6] Buet-golfouse Francois, 2022, FAccT '22: 2022 ACM Conference on Fairness, Accountability, and Transparency, P1399, DOI 10.1145/3531146.3533197
[7] Skeleton-Based Action Recognition with Shift Graph Convolutional Network
Cheng, Ke
Zhang, Yifan
He, Xiangyu
Chen, Weihan
Cheng, Jian
Lu, Hanqing
[J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 180 - 189
[8] Unsupervised video anomaly detection via normalizing flows with implicit latent features
Cho, MyeongAh
Kim, Taeoh
Kim, Woo Jin
Cho, Suhwan
Lee, Sangyoun
[J]. PATTERN RECOGNITION, 2022, 129
[9] Where are we with Human Pose Estimation in Real-World Surveillance?
Cormier, Mickael
Clepe, Aris
Specker, Andreas
Beyerer, Juergen
[J]. 2022 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS (WACVW 2022), 2022, : 591 - 601
[10] Dinh L., 2016, Density estimation using real nvp

← 1 2 3 4 5 →