SPAct: Self-supervised Privacy Preservation for Action Recognition

被引:39
作者
Dave, Ishan Rajendrakumar [1 ]
Chen, Chen [1 ]
Shah, Mubarak [1 ]
机构
[1] Univ Cent Florida, Ctr Res Comp Vis, Orlando, FL 32816 USA
来源
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022) | 2022年
关键词
D O I
10.1109/CVPR52688.2022.01953
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Visual private information leakage is an emerging key issue for the fast growing applications of video understanding like activity recognition. Existing approaches for mitigating privacy leakage in action recognition require privacy labels along with the action labels from the video dataset. However, annotating frames of video dataset for privacy labels is not feasible. Recent developments of self-supervised learning (SSL) have unleashed the untapped potential of the unlabeled data. For the first time, we present a novel training framework which removes privacy information from input video in a self-supervised manner without requiring privacy labels. Our training framework consists of three main components: anonymization function, self-supervised privacy removal branch, and action recognition branch. We train our framework using a minimax optimization strategy to minimize the action recognition cost function and maximize the privacy cost function through a contrastive self-supervised loss. Employing existing protocols of knownaction and privacy attributes, our framework achieves a competitive action-privacy trade-off to the existing state-of-the-art supervised methods. In addition, we introduce a new protocol to evaluate the generalization of learned the anonymization function to novel-action and privacy attributes and show that our self-supervised framework outperforms existing supervised methods.
引用
收藏
页码:20132 / 20141
页数:10
相关论文
共 47 条
[1]  
[Anonymous], 2021, ICML
[2]   The Privacy-Utility Tradeoff for Remotely Teleoperated Robots [J].
Butler, Daniel J. ;
Huang, Justin ;
Roesner, Franziska ;
Cakmak, Maya .
PROCEEDINGS OF THE 2015 ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION (HRI'15), 2015, :27-34
[3]   A Vision-Based System for Monitoring Elderly People at Home [J].
Buzzelli, Marco ;
Albe, Alessio ;
Ciocca, Gianluigi .
APPLIED SCIENCES-BASEL, 2020, 10 (01)
[4]  
Chen T., 2020, ICML
[5]  
Chou E., 2018, ARXIV181109950
[6]  
Dai J, 2015, IEEE IMAGE PROC, P4238, DOI 10.1109/ICIP.2015.7351605
[7]   GabriellaV2: Towards better generalization in surveillance videos for Action Detection [J].
Dave, Ishan ;
Scheffer, Zacchaeus ;
Kumar, Akash ;
Shiraz, Sarah ;
Rawat, Yogesh Singh ;
Shah, Mubarak .
2022 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS (WACVW 2022), 2022, :122-132
[8]   TCLR: Temporal contrastive learning for video representation [J].
Dave, Ishan ;
Gupta, Rohit ;
Rizve, Mamshad Nayeem ;
Shah, Mubarak .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2022, 219
[9]   Large Scale Holistic Video Understanding [J].
Diba, Ali ;
Fayyaz, Mohsen ;
Sharma, Vivek ;
Paluri, Manohar ;
Gall, Jurgen ;
Stiefelhagen, Rainer ;
Van Gool, Luc .
COMPUTER VISION - ECCV 2020, PT V, 2020, 12350 :593-610
[10]   A Large-Scale Study on Unsupervised Spatiotemporal Representation Learning [J].
Feichtenhofer, Christoph ;
Fan, Haoqi ;
Xiong, Bo ;
Girshick, Ross ;
He, Kaiming .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :3298-3308