共 54 条
[1]
Afouras Triantafyllos, 2020, Self-supervised learning of audio-visual objects from video
[2]
Alwassel Humam, 2020, ARXIV201111479
[3]
Heilbron FC, 2015, PROC CVPR IEEE, P961, DOI 10.1109/CVPR.2015.7298698
[4]
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
[J].
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017),
2017,
:4724-4733
[5]
Rethinking the Faster R-CNN Architecture for Temporal Action Localization
[J].
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2018,
:1130-1139
[6]
Attention-based Dropout Layer for Weakly Supervised Object Localization
[J].
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019),
2019,
:2214-2223
[7]
Deng Cheng, 2018, TIP
[8]
MIST: Multiple Instance Self-Training Framework for Video Anomaly Detection
[J].
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021,
2021,
:14004-14013
[9]
Gong G., 2020, CVPR
[10]
Hong Fa-Ting, 2020, ECCV