共 52 条
- [1] MS-TCN: Multi-Stage Temporal Convolutional Network for Action Segmentation [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3570 - 3579
- [2] Diagnosing Error in Temporal Action Detectors [J]. COMPUTER VISION - ECCV 2018, PT III, 2018, 11207 : 264 - 280
- [3] VQA: Visual Question Answering [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 2425 - 2433
- [4] Soft-NMS - Improving Object Detection With One Line of Code [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 5562 - 5570
- [5] Heilbron FC, 2015, PROC CVPR IEEE, P961, DOI 10.1109/CVPR.2015.7298698
- [6] Carion N., 2020, EUROPEAN C COMPUTER, V12346, P213, DOI 10.1007/978-3-030-58452-8_13
- [7] Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 4724 - 4733
- [8] Dynamic Convolution: Attention over Convolution Kernels [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, : 11027 - 11036
- [9] Cheng B., 2021, Per-pixel classification is not all you need for semantic segmentation, V34
- [10] Dosovitskiy A., 2021, P INT C LEARN REPR, P11929