共 76 条
- [51] Video Modeling with Correlation Networks [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 349 - 358
- [52] Wang J., 2020, arXiv
- [53] TDN: Temporal Difference Networks for Efficient Action Recognition [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 1895 - 1904
- [54] Temporal Segment Networks: Towards Good Practices for Deep Action Recognition [J]. COMPUTER VISION - ECCV 2016, PT VIII, 2016, 9912 : 20 - 36
- [55] Appearance-and-Relation Networks for Video Classification [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 1430 - 1439
- [56] Videos as Space-Time Region Graphs [J]. COMPUTER VISION - ECCV 2018, PT V, 2018, 11209 : 413 - 431
- [57] Non-local Neural Networks [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7794 - 7803
- [58] Wang Z., 2022, WACV, P1819
- [59] Structured Triplet Learning with POS-tag Guided Attention for Visual Question Answering [J]. 2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, : 1888 - 1896
- [60] ACTION-Net: Multipath Excitation for Action Recognition [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 13209 - 13218