共 82 条
[21]
Can Spatiotemporal 3D CNNs Retrace the History of 2D CNNs and ImageNet?
[J].
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2018,
:6546-6555
[22]
Align and Attend: Multimodal Summarization with Dual Contrastive Losses
[J].
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2023,
:14867-14878
[23]
Masked Autoencoders Are Scalable Vision Learners
[J].
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022),
2022,
:15979-15988
[24]
Momentum Contrast for Unsupervised Visual Representation Learning
[J].
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020),
2020,
:9726-9735
[25]
Deep Residual Learning for Image Recognition
[J].
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2016,
:770-778
[26]
He X., 2019, P ACM INT C MULT ACM
[27]
Deep attentive and semantic preserving video summarization
[J].
NEUROCOMPUTING,
2020, 405
:200-207
[29]
Jung Y., 2020, P EUR C COMP VIS ECC
[30]
Jung Y., 2019, P C ART INT AAAI HON