共 57 条
[31]
Attention Clusters: Purely Attention Based Local Feature Integration for Video Classification
[J].
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2018,
:7834-7843
[33]
Morgado Pedro, 2018, NIPS
[34]
Moryossef Amit, 2020, YOUR FINGERTIPS AUTO
[35]
Nagrani Arsha, 2018, ARXIV
[36]
Audio-Visual Scene Analysis with Self-Supervised Multisensory Features
[J].
COMPUTER VISION - ECCV 2018, PT VI,
2018, 11210
:639-658
[37]
Ambient Sound Provides Supervision for Visual Learning
[J].
COMPUTER VISION - ECCV 2016, PT I,
2016, 9905
:801-816
[38]
Perez Ethan, 2017, ARXIV
[39]
Raffel C., 2014, ISMIR
[40]
U-Net: Convolutional Networks for Biomedical Image Segmentation
[J].
MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION, PT III,
2015, 9351
:234-241