共 50 条
[21]
MAVT-FG: Multimodal Audio-Visual Transformer for Weakly-supervised Fine-Grained Recognition
[J].
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022,
2022,
:3811-3819
[22]
Audio-Visual Weakly Supervised Approach for Apathy Detection in the Elderly
[J].
2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN),
2020,
[24]
Collecting Cross-Modal Presence-Absence Evidence for Weakly-Supervised Audio-Visual Event Perception
[J].
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2023,
:18827-18836
[25]
Exploring Cross-Video and Cross-Modality Signals for Weakly-Supervised Audio-Visual Video Parsing
[J].
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021),
2021, 34
[27]
SELF-SUPERVISED AUDIO-VISUAL CO-SEGMENTATION
[J].
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP),
2019,
:2357-2361
[28]
Modality-Aware Contrastive Instance Learning with Self-Distillation for Weakly-Supervised Audio-Visual Violence Detection
[J].
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022,
2022,
:6278-6287
[30]
Weakly-Supervised Text Instance Segmentation
[J].
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023,
2023,
:1915-1923