共 50 条
- [1] Cross-modal Manifold Cutmix for Self-supervised Video Representation Learning 2023 18TH INTERNATIONAL CONFERENCE ON MACHINE VISION AND APPLICATIONS, MVA, 2023,
- [2] Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal Synchronicity THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 8, 2023, : 9723 - 9732
- [4] SELF-SUPERVISED LEARNING WITH CROSS-MODAL TRANSFORMERS FOR EMOTION RECOGNITION 2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT), 2021, : 381 - 388
- [6] Self-Supervised Visual Representations for Cross-Modal Retrieval ICMR'19: PROCEEDINGS OF THE 2019 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2019, : 182 - 186
- [7] POSITIVE AND NEGATIVE SAMPLING STRATEGIES FOR SELF-SUPERVISED LEARNING ON AUDIO-VIDEO DATA<bold> </bold> 2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING WORKSHOPS, ICASSPW 2024, 2024, : 545 - 549
- [8] CCMA: CapsNet for audio-video sentiment analysis using cross-modal attention VISUAL COMPUTER, 2025, 41 (03): : 1609 - 1620
- [9] Revamping Cross-Modal Recipe Retrieval with Hierarchical Transformers and Self-supervised Learning 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 15470 - 15479
- [10] Learning Mutual Modulation for Self-supervised Cross-Modal Super-Resolution COMPUTER VISION, ECCV 2022, PT XIX, 2022, 13679 : 1 - 18