共 50 条
- [1] Cross-modal Embeddings for Video and Audio Retrieval COMPUTER VISION - ECCV 2018 WORKSHOPS, PT IV, 2019, 11132 : 711 - 716
- [3] Cross-Modal Audio-Text Retrieval via Sequential Feature Augmentation 2023 2ND ASIA CONFERENCE ON ALGORITHMS, COMPUTING AND MACHINE LEARNING, CACML 2023, 2023, : 298 - 304
- [4] Synchronising audio and ultrasound by learning cross-modal embeddings INTERSPEECH 2019, 2019, : 4100 - 4104
- [5] Cross-modal retrieval of scripted speech audio MULTIMEDIA COMPUTING AND NETWORKING 1998, 1997, 3310 : 226 - 235
- [6] Speaker identification based text to audio alignment for an audio retrieval system 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1099 - 1102
- [7] Improving Audio-Text Retrieval via Hierarchical Cross-Modal Interaction and Auxiliary Captions INTERSPEECH 2023, 2023, : 341 - 345
- [10] LEARNING CONTEXTUAL TAG EMBEDDINGS FOR CROSS-MODAL ALIGNMENT OF AUDIO AND TAGS 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 596 - 600