共 56 条
[1]
Afouras T, 2018, INTERSPEECH, P3244
[3]
Look, Listen and Learn
[J].
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV),
2017,
:609-617
[4]
Monoaural Audio Source Separation Using Deep Convolutional Neural Networks
[J].
LATENT VARIABLE ANALYSIS AND SIGNAL SEPARATION (LVA/ICA 2017),
2017, 10169
:258-266
[5]
Chen HL, 2020, INT CONF ACOUST SPEE, P721, DOI [10.1109/icassp40776.2020.9053174, 10.1109/ICASSP40776.2020.9053174]
[6]
Music Gesture for Visual Sound Separation
[J].
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020),
2020,
:10475-10484
[7]
Looking to Listen at the Cocktail Party: A Speaker-Independent Audio-Visual Model for Speech Separation
[J].
ACM TRANSACTIONS ON GRAPHICS,
2018, 37 (04)
[8]
Gabbay A, 2018, INTERSPEECH, P1170
[9]
Co-Separating Sounds of Visual Objects
[J].
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019),
2019,
:3878-3887
[10]
Learning to Separate Object Sounds by Watching Unlabeled Video
[J].
COMPUTER VISION - ECCV 2018, PT III,
2018, 11207
:36-54