共 34 条
- [1] Audio Visual Scene-Aware Dialog [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 7550 - 7559
- [2] Andrew G., 2013, PMLR, P1247
- [3] [Anonymous], 2016, NIPS
- [4] Look, Listen and Learn [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 609 - 617
- [5] Aytar Y., 2017, ARXIV170600932
- [6] Aytar Y, 2016, ADV NEUR IN, V29
- [7] Chen Z, 2017, INT CONF ACOUST SPEE, P246, DOI 10.1109/ICASSP.2017.7952155
- [8] Cho K., 2014, P SSST8 8 WORKSH SYN, P103, DOI 10.3115/v1/w14-4012
- [9] Learning to Separate Object Sounds by Watching Unlabeled Video [J]. COMPUTER VISION - ECCV 2018, PT III, 2018, 11207 : 36 - 54
- [10] Gemmeke JF, 2017, INT CONF ACOUST SPEE, P776, DOI 10.1109/ICASSP.2017.7952261