共 39 条
[2]
My lips are concealed: Audio-visual speech enhancement through obstructions
[J].
INTERSPEECH 2019,
2019,
:4295-4299
[3]
Deep Lip Reading: a comparison of models and an online application
[J].
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES,
2018,
:3514-3518
[4]
Acoustic beamforming for speaker diarization of meetings
[J].
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING,
2007, 15 (07)
:2011-2022
[5]
[Anonymous], 2019, ASRU
[6]
A comprehensive study of speech separation: spectrogram vs waveform separation
[J].
INTERSPEECH 2019,
2019,
:4574-4578
[7]
Chang XK, 2020, INT CONF ACOUST SPEE, P6134, DOI [10.1109/ICASSP40776.2020.9054029, 10.1109/icassp40776.2020.9054029]
[8]
Chang XK, 2019, 2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), P237, DOI [10.1109/asru46091.2019.9003986, 10.1109/ASRU46091.2019.9003986]
[9]
Speaker-Targeted Audio-Visual Models for Speech Recognition in Cocktail-Party Environments
[J].
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES,
2016,
:2120-2124
[10]
Chen LW, 2019, INT CONF ACOUST SPEE, P705, DOI [10.1109/ICASSP.2019.8682470, 10.1109/icassp.2019.8682470]