共 22 条
[1]
Cooke M(2006)An audio-visual corpus for speech perception and automatic speech recognition J Acoust Soc Am 120 2421-2424
[2]
Barker J(2012)On dynamic stream weighting for audio-visual speech recognition IEEE Trans Audio Speech Lang Process 20 1145-1157
[3]
Cunningham S(2015)Tcd-timit: an audio-visual corpus of continuous speech IEEE Trans Multimed 17 603-615
[4]
Shao X(2005)Integration strategies for audio-visual speech processing: applied to text-dependent speaker recognition IEEE Trans Multimed 7 495-506
[5]
Estellers V(1976)Hearing lips and seeing voices Nature 264 746-748
[6]
Gurban M(2009)Lipreading with local spatiotemporal descriptors IEEE Trans Multimed 11 1254-1265
[7]
Thiran J(2014)A review of recent advances in visual speech decoding Image Vis Comput 32 590-605
[8]
Harte N(undefined)undefined undefined undefined undefined-undefined
[9]
Gillen E(undefined)undefined undefined undefined undefined-undefined
[10]
Lucey S(undefined)undefined undefined undefined undefined-undefined