共 114 条
[21]
Fazel-Zarandi M., 2023, P ICASSP, P1
[22]
Fiscus JG, 2008, LECT NOTES COMPUT SC, V4625, P373
[25]
Graves A., 2012, Sequence transduction with recurrent neural networks
[26]
Graves Alex, 2006, P 23 INT C MACH LEAR, P369, DOI [10.1145/1143844.1143891, DOI 10.1145/1143844.1143891]
[27]
Conformer: Convolution-augmented Transformer for Speech Recognition
[J].
INTERSPEECH 2020,
2020,
:5036-5040
[28]
Multi-channel multi-speaker transformer for speech recognition
[J].
INTERSPEECH 2023,
2023,
:4918-4922
[29]
Hershey JR, 2016, INT CONF ACOUST SPEE, P31, DOI 10.1109/ICASSP.2016.7471631
[30]
Heymann J, 2017, INT CONF ACOUST SPEE, P5325, DOI 10.1109/ICASSP.2017.7953173