共 66 条
- [1] Barkani F., Hamidi M., Laaidi N., Zealouk O., Satori H., Satori K., Amazigh speech recognition based on the Kaldi ASR toolkit, Int J Inf Technol, 2023, pp. 1-8, (2023)
- [2] Hwang I., Chang J.H., End-to-end speech endpoint detection utilizing acoustic and language modeling knowledge for online low-latency speech recognition, IEEE Access, 8, pp. 161109-161123, (2020)
- [3] Aytar Y., Vondrick C., Undefined: Soundnet: Learning Sound Representations from Unlabeled Video, (2016)
- [4] Basbug A.M., Sert M., Analysis of deep neural network models for acoustic scene classification, 27Th Signal Processing and Communications Applications Conference, SIU 2019, (2019)
- [5] Chen L., Zheng X., Zhang C., Guo L., Yu B., Multi-scale temporal-frequency attention for music source separation, Proceedings-Ieee International Conference on Multimedia and Expo. 2022-July, (2022)
- [6] Mak M.W., Yu H.B., A study of voice activity detection techniques for NIST speaker recognition evaluations, Comput Speech Lang, 28, pp. 295-313, (2014)
- [7] Mousazadeh S., Cohen I., Voice activity detection in presence of transient noise using spectral clustering, IEEE Trans Audio Speech Lang Process, 21, pp. 1261-1271, (2013)
- [8] Liu B., Hoffmeister B., Rastrow A., Accurate Endpointing with Expected Pause Duration, (2015)
- [9] Maas R., Rastrow A., Goehner K., Tiwari G., Joseph S., Domain-Specific Utterance End-Point Detection for Speech Recognition, (2017)
- [10] Maas R., Rastrow A., Ma C., Lan G., Goehner K., Tiwari G., Joseph S., Hoffmeister B., Combining acoustic embeddings and decoding features for end-of-utterance detection in real-time far-field speech recognition systems, Ieeexplore.Ieee.Org.