共 18 条
- [1] Benyassine A., Shlomot E., Su H.Y., Et al., A robust low complexity voice activity detection algorithm for speech communication systems, Speech Coding for Telecommunications Proceeding. Pocono Manor, USA: IEEE, pp. 97-98, (1997)
- [2] Cho N., Kim E.K., Enhanced voice activity detection using acoustic event detection and classification, IEEE Transactions on Consumer Electronics, 57, 1, pp. 196-202, (2011)
- [3] Chang J.H., Kim N.S., Voice activity detection based on complex Laplacian model, Electronics Letters, 39, 7, pp. 632-634, (2003)
- [4] Ramirez J., Yelamos P., Gorriz J.M., Et al., SVM-based speech endpoint detection using contextual speech features, Institution of Engineering and Technology, 42, 7, pp. 426-428, (2006)
- [5] Zhang X.L., Wu J., Deep belief network based voice activity detection, Audio, Speech, and Language Processing, 21, 4, pp. 691-710, (2013)
- [6] Ghosh P.K., Tsiartas A., Narayanan S., Robust voice activity detection using long-term signal variability, IEEE Transactions on Audio Speech & Language Processing, 19, 3, pp. 600-613, (2011)
- [7] Salishev S., Barabanov A., Kocharov D., Et al., Voice activity detector (VAD) based on long-term Mel frequency band features, International Conference on Text, Speech, and Dialogue., pp. 352-358, (2016)
- [8] Zhou Q., Ma L., Zheng Z., Et al., Recurrent neural word segmentation with tag inference, (2016)
- [9] Has I.M., Sak, Senior A., Rao K., Et al., Learning acoustic frame labeling for speech recognition with recurrent neural networks, International Conference on Acoustics, Speech and Signal Processing. Brisbane, Australia: IEEE, pp. 4280-4284, (2015)
- [10] Zhang X.L., Wang D., Boosted deep neural networks and multi-resolution cochleagram features for voice activity detection, Speech and Signal Processing, pp. 6645-6649, (2014)