Bilateral Waveform Similarity Overlap-and-Add Based Packet Loss Concealment for Voice over IP

被引:4
作者
Yeh, J. F. [1 ]
Lin, P. C. [2 ]
Kuo, M. D. [1 ,3 ]
Hsu, Z. H. [1 ]
机构
[1] Natl Chiayi Univ, Dept Comp Sci & Informat Engn, Taipei, Taiwan
[2] Far East Univ, Dept Comp Sci & Informat Engn, Taipei, Taiwan
[3] Far East Univ, Dept Digital Design & Management, Taipei, Taiwan
关键词
Packet loss concealment; waveform similarity overlap-and-add; VoIP; speech communication; TIME-SCALE MODIFICATION; RECOVERY TECHNIQUES; SPEECH; ALGORITHM;
D O I
10.1016/S1665-6423(13)71563-3
中图分类号
学科分类号
摘要
This paper invested a bilateral waveform similarity overlap-and-add algorithm for voice packet lost. Since Packet lost will cause the semantic misunderstanding, it has become one of the most essential problems in speech communication. This investment is based on waveform similarity measure using overlap-and-Add algorithm and provides the bilateral information to enhance the speech signal reconstruction. Traditionally, it has been improved that waveform similarity overlap-and-add (WSOLA) technique is an effective algorithm to deal with packet loss concealment (PLC) for real-time time communication. WSOLA algorithm is widely applied to deal with the length adaptation and packet loss concealment of speech signal. Time scale modification of audio signal is one of the most essential research topics in data communication, especially in voice of IP (VoIP). Herein, the proposed the bilateral WSOLA (BWSOLA) that is derived from WSOLA. Instead of only exploitation one direction speech data, the proposed method will reconstruct the lost voice data according to the preceding and cascading data. The related algorithms have been developed to achieve the optimal reconstructing estimation. The experimental results show that the quality of the reconstructed speech signal of the bilateral WSOLA is much better compared to the standard WSOLA and GWSOLA on different packet loss rate and length using the metrics PESQ and MOS. The significant improvement is obtained by bilateral information and proposed method. The proposed bilateral waveform similarity overlap-and-add (BWSOLA) outperforms the traditional approaches especially in the long duration data loss.
引用
收藏
页码:559 / 567
页数:9
相关论文
共 26 条
  • [1] PACKET LOSS CONCEALMENT BASED ON EXTRAPOLATION OF SPEECH WAVEFORM
    Chen, Juin-Hwey
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4129 - 4132
  • [2] Time-scale modification of audio signals using enhanced WSOLA with management of transients
    Grofit, Shahaf
    Lavner, Yizhar
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (01): : 106 - 115
  • [3] Gu H. -Y., 2006, 18 C COMP LING SPEEC
  • [4] Adaptive time scale modification of speech for graceful degrading voice quality in congested networks for VoIP applications
    Ilk, HG
    Güler, S
    [J]. SIGNAL PROCESSING, 2006, 86 (01) : 127 - 139
  • [5] Ito A., 2012, 15 INT S WIR PERS MU, P489
  • [6] Sample-based engine noise synthesis using an enhanced pitch-synchronous overlap-and-add method
    Jagla, Jan
    Maillard, Julien
    Martin, Nadine
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2012, 132 (05) : 3098 - 3108
  • [7] A speech packet loss concealment method using linear prediction
    Kondo, K
    Nakagawa, K
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2006, E89D (02): : 806 - 813
  • [8] Li M., 2009, INT C WIR COMM NETW
  • [9] Liao WT, 2001, IEEE INFOCOM SER, P815, DOI 10.1109/INFCOM.2001.916272
  • [10] Packet Loss Concealment Using Adaptive Lattice Modeling
    Linenberg, Nadav
    Shallom, Ilan D.
    Wulich, Dov
    [J]. MELECON 2010: THE 15TH IEEE MEDITERRANEAN ELECTROTECHNICAL CONFERENCE, 2010, : 378 - 382