Segmentation and its real-world applications in speech processing

被引:0
|
作者
Sattar, Farook [1 ]
Nilsson, Mikael [2 ]
Claesson, Ingvar [2 ]
机构
[1] Nanyang Technol Univ, Sch Elect & Elect Engn, Nanyang Ave, Singapore 639798, Singapore
[2] Blekinge Inst Technol, Sch Engn, SE-37225 Ronneby, Sweden
关键词
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The speech segmentation problem can be formulated as estimating the locations and durations of speech and non-speech components of the measured speech data. In this paper, a new time-scale transform based segmentation method and one of its important application in speech processing, are presented. The proposed scheme is tested on a number of recorded speech data. The preliminary results are shown quite promising. It is found that the method is able to extract the speech components(i.e. active intervals) from non-speech components (i.e. inactive intervals) effectively. The method is, therefore, successfully applied to insert selective pauses in the speech before delivering in the reverberant environment and improve the quality/intelligibility of the delivered speech.
引用
收藏
页码:788 / +
页数:2
相关论文
共 50 条
  • [1] Special section on advances in modeling for real-world speech information processing and its application
    Yamashita, Yoichi
    IEICE Transactions on Information and Systems, 2014, e97 (06)
  • [3] A speech translation system applied to a real-world task/domain and its evaluation using real-world speech data
    Nakamura, A
    Naito, M
    Tsukada, H
    Gruhn, R
    Sumita, E
    Kashioka, N
    Nakajima, H
    Shimizu, T
    Sagisaka, Y
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2001, E84D (01): : 142 - 154
  • [4] Special Section on Advances in Modeling for Real-world Speech Information Processing and its Application FOREWORD
    Yamashita, Yoichi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2014, E97D (06): : 1402 - 1402
  • [5] Foreword: Special section on advances in modeling for real-world speech information processing and its application
    Yamashita, Y., 1600, Institute of Electronics, Information and Communication, Engineers, IEICE (E97-D):
  • [6] Auditory processing of speech signals for robust speech recognition in real-world noisy environments
    Kim, DS
    Lee, SY
    Kil, RM
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1999, 7 (01): : 55 - 69
  • [7] REAL-WORLD APPLICATIONS
    SMITH, M
    COMMUNICATIONS OF THE ACM, 1992, 35 (07) : 20 - &
  • [8] Advances in Smart Hangar and Its Real-world Applications
    Shin, Hye-Jin
    Hong, Seung-Chan
    Truong, Chung Thanh
    Lee, Jung-Ryul
    STRUCTURAL HEALTH MONITORING 2015: SYSTEM RELIABILITY FOR VERIFICATION AND IMPLEMENTATION, VOLS. 1 AND 2, 2015, : 2505 - 2512
  • [9] ITS A REAL REAL REAL-WORLD
    EVANS, RA
    IEEE TRANSACTIONS ON RELIABILITY, 1994, 43 (04) : 550 - 550
  • [10] Scikit-talk: A toolkit for processing real-world conversational speech data
    Liesenfeld, Andreas
    Parti, Gabor
    Huang, Chu-Ren
    SIGDIAL 2021: 22ND ANNUAL MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE (SIGDIAL 2021), 2021, : 252 - 256