A Novel Real-Time Speech Summarizer System for the Learning of Sustainability

被引:1
|
作者
Wang, Hsiu-Wen [1 ]
Cheng, Ding-Yuan [2 ]
Chen, Chi-Hua [1 ,3 ,4 ]
Wu, Yu-Rou [1 ]
Lo, Chi-Chun [1 ]
Lin, Hui-Fei [4 ]
机构
[1] Natl Chiao Tung Univ, Dept Informat Management & Finance, Hsinchu 300, Taiwan
[2] Hwa Hsia Inst Technol, Dept Informat Management, Zhonghe Dist 235, New Taipei, Taiwan
[3] Chunghwa Telecom Co Ltd, Telecommun Labs, Yangmei City 326, Taoyuan County, Taiwan
[4] Natl Chiao Tung Univ, Dept Commun & Technol, Hsinchu 300, Taiwan
关键词
SERVICE SYSTEM; INFORMATION; DESIGN; EXTRACTION; RETRIEVAL; TEXT;
D O I
10.3390/su7043885
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
As the number of speech and video documents increases on the Internet and portable devices proliferate, speech summarization becomes increasingly essential. Relevant research in this domain has typically focused on broadcasts and news; however, the automatic summarization methods used in the past may not apply to other speech domains (e.g., speech in lectures). Therefore, this study explores the lecture speech domain. The features used in previous research were analyzed and suitable features were selected following experimentation; subsequently, a three-phase real-time speech summarizer for the learning of sustainability (RTSSLS) was proposed. Phase One involved selecting independent features (e.g., centrality, resemblance to the title, sentence length, term frequency, and thematic words) and calculating the independent feature scores; Phase Two involved calculating the dependent features, such as the position compared with the independent feature scores; and Phase Three involved comparing these feature scores to obtain weighted averages of the function-scores, determine the highest-scoring sentence, and provide a summary. In practical results, the accuracies of macro-average and micro-average for the RTSSLS were 70% and 73%, respectively. Therefore, using a RTSSLS can enable users to acquire key speech information for the learning of sustainability.
引用
收藏
页码:3885 / 3899
页数:15
相关论文
共 50 条
  • [21] Real-time pitch modification system for speech and singing voice
    Azarov, Elias
    Vashkevich, Maxim
    Likhachov, Denis
    Petrovsky, Alexander
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1070 - 1071
  • [22] DESIGN OF A REAL-TIME FRENCH TEXT-TO-SPEECH SYSTEM
    OSHAUGHNESSY, D
    SPEECH COMMUNICATION, 1984, 3 (03) : 233 - 243
  • [23] Real-Time Communication Aid System for Korean Dysarthric Speech
    Park, Kwanghyun
    Hong, Jungpyo
    APPLIED SCIENCES-BASEL, 2025, 15 (03):
  • [24] MINI-BASED SYSTEM SYNTHESIZES SPEECH IN REAL-TIME
    不详
    ELECTRONICS, 1978, 51 (18): : 71 - 72
  • [25] Real-time robot audition system that recognizes simultaneous speech in the real world
    Yamamoto, Shun'ichi
    Nakadai, Kazuhiro
    Nakano, Mikio
    Tsujino, Hiroshi
    Valin, Jean-Marc
    Komatani, Kazunori
    Ogata, Tetsuya
    Okuno, Hiroshi G.
    2006 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-12, 2006, : 5333 - +
  • [26] Learning Continuous Facial Actions From Speech for Real-Time Animation
    Pham, Hai X.
    Wang, Yuting
    Pavlovic, Vladimir
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2022, 13 (03) : 1567 - 1580
  • [27] A Novel Approach to Noise Reduction and Real-Time Enhancement of Speech Synthesis
    Rafieee, M. Saadeq
    Khazaei, Ali Akbar
    2010 SECOND INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE, COMMUNICATION SYSTEMS AND NETWORKS (CICSYN), 2010, : 250 - 255
  • [28] A Novel Real-Time Fall Detection System Based on Real-Time Video and Mobile Phones
    Tong, Chao
    Lian, Yu
    Zhang, Yang
    Xie, Zhongyu
    Long, Xiang
    Niu, Jianwei
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2017, 26 (04)
  • [29] A Novel Real-Time Text-to-Speech System Using Raspberry Pi for Assisting the Visually Impaired
    Ben Atitallah, Ahmed
    Kammoun, Manel
    Atitallah, Mohamed Amin Ben
    Albekairi, Mohammed
    Said, Yahia
    Boudabous, Anis
    Kaaniche, Khaled
    Atri, Mohamed
    TRAITEMENT DU SIGNAL, 2024, 41 (06) : 3183 - 3192
  • [30] Real-time speech-to-speech translation for PDAs
    Prasad, R.
    Krstovski, K.
    Choi, F.
    Saleem, S.
    Natarajan, P.
    Decerbo, M.
    Stallard, D.
    2007 IEEE INTERNATIONAL CONFERENCE ON PORTABLE INFORMATION DEVICES, 2007, : 95 - 99