A Novel Real-Time Speech Summarizer System for the Learning of Sustainability

被引:1
|
作者
Wang, Hsiu-Wen [1 ]
Cheng, Ding-Yuan [2 ]
Chen, Chi-Hua [1 ,3 ,4 ]
Wu, Yu-Rou [1 ]
Lo, Chi-Chun [1 ]
Lin, Hui-Fei [4 ]
机构
[1] Natl Chiao Tung Univ, Dept Informat Management & Finance, Hsinchu 300, Taiwan
[2] Hwa Hsia Inst Technol, Dept Informat Management, Zhonghe Dist 235, New Taipei, Taiwan
[3] Chunghwa Telecom Co Ltd, Telecommun Labs, Yangmei City 326, Taoyuan County, Taiwan
[4] Natl Chiao Tung Univ, Dept Commun & Technol, Hsinchu 300, Taiwan
关键词
SERVICE SYSTEM; INFORMATION; DESIGN; EXTRACTION; RETRIEVAL; TEXT;
D O I
10.3390/su7043885
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
As the number of speech and video documents increases on the Internet and portable devices proliferate, speech summarization becomes increasingly essential. Relevant research in this domain has typically focused on broadcasts and news; however, the automatic summarization methods used in the past may not apply to other speech domains (e.g., speech in lectures). Therefore, this study explores the lecture speech domain. The features used in previous research were analyzed and suitable features were selected following experimentation; subsequently, a three-phase real-time speech summarizer for the learning of sustainability (RTSSLS) was proposed. Phase One involved selecting independent features (e.g., centrality, resemblance to the title, sentence length, term frequency, and thematic words) and calculating the independent feature scores; Phase Two involved calculating the dependent features, such as the position compared with the independent feature scores; and Phase Three involved comparing these feature scores to obtain weighted averages of the function-scores, determine the highest-scoring sentence, and provide a summary. In practical results, the accuracies of macro-average and micro-average for the RTSSLS were 70% and 73%, respectively. Therefore, using a RTSSLS can enable users to acquire key speech information for the learning of sustainability.
引用
收藏
页码:3885 / 3899
页数:15
相关论文
共 50 条
  • [1] Designing and Implementing a Real-Time Speech Summarizer System
    Cheng, Ding-Yuan
    Chen, Chi-Hua
    Wu, Yu-Rou
    Lo, Chi-Chun
    Lin, Hui-Fei
    2014 INTERNATIONAL SYMPOSIUM ON COMPUTER, CONSUMER AND CONTROL (IS3C 2014), 2014, : 725 - 728
  • [2] REAL-TIME SPEECH SYNTHESIS SYSTEM
    AINSWORTH, WA
    IEEE TRANSACTIONS ON AUDIO AND ELECTROACOUSTICS, 1972, AU20 (05): : 397 - +
  • [3] A real-time speech quality improvement system
    Zhao, HA
    ETFA 2003: IEEE CONFERENCE ON EMERGING TECHNOLOGIES AND FACTORY AUTOMATION, VOL 1, PROCEEDINGS, 2003, : 491 - 495
  • [4] A Real-Time Scene Text to Speech System
    Neumann, Lukas
    Matas, Jiri
    COMPUTER VISION - ECCV 2012, PT III, 2012, 7585 : 619 - 622
  • [5] Real-time speech synthesis system driven by visual speech
    Li, G
    Xie, GM
    Lin, L
    PROCEEDINGS OF THE THIRD INTERNATIONAL SYMPOSIUM ON INSTRUMENTATION SCIENCE AND TECHNOLOGY, VOL 2, 2004, : 397 - 402
  • [6] Design of Japanese Speech Recognition and Real-Time Translation System Based on Deep Learning
    Zhang, Xuanxuan
    Lecture Notes in Electrical Engineering, 1243 LNEE : 227 - 235
  • [7] A REAL-TIME SPEECH DIALOG SYSTEM USING SPONTANEOUS SPEECH UNDERSTANDING
    TAKEBAYASHI, Y
    TSUBOI, H
    KANAZAWA, H
    SADAMOTO, Y
    HASHIMOTO, H
    SHINCHI, H
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1993, E76D (01) : 112 - 120
  • [8] Design and Evaluation of a Real-Time Speech Recognition System
    Shruthi, S.
    Yashaswi, G.
    Shruti, V
    Manikandan, J.
    2018 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2018, : 425 - 430
  • [9] VOXCOM - A SYSTEM FOR ANALYZING NATURAL SPEECH IN REAL-TIME
    ALPERT, M
    MEREWETHER, F
    HOMEL, P
    MARTZ, J
    LOMASK, M
    BEHAVIOR RESEARCH METHODS INSTRUMENTS & COMPUTERS, 1986, 18 (02): : 267 - 272
  • [10] Speech Recognition System for Embedded Real-time Applications
    Cheng, Octavian
    Abdulla, Waleed
    Salcic, Zoran
    2009 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT 2009), 2009, : 118 - 122