Interactive Evolutionary Computation Improving Voice Impressions with Keeping Speaker Personality for Real-Time Speech

被引:0
|
作者
Fukumoto, Makoto [1 ]
Fukushima, Yuta [1 ]
Miyamoto, Taichi [2 ]
机构
[1] Fukuoka Inst Technol, Fukuoka 8110295, Japan
[2] Fukuoka Inst Technol, Grad Sch Engn, Fukuoka 8110295, Japan
来源
COMPUTER INFORMATION SYSTEMS AND INDUSTRIAL MANAGEMENT, CISIM 2024 | 2024年 / 14902卷
关键词
Interactive Evolutionary Computation; Genetic Algorithm; Voice Filter; Impression;
D O I
10.1007/978-3-031-71115-2_24
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, we generally have meetings via the Internet. In this situation, we use background display to improve our impression of other members of the meetings. To improve the users' voice via the Internet, this study proposes an Interactive Evolutionary Computation (IEC) that adjusts the voice filter based on real-time pronunciations while keeping user's personality. The concrete system was constructed by employing a Genetic Algorithm and Koigoe, a software voice filter. The listening experiments were conducted to investigate the efficiencies of the proposed IEC from perspectives of increasing the fitness values and keeping the speaker's personality. The results showed that the proposed IEC has enough possibility to find a good parameter set of the voice filter; however, we need to improve its performance because the obtained best filter did not overcome the impression of the original voice without any filter. Furthermore, the proposed IEC could be considered to keep the user's personality based on the result of the evaluation experiment.
引用
收藏
页码:347 / 358
页数:12
相关论文
共 50 条
  • [21] Use of real-time interactive voice response in a study of stress and alcohol consumption
    Andersson, Claes
    Gordh, Anna H. V. Soederpalm
    Berglund, Mats
    ALCOHOLISM-CLINICAL AND EXPERIMENTAL RESEARCH, 2007, 31 (11) : 1908 - 1912
  • [22] A real-time voice cloning system with multiple algorithms for speech quality improvement
    Hu, Weixin
    Zhu, Xianyou
    PLOS ONE, 2023, 18 (04):
  • [23] Real-Time Semi-Blind Speech Extraction with Speaker Direction Tracking on Kinect
    Onuma, Yuji
    Kamado, Noriyoshi
    Saruwatari, Hiroshi
    Shikano, Kiyohiro
    2012 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2012,
  • [24] Real-time multilingual speech recognition and speaker diarization system based on Whisper segmentation
    Lyu, Ke-Ming
    Lyu, Ren-yuan
    Chang, Hsien-Tsung
    PEERJ COMPUTER SCIENCE, 2024, 10
  • [25] Real-time End-to-End Monaural Multi-speaker Speech Recognition
    Li, Song
    Ouyang, Beibei
    Tong, Fuchuan
    Liao, Dexin
    Li, Lin
    Hong, Qingyang
    INTERSPEECH 2021, 2021, : 3750 - 3754
  • [26] Improving evolutionary real-time testing by seeding structural test data
    Tlili, Marouane
    Sthamer, Harmen
    Wappler, Stefan
    Wegener, Joachim
    2006 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-6, 2006, : 885 - 891
  • [27] Real-time Evolutionary Computation in the Control of Mobile Cyber-Physics System
    Rogachev, Gennady
    Patkin, Mikhail
    Rogachev, Nikolai
    2017 SEMINAR ON SYSTEMS ANALYSIS, 2017, 10
  • [28] Neural networks and evolutionary computation for real-time quality control of complex processes
    Patro, S
    Kolarik, WJ
    ANNUAL RELIABILITY AND MAINTAINABILITY SYMPOSIUM - 1997 PROCEEDINGS: THE INTERNATIONAL SYMPOSIUM ON PRODUCT QUALITY & INTEGRITY, 1997, : 327 - 332
  • [29] A Near Real-Time Automatic Speaker Recognition Architecture for Voice-Based User Interface
    Dhakal, Parashar
    Damacharla, Praveen
    Javaid, Ahmad Y.
    Devabhaktuni, Vijay
    MACHINE LEARNING AND KNOWLEDGE EXTRACTION, 2019, 1 (01): : 504 - 520
  • [30] Augmenting the Social Presence of Interactive Characters Using Real-time Speech Recognition
    Yamano, Mizuki
    Song, Zhihao
    Hoshino, Junichi
    2022 NICOGRAPH INTERNATIONAL, NICOINT 2022, 2022, : 85 - 88