Interactive Evolutionary Computation Improving Voice Impressions with Keeping Speaker Personality for Real-Time Speech

被引：0

作者：

Fukumoto, Makoto ^{[1
]}

Fukushima, Yuta ^{[1
]}

Miyamoto, Taichi ^{[2
]}

机构：

[1] Fukuoka Inst Technol, Fukuoka 8110295, Japan

[2] Fukuoka Inst Technol, Grad Sch Engn, Fukuoka 8110295, Japan

来源：

COMPUTER INFORMATION SYSTEMS AND INDUSTRIAL MANAGEMENT, CISIM 2024 | 2024年 / 14902卷

关键词：

Interactive Evolutionary Computation; Genetic Algorithm; Voice Filter; Impression;

D O I：

10.1007/978-3-031-71115-2_24

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recently, we generally have meetings via the Internet. In this situation, we use background display to improve our impression of other members of the meetings. To improve the users' voice via the Internet, this study proposes an Interactive Evolutionary Computation (IEC) that adjusts the voice filter based on real-time pronunciations while keeping user's personality. The concrete system was constructed by employing a Genetic Algorithm and Koigoe, a software voice filter. The listening experiments were conducted to investigate the efficiencies of the proposed IEC from perspectives of increasing the fitness values and keeping the speaker's personality. The results showed that the proposed IEC has enough possibility to find a good parameter set of the voice filter; however, we need to improve its performance because the obtained best filter did not overcome the impression of the original voice without any filter. Furthermore, the proposed IEC could be considered to keep the user's personality based on the result of the evaluation experiment.

引用

页码：347 / 358

页数：12

共 50 条

[21] Use of real-time interactive voice response in a study of stress and alcohol consumption
Andersson, Claes
Gordh, Anna H. V. Soederpalm
Berglund, Mats
ALCOHOLISM-CLINICAL AND EXPERIMENTAL RESEARCH, 2007, 31 (11) : 1908 - 1912
[22] A real-time voice cloning system with multiple algorithms for speech quality improvement
Hu, Weixin
Zhu, Xianyou
PLOS ONE, 2023, 18 (04):
[23] Real-Time Semi-Blind Speech Extraction with Speaker Direction Tracking on Kinect
Onuma, Yuji
Kamado, Noriyoshi
Saruwatari, Hiroshi
Shikano, Kiyohiro
2012 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2012,
[24] Real-time multilingual speech recognition and speaker diarization system based on Whisper segmentation
Lyu, Ke-Ming
Lyu, Ren-yuan
Chang, Hsien-Tsung
PEERJ COMPUTER SCIENCE, 2024, 10
[25] Real-time End-to-End Monaural Multi-speaker Speech Recognition
Li, Song
Ouyang, Beibei
Tong, Fuchuan
Liao, Dexin
Li, Lin
Hong, Qingyang
INTERSPEECH 2021, 2021, : 3750 - 3754
[26] Improving evolutionary real-time testing by seeding structural test data
Tlili, Marouane
Sthamer, Harmen
Wappler, Stefan
Wegener, Joachim
2006 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-6, 2006, : 885 - 891
[27] Real-time Evolutionary Computation in the Control of Mobile Cyber-Physics System
Rogachev, Gennady
Patkin, Mikhail
Rogachev, Nikolai
2017 SEMINAR ON SYSTEMS ANALYSIS, 2017, 10
[28] Neural networks and evolutionary computation for real-time quality control of complex processes
Patro, S
Kolarik, WJ
ANNUAL RELIABILITY AND MAINTAINABILITY SYMPOSIUM - 1997 PROCEEDINGS: THE INTERNATIONAL SYMPOSIUM ON PRODUCT QUALITY & INTEGRITY, 1997, : 327 - 332
[29] A Near Real-Time Automatic Speaker Recognition Architecture for Voice-Based User Interface
Dhakal, Parashar
Damacharla, Praveen
Javaid, Ahmad Y.
Devabhaktuni, Vijay
MACHINE LEARNING AND KNOWLEDGE EXTRACTION, 2019, 1 (01): : 504 - 520
[30] Augmenting the Social Presence of Interactive Characters Using Real-time Speech Recognition
Yamano, Mizuki
Song, Zhihao
Hoshino, Junichi
2022 NICOGRAPH INTERNATIONAL, NICOINT 2022, 2022, : 85 - 88

← 1 2 3 4 5 →