Audio technology for improving social interaction in extended reality

被引:0
作者
Luberadzka, Joanna [1 ]
Munoz, Enric Guso [1 ,2 ]
Sayin, Umut [1 ]
Garriga, Adan [1 ]
机构
[1] Tecnol Multimedia, Ctr Tecnol Catalunya, Eurecat, Barcelona, Spain
[2] Univ Pompeu Fabra, Mus Technol Grp, Barcelona, Spain
来源
FRONTIERS IN VIRTUAL REALITY | 2025年 / 5卷
基金
欧盟地平线“2020”;
关键词
social interaction; extended reality; virtual acoustic simulation; acoustic matching; speech enhancement; SPEECH-INTELLIGIBILITY; BLIND ESTIMATION; BEHAVIOR; COMMUNICATION; COHERENCE; MASKING; NOISE;
D O I
10.3389/frvir.2024.1442774
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In recent years, extended reality (XR) has gained interest as a platform for human communication, with the emergence of the "Metaverse" promising to reshape social interactions. At the same time, concerns about harmful behavior and criminal activities in virtual environments have increased. This paper explores the potential of technology to support social harmony within XR, focusing specifically on audio aspects. We introduce the concept of acoustic coherence and discuss why it is crucial for smooth interaction. We further explain the challenges of speech communication in XR, including noise and reverberation, and review sound processing methods to enhance the auditory experience. We also comment on the potential of using virtual reality as a tool for the development and evaluation of audio algorithms aimed at enhancing communication. Finally, we present the results of a pilot study comparing several audio enhancement techniques inside a virtual environment.
引用
收藏
页数:9
相关论文
共 89 条
[1]  
Alshehri SA, 2024, J BIOMOL STRUCT DYN, V42, P12596, DOI [10.1080/07391102.2023.2270756, 10.1044/2023_JSLHR-23-00063]
[2]   Virtual (Zoom) Interactions Alter Conversational Behavior and Interbrain Coherence [J].
Balters, Stephanie ;
Miller, Jonas G. ;
Li, Rihui ;
Hawthorne, Grace ;
Reiss, Allan L. .
JOURNAL OF NEUROSCIENCE, 2023, 43 (14) :2568-2578
[3]   Hearing Impairment Increases Communication Effort During Conversations in Noise [J].
Beechey, Timothy ;
Buchholz, Joerg M. ;
Keidser, Gitte .
JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2020, 63 (01) :305-320
[4]  
Billinghurst M., 2024, Dagstuhl Reports, V13, P167, DOI [10.4230/DagRep.13.11.167, DOI 10.4230/DAGREP.13.11.167]
[5]   The effect of audio on the experience in virtual reality: a scoping review [J].
Bosman, Isak de Villiers ;
Buruk, Oguz 'Oz' ;
Jorgensen, Kristine ;
Hamari, Juho .
BEHAVIOUR & INFORMATION TECHNOLOGY, 2024, 43 (01) :165-199
[6]   PREDICTORS OF SPEECH-INTELLIGIBILITY IN ROOMS [J].
BRADLEY, JS .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1986, 80 (03) :837-845
[7]  
Bronkhorst AW, 2000, ACUSTICA, V86, P117
[8]   Isolating the energetic com ponent of speech-on-speech masking with ideal time-frequency segregation [J].
Brungart, Douglas S. ;
Chang, Peter S. ;
Simpson, Brian D. ;
Wang, DeLiang .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2006, 120 (06) :4007-4018
[9]  
Chen Changan, 2022, Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, P18858, DOI 10.48550/arXiv.2202.06875