A PHYSICS-INFORMED NEURAL NETWORK-BASED APPROACH FOR THE SPATIAL UPSAMPLING OF SPHERICAL MICROPHONE ARRAYS

被引：0

作者：

Miotello, Federico ^{[1
]}

Terminiello, Ferdinando ^{[1
]}

Pezzoli, Mirco ^{[1
]}

Bemardini, Alberto ^{[1
]}

Antonacci, Fabio ^{[1
]}

Sarti, Augusto ^{[1
]}

机构：

[1] Politecn Milan, Dipartimento Elettron Informaz & Bioingn, Via Ponzio 34-5, I-20133 Milan, Italy

来源：

2024 18TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT, IWAENC 2024 | 2024年

关键词：

physics-informed neural network; spherical microphone array; space-time audio signal processing; SOUND FIELD;

D O I：

10.1109/IWAENC61483.2024.10694489

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Spherical microphone arrays are convenient tools for capturing the spatial characteristics of a sound field. However, achieving superior spatial resolution requires arrays with numerous capsules, consequently leading to expensive devices. To address this issue, we present a method for spatially upsampling spherical microphone arrays with a limited number of capsules. Our approach exploits a physics-informed neural network with Rowdy activation functions, leveraging physical constraints to provide high-order microphone array signals, starting from low-order devices. Results show that, within its domain of application, our approach outperforms a state of the art method based on signal processing for spherical microphone arrays upsampling.

引用

页码：215 / 219

页数：5

共 40 条

[1] Abhayapala TD, 2002, INT CONF ACOUST SPEE, P1949
[2] Capturing and Reproducing Spatial Audio Based on a Circular Microphone Array
Alexandridis, Anastasios
Griffin, Anthony
Mouchtaris, Athanasios
[J]. JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING, 2013, 2013
[3] Bernschutz B., 2012, AUDIO ENG SOC CONVEN
[4] Sound Field Estimation around a Rigid Sphere with Physics-informed Neural Network
Chen, Xingyu
Ma, Fei
Bastine, Amy
Samarasinghe, Prasanga
Sun, Huiyuan
[J]. 2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 1984 - 1989
[5] Cobos M., 2023, INT C AC SPEECH SIGN, P1
[6] An overview of machine learning and other data-based methods for spatial audio capture, processing, and reproduction
Cobos, Maximo
Ahrens, Jens
Kowalczyk, Konrad
Politis, Archontis
[J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2022, 2022 (01)
[7] A Compressive Sensing Approach for the Reconstruction of the Soundfield Produced by Directive Sources in Reverberant Rooms
Damiano, Stefano
Borra, Federico
Bernardini, Alberto
Antonacci, Fabio
Sarti, Augusto
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 2667 - 2679
[8] Fahim A, 2017, 2017 HANDS-FREE SPEECH COMMUNICATIONS AND MICROPHONE ARRAYS (HSCMA 2017), P151, DOI 10.1109/HSCMA.2017.7895580
[9] Hu Y., 2023, INT C AC SPEECH SIGN, P1
[10] Deep Kronecker neural networks: A general framework for neural networks with adaptive activation functions
Jagtap, Ameya D.
Shin, Yeonjong
Kawaguchi, Kenji
Karniadakis, George Em
[J]. NEUROCOMPUTING, 2022, 468 (165-180) : 165 - 180

← 1 2 3 4 →