A PHYSICS-INFORMED NEURAL NETWORK-BASED APPROACH FOR THE SPATIAL UPSAMPLING OF SPHERICAL MICROPHONE ARRAYS

被引:0
作者
Miotello, Federico [1 ]
Terminiello, Ferdinando [1 ]
Pezzoli, Mirco [1 ]
Bemardini, Alberto [1 ]
Antonacci, Fabio [1 ]
Sarti, Augusto [1 ]
机构
[1] Politecn Milan, Dipartimento Elettron Informaz & Bioingn, Via Ponzio 34-5, I-20133 Milan, Italy
来源
2024 18TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT, IWAENC 2024 | 2024年
关键词
physics-informed neural network; spherical microphone array; space-time audio signal processing; SOUND FIELD;
D O I
10.1109/IWAENC61483.2024.10694489
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Spherical microphone arrays are convenient tools for capturing the spatial characteristics of a sound field. However, achieving superior spatial resolution requires arrays with numerous capsules, consequently leading to expensive devices. To address this issue, we present a method for spatially upsampling spherical microphone arrays with a limited number of capsules. Our approach exploits a physics-informed neural network with Rowdy activation functions, leveraging physical constraints to provide high-order microphone array signals, starting from low-order devices. Results show that, within its domain of application, our approach outperforms a state of the art method based on signal processing for spherical microphone arrays upsampling.
引用
收藏
页码:215 / 219
页数:5
相关论文
共 40 条
  • [1] Abhayapala TD, 2002, INT CONF ACOUST SPEE, P1949
  • [2] Capturing and Reproducing Spatial Audio Based on a Circular Microphone Array
    Alexandridis, Anastasios
    Griffin, Anthony
    Mouchtaris, Athanasios
    [J]. JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING, 2013, 2013
  • [3] Bernschutz B., 2012, AUDIO ENG SOC CONVEN
  • [4] Sound Field Estimation around a Rigid Sphere with Physics-informed Neural Network
    Chen, Xingyu
    Ma, Fei
    Bastine, Amy
    Samarasinghe, Prasanga
    Sun, Huiyuan
    [J]. 2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 1984 - 1989
  • [5] Cobos M., 2023, INT C AC SPEECH SIGN, P1
  • [6] An overview of machine learning and other data-based methods for spatial audio capture, processing, and reproduction
    Cobos, Maximo
    Ahrens, Jens
    Kowalczyk, Konrad
    Politis, Archontis
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2022, 2022 (01)
  • [7] A Compressive Sensing Approach for the Reconstruction of the Soundfield Produced by Directive Sources in Reverberant Rooms
    Damiano, Stefano
    Borra, Federico
    Bernardini, Alberto
    Antonacci, Fabio
    Sarti, Augusto
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 2667 - 2679
  • [8] Fahim A, 2017, 2017 HANDS-FREE SPEECH COMMUNICATIONS AND MICROPHONE ARRAYS (HSCMA 2017), P151, DOI 10.1109/HSCMA.2017.7895580
  • [9] Hu Y., 2023, INT C AC SPEECH SIGN, P1
  • [10] Deep Kronecker neural networks: A general framework for neural networks with adaptive activation functions
    Jagtap, Ameya D.
    Shin, Yeonjong
    Kawaguchi, Kenji
    Karniadakis, George Em
    [J]. NEUROCOMPUTING, 2022, 468 (165-180) : 165 - 180