DEEP TENSOR FACTORIZATION FOR SPATIALLY-AWARE SCENE DECOMPOSITION

被引:0
|
作者
Casebeer, Jonah [1 ]
Colomb, Michael [1 ]
Smaragdis, Paris [1 ]
机构
[1] Univ Illinois, Comp Sci Dept, Urbana, IL 61801 USA
来源
2019 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA) | 2019年
基金
美国国家科学基金会;
关键词
nonnegative tensor factorization; source separation; unsupervised learning; scene understanding; deep learning; NONNEGATIVE MATRIX FACTORIZATION; MIXTURES;
D O I
10.1109/waspaa.2019.8937263
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We propose a completely unsupervised method to understand audio scenes observed with random microphone arrangements by decomposing the scene into its constituent sources and their relative presence in each microphone. To this end, we formulate a neural network architecture that can be interpreted as a nonnegative tensor factorization of a multi-channel audio recording. By clustering on the learned network parameters corresponding to channel content, we can learn sources' individual spectral dictionaries and their activation patterns over time. Our method allows us to leverage deep learning advances like end-to-end training, while also allowing stochastic minibatch training so that we can feasibly decompose realistic audio scenes that are intractable to decompose using standard methods. This neural network architecture is easily extensible to other kinds of tensor factorizations.
引用
收藏
页码:180 / 184
页数:5
相关论文
共 50 条
  • [21] On the interplay between educational attainment and nutrition: a spatially-aware perspective
    Narges Azizi Fard
    Gianmarco De Francisci Morales
    Yelena Mejova
    Rossano Schifanella
    EPJ Data Science, 10
  • [22] On the interplay between educational attainment and nutrition: a spatially-aware perspective
    Azizi Fard, Narges
    De Francisci Morales, Gianmarco
    Mejova, Yelena
    Schifanella, Rossano
    EPJ DATA SCIENCE, 2021, 10 (01)
  • [23] Spatially-aware dialogue control using hierarchical reinforcement learning
    Cuayáhuitl H.
    Dethlefs N.
    ACM Transactions on Speech and Language Processing, 2011, 7 (03):
  • [24] Improving Few-shot Learning by Spatially-aware Matching and CrossTransformer
    Zhang, Hongguang
    Torr, Philip H. S.
    Koniusz, Piotr
    COMPUTER VISION - ACCV 2022, PT V, 2023, 13845 : 3 - 20
  • [25] SPATIALLY-AWARE OPTIMIZATION OF ENERGY CONSUMPTION IN CONSOLIDATED DATA CENTER SYSTEMS
    Chen, Hui
    Kesavan, Mukil
    Schwan, Karsten
    Gavrilovska, Ada
    Kumar, Pramod
    Joshi, Yogendra
    PROCEEDINGS OF THE ASME PACIFIC RIM TECHNICAL CONFERENCE AND EXHIBITION ON PACKAGING AND INTEGRATION OF ELECTRONIC AND PHOTONIC SYSTEMS, MEMS AND NEMS 2011, VOL 2, 2012, : 461 - +
  • [26] A lightweight spatially-aware classification model for breast cancer pathology images
    Jiang, Liang
    Zhang, Cheng
    Zhang, Huan
    Cao, Hui
    BIOCYBERNETICS AND BIOMEDICAL ENGINEERING, 2024, 44 (03) : 586 - 608
  • [27] ProObjAR: Prototyping Spatially-aware Interactions of Smart Objects with AR-HMD
    Ye, Hui
    Leng, Jiaye
    Xiao, Chufeng
    Wang, Lili
    Fu, Hongbo
    PROCEEDINGS OF THE 2023 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, CHI 2023, 2023,
  • [28] Spatially-Aware Autoencoders for Detecting Contextual Anomalies in Geo-Distributed Data
    Corizzo, Roberto
    Ceci, Michelangelo
    Pio, Gianvito
    Mignone, Paolo
    Japkowicz, Nathalie
    DISCOVERY SCIENCE (DS 2021), 2021, 12986 : 461 - 471
  • [29] XSpace: An Augmented Reality Toolkit for Enabling Spatially-Aware Distributed Collaboration
    Herskovitz J.
    Cheng Y.F.
    Guo A.
    Sample A.P.
    Nebeling M.
    Proceedings of the ACM on Human-Computer Interaction, 2022, 6 (ISS)
  • [30] SAMA: Spatially-Aware Multimodal Network with Attention For Early Lung Cancer Diagnosis
    Roa, Mafe
    Daza, Laura
    Escobar, Maria
    Castillo, Angela
    Arbelaez, Pablo
    MULTIMODAL LEARNING FOR CLINICAL DECISION SUPPORT, 2021, 13050 : 48 - 58