Investigating the Effective Dynamic Information of Spectral Shapes for Audio Classification

被引:0
作者
Chen, Liangwei [1 ]
Zhou, Xiren [2 ]
Chen, Qiuju [3 ]
Xiong, Fang [4 ]
Chen, Huanhuan [2 ]
机构
[1] Univ Sci & Technol China, Sch Artificial Intelligence & Data Sci, Hefei 230027, Peoples R China
[2] Univ Sci & Technol China, Sch Comp Sci & Technol, Hefei 230027, Peoples R China
[3] Univ Sci & Technol China, Sch Informat Sci & Technol, Hefei 230027, Peoples R China
[4] Cent South Univ, Xiangya Hosp, Natl Clin Res Ctr Geriatr Dis, Dept Otolaryngol Head & Neck Surg, Changsha 410078, Peoples R China
基金
国家重点研发计划;
关键词
Mel frequency cepstral coefficient; Data models; Spectral shape; Computational modeling; Feature extraction; Fitting; Music; Classification algorithms; Training; Multiple signal classification; Learning in the model space; dynamic information of the spectral shape; audio classification; mel-frequency cepstral coefficients; echo state network; MUSICAL GENRE CLASSIFICATION; FAULT-DIAGNOSIS; MODEL SPACE; RECOGNITION; ALGORITHM;
D O I
10.1109/TMM.2024.3521837
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The spectral shape holds crucial information for Audio Classification (AC), encompassing the spectrum's envelope, details, and dynamic changes over time. Conventional methods utilize cepstral coefficients for spectral shape description but overlook its variation details. Deep-learning approaches capture some dynamics but demand substantial training or fine-tuning resources. The Learning in the Model Space (LMS) framework precisely captures the dynamic information of temporal data by utilizing model fitting, even when computational resources and data are limited. However, applying LMS to audio faces challenges: 1) The high sampling rate of audio hinders efficient data fitting and capturing of dynamic information. 2) The Dynamic Information of Partial Spectral Shapes (DIPSS) may enhance classification, as only specific spectral shapes are relevant for AC. This paper extends an AC framework called Effective Dynamic Information Capture (EDIC) to tackle the above issues. EDIC constructs Mel-Frequency Cepstral Coefficients (MFCC) sequences within different dimensional intervals as the fitted data, which not only reduces the number of sequence sampling points but can also describe the change of the spectral shape in different parts over time. EDIC enables us to implement a topology-based selection algorithm in the model space, selecting effective DIPSS for the current AC task. The performance on three tasks confirms the effectiveness of EDIC.
引用
收藏
页码:1114 / 1126
页数:13
相关论文
共 44 条
  • [21] Effective XML Classification using Content and Structural Information via Rule Learning
    Costa, Gianni
    Ortale, Riccardo
    Ritacco, Ettore
    2011 23RD IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2011), 2011, : 102 - 109
  • [22] Hyperspectral Image Classification via Exploring Spectral-Spatial Information of Saliency Profiles
    Lu, Qikai
    Hu, Xuan
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2020, 13 : 3291 - 3303
  • [23] Approximate sparse spectral clustering based on local information maintenance for hyperspectral image classification
    Yan, Qing
    Ding, Yun
    Zhang, Jing-Jing
    Xun, Li-Na
    Zheng, Chun-Hou
    PLOS ONE, 2018, 13 (08):
  • [24] HYPERSPECTRAL IMAGE CLASSIFICATION USING SPECTRAL AND SPATIAL INFORMATION BASED LINEAR DISCRIMINANT ANALYSIS
    Li, Cheng-Hsuan
    Chu, Hui-Shan
    Kuo, Bor-Chen
    Lin, Chin-Teng
    2011 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2011, : 1716 - 1719
  • [25] An overview on spectral and spatial information fusion for hyperspectral image classification: Current trends and challenges
    Imani, Maryam
    Ghassemian, Hassan
    INFORMATION FUSION, 2020, 59 (59) : 59 - 83
  • [26] Artificial Intelligence-based Echocardiogram Video Classification by Aggregating Dynamic Information
    Ye, Zi
    Kumar, Yogan J.
    Sing, Goh O.
    Song, Fengyan
    Ni, Xianda
    Wang, Jin
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2021, 15 (02): : 500 - 521
  • [27] On the Utility of Power Spectral Techniques With Feature Selection Techniques for Effective Mental Task Classification in Noninvasive BCI
    Gupta, Akshansh
    Agrawal, Ramesh Kumar
    Kirar, Jyoti Singh
    Andreu-Perez, Javier
    Ding, Wei-Ping
    Lin, Chin-Teng
    Prasad, Mukesh
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2021, 51 (05): : 3080 - 3092
  • [28] Cross-Channel Dynamic Spatial-Spectral Fusion Transformer for Hyperspectral Image Classification
    Qiu, Zhao
    Xu, Jie
    Peng, Jiangtao
    Sun, Weiwei
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [29] Spatial-Spectral Fusion BiFormer: A Novel Dynamic Routing Approach for Hyperspectral Image Classification
    Wang, Yiqun
    Yang, Lina
    Wu, Thomas Xinzhang
    Tang, Kaiwen
    Zha, Wanxing
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63
  • [30] Spectral-spatial classification of hyperspectral data with mutual information based segmented stacked autoencoder approach
    Paul, Subir
    Kumar, D. Nagesh
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2018, 138 : 265 - 280