Deep Learning in Audio Classification

被引:2
作者
Wang, Yaqin [1 ]
Wei-Kocsis, Jin [1 ]
Springer, John A. [1 ]
Matson, Eric T. [1 ]
机构
[1] Purdue Univ, W Lafayette, IN 47907 USA
来源
INFORMATION AND SOFTWARE TECHNOLOGIES, ICIST 2022 | 2022年 / 1665卷
关键词
Audio classification; Machine learning; Deep learning; Deep reinforcement learning; CONVOLUTIONAL NEURAL-NETWORKS;
D O I
10.1007/978-3-031-16302-9_5
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Audio processing technology is happening everywhere in our life. We ask our car to make a call for us while driving, or we let Alexa turn off the light for us when we don't want to get out of bed before sleep. In all of these audio-based applications and research, it is AI and ML that makes the computer or the smart phone understand us via our voice [1]. As an important part of artificial intelligence (AI), especially machine learning (ML), which has had great influences in many areas of AI and ML-based research and applications. This paper focuses on deep learning structures and applications for audio classification. We conduct a detailed review of literature in audio-based DL and DRL approaches and applications. We also discuss the limitation and possible future works for audio-based DL approach.
引用
收藏
页码:64 / 77
页数:14
相关论文
共 65 条
[61]  
Wiering M, 2012, ADAPT LEARN OPTIM, V12, P1, DOI 10.1007/978-3-642-27645-3
[62]  
Wu YZ, 2019, INT CONF ACOUST SPEE, P815, DOI [10.1109/icassp.2019.8683490, 10.1109/ICASSP.2019.8683490]
[63]   Handcrafted features and late fusion with deep learning for bird sound classification [J].
Xie, Jie ;
Zhu, Mingying .
ECOLOGICAL INFORMATICS, 2019, 52 :74-81
[64]   An Overview of Overfitting and its Solutions [J].
Ying, Xue .
2018 INTERNATIONAL CONFERENCE ON COMPUTER INFORMATION SCIENCE AND APPLICATION TECHNOLOGY, 2019, 1168
[65]  
Zhang SC, 2003, APPL ARTIF INTELL, V17, P375, DOI [10.1080/713827180, 10.1080/08839510390219264]