Audio Signal Processing in the 21st Century: The important outcomes of the past 25 years

被引：6

作者：

Richard, Gael ^{[1
]}

Smaragdis, Paris ^{[2
]}

Gannot, Sharon ^{[3
]}

Naylor, Patrick A. ^{[4
]}

Makino, Shoji ^{[5
]}

Kellermann, Walter ^{[6
]}

Sugiyama, Akihiko ^{[7
]}

机构：

[1] Telecom Paris, Inst Polytech Paris, F-91120 Palaiseau, France

[2] Univ Illinois, Comp Sci, Champaign, IL 61801 USA

[3] Bar Ilan Univ, Fac Engn, IL-5290002 Ramat Gan, Israel

[4] Imperial Coll London, Speech & Acoust Signal Proc, London SW7 2AZ, England

[5] Waseda Univ, Kitakyushu 8080135, Japan

[6] Univ Erlangen Nurnberg, D-91058 Erlangen, Germany

[7] Yahoo Japan Corp, Tokyo 1028282, Japan

来源：

IEEE SIGNAL PROCESSING MAGAZINE | 2023年 / 40卷 / 05期

基金：

日本学术振兴会;

关键词：

Spatial audio; Signal processing; Telephony; Motion pictures; BLIND SOURCE SEPARATION; SOUND FIELD REPRODUCTION; SPEECH DEREVERBERATION; CONVOLUTIVE MIXTURES; ENHANCEMENT; QUALITY; ALGORITHMS; PREDICTION;

D O I：

10.1109/MSP.2023.3276171

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Audio signal processing has passed many landmarks in its development as a research topic. Many are well known, such as the development of the phonograph in the second half of the 19th century and technology associated with digital telephony that burgeoned in the late 20th century and is still a hot topic in multiple guises. Interestingly, the development of audio technology has been fueled not only by advancements in the capabilities of technology but also by high consumer expectations and customer engagement. From surround sound movie theaters to the latest in-ear devices, people love sound and soon build new audio technology into their daily lives as an essential and expected feature.

引用

页码：12 / 26

页数：15

共 73 条

[41] Jarrett D.P., 2017, Theory and Applications of Spherical Microphone Array Processing
[42] Supervised Determined Source Separation with Multichannel Variational Autoencoder
Kameoka, Hirokazu
Li, Li
Inoue, Shota
Makino, Shoji
[J]. NEURAL COMPUTATION, 2019, 31 (09) : 1891 - 1914
[43] Determined Blind Source Separation Unifying Independent Vector Analysis and Nonnegative Matrix Factorization
Kitamura, Daichi
Ono, Nobutaka
Sawada, Hiroshi
Kameoka, Hirokazu
Saruwatari, Hiroshi
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (09) : 1626 - 1641
[44] Konečny J, 2016, Arxiv, DOI [arXiv:1610.02527, DOI 10.48550/ARXIV.1610.02527]
[45] Loizou P. C., 2007, Speech enhancement: theory and practice
[46] Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation
Luo, Yi
Mesgarani, Nima
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (08) : 1256 - 1266
[47] Makino S, 2018, SIGNALS COMMUN TECHN, P1, DOI 10.1007/978-3-319-73031-8
[48] Makino S, 2018, SIGNALS COMMUN TECHN, pV
[49] A connectionist approach to automatic transcription of polyphonic piano music
Marolt, M
[J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2004, 6 (03) : 439 - 449
[50] Speech Dereverberation Based on Variance-Normalized Delayed Linear Prediction
Nakatani, Tomohiro
Yoshioka, Takuya
Kinoshita, Keisuke
Miyoshi, Masato
Juang, Biing-Hwang
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (07): : 1717 - 1731

← 1 2 3 4 5 6 7 8 →