Audio Signal Processing in the 21st Century: The important outcomes of the past 25 years

被引:6
作者
Richard, Gael [1 ]
Smaragdis, Paris [2 ]
Gannot, Sharon [3 ]
Naylor, Patrick A. [4 ]
Makino, Shoji [5 ]
Kellermann, Walter [6 ]
Sugiyama, Akihiko [7 ]
机构
[1] Telecom Paris, Inst Polytech Paris, F-91120 Palaiseau, France
[2] Univ Illinois, Comp Sci, Champaign, IL 61801 USA
[3] Bar Ilan Univ, Fac Engn, IL-5290002 Ramat Gan, Israel
[4] Imperial Coll London, Speech & Acoust Signal Proc, London SW7 2AZ, England
[5] Waseda Univ, Kitakyushu 8080135, Japan
[6] Univ Erlangen Nurnberg, D-91058 Erlangen, Germany
[7] Yahoo Japan Corp, Tokyo 1028282, Japan
基金
日本学术振兴会;
关键词
Spatial audio; Signal processing; Telephony; Motion pictures; BLIND SOURCE SEPARATION; SOUND FIELD REPRODUCTION; SPEECH DEREVERBERATION; CONVOLUTIVE MIXTURES; ENHANCEMENT; QUALITY; ALGORITHMS; PREDICTION;
D O I
10.1109/MSP.2023.3276171
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Audio signal processing has passed many landmarks in its development as a research topic. Many are well known, such as the development of the phonograph in the second half of the 19th century and technology associated with digital telephony that burgeoned in the late 20th century and is still a hot topic in multiple guises. Interestingly, the development of audio technology has been fueled not only by advancements in the capabilities of technology but also by high consumer expectations and customer engagement. From surround sound movie theaters to the latest in-ear devices, people love sound and soon build new audio technology into their daily lives as an essential and expected feature.
引用
收藏
页码:12 / 26
页数:15
相关论文
共 73 条
  • [41] Jarrett D.P., 2017, Theory and Applications of Spherical Microphone Array Processing
  • [42] Supervised Determined Source Separation with Multichannel Variational Autoencoder
    Kameoka, Hirokazu
    Li, Li
    Inoue, Shota
    Makino, Shoji
    [J]. NEURAL COMPUTATION, 2019, 31 (09) : 1891 - 1914
  • [43] Determined Blind Source Separation Unifying Independent Vector Analysis and Nonnegative Matrix Factorization
    Kitamura, Daichi
    Ono, Nobutaka
    Sawada, Hiroshi
    Kameoka, Hirokazu
    Saruwatari, Hiroshi
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (09) : 1626 - 1641
  • [44] Konečny J, 2016, Arxiv, DOI [arXiv:1610.02527, DOI 10.48550/ARXIV.1610.02527]
  • [45] Loizou P. C., 2007, Speech enhancement: theory and practice
  • [46] Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation
    Luo, Yi
    Mesgarani, Nima
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (08) : 1256 - 1266
  • [47] Makino S, 2018, SIGNALS COMMUN TECHN, P1, DOI 10.1007/978-3-319-73031-8
  • [48] Makino S, 2018, SIGNALS COMMUN TECHN, pV
  • [50] Speech Dereverberation Based on Variance-Normalized Delayed Linear Prediction
    Nakatani, Tomohiro
    Yoshioka, Takuya
    Kinoshita, Keisuke
    Miyoshi, Masato
    Juang, Biing-Hwang
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (07): : 1717 - 1731