Audio Signal Processing in the 21st Century: The important outcomes of the past 25 years

被引:6
作者
Richard, Gael [1 ]
Smaragdis, Paris [2 ]
Gannot, Sharon [3 ]
Naylor, Patrick A. [4 ]
Makino, Shoji [5 ]
Kellermann, Walter [6 ]
Sugiyama, Akihiko [7 ]
机构
[1] Telecom Paris, Inst Polytech Paris, F-91120 Palaiseau, France
[2] Univ Illinois, Comp Sci, Champaign, IL 61801 USA
[3] Bar Ilan Univ, Fac Engn, IL-5290002 Ramat Gan, Israel
[4] Imperial Coll London, Speech & Acoust Signal Proc, London SW7 2AZ, England
[5] Waseda Univ, Kitakyushu 8080135, Japan
[6] Univ Erlangen Nurnberg, D-91058 Erlangen, Germany
[7] Yahoo Japan Corp, Tokyo 1028282, Japan
基金
日本学术振兴会;
关键词
Spatial audio; Signal processing; Telephony; Motion pictures; BLIND SOURCE SEPARATION; SOUND FIELD REPRODUCTION; SPEECH DEREVERBERATION; CONVOLUTIVE MIXTURES; ENHANCEMENT; QUALITY; ALGORITHMS; PREDICTION;
D O I
10.1109/MSP.2023.3276171
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Audio signal processing has passed many landmarks in its development as a research topic. Many are well known, such as the development of the phonograph in the second half of the 19th century and technology associated with digital telephony that burgeoned in the late 20th century and is still a hot topic in multiple guises. Interestingly, the development of audio technology has been fueled not only by advancements in the capabilities of technology but also by high consumer expectations and customer engagement. From surround sound movie theaters to the latest in-ear devices, people love sound and soon build new audio technology into their daily lives as an essential and expected feature.
引用
收藏
页码:12 / 26
页数:15
相关论文
共 73 条
  • [1] A signal subspace tracking algorithm for microphone array processing of speech
    Affes, S
    Grenier, Y
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1997, 5 (05): : 425 - 437
  • [2] Sound Field Reproduction Using Planar and Linear Arrays of Loudspeakers
    Ahrens, Jens
    Spors, Sascha
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (08): : 2038 - 2050
  • [3] IMAGE METHOD FOR EFFICIENTLY SIMULATING SMALL-ROOM ACOUSTICS
    ALLEN, JB
    BERKLEY, DA
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1979, 65 (04) : 943 - 950
  • [4] The fundamental limitation of frequency domain blind source separation for convolutive mixtures of speech
    Araki, S
    Mukai, R
    Makino, S
    Nishikawa, T
    Saruwatari, H
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (02): : 109 - 116
  • [5] Avila AR, 2019, INT CONF ACOUST SPEE, P631, DOI 10.1109/ICASSP.2019.8683175
  • [6] Late Reverberation Synthesis: From Radiance Transfer to Feedback Delay Networks
    Bai, Hequn
    Richard, Gael
    Daudet, Laurent
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (12) : 2260 - 2271
  • [7] Binaural cue coding - Part I. Psychoacoustic fundamentals and design principles
    Baumgarte, F
    Faller, C
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (06): : 509 - 519
  • [8] Begault D. R, 2000, NASA/TM-2000-209606
  • [9] Benesty J., 2008, Springer Handbook of Speech Processing, DOI DOI 10.1007/978-3-540-49127-9
  • [10] Benesty J., 2012, Study and design of differential microphone arrays, V6