Auditory-visual scenes for hearing research

被引:6
作者
van de Par, Steven [1 ]
Ewert, Stephan D. [1 ]
Hladek, Lubos [2 ]
Kirsch, Christoph [1 ]
Schuetze, Julia [1 ]
Llorca-Bofi, Josep [3 ]
Grimm, Giso [1 ]
Hendrikse, Maartje M. E. [1 ,4 ]
Kollmeier, Birger [1 ]
Seeber, Bernhard U. [2 ]
机构
[1] Carl von Ossietzky Univ Oldenburg, Dept Med Phys & Acoust, Cluster Excellence Hearing4a11, Carl von Ossietzky Str 9 11, D-26129 Oldenburg, Germany
[2] Tech Univ Munich, Dept Elect & Comp Engn, Audio Informat Proc, Theresienstr 90, D-80333 Munich, Germany
[3] Rhein Westfal TH Aachen, Inst Hearing Technol & Acoust, Kopernikusstr 5, D-52074 Aachen, Germany
[4] Erasmus MC, Dept Otorhinolaryngol & Head & Neck Surg, Burgemeester Oudlaan 50, NL-3062 PA Rotterdam, Netherlands
来源
ACTA ACUSTICA | 2022年 / 6卷
关键词
Complex acoustic environments; Speech intelligibility; Room acoustics; Ecological validity; SPEECH-INTELLIGIBILITY; PREDICTION; NOISE; REVERBERATION; SENTENCES; ENVELOPE; RELEASE; MASKING; BENEFIT;
D O I
10.1051/aacus/2022032
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
While experimentation with synthetic stimuli in abstracted listening situations has a long standing and successful history in hearing research, an increased interest exists on closing the remaining gap towards real-life listening by replicating situations with high ecological validity in the lab. This is important for understanding the underlying auditory mechanisms and their relevance in real-life situations as well as for developing and evaluating increasingly sophisticated algorithms for hearing assistance. A range of 'classical' stimuli and paradigms have evolved to de-facto standards in psychoacoustics, which are simplistic and can be easily reproduced across laboratories. While they ideally allow for across laboratory comparisons and reproducible research, they, however, lack the acoustic stimulus complexity and the availability of visual information as observed in everyday life communication and listening situations. This contribution aims to provide and establish an extendable set of complex auditory-visual scenes for hearing research that allow for ecologically valid testing in realistic scenes while also supporting reproducibility and comparability of scientific results. Three virtual environments are provided (underground station, pub, living room), consisting of a detailed visual model, an acoustic geometry model with acoustic surface properties as well as a set of acoustic measurements in the respective real-world environments. The current data set enables i) audio-visual research in a reproducible set of environments, ii) comparison of room acoustic simulation methods with "ground truth" acoustic measurements, iii) a condensation point for future extensions and contributions for developments towards standardized test cases for ecologically valid hearing research in complex scenes.
引用
收藏
页数:14
相关论文
共 68 条
  • [1] ANSI, 1997, S35R2007 ANSI AC SOC
  • [2] Bentler Ruth A, 2005, J Am Acad Audiol, V16, P473, DOI 10.3766/jaaa.16.7.7
  • [3] Visually-guided attention enhances target identification in a complex auditory scene
    Best, Virginia
    Ozmeral, Erol J.
    Shinn-Cunningham, Barbara G.
    [J]. JARO-JOURNAL OF THE ASSOCIATION FOR RESEARCH IN OTOLARYNGOLOGY, 2007, 8 (02): : 294 - 304
  • [4] Better-ear glimpsing in hearing-impaired listeners
    Best, Virginia
    Mason, Christine R.
    Kidd, Gerald, Jr.
    Iyer, Nandini
    Brungart, Douglas S.
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2015, 137 (02) : EL213 - EL219
  • [5] Prediction of speech intelligibility in spatial noise and reverberation for normal-hearing and hearing-impaired listeners
    Beutelmann, Rainer
    Brand, Thomas
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2006, 120 (01) : 331 - 342
  • [6] Prediction of binaural speech intelligibility with frequency-dependent interaural phase differences
    Beutelmann, Rainer
    Brand, Thomas
    Kollmeier, Birger
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2009, 126 (03) : 1359 - 1368
  • [7] The effect of room acoustical parameters on speech reception thresholds and spatial release from masking
    Biberger, Thomas
    Ewert, Stephan D.
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2019, 146 (04) : 2188 - 2200
  • [8] Envelope and intensity based prediction of psychoacoustic masking and speech intelligibility
    Biberger, Thomas
    Ewert, Stephan D.
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2016, 140 (02) : 1023 - 1038
  • [9] Toward realistic binaural auralizations - perceptual comparison between measurement and simulation-based auralizations and the real room for a classroom scenario
    Blau, Matthias
    Budnik, Armin
    Fallahi, Mina
    Steffens, Henning
    Ewert, Stephan D.
    van de Par, Steven
    [J]. ACTA ACUSTICA, 2021, 5
  • [10] A round robin on room acoustical simulation and auralization
    Brinkmann, Fabian
    Aspoeck, Lukas
    Ackermann, David
    Lepa, Steffen
    Vorlaender, Michael
    Weinzierl, Stefan
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2019, 145 (04) : 2746 - 2760