Monkeys and Humans Share a Common Computation for Face/Voice Integration

被引:39
作者
Chandrasekaran, Chandramouli [1 ,2 ]
Lemus, Luis [1 ,2 ]
Trubanova, Andrea [2 ,3 ]
Gondan, Matthias [4 ,5 ]
Ghazanfar, Asif A. [1 ,2 ,6 ]
机构
[1] Princeton Univ, Neurosci Inst, Princeton, NJ 08544 USA
[2] Princeton Univ, Dept Psychol, Princeton, NJ 08544 USA
[3] Emory Univ, Sch Med, Marcus Autism Ctr, Atlanta, GA USA
[4] Univ Regensburg, Dept Psychol, Regensburg, Germany
[5] Heidelberg Univ, Heidelberg, Germany
[6] Princeton Univ, Dept Ecol & Evolut Biol, Princeton, NJ 08544 USA
基金
美国国家科学基金会;
关键词
AUDIOVISUAL SPEECH-PERCEPTION; SUPERIOR TEMPORAL SULCUS; OROFACIAL MOTOR REPRESENTATION; AUDITORY-VISUAL INTERACTIONS; CHIMPANZEE PAN-TROGLODYTES; BIMODAL DIVIDED ATTENTION; MACAQUES MACACA-MULATTA; SACCADIC EYE-MOVEMENTS; OLD-WORLD MONKEYS; MULTISENSORY INTEGRATION;
D O I
10.1371/journal.pcbi.1002165
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Speech production involves the movement of the mouth and other regions of the face resulting in visual motion cues. These visual cues enhance intelligibility and detection of auditory speech. As such, face-to-face speech is fundamentally a multisensory phenomenon. If speech is fundamentally multisensory, it should be reflected in the evolution of vocal communication: similar behavioral effects should be observed in other primates. Old World monkeys share with humans vocal production biomechanics and communicate face-to-face with vocalizations. It is unknown, however, if they, too, combine faces and voices to enhance their perception of vocalizations. We show that they do: monkeys combine faces and voices in noisy environments to enhance their detection of vocalizations. Their behavior parallels that of humans performing an identical task. We explored what common computational mechanism(s) could explain the pattern of results we observed across species. Standard explanations or models such as the principle of inverse effectiveness and a "race'' model failed to account for their behavior patterns. Conversely, a "superposition model'', positing the linear summation of activity patterns in response to visual and auditory components of vocalizations, served as a straightforward but powerful explanatory mechanism for the observed behaviors in both species. As such, it represents a putative homologous mechanism for integrating faces and voices across primates.
引用
收藏
页数:20
相关论文
共 112 条
  • [91] Psychophysical spectro-temporal receptive fields in an auditory task
    Shub, Daniel E.
    Richards, Virginia M.
    [J]. HEARING RESEARCH, 2009, 251 (1-2) : 1 - 9
  • [92] Functional topography of converging visual and auditory inputs to neurons in the rat superior colliculus
    Skaliora, I
    Doubell, TP
    Holmes, NP
    Nodal, FR
    King, AJ
    [J]. JOURNAL OF NEUROPHYSIOLOGY, 2004, 92 (05) : 2933 - 2946
  • [93] Spontaneous voice-face identity matching by rhesus monkeys for familiar conspecifics and humans
    Sliwa, Julia
    Duhamel, Jean-Rene
    Pascalis, Olivier
    Wirth, Sylvia
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2011, 108 (04) : 1735 - 1740
  • [94] Superadditivity in multisensory integration: putting the computation in context
    Stanford, Terrence R.
    Stein, Barry E.
    [J]. NEUROREPORT, 2007, 18 (08) : 787 - 792
  • [95] Stein B E, 1989, J Cogn Neurosci, V1, P12, DOI 10.1162/jocn.1989.1.1.12
  • [96] Multisensory integration: current issues from the perspective of the single neuron
    Stein, Barry E.
    Stanford, Terrence R.
    [J]. NATURE REVIEWS NEUROSCIENCE, 2008, 9 (04) : 255 - 266
  • [97] Semantic confusion regarding the development of multisensory integration: a practical solution
    Stein, Barry E.
    Burr, David
    Constantinidis, Christos
    Laurienti, Paul J.
    Meredith, M. Alex
    Perrault, Thomas J., Jr.
    Ramachandran, Ramnarayan
    Roeder, Brigitte
    Rowland, Benjamin A.
    Sathian, K.
    Schroeder, Charles E.
    Shams, Ladan
    Stanford, Terrence R.
    Wallace, Mark T.
    Yu, Liping
    Lewkowicz, David J.
    [J]. EUROPEAN JOURNAL OF NEUROSCIENCE, 2010, 31 (10) : 1713 - 1720
  • [98] Stein Barry E., 1993, The Merging of the Senses. The Merging of the Senses. Cognitive Neuroscience
  • [99] Neural mechanisms for synthesizing sensory information and producing adaptive behaviors
    Stein, BE
    [J]. EXPERIMENTAL BRAIN RESEARCH, 1998, 123 (1-2) : 124 - 135
  • [100] Neural processing of asynchronous audiovisual speech perception
    Stevenson, Ryan A.
    Altieri, Nicholas A.
    Kim, Sunah
    Pisoni, David B.
    James, Thomas W.
    [J]. NEUROIMAGE, 2010, 49 (04) : 3308 - 3318