Learning phonetic categories by tracking movements

被引:34
作者
Gauthier, Bruno
Shi, Rushen
Xu, Yi
机构
[1] Univ Quebec, Dept Psychol, Montreal, PQ H3C 3P8, Canada
[2] UCL, Dept Phonet & Linguist, London WC1E 6BT, England
关键词
category formation; infant speech perception; language acquisition; unsupervised learning; self-organizing maps; target approximation; lexical tone; contextual tonal variation; theories of speech production and perception; LANGUAGE SPEECH-PERCEPTION; STOP-CONSONANT PERCEPTION; TONE PERCEPTION; MANDARIN CHINESE; MAXIMUM SPEED; LEXICAL TONE; PITCH; COARTICULATION; INFORMATION; REALIZATION;
D O I
10.1016/j.cognition.2006.03.002
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
We explore in this study how infants may derive phonetic categories from adult input that are highly variable. Neural networks in the form of self-organizing maps (SOMs; Kohonen, 1989, 1995) were used to simulate unsupervised learning of Mandarin tones. In Simulation 1, we trained the SOMs with syllable-sized continuous F-0 contours, produced by multiple speakers in connected speech, and with the corresponding velocity profiles (D1). No attempt was made to reduce the large amount of variability in the input or to add to the input any abstract features such as height and slope of the F-0 contours. In the testing phase, reasonably high categorization rate was achieved with F-0 profiles, but D1 profiles yielded almost perfect categorization of the four tones. Close inspection of the learned prototypical D1 profile clusters revealed that they had effectively eliminated surface variability and directly reflected articulatory movements toward the underlying targets of the four tones as proposed by Xu and Wang (2001). Additional simulations indicated that a further learning step was possible through which D1 prototypes with one-to-one correspondence to the tones were derived from the prototype clusters learned in Simulation 1. Implications of these findings for theories of language acquisition, speech perception and speech production are discussed. (c) 2006 Elsevier B.V. All rights reserved.
引用
收藏
页码:80 / 106
页数:27
相关论文
共 50 条
  • [21] Orienting attention during phonetic training facilitates learning
    Pederson, Eric
    Guion-Anderson, Susan
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2010, 127 (02) : EL54 - EL59
  • [22] Neural signatures of phonetic learning in adulthood: A magnetoencephalography study
    Zhang, Yang
    Kuhl, Patricia K.
    Imada, Toshiaki
    Iverson, Paul
    Pruitt, John
    Stevens, Erica B.
    Kawakatsu, Masaki
    Tohkura, Yoh'ichi
    Nemoto, Iku
    NEUROIMAGE, 2009, 46 (01) : 226 - 240
  • [23] Infant Phonetic Learning as Perceptual Space Learning: A Crosslinguistic Evaluation of Computational Models
    Matusevych, Yevgen
    Schatz, Thomas
    Kamper, Herman
    Feldman, Naomi H.
    Goldwater, Sharon
    COGNITIVE SCIENCE, 2023, 47 (07)
  • [24] Computational modeling of phonetic and lexical learning in early language acquisition: Existing models and future directions
    Rasanen, Okko
    SPEECH COMMUNICATION, 2012, 54 (09) : 975 - 997
  • [25] Eye movements in repeated multiple object tracking
    Lukavsky, Jiri
    JOURNAL OF VISION, 2013, 13 (07):
  • [26] Position tracking and identity tracking are separate systems: Evidence from eye movements
    Oksama, Lauri
    Hyona, Jukka
    COGNITION, 2016, 146 : 393 - 409
  • [27] Word learning and phonetic processing in preschool-age children
    Havy, Melanie
    Bertoncini, Josiane
    Nazzi, Thierry
    JOURNAL OF EXPERIMENTAL CHILD PSYCHOLOGY, 2011, 108 (01) : 25 - 43
  • [28] Does Infant-Directed Speech Help Phonetic Learning? A Machine Learning Investigation
    Ludusan, Bogdan
    Mazuka, Reiko
    Dupoux, Emmanuel
    COGNITIVE SCIENCE, 2021, 45 (05)
  • [29] Prosodic pitch processing is represented in delta-band EEG and is dissociable from the cortical tracking of other acoustic and phonetic features
    Teoh, Emily S.
    Cappelloni, Madeline S.
    Lalor, Edmund C.
    EUROPEAN JOURNAL OF NEUROSCIENCE, 2019, 50 (11) : 3831 - 3842
  • [30] Unsupervised Learning of Functional Categories in Video Scenes
    Turek, Matthew W.
    Hoogs, Anthony
    Collins, Roderic
    COMPUTER VISION-ECCV 2010, PT II, 2010, 6312 : 664 - 677