Does Infant-Directed Speech Help Phonetic Learning? A Machine Learning Investigation

被引:12
作者
Ludusan, Bogdan [1 ,2 ]
Mazuka, Reiko [1 ,3 ]
Dupoux, Emmanuel [4 ]
机构
[1] RIKEN, Ctr Brain Sci, Lab Language Dev, 2-1 Hirosawa, Wako, Saitama 3510198, Japan
[2] Bielefeld Univ, Fac Linguist & Literary Studies, Phonet Workgrp, Bielefeld, Germany
[3] Duke Univ, Dept Psychol & Neurosci, Durham, NC 27706 USA
[4] CNRS, INRIA, EHESS, Lab Sci Cognit & Psycholiguist,ENS,PSL, F-75700 Paris, France
基金
欧洲研究理事会;
关键词
Phonetic learning; Speech variability; Hyperarticulation; Infant-directed speech; Adult-directed speech; Read speech; CROSS-LANGUAGE; MOTHERS SPEECH; MATERNAL SPEECH; BABY TALK; VARIABILITY; CATEGORIES; JAPANESE; AGE; EXAGGERATION; INFORMATION;
D O I
10.1111/cogs.12946
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
A prominent hypothesis holds that by speaking to infants in infant-directed speech (IDS) as opposed to adult-directed speech (ADS), parents help them learn phonetic categories. Specifically, two characteristics of IDS have been claimed to facilitate learning: hyperarticulation, which makes the categories more separable, and variability, which makes the generalization more robust. Here, we test the separability and robustness of vowel category learning on acoustic representations of speech uttered by Japanese adults in ADS, IDS (addressed to 18- to 24-month olds), or read speech (RS). Separability is determined by means of a distance measure computed between the five short vowel categories of Japanese, while robustness is assessed by testing the ability of six different machine learning algorithms trained to classify vowels to generalize on stimuli spoken by a novel speaker in ADS. Using two different speech representations, we find that hyperarticulated speech, in the case of RS, can yield better separability, and that increased between-speaker variability in ADS can yield, for some algorithms, more robust categories. However, these conclusions do not apply to IDS, which turned out to yield neither more separable nor more robust categories compared to ADS inputs. We discuss the usefulness of machine learning algorithms run on real data to test hypotheses about the functional role of IDS.
引用
收藏
页数:31
相关论文
共 83 条
  • [1] Adriaans F., 2012, P 34 ANN M COGN SCI, P72
  • [2] Prosodic exaggeration within infant-directed speech: Consequences for vowel learnability
    Adriaans, Frans
    Swingley, Daniel
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2017, 141 (05) : 3070 - 3078
  • [3] Fundamental frequency of infants' and parents' utterances in longitudinal recordings
    Amano, S
    Nakatani, T
    Kondo, T
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2006, 119 (03) : 1636 - 1647
  • [4] Andruski J., 1999, P 14 INT C PHON SCI, P2177, DOI [10.1109/ICSLP.1996.607913, DOI 10.1109/ICSLP.1996.607913]
  • [5] [Anonymous], 2018, SOMBRERO SOM BOUND R
  • [6] [Anonymous], 2006, IEIC TECHNICAL REPOR
  • [7] Barriuso T.A., 2018, The CATESOL Journal, V30, P177
  • [8] Mommy is only happy! Dutch mothers' realisation of speech sounds in infant-directed speech expresses emotion, not didactic intent
    Benders, Titia
    [J]. INFANT BEHAVIOR & DEVELOPMENT, 2013, 36 (04) : 847 - 862
  • [9] WordSeg: Standardizing unsupervised word form segmentation from text
    Bernard, Mathieu
    Thiolliere, Roland
    Saksida, Amanda
    Loukatou, Georgia R.
    Larsen, Elin
    Johnson, Mark
    Fibla, Laia
    Dupoux, Emmanuel
    Daland, Robert
    Cao, Xuan Nga
    Cristia, Alejandrina
    [J]. BEHAVIOR RESEARCH METHODS, 2020, 52 (01) : 264 - 278
  • [10] Boersma P., 2021, Praat: Doing phonetics by computer (6.1.40) Computer software