Rhythm May Be Key to Linking Language and Cognition in Young Infants: Evidence From Machine Learning

被引:3
作者
Lau, Joseph C. Y. [1 ,2 ,3 ]
Fyshe, Alona [4 ]
Waxman, Sandra R. [1 ,2 ]
机构
[1] Northwestern Univ, Dept Psychol, Evanston, IL 60208 USA
[2] Northwestern Univ, Inst Policy Res, Evanston, IL 60208 USA
[3] Northwestern Univ, Roxelyn & Richard Pepper Dept Commun Sci & Disorde, Evanston, IL 60208 USA
[4] Univ Alberta, Dept Comp Sci & Psychol, Edmonton, AB, Canada
基金
美国国家卫生研究院; 英国科研创新办公室;
关键词
infant cognition; language; rhythm; machine learning; non-human vocalizations; SPEECH RHYTHM; DISCRIMINATION; NEWBORNS; CATEGORIZATION; ACQUISITION; PERCEPTION; ATTENTION; PATTERNS; WORDS;
D O I
10.3389/fpsyg.2022.894405
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
Rhythm is key to language acquisition. Across languages, rhythmic features highlight fundamental linguistic elements of the sound stream and structural relations among them. A sensitivity to rhythmic features, which begins in utero, is evident at birth. What is less clear is whether rhythm supports infants' earliest links between language and cognition. Prior evidence has documented that for infants as young as 3 and 4 months, listening to their native language (English) supports the core cognitive capacity of object categorization. This precocious link is initially part of a broader template: listening to a non-native language from the same rhythmic class as (e.g., German, but not Cantonese) and to vocalizations of non-human primates (e.g., lemur, Eulemur macaco flavifrons, but not birds e.g., zebra-finches, Taeniopygia guttata) provide English-acquiring infants the same cognitive advantage as does listening to their native language. Here, we implement a machine-learning (ML) approach to ask whether there are acoustic properties, available on the surface of these vocalizations, that permit infants' to identify which vocalizations are candidate links to cognition. We provided the model with a robust sample of vocalizations that, from the vantage point of English-acquiring 4-month-olds, either support object categorization (English, German, lemur vocalizations) or fail to do so (Cantonese, zebra-finch vocalizations). We assess (a) whether supervised ML classification models can distinguish those vocalizations that support cognition from those that do not, and (b) which class(es) of acoustic features (including rhythmic, spectral envelope, and pitch features) best support that classification. Our analysis reveals that principal components derived from rhythm-relevant acoustic features were among the most robust in supporting the classification. Classifications performed using temporal envelope components were also robust. These new findings provide in principle evidence that infants' earliest links between vocalizations and cognition may be subserved by their perceptual sensitivity to rhythmic and spectral elements available on the surface of these vocalizations, and that these may guide infants' identification of candidate links to cognition.
引用
收藏
页数:10
相关论文
共 72 条
[1]   Brain mechanisms of acoustic communication in humans and nonhuman primates: An evolutionary perspective [J].
Ackermann, Hermann ;
Hage, Steffen R. ;
Ziegler, Wolfram .
BEHAVIORAL AND BRAIN SCIENCES, 2014, 37 (06) :529-546
[2]   Deep Scattering Spectrum [J].
Anden, Joakim ;
Mallat, Stephane .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2014, 62 (16) :4114-4128
[3]  
[Anonymous], 2016, LINGUISTIC RHYTHM LI, DOI DOI 10.1075/TILAR.17.06GOS
[4]  
[Anonymous], 2016, KONSTANZ PROSODICALL, DOI DOI 10.21437/SPEECHPROSODY.2016-115
[5]  
[Anonymous], 2015, 2015 IEEE 25 INT WOR
[6]  
[Anonymous], 1997, HDB PHONET SCI
[7]   AUTOMATIC SPEAKER RECOGNITION BASED ON PITCH CONTOURS [J].
ATAL, BS .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1972, 52 (06) :1687-1697
[8]   Delta- and theta-band cortical tracking and phase-amplitude coupling to sung speech by infants [J].
Attaheri, Adam ;
Ni Choisdealbha, Aine ;
Di Liberto, Giovanni M. ;
Rocha, Sinead ;
Brusini, Perrine ;
Mead, Natasha ;
Olawole-Scott, Helen ;
Boutris, Panagiotis ;
Gibbon, Samuel ;
Williams, Isabel ;
Grey, Christina ;
Flanagan, Sheila ;
Goswami, Usha .
NEUROIMAGE, 2022, 247
[9]   Voice processing in human and non-human primates [J].
Belin, Pascal .
PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2006, 361 (1476) :2091-2107
[10]   Birdsong fails to support object categorization in human infants [J].
Carr, Kali Woodruff ;
Perszyk, Danielle R. ;
Waxman, Sandra R. .
PLOS ONE, 2021, 16 (03)