Local similarity and global variability characterize the semantic space of human languages

被引:5
|
作者
Lewis, Molly [1 ]
Cahill, Aoife [2 ]
Madnani, Nitin [3 ]
Evans, James [4 ,5 ]
机构
[1] Carnegie Mellon Univ, Psychol & Social & Decis Sci, Pittsburgh, PA 15213 USA
[2] Dataminr Inc, New York, NY 10016 USA
[3] Educ Testing Serv, Princeton, NJ 08541 USA
[4] Univ Chicago, Sociol & Data Sci, Chicago, IL 60637 USA
[5] Santa Fe Inst, Santa Fe, NM 87501 USA
关键词
human cognition; language; semantics; culture; communication; BODY; CATEGORIES; ENGLISH; COLOR; SPECIFICITY; SENSITIVITY; EVOLUTION; PATTERNS; MEANINGS; REFLECTS;
D O I
10.1073/pnas.2300986120
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
How does meaning vary across the world's languages? Scholars recognize the existence of substantial variability within specific domains, ranging from nature and color to kinship. The emergence of large language models enables a systems-level approach that directly characterizes this variability through comparison of word organization across semantic domains. Here, we show that meanings across languages manifest lower variability within semantic domains and greater variability between them, using models trained on both 1) large corpora of native language text comprising Wikipedia articles in 35 languages and also 2) Test of English as a Foreign Language (TOEFL) essays written by 38,500 speakers from the same native languages, which cluster into semantic domains. Concrete meanings vary less across languages than abstract meanings, but all vary with geographical, environmental, and cultural distance. By simultaneously examining local similarity and global difference, we harmonize these findings and provide a description of general principles that govern variability in semantic space across languages. In this way, the structure of a speaker's semantic space influences the comparisons cognitively salient to them, as shaped by their native language, and suggests that even successful bilingual communicators likely think with "semantic accents" driven by associations from their native language while writing English. These findings have dramatic implications for language education, cross-cultural communication, and literal translations, which are impossible not because the objects of reference are uncertain, but because associations, metaphors, and narratives interlink meanings in different, predictable ways from one language to another.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Local and global semantic integration in an argument structure: ERP evidence from Korean
    Nam, Yunju
    Hong, Upyong
    BRAIN RESEARCH, 2016, 1642 : 590 - 602
  • [22] Human and climate drivers of global biomass burning variability
    Chuvieco, Emilio
    Pettinari, M. Lucrecia
    Koutsias, Nikos
    Forkel, Matthias
    Hantson, Stijn
    Turco, Marco
    SCIENCE OF THE TOTAL ENVIRONMENT, 2021, 779
  • [23] A Local-Global Feature Fusing Method for Point Clouds Semantic Segmentation
    Bi, Yuanwei
    Zhang, Lujian
    Liu, Yaowen
    Huang, Yansen
    Liu, Hao
    IEEE ACCESS, 2023, 11 : 68776 - 68790
  • [24] Local conditions for global stability in the space of codons of the genetic code
    Salinas, Dino G.
    Gallardo, Mauricio O.
    Osorio, Manuel I.
    BIOSYSTEMS, 2016, 150 : 73 - 77
  • [25] Contrast preserving image decolorization combining global features and local semantic features
    Zhang, Xiaoli
    Liu, Shiguang
    VISUAL COMPUTER, 2018, 34 (6-8) : 1099 - 1108
  • [26] A Weighted Topic Model Learned From Local Semantic Space for Automatic Image Annotation
    Song, Haiyu
    Wang, Pengjie
    Yun, Jian
    Li, Wei
    Xue, Bo
    Wu, Gang
    IEEE ACCESS, 2020, 8 : 76411 - 76422
  • [27] Local-Global Multiscale Fusion Network for Semantic Segmentation of Buildings in SAR Imagery
    Zhou, Xuanyu
    Zhou, Lifan
    Zhang, Haizhen
    Ji, Wei
    Zhou, Bei
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 7410 - 7421
  • [28] A Novel Local-Global Graph Convolutional Method for Point Cloud Semantic Segmentation
    Du, Zijin
    Ye, Hailiang
    Cao, Feilong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (04) : 4798 - 4812
  • [29] Spatiotemporal variability of urban growth factors: A global and local perspective on the megacity of Mumbai
    Shafizadeh-Moghadam, Hossein
    Helbich, Marco
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2015, 35 : 187 - 198
  • [30] Global similarity with local differences in linkage disequilibrium between the Dutch and HapMap-CEU populations
    Pardo, Luba
    Bochdanovits, Zoltan
    de Geus, Eco
    Hottenga, Jouke J.
    Sullivan, Patrick
    Posthuma, Danielle
    Penninx, Brenda W. J. H.
    Boomsma, Dorret
    Heutink, Peter
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2009, 17 (06) : 802 - 810