Local similarity and global variability characterize the semantic space of human languages

被引:5
|
作者
Lewis, Molly [1 ]
Cahill, Aoife [2 ]
Madnani, Nitin [3 ]
Evans, James [4 ,5 ]
机构
[1] Carnegie Mellon Univ, Psychol & Social & Decis Sci, Pittsburgh, PA 15213 USA
[2] Dataminr Inc, New York, NY 10016 USA
[3] Educ Testing Serv, Princeton, NJ 08541 USA
[4] Univ Chicago, Sociol & Data Sci, Chicago, IL 60637 USA
[5] Santa Fe Inst, Santa Fe, NM 87501 USA
关键词
human cognition; language; semantics; culture; communication; BODY; CATEGORIES; ENGLISH; COLOR; SPECIFICITY; SENSITIVITY; EVOLUTION; PATTERNS; MEANINGS; REFLECTS;
D O I
10.1073/pnas.2300986120
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
How does meaning vary across the world's languages? Scholars recognize the existence of substantial variability within specific domains, ranging from nature and color to kinship. The emergence of large language models enables a systems-level approach that directly characterizes this variability through comparison of word organization across semantic domains. Here, we show that meanings across languages manifest lower variability within semantic domains and greater variability between them, using models trained on both 1) large corpora of native language text comprising Wikipedia articles in 35 languages and also 2) Test of English as a Foreign Language (TOEFL) essays written by 38,500 speakers from the same native languages, which cluster into semantic domains. Concrete meanings vary less across languages than abstract meanings, but all vary with geographical, environmental, and cultural distance. By simultaneously examining local similarity and global difference, we harmonize these findings and provide a description of general principles that govern variability in semantic space across languages. In this way, the structure of a speaker's semantic space influences the comparisons cognitively salient to them, as shaped by their native language, and suggests that even successful bilingual communicators likely think with "semantic accents" driven by associations from their native language while writing English. These findings have dramatic implications for language education, cross-cultural communication, and literal translations, which are impossible not because the objects of reference are uncertain, but because associations, metaphors, and narratives interlink meanings in different, predictable ways from one language to another.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] Modulation of taxonomic (versus thematic) similarity judgments and product choices by inducing local and global processing
    Guest, Duncan
    Gibbert, Michael
    Estes, Zachary
    Mazursky, David
    Lam, Michael
    JOURNAL OF COGNITIVE PSYCHOLOGY, 2016, 28 (08) : 1013 - 1025
  • [32] Global-Local Coupled Style Transfer for Semantic Segmentation of Bitemporal Remote Sensing Images
    Wang, Hao
    Guo, Mingning
    Li, Shaoxian
    Li, Haifeng
    Tao, Chao
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [33] Combined global and local semantic feature-based image retrieval analysis with interactive feedback
    Anandh, A.
    Mala, K.
    Babu, R. Suresh
    MEASUREMENT & CONTROL, 2020, 53 (1-2) : 3 - 17
  • [34] FuzzyPPI: Large-Scale Interaction of Human Proteome at Fuzzy Semantic Space
    Halder, Anup Kumar
    Bandyopadhyay, Soumyendu Sekhar
    Jedrzejewski, Witold
    Basu, Subhadip
    Sroka, Jacek
    IEEE TRANSACTIONS ON BIG DATA, 2025, 11 (01) : 47 - 58
  • [35] FARN: Fetal Anatomy Reasoning Network for Detection With Global Context Semantic and Local Topology Relationship
    Zhao, Lei
    Tan, Guanghua
    Wu, Qianghui
    Pu, Bin
    Ren, Hongliang
    Li, Shengli
    Li, Kenli
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2024, 28 (08) : 4866 - 4877
  • [36] A REVIEW OF URBAN HUMAN MOBILITY RESEARCH BASED ON CROWD-SOURCED DATA AND SPACE-TIME AND SEMANTIC ANALYSIS
    Basmenj, S. Kamel
    Li, S.
    XXIV ISPRS CONGRESS IMAGING TODAY, FORESEEING TOMORROW, COMMISSION IV, 2022, 43-B4 : 247 - 252
  • [37] Dermatoscopy of basal cell carcinoma: Morphologic variability of global and local features and accuracy of diagnosis
    Altamura, Davide
    Menzies, Scott W.
    Argenziano, Giuseppe
    Zalaudek, Iris
    Soyer, H. Peter
    Sera, Francesco
    Avramidis, Michelle
    DeAmbrosis, Kathryn
    Fargnoli, Maria Concetta
    Peris, Ketty
    JOURNAL OF THE AMERICAN ACADEMY OF DERMATOLOGY, 2010, 62 (01) : 67 - 75
  • [38] Probing the parameter space of HD 49933: a comparison between global and local methods
    Creevey, O. L.
    Bazot, M.
    GONG-SOHO 24: A NEW ERA OF SEISMOLOGY OF THE SUN AND SOLAR-LIKE STARS, 2011, 271
  • [39] A novel level set method for image segmentation by incorporating local statistical analysis and global similarity measurement
    Wang, Xiao-Feng
    Min, Hai
    Zou, Le
    Zhang, Yi-Gang
    PATTERN RECOGNITION, 2015, 48 (01) : 189 - 204
  • [40] GLSANet: Global-Local Self-Attention Network for Remote Sensing Image Semantic Segmentation
    Hu, Xudong
    Zhang, Penglin
    Zhang, Qi
    Yuan, Feng
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20