Construction and Application of a Large-Scale Chinese Abstractness Lexicon Based on Word Similarity

被引:0
|
作者
Xu, Huidan [1 ]
Yang, Lijiao [1 ]
机构
[1] Beijing Normal Univ, Beijing 100875, Peoples R China
来源
NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2022, PT II | 2022年 / 13552卷
关键词
Word abstractness; Chinese lexicon; Word similarity-based; CONCRETENESS; NORMS; RATINGS;
D O I
10.1007/978-3-031-17189-5_10
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As an important semantic feature, abstractness has been widely studied in linguistics, psychology, cognitive sciences and other fields. Many languages have constructed their abstractness lexicons, while there has never been a large-scale and high-quality abstractness lexicon in Chinese. Since manual construction is time-consuming and costly, we use the existing resources with human abstractness scores as original data, and adopt the word similarity-based approach to automatically construct a large-scale Chinese abstractness lexicon. Besides, we evaluate the quality of the constructed lexicon by comparing it with expert knowledge and previous work. It has been verified that this lexicon is roughly consistent with human cognition and can provide reliable abstractness ratings for words. Finally, the performance of this constructed lexicon on two research, cross-language comparison and Chinese text readability auto-evaluation, shows that word abstractness is an important feature in investigating cognitive differences and text complexity. The large-scale Chinese abstractness lexicon constructed in this paper has important application values.
引用
收藏
页码:122 / 130
页数:9
相关论文
共 50 条
  • [1] Construction and Inference Technique of Large-Scale Chinese Concreteness Lexicon
    Xie Z.
    Bi R.
    Beijing Daxue Xuebao (Ziran Kexue Ban)/Acta Scientiarum Naturalium Universitatis Pekinensis, 2022, 58 (01): : 1 - 6
  • [2] Phonological Similarity Judgments of Word Pairs Reflect Sensitivity to Large-Scale Structure of the Phonological Lexicon
    Siew, Cynthia S. Q.
    Castro, Nichol
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY-LEARNING MEMORY AND COGNITION, 2023, 49 (12) : 1989 - 2002
  • [3] A Method of Building Chinese Basic Semantic Lexicon Based on Word Similarity
    Zhu, Yanhui
    Wen, ZhiQiang
    Wang, Ping
    Peng, Zhaoyi
    PROCEEDINGS OF THE 2009 CHINESE CONFERENCE ON PATTERN RECOGNITION AND THE FIRST CJK JOINT WORKSHOP ON PATTERN RECOGNITION, VOLS 1 AND 2, 2009, : 608 - 611
  • [4] Combining Word Embedding and Semantic Lexicon for Chinese Word Similarity Computation
    Pei, Jiahuan
    Zhang, Cong
    Huang, Degen
    Ma, Jianjun
    NATURAL LANGUAGE UNDERSTANDING AND INTELLIGENT APPLICATIONS (NLPCC 2016), 2016, 10102 : 766 - 777
  • [5] Enhancing automatic acquisition of the thematic structure in a large-scale lexicon for Mandarin Chinese
    Olsen, MB
    Dorr, BJ
    Thomas, SC
    MACHINE TRANSLATION AND THE INFORMATION SOUP, 1998, 1529 : 41 - 50
  • [6] Construction of large-scale honeynet Based on Honeyd
    Wang, Jun
    Zeng, Jing
    CEIS 2011, 2011, 15
  • [7] New word detection based on large-scale corpus
    Digital Technology Laboratory, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100080, China
    不详
    不详
    Jisuanji Yanjiu yu Fazhan, 2006, 5 (927-932):
  • [8] Similarity-based large-scale distribution mapping of orchids
    Remm, Kalle
    Remm, Liina
    BIODIVERSITY AND CONSERVATION, 2009, 18 (06) : 1629 - 1647
  • [9] Similarity-based large-scale distribution mapping of orchids
    Kalle Remm
    Liina Remm
    Biodiversity and Conservation, 2009, 18 : 1629 - 1647
  • [10] Construction and application of a large-scale DNA sequence analysis system based on PC/Linux
    Zhang, CG
    Ouyang, SG
    Zhang, SW
    Qu, XH
    Yu, YT
    Zhou, GQ
    Wu, SF
    He, FC
    PROGRESS IN BIOCHEMISTRY AND BIOPHYSICS, 2001, 28 (02) : 263 - 266