Capturing the structures in association knowledge: Application of network analyses to large-scale Databases of Japanese word associations

被引:2
作者
Joyce, Terry [1 ]
Miyake, Maki [2 ]
机构
[1] Tama Univ, Sch Global Studies, 802 Engyo, Kanagawa 2520805, Japan
[2] Osaka Univ, Grad Sch Language & Culture, Osaka 5600043, Japan
来源
LARGE-SCALE KNOWLEDGE RESOURCES: CONSTRUCTION AND APPLICATION | 2008年 / 4938卷
关键词
association knowledge; lexical knowledge; network analyses; large-scale; databases of Japanese word associations; Associative Concept Dictionary; (ACL); Japanese Word Association Database (JWAD); association network; representations; graph clustering; Markov clustering (MCL); recurrent Markov clustering (RMCL); modularity;
D O I
10.1007/978-3-540-78159-2_12
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Within the general enterprise of probing into the complexities of lexical knowledge, one particularly promising research focus is on word association knowledge. Given Deese's [1] and Cramer's [2] convictions that word association closely mirror the structured patterns of relations that exist among concepts, as largely echoed Hirst's [31 more recent comments about the close relationships between lexicons and ontologies, as well as Firth's [4] remarks about finding a word's meaning in the company it. keeps, efforts to capture and unravel the rich networks of associations that connect words together are likely to yield interesting insights into the nature of lexical knowledge. Adopting such an approach, this paper applies a range of network analysis techniques in order to investigate the characteristics of network representations of word association knowledge in Japanese. Specifically, two separate association networks are constructed from two different large-scale databases of Japanese word associations: the Associative Concept Dictionary (ACD) by Okamoto and Ishizaki [5] and the Japanese Word Association Database (JWAD) by Joyce [6] [7] [8]. Results of basic statistical analyses of the association networks indicate that both are scale-free with small-world properties and that both exhibit hierarchical organization. As effective methods of discerning associative structures with networks, some graph clustering algorithms are also applied. In addition to the basic Markov Clustering algorithm proposed by van Dongen [9], the present study also employs a recently proposed combination of the enhanced Recurrent Markov Cluster algorithm (RMCL) [10] with an index of modularity [11]. Clustering results show that the RMCL and modularity combination provides effective control over cluster sizes. The results also demonstrate the effectiveness of graph clustering approaches to capturing the structures within large-scale association knowledge resources, such as the two constructed networks of Japanese word associations.
引用
收藏
页码:116 / +
页数:3
相关论文
共 2 条
  • [1] USAGE OF DEDICATED DATA STRUCTURES FOR URL DATABASES IN A LARGE-SCALE CRAWLING
    Dorosz, Krzysztof
    COMPUTER SCIENCE-AGH, 2009, 10 : 7 - 17
  • [2] An efficient approach to large-scale genotype-phenotype association analyses
    Yang, Runqing
    Li, Hongwang
    Fu, Lina
    Liu, Yongxin
    BRIEFINGS IN BIOINFORMATICS, 2014, 15 (05) : 814 - 822