Automatic Metadata Generation Using Associative Networks

被引:19
作者
Rodriguez, Marko A. [1 ]
Bollen, Johan
De Sompel, Herbert Van
机构
[1] Los Alamos Natl Lab, Digital Lib Res, Los Alamos, NM 87545 USA
关键词
Algorihms; Experimentation; Associative networks; particle-swarms; metadata generation; SPREADING ACTIVATION; RETRIEVAL;
D O I
10.1145/1462198.1462199
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In spite of its tremendous value, metadata is generally sparse and incomplete, thereby hampering the effectiveness of digital information services. Many of the existing mechanisms for the automated creation of metadata rely primarily on content analysis which can be costly and inefficient. The automatic metadata generation system proposed in this article leverages resource relationships generated from existing metadata as a medium for propagation from metadata-rich to metadata-poor resources. Because of its independence from content analysis, it can be applied to a wide variety of resource media types and is shown to be computationally inexpensive. The proposed method operates through two distinct phases. Occurrence and cooccurrence algorithms first generate an associative network of repository resources leveraging existing repository metadata. Second, using the associative network as a substrate, metadata associated with metadata-rich resources is propagated to metadata-poor resources by means of a discrete-form spreading activation algorithm. This article discusses the general framework for building associative networks, an algorithm for disseminating metadata through such networks, and the results of an experiment and validation of the proposed method using a standard bibliographic dataset.
引用
收藏
页数:20
相关论文
共 27 条
[1]   Toward alternative metrics of journal impact: A comparison of download and citation data [J].
Bollen, J ;
de Sompel, HV ;
Smith, JA ;
Luce, R .
INFORMATION PROCESSING & MANAGEMENT, 2005, 41 (06) :1419-1440
[2]   INFORMATION-RETRIEVAL BY CONSTRAINED SPREADING ACTIVATION IN SEMANTIC NETWORKS [J].
COHEN, PR ;
KJELDSEN, R .
INFORMATION PROCESSING & MANAGEMENT, 1987, 23 (04) :255-268
[3]   SPREADING ACTIVATION THEORY OF SEMANTIC PROCESSING [J].
COLLINS, AM ;
LOFTUS, EF .
PSYCHOLOGICAL REVIEW, 1975, 82 (06) :407-428
[4]   Application of spreading activation techniques in information retrieval [J].
Crestani, F .
ARTIFICIAL INTELLIGENCE REVIEW, 1997, 11 (06) :453-482
[5]   Searching the web by constrained spreading activation [J].
Crestani, F ;
Lee, PL .
INFORMATION PROCESSING & MANAGEMENT, 2000, 36 (04) :585-605
[6]  
DELIN S, 2004, P ACM SIKDD INT C KN
[7]  
Duval Erik., 2002, D LIB MAGAZINE, V8
[8]  
GIURIDA G, 2000, P INT C DIG LIB, P77
[9]   Usage patterns of collaborative tagging systems [J].
Golder, SA ;
Huberman, BA .
JOURNAL OF INFORMATION SCIENCE, 2006, 32 (02) :198-208
[10]  
Greenberg J., 2004, J INTERNET CATALOGIN, V6, P59, DOI [DOI 10.1300/J141V06N04_05, 10.1300/J141v06n04_05]