Structure clustering for Chinese patent documents

被引:19
作者
Huang, Su-Hsien [1 ,5 ]
Ke, Hao-Ren [2 ,3 ]
Yang, Wei-Pang [4 ]
机构
[1] Natl Chiao Tung Univ, Inst Comp Sci & Engn, Hsinchu, Taiwan
[2] Natl Chiao Tung Univ, Lib Informat Management, Hsinchu, Taiwan
[3] Natl Chiao Tung Univ, Inst Informat Management, Hsinchu, Taiwan
[4] Natl Don Hwa Univ, Dept Informat Management, Shoufeng, Hualien, Taiwan
[5] Minghsin Univ Sci & Technol, Dept Informat Management, Hsinchu, Taiwan
关键词
structure clustering; Chinese patent; structure expression; metadata;
D O I
10.1016/j.eswa.2007.03.012
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper aims to cluster Chinese patent documents with the structures. Both the explicit and implicit structures are analyzed to represent by the proposed structure expression. Accordingly, an unsupervised clustering algorithm called structured self-organizing map (SOM) is adopted to cluster Chinese patent documents with both similar content and structure. Structured SOM clusters the similar content of each sub-part structure, and then propagates the similarity to upper level ones. Experimental result showed the maps size and number of patents are proportional to the computing time, which implies the width and depth of structure affects the performance of structured SOM. Structured clustering of patents is helpful in many applications. In the lawsuit of copyright, companies are easy to find claim conflict in the existent patents to contradict the accusation. Moreover, decision-maker of a company can be advised to avoid hot-spot aspects of patents, which can save a lot of R&D effort. (c) 2007 Elsevier Ltd. All rights reserved.
引用
收藏
页码:2290 / 2297
页数:8
相关论文
共 18 条
[1]   A general framework for adaptive processing of data structures [J].
Frasconi, P ;
Gori, M ;
Sperduti, A .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1998, 9 (05) :768-786
[2]  
Fujii Atsushi, 2006, P 5 INT C LANG RES E, P671
[3]   A self-organizing map for adaptive processing of structured data [J].
Hagenbuchner, M ;
Sperduti, A ;
Tsoi, AC .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2003, 14 (03) :491-505
[4]  
HAMMER B, 2002, EUR S ART NEUR NETW, P395
[5]  
HAMMER B, 2004, EUR S ART NEUR NETW, P281
[6]   Evaluating patent retrieval in the third NTCIR workshop [J].
Iwayama, M ;
Fujii, A ;
Kando, N ;
Marukawa, Y .
INFORMATION PROCESSING & MANAGEMENT, 2006, 42 (01) :207-221
[7]   The self-organizing map [J].
Kohonen, T .
NEUROCOMPUTING, 1998, 21 (1-3) :1-6
[8]  
MASE H, 2004, NTCIR WORKSH 4 M
[9]  
ROSSI F, 2004, P EUR S ART NEUR NET, P305
[10]   Information navigation on the web by clustering and summarizing query results [J].
Roussinov, DG ;
Chen, HC .
INFORMATION PROCESSING & MANAGEMENT, 2001, 37 (06) :789-816