Cluster-based patent retrieval using international patent classification system

被引:0
作者
Kim, Jungi [1 ]
Kang, In-Su [2 ]
Lee, Jong-Hyeok [1 ]
机构
[1] Pohang Univ Sci & Technol, Div Elect & Comp Engn, Adv Informat Technol Res Ctr, Pohang, South Korea
[2] KISTI, Informat Syst Res Lab, Seoul, South Korea
来源
COMPUTER PROCESSING OF ORIENTAL LANGUAGES, PROCEEDINGS: BEYOND THE ORIENT: THE RESEARCH CHALLENGES AHEAD | 2006年 / 4285卷
关键词
cluster-based retrieval; patent retrieval; invalidity search; international patent classification;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A patent collection provides a great test-bed for cluster-based information retrieval. International Patent Classification (IPC) system provides a hierarchical taxonomy with 5 levels of specificity. We regard IPC codes of patent applications as cluster information, manually assigned by patent officers according to their subjects. Such manual cluster provides advantages over automatically built clusters using document term similarities. There are previous researches that successfully apply cluster-based retrieval models using language modeling. We develop cluster-based language models that employ advantages of having manually clustered documents.
引用
收藏
页码:205 / +
页数:2
相关论文
共 15 条
  • [1] [Anonymous], P 27 INT ACM SIGIR C
  • [2] [Anonymous], P INT ACM SIGIR C RE
  • [3] [Anonymous], P 24 ANN INT ACM SIG, DOI DOI 10.1145/383952.384019
  • [4] [Anonymous], THESIS U TWENTE
  • [5] A MODEL OF CLUSTER-SEARCHING BASED ON CLASSIFICATION
    CROFT, WB
    [J]. INFORMATION SYSTEMS, 1980, 5 (03) : 189 - 195
  • [6] COMPARISON OF HIERARCHIC AGGLOMERATIVE CLUSTERING METHODS FOR DOCUMENT-RETRIEVAL
    ELHAMDOUCHI, A
    WILLETT, P
    [J]. COMPUTER JOURNAL, 1989, 32 (03) : 220 - 227
  • [7] Fall CJ., 2003, SIGIR FORUM, V37, P10, DOI [10.1145/945546.945547, DOI 10.1145/945546.945547]
  • [8] FUJII A, 2004, 4 NTCIR WORKSH M, P225
  • [9] KANG I, 2006, CLUSTER BASED PATENT
  • [10] KURLAND O, 2004, P 27 ANN INT ACM SIG