Using query logs of USPTO patent examiners for automatic query expansion in patent searching

被引:1
作者
Wolfgang Tannebaum
Andreas Rauber
机构
[1] Vienna University of Technology,Institute of Software Technology and Interactive Systems
来源
Information Retrieval | 2014年 / 17卷
关键词
Patent searching; Query expansion; Query log analysis;
D O I
暂无
中图分类号
学科分类号
摘要
In the patent domain significant efforts are invested to assist researchers in formulating better queries, preferably via automated query expansion. Currently, automatic query expansion in patent search is mostly limited to computing co-occurring terms for the searchable features of the invention. Additional query terms are extracted automatically from patent documents based on entropy measures. Learning synonyms in the patent domain for automatic query expansion has been a difficult task. No dedicated sources providing synonyms for the patent domain, such as patent domain specific lexica or thesauri, are available. In this paper we focus on the highly professional search setting of patent examiners. In particular, we use query logs to learn synonyms for the patent domain. For automatic query expansion, we create term networks based on the query logs specifically for several USPTO patent classes. Experiments show good performance in automatic query expansion using these automatically generated term networks. Specifically, with a larger number of query logs for a specific patent US class available the performance of the learned term networks increases.
引用
收藏
页码:452 / 470
页数:18
相关论文
共 11 条
  • [1] Clough P(2009)Report on the treble CLEF query log analysis workshop 2009 SIGIR Forum 43 71-77
  • [2] Berendt B(2007)Technology survey and invalidity search: An comparative study of different tasks for Japanese patent document retrieval Information Processing and Management, An International Journal 42 1154-1172
  • [3] Fujita S(2013)When do people use query suggestion? A query suggestion log analysis Information Retrieval 16 1-22
  • [4] Kato M.(2009)A new query expansion method based on query logs mining International Journal on Asian Language Processing 19 1-12
  • [5] Sakai T.(1995)WordNet: A lexical database for english Communications of the ACM 38 39-41
  • [6] Tanaka K.(2010)Mining query logs: Turning search usage data into knowledge Foundations and Trends in Information Retrieval 4 1-174
  • [7] Kunpeng Z(undefined)undefined undefined undefined undefined-undefined
  • [8] Xiaolong W(undefined)undefined undefined undefined undefined-undefined
  • [9] Yuanchao L(undefined)undefined undefined undefined undefined-undefined
  • [10] Miller G(undefined)undefined undefined undefined undefined-undefined