Completing keyword patent search with semantic patent search: introducing a semiautomatic iterative method for patent near search based on semantic similarities

被引:0
作者
Ansgar Moeller
Martin G. Moehrle
机构
[1] University of Bremen,Institute of Project Management and Innovation
来源
Scientometrics | 2015年 / 102卷
关键词
Patent search; Keyword search; Semantic search; Text mining; Similarity measurement; -grams;
D O I
暂无
中图分类号
学科分类号
摘要
Patent search is a substantial basis for many operational questions and scientometric evaluations. We consider it as a sequence of distinct stages. The “patent wide search” involves a definition of system boundaries by means of classifications and a keyword search producing a patent set with a high recall level (see Schmitz in Patentinformetrie: Analyse und Verdichtung von technischen Schutzrechtsinformationen, DGI, Frankfurt (Main), 2010 with an overview of searchable patent meta data). In this set of patents a “patent near search” takes place, producing a patent set with high(er) precision. Hence, the question arises how the researcher has to operate within this patent set to efficiently identify patents that contain paraphrased descriptions of the sought inventive elements in contextual information and whether this produces different results compared to a conventional search. We present a semiautomatic iterative method for the identification of such patents, based on semantic similarity. In order to test our method we generate an initial dataset in the course of a patent wide search. This dataset is then analyzed by means of the semiautomatic iterative method as well as by an alternative method emulating the conventional process of keyword refinement. It thus becomes obvious that both methods have their particular “raison d’être”, and that the semiautomatic iterative method seems to be able to support a conventional patent search very effectively.
引用
收藏
页码:77 / 96
页数:19
相关论文
共 67 条
  • [1] Aamodt A(1994)Case-based reasoning: foundational issues, methodological variations, and system approaches AI Communications 7 39-59
  • [2] Plaza E(2012)A study of scientometric methods to identify emerging technologies via modeling of milestones Scientometrics 91 327-342
  • [3] Abercrombie RK(2013)A hybrid keyword and patent class methodology for selecting relevant sets of patents for a technological field Scientometrics 96 69-82
  • [4] Udoeyop AW(2002)Using patent citation analysis to target/value M&A candidates Research Technology Management 45 28-36
  • [5] Schlicher BG(2012)A survey of automatic query expansion in information retrieval ACM Computing Surveys 44 1-50
  • [6] Benson CL(2004)Natural language processing of patents and technical documentation Lecture Notes in Computer Science 3163 508-520
  • [7] Magee CL(2008)Measuring patent similarity by comparing inventions functional trees Computer-Aided Innovation (CAI) 277 31-42
  • [8] Breitzmann A(2011)SAO network analysis of patents for technology trends identification: A case study of polymer electrolyte membrane technology in proton exchange membrane fuel cells Scientometrics 88 863-883
  • [9] Thomas P(2011)A guide to efficient keyword, sequence and classification search strategies for biopharmaceutical drug-centric patent landscape searches—A human recombinant insulin patent landscape case study World Patent Information 33 128-143
  • [10] Carpineto C(2011)A context-aware semantic similarity model for ontology environments Concurrency and Computation: Practice and Experience 23 505-524