Automated patent landscaping

被引:36
作者
Abood, Aaron [1 ]
Feltenberger, Dave [1 ]
机构
[1] Google Inc, Mountain View, CA 94043 USA
关键词
Patent landscape; Classification; Text analytics; Semi-supervised machine learning;
D O I
10.1007/s10506-018-9222-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Patent landscaping is the process of finding patents related to a particular topic. It is important for companies, investors, governments, and academics seeking to gauge innovation and assess risk. However, there is no broadly recognized best approach to landscaping. Frequently, patent landscaping is a bespoke human-driven process that relies heavily on complex queries over bibliographic patent databases. In this paper, we present Automated Patent Landscaping, an approach that jointly leverages human domain expertise, heuristics based on patent metadata, and machine learning to generate high-quality patent landscapes with minimal effort. In particular, this paper describes a flexible automated methodology to construct a patent landscape for a topic based on an initial seed set of patents. This approach takes human-selected seed patents that are representative of a topic, such as operating systems, and uses structure inherent in patent data such as references and class codes to "expand'' the seed set to a set of "probably-related'' patents and anti-seed "probably-unrelated'' patents. The expanded set of patents is then pruned with a semi-supervised machine learning model trained on seed and anti-seed patents. This removes patents from the expanded set that are unrelated to the topic and ensures a comprehensive and accurate landscape.
引用
收藏
页码:103 / 125
页数:23
相关论文
共 27 条
[1]  
[Anonymous], 2012, Technical Report
[2]  
[Anonymous], DISRUPTIVE COMPETITI
[3]  
[Anonymous], 2015, arXiv: Learning
[4]   Augmented implicitly restarted Lanczos bidiagonalization methods [J].
Baglama, J ;
Reichel, L .
SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2005, 27 (01) :19-42
[5]  
Blum A, 2006, LECT NOTES COMPUT SC, V3940, P52
[6]  
Cheng H.-T., 2016, P 1 WORKSH DEEP LEAR
[7]   Patents, Thickets and the Financing of Early-Stage Firms: Evidence from the Software Industry [J].
Cockburn, Iain M. ;
MacGarvie, Megan J. .
JOURNAL OF ECONOMICS & MANAGEMENT STRATEGY, 2009, 18 (03) :729-773
[8]   A multilinear singular value decomposition [J].
De Lathauwer, L ;
De Moor, B ;
Vandewalle, J .
SIAM JOURNAL ON MATRIX ANALYSIS AND APPLICATIONS, 2000, 21 (04) :1253-1278
[9]  
Dean J, 2004, USENIX ASSOCIATION PROCEEDINGS OF THE SIXTH SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDE '04), P137
[10]  
DEERWESTER S, 1990, J AM SOC INFORM SCI, V41, P391, DOI 10.1002/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO