Automatic users extraction from patents

被引:15
作者
Chiarello, Filippo [1 ]
Cimino, Andrea [2 ]
Fantoni, Gualtiero [3 ]
Dell'Orletta, Felice [2 ]
机构
[1] Univ Pisa, Dept Energy Syst Terr & Construct Engn, Largo Lucio Lazzarino 2, I-56126 Pisa, Italy
[2] Italian Natl Res Council ILC, CNR, Inst Computat Linguist, Via G Moruzzi 1, Pisa, Italy
[3] Univ Pisa, Dept Mech Nucl & Prod Engn, Largo Lucio Lazzarino 2, I-56126 Pisa, Italy
关键词
Patent analysis; Deep learning; Text mining; User of an invention;
D O I
10.1016/j.wpi.2018.07.006
中图分类号
G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];
学科分类号
1205 ; 120501 ;
摘要
Patents contain a large quantity of information which is usually neglected. This information is hidden beneath technical and juridical jargon and therefore so many potential readers cannot take advantage of it. State of the art natural language processing tools and in particular named entity recognition tools, could be used to detect valuable concepts in patent documents. The purpose of the present research is to design a method capable of automatically detecting and extracting one of the multiple entities hidden in patents: the users of the invention. The method is based on a new approach tailored for users extraction by integrating state-of-the-art computational linguistics tools with a large knowledge base. Furthermore the paper shows a comparison among different machine learning algorithms with the twofold aim of achieving the highest recall and evaluating the performance in terms of precision and computational effort. Finally, a case study on two patent sets has been conducted to evaluate the effectiveness and the output of the entire tool-chain.
引用
收藏
页码:28 / 38
页数:11
相关论文
共 47 条
  • [1] [Anonymous], 13407 I ISO
  • [2] [Anonymous], CHECK SUFF COD JOBS
  • [3] [Anonymous], 2008, HDB NEW PRODUCT DEV
  • [4] Beller C., 2014, ASS COMPUTATIONAL LI, P50
  • [5] Beller C, 2014, PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2, P181
  • [6] Evaluating the risk of patent infringement by means of semantic patent analysis: the case of DNA chips
    Bergmann, Isumo
    Butzke, Daniel
    Walter, Lothar
    Fuerste, Jens P.
    Moehrle, Martin G.
    Erdmann, Volker A.
    [J]. R & D MANAGEMENT, 2008, 38 (05) : 550 - 562
  • [7] Understanding and customizing stopword lists for enhanced patent mapping
    Blanchard, Antoine
    [J]. WORLD PATENT INFORMATION, 2007, 29 (04) : 308 - 316
  • [8] Bonaccorsi F., 2017, EPIP 2017 C BORDEAUX
  • [9] Review of the state-of-the-art in patent information and forthcoming evolutions in intelligent patent informatics
    Bonino, Dario
    Ciaramella, Alberto
    Corno, Fulvio
    [J]. WORLD PATENT INFORMATION, 2010, 32 (01) : 30 - 38
  • [10] Measuring patent assessment quality - Analyzing the degree and kind of (in)consistency in patent offices' decision making
    Burke, Paul F.
    Reitzig, Markus
    [J]. RESEARCH POLICY, 2007, 36 (09) : 1404 - 1430