Assessment of Pharmaceutical Patent Novelty with Siamese Neural Networks

被引:2
作者
El-Shimy, Heba [1 ]
Zantout, Hind [1 ]
Hassen, Hani Ragab [1 ]
机构
[1] Heriot Watt Univ, Dubai, U Arab Emirates
来源
ARTIFICIAL NEURAL NETWORKS IN PATTERN RECOGNITION, ANNPR 2022 | 2023年 / 13739卷
关键词
Document analysis; Siamese neural networks; CNN; LSTM; Optical character recognition; Pharmaceutical patents; Chemical structure extraction; SYSTEM; INDUSTRY;
D O I
10.1007/978-3-031-20650-4_12
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Patents in the pharmaceutical field fulfil an important role as they contain details of the final product that is the culmination of years of research and possibly millions of dollars of investment. It is crucial that both patent producers and consumers are able to assess the novelty of such patents and perform basic processing on them. In this work, we review approaches in the literature in patent analysis and novelty assessment that range from basic digitisation to deep learning-based approaches including natural language processing, image processing and chemical structure extraction. We propose a system that automates the process of patent novelty assessment using Siamese neural networks for similarity detection. Our system showed promising results and has a potential to improve upon the current patent analysis methods, specifically in the pharmaceutical field, by not just focusing on the task from a Natural Language Processing perspective, but also, adding image analysis and adaptations for chemical structure extraction.
引用
收藏
页码:140 / 155
页数:16
相关论文
共 46 条
  • [1] Annotated Chemical Patent Corpus: A Gold Standard for Text Mining
    Akhondi, Saber A.
    Klenner, Alexander G.
    Tyrchan, Christian
    Manchala, Anil K.
    Boppana, Kiran
    Lowe, Daniel
    Zimmermann, Marc
    Jagarlapudi, Sarma A. R. P.
    Sayle, Roger
    Kors, Jan A.
    Muresan, Sorel
    [J]. PLOS ONE, 2014, 9 (09):
  • [2] [Anonymous], 2016, UNDERSTANDING IND PR, DOI [10.34667/tind.28945, DOI 10.34667/TIND.28945]
  • [3] Name=Struct: A practical approach to the sorry state of real-life chemical nomenclature
    Brecher, J
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1999, 39 (06): : 943 - 950
  • [4] Evolution of the international intellectual property rights system: patent protection for the pharmaceutical industry and access to medicines
    Chaves, Gabriela Costa
    Oliveira, Maria Auxiliadora
    Hasenclever, Lia
    de Melo, Luiz Martins
    [J]. CADERNOS DE SAUDE PUBLICA, 2007, 23 (02): : 257 - 267
  • [5] Chicco D, 2021, METHODS MOL BIOL, V2190, P73, DOI 10.1007/978-1-0716-0826-5_3
  • [6] Correa C.M., 2015, KRITIKA ESSAYS INTEL, DOI [10.4337/9781784712068.00010, DOI 10.4337/9781784712068.00010]
  • [7] Crocetti G, 2015, Arxiv, DOI arXiv:1505.03934
  • [8] Czajkowski A., PATENT SYSTEM ITS RO
  • [9] Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
  • [10] Innovation in the pharmaceutical industry: New estimates of R&D costs
    DiMasi, Joseph A.
    Grabowski, Henry G.
    Hansen, Ronald W.
    [J]. JOURNAL OF HEALTH ECONOMICS, 2016, 47 : 20 - 33