A Review on the Determination of Semantic Similarity of Patent Documents

被引:0
作者
Kayakoku, Ahmet [1 ,2 ]
Tufekci, Aslihan [1 ]
机构
[1] Gazi Univ, Yonetim Bilisim Sistemleri, Bilisim Enstitusu, Ankara, Turkiye
[2] Turk Patent & Marka Kurumu, Ankara, Turkiye
来源
JOURNAL OF POLYTECHNIC-POLITEKNIK DERGISI | 2025年 / 28卷 / 02期
关键词
patent; patent mining; text mining; semantic similarity; patentability; deep learning;
D O I
10.2339/politeknik.1294789
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Patent mining studies are gaining importance in the processing of patent data, which contains the most up-to-date technical information, has a great potential in terms of information discovery with its high volume, and plays a key role in the field of technology management. Although all the structured or unstructured data in the patent data are important, the most critical goal of patent mining studies is to determine the semantic similarity of patent documents. With the semantic similarity detection of patents, it will be possible to automatically determine the patentability criteria, which is the most difficult and time-consuming phase of the patent application process. In this article, the studies carried out in the literature about determination the semantic similarity of patent documents were examined and the methods used were categorized into two groups: text mining and deep learning -based methods. Set theory approaches, vector space model approaches or ontology etc. are used to determine how similar patent texts are to each other by structuralizing them with text mining methods. There are approaches that make use of information sources. However, due to the complex structure and unique terminology of patent texts, the targeted efficiency cannot be obtained from these methods. In order to overcome this deficiency, artificial intelligence methods, which have shown great success in every field they are used, should also be utilized in the semantic comparison of patent texts. Especially with the high capacity of deep learning methods, a significant distance can be achieved in determining the patentability criteria, which is a complex problem. Although studies have been carried out in this area, models that can effectively detect patentability are still in their infancy. The objective of this study is to demonstrate that developing a qualified model capable of conducting patentability assessments and automatically generating patent search reports will be a significant step towards fulfilling the substantial needs in the field of technology management.
引用
收藏
页数:21
相关论文
共 55 条
[1]   An improved patent similarity measurement based on entities and semantic relations [J].
An, Xin ;
Li, Jinghong ;
Xu, Shuo ;
Chen, Liang ;
Sun, Wei .
JOURNAL OF INFORMETRICS, 2021, 15 (02)
[2]  
Aras H., 2018, P INT C SEM SYST SEM, P1
[3]   The state-of-the-art on Intellectual Property Analytics (IPA): A literature review on artificial intelligence, machine learning and deep learning methods for analysing intellectual property (IP) data [J].
Aristodemou, Leonidas ;
Tietze, Frank .
WORLD PATENT INFORMATION, 2018, 55 :37-51
[4]   Text matching to measure patent similarity [J].
Arts, Sam ;
Cassiman, Bruno ;
Carlos Gomez, Juan .
STRATEGIC MANAGEMENT JOURNAL, 2018, 39 (01) :62-84
[5]   Towards a general theory of similarity and association measures: Similarity, dissimilarity and correlation functions [J].
Batyrshin, Ildar .
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 36 (04) :2977-3004
[6]   Evaluating the risk of patent infringement by means of semantic patent analysis: the case of DNA chips [J].
Bergmann, Isumo ;
Butzke, Daniel ;
Walter, Lothar ;
Fuerste, Jens P. ;
Moehrle, Martin G. ;
Erdmann, Volker A. .
R & D MANAGEMENT, 2008, 38 (05) :550-562
[7]   Review of the state-of-the-art in patent information and forthcoming evolutions in intelligent patent informatics [J].
Bonino, Dario ;
Ciaramella, Alberto ;
Corno, Fulvio .
WORLD PATENT INFORMATION, 2010, 32 (01) :30-38
[8]   Evolution of Semantic Similarity-A Survey [J].
Chandrasekaran, Dhivya ;
Mago, Vijay .
ACM COMPUTING SURVEYS, 2021, 54 (02)
[9]   A deep learning based method for extracting semantic information from patent documents [J].
Chen, Liang ;
Xu, Shuo ;
Zhu, Lijun ;
Zhang, Jing ;
Lei, Xiaoping ;
Yang, Guancan .
SCIENTOMETRICS, 2020, 125 (01) :289-312
[10]   A two-stage deep learning-based system for patent citation recommendation [J].
Choi, Jaewoong ;
Lee, Jiho ;
Yoon, Janghyeok ;
Jang, Sion ;
Kim, Jaeyoung ;
Choi, Sungchul .
SCIENTOMETRICS, 2022, 127 (11) :6615-6636