Large language model for patent concept generation

被引：0

作者：

Ren, Runtao ^{[1
]}

Ma, Jian ^{[1
]}

Luo, Jianxi ^{[2
]}

机构：

[1] City Univ Hong Kong, Dept Informat Syst, Kowloon Tong, Hong Kong, Peoples R China

[2] City Univ Hong Kong, Dept Syst Engn, Kowloon Tong, Hong Kong, Peoples R China

来源：

ADVANCED ENGINEERING INFORMATICS | 2025年 / 65卷

关键词：

Generative AI; Large language model; Finetuning; Patent;

D O I：

10.1016/j.aei.2025.103301

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In traditional innovation practices, concept and IP generation are often iteratively integrated. Both processes demand an intricate understanding of advanced technical domain knowledge. Existing large language models (LLMs), while possessing massive pre-trained knowledge, often fall short in the innovative concept generation due to a lack of specialized knowledge necessary for the generation. To bridge this critical gap, we propose a novel knowledge finetuning (KFT) framework to endow LLM-based AI with the ability to autonomously mine, understand, and apply domain-specific knowledge and concepts for invention generation, i.e., concept and patent generation together. Our proposed PatentGPT integrates knowledge injection pre-training (KPT), domainspecific supervised finetuning (SFT), and reinforcement learning from human feedback (RLHF). Extensive evaluation shows that PatentGPT significantly outperforms the state-of-the-art models on patent-related benchmark tests. Our method not only provides new insights into data-driven innovation but also paves a new path to fine-tune LLMs for applications in the context of technology. We also discuss the managerial and policy implications of AI-generating inventions in the future.

引用

页数：16

共 49 条

[1] [Anonymous], About us
[2] [Anonymous], 2011, Geographical Indication Protection in the United States
[3] Infusing internalized knowledge of language models into hybrid prompts for knowledgeable dialogue generation
Bai, Jiaqi
Yan, Zhao
Zhang, Shun
Yang, Jian
Guo, Hongcheng
Li, Zhoujun
[J]. KNOWLEDGE-BASED SYSTEMS, 2024, 296
[4] PatentSBERTa: A deep NLP based hybrid model for patent distance and classification using augmented SBERT
Bekamiri, Hamid
Hain, Daniel S.
Jurowetzki, Roman
[J]. TECHNOLOGICAL FORECASTING AND SOCIAL CHANGE, 2024, 206
[5] From PARIS to LE-PARIS: toward patent response automation with recommender systems and collaborative large language models
Chu, Jung-Mei
Lo, Hao-Cheng
Hsiang, Jieh
Cho, Chun-Chieh
[J]. ARTIFICIAL INTELLIGENCE AND LAW, 2024,
[6] claude, About us
[7] Structured information extraction from scientific text with large language models
Dagdelen, John
Dunn, Alexander
Lee, Sanghoon
Walker, Nicholas
Rosen, Andrew S.
Ceder, Gerbrand
Persson, Kristin A.
Jain, Anubhav
[J]. NATURE COMMUNICATIONS, 2024, 15 (01)
[8] Drexl J., 2021, A Position Statement, V7, P21
[9] Toward understanding the impact of artificial intelligence on labor
Frank, Morgan R.
Autor, David
Bessen, James E.
Brynjolfsson, Erik
Cebrian, Manuel
Deming, David J.
Feldman, Maryann
Groh, Matthew
Lobo, Jose
Moro, Esteban
Wang, Dashun
Youn, Hyejin
Rahwan, Iyad
[J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2019, 116 (14) : 6531 - 6539
[10] Text and Dynamic Network Analysis for Measuring Technological Convergence: A Case Study on Defense Patent Data
Giordano, Vito
Chiarello, Filippo
Melluso, Nicola
Fantoni, Gualtiero
Bonaccorsi, Andrea
[J]. IEEE TRANSACTIONS ON ENGINEERING MANAGEMENT, 2023, 70 (04) : 1490 - 1503

← 1 2 3 4 5 →