Large language model for patent concept generation

被引:0
作者
Ren, Runtao [1 ]
Ma, Jian [1 ]
Luo, Jianxi [2 ]
机构
[1] City Univ Hong Kong, Dept Informat Syst, Kowloon Tong, Hong Kong, Peoples R China
[2] City Univ Hong Kong, Dept Syst Engn, Kowloon Tong, Hong Kong, Peoples R China
关键词
Generative AI; Large language model; Finetuning; Patent;
D O I
10.1016/j.aei.2025.103301
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In traditional innovation practices, concept and IP generation are often iteratively integrated. Both processes demand an intricate understanding of advanced technical domain knowledge. Existing large language models (LLMs), while possessing massive pre-trained knowledge, often fall short in the innovative concept generation due to a lack of specialized knowledge necessary for the generation. To bridge this critical gap, we propose a novel knowledge finetuning (KFT) framework to endow LLM-based AI with the ability to autonomously mine, understand, and apply domain-specific knowledge and concepts for invention generation, i.e., concept and patent generation together. Our proposed PatentGPT integrates knowledge injection pre-training (KPT), domainspecific supervised finetuning (SFT), and reinforcement learning from human feedback (RLHF). Extensive evaluation shows that PatentGPT significantly outperforms the state-of-the-art models on patent-related benchmark tests. Our method not only provides new insights into data-driven innovation but also paves a new path to fine-tune LLMs for applications in the context of technology. We also discuss the managerial and policy implications of AI-generating inventions in the future.
引用
收藏
页数:16
相关论文
共 49 条
  • [1] [Anonymous], About us
  • [2] [Anonymous], 2011, Geographical Indication Protection in the United States
  • [3] Infusing internalized knowledge of language models into hybrid prompts for knowledgeable dialogue generation
    Bai, Jiaqi
    Yan, Zhao
    Zhang, Shun
    Yang, Jian
    Guo, Hongcheng
    Li, Zhoujun
    [J]. KNOWLEDGE-BASED SYSTEMS, 2024, 296
  • [4] PatentSBERTa: A deep NLP based hybrid model for patent distance and classification using augmented SBERT
    Bekamiri, Hamid
    Hain, Daniel S.
    Jurowetzki, Roman
    [J]. TECHNOLOGICAL FORECASTING AND SOCIAL CHANGE, 2024, 206
  • [5] From PARIS to LE-PARIS: toward patent response automation with recommender systems and collaborative large language models
    Chu, Jung-Mei
    Lo, Hao-Cheng
    Hsiang, Jieh
    Cho, Chun-Chieh
    [J]. ARTIFICIAL INTELLIGENCE AND LAW, 2024,
  • [6] claude, About us
  • [7] Structured information extraction from scientific text with large language models
    Dagdelen, John
    Dunn, Alexander
    Lee, Sanghoon
    Walker, Nicholas
    Rosen, Andrew S.
    Ceder, Gerbrand
    Persson, Kristin A.
    Jain, Anubhav
    [J]. NATURE COMMUNICATIONS, 2024, 15 (01)
  • [8] Drexl J., 2021, A Position Statement, V7, P21
  • [9] Toward understanding the impact of artificial intelligence on labor
    Frank, Morgan R.
    Autor, David
    Bessen, James E.
    Brynjolfsson, Erik
    Cebrian, Manuel
    Deming, David J.
    Feldman, Maryann
    Groh, Matthew
    Lobo, Jose
    Moro, Esteban
    Wang, Dashun
    Youn, Hyejin
    Rahwan, Iyad
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2019, 116 (14) : 6531 - 6539
  • [10] Text and Dynamic Network Analysis for Measuring Technological Convergence: A Case Study on Defense Patent Data
    Giordano, Vito
    Chiarello, Filippo
    Melluso, Nicola
    Fantoni, Gualtiero
    Bonaccorsi, Andrea
    [J]. IEEE TRANSACTIONS ON ENGINEERING MANAGEMENT, 2023, 70 (04) : 1490 - 1503