Quo Vadis ChatGPT? From large language models to Large Knowledge Models

被引:6
作者
Venkatasubramanian, Venkat [1 ]
Chakraborty, Arijit [1 ]
机构
[1] Columbia Univ, Dept Chem Engn, Complex Resilient Intelligent Syst Lab, New York, NY 10027 USA
关键词
Artificial intelligence; Large language model; Knowledge graph; Domain-specific language processing; Machine learning; Hybrid AI; PHARMACEUTICAL PRODUCT DEVELOPMENT; HYBRID EXPERT SYSTEM; HAZOP ANALYSIS TOOL; CHEMICAL-PROCESSES; ARTIFICIAL-INTELLIGENCE; CATALYST SELECTION; PART I; FRAMEWORK; DESIGN; IDENTIFICATION;
D O I
10.1016/j.compchemeng.2024.108895
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The startling success of ChatGPT and other large language models (LLMs) using transformer-based generative neural network architecture in applications such as natural language processing and image synthesis has many researchers excited about potential opportunities in process systems engineering (PSE). The almost human-like performance of LLMs in these areas is indeed very impressive, surprising, and a major breakthrough. Their capabilities are very useful in certain tasks, such as writing first drafts of documents, code writing assistance, text summarization, etc. However, their success is limited in highly scientific domains as they cannot yet reason, plan, or explain due to their lack of in-depth mechanistic domain knowledge. This is a problem in domains such as chemical engineering as they are governed by fundamental laws of physics and chemistry (and biology), constitutive relations, and highly technical knowledge about materials, processes, and systems. Although purely data-driven machine learning has its immediate uses, the long-term success of AI in scientific and engineering domains would depend on developing hybrid AI systems that combine first principles and technical knowledge effectively. We call these hybrid AI systems Large Knowledge Models (LKMs), as they will not be limited to only NLP-based techniques or NLP-like applications. In this paper, we discuss the challenges and opportunities in developing such systems in chemical engineering.
引用
收藏
页数:12
相关论文
共 136 条
[1]  
Abdin M, 2024, Arxiv, DOI arXiv:2404.14219
[2]   APPLICATIONS OF MATRIX MATHEMATICS TO CHEMICAL ENGINEERING PROBLEMS [J].
ACRIVOS, A ;
AMUNDSON, NR .
INDUSTRIAL AND ENGINEERING CHEMISTRY, 1955, 47 (08) :1533-1541
[3]  
AIChE, 2019, Venkat Venkatasubramanian on Artificial Intelligence in Chemical Engineering
[4]  
Aldea A., 2003, IIWEB, P177
[5]   MORE IS DIFFERENT - BROKEN SYMMETRY AND NATURE OF HIERARCHICAL STRUCTURE OF SCIENCE [J].
ANDERSON, PW .
SCIENCE, 1972, 177 (4047) :393-&
[6]  
[Anonymous], 2024, Llama 3 model card
[7]  
[Anonymous], 2023, Stability AI launches the first of its stablelm suite of language models
[8]  
[Anonymous], 2022, BLOOM 176B PARAMETER, DOI DOI 10.48550/ARXIV.2211.05100
[9]   MolGPT: Molecular Generation Using a Transformer-Decoder Model [J].
Bagal, Viraj ;
Aggarwal, Rishal ;
Vinod, P. K. ;
Priyakumar, U. Deva .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2022, 62 (09) :2064-2076
[10]  
BANARESALCANTARA R, 1985, CHEM ENG PROG, V81, P25