A Comparative Study on Machine Learning algorithms for Knowledge Discovery

被引:2
作者
Suseela, Siddesh Sambasivam [1 ]
Feng, Yang [2 ]
Mao, Kezhi [3 ]
机构
[1] Nanyang Technol Univ, IHPC, A STAR, Singapore, Singapore
[2] ASTAR, IHPC, Singapore, Singapore
[3] Nanyang Technol Univ, Singapore, Singapore
来源
2022 17TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV) | 2022年
关键词
Knowledge discovery; symbolic regression; Sparse regression; Machine Learning;
D O I
10.1109/ICARCV57592.2022.10004302
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
For centuries, the process of formulating new knowledge from observations has driven scientific discoveries. With rapid advancements in machine learning, it is natural to question the possibility of automating knowledge discovery in the scientific field. A benchmark task for automated knowledge discovery is called symbolic regression. The task aims to predict a mathematical equation that best describes the observational data. The advancements in symbolic regression have significant potential to aid research in understanding unexplored systems' dynamics and governing properties. However, the combinatorial nature of the problem makes it an expensive and challenging problem to solve efficiently. Several types of symbolic regression algorithms exist, from genetic programming and sparse regression to deep generative models. However, no survey collates these prominent algorithms. Therefore, this paper aims to summarize key research works in symbolic regression and perform a comparative study to understand the strength and limitations of each method. Finally, we highlight the challenges in the current methods and future research directions in the application of machine learning in knowledge discovery.
引用
收藏
页码:131 / 136
页数:6
相关论文
共 20 条
[1]  
Biggio L, 2021, PR MACH LEARN RES, V139
[2]   Automated reverse engineering of nonlinear dynamical systems [J].
Bongard, Josh ;
Lipson, Hod .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2007, 104 (24) :9943-9948
[3]   Discovering governing equations from data by sparse identification of nonlinear dynamical systems [J].
Brunton, Steven L. ;
Proctor, Joshua L. ;
Kutz, J. Nathan .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2016, 113 (15) :3932-3937
[4]  
Chen Z, 2021, NAT COMMUN, V12, DOI [10.1038/s41467-021-26434-1, 10.1038/s41467-021-27250-3]
[5]  
Goodfellow I, 2016, ADAPT COMPUT MACH LE, P1
[6]   SINDy-PI: a robust algorithm for parallel implicit sparse identification of nonlinear dynamics [J].
Kaheman, Kadierdan ;
Kutz, J. Nathan ;
Brunton, Steven L. .
PROCEEDINGS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2020, 476 (2242)
[7]  
Kamienny PA, 2022, ADV NEUR IN
[8]  
Kim JT, 2020, PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P5261
[9]   Integration of Neural Network-Based Symbolic Regression in Deep Learning for Scientific Discovery [J].
Kim, Samuel ;
Lu, Peter Y. ;
Mukherjee, Srijon ;
Gilbert, Michael ;
Jing, Li ;
Ceperic, Vladimir ;
Soljacic, Marin .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (09) :4166-4177
[10]  
Koza J., 2005, Search Methodologies, P127, DOI [DOI 10.1007/0-387-28356-05, DOI 10.1007/0-387-28356-0_5]