Automatic Feature Engineering Through Monte Carlo Tree Search

被引:2
作者
Huang, Yiran [1 ]
Zhou, Yexu [1 ]
Hefenbrock, Michael [1 ]
Riedel, Till [1 ]
Fang, Likun [1 ]
Beigl, Michael [1 ]
机构
[1] Karlsruhe Inst Technol, Telecooperat Off, Karlsruhe, Germany
来源
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2022, PT III | 2023年 / 13715卷
关键词
Data mining; Feature engineering; Monte Carlo tree search; Reinforce learning;
D O I
10.1007/978-3-031-26409-2_35
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The performance of machine learning models depends heavily on the feature space and feature engineering. Although neural networks have made significant progress in learning latent feature spaces from data, compositional feature engineering through nested feature transformations can reduce model complexity and can be particularly desirable for interpretability. To find suitable transformations automatically, state-of-the-art methods model the feature transformation space by graph structures and use heuristics such as epsilon-greedy to search for them. Such search strategies tend to become less efficient over time because they do not consider the sequential information of the candidate sequences and cannot dynamically adjust the heuristic strategy. To address these shortcomings, we propose a reinforcement learning-based automatic feature engineering method, which we call Monte Carlo tree search Automatic Feature Engineering (mCAFE). We employ a surrogate model that can capture the sequential information contained in the transformation sequence and thus can dynamically adjust the exploration strategy. It balances exploration and exploitation by Thompson sampling and uses a Long Short Term Memory (LSTM) based surrogate model to estimate sequences of promising transformations. In our experiments, mCAFE outperformed state-of-the-art automatic feature engineering methods on most common benchmark datasets.
引用
收藏
页码:581 / 598
页数:18
相关论文
共 25 条
  • [1] Auer P., 2003, Journal of Machine Learning Research, V3, P397, DOI 10.1162/153244303321897663
  • [2] Batista GEAPA., 2009, ARGENTINE S ARTIFICI, P1, DOI DOI 10.1145/1553374.1553495
  • [3] Coquelin P.A., 2007, ARXIV, DOI [10.48550/arXiv.cs/0703062, DOI 10.48550/ARXIV.CS/0703062]
  • [4] Coulom R, 2007, LECT NOTES COMPUT SC, V4630, P72
  • [5] Strengthening learning algorithms by feature discovery
    Dor, Ofer
    Reich, Yoram
    [J]. INFORMATION SCIENCES, 2012, 189 : 176 - 190
  • [6] Fan Wei, 2010, Proc SIAM Int Conf Data Min, V2010, P629
  • [7] Fernández-Delgado M, 2014, J MACH LEARN RES, V15, P3133
  • [8] Gaudel R., 2010, INTERNAT C ML, P359, DOI DOI 10.5555/3104322.3104369
  • [9] Monte-Carlo tree search and rapid action value estimation in computer Go
    Gelly, Sylvain
    Silver, David
    [J]. ARTIFICIAL INTELLIGENCE, 2011, 175 (11) : 1856 - 1875
  • [10] Gijsbers P, 2019, Arxiv, DOI arXiv:1907.00909