Neural Feature Search: A Neural Architecture for Automated Feature Engineering

被引:36
作者
Chen, Xiangning [1 ]
Lin, Qingwei [2 ]
Luo, Chuan [2 ]
Li, Xudong [3 ]
Zhang, Hongyu [4 ]
Xu, Yong [2 ]
Dang, Yingnong [5 ]
Sui, Kaixin [2 ]
Zhang, Xu [6 ]
Qiao, Bo [2 ]
Zhang, Weiyi [2 ]
Wu, Wei [7 ]
Chintalapati, Murali [5 ]
Zhang, Dongmei [2 ]
机构
[1] Tsinghua Univ, Beijing, Peoples R China
[2] Microsoft Res, Beijing, Peoples R China
[3] Univ Calif Los Angeles, Los Angeles, CA USA
[4] Univ Newcastle, Callaghan, NSW, Australia
[5] Microsoft Azure, New York, NY USA
[6] Nanjing Univ, Nanjing, Peoples R China
[7] Univ Technol Sydney, Sydney, NSW, Australia
来源
2019 19TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2019) | 2019年
关键词
Feature Engineering; Automated Feature Engineering; Neural Architecture; ALGORITHMS; SELECTION;
D O I
10.1109/ICDM.2019.00017
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature engineering is a crucial step for developing effective machine learning models. Traditionally, feature engineering is performed manually, which requires much domain knowledge and is time-consuming. In recent years, many automated feature engineering methods have been proposed. These methods improve the accuracy of a machine learning model by automatically transforming the original features into a set of new features. However, existing methods either lack ability to perform high-order transformations or suffer from the feature space explosion problem. In this paper, we present Neural Feature Search (NFS), a novel neural architecture for automated feature engineering. We utilize a recurrent neural network based controller to transform each raw feature through a series of transformation functions. The controller is trained through reinforcement learning to maximize the expected performance of the machine learning algorithm. Extensive experiments on public datasets illustrate that our neural architecture is effective and outperforms the existing state-of-the-art automated feature engineering methods. Our architecture can efficiently capture potentially valuable high-order transformations and mitigate the feature explosion problem.
引用
收藏
页码:71 / 80
页数:10
相关论文
共 32 条
[1]  
[Anonymous], 2000, WILEY PS TX, DOI 10.1002/0471722146
[2]  
[Anonymous], 2016, KDD16 P 22 ACM, DOI DOI 10.1145/2939672.2939785
[3]   Mixed-Initiative Feature Engineering Using Knowledge Graphs [J].
Atzmueller, Martin ;
Sternberg, Eric .
K-CAP 2017: PROCEEDINGS OF THE KNOWLEDGE CAPTURE CONFERENCE, 2017,
[4]  
Bergstra J, 2012, J MACH LEARN RES, V13, P281
[5]  
Cui Y, 2011, IEEE INFOCOM SER, P1395, DOI 10.1109/INFCOM.2011.5934925
[6]  
Dong G., 2018, Feature engineering for machine learning and data analytics
[7]   Strengthening learning algorithms by feature discovery [J].
Dor, Ofer ;
Reich, Yoram .
INFORMATION SCIENCES, 2012, 189 :176-190
[8]  
Ho TK, 1998, IEEE T PATTERN ANAL, V20, P832, DOI 10.1109/34.709601
[9]   Programming by Optimization [J].
Hoos, Holger H. .
COMMUNICATIONS OF THE ACM, 2012, 55 (02) :70-80
[10]  
Kanter JM, 2015, PROCEEDINGS OF THE 2015 IEEE INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (IEEE DSAA 2015), P717