Unsupervised Entropy-Based Selection of Data Sets for Improved Model Fitting

被引:0
作者
Ferreira, Pedro M. [1 ]
机构
[1] Univ Lisbon, Fac Ciencias, LaSIGE, Lisbon, Portugal
来源
2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) | 2016年
关键词
NEURAL-NETWORKS; INFORMATION; REGRESSION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A method based on the information theory concept of entropy is presented for selecting subsets of data for off-line model identification. By using entropy-based data selection instead of random equiprobable sampling before training models, significant improvements are achieved in parameter convergence, accuracy and generalisation ability. Furthermore, model evaluation metrics exhibit less variance, therefore allowing faster convergence when multiple modelling trials have to be executed. These features are experimentally demonstrated by the results of an extensive number of neural network predictive modelling experiments, where the single difference in the identification of pairs of models was the data set used to tune model parameters. Unlike most active learning and instance selection procedures, the method is not iterative, does not rely on an existing model, and does not require a specific modelling technique. Instead, it selects data points in one unsupervised step relying solely on Shannon's information measure.
引用
收藏
页码:3330 / 3337
页数:8
相关论文
共 30 条
[1]  
[Anonymous], 1996, J ARTIFICIAL INTELLI
[2]   A review of instance selection methods [J].
Arturo Olvera-Lopez, J. ;
Ariel Carrasco-Ochoa, J. ;
Francisco Martinez-Trinidad, J. ;
Kittler, Josef .
ARTIFICIAL INTELLIGENCE REVIEW, 2010, 34 (02) :133-143
[3]  
Baker J. E., 1987, Genetic Algorithms and their Applications: Proceedings of the Second International Conference on Genetic Algorithms, P14
[4]   Support vector regression based on optimal training subset and adaptive particle swarm optimization algorithm [J].
Che, JinXing .
APPLIED SOFT COMPUTING, 2013, 13 (08) :3473-3481
[5]  
Cover T. M., 2012, ELEMENTS INFORM THEO
[6]   Kernel ridge regression with active learning for wind speed prediction [J].
Douak, Fouzi ;
Melgani, Farid ;
Benoudjit, Nabil .
APPLIED ENERGY, 2013, 103 :328-340
[7]  
Ferreira P. M., 2000, IEEE S 2000 AD SYST
[8]  
Ferreira PM, 2011, STUD COMPUT INTELL, V372, P21
[9]   Neural network models in greenhouse air temperature prediction [J].
Ferreira, PM ;
Faria, EA ;
Ruano, AE .
NEUROCOMPUTING, 2002, 43 :51-75
[10]  
Haykin S., 2009, NEURAL NETWORKS LEAR