Two-layer random forests model for case reuse in case-based reasoning

被引:26
作者
Zhong, Shisheng [1 ]
Xie, Xiaolong [1 ]
Lin, Lin [1 ]
机构
[1] Harbin Inst Technol, Sch Mechatron Engn, Harbin 150001, Peoples R China
基金
中国国家自然科学基金;
关键词
Two-layer model; Random forests; Case-based reasoning; Case reuse; Ensemble learning; FEATURE-SELECTION; CBR SYSTEM; ENSEMBLE; PREDICTION; KNOWLEDGE;
D O I
10.1016/j.eswa.2015.08.005
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Case reuse is important for case-based reasoning (CBR) because without it, a CBR system degrades to a case retrieval system, and a retrieved case generally cannot solve a problem directly. To improve the accuracy of case reuse, a two-layer random forests model is proposed and the framework of the corresponding CBR system is presented. First, clustering analysis is used to organize the cases in the case base, and gravitational self-organizing mapping algorithm is adopted to automatically detect the cluster number on the basis of the structure of the data. Then, a two-layer model scheme is proposed to model the mapping in every cluster. In this scheme, the first layer model obtains the pre-estimate of the output feature value of the query case, and the second layer model models the error of the pre-estimate, which is used to decrease the error generated by the first layer model. Random forests algorithm, which is a popular ensemble learning model, is adopted as a model in the two layers to improve accuracy and stability. Several benchmark datasets are used to validate the proposed two-layer model scheme, and the results demonstrate that it can improve the case reuse accuracy and stability. The proposed two-layer random forests model is applied to hydraulic generator design, and the results confirm that the proposed model is effective for case reuse. (C) 2015 Elsevier Ltd. All rights reserved.
引用
收藏
页码:9412 / 9425
页数:14
相关论文
共 44 条
[1]  
AAMODT A, 1994, AI COMMUN, V7, P39
[2]  
An AJ, 1997, LECT NOTES ARTIF INT, V1266, P499
[3]  
[Anonymous], 2014, Case-based reasoning
[4]   Data mining for case-based reasoning in high-dimensional biological domains [J].
Arshadi, N ;
Jurisica, I .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2005, 17 (08) :1127-1137
[5]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[6]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[7]   A framework for an automotive body assembly process design system [J].
Chen, Guanlong ;
Zhou, Jiangqi ;
Cai, Wayne ;
Lai, Xinmin ;
Lin, Zhongqin ;
Menassa, Roland .
COMPUTER-AIDED DESIGN, 2006, 38 (05) :531-539
[8]   Dynamic adaptive ensemble case-based reasoning: application to stock market prediction [J].
Chun, SH ;
Park, YJ .
EXPERT SYSTEMS WITH APPLICATIONS, 2005, 28 (03) :435-443
[9]  
Cooper D, 2007, P 7 AIM ATIO C BELF
[10]   Using decision trees for determining attribute weights in a case-based model of early cost prediction [J].
Dogan, Sevgi Zeynep ;
Arditi, David ;
Guenaydin, H. Murat .
JOURNAL OF CONSTRUCTION ENGINEERING AND MANAGEMENT, 2008, 134 (02) :146-152