Visualizing Random Forest with Self-Organising Map

被引:0
作者
Plonski, Piotr [1 ]
Zaremba, Krzysztof [1 ]
机构
[1] Warsaw Univ Technol, Inst Radioelect, Nowowiejska 15-19, PL-00665 Warsaw, Poland
来源
ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, ICAISC 2014, PT II | 2014年 / 8468卷
关键词
Random Forest; Self-Organising Maps; visualization; classification; proximity matrix; CLASSIFICATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Random Forest (RF) is a powerful ensemble method for classification and regression tasks. It consists of decision trees set. Although, a single tree is well interpretable for human, the ensemble of trees is a black-box model. The popular technique to look inside the RF model is to visualize a RF proximity matrix obtained on data samples with Multidimensional Scaling (MDS) method. Herein, we present a novel method based on Self-Organising Maps (SOM) for revealing intrinsic relationships in data that lay inside the RF used for classification tasks. We propose an algorithm to learn the SOM with the proximity matrix obtained from the RF. The visualization of RF proximity matrix with MDS and SOM is compared. What is more, the SOM learned with the RF proximity matrix has better classification accuracy in comparison to SOM learned with Euclidean distance. Presented approach enables better understanding of the RF and additionally improves accuracy of the SOM.
引用
收藏
页码:63 / 71
页数:9
相关论文
共 19 条
[1]  
[Anonymous], 2007, Uci machine learning repository
[2]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[3]  
Czajkowski M, 2012, LECT NOTES COMPUT SC, V7053, P154, DOI 10.1007/978-3-642-25261-7_12
[4]   Variable selection using random forests [J].
Genuer, Robin ;
Poggi, Jean-Michel ;
Tuleau-Malot, Christine .
PATTERN RECOGNITION LETTERS, 2010, 31 (14) :2225-2236
[5]  
Gray KR, 2011, LECT NOTES COMPUT SC, V7009, P159, DOI 10.1007/978-3-642-24319-6_20
[6]  
Kästner M, 2012, LECT NOTES ARTIF INT, V7267, P256, DOI 10.1007/978-3-642-29347-4_30
[7]   THE SELF-ORGANIZING MAP [J].
KOHONEN, T .
PROCEEDINGS OF THE IEEE, 1990, 78 (09) :1464-1480
[8]   LEARNING ASSOCIATIONS BY SELF-ORGANIZATION - THE LASSO MODEL [J].
MIDENET, S ;
GRUMBACH, A .
NEUROCOMPUTING, 1994, 6 (03) :343-361
[9]   Randomized clustering forests for image classification [J].
Moosmann, Frank ;
Nowak, Eric ;
Jurie, Frederic .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2008, 30 (09) :1632-1646
[10]  
Olszewski D, 2013, LECT NOTES ARTIF INT, V7894, P150, DOI 10.1007/978-3-642-38658-9_14