An estimating method for missing data by using multiple self-organizing maps

被引:0
作者
机构
[1] Kyushu Univ., Dept. of Mechanical Engineering, Nishi-ku, Fukuoka, 819-0395
来源
| 1600年 / Japan Society of Mechanical Engineers卷 / 79期
关键词
Data processing; Learning; Missing value; Neural network; Self-organizing map;
D O I
10.1299/kikaic.79.3465
中图分类号
学科分类号
摘要
In this paper, we propose a new method that uses multiple SOMs for estimating values lacking in data analysis. Recently, development of information technology grows the importance of data analysis. In actual data, however, some values will be sometimes missing, and then dealing with such insufficient data has become one of the important subjects in data analysis. Estimating and completing the empty values are required to applying various data analysis techniques. Such an estimation method is also applicable to data prediction problems. In the former methods that use SOM, many empty values would have caused the lack of data for learning process. Our system can achieve effective learning by using multiple SOMs even for data that includes many missing values. Moreover, the system is still available for nonlinear data because of using SOMs. We performed some numerical simulation using the proposed and other methods. By the simulation results, we showed the advantages of our method over some traditional techniques including a technique that uses single SOM. © 2013 The Japan Society of Mechanical Engineers.
引用
收藏
页码:3465 / 3473
页数:8
相关论文
共 4 条
[1]  
Edgar A., Caroline R., The treatment of missing values and its effect on classifier accuracy, Classification, Clustering, and Data Mining Applications, 0, 7, pp. 639-647, (2004)
[2]  
Pablo T., Donna S., Jill M., Qing Z., Sutisak K., Ethan D., Eric S.L., Todd R.G., Interpreting patterns of gene expression with self-organizing maps: Methods and application to hematopoietic differentiation, Proceedings of the National Academy of Sciences of the United States of America, 96, 6, pp. 2907-2912, (1999)
[3]  
Iris Data Set
[4]  
Wine Data Set