Imputation Method Based on Collaborative Filtering and Clustering for the Missing Data of the Squeeze Casting Process Parameters

被引:0
作者
Jianxin Deng
Zhixing Ye
Lubao Shan
Dongdong You
Guangming Liu
机构
[1] Guangxi University,Guangxi Key Lab of Manufacturing System and Advanced Manufacturing Technology
[2] Guangxi University,School of Mechanical Engineering
[3] South China University of Technology,National Engineering Research Center of Near
来源
Integrating Materials and Manufacturing Innovation | 2022年 / 11卷
关键词
Squeeze casting; Data-driven materials manufacturing; Missing data; Imputation method; Clustering collaborative filtering; Process data;
D O I
暂无
中图分类号
学科分类号
摘要
The development of a highly efficient methodology for establishing squeeze casting process parameters from past data is essential. However, designing squeeze casting process parameters based on past data is difficult when there are many missing values. Conventional missing data approaches are fraught with additional computational challenges when applied to high-dimensional multivariable missing data, especially material process data with correlation. As the relationship between material composition and process parameters has similar characteristics with that between users and information of interest, this paper proposes a method for missing data imputation based on a clustering-based collaborative filtering (ClubCF) algorithm to address this challenge. Data samples with and without missing values were divided into two groups. K-means clustering based on a canopy algorithm was applied to the data samples without missing values to obtain k subclass data, whose values were then selected to fill data samples with missing values via a collaborative filtering theory based on Pearson similarity user filling. The missing squeeze casting process parameters data of aluminum alloys were used to evaluate the method, and more comparative experiments were carried out to understand their performance and features. Two different indicators, including the mean absolute error and the standard deviation, were utilized to quantify the imputation performance, which was compared with those of three conventional methods (mean interpolation, regression interpolation, and the expectation maximization algorithm). The results indicate that the proposed approach is effective and outperforms conventional methods in processing high-dimensional correlated data.
引用
收藏
页码:95 / 108
页数:13
相关论文
共 201 条
  • [1] Jain A(2013)Commentary: the materials project: a materials genome approach to accelerating materials innovation APL Mater 1 011002-158
  • [2] Ong SP(2015)Laminate squeeze casting of carbon fiber reinforced aluminum matrix composites Mater Des 67 154-1084
  • [3] Hautier G(2019)New frontiers for the materials genome initiative npj Comput Mater 41 1076-164
  • [4] Chen W(2008)Modeling and analysis of the effects of processing parameters on the performance characteristics in the high pressure die casting process of Al–SI alloys Int J Adv Manuf Technol 14 157-10
  • [5] Richards WD(2014)Optimization of squeeze cast process parameters using Taguchi and Grey relational analysis Procedia Technol 2014 1-2077
  • [6] Dacek S(2014)Parametric optimization of squeeze cast AC2A-Ni coated SiCp composite using Taguchi technique Adv Mater Sci Eng 78 2069-3561
  • [7] Cholia S(2015)Improvement of ductility for squeeze cast 2017 A wrought aluminum alloy using the Taguchi method Int J Adv Manuf Technol 89 3547-773
  • [8] Gunter D(2016)Investigating the effects of as-casted and in situ heat-treated squeeze casting of Al–3.5% Cu alloy Int J Adv Manuf Technol 102 759-54
  • [9] Skinner D(2019)Multi-response parametric optimization of squeeze casting process for fabricating Al 6061-SiC composite Int J Adv Manuf Technol 4 053208-36
  • [10] Ceder G(2016)Perspective: materials informatics and big data: realization of the “fourth paradigm” of science in materials science APL Mater 155 48-102