Imputation Method Based on Collaborative Filtering and Clustering for the Missing Data of the Squeeze Casting Process Parameters

被引:3
|
作者
Deng, Jianxin [1 ,2 ]
Ye, Zhixing [1 ]
Shan, Lubao [1 ]
You, Dongdong [3 ]
Liu, Guangming [2 ]
机构
[1] Guangxi Univ, Guangxi Key Lab Mfg Syst & Adv Mfg Technol, Nanning 530003, Peoples R China
[2] Guangxi Univ, Sch Mech Engn, Nanning 530003, Peoples R China
[3] South China Univ Technol, Natl Engn Res Ctr Near Net Shape Forming Metall M, Guangzhou 510640, Peoples R China
基金
中国国家自然科学基金;
关键词
Squeeze casting; Data-driven materials manufacturing; Missing data; Imputation method; Clustering collaborative filtering; Process data; MULTIPLE IMPUTATION; REGRESSION-MODELS; OPTIMIZATION; VALIDATION; SYSTEM;
D O I
10.1007/s40192-021-00248-x
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
The development of a highly efficient methodology for establishing squeeze casting process parameters from past data is essential. However, designing squeeze casting process parameters based on past data is difficult when there are many missing values. Conventional missing data approaches are fraught with additional computational challenges when applied to high-dimensional multivariable missing data, especially material process data with correlation. As the relationship between material composition and process parameters has similar characteristics with that between users and information of interest, this paper proposes a method for missing data imputation based on a clustering-based collaborative filtering (ClubCF) algorithm to address this challenge. Data samples with and without missing values were divided into two groups. K-means clustering based on a canopy algorithm was applied to the data samples without missing values to obtain k subclass data, whose values were then selected to fill data samples with missing values via a collaborative filtering theory based on Pearson similarity user filling. The missing squeeze casting process parameters data of aluminum alloys were used to evaluate the method, and more comparative experiments were carried out to understand their performance and features. Two different indicators, including the mean absolute error and the standard deviation, were utilized to quantify the imputation performance, which was compared with those of three conventional methods (mean interpolation, regression interpolation, and the expectation maximization algorithm). The results indicate that the proposed approach is effective and outperforms conventional methods in processing high-dimensional correlated data.
引用
收藏
页码:95 / 108
页数:14
相关论文
共 50 条
  • [1] Imputation Method Based on Collaborative Filtering and Clustering for the Missing Data of the Squeeze Casting Process Parameters
    Jianxin Deng
    Zhixing Ye
    Lubao Shan
    Dongdong You
    Guangming Liu
    Integrating Materials and Manufacturing Innovation, 2022, 11 : 95 - 108
  • [2] Cooperative Clustering Missing Data Imputation
    Wan, Daoming
    Razavi-Far, Roozbeh
    Saif, Mehrdad
    2020 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2020, : 1039 - 1045
  • [3] Review of Design of Process Parameters for Squeeze Casting
    Deng, Jianxin
    Xie, Bin
    You, Dongdong
    Huang, Haibin
    CHINESE JOURNAL OF MECHANICAL ENGINEERING, 2023, 36 (01)
  • [4] A novel approach of shot peening process parameters prediction with missing surface integrity data based on imputation method
    Li, Yang
    Wei, Peitang
    Zhao, Xinhao
    Zhu, Rupeng
    Wu, Jizhan
    Liu, Huaiju
    INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2023, 127 (1-2) : 81 - 92
  • [5] Multiple Imputation based Clustering Validation (MIV) for Big Longitudinal Trial Data with Missing Values in eHealth
    Zhaoyang Zhang
    Hua Fang
    Honggang Wang
    Journal of Medical Systems, 2016, 40
  • [6] Multiple Imputation based Clustering Validation (MIV) for Big Longitudinal Trial Data with Missing Values in eHealth
    Zhang, Zhaoyang
    Fang, Hua
    Wang, Honggang
    JOURNAL OF MEDICAL SYSTEMS, 2016, 40 (06)
  • [7] Reliability evaluation method for squeeze casting process parameter data
    Jianxin Deng
    Zhixing Ye
    Rui Tang
    Dongdong You
    Bin Xie
    The International Journal of Advanced Manufacturing Technology, 2021, 117 : 1303 - 1325
  • [8] Reliability evaluation method for squeeze casting process parameter data
    Deng, Jianxin
    Ye, Zhixing
    Tang, Rui
    You, Dongdong
    Xie, Bin
    INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2021, 117 (3-4) : 1303 - 1325
  • [9] Partial distance evidential clustering for missing data with multiple imputation
    Tian, Hong-Peng
    Zhang, Zhen
    KNOWLEDGE-BASED SYSTEMS, 2025, 310
  • [10] Review of Design of Process Parameters for Squeeze Casting
    Jianxin Deng
    Bin Xie
    Dongdong You
    Haibin Huang
    Chinese Journal of Mechanical Engineering, 36