An Engineering Domain Knowledge-Based Framework for Modelling Highly Incomplete Industrial Data

被引:1
作者
Li, Han [1 ]
Liu, Zhao [2 ]
Zhu, Ping [3 ]
机构
[1] Shanghai Jiao Tong Univ, Mech Engn, Sch Mech Engn, Shanghai, Peoples R China
[2] Shanghai Jiao Tong Univ, Sch Design, Shanghai, Peoples R China
[3] Shanghai Jiao Tong Univ, Sch Mech Engn, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
Data Mining; Data-Driven Engineering; Feature Combination; Feature Extraction; Industrial Data; Local Imputation Model; Missing Data Imputation; Neural Network Applications; Occupant Protection; MISSING VALUES; OPTIMIZATION; IMPUTATION;
D O I
10.4018/IJDWM.2021100103
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The missing values in industrial data restrict the applications. Although this incomplete data contains enough information for engineers to support subsequent development, there are still too many missing values for algorithms to establish precise models. This is because the engineering domain knowledge is not considered, and valuable information is not fully captured. Therefore, this article proposes an engineering domain knowledge-based framework for modelling incomplete industrial data. The raw datasets are partitioned and processed at different scales. Firstly, the hierarchical features are combined to decrease the missing ratio. In order to fill the missing values in special data, which is identified for classifying the samples, samples with only part of the features presented are fully utilized instead of being removed to establish local imputation model. Then samples are divided into different groups to transfer the information. A series of industrial data is analyzed for verifying the feasibility of the proposed method.
引用
收藏
页码:48 / 66
页数:19
相关论文
共 23 条
  • [1] Hierarchical k-nearest neighbours classification and binary differential evolution for fault diagnostics of automotive bearings operating under variable conditions
    Baraldi, Piero
    Cannarile, Francesco
    Di Maio, Francesco
    Zio, Enrico
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2016, 56 : 1 - 13
  • [2] Bishop Christopher M, 2006, PATTERN RECOGN, V128, P1, DOI [10.1117/1.2819119, DOI 10.1117/1]
  • [3] An Integrative Machine Learning Method to Improve Fault Detection and Productivity Performance in a Cyber-Physical System
    Chiu, Ming-Chuan
    Tsai, Chien-De
    Li, Tung-Lung
    [J]. JOURNAL OF COMPUTING AND INFORMATION SCIENCE IN ENGINEERING, 2020, 20 (02)
  • [4] A novel hybrid model for short-term wind power forecasting
    Du, Pei
    Wang, Jianzhou
    Yang, Wendong
    Niu, Tong
    [J]. APPLIED SOFT COMPUTING, 2019, 80 : 93 - 106
  • [5] On design optimization for structural crashworthiness and its state of the art
    Fang, Jianguang
    Sun, Guangyong
    Qiu, Na
    Kim, Nam H.
    Li, Qing
    [J]. STRUCTURAL AND MULTIDISCIPLINARY OPTIMIZATION, 2017, 55 (03) : 1091 - 1119
  • [6] Tehran driving cycle development using the k-means clustering method
    Fotouhi, A.
    Montazeri-Gh, M.
    [J]. SCIENTIA IRANICA, 2013, 20 (02) : 286 - 293
  • [7] Imputations of missing values using a tracking-removed autoencoder trained with incomplete data
    Lai, Xiaochen
    Wu, Xia
    Zhang, Liyong
    Lu, Wei
    Zhong, Chongquan
    [J]. NEUROCOMPUTING, 2019, 366 (54-65) : 54 - 65
  • [8] Diversity enhanced particle swarm optimization algorithm and its application in vehicle lightweight design
    Liu, Zhao
    Li, Han
    Zhu, Ping
    [J]. JOURNAL OF MECHANICAL SCIENCE AND TECHNOLOGY, 2019, 33 (02) : 695 - 709
  • [9] Adaptive imputation of missing values for incomplete pattern classification
    Liu, Zhun-ga
    Pan, Quan
    Dezert, Jean
    Martin, Arnaud
    [J]. PATTERN RECOGNITION, 2016, 52 : 85 - 95
  • [10] A survey of data mining and knowledge discovery process models and methodologies
    Mariscal, Gonzalo
    Marban, Oscar
    Fernandez, Covadonga
    [J]. KNOWLEDGE ENGINEERING REVIEW, 2010, 25 (02) : 137 - 166