Estimation of missing values is an essential step in data pre-processing to increase the data quality for further data mining approaches. The significance of estimation of missing values in industrial data sets is that different operational situations cannot be describe properly while data sets includes missing values. In this paper, Expectation Conditional Maximization is used to find an approximated model over the data based on Gaussian distribution. Then, in the Expectation step, Sweep operation is used to obtain the regression model of missing values on observable values and estimate the missing values based on observable data. In order to evaluate the results a process data set for a real industrial production system is considered. The missing values are simulated by randomly removing the data from variables. Finally, the accuracy of the proposed method in estimation of missing values is discussed as well as the effect of imputation of missing values on further data analysis.
机构:
Bournemouth Univ, Computat Intelligence Res Grp, Smart Technol Res Ctr, Poole BH12 5BB, Dorset, EnglandBournemouth Univ, Computat Intelligence Res Grp, Smart Technol Res Ctr, Poole BH12 5BB, Dorset, England
Kadlec, Petr
;
Gabrys, Bogdan
论文数: 0引用数: 0
h-index: 0
机构:
Bournemouth Univ, Computat Intelligence Res Grp, Smart Technol Res Ctr, Poole BH12 5BB, Dorset, EnglandBournemouth Univ, Computat Intelligence Res Grp, Smart Technol Res Ctr, Poole BH12 5BB, Dorset, England
Gabrys, Bogdan
;
Strandt, Sibylle
论文数: 0引用数: 0
h-index: 0
机构:
Evon Degussa AG, D-45128 Essen, GermanyBournemouth Univ, Computat Intelligence Res Grp, Smart Technol Res Ctr, Poole BH12 5BB, Dorset, England
机构:
Univ Michigan, Dept Biostat, Ann Arbor, MI 48105 USAUniv Michigan, Dept Biostat, Ann Arbor, MI 48105 USA
Little, Roderick J.
;
论文数: 引用数:
h-index:
机构:
Rubin, Donald B.
;
Zangeneh, Sahar Z.
论文数: 0引用数: 0
h-index: 0
机构:
Fred Hutchinson Canc Res Ctr, Vaccine & Infect Dis Div, 1124 Columbia St, Seattle, WA 98104 USAUniv Michigan, Dept Biostat, Ann Arbor, MI 48105 USA
机构:
Bournemouth Univ, Computat Intelligence Res Grp, Smart Technol Res Ctr, Poole BH12 5BB, Dorset, EnglandBournemouth Univ, Computat Intelligence Res Grp, Smart Technol Res Ctr, Poole BH12 5BB, Dorset, England
Kadlec, Petr
;
Gabrys, Bogdan
论文数: 0引用数: 0
h-index: 0
机构:
Bournemouth Univ, Computat Intelligence Res Grp, Smart Technol Res Ctr, Poole BH12 5BB, Dorset, EnglandBournemouth Univ, Computat Intelligence Res Grp, Smart Technol Res Ctr, Poole BH12 5BB, Dorset, England
Gabrys, Bogdan
;
Strandt, Sibylle
论文数: 0引用数: 0
h-index: 0
机构:
Evon Degussa AG, D-45128 Essen, GermanyBournemouth Univ, Computat Intelligence Res Grp, Smart Technol Res Ctr, Poole BH12 5BB, Dorset, England
机构:
Univ Michigan, Dept Biostat, Ann Arbor, MI 48105 USAUniv Michigan, Dept Biostat, Ann Arbor, MI 48105 USA
Little, Roderick J.
;
论文数: 引用数:
h-index:
机构:
Rubin, Donald B.
;
Zangeneh, Sahar Z.
论文数: 0引用数: 0
h-index: 0
机构:
Fred Hutchinson Canc Res Ctr, Vaccine & Infect Dis Div, 1124 Columbia St, Seattle, WA 98104 USAUniv Michigan, Dept Biostat, Ann Arbor, MI 48105 USA