Imputing Missing Data in One-Shot Devices Using Unsupervised Learning Approach

被引:0
作者
So, Hon Yiu [1 ]
Ling, Man Ho [2 ]
Balakrishnan, Narayanaswamy [3 ]
机构
[1] Oakland Univ, Dept Math & Stat, Rochester, MI 48309 USA
[2] Educ Univ Hong Kong, Dept Math & Informat Technol, Hong Kong, Peoples R China
[3] McMaster Univ, Dept Math & Stat, Hamilton, ON L8S 4K1, Canada
关键词
one-shot devices; missing data; clustering; imputation; inverse probability weighting; unsupervised learning; k-prototype; DBSCAN; MULTIPLE IMPUTATION; CHAINED EQUATIONS; ALGORITHM; INFERENCE;
D O I
10.3390/math12182884
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
One-shot devices are products that can only be used once. Typical one-shot devices include airbags, fire extinguishers, inflatable life vests, ammo, and handheld flares. Most of them are life-saving products and should be highly reliable in an emergency. Quality control of those productions and predicting their reliabilities over time is critically important. To assess the reliability of the products, manufacturers usually test them in controlled conditions rather than user conditions. We may rely on public datasets that reflect their reliability in actual use, but the datasets often come with missing observations. The experimenter may lose information on covariate readings due to human errors. Traditional missing-data-handling methods may not work well in handling one-shot device data as they only contain their survival statuses. In this research, we propose Multiple Imputation with Unsupervised Learning (MIUL) to impute the missing data using Hierarchical Clustering, k-prototype, and density-based spatial clustering of applications with noise (DBSCAN). Our simulation study shows that MIUL algorithms have superior performance. We also illustrate the method using datasets from the Crash Report Sampling System (CRSS) of the National Highway Traffic Safety Administration (NHTSA).
引用
收藏
页数:33
相关论文
共 50 条
  • [41] Time-aware Subgroup Matrix Decomposition: Imputing Missing Data Using Forecasting Events
    Yang, Xi
    Zhang, Yuan
    Chi, Min
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 1524 - 1533
  • [42] iVAR: A program for imputing missing data in multivariate time series using vector autoregressive models
    Siwei Liu
    Peter C. M. Molenaar
    Behavior Research Methods, 2014, 46 : 1138 - 1148
  • [43] A matrix completion-based multiview learning method for imputing missing values in buoy monitoring data
    Qin, Mengjiao
    Du, Zhenhong
    Zhang, Feng
    Liu, Renyi
    INFORMATION SCIENCES, 2019, 487 : 18 - 30
  • [44] TUMK-ELM: A Fast Unsupervised Heterogeneous Data Learning Approach
    Xiang, Lingyun
    Zhao, Guohan
    Li, Qian
    Hao, Wei
    Li, Feng
    IEEE ACCESS, 2018, 6 : 35305 - 35315
  • [45] Multiple-Stress Model for One-Shot Device Testing Data Under Exponential Distribution
    Balakrishnan, Narayanaswamy
    Ling, Man Ho
    IEEE TRANSACTIONS ON RELIABILITY, 2012, 61 (03) : 809 - 821
  • [46] Inference for One-Shot Devices with Dependent k-Out-of-M Structured Components under Gamma Frailty
    Ling, Man-Ho
    Balakrishnan, Narayanaswamy
    Yu, Chenxi
    So, Hon Yiu
    MATHEMATICS, 2021, 9 (23)
  • [47] PERFORMANCE OF MACHINE LEARNING FOR IMPUTING MISSING DAILY RAINFALL DATA IN EAST JAVA']JAVA UNDER MULTIPLE SATELLITE DATA MODELS
    Sriwahyuni, Lilis
    Nurdiati, Sri
    Nugrahani, Endar Hasafah
    Najib, Mohamad Khoirun
    GEOGRAPHIA TECHNICA, 2025, 20 (01): : 346 - 368
  • [48] Extreme learning machine for missing data using multiple imputations
    Sovilj, Dusan
    Eirola, Emil
    Miche, Yoan
    Bjork, Kaj-Mikael
    Nian, Rui
    Akusok, Anton
    Lendasse, Amaury
    NEUROCOMPUTING, 2016, 174 : 220 - 231
  • [49] A Bayesian unsupervised learning approach for identifying soil stratification using cone penetration data
    Wang, Hui
    Wang, Xiangrong
    Wellmann, J. Florian
    Liang, Robert Y.
    CANADIAN GEOTECHNICAL JOURNAL, 2019, 56 (08) : 1184 - 1205
  • [50] Robust estimators for one-shot device testing data under gamma lifetime model with an application to a tumor toxicological data
    N. Balakrishnan
    E. Castilla
    N. Martín
    L. Pardo
    Metrika, 2019, 82 : 991 - 1019