A Lightweight-Window-Portion-Based Multiple Imputation for Extreme Missing Gaps in IoT Systems

被引:7
作者
Adhikari, Deepak [1 ]
Jiang, Wei [1 ]
Zhan, Jinyu [1 ]
Assefa, Maregu [1 ]
Khorshidi, Hadi A. [2 ]
Aickelin, Uwe [2 ]
Rawat, Danda B. [3 ]
机构
[1] Univ Elect Sci & Technol China, Dept Informat & Software Engn, Chengdu 610056, Peoples R China
[2] Univ Melbourne, Sch Comp & Informat Syst, Melbourne, Vic 3010, Australia
[3] Howard Univ, Dept Elect Engn & Comp Sci, Washington, DC 20059 USA
基金
中国国家自然科学基金;
关键词
Imputation in extreme missing gaps; imputation in Internet of Things (IoT); lightweight-window-portion-based multiple imputation (LWPMI); missing data; VALUES;
D O I
10.1109/JIOT.2023.3315137
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Intelligent techniques, including artificial intelligence and deep learning, normally perform on complete data without missing data. Multiple imputation is indispensable for addressing missing data resulting in unbiased estimates and dealing with uncertainty by providing more valid results. Most state-of-the-art techniques focus on high-missing rates (around 50%-60%) and short missing gaps, while imputation for extreme missing gaps and missing rates is an important challenge for multivariate time-series data generated through the Internet of Things (IoT). Hence, we propose an lightweight-window-portion-based multiple imputation (LWPMI) based on multivariate variables, correlation, data fusion, regression, and multiple imputations. We conduct extensive experiments by generating extreme missing gaps and high-missing rates ranging from 10% to 90% on data generated by sensors. We also investigate different sets of feature to examine how LWPMI works when features have high, weak, or a mixture of high and weak correlation. All the obtained results prove LWPMI outperforms baseline techniques in preserving pattern, structure, and trend in both 90% extreme missing gap and missing rates.
引用
收藏
页码:3676 / 3689
页数:14
相关论文
共 41 条
[1]   A Comprehensive Survey on Imputation of Missing Data in Internet of Things [J].
Adhikari, Deepak ;
Jiang, Wei ;
Zhan, Jinyu ;
He, Zhiyuan ;
Rawat, Danda B. ;
Aickelin, Uwe ;
Khorshidi, Hadi A. .
ACM COMPUTING SURVEYS, 2023, 55 (07)
[2]  
Adhikari W., 2021, INT C INTELLTECHNOL, P1
[3]  
Adhikari W. Jiang, 2021, Microprocess. Microsyst.
[4]  
Bansal P, 2021, Arxiv, DOI [arXiv:2103.01600, DOI 10.48550/ARXIV.2103.01600]
[5]  
Cao W, 2018, ADV NEUR IN, V31
[6]   A neural network approach for traffic prediction and routing with missing data imputation for intelligent transportation system [J].
Chan, Robin Kuok Cheong ;
Lim, Joanne Mun-Yee ;
Parthiban, Rajendran .
EXPERT SYSTEMS WITH APPLICATIONS, 2021, 171
[7]   Multiple imputation for analysis of incomplete data in distributed health data networks [J].
Chang, Changgee ;
Deng, Yi ;
Jiang, Xiaoqian ;
Long, Qi .
NATURE COMMUNICATIONS, 2020, 11 (01)
[8]  
Dua D., 2017, UCI MACHINE LEARNING
[9]  
Ghosh D. B., 2020, Internet ofThings and Secure Smart Environments: Successes and Pitfalls
[10]   Missing Data Imputation With Bayesian Maximum Entropy for Internet of Things Applications [J].
Gonzalez-Vidal, Aurora ;
Rathore, Punit ;
Rao, Aravinda S. ;
Mendoza-Bernal, Jose ;
Palaniswami, Marimuthu ;
Skarmeta-Gomez, Antonio F. .
IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (21) :16108-16120