A Metric and Visualization of Completeness in Multi-Dimensional Data Sets of Sensor and Actuator Data Applied to a Condition Monitoring Use Case

被引:1
作者
Weiss, Iris [1 ]
Vogel-Heuser, Birgit [1 ]
机构
[1] Tech Univ Munich, Inst Automat & Informat Syst, D-85748 Garching, Germany
来源
APPLIED SCIENCES-BASEL | 2021年 / 11卷 / 11期
关键词
data quality assessment; completeness; data quality metric; sensor and actuator data; automated production systems; condition monitoring; control valves; industrie; 4; 0; DATA QUALITY; FRAMEWORK; ERROR;
D O I
10.3390/app11115022
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Featured Application The proposed data quality metric and visualization is applicable to sets of independent, numeric variables, which are provided as input variables for regression models. The metric measures a task-dependent aspect of data quality requiring an assessment by experts of the data mining task. In this paper, condition monitoring of control valves is uses as running example. The so-called 'Industrie 4.0' provides high potential for data-driven methods in automated production systems. However, sensor and actuator data gathered during normal operation of the system is often limited to a narrow range of single, specific operating points. This limitation also restricts the significance of condition-based maintenance models, which are trained to the narrow data. In order to reveal the structure of such multi-dimensional data sets and detect deficiencies, this paper derives a data quality metric and visualization. The metric observes the feature space and evaluates the completeness of data. In the best case, the observations utilize the whole feature space, meaning all different combinations of the variables are present in the data. Low metric values indicate missing combinations, reducing the representativeness of the data. In this way, appropriate countermeasures can be taken if relevant data is missing. For evaluation, a data set of an industrial test bed for condition monitoring of control valves is used. It is shown that the state-of-the-art metrics and visualizations cannot detect deficiencies of completeness in multi-dimensional data sets. In contrast, the proposed heat map enables the expert to locate limitations in multi-dimensional data sets.
引用
收藏
页数:30
相关论文
共 59 条
[1]  
Ahlborn K., 2019, TECHNOLOGIESZENARIO
[2]  
[Anonymous], 2015, ISO 90002015
[3]  
[Anonymous], 2012, Prevention and control of noncommunicable diseases: guidelines for primary health care in low-resource settings, DOI DOI 10.22514/SV.2023.071
[4]  
[Anonymous], 2009, Journal of Data and Information Quality, DOI DOI 10.1145/1577840.1577845
[5]  
Ayodeji Abiodun, 2019, 2019 IEEE 5th International Conference on Computer and Communications (ICCC), P948, DOI 10.1109/ICCC47050.2019.9064354
[6]  
Ballou DP, 2003, IEEE T KNOWL DATA EN, V15, P240, DOI 10.1109/TKDE.2003.1161595
[7]   MODELING DATA AND PROCESS QUALITY IN MULTI-INPUT, MULTI-OUTPUT INFORMATION-SYSTEMS [J].
BALLOU, DP ;
PAZER, HL .
MANAGEMENT SCIENCE, 1985, 31 (02) :150-162
[8]   Towards Modelling and Reasoning about Uncertain Data of Sensor Measurements for Decision Support in Smart Spaces [J].
Bamgboye, Oluwaseun ;
Liu, Xiaodong ;
Cruickshank, Peter .
2018 IEEE 42ND ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC 2018), VOL 2, 2018, :744-749
[9]  
Batini C, 2016, DATA CENTRIC SYST AP, P1, DOI 10.1007/978-3-319-24106-7
[10]  
Bicevskis J, 2019, 2019 SIXTH INTERNATIONAL CONFERENCE ON SOCIAL NETWORKS ANALYSIS, MANAGEMENT AND SECURITY (SNAMS), P303, DOI [10.1109/snams.2019.8931867, 10.1109/SNAMS.2019.8931867]