A Practical Data Quality Assessment Method for Raw Data in Vessel Operations

被引:3
作者
Chen, Gang [1 ]
Cai, Jie [2 ]
Rytter, Niels Gorm Maly [2 ]
Lutzen, Marie [3 ]
机构
[1] World Maritime Univ, Malmo, Sweden
[2] Univ Southern Denmark, Dept Technol & Innovat, Campusvej 55, DK-5230 Odense M, Denmark
[3] Univ Southern Denmark, Dept Mech & Elect Engn, Campusvej 55, DK-5230 Odense M, Denmark
关键词
Data quality; Vessel operations; Shipping; Validation rules; Noon reports; BIG DATA; INFORMATION; SHIP; SYSTEMS; IMPACT;
D O I
10.1007/s11804-023-00326-w
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
With the current revolution in Shipping 4.0, a tremendous amount of data is accumulated during vessel operations. Data quality (DQ) is becoming more and more important for the further digitalization and effective decision-making in shipping industry. In this study, a practical DQ assessment method for raw data in vessel operations is proposed. In this method, specific data categories and data dimensions are developed based on engineering practice and existing literature. Concrete validation rules are then formed, which can be used to properly divide raw datasets. Afterwards, a scoring method is used for the assessment of the data quality. Three levels, namely good, warning and alarm, are adopted to reflect the final data quality. The root causes of bad data quality could be revealed once the internal dependency among rules has been built, which will facilitate the further improvement of DQ in practice. A case study based on the datasets from a Danish shipping company is conducted, where the DQ variation is monitored, assessed and compared. The results indicate that the proposed method is effective to help shipping industry improve the quality of raw data in practice. This innovation research can facilitate shipping industry to set a solid foundation at the early stage of their digitalization journeys.
引用
收藏
页码:370 / 380
页数:11
相关论文
共 39 条
[1]  
Ahn K., 2008, DATA QUALITY WHITE P
[2]   Assessing information quality of e-learning systems: a web mining approach [J].
Alkhattabi, Mona ;
Neagu, Daniel ;
Cullen, Andrea .
COMPUTERS IN HUMAN BEHAVIOR, 2011, 27 (02) :862-873
[3]  
Bates MJ, 2019, UNDERSTANDING INFORM
[4]  
Blake RH., 2011, ACM J Data Inf Qual, V2, P8, DOI DOI 10.1145/1891879.1891881
[5]  
Burn J, 2005, INFORM SCI
[6]   Data-driven Methods to Predict the Burst Strength of Corroded Line Pipelines Subjected to Internal Pressure [J].
Cai, Jie ;
Jiang, Xiaoli ;
Yang, Yazhou ;
Lodewijks, Gabriel ;
Wang, Minchang .
JOURNAL OF MARINE SCIENCE AND APPLICATION, 2022, 21 (02) :115-132
[7]   A practical AIS-based route library for voyage planning at the pre-fixture stage [J].
Cai, Jie ;
Chen, Gang ;
Lutzen, Marie ;
Rytter, Niels Gorm Maly .
OCEAN ENGINEERING, 2021, 236
[8]   A proposal for a set of attributes relevant for Web portal data quality [J].
Caro, Angelica ;
Calero, Coral ;
Caballero, Ismael ;
Piattini, Mario .
SOFTWARE QUALITY JOURNAL, 2008, 16 (04) :513-542
[9]   The impact of data quality information on decision making: An exploratory analysis [J].
Chengalur-Smith, IN ;
Ballou, DP ;
Pazer, HL .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1999, 11 (06) :853-864
[10]   Improving data quality using a cross layer protocol in wireless sensor networks [J].
Coen-Porisini, Alberto ;
Sicari, Sabrina .
COMPUTER NETWORKS, 2012, 56 (17) :3655-3665