Treatment of Bad Big Data in Research Data Management (RDM) Systems

被引:2
作者
Azeroual, Otmane [1 ]
机构
[1] German Ctr Higher Educ Res & Sci Studies DZHW, Schutzenstr 6a, D-10117 Berlin, Germany
关键词
research data management systems (RDMS); research data life cycle; big data; poor quality of information; data integrity and quality; institutional decision making; DATA QUALITY;
D O I
10.3390/bdcc4040029
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Databases such as research data management systems (RDMS) provide the research data in which information is to be searched for. They provide techniques with which even large amounts of data can be evaluated efficiently. This includes the management of research data and the optimization of access to this data, especially if it cannot be fully loaded into the main memory. They also provide methods for grouping and sorting and optimize requests that are made to them so that they can be processed efficiently even when accessing large amounts of data. Research data offer one thing above all: the opportunity to generate valuable knowledge. The quality of research data is of primary importance for this. Only flawless research data can deliver reliable, beneficial results and enable sound decision-making. Correct, complete and up-to-date research data are therefore essential for successful operational processes. Wrong decisions and inefficiencies in day-to-day operations are only the tip of the iceberg, since the problems with poor data quality span various areas and weaken entire university processes. Therefore, this paper addresses the problems of data quality in the context of RDMS and tries to shed light on the solution for ensuring data quality and to show a way to fix the dirty research data that arise during its integration before it has a negative impact on business success.
引用
收藏
页码:1 / 11
页数:11
相关论文
共 20 条
[1]   Metadata-based data quality assessment [J].
Aljumaili, Mustafa ;
Karim, Ramin ;
Tretten, Phillip .
VINE JOURNAL OF INFORMATION AND KNOWLEDGE MANAGEMENT SYSTEMS, 2016, 46 (02) :232-250
[2]   A comparison of research data management platforms: architecture, flexible metadata and interoperability [J].
Amorim, Ricardo Carvalho ;
Castro, Joao Aguiar ;
da Silva, Joao Rocha ;
Ribeiro, Cristina .
UNIVERSAL ACCESS IN THE INFORMATION SOCIETY, 2017, 16 (04) :851-862
[3]  
Azeroual O., 2018, J DIGITAL INFORM MAN, V16, P12
[4]   How to Inspect and Measure Data Quality about Scientific Publications: Use Case of Wikipedia and CRIS Databases [J].
Azeroual, Otmane ;
Lewoniewski, Wlodzimierz .
ALGORITHMS, 2020, 13 (05)
[5]   Data Wrangling in Database Systems: Purging of Dirty Data [J].
Azeroual, Otmane .
DATA, 2020, 5 (02) :1-9
[6]   Analyzing data quality issues in research information systems via data profiling [J].
Azeroual, Otmane ;
Saake, Gunter ;
Schallehn, Eike .
INTERNATIONAL JOURNAL OF INFORMATION MANAGEMENT, 2018, 41 :50-56
[7]  
Batini Carlo, 2007, INT C INF QUAL, P333
[8]   Developments in Research Data Management in Academic Libraries: Towards an Understanding of Research Data Service Maturity [J].
Cox, Andrew M. ;
Kennan, Mary Anne ;
Lyon, Liz ;
Pinfield, Stephen .
JOURNAL OF THE ASSOCIATION FOR INFORMATION SCIENCE AND TECHNOLOGY, 2017, 68 (09) :2182-2200
[9]   A theoretical framework to improve the quality of manually acquired data [J].
Haegemans, Tom ;
Snoeck, Monique ;
Lemahieu, Wilfried .
INFORMATION & MANAGEMENT, 2019, 56 (01) :1-14
[10]   Research Data Management [J].
Heuer, Andreas .
IT-INFORMATION TECHNOLOGY, 2020, 62 (01) :1-5