Text and Data Quality Mining in CRIS

被引:6
作者
Azeroual, Otmane [1 ]
机构
[1] German Ctr Higher Educ Res & Sci Studies DZHW, Schutzenstr 6a, D-10117 Berlin, Germany
关键词
current research information systems (CRIS); research information; text and data mining (TDM); data quality; knowledge exploration; knowledge transfer; decision making; user acceptance; RESEARCH INFORMATION-SYSTEMS;
D O I
10.3390/info10120374
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To provide scientific institutions with comprehensive and well-maintained documentation of their research information in a current research information system (CRIS), they have the best prerequisites for the implementation of text and data mining (TDM) methods. Using TDM helps to better identify and eliminate errors, improve the process, develop the business, and make informed decisions. In addition, TDM increases understanding of the data and its context. This not only improves the quality of the data itself, but also the institution's handling of the data and consequently the analyses. This present paper deploys TDM in CRIS to analyze, quantify, and correct the unstructured data and its quality issues. Bad data leads to increased costs or wrong decisions. Ensuring high data quality is an essential requirement when creating a CRIS project. User acceptance in a CRIS depends, among other things, on data quality. Not only is the objective data quality the decisive criterion, but also the subjective quality that the individual user assigns to the data.
引用
收藏
页数:15
相关论文
共 36 条
[1]  
Aggarwal Charu C, 2012, Mining text data, P77, DOI [10.1007/978-1-4614-3223-4, DOI 10.1007/978-1-4614-3223-4]
[2]  
[Anonymous], 2019, P 36 INT C MACHINE L
[3]  
[Anonymous], 2002, P AAAI 2002 SPRING S
[4]  
[Anonymous], 2013, DATA DRIVEN PROFITIN
[5]  
Asahara M, 2003, HLT-NAACL 2003: HUMAN LANGUAGE TECHNOLOGY CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE MAIN CONFERENCE, P8
[6]  
Azeroual O., 2018, 3 INT C OP ACC RAB, P29
[7]  
Azeroual O., 2018, J. Digit. Inf. Manag, V16, P12
[8]  
Azeroual O., 2019, Computer and Information Science, V12, P84, DOI [10.5539/cis.v12n4p84, DOI 10.5539/CIS.V12N4P84]
[9]  
Azeroual O., 2017, INT J COMPUTER SCI I, V15, P82
[10]   Quality of Research Information in RIS Databases: A Multidimensional Approach [J].
Azeroual, Otmane ;
Saake, Gunter ;
Abuosba, Mohammad ;
Schopfel, Joachim .
BUSINESS INFORMATION SYSTEMS, PT I, 2019, 353 :337-349