Development of Usability Enhancement Model for Unstructured Big Data Using SLR

被引:9
作者
Adnan, Kiran [1 ]
Akbar, Rehan [1 ]
Wang, Khor Siak [1 ]
机构
[1] Univ Tunku Abdul Rahman, Fac Informat & Commun Technol, Dept Informat Syst, Kampar 31900, Malaysia
来源
IEEE ACCESS | 2021年 / 9卷
关键词
Usability; Big Data; Data models; Data mining; Bibliographies; Protocols; Systematics; Big data; data transformation; data usability; text data; unstructured data; usability enhancement; LITERATURE-REVIEWS; QUALITY; METHODOLOGY; EXTRACTION; CHALLENGES;
D O I
10.1109/ACCESS.2021.3089100
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Unstructured text contains valuable information for a range of enterprise applications and informed decision making. Text analytics is used to extract valuable insights from unstructured big data. Among the most significant challenges of text analytics, quality and usability are critical in affecting the outcome of the analytical process. The enhancement in usability is important for the exploitation of unstructured data. Most of the existing literature focuses on the usability of structured data as compared to unstructured data whereas big data usability has been discussed merely in the context of its assessment. The existing approaches do not provide proper guidelines on the usability enhancement of unstructured data. In this study, a rigorous systematic literature review using PRISMA framework has been conducted to develop a model enhancing the usability of unstructured data bridging the research gap. The recent approaches and solutions for text analytics have been investigated thoroughly. The usability issues of unstructured text data and their consequences on data preparation for analytics have been identified. Defining the usability dimensions for unstructured big data, identification of the usability determinants, and developing a relationship between usability dimension and determinants to derive usability rules are the significant contributions of this research and are integrated to formulate the usability enhancement model. The proposed model is the major outcome of the research. It contributes to make unstructured data usable and facilitates the data preparation activities with more valuable data that eventually improve the analytical process.
引用
收藏
页码:87391 / 87409
页数:19
相关论文
共 104 条
[1]   Multi-domain evaluation framework for named entity recognition tools [J].
Abdallah, Zahraa S. ;
Carman, Mark ;
Haffari, Gholamreza .
COMPUTER SPEECH AND LANGUAGE, 2017, 43 :34-55
[2]  
Abdullah N., 2015, INT J ADV SOFT COMPU, V7, P16
[3]  
Abedjan Ziawasch., 2019, Data Science for Healthcare, P3, DOI [DOI 10.1007/978-3-030-05249-2_1, 10.1007/978-3-030-05249-2_1, 10.1007/978-3-030-05249-2_1/COVER]
[4]  
Adnan K., 2019, International Journal of Recent Technology and Engineering (IJRTE), V8, P1398, DOI [https://doi.org/10.35940/ijrte.B1074.0882S819, DOI 10.35940/IJRTE.B1074.0882S819]
[5]   Role and Challenges of Unstructured Big Data in Healthcare [J].
Adnan, Kiran ;
Akbar, Rehan ;
Khor, Siak Wang ;
Ali, Adnan Bin Amanat .
DATA MANAGEMENT, ANALYTICS AND INNOVATION, ICDMAI 2019, VOL 1, 2020, 1042 :301-323
[6]   An analytical study of information extraction from unstructured and multidimensional big data [J].
Adnan, Kiran ;
Akbar, Rehan .
JOURNAL OF BIG DATA, 2019, 6 (01)
[7]   Limitations of information extraction methods and techniques for heterogeneous unstructured big data [J].
Adnan, Kiran ;
Akbar, Rehan .
INTERNATIONAL JOURNAL OF ENGINEERING BUSINESS MANAGEMENT, 2019, 11
[8]  
Akerkar R, 2016, INTELLIGENT TECHNIQU, P211
[9]  
Akerkar R., 2016, INTELLIGENT TECHNIQU, P31
[10]   TAQIH, a tool for tabular data quality assessment and improvement in the context of health data [J].
Alvarez Sanchez, Roberto ;
Beristain Iraola, Andoni ;
Epelde Unanue, Gorka ;
Carlin, Paul .
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2019, 181