Application of Text Mining Techniques on Scholarly Research Articles: Methods and Tools

被引:24
作者
Thakur, Khusbu [1 ]
Kumar, Vinit [1 ]
机构
[1] Babasaheb Bhimrao Ambedkar Univ, Dept Lib & Informat Sci, Lucknow, Uttar Pradesh, India
关键词
knowledge discovery; Latent Dirichlet Allocation; research trends analysis; text mining; topic modelling; KNOWLEDGE DISCOVERY; RESEARCH TOPICS; TRENDS; LIBRARY; SCIENCE; TECHNOLOGY; MANAGEMENT; DOMAIN;
D O I
10.1080/13614533.2021.1918190
中图分类号
G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];
学科分类号
1205 ; 120501 ;
摘要
A vast amount of published scholarly literature is generated every day. Today, it is one of the biggest challenges for organisations to extract knowledge embedded in published scholarly literature for business and research applications. Application of text mining is gaining popularity among researchers and applications are growing exponentially in different research areas. This study investigates the variety of text mining tools, techniques, sample sizes, domains and sections of the documents preferred by the text mining researchers through a systematic and structured literature review of conceptual and empirical studies. The significant findings depict that LDA and R package is the most extensively used tool and technique among the authors, most of the researchers prefer the sample size of 1000 articles for analysis, literature belonging to the domain of ICT, and related disciplines are frequently analysed in the text mining studies and abstracts constitute the corpus of the majority of text mining studies.
引用
收藏
页码:279 / 302
页数:24
相关论文
共 115 条
  • [1] E-government research insights: Text mining analysis
    Abu-Shanab, Emad
    Harb, Yousra
    [J]. ELECTRONIC COMMERCE RESEARCH AND APPLICATIONS, 2019, 38
  • [2] Aich S, 2017, PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTING AND INFORMATICS (ICICI 2017), P481, DOI 10.1109/ICICI.2017.8365398
  • [3] Allahyari M, 2017, Arxiv, DOI arXiv:1707.02919
  • [4] Research trends on Big Data in Marketing: A text mining and topic modeling based literature analysis
    Amado, Alexandra
    Cortez, Paulo
    Rita, Paulo
    Moro, Sergio
    [J]. EUROPEAN RESEARCH ON MANAGEMENT AND BUSINESS ECONOMICS, 2018, 24 (01) : 1 - 7
  • [5] Institutional repository research 2005-2015: a trend analysis using bibliometrics and text mining
    Ammarukleart S.
    Kim J.
    [J]. Digital Library Perspectives, 2017, 33 (03) : 264 - 278
  • [6] Smart literature review: a practical topic modelling approach to exploratory literature review
    Asmussen, Claus Boye
    Moller, Charles
    [J]. JOURNAL OF BIG DATA, 2019, 6 (01)
  • [7] Behera S., 2015, International Journal of Engineering Research and Development, V12, pe2278
  • [8] Bhanot N., 2019, arXiv Preprint arXiv, V1911, P08271
  • [9] Latent Dirichlet allocation
    Blei, DM
    Ng, AY
    Jordan, MI
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) : 993 - 1022
  • [10] Support Vector Machines for classification and regression
    Brereton, Richard G.
    Lloyd, Gavin R.
    [J]. ANALYST, 2010, 135 (02) : 230 - 267