Topic Modeling: Perspectives From a Literature Review

被引:32
作者
Grisales, A. Andres M. [1 ]
Robledo, Sebastian [1 ]
Zuluaga, Martha [2 ]
机构
[1] Univ Catol Luis Amigo, Fac Adm Econ & Accounting Sci, Medellin 050004, Colombia
[2] Univ Nacl Abierta & Distancia UNAD, Dosquebradas 661007, Colombia
关键词
Natural language processing; Bibliometrics; Databases; Codes; Bibliographies; Data models; Systematics; Machine learning; Literature review; machine learning; natural language processing; scientometrics; topic modeling; SHORT TEXT; WORDS; FRAMEWORK; NETWORK; SCIENCE;
D O I
10.1109/ACCESS.2022.3232939
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Topic modeling is a Natural Language Processing technique that has gained popularity over the last ten years, with applications in multiple fields of knowledge. However, there is insufficient empirical evidence to show how this field of study has developed over the years, as well as the main models that have been applied in different contexts. The objective of this paper is to analyze the evolution of the topic modeling technique, the main areas in which it has been applied, and the models that are recommended for specific types of data. The methodology applied is based on bibliometric analysis. First, we searched the Web of Science and the Scopus databases. We then used scientometric techniques and a Tree of Science methodology, which allowed us to analyze the search results from the perspectives of classics, structure, and trends. The results show that the USA and China are among the most productive countries in this field and the applications have been mainly in the identification of sub-topics in short texts, such as social networks and blogs. The main conclusion of this work is that topic modeling is a versatile technique that can complement systematic literature reviews and that has been well-received in different academic and research contexts. The results of this study will help researchers and academics to recognize the importance of these techniques for reviewing large volumes of unstructured information, such as research articles, and in general, for systematic literature reviews.
引用
收藏
页码:4066 / 4078
页数:13
相关论文
共 111 条
[21]   Fifty years of British Journal of Educational Technology: A topic modeling based bibliometric perspective [J].
Chen, Xieling ;
Zou, Di ;
Xie, Haoran .
BRITISH JOURNAL OF EDUCATIONAL TECHNOLOGY, 2020, 51 (03) :692-708
[22]   Does cross-field influence regional and field-specific distributions of highly cited researchers? [J].
Chen, Xinyi .
SCIENTOMETRICS, 2023, 128 (01) :825-840
[23]   BTM: Topic Modeling over Short Texts [J].
Cheng, Xueqi ;
Yan, Xiaohui ;
Lan, Yanyan ;
Guo, Jiafeng .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (12) :2928-2941
[24]   Topic representation: Finding more representative words in topic models [J].
Chi, Jinjin ;
Ouyang, Jihong ;
Li, Changchun ;
Dong, Xueyang ;
Li, Ximing ;
Wang, Xinhua .
PATTERN RECOGNITION LETTERS, 2019, 123 :53-60
[25]   UTOPIAN: User-Driven Topic Modeling Based on Interactive Nonnegative Matrix Factorization [J].
Choo, Jaegul ;
Lee, Changhyun ;
Reddy, Chandan K. ;
Park, Haesun .
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2013, 19 (12) :1992-2001
[26]   Temporal expert finding through generalized time topic modeling [J].
Daud, Ali ;
Li, Juanzi ;
Zhou, Lizhu ;
Muhammad, Faqir .
KNOWLEDGE-BASED SYSTEMS, 2010, 23 (06) :615-625
[27]  
DEERWESTER S, 1990, J AM SOC INFORM SCI, V41, P391, DOI 10.1002/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO
[28]  
2-9
[29]   Emerging Topics in Brexit Debate on Twitter Around the Deadlines A Probabilistic Topic Modelling Approach [J].
del Gobbo, Emiliano ;
Fontanella, Sara ;
Sarra, Annalina ;
Fontanella, Lara .
SOCIAL INDICATORS RESEARCH, 2021, 156 (2-3) :669-688
[30]   Social Economy and Solidarity Economy: a bibliometric analysis and literature review [J].
Duque, Pedro ;
Meza, Oscar Eduardo ;
Giraldo, David ;
Barreto, Karol .
REVESCO-REVISTA DE ESTUDIOS COOPERATIVOS, 2021, (138) :1-25