Latent Dirichlet Allocation (LDA) for improving the topic modeling of the official bulletin of the spanish state (BOE)

被引:4
作者
Bailon-Elvira, J. C. [1 ]
Cobo, M. J. [2 ]
Herrera-Viedma, E. [1 ]
Lopez-Herrera, A. G. [1 ]
机构
[1] Univ Granada, Dept Comp Sci & Artificial Intelligence, Calle Daniel Saucedo Aranda S-N, E-18071 Granada, Spain
[2] Univ Cadiz, Dept Comp Sci & Engn, Ave Ramon Puyol, Cadiz 11202, Spain
来源
7TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND QUANTITATIVE MANAGEMENT (ITQM 2019): INFORMATION TECHNOLOGY AND QUANTITATIVE MANAGEMENT BASED ON ARTIFICIAL INTELLIGENCE | 2019年 / 162卷
关键词
Recommender systems; BOE; LDA; Alerts; RECOMMENDER SYSTEM; HYBRID;
D O I
10.1016/j.procs.2019.11.277
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Since Internet was born most people can access fully free to a lot sources of information. Every day a lot of web pages are created and new content is uploaded and shared. Never in the history the humans has been more informed but also uninformed due the huge amount of information that can be access. When we are looking for something in any search engine the results are too many for reading and filtering one by one. Recommended Systems (RS) was created to help us to discriminate and filter these information according to ours preferences. This contribution analyses the RS of the official agency of publications in Spain (BOE), which is known as "Mi BOE'. The way this RS works was analysed, and all the meta-data of the published documents were analysed in order to know the coverage of the system. The results of our analysis show that more than 89% of the documents cannot be recommended, because they are not well described at the documentary level, some of their key meta-data are empty. So, this contribution proposes a method to label documents automatically based on Latent Dirichlet Allocation (LDA). The results are that using this approach the system could recommend (at a theoretical point of view) more than twice of documents that it now does, 11% vs 23% after applied this approach. (C) 2020 The Authors. Published by Elsevier B.V. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-ne-nd/4.0/) Peer-review under responsibility of the scientific committee of the 7th International Conference on Information Technology and Quantitative Management (ITQM 2019)
引用
收藏
页码:207 / 214
页数:8
相关论文
共 25 条
[1]   SOS: A multimedia recommender System for Online Social networks [J].
Amato, Flora ;
Moscato, Vincenzo ;
Picariello, Antonio ;
Piccialli, Francesco .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 93 :914-923
[2]  
Amer-Yahia S., 2009, Proceedings of the VLDB Endowment, V2, P754, DOI DOI 10.14778/1687627.1687713
[3]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[4]   A hybrid system of pedagogical pattern recommendations based on singular value decomposition and variable data attributes [J].
Cobos, Carlos ;
Rodriguez, Orlando ;
Rivera, Jarvein ;
Betancourt, John ;
Mendoza, Martha ;
Leon, Elizabeth ;
Herrera-Viedma, Enrique .
INFORMATION PROCESSING & MANAGEMENT, 2013, 49 (03) :607-625
[5]  
Crossen A., 2002, IUI 02. 2002 International Conference on Intelligent User Interfaces, P184
[6]  
El Fazazi H, 2018, INT J COMPUT SCI NET, V18, P173
[7]   TPLUFIB-WEB: A fuzzy linguistic Web system to help in the treatment of low back pain problems [J].
Esteban, Bernabe ;
Tejeda-Lorente, Alvaro ;
Porcel, Carlos ;
Arroyo, Manolo ;
Herrera-Viedma, Enrique .
KNOWLEDGE-BASED SYSTEMS, 2014, 67 :429-438
[8]   The Netflix Recommender System: Algorithms, Business Value, and Innovation [J].
Gomez-Uribe, Carlos A. ;
Hunt, Neil .
ACM TRANSACTIONS ON MANAGEMENT INFORMATION SYSTEMS, 2016, 6 (04)
[9]   Open-source machine learning: R meets Weka [J].
Hornik, Kurt ;
Buchta, Christian ;
Zeileis, Achim .
COMPUTATIONAL STATISTICS, 2009, 24 (02) :225-232
[10]   Development of a recommender system for dental care using machine learning [J].
Hung, Man ;
Xu, Julie ;
Lauren, Evelyn ;
Voss, Maren W. ;
Rosales, Megan N. ;
Su, Weicong ;
Ruiz-Negron, Bianca ;
He, Yao ;
Li, Wei ;
Licari, Frank W. .
SN APPLIED SCIENCES, 2019, 1 (07)