Latent Dirichlet Allocation (LDA) for improving the topic modeling of the official bulletin of the spanish state (BOE)

被引:4
作者
Bailon-Elvira, J. C. [1 ]
Cobo, M. J. [2 ]
Herrera-Viedma, E. [1 ]
Lopez-Herrera, A. G. [1 ]
机构
[1] Univ Granada, Dept Comp Sci & Artificial Intelligence, Calle Daniel Saucedo Aranda S-N, E-18071 Granada, Spain
[2] Univ Cadiz, Dept Comp Sci & Engn, Ave Ramon Puyol, Cadiz 11202, Spain
来源
7TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND QUANTITATIVE MANAGEMENT (ITQM 2019): INFORMATION TECHNOLOGY AND QUANTITATIVE MANAGEMENT BASED ON ARTIFICIAL INTELLIGENCE | 2019年 / 162卷
关键词
Recommender systems; BOE; LDA; Alerts; RECOMMENDER SYSTEM; HYBRID;
D O I
10.1016/j.procs.2019.11.277
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Since Internet was born most people can access fully free to a lot sources of information. Every day a lot of web pages are created and new content is uploaded and shared. Never in the history the humans has been more informed but also uninformed due the huge amount of information that can be access. When we are looking for something in any search engine the results are too many for reading and filtering one by one. Recommended Systems (RS) was created to help us to discriminate and filter these information according to ours preferences. This contribution analyses the RS of the official agency of publications in Spain (BOE), which is known as "Mi BOE'. The way this RS works was analysed, and all the meta-data of the published documents were analysed in order to know the coverage of the system. The results of our analysis show that more than 89% of the documents cannot be recommended, because they are not well described at the documentary level, some of their key meta-data are empty. So, this contribution proposes a method to label documents automatically based on Latent Dirichlet Allocation (LDA). The results are that using this approach the system could recommend (at a theoretical point of view) more than twice of documents that it now does, 11% vs 23% after applied this approach. (C) 2020 The Authors. Published by Elsevier B.V. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-ne-nd/4.0/) Peer-review under responsibility of the scientific committee of the 7th International Conference on Information Technology and Quantitative Management (ITQM 2019)
引用
收藏
页码:207 / 214
页数:8
相关论文
共 25 条
[11]   Adaptive Recommender System for an Intelligent Classroom Teaching Model [J].
Lin, Hanhui ;
Xie, Shaoqun ;
Xiao, Zhiguo ;
Deng, Xinxin ;
Yue, Hongwei ;
Cai, Ken .
INTERNATIONAL JOURNAL OF EMERGING TECHNOLOGIES IN LEARNING, 2019, 14 (05) :51-63
[12]   Amazon.com recommendation - Item-to-item collaborative filtering [J].
Linden, G ;
Smith, B ;
York, J .
IEEE INTERNET COMPUTING, 2003, 7 (01) :76-80
[13]   A hybrid of sequential rules and collaborative filtering for product recommendation [J].
Liu, Duen-Ren ;
Lai, Chin-Hui ;
Lee, Wang-Jung .
INFORMATION SCIENCES, 2009, 179 (20) :3505-3519
[14]   A hybrid recommendation approach for a tourism system [J].
Lucas, Joel P. ;
Luz, Nuno ;
Moreno, Maria N. ;
Anacleto, Ricardo ;
Figueiredo, Ana Almeida ;
Martins, Constantino .
EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (09) :3532-3550
[15]  
Masthoff J, 2004, HUM-COMPUT INT-SPRIN, P93
[16]  
McCarthy Kevin, 2006, FLOR ART INT RES SOC, P86
[17]   Recommending Biomedical Resources A Fuzzy Linguistic Approach Based on Semantic Web [J].
Morales-del-Castillo, J. M. ;
Peis, Eduardo ;
Ruiz, Antonio A. ;
Herrera-Viedma, E. .
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2010, 25 (12) :1143-1157
[18]  
O'Connor M, 2001, ECSCW 2001: PROCEEDINGS OF THE SEVENTH EUROPEAN CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK, P199
[19]   A recommender system for research resources based on fuzzy linguistic modeling [J].
Porcel, C. ;
Lopez-Herrera, A. G. ;
Herrera-Viedma, E. .
EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (03) :5173-5183
[20]  
Pournelle G. H., 1953, Journal of Mammalogy, V34, P133