A Multi-Dimensional Source Selection Based on Topic Modelling

被引:3
作者
Lebib, Fatma Zohra [1 ,2 ]
Mellah, Hakima [1 ]
Meziane, Abdelkrim [1 ]
机构
[1] Res Ctr Sci & Tech Informat, Informat Syst & Multimedia Syst Div, Algiers 16028, Algeria
[2] Univ Sci & Technol Houari Boumediene, Dept Comp Sci, Algiers 16111, Algeria
关键词
multisource environment; social tagging; source selection; genetic algorithm; LDA; QUALITY;
D O I
10.6688/JISE.202205_38(3).0008
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Access to information in multisource environments is facing many problems. One of them is the source selection problem. As more and more sources become available on the internet, how to select the relevant sources that meet the user needs is a big challenge. In this paper, we propose a multi-dimensional source selection approach based on topic modelling, which integrates both the social dimension and the intelligent dimension in order to optimize the source selection according to different user interests. Social tagging data is analyzed to discover relevant topics of user interests and latent relationships between users and sources based on topic modelling. By intelligently exploring a large search space of possible solutions, an (optimal) selection of sources is found using an intelligent method (a genetic algorithm). The proposed approach is evaluated on real data sources. The experimental results demonstrate that the proposed approach outperforms state-of-the-art source selection algorithms.
引用
收藏
页码:619 / 644
页数:26
相关论文
共 74 条
[1]   User driven multi-criteria source selection [J].
Abel, Edward ;
Keane, John ;
Paton, Norman W. ;
Fernandes, Alvaro A. A. ;
Koehler, Martin ;
Konstantinou, Nikolaos ;
Rios, Julio Cesar Cortes ;
Azuan, Nurzety A. ;
Embury, Suzanne M. .
INFORMATION SCIENCES, 2018, 430 :179-199
[2]  
Aliannejadi M, 2018, ACM T INFORM SYST, V36, P1, DOI [10.1145/3231933, 10.1145/ 3231933, DOI 10.1145/3231933]
[3]  
Aly R, 2013, SIGIR'13: THE PROCEEDINGS OF THE 36TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH & DEVELOPMENT IN INFORMATION RETRIEVAL, P673
[4]  
[Anonymous], 2011, P 20 INT WORLD WIDE
[5]  
[Anonymous], 2016, P 19 INT C EXTENDING
[6]  
Arguello Jaime., 2009, CIKM 09, P1277
[7]   A multi-collection latent topic model for federated search [J].
Baillie, Mark ;
Carman, Mark ;
Crestani, Fabio .
INFORMATION RETRIEVAL, 2011, 14 (04) :390-412
[8]   Assessing Relevance and Trust of the Deep Web Sources and Results Based on Inter-Source Agreement [J].
Balakrishnan, Raju ;
Kambhampati, Subbarao ;
Jha, Manishkumar .
ACM TRANSACTIONS ON THE WEB, 2013, 7 (02)
[9]   Probabilistic Topic Models [J].
Blei, David M. .
COMMUNICATIONS OF THE ACM, 2012, 55 (04) :77-84
[10]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022