Bi-Labeled LDA: Inferring Interest Tags for Non-famous Users in Social Network

被引:24
作者
He, Jun [1 ]
Liu, Hongyan [2 ]
Zheng, Yiqing [1 ]
Tang, Shu [1 ]
He, Wei [1 ]
Du, Xiaoyong [1 ]
机构
[1] Renmin Univ China, Key Lab Data Engn & Knowledge Engn, MOE, Beijing 100872, Peoples R China
[2] Tsinghua Univ, Dept Management Sci & Engn, Beijing 100084, Peoples R China
基金
中国国家自然科学基金;
关键词
Topic model; LDA; Labeled LDA; Social network; Social tagging; Random walk;
D O I
10.1007/s41019-019-00113-0
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
User tags in social network are valuable information for many applications such as Web search, recommender systems and online advertising. Thus, extracting high quality tags to capture user interest has attracted many researchers' study in recent years. Most previous studies inferred users' interest based on text posted in social network. In some cases, ordinary users usually only publish a small number of text posts and text information is not related to their interest very much. Compared with famous user, it is more challenging to find non-famous (ordinary) user's interest. In this paper, we propose a probabilistic topic model, Bi-Labeled LDA, to automatically find interest tags for non-famous users in social network such as Twitter. Instead of extracting tags from text posts, tags of non-famous users are inferred from interest topics of famous users. With the proposed model, the formulation of social relationship between non-famous users and famous user is simulated and interest tags of famous users are exploited to supervise the training of the model and to make use of latent relation among famous users. Furthermore, the influence of popularity of famous user and popular tags are considered, and tags of non-famous users are ranked based on random walk model. Experiments were conducted on Twitter real datasets. Comparison with state-of-the-art methods shows that our method is more superior in terms of both ranking and quality of the tagging results.
引用
收藏
页码:27 / 47
页数:21
相关论文
共 23 条
  • [1] [Anonymous], 2014, P 8 INT AAAI C WEBL
  • [2] [Anonymous], 2014, AAAI ICWSM
  • [3] Inferring User Interests in the Twitter Social Network
    Bhattacharya, Parantapa
    Zafar, Muhammad Bilal
    Ganguly, Niloy
    Ghosh, Saptarshi
    Gummadi, Krishna P.
    [J]. PROCEEDINGS OF THE 8TH ACM CONFERENCE ON RECOMMENDER SYSTEMS (RECSYS'14), 2014, : 357 - 360
  • [4] Latent Dirichlet allocation
    Blei, DM
    Ng, AY
    Jordan, MI
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) : 993 - 1022
  • [5] Cha Y, 2013, SIGIR'13: THE PROCEEDINGS OF THE 36TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH & DEVELOPMENT IN INFORMATION RETRIEVAL, P223
  • [6] Extracting interest tags from twitter user biographies
    Ding, Ying
    Jiang, Jing
    [J]. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2014, 8870 : 268 - 279
  • [7] Ghosh S, 2012, SIGIR 2012: PROCEEDINGS OF THE 35TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, P575, DOI 10.1145/2348283.2348361
  • [8] Kwak H., WWW'10, DOI DOI 10.1145/1772690.1772751
  • [9] Lappas T, 2011, PROCEEDINGS OF THE 34TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR'11), P195
  • [10] Lim Kwan Hui, 2013, P 9 INT S OP COLL AC, P22