Neural Demographic Prediction in Social Media with Deep Multi-view Multi-task Learning

被引:0
作者
Lai, Yantong [1 ,2 ]
Su, Yijun [3 ]
Xue, Cong [2 ]
Zha, Daren [2 ]
机构
[1] Univ Chinese Acad Sci, Sch Cyber Secur, Beijing, Peoples R China
[2] Chinese Acad Sci, Inst Informat Engn, Beijing, Peoples R China
[3] JD Com, Beijing, Peoples R China
来源
DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2021), PT II | 2021年 / 12682卷
关键词
Demographic prediction; Context; Sentiment and topic views; Multi-task learning;
D O I
10.1007/978-3-030-73197-7_18
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Utilizing the demographic information of social media users is very essential for personalized online services. However, it is difficult to collect such information in most realistic scenarios. Luckily, the reviews posted by users can provide rich clues for inferring their demographics, since users with different demographics such as gender and age usually have differences in their contents and expressing styles. In this paper, we propose a neural approach for demographic prediction based on user reviews. The core of our approach is a deep multi-view multi-task learning model. Our model first learns context representations from reviews using a context encoder, which takes semantics and syntactics into consideration. Meanwhile, we learn sentiment and topic representations from selected sentiment and topic words using a word encoder separately, which consists of a convolutional neural network to capture the local contexts of reviews in word-level. Then, we learn a unified user representation from context, sentiment and topic representations and apply multi-task learning for inferring user's gender and age simultaneously. Experimental results on three real-world datasets validate the effectiveness of our approach. To facilitate future research, we release the codes and datasets at https://github.com/icmpnorequest/DASFAA2021_DMVMT.
引用
收藏
页码:271 / 279
页数:9
相关论文
共 20 条
  • [1] Basile A., 2017, CEUR Workshop Proceedings, V1866
  • [2] Age and Gender Classification of Tweets Using Convolutional Neural Networks
    Bayot, Roy Khristopher
    Goncalves, Teresa
    [J]. MACHINE LEARNING, OPTIMIZATION, AND BIG DATA, MOD 2017, 2018, 10710 : 337 - 348
  • [3] Latent Dirichlet allocation
    Blei, DM
    Ng, AY
    Jordan, MI
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) : 993 - 1022
  • [4] Devlin Jacob, 2019, NAACL HLT 1
  • [5] Gjurkovic M., 2018, 2 WORKSHOP COMPUTATI
  • [6] Graves A, 2012, STUD COMPUT INTELL, V385, P1, DOI [10.1162/neco.1997.9.1.1, 10.1007/978-3-642-24797-2]
  • [7] User Review Sites as a Resource for Large-Scale Sociolinguistic Studies
    Hovy, Dirk
    Johannsen, Anders
    Sogaard, Anders
    [J]. PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW 2015), 2015, : 452 - 461
  • [8] Joulin A, 2016, Arxiv, DOI [arXiv:1607.01759, 10.48550/arXiv.1607.01759]
  • [9] Kim Y, 2014, Arxiv, DOI [arXiv:1408.5882, DOI 10.48550/ARXIV.1408.5882]
  • [10] Kingma DP, 2014, ADV NEUR IN, V27