Embedding User Behavioral Aspect in TF-IDF like Representation

被引:3
|
作者
Pradhan, Ligaj [1 ]
Zhang, Chengcui [1 ]
Bethard, Steven [2 ]
Chen, Xin [3 ]
机构
[1] Univ Alabama Birmingham, Dept Comp Sci, Birmingham, AL 35294 USA
[2] Univ Arizona, Sch Informat, Tucson, AZ USA
[3] Governors State Univ, Div Sci Math & Tech, Chicago, IL USA
来源
IEEE 1ST CONFERENCE ON MULTIMEDIA INFORMATION PROCESSING AND RETRIEVAL (MIPR 2018) | 2018年
关键词
TF-IDF; topic modeling; user-concerns; user behavior; rating prediction;
D O I
10.1109/MIPR.2018.00061
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Term Frequency - Inverse Document Frequency (TF-IDF) computes weight for each word in a document which increases proportionally to the number of times the word appears in a specific document but is counterbalanced by the number of times it occurs in the collection of documents. TF-IDF is the state-of-the-art for computing relevancy scores between documents. However, it is based on statistical learning alone and doesn't directly capture the conceptual contents of the text or the behavioral aspects of the writer. Hence, in this work we show how relatively low dimensional user behavioral vectors extracted from the same text, from which TF-IDF vectors are extracted, can be used to enrich the performance of TF-IDF. We extract User-Concerns embedded in user reviews and append them to TF-IDF vectors to train a deep rating prediction model. Our experiments show that adding such conceptual knowledge to TF-IDF vectors can significantly enhance the performance of TF-IDF vectors by only adding very little complexity.
引用
收藏
页码:262 / 267
页数:6
相关论文
共 50 条
  • [1] Mining microblog user interests based on TextRank with TF-IDF factor
    Tu Shouzhong
    Huang Minlie
    The Journal of China Universities of Posts and Telecommunications, 2016, (05) : 40 - 46
  • [2] Collaborative Filtering Recommendation Algorithm Based on TF-IDF and User Characteristics
    Ni, Jianjun
    Cai, Yu
    Tang, Guangyi
    Xie, Yingjuan
    APPLIED SCIENCES-BASEL, 2021, 11 (20):
  • [3] Mining microblog user interests based on TextRank with TF-IDF factor
    Tu Shouzhong
    Huang Minlie
    The Journal of China Universities of Posts and Telecommunications, 2016, 23 (05) : 40 - 46
  • [4] Topological Data Analysis In Text Classification Based On Word Embedding And TF-IDF
    Wen, Xiaoyang
    2020 3RD INTERNATIONAL CONFERENCE ON COMPUTER INFORMATION SCIENCE AND APPLICATION TECHNOLOGY (CISAT) 2020, 2020, 1634
  • [5] Document Clustering: TF-IDF approach
    Bafna, Prafulla
    Pramod, Dhanya
    Vaidya, Anagha
    2016 INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, AND OPTIMIZATION TECHNIQUES (ICEEOT), 2016, : 61 - 66
  • [6] Deriving TF-IDF as a Fisher kernel
    Elkan, Charles
    STRING PROCESSING AND INFORMATION RETRIEVAL, PROCEEDINGS, 2005, 3772 : 295 - 300
  • [7] Sentiment Enhanced Hybrid TF-IDF for Microblogs
    Simsek, Atakan
    Karagoz, Pinar
    2014 IEEE FOURTH INTERNATIONAL CONFERENCE ON BIG DATA AND CLOUD COMPUTING (BDCLOUD), 2014, : 311 - 317
  • [8] Using TF-IDF to hide sensitive itemsets
    Tzung-Pei Hong
    Chun-Wei Lin
    Kuo-Tung Yang
    Shyue-Liang Wang
    Applied Intelligence, 2013, 38 : 502 - 510
  • [9] Using TF-IDF to hide sensitive itemsets
    Hong, Tzung-Pei
    Lin, Chun-Wei
    Yang, Kuo-Tung
    Wang, Shyue-Liang
    APPLIED INTELLIGENCE, 2013, 38 (04) : 502 - 510
  • [10] A Code Classification Method Based on TF-IDF
    Wang, Ke
    Jiang, Jian-Hong
    Ma, Rui-Yun
    2018 INTERNATIONAL CONFERENCE ON E-COMMERCE AND CONTEMPORARY ECONOMIC DEVELOPMENT (ECED 2018), 2018, : 13 - 17