Enhancing Recommender Systems by Fusing Diverse Information Sources through Data Transformation and Feature Selection

被引:2
作者
Ho, Thi-Linh [1 ]
Le, Anh-Cuong [1 ]
Vu, Dinh-Hong [1 ]
机构
[1] Ton Duc Thang Univ, Fac Informat & Technol, Nat Language Proc & Knowledge Discovery Lab, Dist 7, Ho Chi Minh City, Vietnam
来源
KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS | 2023年 / 17卷 / 05期
关键词
Recommender systems; data transformation; multi-modal fusion; recommendation model; deep neural network recommender systems; MATRIX FACTORIZATION;
D O I
10.3837/tiis.2023.05.006
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recommender systems aim to recommend items to users by taking into account their probable interests. This study focuses on creating a model that utilizes multiple sources of information about users and items by employing a multimodality approach. The study addresses the task of how to gather information from different sources (modalities) and transform them into a uniform format, resulting in a multi-modal feature description for users and items. This work also aims to transform and represent the features extracted from different modalities so that the information is in a compatible format for integration and contains important, useful information for the prediction model. To achieve this goal, we propose a novel multi-modal recommendation model, which involves extracting latent features of users and items from a utility matrix using matrix factorization techniques. Various transformation techniques are utilized to extract features from other sources of information such as user reviews, item descriptions, and item categories. We also proposed the use of Principal Component Analysis (PCA) and Feature Selection techniques to reduce the data dimension and extract important features as well as remove noisy features to increase the accuracy of the model. We conducted several different experimental models based on different subsets of modalities on the MovieLens and Amazon sub-category datasets. According to the experimental results, the proposed model significantly enhances the accuracy of recommendations when compared to SVD, which is acknowledged as one of the most effective models for recommender systems. Specifically, the proposed model reduces the RMSE by a range of 4.8% to 21.43% and increases the Precision by a range of 2.07% to 26.49% for the Amazon datasets. Similarly, for the MovieLens dataset, the proposed model reduces the RMSE by 45.61% and increases the Precision by 14.06%. Additionally, the experimental results on both datasets demonstrate that combining information from multiple modalities in the proposed model leads to superior outcomes compared to relying on a single type of information.
引用
收藏
页码:1413 / 1432
页数:20
相关论文
共 44 条
  • [1] Review rating prediction framework using deep learning
    Ahmed, Basem H.
    Ghabayen, Ayman S.
    [J]. JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2020, 13 (7) : 3423 - 3432
  • [2] Alolama Y., 2020, RECOMMENDER SYSTEMS
  • [3] Christie D., 2022, COMPREHENSIVE RENEWA, P149, DOI DOI 10.1016/B978-0-12-819727-1.00083-2
  • [4] Multimodal data fusion framework based on autoencoders for top-N recommender systems
    Conceicao, Felipe L. A.
    Padua, Flavio L. C.
    Lacerda, Anisio
    Machado, Adriano C.
    Dalip, Daniel H.
    [J]. APPLIED INTELLIGENCE, 2019, 49 (09) : 3267 - 3282
  • [5] Devlin J, 2019, Arxiv, DOI [arXiv:1810.04805, 10.48550/arXiv.1810.04805]
  • [6] Ebersbach M., 2017, CLEF WORKING NOTES
  • [7] Eklund M., 2018, THESIS KTH ROYAL INT
  • [8] A fusion collaborative filtering method for sparse data in recommender systems
    Feng, Chenjiao
    Liang, Jiye
    Song, Peng
    Wang, Zhiqiang
    [J]. INFORMATION SCIENCES, 2020, 521 : 365 - 379
  • [9] A Hybrid Approach using Collaborative filtering and Content based Filtering for Recommender System
    Geetha, G.
    Safa, M.
    Fancy, C.
    Saranya, D.
    [J]. PROCEEDINGS OF THE 10TH NATIONAL CONFERENCE ON MATHEMATICAL TECHNIQUES AND ITS APPLICATIONS (NCMTA 18), 2018, 1000
  • [10] Gezici B, 2019, 2019 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), P629, DOI [10.1109/ubmk.2019.8907234, 10.1109/UBMK.2019.8907234]