A reinforcement learning recommender system using bi-clustering and Markov Decision Process

被引:8
|
作者
Iftikhar, Arta [1 ]
Ghazanfar, Mustansar Ali [2 ]
Ayub, Mubbashir [1 ]
Alahmari, Saad Ali [3 ]
Qazi, Nadeem [2 ]
Wall, Julie [2 ]
机构
[1] Univ Engn & Technol, Dept Software Engn, Taxila, Pakistan
[2] Univ East London, Dept Comp Sci & Digital Technol, London, England
[3] AL Imam Mohammad Ibn Saud Islamic Univ, Dept Comp Sci, Riyadh, Saudi Arabia
关键词
Reinforcement learning; Markov Decision Process; Bi-clustering; Q-learning; Policy; ALGORITHM; PERSONALIZATION; ACCURACY; IMPROVE;
D O I
10.1016/j.eswa.2023.121541
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Collaborative filtering (CF) recommender systems are static in nature and does not adapt well with changing user preferences. User preferences may change after interaction with a system or after buying a product. Conventional CF clustering algorithms only identifies the distribution of patterns and hidden correlations globally. However, the impossibility of discovering local patterns by these algorithms, headed to the popularization of bi-clustering algorithms. Bi-clustering algorithms can analyze all dataset dimensions simultaneously and consequently, discover local patterns that deliver a better understanding of the underlying hidden correlations. In this paper, we modelled the recommendation problem as a sequential decision-making problem using Markov Decision Processes (MDP). To perform state representation for MDP, we first converted user-item votings matrix to a binary matrix. Then we performed bi-clustering on this binary matrix to determine a subset of similar rows and columns. A bi-cluster merging algorithm is designed to merge similar and overlapping bi-clusters. These biclusters are then mapped to a squared grid (SG). RL is applied on this SG to determine best policy to give recommendation to users. Start state is determined using Improved Triangle Similarity (ITR similarity measure. Reward function is computed as grid state overlapping in terms of users and items in current and prospective next state. A thorough comparative analysis was conducted, encompassing a diverse array of methodologies, including RL-based, pure Collaborative Filtering (CF), and clustering methods. The results demonstrate that our proposed method outperforms its competitors in terms of precision, recall, and optimal policy learning.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Online Reinforcement Learning in Markov Decision Process Using Linear Programming
    Leon, Vincent
    Etesami, S. Rasoul
    2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 1973 - 1978
  • [2] Reinforcement Learning to Rank with Markov Decision Process
    Wei, Zeng
    Xu, Jun
    Lan, Yanyan
    Guo, Jiafeng
    Cheng, Xueqi
    SIGIR'17: PROCEEDINGS OF THE 40TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2017, : 945 - 948
  • [3] A Study of Recommender Systems using Markov Decision Process
    Gupta, Garima
    Katarya, Rahul
    PROCEEDINGS OF THE 2018 SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICICCS), 2018, : 1279 - 1283
  • [4] Jaccard Coefficient-Based Bi-clustering and Fusion Recommender System for Solving Data Sparsity
    Cheng, Jiangfei
    Zhang, Li
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2019, PT II, 2019, 11440 : 369 - 380
  • [5] Joint Manufacturing and Onsite Microgrid System Control Using Markov Decision Process and Neural Network Integrated Reinforcement Learning
    Hu, Wenqing
    Sun, Zeyi
    Zhang, Yunchao
    Li, Yu
    25TH INTERNATIONAL CONFERENCE ON PRODUCTION RESEARCH MANUFACTURING INNOVATION: CYBER PHYSICAL MANUFACTURING, 2019, 39 : 1242 - 1249
  • [6] eNB Selection for Machine Type Communications Using Reinforcement Learning Based Markov Decision Process
    Liu, Yu-Jui
    Cheng, Shin-Ming
    Hsueh, Yu-Lin
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2017, 66 (12) : 11330 - 11338
  • [7] Recommender System using Reinforcement Learning: A Survey
    Rezaei, Mehrdad
    Tabrizi, Nasseh
    DELTA: PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON DEEP LEARNING THEORY AND APPLICATIONS, 2022, : 148 - 159
  • [8] A Finite Horizon Markov Decision Process Based Reinforcement Learning Control of a Rapid Thermal Processing system
    Pradeep, D. John
    Noel, Mathew Mithra
    JOURNAL OF PROCESS CONTROL, 2018, 68 : 218 - 225
  • [9] A New Bi-clustering Approach Using Topological Maps
    Chaibi, Amine
    Lebbah, Mustapha
    Azzag, Hanane
    2013 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2013,
  • [10] AspectAnalyzer-Distributed System for Bi-clustering Analysis
    Foszner, Pawel
    Polanski, Andrzej
    MAN-MACHINE INTERACTIONS 4, ICMMI 2015, 2016, 391 : 411 - 420