Skip to main content
SearchLoginLogin or Signup

Sample Efficiency in Deep Reinforcement Learning based Recommender Systems with Imitation Learning

Published onMay 27, 2022
Sample Efficiency in Deep Reinforcement Learning based Recommender Systems with Imitation Learning
·

Abstract

Recently, there has been an unprecedented interest in using reinforcement learning (RL) for recommender systems (RSs), due to its unique ability in taking into account the dynamic and long-term user engagement. However, sample ineciency is a major challenge in applying RL to problems with very dynamic environments and huge actions spaces. In this paper, we present Imitation, Reinforcement learning based Recommender System (IR2S) to combine RL with imitation learning to alleviate this problem. More
specically, by utilizing demonstrations (available user ratings), we show that IR2S can optimize its behavior faster and more eciently. The proposed IR2S, built on top of Deep Q Network (DQN), shows superior performance compared to baselines in experiments.


Article ID: 2022L8

Month: May

Year: 2022

Address: Online

Venue: Canadian Conference on Artificial Intelligence

Publisher: Canadian Artificial Intelligence Association

URL: https://caiac.pubpub.org/pub/d0jo6snj

Comments
0
comment
No comments here
Why not start the discussion?