Capacity-aware Sequential Recommendations

Conference paper (2018)

Authors

F. de Nijs Algorithmics -

Georgios Theocharous Adobe Systems

Nikos Vlassis Netflix

M.M. de Weerdt Algorithmics -

M.T.J. Spaan Algorithmics -

Research Group

Algorithmics () (TU Delft)

To reference this document use:

http://resolver.tudelft.nl/uuid:5485afd2-9bf6-4066-87e6-0732802ddde3

More Info

expand_more

Published Date

10-07-2018

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Faculty

Electrical Engineering, Mathematics and Computer Science

Department

Software Technology

Research Group

Algorithmics

Abstract

Personalized recommendations are increasingly important to engage users and guide them through large systems, for example when recommending points of interest to tourists visiting a popular city. To maximize long-term user experience, the system should consider issuing recommendations sequentially, since by observing the user's response to a recommendation, the system can update its estimate of the user's (latent) interests. However, as traditional recommender systems target individuals, their effect on a collective of users can unintentionally overload capacity. Therefore, recommender systems should not only consider the users' interests, but also the effect of recommendations on the available capacity.

The structure in such a constrained, multi-agent, partially observable decision problem can be exploited by a novel belief-space sampling algorithm which bounds the size of the state space by a limit on regret. By exploiting the stationary structure of the problem, our algorithm is significantly more scalable than existing approximate solvers. Moreover, by explicitly considering the information value of actions, this algorithm significantly improves the quality of recommendations over an extension of posterior sampling reinforcement learning to the constrained multi-agent case. We show how to decouple constraint satisfaction from sequential recommendation policies, resulting in algorithms which issue recommendations to thousands of agents while respecting constraints.

Files

De_Nijs_et_al._2018_Capacity_a... (.pdf)

(.pdf | 0.805 Mb)

Download not available

P416.pdf

(.pdf | 1.38 Mb)

- Embargo expired in 01-07-2019