Computing Reviews
Today's Issue Hot Topics Search Browse Recommended My Account Log In
Review Help
Search
Interactive recommendation with user-specific deep reinforcement learning
Lei Y., Li W. ACM Transactions on Knowledge Discovery from Data13 (6):1-15,2019.Type:Article
Date Reviewed: Mar 2 2020

Recommender systems are widely used, especially by online applications with a view to enhancing user experience. In most conventional systems, past history of a user’s implicit online behavior is used to derive a new recommendation. By enabling an explicit feedback mechanism with the user, would it be possible to design a reinforcement learning model that could lead to better recommendations? This paper tests this hypothesis, and the authors suggest a new solution and validate their findings on real-world datasets.

Inputs from a user’s interactive system are used to model a Markov decision process (MDP), which the paper labels as a T-step interactive recommendation--each step denoting response to a recommendation from the user. The responses are used in a reinforcement learning model, which uses it to learn a global policy by maximizing the cumulative reward it receives. A user-specific deep Q-learning method (christened UDQN) and a bias-incorporated UDQN (christened BUDQN) are formulated, where the existing latent state is used as input and user responses to recommendations are used as output.

Two different MovieLens datasets and a Yahoo! music dataset are used as benchmarking datasets to validate the experimental results. Cross-validation aspects are taken care of by using tenfold cross-validation in randomly selecting different samples for training and testing datasets to minimize the effects of overlapping data in test sets. Both of the proposed UDQN and BUDQN methods are seen to achieve better results as a recommender system.

Reviewer:  CK Raju Review #: CR146914 (2007-0172)
Bookmark and Share
 
Retrieval Models (H.3.3 ... )
 
 
Markov Processes (G.3 ... )
 
 
Learning (I.2.6 )
 
Would you recommend this review?
yes
no
Other reviews under "Retrieval Models": Date
Evaluation of an inference network-based retrieval model
Turtle H., Croft W. (ed) ACM Transactions on Information Systems 9(3): 187-222, 1991. Type: Article
May 1 1993
On a model of distributed information retrieval systems based on thesauri
Mazur Z. Information Processing and Management: an International Journal 20(4): 499-505, 1984. Type: Article
Sep 1 1985
Information processing in linear vector space
Kunz M. Information Processing and Management: an International Journal 20(4): 519-525, 1984. Type: Article
Mar 1 1985
more...

E-Mail This Printer-Friendly
Send Your Comments
Contact Us
Reproduction in whole or in part without permission is prohibited.   Copyright 1999-2024 ThinkLoud®
Terms of Use
| Privacy Policy