An Optimal Greedy Heuristic with Minimal Learning Regret for the Markov Chain Choice Model

An Optimal Greedy Heuristic with Minimal Learning Regret for the Markov Chain Choice Model PDF Author: Guillermo Gallego
Publisher:
ISBN:
Category :
Languages : en
Pages : 0

Get Book Here

Book Description
We study the assortment optimization problem and show that local optima are global optima for all discrete choice models that can be represented by the Markov Chain model. We develop a forward greedy heuristic that finds an optimal assortment for the Markov Chain model and runs in $O(n^2)$ iterations. The heuristic has performance bound $1/n$ for any regular choice model which is best possible among polynomial heuristics. We also propose a backward greedy heuristic that is optimal for Markov chain model and requires fewer iterations. Numerical results show that our heuristics performs significantly better than the estimate then optimize method and the revenue-ordered assortment heuristic when the ground truth is a latent class multinomial logit choice model. Based on the greedy heuristics, we develop a learning algorithm that enjoys asymptotic optimal regret for the Markov chain choice model and avoids parameter estimations, focusing instead on binary comparisons of revenues.

Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems

Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems PDF Author: Sébastien Bubeck
Publisher: Now Pub
ISBN: 9781601986269
Category : Computers
Languages : en
Pages : 138

Get Book Here

Book Description
In this monograph, the focus is on two extreme cases in which the analysis of regret is particularly simple and elegant: independent and identically distributed payoffs and adversarial payoffs. Besides the basic setting of finitely many actions, it analyzes some of the most important variants and extensions, such as the contextual bandit model.

Algorithms for Reinforcement Learning

Algorithms for Reinforcement Learning PDF Author: Csaba Grossi
Publisher: Springer Nature
ISBN: 3031015517
Category : Computers
Languages : en
Pages : 89

Get Book Here

Book Description
Reinforcement learning is a learning paradigm concerned with learning to control a system so as to maximize a numerical performance measure that expresses a long-term objective. What distinguishes reinforcement learning from supervised learning is that only partial feedback is given to the learner about the learner's predictions. Further, the predictions may have long term effects through influencing the future state of the controlled system. Thus, time plays a special role. The goal in reinforcement learning is to develop efficient learning algorithms, as well as to understand the algorithms' merits and limitations. Reinforcement learning is of great interest because of the large number of practical applications that it can be used to address, ranging from problems in artificial intelligence to operations research or control engineering. In this book, we focus on those algorithms of reinforcement learning that build on the powerful theory of dynamic programming. We give a fairly comprehensive catalog of learning problems, describe the core ideas, note a large number of state of the art algorithms, followed by the discussion of their theoretical properties and limitations. Table of Contents: Markov Decision Processes / Value Prediction Problems / Control / For Further Exploration

Bandit Algorithms

Bandit Algorithms PDF Author: Tor Lattimore
Publisher: Cambridge University Press
ISBN: 1108486827
Category : Business & Economics
Languages : en
Pages : 537

Get Book Here

Book Description
A comprehensive and rigorous introduction for graduate students and researchers, with applications in sequential decision-making problems.

Prediction, Learning, and Games

Prediction, Learning, and Games PDF Author: Nicolo Cesa-Bianchi
Publisher: Cambridge University Press
ISBN: 113945482X
Category : Computers
Languages : en
Pages : 4

Get Book Here

Book Description
This important text and reference for researchers and students in machine learning, game theory, statistics and information theory offers a comprehensive treatment of the problem of predicting individual sequences. Unlike standard statistical approaches to forecasting, prediction of individual sequences does not impose any probabilistic assumption on the data-generating mechanism. Yet, prediction algorithms can be constructed that work well for all possible sequences, in the sense that their performance is always nearly as good as the best forecasting strategy in a given reference class. The central theme is the model of prediction using expert advice, a general framework within which many related problems can be cast and discussed. Repeated game playing, adaptive data compression, sequential investment in the stock market, sequential pattern analysis, and several other problems are viewed as instances of the experts' framework and analyzed from a common nonstochastic standpoint that often reveals new and intriguing connections.

Bandit Algorithms

Bandit Algorithms PDF Author: Tor Lattimore
Publisher: Cambridge University Press
ISBN: 1108687490
Category : Computers
Languages : en
Pages : 538

Get Book Here

Book Description
Decision-making in the face of uncertainty is a significant challenge in machine learning, and the multi-armed bandit model is a commonly used framework to address it. This comprehensive and rigorous introduction to the multi-armed bandit problem examines all the major settings, including stochastic, adversarial, and Bayesian frameworks. A focus on both mathematical intuition and carefully worked proofs makes this an excellent reference for established researchers and a helpful resource for graduate students in computer science, engineering, statistics, applied mathematics and economics. Linear bandits receive special attention as one of the most useful models in applications, while other chapters are dedicated to combinatorial bandits, ranking, non-stationary problems, Thompson sampling and pure exploration. The book ends with a peek into the world beyond bandits with an introduction to partial monitoring and learning in Markov decision processes.

Understanding Machine Learning

Understanding Machine Learning PDF Author: Shai Shalev-Shwartz
Publisher: Cambridge University Press
ISBN: 1107057132
Category : Computers
Languages : en
Pages : 415

Get Book Here

Book Description
Introduces machine learning and its algorithmic paradigms, explaining the principles behind automated learning approaches and the considerations underlying their usage.

The Theory of Learning in Games

The Theory of Learning in Games PDF Author: Drew Fudenberg
Publisher: MIT Press
ISBN: 9780262061940
Category : Business & Economics
Languages : en
Pages : 304

Get Book Here

Book Description
This work explains that equilibrium is the long-run outcome of a process in which non-fully rational players search for optimality over time. The models they e×plore provide a foundation for equilibrium theory and suggest ways for economists to evaluate and modify traditional equilibrium concepts.

Twenty Lectures on Algorithmic Game Theory

Twenty Lectures on Algorithmic Game Theory PDF Author: Tim Roughgarden
Publisher: Cambridge University Press
ISBN: 1316781178
Category : Computers
Languages : en
Pages : 356

Get Book Here

Book Description
Computer science and economics have engaged in a lively interaction over the past fifteen years, resulting in the new field of algorithmic game theory. Many problems that are central to modern computer science, ranging from resource allocation in large networks to online advertising, involve interactions between multiple self-interested parties. Economics and game theory offer a host of useful models and definitions to reason about such problems. The flow of ideas also travels in the other direction, and concepts from computer science are increasingly important in economics. This book grew out of the author's Stanford University course on algorithmic game theory, and aims to give students and other newcomers a quick and accessible introduction to many of the most important concepts in the field. The book also includes case studies on online advertising, wireless spectrum auctions, kidney exchange, and network management.

Introduction to Multi-Armed Bandits

Introduction to Multi-Armed Bandits PDF Author: Aleksandrs Slivkins
Publisher:
ISBN: 9781680836202
Category : Computers
Languages : en
Pages : 306

Get Book Here

Book Description
Multi-armed bandits is a rich, multi-disciplinary area that has been studied since 1933, with a surge of activity in the past 10-15 years. This is the first book to provide a textbook like treatment of the subject.