Two-Armed Restless Bandits with Imperfect Information: Stochastic Control and Indexability

Roland G. Fryer; Jr.; Philipp Harms

doi:10.3386/w19043

Two-Armed Restless Bandits with Imperfect Information: Stochastic Control and Indexability

Roland G. Fryer, Jr. & Philipp Harms

Working Paper 19043

DOI 10.3386/w19043

Issue Date May 2013

We present a two-armed bandit model of decision making under uncertainty where the expected return to investing in the "risky arm'' increases when choosing that arm and decreases when choosing the "safe'' arm. These dynamics are natural in applications such as human capital development, job search, and occupational choice. Using new insights from stochastic control, along with a monotonicity condition on the payoff dynamics, we show that optimal strategies in our model are stopping rules that can be characterized by an index which formally coincides with Gittins' index. Our result implies the indexability of a new class of "restless'' bandit models.

We are grateful to Richard Holden, Peter Michor, Derek Neal, Ariel Pakes, Yuliy Sannikov, Mete Soner, Josef Teichmann and seminar participants at Barcelona GSE and Harvard University for helpful comments and suggestions. Financial support from the Education Innovation Laboratory at Harvard University is gratefully acknowledged. Correspondence can be addressed to the authors by e-mail: rfryer@fas.harvard.edu [Fryer] or pharms@edlabs.harvard.edu [Harms]. The usual caveat applies. The views expressed herein are those of the authors and do not necessarily reflect the views of the National Bureau of Economic Research.
Copy Citation

Roland G. Fryer, Jr. and Philipp Harms, "Two-Armed Restless Bandits with Imperfect Information: Stochastic Control and Indexability," NBER Working Paper 19043 (2013), https://doi.org/10.3386/w19043.

Download Citation

MARC RIS BibTeΧ

Two-Armed Restless Bandits with Imperfect Information: Stochastic Control and Indexability

Related

Topics

Programs

More from the NBER