Response-Adaptive Resource Allocation With Many Treatments

Limited Access
This item is unavailable until:
2026-04-13

Date

2025

Journal Title

Journal ISSN

Volume Title

Repository Usage Stats

8
views
0
downloads

Attention Stats

Abstract

We study a sequential decision-making problem in which a decision maker allocates shared resources across multiple treatments over a finite horizon based on the decision maker’s beliefs about each treatment’s performance, with the goal of maximizing the expected total reward. The belief about each treatment’s performance is updated with the realized reward in each period, fitting the problem into a generalized multi-armed bandit model. Some applications of this problem include clinical trials, training of artificial intelligence models, online marketing, and production mix optimization, where efficient resource allocation is crucial for maximizing rewards while learning about treatment effectiveness. Such problems are naturally formulated as weakly coupled stochastic dynamic programs (DPs) that are difficult to solve as the state space grows exponentially in the number of treatments. In this dissertation, we use Lagrangian relaxation to characterize an optimal relaxed policy and its feasible version under assumptions on the time frame, prior beliefs, and available resources. We then bound the relative gap between the value of the feasible relaxed policy and the optimal value function, showing that the feasible policy proposed is asymptotically optimal.

Description

Provenance

Subjects

Operations research, Industrial engineering, dynamic resource allocation, Lagrangian relaxation, weakly coupled stochastic dynamic programs

Citation

Citation

Yan, Jingyi (2025). Response-Adaptive Resource Allocation With Many Treatments. Dissertation, Duke University. Retrieved from https://hdl.handle.net/10161/33379.

Collections


Except where otherwise noted, student scholarship that was shared on DukeSpace after 2009 is made available to the public under a Creative Commons Attribution / Non-commercial / No derivatives (CC-BY-NC-ND) license. All rights in student work shared on DukeSpace before 2009 remain with the author and/or their designee, whose permission may be required for reuse.