Dzhoha A. Sequential resource allocation in a stochastic environment

Українська версія

Thesis for the degree of Doctor of Philosophy (PhD)

State registration number

0824U002038

Applicant for

Specialization

  • 124 - Системний аналіз

18-06-2024

Specialized Academic Board

5403

Taras Shevchenko National University of Kyiv

Essay

The focus of this dissertation study lies in the asymptotic analysis of policies within the context of sequential resource allocation tasks in a stochastic environment. This environment is characterized by a collection of beta distributions with unknown parameters, resembling a stochastic multi-armed bandit model. The central aim of this study is twofold: firstly, to conduct an asymptotic analysis of policies and adapt algorithms tailored to the specific environment; and secondly, to explore the impact of contextual information on the effectiveness of these strategies. The problem at hand pertains to decision-making under uncertainty, involving a sequential interaction between a decision-maker (referred to as the agent) and the stochastic environment. This interaction unfolds over a finite time horizon. At each step, the agent selects an action from a predefined set and, in return, receives a reward from the environment. The agent’s objective is to consistently select actions that maximize the cumulative reward over the entire horizon. The primary challenge in this task is the lack of prior knowledge regarding the parameters governing the reward process.

Research papers

Dzhoha A. S. Multi-armed bandit problem under delayed feedback // Bulletin of Taras Shevchenko National University of Kyiv. Series: Physics and Mathematics. 2021. no. 1, P. 20–26.

Dzhoha A. S. Sequential resource allocation in a stochastic environment: an overview and numerical experiments // Bulletin of Taras Shevchenko National University of Kyiv. Series: Physics and Mathematics. 2021. no. 3, P. 13–25.

Dzhoha A. S, Rozora I. V. Multi-armed bandit problem with online clustering as side information // Journal of Computational and Applied Mathematics. 2023. Vol. 427, P. 115–132.

Dzhoha A. S., Rozora I. V. Beta upper confidence bound policy for the design of clinical trials // Austrian Journal of Statistics. 2023. Vol. 52, no. SI, P. 26–39.

Files

Similar theses