BAI in Exponential Family: Efficiency and Optimality
2024 IEEE International Symposium on Information Theory (ISIT)(2024)
Abstract
This paper investigates a hitherto unaddressed as-pect of best arm identification (BAI) in stochastic multi-armed bandits in the fixed -confidence setting. Two essential metrics for assessing bandit algorithms are computational efficiency and performance optimality (e.g., in sample complexity). In stochastic BAI literature, there have been advances in designing algorithms to achieve optimal performance, but they are generally computationally expensive to implement (e.g., optimization-based methods). There also exist approaches that have high computationally efficiency but do not achieve the optimal performance (e.g., UCB-based methods) or achieve it up to a gap (e.g., the
$\beta-$
optimal approaches in top-two methods). This paper introduces a framework and an algorithm for BAI that achieves optimal performance with a computationally efficient set of decision rules. The central process that facilitates this is a routine for sequentially estimating the optimal allocations up to sufficient fidelity. Specifically, these estimates are accurate enough for identifying the best arm (hence, achieving optimality) but not excessively accurate to an unnecessary extent (hence, maintaining efficiency). Numerical evaluations are provided to (i) establish the optimality and efficiency of the algorithm, (ii) showcase the implicit estimation property of the proposed allocation rules, and (ii) demonstrate the superior performance of the proposed algorithms compared to the existing ones.
MoreTranslated text
Key words
Exponential Family,Computational Efficiency,Optimal Allocation,Multi-armed Bandit,Complex Problems,Transportation Costs,Kullback-Leibler,Weight Vector,Error Probability,Sample Space,Stopping Rule,Optimal Resource Allocation,Decision Confidence,Optimal Proportion,Additional Penalty,Bayesian Setting,Balancing Algorithm,Sample Arm,Optimal Guaranteed,Asymptotic Optimality
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined