Lectures in Active Sequential Hypothesis Testing and Adaptive Exploration in Reinforcement Learning - Lecture 1
- Starts: 4:00 pm on Friday, November 14, 2025
- Ends: 6:00 pm on Friday, November 14, 2025
Lecture 1: Sample complexity lower bounds for iid models
We formalize the pure exploration problem for Multi-Armed bandit models. We formalize the fixed-confidence Best Arm Identification (BAI) problem and derive information-theoretic lower bounds on the sample complexity via change-of-measure arguments. You’ll see how KL divergences between candidate instances yield the characteristic time and the optimal allocation vector. The characteristic time characterizes the complexity of learning, and we will see how to compute it for simple models.
Lecture notes will be provided in advance.
- Registration:
- https://forms.gle/ohC8KJPPbBt6jrQH8