Lectures in Active Sequential Hypothesis Testing and Adaptive Exploration in Reinforcement Learning - Lecture 1

Starts: 4:00 pm on Friday, November 14, 2025
Ends: 6:00 pm on Friday, November 14, 2025

Lecture 1: Sample complexity lower bounds for iid models We formalize the pure exploration problem for Multi-Armed bandit models. We formalize the fixed-confidence Best Arm Identification (BAI) problem and derive information-theoretic lower bounds on the sample complexity via change-of-measure arguments. You’ll see how KL divergences between candidate instances yield the characteristic time and the optimal allocation vector. The characteristic time characterizes the complexity of learning, and we will see how to compute it for simple models. Lecture notes will be provided in advance.

Registration:: https://forms.gle/ohC8KJPPbBt6jrQH8

Back to Calendar