Dynamic Programming and Reinforcement Learning

ENG EC 710

Undergraduate Prerequisites: (ENGEK500) ENGEK381 and ENG EC402 or ENG EC501. - This course covers the fundamentals of sequential decision making in both known and unknown environments through dynamic programming and reinforcement learning. The first part of the course delves into selecting optimal permissible actions upon observing a system state with established system evolution rules. This section examines finite and infinite horizon stochastic dynamic systems, introducing methods like value iteration, policy iteration, and linear programming solution approaches. Subsequently the course shifts to strategies for optimal action selection under uncertain stochastic system dynamics, covering techniques such as temporal differences, Q-learning, policy gradient, actor-critic, neural network/deep-learning-based reinforcement learning, and federated learning. This course is cross-listed as ENG ME 710 and ENG SE 710. Note: Credit is granted for only one of these courses.

SPRG 2026 Schedule

Section Instructor Location Schedule Notes
A1 Caramanis CGS 421 MW 10:10 am-11:55 am Mts w/ENG ME710 Mts w/ENG SE710

Note that this information may change at any time. Please visit the MyBU Student Portal for the most up-to-date course information.