{"id":29283,"date":"2024-08-16T11:10:30","date_gmt":"2024-08-16T15:10:30","guid":{"rendered":"http:\/\/www.bu.edu\/csmet\/?page_id=29283"},"modified":"2026-01-28T13:10:45","modified_gmt":"2026-01-28T18:10:45","slug":"cs766","status":"publish","type":"page","link":"https:\/\/www.bu.edu\/csmet\/academic-programs\/courses\/cs766\/","title":{"rendered":"Deep Reinforcement Learning"},"content":{"rendered":"<style><span data-mce-type=\"bookmark\" style=\"display: inline-block; width: 0px; overflow: hidden; line-height: 0;\" class=\"mce_SELRES_start\">\ufeff<\/span><span data-mce-type=\"bookmark\" style=\"display: inline-block; width: 0px; overflow: hidden; line-height: 0;\" class=\"mce_SELRES_start\">\ufeff<\/span> p {margin-bottom:0px;} .course-feed .cf-course h4 {display: none; white-space: wrap;} .button{width:125px;}<\/style>\n<hr \/>\n<div class=\"course-feed\"><div class=\"cf-course\">\n\t<h4>Deep Reinforcement Learning<\/h4>\n\t<p class=\"meta\">MET CS 766 (4 credits)<\/p>\n\t\n\t<p>Prerequisites: MET CS 577 or consent of instructor. - This course focuses on reinforcement learning, covering fundamental concepts and advanced techniques. It begins with an introduction to reinforcement learning and key concepts, such as exploitation versus exploration and Markov Decision Processes. As the course progresses, it delves into state transition diagrams, the Bellman equation, and solutions to the Multi-Armed Bandits problem. Students will explore challenges and methods related to control and prediction. Then, they learn tabular methods, including Monte Carlo, Dynamic Programming, Temporal Difference Learning, SARSA, and Q-Learning. Afterwards, the course also extends into reviewing neural network concepts, covering convolutional and recurrent neural networks, and moves on to approximation methods for both discrete and continuous spaces, including DQN and its variants. Policy gradient methods, actor-critic methods. Finally, ethical considerations in AI and safety issues are also discussed.<\/p>\n\t\n\t<p class=\"\"><em>2026SPRGMETCS766A1, Jan 20th to Apr 30th 2026<\/em><\/p>\n<div class=\"responsive-table\">\n<table>\n\t<tr>\n\t\t<th>Days<\/th>\n\t\t<th>Start<\/th>\n\t\t<th>End<\/th>\n\t\t<th>Type<\/th>\n\t\t<th>Bldg<\/th>\n\t\t<th>Room<\/th>\n\t<\/tr>\n\t<tr>\n\t<td>W<\/td>\n\t<td>06:00 PM<\/td>\n\t<td>08:45 PM<\/td>\n\t<td><\/td>\n\t<td>PHO<\/td>\n\t<td>201<\/td>\n<\/tr>\n<\/table>\n<\/div>\n<\/div><\/div>\n<p><strong>Format &amp; Syllabus:<\/strong><\/p>\n<div class=\"btn-group\">\n<div class=\"dropdown\">\n<p><button class=\"button\">On Campus<\/button><\/p>\n<div class=\"dropdown-content\"><a href=\"\/csmet\/files\/2025\/08\/Syllabus-CS766-Deep-Reinforcement-Learning-Syllabus.pdf\"><\/a><a href=\"\/csmet\/files\/2026\/01\/CS_766_Syllabus_2026.pdf\">766 A1 SPRG26<\/a><a href=\"\/csmet\/files\/2025\/08\/Syllabus-CS766-Deep-Reinforcement-Learning-Syllabus.pdf\">766 A1 FALL25<\/a><\/div>\n<\/div>\n<div class=\"dropdown\">\n<p><button class=\"button\" disabled=\"disabled\">Online<\/button><\/p>\n<div class=\"dropdown-content\"><\/div>\n<\/div>\n<div class=\"dropdown\"><button class=\"button\" disabled=\"disabled\">Blended<\/button><\/div>\n<\/div>\n<div class=\"clearfloat\"><\/div>\n<p>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Format &amp; Syllabus: On Campus 766 A1 SPRG26766 A1 FALL25 Online Blended &nbsp;<\/p>\n","protected":false},"author":22903,"featured_media":0,"parent":7301,"menu_order":12,"comment_status":"closed","ping_status":"closed","template":"page-templates\/no-sidebars.php","meta":[],"_links":{"self":[{"href":"https:\/\/www.bu.edu\/csmet\/wp-json\/wp\/v2\/pages\/29283"}],"collection":[{"href":"https:\/\/www.bu.edu\/csmet\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/www.bu.edu\/csmet\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/www.bu.edu\/csmet\/wp-json\/wp\/v2\/users\/22903"}],"replies":[{"embeddable":true,"href":"https:\/\/www.bu.edu\/csmet\/wp-json\/wp\/v2\/comments?post=29283"}],"version-history":[{"count":6,"href":"https:\/\/www.bu.edu\/csmet\/wp-json\/wp\/v2\/pages\/29283\/revisions"}],"predecessor-version":[{"id":31281,"href":"https:\/\/www.bu.edu\/csmet\/wp-json\/wp\/v2\/pages\/29283\/revisions\/31281"}],"up":[{"embeddable":true,"href":"https:\/\/www.bu.edu\/csmet\/wp-json\/wp\/v2\/pages\/7301"}],"wp:attachment":[{"href":"https:\/\/www.bu.edu\/csmet\/wp-json\/wp\/v2\/media?parent=29283"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}