{"id":36635,"date":"2022-04-25T12:51:48","date_gmt":"2022-04-25T16:51:48","guid":{"rendered":"https:\/\/www.bu.edu\/cise\/?p=36635"},"modified":"2022-06-09T14:04:41","modified_gmt":"2022-06-09T18:04:41","slug":"rui-liu","status":"publish","type":"post","link":"https:\/\/www.bu.edu\/cise\/rui-liu\/","title":{"rendered":"Rui Liu Wins CISE 2022 Best Student Paper Award"},"content":{"rendered":"<p>Rui Liu, 5th year Ph.D. Candidate in Systems Engineering, won the 2022 CISE Best Paper Award for her paper titled <a href=\"https:\/\/proceedings.mlr.press\/v139\/liu21q.html\" rel=\"noopener noreferrer\" target=\"_blank\">Temporal Difference Learning as Gradient Splitting<\/a>. Liu\u2019s interests consist of reinforcement learning, multi-agent systems, and optimization.<\/p>\n<p><img loading=\"lazy\" src=\"\/cise\/files\/2022\/04\/Rui-636x636.png\" alt=\"\" width=\"239\" height=\"239\" class=\" wp-image-36442 alignleft\" srcset=\"https:\/\/www.bu.edu\/cise\/files\/2022\/04\/Rui-636x636.png 636w, https:\/\/www.bu.edu\/cise\/files\/2022\/04\/Rui-150x150.png 150w, https:\/\/www.bu.edu\/cise\/files\/2022\/04\/Rui-768x768.png 768w, https:\/\/www.bu.edu\/cise\/files\/2022\/04\/Rui-550x550.png 550w, https:\/\/www.bu.edu\/cise\/files\/2022\/04\/Rui-710x710.png 710w, https:\/\/www.bu.edu\/cise\/files\/2022\/04\/Rui-300x300.png 300w, https:\/\/www.bu.edu\/cise\/files\/2022\/04\/Rui-600x600.png 600w, https:\/\/www.bu.edu\/cise\/files\/2022\/04\/Rui-100x100.png 100w, https:\/\/www.bu.edu\/cise\/files\/2022\/04\/Rui.png 1000w\" sizes=\"(max-width: 239px) 100vw, 239px\" \/>Her paper gives a fuller explanation for why a common class of algorithms in reinforcement learning work as well as they do. Reinforcement learning is a type of machine learning that has been applied to autonomous driving, robotics, bidding and advertising, and games. The goal of reinforcement learning is to find an optimal policy in a situation where actions taken by an agent affect the agent\u2019s state and ability to take future actions . This is where temporal difference learning comes in. Algorithms using temporal difference learning as a subroutine are widely used in reinforcement learning.<\/p>\n<p>\u201cRui&#8217;s research has shed new light on a canonical algorithm in reinforcement learning. We are still thinking through all of its implications, but it is likely to have repercussions for many other methods in reinforcement learning, and could lead to the development of entirely new algorithms,\u201d advisor Alex Olshevsky said.<\/p>\n<p>The key new idea in the paper is to interpret temporal difference learning through the lens of a new concept called a \u201cgradient splitting,\u201d introduced in the paper. This leads to a clearer and sharper analysis of temporal difference methods.<\/p>\n<p>\u201cThis work is theoretical. So it explains why this algorithm works and it gives you a better convergence time. When doing work, the proof results may not be as good as this algorithm is in the practice, but our analysis improves the current theoretical results,\u201d Liu said.<\/p>\n<p>Liu hopes her work will improve the state of the art in many robotics applications where temporal difference learning is used.<\/p>\n<p>Liu got her Masters from the Chinese Academy of Sciences and said reinforcement learning interests her because it has many real world applications.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Rui Liu, 5th year Ph.D. Candidate in Systems Engineering, won the 2022 CISE Best Paper Award for her paper titled Temporal Difference Learning as Gradient Splitting. Liu\u2019s interests consist of reinforcement learning, multi-agent systems, and optimization. Her paper gives a fuller explanation for why a common class of algorithms in reinforcement learning work as well [&hellip;]<\/p>\n","protected":false},"author":19737,"featured_media":36416,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":[],"categories":[205],"tags":[],"_links":{"self":[{"href":"https:\/\/www.bu.edu\/cise\/wp-json\/wp\/v2\/posts\/36635"}],"collection":[{"href":"https:\/\/www.bu.edu\/cise\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.bu.edu\/cise\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.bu.edu\/cise\/wp-json\/wp\/v2\/users\/19737"}],"replies":[{"embeddable":true,"href":"https:\/\/www.bu.edu\/cise\/wp-json\/wp\/v2\/comments?post=36635"}],"version-history":[{"count":5,"href":"https:\/\/www.bu.edu\/cise\/wp-json\/wp\/v2\/posts\/36635\/revisions"}],"predecessor-version":[{"id":36641,"href":"https:\/\/www.bu.edu\/cise\/wp-json\/wp\/v2\/posts\/36635\/revisions\/36641"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.bu.edu\/cise\/wp-json\/wp\/v2\/media\/36416"}],"wp:attachment":[{"href":"https:\/\/www.bu.edu\/cise\/wp-json\/wp\/v2\/media?parent=36635"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.bu.edu\/cise\/wp-json\/wp\/v2\/categories?post=36635"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.bu.edu\/cise\/wp-json\/wp\/v2\/tags?post=36635"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}