{"671419":{"#nid":"671419","#data":{"type":"event","title":"ML@GT Seminar Series | Exploration vs. Exploitation from Adaptive Control to Reinforcement Learning ","body":[{"value":"\u003Cp\u003EFeaturing P. R. Kumar, Texas A\u0026amp;M University\u003C\/p\u003E\r\n\r\n\u003Cp\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cstrong\u003EAbstract:\u003C\/strong\u003E\u0026nbsp;\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003EWe address the problem of exploration versus exploitation that lies at the heart of reinforcement learning of\u0026nbsp;\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003Edynamic systems. We describe the Biased Maximum Likelihood Method proposed to address this challenge.\u0026nbsp;\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003EWe present a comparative study of its regret performance in a variety of contexts ranging from Bandits to\u0026nbsp;\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003EMarkov Decision Processes to LQG systems. We also provide an account of regulation problems where\u0026nbsp;\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003Ethere is no intrinsic conflict between exploration and exploitation, and present a historical account of results\u0026nbsp;\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003Eon stability, asymptotic behavior and robustness.\u0026nbsp;\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003E[Joint work with Akshay Mete, Rahul Singh, Ping-Chun Hsieh, Yu-Heng Hung, Xi Liu, and Anirban Bhattacharya].\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/p\u003E\r\n\r\n\u003Cp\u003E\u003Cstrong\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003EBio:\u0026nbsp;\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/strong\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003EP. R. Kumar, B. Tech (1973, IIT Madras) and D.Sc. (1977, Washington Univ., St. Louis), was a faculty member in the Math Dept at University of Maryland, Baltimore County (1977-84), ECE and CSL at the University of Illinois, Urbana-Champaign (1985-2011), and has been at Texas A\u0026amp;M University since 2011. He has worked on problems in game theory, adaptive control, simulated annealing, machine learning, queueing networks, manufacturing systems, scheduling wafer fabrication plants, wireless networks and network information theory. His current research focus includes renewable energy, power systems, security, automated transportation, unmanned aerial vehicle traffic management, millimeter wave 5G, and cyber-physical systems. He is a member of the U.S. National Academy of Engineering, The World Academy of Sciences, and Indian National Academy of Engineering. \u0026nbsp;He was awarded an honorary doctorate by ETH, Zurich. \u0026nbsp;He received the Alexander Graham Bell Medal of IEEE, the IEEE Field Award for Control Systems, the Donald Eckman Award of the American Automatic Control Council, the Ellersick Prize of IEEE Communication Society, the Outstanding Contribution Award of ACM SIGMOBILE, the Infocom Achievement Award, the ACM SIGMOBILE Test-of-Time Paper Award, and COMSNETS Outstanding Contribution Award. \u0026nbsp;He is a Fellow of IEEE, ACM and IFAC. He is an Honorary Professor at IIT Hyderabad. \u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/p\u003E\r\n","summary":"","format":"limited_html"}],"field_subtitle":"","field_summary":[{"value":"\u003Cp\u003EMachine Learning Center Seminar Series is held bi-weekly on Wednesdays at 12pm.\u0026nbsp;\u003C\/p\u003E\r\n","format":"limited_html"}],"field_summary_sentence":[{"value":"Featuring P. R. Kumar, Texas A\u0026M University"}],"uid":"36518","created_gmt":"2023-12-05 15:14:48","changed_gmt":"2024-02-07 16:31:51","author":"shatcher8","boilerplate_text":"","field_publication":"","field_article_url":"","field_event_time":{"event_time_start":"2024-02-28T12:00:00-05:00","event_time_end":"2024-02-28T13:00:00-05:00","event_time_end_last":"2024-02-28T13:00:00-05:00","gmt_time_start":"2024-02-28 17:00:00","gmt_time_end":"2024-02-28 18:00:00","gmt_time_end_last":"2024-02-28 18:00:00","rrule":null,"timezone":"America\/New_York"},"location":"CODA 9th Floor Atrium","extras":["free_food"],"related_links":[{"url":"https:\/\/ml.gatech.edu\/","title":""}],"groups":[{"id":"576481","name":"ML@GT"}],"categories":[],"keywords":[{"id":"173555","name":"Center for Machine Learning"},{"id":"9167","name":"machine learning"}],"core_research_areas":[],"news_room_topics":[],"event_categories":[{"id":"1795","name":"Seminar\/Lecture\/Colloquium"}],"invited_audience":[{"id":"78761","name":"Faculty\/Staff"},{"id":"177814","name":"Postdoc"},{"id":"174045","name":"Graduate students"},{"id":"78751","name":"Undergraduate students"}],"affiliations":[],"classification":[],"areas_of_expertise":[],"news_and_recent_appearances":[],"phone":[],"contact":[{"value":"\u003Cp\u003EShelli Hatcher, Program and Operations Manager\u003C\/p\u003E\r\n","format":"limited_html"}],"email":[],"slides":[],"orientation":[],"userdata":""}}}