<node id="630874">
  <nid>630874</nid>
  <type>event</type>
  <uid>
    <user id="34773"><![CDATA[34773]]></user>
  </uid>
  <created>1578669709</created>
  <changed>1582818458</changed>
  <title><![CDATA[ML@GT Seminar: Daniel Russo, Columbia University]]></title>
  <body><![CDATA[<p>ML@GT invites you to a seminar by Daniel Russo, an assistant professor at Columbia Business School.</p>

<p><a href="https://docs.google.com/forms/d/e/1FAIpQLScO_LcGzGuE8reXCZ8v03hknbkweLf20TQ-H5VggfFK0LKB4w/viewform">RSVP here</a></p>

<h3>Talk Title</h3>

<p>Global Optimality Guarantees for Policy Gradient Methods</p>

<h3>Abstract</h3>

<p>Policy gradients methods are perhaps the most widely used class of reinforcement learning algorithms.&nbsp; These methods apply to complex, poorly understood, control problems by performing stochastic gradient descent over a parameterized class of polices. Unfortunately, due to the multi-period nature of the objective, policy gradient algorithms face non-convex optimization problems and can get stuck in suboptimal local minima even for extremely simple problems. This talk with discus structural properties &ndash; shared by several canonical control problems &ndash; that guarantee the policy gradient objective function has no suboptimal stationary points despite being non-convex. Time permitting, I&rsquo;ll also discuss (1) convergence rates that follow as a consequence of this theory and (2) consequences of this theory for policy gradient performed with highly expressive policy classes.</p>

<p>* This talk is based on ongoing joint work with Jalaj Bhandari.</p>

<h3>Bio</h3>

<p>Russo joined the&nbsp;<a href="https://www8.gsb.columbia.edu/faculty-research/divisions/decision-risk-operations">Decision, Risk, and Operations division</a>&nbsp;of the Columbia Business School as an assistant professor in Summer 2017. Prior to joining Columbia, he&nbsp;spent one great year as an assistant professor in the MEDS department at Northwestern&#39;s Kellogg School of Management and one year at Microsoft Research in New England as Postdoctoral Researcher. Russo recieved his&nbsp;Ph.D. from Stanford University in 2015, where he&nbsp;was advised by&nbsp;<a href="http://engineering.stanford.edu/profile/bvr">Benjamin Van Roy</a>. In 2011 Russo recieved his&nbsp;&nbsp;BS in Mathematics and Economics from the University of Michigan.</p>

<p>Russo&#39;s research lies at the intersection of statistical machine learning and sequential decision-making, and contributes to the fields of online optimization, reinforcement learning, and sequential design of experiments. He is&nbsp;interested in the design and analysis of algorithms that learn over time to make increasingly effective decisions through interacting with a poorly understood environment.</p>
]]></body>
  <field_summary_sentence>
    <item>
      <value><![CDATA[ML@GT invites you to a seminar by Daniel Russo, an assistant professor at Columbia University's business school. ]]></value>
    </item>
  </field_summary_sentence>
  <field_summary>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_summary>
  <field_time>
    <item>
      <value><![CDATA[2020-03-11T13:15:00-04:00]]></value>
      <value2><![CDATA[2020-03-11T14:15:00-04:00]]></value2>
      <rrule><![CDATA[]]></rrule>
      <timezone><![CDATA[America/New_York]]></timezone>
    </item>
  </field_time>
  <field_fee>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_fee>
  <field_extras>
      </field_extras>
  <field_audience>
          <item>
        <value><![CDATA[Faculty/Staff]]></value>
      </item>
          <item>
        <value><![CDATA[Postdoc]]></value>
      </item>
          <item>
        <value><![CDATA[Public]]></value>
      </item>
          <item>
        <value><![CDATA[Graduate students]]></value>
      </item>
          <item>
        <value><![CDATA[Undergraduate students]]></value>
      </item>
      </field_audience>
  <field_media>
      </field_media>
  <field_contact>
    <item>
      <value><![CDATA[<p>Kyla Hanson</p>

<p>khanson@cc.gatech.edu</p>
]]></value>
    </item>
  </field_contact>
  <field_location>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_location>
  <field_sidebar>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_sidebar>
  <field_phone>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_phone>
  <field_url>
    <item>
      <url><![CDATA[]]></url>
      <title><![CDATA[]]></title>
            <attributes><![CDATA[]]></attributes>
    </item>
  </field_url>
  <field_email>
    <item>
      <email><![CDATA[]]></email>
    </item>
  </field_email>
  <field_boilerplate>
    <item>
      <nid><![CDATA[]]></nid>
    </item>
  </field_boilerplate>
  <links_related>
      </links_related>
  <files>
      </files>
  <og_groups>
          <item>47223</item>
          <item>37041</item>
          <item>1299</item>
          <item>589608</item>
          <item>576481</item>
          <item>431631</item>
          <item>50877</item>
          <item>50875</item>
          <item>50876</item>
      </og_groups>
  <og_groups_both>
          <item><![CDATA[College of Computing]]></item>
          <item><![CDATA[Computational Science and Engineering]]></item>
          <item><![CDATA[GVU Center]]></item>
          <item><![CDATA[Machine Learning]]></item>
          <item><![CDATA[ML@GT]]></item>
          <item><![CDATA[OMS]]></item>
          <item><![CDATA[School of Computational Science and Engineering]]></item>
          <item><![CDATA[School of Computer Science]]></item>
          <item><![CDATA[School of Interactive Computing]]></item>
      </og_groups_both>
  <field_categories>
          <item>
        <tid>1795</tid>
        <value><![CDATA[Seminar/Lecture/Colloquium]]></value>
      </item>
      </field_categories>
  <field_keywords>
      </field_keywords>
  <field_userdata><![CDATA[]]></field_userdata>
</node>
