<node id="681886">
  <nid>681886</nid>
  <type>event</type>
  <uid>
    <user id="27707"><![CDATA[27707]]></user>
  </uid>
  <created>1744917231</created>
  <changed>1744917264</changed>
  <title><![CDATA[PhD Defense by Sheng Zhang ]]></title>
  <body><![CDATA[<p>Title: Finite-Sample Guarantees of Stochastic Approximation Algorithms in Average-Reward Reinforcement Learning and Decentralized Stochastic Optimization<br><br>Date: April 18th, 2025<br>Time: 1:00 PM – 3:00 PM (EDT)&nbsp;<br>Location: https://gatech.zoom.us/j/7545146010<br><br>Sheng Zhang<br>Machine Learning PhD Student&nbsp;<br>H. Milton Stewart School of Industrial and Systems Engineering<br>Georgia Institute of Technology&nbsp;<br><br>Committee<br>1.&nbsp;&nbsp;&nbsp;&nbsp;Prof. Justin Romberg (ECE, Advisor)<br>2.&nbsp;&nbsp;&nbsp;&nbsp;Prof. Ashwin Pananjady (ISYE &amp; ECE, Advisor)<br>3.&nbsp;&nbsp;&nbsp;&nbsp;Prof. Siva Maguluri (ISYE)<br>4.&nbsp;&nbsp;&nbsp;&nbsp;Prof. Guanghui (George) Lan (ISYE)<br>5.&nbsp;&nbsp;&nbsp;&nbsp;Prof. Thinh Doan (Department of Aerospace Engineering and Engineering Mechanics, University of Texas Austin)<br><br>Abstract<br>Many problems in operations research and machine learning can be formulated as root-finding or fixed-point equations, often solved using iterative algorithms such as stochastic approximation (SA). This thesis focuses on the theoretical analysis of SA methods, particularly in the context of reinforcement learning (RL) and decentralized optimization. We first study the finite-sample behavior of average-reward RL algorithms, addressing the unique challenges posed by the absence of contraction properties. We then develop a general non-asymptotic theory for SA under seminorm-contractive operators, extending classical stability results and offering sharper performance guarantees. Finally, we propose and analyze stochastic dual accelerated algorithms for decentralized stochastic optimization, showing optimal convergence under communication and data constraints. Together, these contributions advance the understanding and design of efficient SA algorithms for large-scale, data-driven decision-making problems.<br>&nbsp;<br>&nbsp;</p>]]></body>
  <field_summary_sentence>
    <item>
      <value><![CDATA[Finite-Sample Guarantees of Stochastic Approximation Algorithms in Average-Reward Reinforcement Learning and Decentralized Stochastic Optimization]]></value>
    </item>
  </field_summary_sentence>
  <field_summary>
    <item>
      <value><![CDATA[<p>Finite-Sample Guarantees of Stochastic Approximation Algorithms in Average-Reward Reinforcement Learning and Decentralized Stochastic Optimization</p>]]></value>
    </item>
  </field_summary>
  <field_time>
    <item>
      <value><![CDATA[2025-04-18T13:00:00-04:00]]></value>
      <value2><![CDATA[2025-04-18T15:00:00-04:00]]></value2>
      <rrule><![CDATA[]]></rrule>
      <timezone><![CDATA[America/New_York]]></timezone>
    </item>
  </field_time>
  <field_fee>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_fee>
  <field_extras>
      </field_extras>
  <field_audience>
          <item>
        <value><![CDATA[Public]]></value>
      </item>
      </field_audience>
  <field_media>
      </field_media>
  <field_contact>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_contact>
  <field_location>
    <item>
      <value><![CDATA[REMOTE]]></value>
    </item>
  </field_location>
  <field_sidebar>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_sidebar>
  <field_phone>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_phone>
  <field_url>
    <item>
      <url><![CDATA[]]></url>
      <title><![CDATA[]]></title>
            <attributes><![CDATA[]]></attributes>
    </item>
  </field_url>
  <field_email>
    <item>
      <email><![CDATA[]]></email>
    </item>
  </field_email>
  <field_boilerplate>
    <item>
      <nid><![CDATA[]]></nid>
    </item>
  </field_boilerplate>
  <links_related>
      </links_related>
  <files>
      </files>
  <og_groups>
          <item>221981</item>
      </og_groups>
  <og_groups_both>
          <item><![CDATA[Graduate Studies]]></item>
      </og_groups_both>
  <field_categories>
          <item>
        <tid>1788</tid>
        <value><![CDATA[Other/Miscellaneous]]></value>
      </item>
      </field_categories>
  <field_keywords>
          <item>
        <tid>100811</tid>
        <value><![CDATA[Phd Defense]]></value>
      </item>
      </field_keywords>
  <field_userdata><![CDATA[]]></field_userdata>
</node>
