<node id="675266">
  <nid>675266</nid>
  <type>event</type>
  <uid>
    <user id="27707"><![CDATA[27707]]></user>
  </uid>
  <created>1719585269</created>
  <changed>1719585269</changed>
  <title><![CDATA[PhD Defense by Jiaojiao Fan]]></title>
  <body><![CDATA[<p><strong>Title: Learning and inference for distributions: from optimal transport to MCMC</strong></p><p>&nbsp;</p><p><strong>Date: Friday, July 5th, 2024</strong></p><p><strong>Time: 9:30 am - 12:00 pm EST (9:30 pm - 12:00 am&nbsp;Hong Kong Time)</strong></p><p>Location: Zoom meeting <a href="https://gatech.zoom.us/j/91660072023">https://gatech.zoom.us/j/91660072023</a></p><p>&nbsp;</p><p><strong>Jiaojiao Fan</strong></p><p>Machine Learning PhD Student</p><p>School of Aerospace Engineering<br>Georgia Institute of Technology</p><p>&nbsp;</p><p><strong>Committee</strong></p><p>1 Prof. Yongxin Chen (School of Aerospace Engineering, Georgia Tech; Advisor)</p><p>2 Prof. Molei Tao (School of Mathematics, Georgia Tech)</p><p>3 Prof. Peng Chen (School of Computational Science and Engineering, Georgia Tech)</p><p>4 Prof. Haomin&nbsp;Zhou (School of Mathematics, Georgia Tech)</p><p>5 Prof. Jiaming Liang (Goergen Institute for Data Science &amp; Department of Computer Science, University of Rochester)</p><p>&nbsp;</p><p><strong>Abstract</strong></p><p>In machine learning, distributional data are ubiquitous&nbsp;and pivotal across diverse fields. This dissertation tackles large-scale challenges associated with distributional data, which are often defined by either the volume of data or their high dimensionality, particularly in learning and inference contexts.</p><p>We explore two fundamental mathematical tools essential for the transformation and manipulation of distributional data: optimal transport (OT) and Markov Chain Monte Carlo (MCMC) sampling. OT, a centuries-old mathematical framework, is used for comparing probability distributions but is often hindered by high computational costs. We enhance the computational efficiency of OT by focusing on two aspects: improving multi-marginal OT with graph structures, and scaling OT to handle millions of samples through the introduction of neural OT solvers.</p><p>MCMC sampling, while being the primary method for drawing samples from a given probability density, faces scalability issues, especially in higher dimensions. To overcome these limitations, we introduce a novel algorithm based on the proximal sampler, a type of Gibbs sampling method, and rigorously demonstrate its superior computational efficiency in converging to the target distribution.</p><p>&nbsp;</p>]]></body>
  <field_summary_sentence>
    <item>
      <value><![CDATA[Learning and inference for distributions: from optimal transport to MCMC]]></value>
    </item>
  </field_summary_sentence>
  <field_summary>
    <item>
      <value><![CDATA[<p>See Below</p>]]></value>
    </item>
  </field_summary>
  <field_time>
    <item>
      <value><![CDATA[2024-07-05T09:30:57-04:00]]></value>
      <value2><![CDATA[2024-07-05T12:00:00-04:00]]></value2>
      <rrule><![CDATA[]]></rrule>
      <timezone><![CDATA[America/New_York]]></timezone>
    </item>
  </field_time>
  <field_fee>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_fee>
  <field_extras>
      </field_extras>
  <field_audience>
          <item>
        <value><![CDATA[Public]]></value>
      </item>
      </field_audience>
  <field_media>
      </field_media>
  <field_contact>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_contact>
  <field_location>
    <item>
      <value><![CDATA[Zoom meeting ]]></value>
    </item>
  </field_location>
  <field_sidebar>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_sidebar>
  <field_phone>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_phone>
  <field_url>
    <item>
      <url><![CDATA[]]></url>
      <title><![CDATA[]]></title>
            <attributes><![CDATA[]]></attributes>
    </item>
  </field_url>
  <field_email>
    <item>
      <email><![CDATA[]]></email>
    </item>
  </field_email>
  <field_boilerplate>
    <item>
      <nid><![CDATA[]]></nid>
    </item>
  </field_boilerplate>
  <links_related>
      </links_related>
  <files>
      </files>
  <og_groups>
          <item>221981</item>
      </og_groups>
  <og_groups_both>
          <item><![CDATA[Graduate Studies]]></item>
      </og_groups_both>
  <field_categories>
          <item>
        <tid>1788</tid>
        <value><![CDATA[Other/Miscellaneous]]></value>
      </item>
      </field_categories>
  <field_keywords>
          <item>
        <tid>100811</tid>
        <value><![CDATA[Phd Defense]]></value>
      </item>
      </field_keywords>
  <field_userdata><![CDATA[]]></field_userdata>
</node>
