<node id="688808">
  <nid>688808</nid>
  <type>event</type>
  <uid>
    <user id="27707"><![CDATA[27707]]></user>
  </uid>
  <created>1773065619</created>
  <changed>1773065664</changed>
  <title><![CDATA[PhD Defense by Xinyuan Cao]]></title>
  <body><![CDATA[<p><strong>Title:&nbsp;</strong>Foundations of Efficient Representation Learning</p><p>&nbsp;</p><p><strong>Date:&nbsp;</strong>March 17, 2026</p><p><strong>Time:&nbsp;</strong>12:00 pm - 2:00 pm ET</p><p><strong>Location</strong>: Klaus 1212</p><p><strong>Zoom link</strong>: <a href="https://gatech.zoom.us/j/94992205747" title="https://gatech.zoom.us/j/94992205747">https://gatech.zoom.us/j/94992205747</a></p><p>&nbsp;</p><p><strong>Xinyuan Cao</strong></p><p>Machine Learning PhD Student</p><p>School of Computer Science<br>Georgia Institute of Technology</p><p>&nbsp;</p><p><strong>Committee</strong></p><p>1 Dr. Santosh Vempala&nbsp;(Advisor), School of Computer Science, Georgia Institute of Technology</p><p>2 Dr. Jacob Abernethy, School of Computer Science, Georgia Institute of Technology</p><p>3 Dr. Pan Li, School of Electrical and Computer Engineering, Georgia Institute of Technology</p><p>4 Dr. Sahil Singla, School of Computer Science, Georgia Institute of Technology</p><p>5 Dr. Freda Shi, School of Computer Science, University of Waterloo</p><p>&nbsp;</p><p><strong>Abstract</strong></p><p>Representation learning extracts lower-dimensional, structured features from complex, unstructured data and reuses them across tasks. Despite strong empirical performance, its theoretical foundations remain limited. This thesis bridges this gap by developing formal efficiency guarantees for representation learning.</p><p>&nbsp;</p><p>First, we study unsupervised identification of geometric structure and give a polynomial-time algorithm that recovers a halfspace with margin from unlabeled data under broad distributional conditions. Next, we analyze implicit structure in sequence modeling. By formalizing long-range structure using efficient distinguishers, we prove that minimizing next-token prediction loss over bounded-size networks yields an indistinguishable language model, with model size polynomial in the distinguisher parameters and independent of document length. Finally, we study how learned structures can be efficiently transferred in lifelong learning, where tasks arrive sequentially and the model continually refines the representation while maintaining performance on earlier tasks. We propose algorithms that dynamically learn, refine, and reuse features across sequential tasks, achieving near-optimal sample complexity.&nbsp;</p><p>&nbsp;</p><p>Together, these results provide a unified learning-theoretic foundation for efficient representation learning, spanning how structure can be identified, induced by training objectives, and transferred across tasks.</p><p>&nbsp;</p>]]></body>
  <field_summary_sentence>
    <item>
      <value><![CDATA[Foundations of Efficient Representation Learning]]></value>
    </item>
  </field_summary_sentence>
  <field_summary>
    <item>
      <value><![CDATA[<p>Foundations of Efficient Representation Learning</p>]]></value>
    </item>
  </field_summary>
  <field_time>
    <item>
      <value><![CDATA[2026-03-17T12:00:00-04:00]]></value>
      <value2><![CDATA[2026-03-17T14:00:00-04:00]]></value2>
      <rrule><![CDATA[]]></rrule>
      <timezone><![CDATA[America/New_York]]></timezone>
    </item>
  </field_time>
  <field_fee>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_fee>
  <field_extras>
      </field_extras>
  <field_audience>
          <item>
        <value><![CDATA[Public]]></value>
      </item>
      </field_audience>
  <field_media>
      </field_media>
  <field_contact>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_contact>
  <field_location>
    <item>
      <value><![CDATA[Klaus 1212]]></value>
    </item>
  </field_location>
  <field_sidebar>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_sidebar>
  <field_phone>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_phone>
  <field_url>
    <item>
      <url><![CDATA[]]></url>
      <title><![CDATA[]]></title>
            <attributes><![CDATA[]]></attributes>
    </item>
  </field_url>
  <field_email>
    <item>
      <email><![CDATA[]]></email>
    </item>
  </field_email>
  <field_boilerplate>
    <item>
      <nid><![CDATA[]]></nid>
    </item>
  </field_boilerplate>
  <links_related>
      </links_related>
  <files>
      </files>
  <og_groups>
          <item>221981</item>
      </og_groups>
  <og_groups_both>
          <item><![CDATA[Graduate Studies]]></item>
      </og_groups_both>
  <field_categories>
          <item>
        <tid>1788</tid>
        <value><![CDATA[Other/Miscellaneous]]></value>
      </item>
      </field_categories>
  <field_keywords>
          <item>
        <tid>100811</tid>
        <value><![CDATA[Phd Defense]]></value>
      </item>
      </field_keywords>
  <field_userdata><![CDATA[]]></field_userdata>
</node>
