<node id="682735">
  <nid>682735</nid>
  <type>event</type>
  <uid>
    <user id="27707"><![CDATA[27707]]></user>
  </uid>
  <created>1749484422</created>
  <changed>1749484483</changed>
  <title><![CDATA[PhD Defense by  Alex Havrilla]]></title>
  <body><![CDATA[<p>Alex Havrilla</p><p>&nbsp;</p><p>Title: Towards a Theory and Practice of Open-ended Reasoning with Generative Models</p><p>&nbsp;</p><p>Date: 6/13/2025</p><p>Time: 1 PM</p><p>Location:</p><p>-&nbsp;In-person: Skiles 202</p><p>-&nbsp;Remote: <a href="https://gatech.zoom.us/j/5472104648">https://gatech.zoom.us/j/5472104648</a></p><p>&nbsp;</p><p>Alexander Havrilla</p><p>Machine Learning PhD Student</p><p>School of Mathematics</p><p>Georgia Institute of Technology</p><p>&nbsp;</p><p>Committee</p><p>1 Dr. Wenjing Liao, School of Mathematics (Advisor), Georgia Tech</p><p>2 Dr. Mark Riedl, School of Interactive Computing, Georgia Tech</p><p>3 Dr. Tuo Zhao, School of Industrial and Systems Engineering, Georgia Tech</p><p>4 Dr. Jacob Abernethy, School of Interactive Computing, Georgia Tech</p><p>5 Dr. David Alvarez-Melis, School of Engineering and Applied Sciences, Harvard</p><p>&nbsp;</p><p>Abstract</p><p>Driven by advancements in large language modeling (LLMs), the last several years have seen an explosion in AI reasoning capability. In this dissertation, we characterize two distinct types of reasoning: closed-ended reasoning versus open-ended reasoning. We define closed-ended reasoning as the systematic application of a defined set of rules to reach a desired outcome. In contrast, we describe open-ended reasoning as a less structured process, often requiring the creation or adaptation of new rule sets themselves, and characterized by a greater need for exploration and discovery. While LLMs increasingly excel at closed-ended reasoning, they struggle more with problems requiring the open-ended counterpart. We study both types of reasoning in three parts. First, by establishing novel approximation and statistical theory for LLMs. This theory elucidates data complexity as a driving factor behind scaling laws, which themselves have a strong downstream effect on reasoning ability. Then, to improve reasoning ability in practice, we develop a novel RL framework for LLMs, trlX, which is used to fine-tune LLMs on reasoning problems. Our analysis reveals the exploration ability of LLMs as a key bottleneck to future improvement via RL. This leads us to propose SPARQ: a self-improvement style synthetic data generation algorithm drawing on techniques from the quality-diversity (QD) literature to improve both the correctness and diversity of LLM reasoning. We conclude by discussing open problems and future directions for better open-ended AI reasoning.</p><p>&nbsp;</p>]]></body>
  <field_summary_sentence>
    <item>
      <value><![CDATA[Towards a Theory and Practice of Open-ended Reasoning with Generative Models]]></value>
    </item>
  </field_summary_sentence>
  <field_summary>
    <item>
      <value><![CDATA[<p>Towards a Theory and Practice of Open-ended Reasoning with Generative Models</p>]]></value>
    </item>
  </field_summary>
  <field_time>
    <item>
      <value><![CDATA[2025-06-13T13:00:00-04:00]]></value>
      <value2><![CDATA[2025-06-16T15:00:00-04:00]]></value2>
      <rrule><![CDATA[]]></rrule>
      <timezone><![CDATA[America/New_York]]></timezone>
    </item>
  </field_time>
  <field_fee>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_fee>
  <field_extras>
      </field_extras>
  <field_audience>
          <item>
        <value><![CDATA[Public]]></value>
      </item>
      </field_audience>
  <field_media>
      </field_media>
  <field_contact>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_contact>
  <field_location>
    <item>
      <value><![CDATA[Skiles 202]]></value>
    </item>
  </field_location>
  <field_sidebar>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_sidebar>
  <field_phone>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_phone>
  <field_url>
    <item>
      <url><![CDATA[]]></url>
      <title><![CDATA[]]></title>
            <attributes><![CDATA[]]></attributes>
    </item>
  </field_url>
  <field_email>
    <item>
      <email><![CDATA[]]></email>
    </item>
  </field_email>
  <field_boilerplate>
    <item>
      <nid><![CDATA[]]></nid>
    </item>
  </field_boilerplate>
  <links_related>
      </links_related>
  <files>
      </files>
  <og_groups>
          <item>221981</item>
      </og_groups>
  <og_groups_both>
          <item><![CDATA[Graduate Studies]]></item>
      </og_groups_both>
  <field_categories>
          <item>
        <tid>1788</tid>
        <value><![CDATA[Other/Miscellaneous]]></value>
      </item>
      </field_categories>
  <field_keywords>
          <item>
        <tid>100811</tid>
        <value><![CDATA[Phd Defense]]></value>
      </item>
      </field_keywords>
  <field_userdata><![CDATA[]]></field_userdata>
</node>
