<node id="674043">
  <nid>674043</nid>
  <type>event</type>
  <uid>
    <user id="27707"><![CDATA[27707]]></user>
  </uid>
  <created>1712597597</created>
  <changed>1712597640</changed>
  <title><![CDATA[PhD Defense by  Ashutosh Baheti]]></title>
  <body><![CDATA[<p><span><span><span><strong><span>Title:</span></strong><span>&nbsp;Towards Fine-grained Multi-Attribute Control Using Language Models</span></span></span></span></p>

<p><span><span><span><strong><span>Date:</span></strong><span>&nbsp;Wednesday, 17th April,&nbsp;2024</span></span></span></span></p>

<p><span><span><span><strong><span>Time:</span></strong><span>&nbsp;2:30 PM to 4:15 PM ET (11:30 AM - 1:15 PM PT)</span></span></span></span></p>

<p><span><span><span><strong><span>Location:</span></strong><span>&nbsp;<a href="https://gatech.zoom.us/j/95160762750?pwd=NUl3UlhhSXlIc2xsY3lDbTZkbjhpUT09&amp;from=addon" title="https://gatech.zoom.us/j/95160762750?pwd=NUl3UlhhSXlIc2xsY3lDbTZkbjhpUT09&amp;from=addon">Virtual Zoom Link</a>&nbsp;</span></span></span></span></p>

<p><span><span><span><span>Meeting ID: 951&nbsp;6076 2750<br />
Passcode:   299490</span></span></span></span></p>

<p>&nbsp;</p>

<p><span><span><span><strong><span><span><span>Ashutosh Baheti</span></span></span></strong></span></span></span></p>

<p><span><span><span><span><span><span>Computer Science Ph.D. Candidate</span></span></span></span></span></span></p>

<p><span><span><span><span><span><span>School of Interactive Computing<br />
College of Computing</span></span></span></span></span></span></p>

<p><span><span><span><span><span><span>Georgia Institute of Technology</span></span></span></span></span></span></p>

<p><span><span><span><span><span><span><a href="https://abaheti95.github.io/">https://abaheti95.github.io/</a></span></span></span></span></span></span></p>

<p><span><span><span><strong><span>Committee:</span></strong></span></span></span></p>

<p><span><span><span><span>Prof. Mark Riedl (Advisor) -- School of Interactive Computing, Georgia Institute of Technology</span></span></span></span></p>

<p><span><span><span><span>Prof. Alan Ritter (Co-Advisor) -- School of Interactive Computing, Georgia Institute of Technology</span></span></span></span></p>

<p><span><span><span><span>Prof. Dhruv Batra -- School of Interactive Computing, Georgia Institute of Technology</span></span></span></span></p>

<p><span><span><span><span>Prof. Munmun de Choudhury -- School of Interactive Computing, Georgia Institute of Technology</span></span></span></span></p>

<p><span><span><span><span>Prof. Maarten Sap -- Language Technologies Institute, Carnegie Mellon University</span></span></span></span></p>

<p><span><span><span><strong><span>Abstract</span></strong></span></span></span></p>

<p><span><span><span><span>As we increasingly rely on powerful language models, ensuring their safe and effective operation necessitates extensive research in controllable text generation. Existing state-of-the-art language models struggle to generate the most accurate or desired output at the first attempt. Inspired by recent developments in self-correction in large language models and new reinforcement learning methods, we aim to train smaller language models as fine-grained editors, whereby they iteratively edit outputs to satisfy threshold constraints over multiple classifier-based attributes.</span></span></span></span></p>

<p>&nbsp;</p>

<p><span><span><span><span>In this thesis, I show a study of contextual offensive behavior of pretrained large language models and curate a high-quality dataset for toxicity detection. Next, I introduce a novel offline RL algorithm that can utilize arbitrary numeric scores as rewards during training to optimize any user-desired LM behavior by filtering out suboptimal data. Finally, I design an offline RL framework, I propose a fine-grained multi-attribute controllability task, where the goal is to guide the language model to generate output sequences that satisfy user-defined threshold-based attribute constraints. The LM model can take multiple edits to reach the desired attributes. Experiments on both languages and proteins demonstrate the versatility and effectiveness of our approach.</span></span></span></span></p>
]]></body>
  <field_summary_sentence>
    <item>
      <value><![CDATA[Towards Fine-grained Multi-Attribute Control Using Language Models]]></value>
    </item>
  </field_summary_sentence>
  <field_summary>
    <item>
      <value><![CDATA[<p><span><span><span>Towards Fine-grained Multi-Attribute Control Using Language Models</span></span></span></p>
]]></value>
    </item>
  </field_summary>
  <field_time>
    <item>
      <value><![CDATA[2024-04-17T14:30:00-04:00]]></value>
      <value2><![CDATA[2024-04-17T16:15:05-04:00]]></value2>
      <rrule><![CDATA[]]></rrule>
      <timezone><![CDATA[America/New_York]]></timezone>
    </item>
  </field_time>
  <field_fee>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_fee>
  <field_extras>
      </field_extras>
  <field_audience>
          <item>
        <value><![CDATA[Public]]></value>
      </item>
      </field_audience>
  <field_media>
      </field_media>
  <field_contact>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_contact>
  <field_location>
    <item>
      <value><![CDATA[ZOOM]]></value>
    </item>
  </field_location>
  <field_sidebar>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_sidebar>
  <field_phone>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_phone>
  <field_url>
    <item>
      <url><![CDATA[]]></url>
      <title><![CDATA[]]></title>
            <attributes><![CDATA[]]></attributes>
    </item>
  </field_url>
  <field_email>
    <item>
      <email><![CDATA[]]></email>
    </item>
  </field_email>
  <field_boilerplate>
    <item>
      <nid><![CDATA[]]></nid>
    </item>
  </field_boilerplate>
  <links_related>
      </links_related>
  <files>
      </files>
  <og_groups>
          <item>221981</item>
      </og_groups>
  <og_groups_both>
          <item><![CDATA[Graduate Studies]]></item>
      </og_groups_both>
  <field_categories>
          <item>
        <tid>1788</tid>
        <value><![CDATA[Other/Miscellaneous]]></value>
      </item>
      </field_categories>
  <field_keywords>
          <item>
        <tid>100811</tid>
        <value><![CDATA[Phd Defense]]></value>
      </item>
      </field_keywords>
  <field_userdata><![CDATA[]]></field_userdata>
</node>
