<node id="125721">
  <nid>125721</nid>
  <type>event</type>
  <uid>
    <user id="1"><![CDATA[1]]></user>
  </uid>
  <created>1335179674</created>
  <changed>1475891925</changed>
  <title><![CDATA[Ph.D. Thesis Proposal: Christopher Simpkins]]></title>
  <body><![CDATA[<p>Ph.D. Thesis Proposal Announcement<br /><br />Title: <strong>Integrating Reinforcement Learning into a Programming Language</strong><br /><br />Christopher Simpkins<br />School of Interactive Computing<br />Georgia Institute of Technology</p><p>&nbsp;</p><p>Date:&nbsp;&nbsp;&nbsp;&nbsp;<strong> 8 May 2012</strong> (revised)<br />Time:&nbsp;&nbsp;&nbsp;&nbsp; <strong>1:00 - 3:00 pm</strong> (revised)<br />Location: <strong>Klaus 1116W</strong> (revised)</p><p><br /><strong>Committee:</strong></p><ul><li>Professor Charles Isbell, School of Interactive Computing (Advisor)</li><li>Dr. Douglas Bodner, Tennenbaum Institute Professor</li><li>Mark Riedl, School of Interactive Computing</li><li>Dr. Spencer Rugaber, School of Computer Science</li><li>Professor Andrea Thomaz, School of Interactive Computing</li></ul><p><br /><strong>Abstract:</strong><br />My Thesis: Integrating modular reinforcement learning (MRL) into a programming language supports adaptive agent software engineering. There are three claims implied in this thesis statement: (1) there is a such thing as MRL in a software engineering sense, (2) integrating MRL into a programming language is feasible, and (3) integrating MRL into a programming language is useful to software engineers writing adaptive software agents.<br /><br />Modular reinforcement learning decomposes a reinforcement learning agent into components that solve subproblems of the total problem faced by an agent.&nbsp; Hierarchical reinforcement learning (HRL), which decomposes problems temporally into subtasks, is well developed.&nbsp; MRL, which decomposes problems into concurrent subproblems, is still nascent.&nbsp; Existing approaches to MRL are not modular in a software engineering sense because inter-component reward coupling prevents reuse.&nbsp; This dissertation will demonstrate the reward coupling problem and contribute a solution in the form of a reformulation of MRL and an algorithm that implements it.<br /><br />Our goal is to support practical software engineering.&nbsp; The best way to support software engineering is with practical, usable programming languages.&nbsp; This dissertation will contribute a programming language, implemented as a Scala library and asosciated idioms and design patterns, called AFABL -- A {Friendly|Flexible} Adaptive Behavior Language -- that integrates MRL, making MRL useful to software engineers writing practical adaptive agent software.<br /><br />Finally, we will apply AFABL to non-player character (NPC) programming in games and agent simulations to demonstrate its usefulness to software engineers writing adaptive software agents.&nbsp; This application of AFABL to practical software engineering problems will distinguish AFABL from previous work in integrating RL into programming languages such as ALisp.</p>]]></body>
  <field_summary_sentence>
    <item>
      <value><![CDATA[Integrating Reinforcement Learning into a Programming Language]]></value>
    </item>
  </field_summary_sentence>
  <field_summary>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_summary>
  <field_time>
    <item>
      <value><![CDATA[2012-05-08T18:00:00-04:00]]></value>
      <value2><![CDATA[2012-05-08T20:00:00-04:00]]></value2>
      <rrule><![CDATA[]]></rrule>
      <timezone><![CDATA[America/New_York]]></timezone>
    </item>
  </field_time>
  <field_fee>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_fee>
  <field_extras>
      </field_extras>
  <field_audience>
      </field_audience>
  <field_media>
      </field_media>
  <field_contact>
    <item>
      <value><![CDATA[<p><a href="mailto:chris.simpkins@gatech.edu">Christopher Simpkins</a></p>]]></value>
    </item>
  </field_contact>
  <field_location>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_location>
  <field_sidebar>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_sidebar>
  <field_phone>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_phone>
  <field_url>
    <item>
      <url><![CDATA[]]></url>
      <title><![CDATA[]]></title>
            <attributes><![CDATA[]]></attributes>
    </item>
  </field_url>
  <field_email>
    <item>
      <email><![CDATA[]]></email>
    </item>
  </field_email>
  <field_boilerplate>
    <item>
      <nid><![CDATA[]]></nid>
    </item>
  </field_boilerplate>
  <links_related>
      </links_related>
  <files>
      </files>
  <og_groups>
          <item>47223</item>
          <item>50876</item>
      </og_groups>
  <og_groups_both>
          <item><![CDATA[College of Computing]]></item>
          <item><![CDATA[School of Interactive Computing]]></item>
      </og_groups_both>
  <field_categories>
      </field_categories>
  <field_keywords>
      </field_keywords>
  <field_userdata><![CDATA[]]></field_userdata>
</node>
