<node id="633197">
  <nid>633197</nid>
  <type>event</type>
  <uid>
    <user id="34868"><![CDATA[34868]]></user>
  </uid>
  <created>1583172638</created>
  <changed>1584063364</changed>
  <title><![CDATA[(POSTPONED) Warren Powell workshop on Stochastic Optimization and Machine Learning ]]></title>
  <body><![CDATA[<p>Workshop:</p>

<p>A Unified Framework on Sequential Decisions under Uncertainty</p>

<p>Georgia Tech, ISyE</p>

<p>Warren B Powell<br />
Princeton University<br />
April 3, 2020</p>

<p>Sequential decision problems arise in application areas that include engineering, the sciences, transportation, logistics, health services, medical decision making, energy, e-commerce and finance, along multiagent problems that arise with drones and robotics.&nbsp; These problems have been addressed in the academic literature using a variety of modeling and algorithmic frameworks, including dynamic programming, stochastic programming, optimal control, simulation optimization, approximate dynamic programming/reinforcement learning, and even multiarmed bandit problems.</p>

<p>In sharp contrast with the field of deterministic math programming which enjoys a canonical framework that is used around the world, sequential decision problems are described in the literature using at least eight fundamental modeling languages plus at least six more derivative dialects.&nbsp; Further, application communities have developed a wide range of solution methods that reflect the characteristics of specific problem classes.</p>

<p>In this workshop, I will introduce a single, canonical modeling framework that covers every sequential decision problem.&nbsp; The framework consists of five dimensions (drawing heavily on the approach used by stochastic control) that naturally represent real-world problems and map directly into software (and vice versa).&nbsp; The framework clearly lays out the elements of any sequential decision problem that have to be modeled, which helps to clarify the understanding of complex systems.</p>

<p>Special attention will be given to the modeling of state variables, which are a surprising source of confusion in the research literature.&nbsp; I will illustrate the different classes of state variables, including belief states, using a series of energy storage problems.&nbsp; Through proper handling of state variables, I show how we can model pure learning problems, pure resource allocation problems, hybrid learning/resource allocation problems, and contextual problems.&nbsp;</p>

<p>Our modeling strategy is centered on the challenge of optimizing over policies (functions for making decisions).&nbsp; I will describe four (meta) classes of policies, that can be divided into two broad groups: the policy search class (finding the best function that works well over time), and lookahead policies that approximate the downstream impact of making a decision now.&nbsp; I will claim that these four classes are universal: any method proposed in the literature (or used in practice) falls into one of these four classes, or a hybrid of two or more.&nbsp; I will illustrate parametric cost function approximations that are widely used in practice, but almost completely ignored by the academic community.&nbsp;</p>

<p>I will illustrate the four classes of policies in the context of pure learning problems, and the much richer class of state-dependent problems.&nbsp; In the process, I will highlight the strengths of policies in the policy search class (simplicity, ability to incorporate structure) and the weaknesses (tunable parameters).&nbsp; I will use an energy storage problem to show that we can make each of the four classes of policies (and possibly a hybrid) work best depending on the characteristics of the data.&nbsp;</p>
]]></body>
  <field_summary_sentence>
    <item>
      <value><![CDATA[A Unified Framework on Sequential Decisions under Uncertainty]]></value>
    </item>
  </field_summary_sentence>
  <field_summary>
    <item>
      <value><![CDATA[<p>Sequential decision problems arise in application areas that include engineering, the sciences, transportation, logistics, health services, medical decision making, energy, e-commerce and finance, along multiagent problems that arise with drones and robotics.&nbsp; These problems have been addressed in the academic literature using a variety of modeling and algorithmic frameworks, including dynamic programming, stochastic programming, optimal control, simulation optimization, approximate dynamic programming/reinforcement learning, and even multiarmed bandit problems.</p>

<p>In sharp contrast with the field of deterministic math programming which enjoys a canonical framework that is used around the world, sequential decision problems are described in the literature using at least eight fundamental modeling languages plus at least six more derivative dialects.&nbsp; Further, application communities have developed a wide range of solution methods that reflect the characteristics of specific problem classes.</p>

<p>In this workshop, I will introduce a single, canonical modeling framework that covers every sequential decision problem.&nbsp; The framework consists of five dimensions (drawing heavily on the approach used by stochastic control) that naturally represent real-world problems and map directly into software (and vice versa).&nbsp; The framework clearly lays out the elements of any sequential decision problem that have to be modeled, which helps to clarify the understanding of complex systems.</p>

<p>Special attention will be given to the modeling of state variables, which are a surprising source of confusion in the research literature.&nbsp; I will illustrate the different classes of state variables, including belief states, using a series of energy storage problems.&nbsp; Through proper handling of state variables, I show how we can model pure learning problems, pure resource allocation problems, hybrid learning/resource allocation problems, and contextual problems.&nbsp;</p>

<p>Our modeling strategy is centered on the challenge of optimizing over policies (functions for making decisions).&nbsp; I will describe four (meta) classes of policies, that can be divided into two broad groups: the policy search class (finding the best function that works well over time), and lookahead policies that approximate the downstream impact of making a decision now.&nbsp; I will claim that these four classes are universal: any method proposed in the literature (or used in practice) falls into one of these four classes, or a hybrid of two or more.&nbsp; I will illustrate parametric cost function approximations that are widely used in practice, but almost completely ignored by the academic community.&nbsp;</p>

<p>I will illustrate the four classes of policies in the context of pure learning problems, and the much richer class of state-dependent problems.&nbsp; In the process, I will highlight the strengths of policies in the policy search class (simplicity, ability to incorporate structure) and the weaknesses (tunable parameters).&nbsp; I will use an energy storage problem to show that we can make each of the four classes of policies (and possibly a hybrid) work best depending on the characteristics of the data.&nbsp;</p>
]]></value>
    </item>
  </field_summary>
  <field_time>
    <item>
      <value><![CDATA[2020-04-03T10:00:00-04:00]]></value>
      <value2><![CDATA[2020-04-03T18:00:00-04:00]]></value2>
      <rrule><![CDATA[]]></rrule>
      <timezone><![CDATA[America/New_York]]></timezone>
    </item>
  </field_time>
  <field_fee>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_fee>
  <field_extras>
      </field_extras>
  <field_audience>
          <item>
        <value><![CDATA[Faculty/Staff]]></value>
      </item>
          <item>
        <value><![CDATA[Postdoc]]></value>
      </item>
          <item>
        <value><![CDATA[Public]]></value>
      </item>
          <item>
        <value><![CDATA[Graduate students]]></value>
      </item>
          <item>
        <value><![CDATA[Undergraduate students]]></value>
      </item>
      </field_audience>
  <field_media>
      </field_media>
  <field_contact>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_contact>
  <field_location>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_location>
  <field_sidebar>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_sidebar>
  <field_phone>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_phone>
  <field_url>
    <item>
      <url><![CDATA[https://www.isye.gatech.edu/about/maps-directions/isye-building-complex]]></url>
      <title><![CDATA[ISyE Building Complex]]></title>
            <attributes><![CDATA[]]></attributes>
    </item>
  </field_url>
  <field_email>
    <item>
      <email><![CDATA[]]></email>
    </item>
  </field_email>
  <field_boilerplate>
    <item>
      <nid><![CDATA[]]></nid>
    </item>
  </field_boilerplate>
  <links_related>
      </links_related>
  <files>
      </files>
  <og_groups>
          <item>1242</item>
      </og_groups>
  <og_groups_both>
          <item><![CDATA[School of Industrial and Systems Engineering (ISYE)]]></item>
      </og_groups_both>
  <field_categories>
          <item>
        <tid>1795</tid>
        <value><![CDATA[Seminar/Lecture/Colloquium]]></value>
      </item>
      </field_categories>
  <field_keywords>
      </field_keywords>
  <field_userdata><![CDATA[]]></field_userdata>
</node>
