<node id="686276">
  <nid>686276</nid>
  <type>event</type>
  <uid>
    <user id="27707"><![CDATA[27707]]></user>
  </uid>
  <created>1762459808</created>
  <changed>1762459863</changed>
  <title><![CDATA[PhD Proposal by Shuo Cheng]]></title>
  <body><![CDATA[<p><strong>Title: Robot Skill Representations for Long-Horizon Task Learning and Deployment</strong></p><p>&nbsp;</p><p>Shuo Cheng</p><p>Ph.D. Student in Computer Science</p><p>School of Interactive Computing&nbsp;</p><p>Georgia Institute of Technology&nbsp;</p><p><a href="https://sites.google.com/view/shuocheng">https://sites.google.com/view/shuocheng</a></p><p>&nbsp;</p><p>&nbsp;</p><p><strong>Date:</strong>&nbsp;Monday, Nov 17th, 2025</p><p><strong>Time:</strong>&nbsp;11:00 AM - &nbsp;12:30 PM EST</p><p><strong>Location:</strong>&nbsp;Klaus 1212,&nbsp;<a href="https://gatech.zoom.us/j/2301796303?pwd=QjFYTE5sTzhCeVJRNXdpbXViSjFMUT09&amp;omn=99157421529">Zoom Link</a></p><p>&nbsp;</p><p><strong>Committee:&nbsp;</strong></p><p>Dr. Danfei Xu (Advisor) – School of Interactive Computing, Georgia Institute of Technology</p><p>Dr. Sehoon Ha – School of Interactive Computing, Georgia Institute of Technology</p><p>Dr. Harish Ravichandar – School of Interactive Computing, Georgia Institute of Technology</p><p>&nbsp;</p><p><strong>Abstract</strong></p><p>Robots must acquire versatile and generalizable skills to operate effectively in complex, unstructured environments. This thesis investigates how to enable such capabilities through two complementary directions: (1) structural skill representations for scalable and generalizable learning, and (2) data generation and co-training frameworks for learning reactive, deployable policies. The first part develops representations that embed structural inductive biases to support efficient skill acquisition and reuse. LEAGUE, which progressively learns neuro-symbolic reinforcement learning policies within task and motion planning systems, facilitates compositional skill learning over long horizons and earned a Best Paper Honorable Mention at IEEE RA-L. NOD-TAMP enables one-shot skill adaptation across novel geometries and configurations through learned neural object descriptors. The second part introduces robotic data generation and policy distillation frameworks for large-scale, cross-domain learning. OT-Sim2Real leverages the learned skill representations to synthesize diverse simulation demonstrations and employs a co-training strategy that improves policy generalization beyond real-world data coverage—bridging the gap between simulation and deployment.</p><p>&nbsp;</p><p><strong>Proposed Work</strong>: Building on these foundations, future research will investigate how large-scale human videos can be leveraged to learn composable motion priors that enable scalable learning of robot manipulation systems that generalize across diverse tasks and embodiments. A factor graph representation is proposed to decompose the manipulation process into structured subcomponents, facilitating efficient learning of motion samplers from human videos. In addition, a diffusion-based steering strategy will be explored to guide the sampling process toward trajectories that satisfy task-specific constraints effectively and efficiently.</p>]]></body>
  <field_summary_sentence>
    <item>
      <value><![CDATA[Robot Skill Representations for Long-Horizon Task Learning and Deployment]]></value>
    </item>
  </field_summary_sentence>
  <field_summary>
    <item>
      <value><![CDATA[<p><strong>Robot Skill Representations for Long-Horizon Task Learning and Deployment</strong></p>]]></value>
    </item>
  </field_summary>
  <field_time>
    <item>
      <value><![CDATA[2025-11-17T11:00:00-05:00]]></value>
      <value2><![CDATA[2025-11-17T12:30:42-05:00]]></value2>
      <rrule><![CDATA[]]></rrule>
      <timezone><![CDATA[America/New_York]]></timezone>
    </item>
  </field_time>
  <field_fee>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_fee>
  <field_extras>
      </field_extras>
  <field_audience>
          <item>
        <value><![CDATA[Public]]></value>
      </item>
      </field_audience>
  <field_media>
      </field_media>
  <field_contact>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_contact>
  <field_location>
    <item>
      <value><![CDATA[Klaus 1212]]></value>
    </item>
  </field_location>
  <field_sidebar>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_sidebar>
  <field_phone>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_phone>
  <field_url>
    <item>
      <url><![CDATA[]]></url>
      <title><![CDATA[]]></title>
            <attributes><![CDATA[]]></attributes>
    </item>
  </field_url>
  <field_email>
    <item>
      <email><![CDATA[]]></email>
    </item>
  </field_email>
  <field_boilerplate>
    <item>
      <nid><![CDATA[]]></nid>
    </item>
  </field_boilerplate>
  <links_related>
      </links_related>
  <files>
      </files>
  <og_groups>
          <item>221981</item>
      </og_groups>
  <og_groups_both>
          <item><![CDATA[Graduate Studies]]></item>
      </og_groups_both>
  <field_categories>
          <item>
        <tid>1788</tid>
        <value><![CDATA[Other/Miscellaneous]]></value>
      </item>
      </field_categories>
  <field_keywords>
          <item>
        <tid>102851</tid>
        <value><![CDATA[Phd proposal]]></value>
      </item>
      </field_keywords>
  <field_userdata><![CDATA[]]></field_userdata>
</node>
