<node id="688451">
  <nid>688451</nid>
  <type>event</type>
  <uid>
    <user id="36319"><![CDATA[36319]]></user>
  </uid>
  <created>1771612558</created>
  <changed>1771613966</changed>
  <title><![CDATA[School of CSE Seminar Series: Manling Li]]></title>
  <body><![CDATA[<p><strong>Speaker:</strong>&nbsp;Manling Li, assistant professor at Northwestern University<br><strong>Date and Time:</strong>&nbsp;February 24, 11:00 a.m. - 12:00 p.m.<br><strong>Location:</strong>&nbsp;Coda 114<br><strong>Host:</strong>&nbsp;Bo Dai</p><p><strong>Title:</strong>&nbsp;<em>Toward Foundation Agents: How Multimodal Models Learn (and Fail to Learn) the Physical World</em></p><p><strong>Abstract:</strong> Today’s multimodal models are often trained with a brute-force “align everything” recipe, yet it is still unclear how cross-modal intelligence can emerge. We argue the key question is mechanistic: how can models go beyond static alignment annotations to learn from physical-world interaction and support goal-directed decision making? We systematically study multimodal learning through the MDP agent loop: state estimation, world modeling for planning, and control for safety. First, we open up the black box, and intervene inside embeddings to reveal how geometry is lost, and design ways to retain geometric structure. Second, we inject world-model priors to teach dynamics through RAGEN/VAGEN, enabling multi-step planning rather than token matching. Third, we introduce ODE-Steer for safe agents, which steers internal activations into “safe zones” where reasoning stays reliable and controllable. Lastly, we lay out the future that true multimodal intelligence requires more than aligning tokens; it requires aligning the internal mechanisms of the model with the geometry of the world.</p><p><strong>Bio:</strong> Manling Li is an Assistant Professor at Northwestern University and an Amazon Scholar. She was a postdoc at Stanford University, and obtained Ph.D. degree in Computer Science at University of Illinois Urbana-Champaign in 2023. She works on Reasoning, Planning and Compositionality, in the intersection of Language, Vision, and Robotics. Her work has been recognized as ACL 2025 Inaugural Dissertation Award Honorable Mention, MIT Tech Review Innovators Under 35, ACL’24 Outstanding Paper Award, NAACL'21 Best Demo Paper Award, ACL'20 Best Demo Paper Award, Microsoft Research PhD Fellowship, EE CS Rising Star, etc. She served as virtual chairs of ACL 25, publication chairs at NAACL 25, demo chairs at EMNLP 24, etc. Additional information is available at <a href="https://limanling.github.io/">https://limanling.github.io/</a>.</p>]]></body>
  <field_summary_sentence>
    <item>
      <value><![CDATA[School of CSE hosts a seminar from Northwestern University Assistant Professor Manling Li]]></value>
    </item>
  </field_summary_sentence>
  <field_summary>
    <item>
      <value><![CDATA[<p><strong>Speaker:</strong>&nbsp;Manling Li, assistant professor at Northwestern University<br><strong>Date and Time:</strong>&nbsp;February 24, 11:00 a.m. - 12:00 p.m.<br><strong>Location:</strong>&nbsp;Coda 114<br><strong>Host:</strong>&nbsp;Bo Dai</p><p><strong>Title:</strong>&nbsp;<em>Toward Foundation Agents: How Multimodal Models Learn (and Fail to Learn) the Physical World</em></p>]]></value>
    </item>
  </field_summary>
  <field_time>
    <item>
      <value><![CDATA[2026-02-24T11:00:00-05:00]]></value>
      <value2><![CDATA[2026-02-24T12:00:00-05:00]]></value2>
      <rrule><![CDATA[]]></rrule>
      <timezone><![CDATA[America/New_York]]></timezone>
    </item>
  </field_time>
  <field_fee>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_fee>
  <field_extras>
      </field_extras>
  <field_audience>
          <item>
        <value><![CDATA[Faculty/Staff]]></value>
      </item>
          <item>
        <value><![CDATA[Postdoc]]></value>
      </item>
          <item>
        <value><![CDATA[Public]]></value>
      </item>
          <item>
        <value><![CDATA[Graduate students]]></value>
      </item>
          <item>
        <value><![CDATA[Undergraduate students]]></value>
      </item>
      </field_audience>
  <field_media>
          <item>
        <nid>
          <node id="679384">
            <nid>679384</nid>
            <type>image</type>
            <title><![CDATA[Manling-Li.jpg]]></title>
            <body><![CDATA[]]></body>
                          <field_image>
                <item>
                  <fid>263538</fid>
                  <filename><![CDATA[Manling-Li.jpg]]></filename>
                  <filepath><![CDATA[/sites/default/files/2026/02/20/Manling-Li.jpg]]></filepath>
                  <file_full_path><![CDATA[http://hg.gatech.edu//sites/default/files/2026/02/20/Manling-Li.jpg]]></file_full_path>
                  <filemime>image/jpeg</filemime>
                  <image_740><![CDATA[]]></image_740>
                  <image_alt><![CDATA[CSE Seminar Manling Li]]></image_alt>
                </item>
              </field_image>
            
                      </node>
        </nid>
      </item>
      </field_media>
  <field_contact>
    <item>
      <value><![CDATA[<p>Sophie McGivern &nbsp;<br>smcgivern3@gatech.edu</p>]]></value>
    </item>
  </field_contact>
  <field_location>
    <item>
      <value><![CDATA[Coda, Room 114]]></value>
    </item>
  </field_location>
  <field_sidebar>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_sidebar>
  <field_phone>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_phone>
  <field_url>
    <item>
      <url><![CDATA[]]></url>
      <title><![CDATA[]]></title>
            <attributes><![CDATA[]]></attributes>
    </item>
  </field_url>
  <field_email>
    <item>
      <email><![CDATA[]]></email>
    </item>
  </field_email>
  <field_boilerplate>
    <item>
      <nid><![CDATA[]]></nid>
    </item>
  </field_boilerplate>
  <links_related>
      </links_related>
  <files>
      </files>
  <og_groups>
          <item>47223</item>
          <item>50877</item>
      </og_groups>
  <og_groups_both>
          <item><![CDATA[College of Computing]]></item>
          <item><![CDATA[School of Computational Science and Engineering]]></item>
      </og_groups_both>
  <field_categories>
          <item>
        <tid>1795</tid>
        <value><![CDATA[Seminar/Lecture/Colloquium]]></value>
      </item>
      </field_categories>
  <field_keywords>
          <item>
        <tid>166983</tid>
        <value><![CDATA[School of Computational Science and Engineering]]></value>
      </item>
      </field_keywords>
  <field_userdata><![CDATA[]]></field_userdata>
</node>
