<node id="688742">
  <nid>688742</nid>
  <type>event</type>
  <uid>
    <user id="27707"><![CDATA[27707]]></user>
  </uid>
  <created>1772722673</created>
  <changed>1772722711</changed>
  <title><![CDATA[PhD Defense by Amey Agrawal]]></title>
  <body><![CDATA[<p>&nbsp;</p><p><strong>Title:&nbsp;</strong>Towards Efficient and Predictable Large-Scale AI Systems</p><p><strong>Date:</strong> Friday, March 20, 2026 Time: 2:00 PM – 4:00 PM EST</p><p><strong>Location:</strong> Klaus Advanced Computing Building (KACB), Room 3100.</p><p>&nbsp;</p><p><strong>Candidate:</strong></p><p>Amey Agrawal, School of Computer Science, Georgia Tech</p><p>&nbsp;</p><p><strong>Committee:</strong></p><p>Dr. Alexey Tumanov (Advisor &amp; Chair), School of Computer Science, Georgia Tech</p><p>Dr. Vijay Ganesh, School of Computer Science, Georgia Tech</p><p>Dr. Tushar Krishna, School of Electrical and Computer Engineering, Georgia Tech</p><p>Dr. Ram Ramjee, Partner Research Manager, Microsoft Research</p><p>Dr. Srinivas Sridharan, Distinguished Engineer, NVIDIA</p><p>&nbsp;</p><p><strong>Abstract:</strong> Serving large AI models efficiently while guaranteeing low and predictable latency is the central systems challenge in deploying modern AI. This thesis addresses this challenge through two complementary thrusts. First, we build inference systems that maximize hardware utilization by exploiting the unique properties of these workloads — resolving latency-throughput tradeoffs, scaling to multi-million token contexts, optimizing across memory hierarchies, and enabling efficient data movement across distributed components. Second, we develop deployment optimization systems that identify optimal configurations by jointly reasoning about model architecture, hardware capabilities, workload characteristics, and user requirements for cost and latency. Together, these contributions achieve multi-fold improvements in serving capacity and latency, with core techniques adopted by major open-source inference frameworks serving millions of GPU-hours weekly.</p><p>&nbsp;</p>]]></body>
  <field_summary_sentence>
    <item>
      <value><![CDATA[Towards Efficient and Predictable Large-Scale AI Systems]]></value>
    </item>
  </field_summary_sentence>
  <field_summary>
    <item>
      <value><![CDATA[<p>Towards Efficient and Predictable Large-Scale AI Systems</p>]]></value>
    </item>
  </field_summary>
  <field_time>
    <item>
      <value><![CDATA[2026-03-20T14:00:00-04:00]]></value>
      <value2><![CDATA[2026-03-20T16:00:00-04:00]]></value2>
      <rrule><![CDATA[]]></rrule>
      <timezone><![CDATA[America/New_York]]></timezone>
    </item>
  </field_time>
  <field_fee>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_fee>
  <field_extras>
      </field_extras>
  <field_audience>
          <item>
        <value><![CDATA[Public]]></value>
      </item>
      </field_audience>
  <field_media>
      </field_media>
  <field_contact>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_contact>
  <field_location>
    <item>
      <value><![CDATA[Klaus Advanced Computing Building (KACB), Room 3100]]></value>
    </item>
  </field_location>
  <field_sidebar>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_sidebar>
  <field_phone>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_phone>
  <field_url>
    <item>
      <url><![CDATA[]]></url>
      <title><![CDATA[]]></title>
            <attributes><![CDATA[]]></attributes>
    </item>
  </field_url>
  <field_email>
    <item>
      <email><![CDATA[]]></email>
    </item>
  </field_email>
  <field_boilerplate>
    <item>
      <nid><![CDATA[]]></nid>
    </item>
  </field_boilerplate>
  <links_related>
      </links_related>
  <files>
      </files>
  <og_groups>
          <item>221981</item>
      </og_groups>
  <og_groups_both>
          <item><![CDATA[Graduate Studies]]></item>
      </og_groups_both>
  <field_categories>
          <item>
        <tid>1788</tid>
        <value><![CDATA[Other/Miscellaneous]]></value>
      </item>
      </field_categories>
  <field_keywords>
          <item>
        <tid>100811</tid>
        <value><![CDATA[Phd Defense]]></value>
      </item>
      </field_keywords>
  <field_userdata><![CDATA[]]></field_userdata>
</node>
