<node id="673995">
  <nid>673995</nid>
  <type>event</type>
  <uid>
    <user id="27707"><![CDATA[27707]]></user>
  </uid>
  <created>1712245212</created>
  <changed>1712245248</changed>
  <title><![CDATA[PhD Defense by Daniel Bolya]]></title>
  <body><![CDATA[<p><span><span><span><strong><span><span><span><span>Title: </span></span></span></span></strong><span><span><span><span>Less is More: Accelerating Vision by Eliminating Redundancy</span></span></span></span></span></span></span></p>

<p>&nbsp;</p>

<p><span><span><span><strong><span><span><span>Date: </span></span></span></strong><span><span><span>April 12th, 2024</span></span></span></span></span></span></p>

<p><span><span><span><strong><span><span><span>Time: </span></span></span></strong><span><span><span>4:00pm - 5:30pm ET</span></span></span></span></span></span></p>

<p><span><span><span><strong><span><span><span>Location:</span></span></span></strong><span><span><span>&nbsp;CODA C1115 Druid Hills</span></span></span></span></span></span></p>

<p><span><span><span><strong><span><span><span>Zoom Link</span></span></span></strong><span><span><span>: </span></span></span><span><span><span><span><a href="https://gatech.zoom.us/j/96608837820?pwd=cGtSOXZMaHRVL0g0ZGN2aE9QeTNaZz09">https://gatech.zoom.us/j/96608837820?pwd=cGtSOXZMaHRVL0g0ZGN2aE9QeTNaZz09</a></span></span></span></span></span></span></span></p>

<p>&nbsp;</p>

<p><span><span><span><strong><span><span><span>Daniel Bolya</span></span></span></strong></span></span></span></p>

<p><span><span><span><span><span><span>Machine Learning PhD Student</span></span></span></span></span></span></p>

<p><span><span><span><span><span><span>Interactive Computing<br />
Georgia Institute of Technology</span></span></span></span></span></span></p>

<p>&nbsp;</p>

<p><span><span><span><strong><span><span><span>Committee</span></span></span></strong></span></span></span></p>

<p><span><span><span><span><span><span>1 Judy Hoffman (Advisor, IC, GT)</span></span></span></span></span></span></p>

<p><span><span><span><span><span><span>2 James Hays (IC, GT)</span></span></span></span></span></span></p>

<p><span><span><span><span><span><span>3 Zsolt Kira (IC, GT)</span></span></span></span></span></span></p>

<p><span><span><span><span><span><span>4 Dhruv Batra (IC, GT)</span></span></span></span></span></span></p>

<p><span><span><span><span><span><span>5 Christoph Feichtenhofer (FAIR, Meta)</span></span></span></span></span></span></p>

<p>&nbsp;</p>

<p><span><span><span><strong><span><span><span>Abstract</span></span></span></strong></span></span></span></p>

<p><span><span><span><span><span>The massive models that power today’s state-of-the-art in computer vision require trillions of floating-point operations to compute. But how much of these operations do we really need? Given how well techniques like pruning or quantization work, it’s clear that a lot of this computation is redundant. My work focuses on speeding up vision models by reducing redundancy with simple but powerful techniques. In this thesis defense, I’ll give a brief overview of all of my work and then hone in on discussing Token Merging to merge redundant tokens in vision transformers for classification and diffusion and Hiera, which removes redundant modules in modern vision architectures by explicitly teaching spatial bias. Then, I'll show that you can combine these and other approaches for a multiplicative effect (for e.g., ~10x speed-up on video).</span></span></span></span></span></p>

<p>&nbsp;</p>
]]></body>
  <field_summary_sentence>
    <item>
      <value><![CDATA[Less is More: Accelerating Vision by Eliminating Redundancy]]></value>
    </item>
  </field_summary_sentence>
  <field_summary>
    <item>
      <value><![CDATA[<p><span><span><span><span>Less is More: Accelerating Vision by Eliminating Redundancy</span></span></span></span></p>
]]></value>
    </item>
  </field_summary>
  <field_time>
    <item>
      <value><![CDATA[2024-04-12T16:00:00-04:00]]></value>
      <value2><![CDATA[2024-04-12T17:30:00-04:00]]></value2>
      <rrule><![CDATA[]]></rrule>
      <timezone><![CDATA[America/New_York]]></timezone>
    </item>
  </field_time>
  <field_fee>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_fee>
  <field_extras>
      </field_extras>
  <field_audience>
          <item>
        <value><![CDATA[Public]]></value>
      </item>
      </field_audience>
  <field_media>
      </field_media>
  <field_contact>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_contact>
  <field_location>
    <item>
      <value><![CDATA[CODA C1115 Druid Hills]]></value>
    </item>
  </field_location>
  <field_sidebar>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_sidebar>
  <field_phone>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_phone>
  <field_url>
    <item>
      <url><![CDATA[]]></url>
      <title><![CDATA[]]></title>
            <attributes><![CDATA[]]></attributes>
    </item>
  </field_url>
  <field_email>
    <item>
      <email><![CDATA[]]></email>
    </item>
  </field_email>
  <field_boilerplate>
    <item>
      <nid><![CDATA[]]></nid>
    </item>
  </field_boilerplate>
  <links_related>
      </links_related>
  <files>
      </files>
  <og_groups>
          <item>221981</item>
      </og_groups>
  <og_groups_both>
          <item><![CDATA[Graduate Studies]]></item>
      </og_groups_both>
  <field_categories>
          <item>
        <tid>1788</tid>
        <value><![CDATA[Other/Miscellaneous]]></value>
      </item>
      </field_categories>
  <field_keywords>
          <item>
        <tid>100811</tid>
        <value><![CDATA[Phd Defense]]></value>
      </item>
      </field_keywords>
  <field_userdata><![CDATA[]]></field_userdata>
</node>
