<node id="653191">
  <nid>653191</nid>
  <type>event</type>
  <uid>
    <user id="27707"><![CDATA[27707]]></user>
  </uid>
  <created>1638216750</created>
  <changed>1638216750</changed>
  <title><![CDATA[PhD Proposal by Sana Damani]]></title>
  <body><![CDATA[<p><strong>Title: </strong>Instruction Reordering and Work Scheduling for Thread-Parallel Architectures</p>

<p>&nbsp;</p>

<p><strong>Sana Damani</strong></p>

<p>Ph.D. Student</p>

<p>School of Computer Science&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</p>

<p>Georgia Institute of Technology</p>

<p>&nbsp;</p>

<p><strong>Date:</strong> Wednesday, December 1, 2021</p>

<p><strong>Time: </strong>11:00 AM - 1:00 PM EST</p>

<p><strong>Location</strong>(Remote via BlueJeans): <a href="https://gatech.bluejeans.com/sdamani6">https://gatech.bluejeans.com/sdamani6</a></p>

<p>&nbsp;</p>

<p><strong>Committee:</strong></p>

<p>Dr. Vivek Sarkar (Advisor), School of Computer Science, Georgia Institute of Technology</p>

<p>Dr. Hyesoon Kim, School of Computer Science, Georgia Institute of Technology</p>

<p>Dr. Tom Conte, School of Computer Science, Georgia Institute of Technology</p>

<p>Dr. Santosh Pande, School of Computer Science, Georgia Institute of Technology</p>

<p>&nbsp;</p>

<p><strong>Abstract:</strong></p>

<p>While accelerators such as GPUs and near-memory processors show significant performance improvements for applications with high data parallelism and regular memory accesses, they experience synchronization and memory access overheads in applications with irregular control flow and memory access patterns resulting in reduced efficiency. Examples include graph applications, Monte Carlo simulations, ray tracing applications, and sparse matrix computations. This proposal aims at identifying inefficiencies in executing irregular programs on thread-parallel architectures, and recommends compiler transformations and architecture enhancements to address these inefficiencies. In particular, we describe instruction reordering and thread scheduling techniques that avoid serialization, reduce pipeline stalls and minimize redundant thread migrations, thereby reducing overall program latency and improving processor utilization.</p>

<p>&nbsp;</p>

<p>Contributions:</p>

<ol>
	<li>Common Subexpression Convergence, a compiler transformation that identifies and removes redundant code in divergent regions of GPU programs.</li>
	<li>Speculative Reconvergence, a compiler transformation that identifies new thread reconvergence points in divergent GPU programs to improve SIMT efficiency.</li>
	<li>Subwarp Interleaving, an architecture feature that schedules threads at a subwarp granularity on GPUs to reduce pipeline stalls in divergent regions of the program.</li>
	<li>Memory Access Scheduling, a software instruction scheduling approach that groups together co-located memory accesses to minimize thread migrations on migratory-thread architectures.</li>
</ol>
]]></body>
  <field_summary_sentence>
    <item>
      <value><![CDATA[Instruction Reordering and Work Scheduling for Thread-Parallel Architectures]]></value>
    </item>
  </field_summary_sentence>
  <field_summary>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_summary>
  <field_time>
    <item>
      <value><![CDATA[2021-12-01T13:00:00-05:00]]></value>
      <value2><![CDATA[2021-12-01T15:00:00-05:00]]></value2>
      <rrule><![CDATA[]]></rrule>
      <timezone><![CDATA[America/New_York]]></timezone>
    </item>
  </field_time>
  <field_fee>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_fee>
  <field_extras>
      </field_extras>
  <field_audience>
          <item>
        <value><![CDATA[Faculty/Staff]]></value>
      </item>
          <item>
        <value><![CDATA[Public]]></value>
      </item>
          <item>
        <value><![CDATA[Undergraduate students]]></value>
      </item>
      </field_audience>
  <field_media>
      </field_media>
  <field_contact>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_contact>
  <field_location>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_location>
  <field_sidebar>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_sidebar>
  <field_phone>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_phone>
  <field_url>
    <item>
      <url><![CDATA[https://gatech.bluejeans.com/sdamani6]]></url>
      <title><![CDATA[Bluejeans]]></title>
            <attributes><![CDATA[]]></attributes>
    </item>
  </field_url>
  <field_email>
    <item>
      <email><![CDATA[]]></email>
    </item>
  </field_email>
  <field_boilerplate>
    <item>
      <nid><![CDATA[]]></nid>
    </item>
  </field_boilerplate>
  <links_related>
      </links_related>
  <files>
      </files>
  <og_groups>
          <item>221981</item>
      </og_groups>
  <og_groups_both>
          <item><![CDATA[Graduate Studies]]></item>
      </og_groups_both>
  <field_categories>
          <item>
        <tid>1788</tid>
        <value><![CDATA[Other/Miscellaneous]]></value>
      </item>
      </field_categories>
  <field_keywords>
          <item>
        <tid>102851</tid>
        <value><![CDATA[Phd proposal]]></value>
      </item>
      </field_keywords>
  <field_userdata><![CDATA[]]></field_userdata>
</node>
