PhD Defense by Ruobing Han

Title: Compiler-Runtime Co-Design for Performance-Portable GPU Programming on CPUs

Date: Tuesday, April 22, 2025

Time: 9:00 AM - 11:00 AM EDT

Location:

In-Person: Klaus 3126
Online: Microsoft Teams

Ruobing Han

Ph.D. Candidate

School of Computer Science

College of Computing

Georgia Institute of Technology

Committee:

Dr. Hyesoon Kim (advisor), School of Computer Science, Georgia Institute of Technology

Dr. Santosh Pande, School of Computer Science, Georgia Institute of Technology

Dr. Yingyan (Celine) Lin, School of Computer Science, Georgia Institute of Technology

Dr. Richard (Rich) Vuduc, School of Computational Science and Engineering, Georgia Institute of Technology

Dr. Jaewoong Sim, Department of Electrical and Computer Engineering, Seoul National University

Dr. Zongwei Zhou, Google Cloud, Google

Abstract

Most commercial and research data centers are CPU-GPU heterogeneous systems, offering both CPUs and GPUs to users. As a great number of advanced AI and HPC applications are implemented for GPUs, CPUs—despite providing substantial computational resources—are consistently underutilized. It is common for users to wait hours for GPUs to become available, even as many CPUs remain idle. This raises an important question: can CPUs be effectively utilized to run GPU applications?

To address the challenge of GPU-to-CPU migration, my dissertation focuses on two key aspects: coverage and performance. First, I introduce two projects, COX and CuPBoP, which provide broad support for correctly migrating various existing GPU programs to CPUs. We implement frameworks based on compiler and runtime co-design that support executing off-the-shelf GPU programs on CPUs without requiring manual pre- or post-processing. Compared to existing solutions, our frameworks achieve the highest coverage.

Second, I present a project that significantly enhances a single CPU’s ability to execute transformed GPU programs. We summarize the challenges of executing transformed GPU programs on CPUs and propose four compiler and runtime optimizations accordingly. Lastly, I introduce CuCC, a framework that extends CPU migration to distributed nodes, enabling the execution of GPU programs on CPU clusters. CuCC is the first solution to support GPU-to-CPU-cluster migration.

Media

No media selected

Summary

Compiler-Runtime Co-Design for Performance-Portable GPU Programming on CPUs

Details

Tuesday

Apr 22 2025

09:00am - 11:00am

Location: • In-Person: Klaus 3126

In campus calendar: No

Sidebar Content

No sidebar content

Groups

Graduate Studies

Status

Workflow Status:Published
Created By:Tatianna Richardson
Created:04/09/2025
Modified By:Tatianna Richardson
Modified:04/09/2025

Mercury (Hg)