Ph.D. Dissertation Defense - Zheyuan Wang

Title: Learning Dynamic Priority Scheduling Policies with Graph Attention Networks

Committee:

Dr. Matthew Gombolay, IC, Chair, Advisor

Dr. Matthieu Bloch, ECE, Co-Advisor

Dr. Sonia Chernova, IC

Dr. Magnus Egerstedt, ECE

Dr. Harish Ravichandar, IC

Dr. Elias Khalil, U of Toronto

Abstract: The aim of this thesis is to develop novel graph attention network-based models to automatically learn scheduling policies for effectively solving resource optimization problems, covering both deterministic and stochastic environments. The policy learning methods utilize both imitation learning, when expert demonstrations are accessible at low cost, and reinforcement learning, when otherwise reward engineering is feasible. By parameterizing the learner with graph attention networks, the framework is computationally efficient and results in scalable resource optimization schedulers that adapt to various problem structures. This thesis addresses the problem of multi-robot task allocation (MRTA) under temporospatial constraints. Initially, robots with deterministic and homogeneous task performance are considered with the development of the RoboGNN scheduler. Then, I develop ScheduleNet, a novel heterogeneous graph attention network model, to efficiently reason about coordinating teams of heterogeneous robots. Next, I address problems under the more challenging stochastic setting in two parts. Part 1) Scheduling with stochastic and dynamic task completion times. The MRTA problem is extended by introducing human coworkers with dynamic learning curves and stochastic task execution. HybridNet, a hybrid network structure, has been developed that utilizes a heterogeneous graph-based encoder and a recurrent schedule propagator, to carry out fast schedule generation in multi-round settings. Part 2) Scheduling with stochastic and dynamic task arrival and completion times. With an application in failure-predictive plane maintenance, I develop a heterogeneous graph-based policy optimization (HetGPO) approach to enable learning robust scheduling policies in highly stochastic environments. Through extensive experiments, the proposed framework has been shown to outperform prior state-of-the-art algorithms in different applications. My research contributes several key innovations regarding designing graph-based learning algorithms in operations research.

Media

No media selected

Summary

Details

Wednesday

Dec 7 2022

09:00am - 11:00am

In campus calendar: No

Sidebar Content

No sidebar content

Groups

ECE Ph.D. Dissertation Defenses

Status

Workflow Status:Published
Created By:Daniela Staiculescu
Created:11/23/2022
Modified By:Daniela Staiculescu
Modified:11/23/2022

Mercury (Hg)

Ph.D. Dissertation Defense - Zheyuan Wang

Log in

Georgia Institute of Technology

Ph.D. Dissertation Defense - Zheyuan Wang

Primary tabs

Log in

Georgia Institute of Technology