PhD Defense by Austin Xu

Title: Learning with and without human feedback

Date and time: April 15, 12:30-2pm

Zoom link: https://gatech.zoom.us/j/9026260477?omn=93570001864

Committee

Dr. Mark Davenport (Advisor)

Dr. Christopher Rozell

Dr. Ashwin Pananjady

Dr. Justin Romberg

Dr. Zsolt Kira

Abstract

Labels and feedback provided by humans play a central role in training contemporary machine learning models, offering models ground truth annotations from which to extract patterns. However, collecting such feedback from humans is a challenging and time-consuming task. As a result, practitioners must be intentional both in how they choose to query humans for feedback and in the problem settings for which they request feedback. This thesis explores learning from human feedback along two fundamental directions. The first part of the thesis focuses on how we can more effectively learn from human feedback from a mathematically grounded perspective. We first consider how to leverage paired comparisons, a simple mechanism for human feedback, for learning rich models of human preference. We then propose a new mechanism for collecting human feedback aimed at balancing informativeness and cognitive burden. The second part of the thesis focuses on how we can leverage pretrained models to avoid collecting additional human feedback. We consider two specific application settings: retrieval and synthetic dataset generation, and show that existing tools, such as large language models or image editing models, can be used to remove the need for collecting human feedback.

Media

No media selected

Summary

Learning with and without human feedback

Details

Monday

Apr 15 2024

12:30pm - 02:30pm

Location: ZOOM

In campus calendar: No

Sidebar Content

No sidebar content

Groups

Graduate Studies

Status

Workflow Status:Published
Created By:Tatianna Richardson
Created:04/08/2024
Modified By:Tatianna Richardson
Modified:04/08/2024

Mercury (Hg)

PhD Defense by Austin Xu

Log in

Georgia Institute of Technology

PhD Defense by Austin Xu

Primary tabs

Log in

Georgia Institute of Technology