event

PhD Proposal by Yunhai Han

Primary tabs

Title: Towards Efficient and Self-Sufficient Learning of In-Domain Dexterous Manipulation Skills from Limited Supervision

 

Date: Thursday, April 2nd, 2026

Time: 3:30PM to 5PM ET

Location: 1212 Conference Room Klaus or Zoom Link

 

Yunhai Han

Robotics Ph.D. Student

Woodruff School of Mechanical Engineering

Georgia Institute of Technology

 

Committee:

Dr. Harish Ravichandar (advisor) – School of Interactive Computing, Georgia Institute of Technology

Dr. Zsolt Kira – School of Interactive Computing, Georgia Institute of Technology

Dr. Danfei Xu –  School of Interactive Computing, Georgia Institute of Technology

Dr. Yunzhu Li – Department of Computer Science, Columbia University

Dr. Dinesh Jayaraman  – Department of Computer and Information Science, University of Pennsylvania

 

Abstract:

Recent advances in large-scale robot learning, that build upon successes from adjacent domains (e.g., Natural Language Processing and Computing Vision), have reinforced the view that scaling is the primary approach to achieving effective dexterous robotic manipulation. However, despite extensive efforts in massive data collection and pretrained foundation models for robotics, robot manipulation systems remain largely confined to settings that rely on bespoke demonstrations and well-resourced laboratory infrastructure. 

A key reason is the critical need for high-quality, in-domain robot data and expert-designed training recipes to fine-tune or distill large pretrained models into effective manipulation policies. Consequently, this reliance on curated datasets, rich supervision, and expert intervention during training makes it challenging to adapt robot policies across the diverse settings encountered in domains such as manufacturing and consumer robotics. Moreover, the lack of interpretability in these distilled policies offers little insight into their brittleness and idiosyncratic failure modes.

 

In this work, we propose a learning framework for robust robot dexterity by efficiently learning transparent robot policies from limited supervision while minimizing task-specific expert intervention. To achieve this, the learning system should: i) employ a transparent, task-agnostic policy architecture, ii) enable efficient, self-sufficient training and fine-tuning strategies, iii) learn from accessible in-the-wild data, and iv) integrate general-purpose reasoning modules with mechanisms that support reliable in-domain execution. In this way, our framework is complementary to general-purpose reasoning models but offers a principled alternative to monolithic generalist robot policies by learning lightweight specialists for specific dexterous skills in the target environment without significant reliance on expert-intervention and data curation. Our primary design objectives are to better support real-world diversity in tasks, objects, and environments by learning specialist policies tailored to the requirements of a particular setting.

Status

  • Workflow status: Published
  • Created by: Tatianna Richardson
  • Created: 03/23/2026
  • Modified By: Tatianna Richardson
  • Modified: 03/23/2026

Categories

Keywords

User Data

Target Audience