event
PhD Proposal by Yunhai Han
Primary tabs
Title: Towards Efficient and Self-Sufficient Learning of In-Domain Dexterous Manipulation Skills from Limited Supervision
Date: Thursday, April 2nd, 2026
Time: 3:30PM to 5PM ET
Location: 1212 Conference Room Klaus or Zoom Link
Yunhai Han
Robotics Ph.D. Student
Woodruff School of Mechanical Engineering
Georgia Institute of Technology
Committee:
Dr. Harish Ravichandar (advisor) – School of Interactive Computing, Georgia Institute of Technology
Dr. Zsolt Kira – School of Interactive Computing, Georgia Institute of Technology
Dr. Danfei Xu – School of Interactive Computing, Georgia Institute of Technology
Dr. Yunzhu Li – Department of Computer Science, Columbia University
Dr. Dinesh Jayaraman – Department of Computer and Information Science, University of Pennsylvania
Abstract:
Recent advances in large-scale robot learning, that build upon successes from adjacent domains (e.g., Natural Language Processing and Computing Vision), have reinforced the view that scaling is the primary approach to achieving effective dexterous robotic manipulation. However, despite extensive efforts in massive data collection and pretrained foundation models for robotics, robot manipulation systems remain largely confined to settings that rely on bespoke demonstrations and well-resourced laboratory infrastructure.
A key reason is the critical need for high-quality, in-domain robot data and expert-designed training recipes to fine-tune or distill large pretrained models into effective manipulation policies. Consequently, this reliance on curated datasets, rich supervision, and expert intervention during training makes it challenging to adapt robot policies across the diverse settings encountered in domains such as manufacturing and consumer robotics. Moreover, the lack of interpretability in these distilled policies offers little insight into their brittleness and idiosyncratic failure modes.
In this work, we propose a learning framework for robust robot dexterity by efficiently learning transparent robot policies from limited supervision while minimizing task-specific expert intervention. To achieve this, the learning system should: i) employ a transparent, task-agnostic policy architecture, ii) enable efficient, self-sufficient training and fine-tuning strategies, iii) learn from accessible in-the-wild data, and iv) integrate general-purpose reasoning modules with mechanisms that support reliable in-domain execution. In this way, our framework is complementary to general-purpose reasoning models but offers a principled alternative to monolithic generalist robot policies by learning lightweight specialists for specific dexterous skills in the target environment without significant reliance on expert-intervention and data curation. Our primary design objectives are to better support real-world diversity in tasks, objects, and environments by learning specialist policies tailored to the requirements of a particular setting.
Groups
Status
- Workflow status: Published
- Created by: Tatianna Richardson
- Created: 03/23/2026
- Modified By: Tatianna Richardson
- Modified: 03/23/2026
Categories
Keywords
User Data
Target Audience