event
PhD Proposal by Andrew Yarovoi
Primary tabs
Title: Generalized 3D Point Cloud Understanding via Cross-Domain Multi-Dataset Learning.
Date: January 22, 2026
Time: 1pm EST
Location: Van Leer 218
Virtual Link: Zoom
Meeting ID: 968 3730 8246
Passcode: 650576
Andrew Yarovoi
Ph.D. Student, Robotics
George W. Woodruff School of Mechanical Engineering
Georgia Institute of Technology
Committee:
Dr. Christopher R. Valenta (Advisor)
School of Electrical and Computer Engineering
Georgia Institute of Technology
Dr. Lu Gan
Guggenheim School of Aerospace Engineering
Georgia Institute of Technology
Dr. James Hays
School of Interactive Computing
Georgia Institute of Technology
Dr. Zsolt Kira
School of Interactive Computing
Georgia Institute of Technology
Dr. Patricio A. Vela
School of Electrical and Computer Engineering
Georgia Institute of Technology
Abstract:
The proposed research aims to advance 3D point cloud semantic understand by enabling effective multi-dataset training (MDT) across diverse domains and sensors. Current models trained on individual datasets often exhibit limited generalization due to restricted point cloud dataset sizes and sensor-specific biases. MDT offers a solution by providing more diverse training data from a variety of sensors and domains. However, implementing MDT for point cloud data remains challenging due to unresolved issues such as heterogeneous input features, insufficient contextual information, significant variations in scale and density, and inconsistent labeling across datasets. The proposed work aims to systematically address these challenges through architectural design. First, a modular training framework will be developed to accommodate heterogeneous sensor inputs and align synonymous class labels through an open-vocabulary approach. Second, novel expansion and contraction network blocks will be designed to efficiently generate multi-resolution features, enhancing scale and density invariance. Third, a new backbone model, termed UniMD3D, will integrate these modules and implement a unified label alignment mechanism to resolve class partitioning inconsistencies across datasets. The proposed research is expected to enable effective learning while training on mixed datasets spanning multiple domains, tasks, and sensors, leading to more generalized 3D perception systems for real-world applications.
Groups
Status
- Workflow status: Published
- Created by: Tatianna Richardson
- Created: 01/13/2026
- Modified By: Tatianna Richardson
- Modified: 01/13/2026
Categories
Keywords
Target Audience