PhD Defense by Nam Vo

Title: Image Retrieval and Geolocalization with Deep Learning

Nam Vo
Ph.D. Student

School of Interactive Computing
College of Computing
Georgia Institute of Technology

Date: Tuesday, Dec 11th, 2018
Time: 10:00 AM to 12:00PM (EST)
Location: TBA, College of Computing Building

Committee:

---------------

Dr. James Hays (Advisor), School of Interactive Computing, Georgia Institute of Technology

Dr. Irfan Essa, School of Interactive Computing, Georgia Institute of Technology

Dr. James Rehg, School of Interactive Computing, Georgia Institute of Technology

Dr. Nathan Jacobs, Department of Computer Science, University of Kentucky

Dr. Aaron Bobick, School of Engineering and Applied Science, Washington University in St. Louis

Summary:

---------------

In this thesis, I study image localization task and explore image ranking/retrieval approach. Deep Learning has advanced many computer vision task including image retrieval; in addition, location tagged image data has become increasingly abundant.

Our first contribution is a study of image geolocalization at planet scale (Im2GPS: predicting GPS coordinate from image data) comparing 2 deep learning approaches: image classification and image retrieval. We analyze the trade off between localization accuracy at different granularity levels. Image retrieval approach has great advantage when it comes to geolocalization at fine levels (street, city) and still competitive at coarse levels (country, continent).

Next, we investigate different architectures for matching and retrieving crossview images. The application is to do localization using image retrieval approach where the query images are normal streetview images, but reference images in the database are overhead viewpoint (satellite images).

Our third contribution is exploring state of the art Deep Metric Learning (DML) techniques in image retrieval. We first look at it in the context of fine grained image retrieval, which is much well studied in the literature, and analyze generalization performance when switching embedding layer. Lastly, we apply DML techniques to training deep networks for image retrieval and Im2GPS geolocalization task. Our experiment shows that DML trained systems outperform a classification trained system as feature extractors, result in better image retrieval and geolocalization performance.

Media

No media selected

Summary

Details

Tuesday

Dec 11 2018

10:00am - 12:00pm

In campus calendar: No

Sidebar Content

No sidebar content

Groups

Graduate Studies

Status

Workflow status: Published
Created by: Tatianna Richardson
Created: 12/05/2018
Modified By: Tatianna Richardson
Modified: 12/05/2018

Mercury (Hg)

PhD Defense by Nam Vo

Log in

Georgia Institute of Technology

PhD Defense by Nam Vo

Primary tabs

Log in

Georgia Institute of Technology