SCS Recruiting Seminar: Yuanzhi Li

TITLE: Towards Deeper Understandings of Deep Learning

ABSTRACT:

Recent breakthroughs in machine learning often involve learning highly non-convex models, especially deep neural networks. Though many empirical works have demonstrated the success of these methods, the formal study of the principles behind them is less established.

This talk will show a few of the recent results towards developing such principles. In particular, we focus on the over-parameterized neural networks for multi-class classifications. We will show that stochastic gradient descent (SGD) on over-parameterized deep neural networks provably finds the global minimum for the training objective. Moreover, we also prove that such perfect fitting can also be extended to test data set when the labels are generated by certain teaching networks.

This talk will also cover how to use the above results as a step to establish the theory behind the “magic’’ of learning rate decay in training neural networks, as well as how the identity mapping in ResNet helps in the learning process.

BIO:

Yuanzhi Li is a postdoctoral researcher at the computer science department of Stanford University. Previously, he obtained his Ph.D. at Princeton under the advice of Sanjeev Arora. His research interests include topics in deep learning, non-convex optimization, and online learning.

Media

Yuanzhi Li

Summary

Details

Tuesday

Jan 15 2019

11:00am - 12:00pm

Contact: Tess Malone, Communications Officer tess.malone@cc.gatech.edu

In campus calendar: No

Sidebar Content

No sidebar content

Groups

Status

Workflow status: Published
Created by: Tess Malone
Created: 01/10/2019
Modified By: Tess Malone
Modified: 01/10/2019

Mercury (Hg)

SCS Recruiting Seminar: Yuanzhi Li

Log in

Georgia Institute of Technology

SCS Recruiting Seminar: Yuanzhi Li

Primary tabs

Log in

Georgia Institute of Technology