event
Center for Signal and Information Processing Seminar
Primary tabs
Date: Friday, Oct. 29, 2021
Time: 3:00 pm to 4:00 pm
BlueJeans Link: https://bluejeans.com/4658604304
Speaker: Jinyu Li
Affiliation: Microsoft Corporation, Redmond, Washington
Title: Advances in end-to-end automatic speech recognition
Bio: Jinyu Li received a Ph.D. degree from the Georgia Institute of Technology, Atlanta, in 2008. Currently, he is a Partner Applied Scientist and Technical Lead in Microsoft Corporation, Redmond, Washington, USA. He leads a team to design and improve speech modeling algorithms and technologies that ensure industry state-of-the-art speech recognition accuracy for Microsoft. His major research interests cover several topics in speech recognition, including end-to-end modeling, deep learning, noise robustness, etc. He is the leading author of the book "Robust Automatic Speech Recognition -- A Bridge to Practical Applications", Academic Press, Oct, 2015. He is the member of IEEE Speech and Language Processing Technical Committee since 2017. He also served as the associate editor of IEEE/ACM Transactions on Audio, Speech, and Language Processing from 2015 to 2020.
Abstract: Recently, the speech community is seeing a significant trend of moving from deep neural network-based hybrid modeling to end-to-end (E2E) modeling for automatic speech recognition (ASR). While E2E models achieve the state-of-the-art results in most benchmarks in terms of ASR accuracy, hybrid models are still used in a large proportion of commercial ASR systems at the current time. There are lots of practical factors that affect the production model deployment decision. Traditional hybrid models, been optimized for production for decades, are usually good at these factors. Without providing excellent solutions to all these factors, it is hard for E2E models to be widely commercialized. In this talk, I will overview the recent advances in E2E models with the focus on technologies addressing those challenges from the perspective of the industry.
Status
- Workflow status: Published
- Created by: Jackie Nemeth
- Created: 10/20/2021
- Modified By: Jackie Nemeth
- Modified: 10/20/2021
Categories
Keywords
Target Audience