event

CSE Seminar: Kevyn Collins-Thompson

Primary tabs

Kevyn Collins-Thompson
Researcher, Microsoft Research (Redmond)

For more information please contact Dr. Guy Lebanon at lebanon@cc.gatech.edu

 

"Robust algorithms for information retrieval: effective tradeoffs between risk and reward"

Abstract:

Current search engines must typically act under great uncertainty, attempting to interpret the intent of few keywords from the user to match against billions of documents to find a few relevant results. A typical Web search involves a series of operations based on the user's query, from automatic spelling correction and identifying common word variants to the actual document ranking. Such operations tend to have a risk-reward tradeoff depending on how they "bet" on different solution hypotheses. In this talk I discuss new theoretical models, algorithms, and evaluation methods for estimating and accounting for uncertainty in retrieval algorithms to achieve effective risk-reward tradeoffs.

A prime example of a high-risk, high-reward operation is automatic query reformulation that adds related terms to a query - a process known as query expansion.  Query expansion can significantly improve ranking quality on average, but even state-of-the-art methods are highly unreliable and can significantly hurt result quality for some queries, which is one reason for their limited deployment in real-world scenarios. I discuss how casting query expansion as a
constrained optimization problem over a word graph provides a selective, highly effective modeling framework that reduces the number and magnitude of expansion failures with no loss in the strong average-case gain of the underlying expansion algorithm. I also discuss applications of such optimization frameworks to other problems in information retrieval, such as providing diversity in ranking.

Bio:

Kevyn Collins-Thompson is a Researcher in the Context, Learning and User Experience for Search (CLUES) group at Microsoft Research. He completed his PhD (2008) in Computer Science at the Language Technologies Institute, Carnegie Mellon University, where his advisor was Jamie Callan.  His research focuses on theoretical models, algorithms, and evaluation methods for effective, reliable information retrieval. Other research interests include user models and personalization, modeling evolutional dynamics of text,  educational applications of search technology and machine learning, and understanding how the brain acquires language skills.  Kevyn also has more than ten years of industry experience as a software engineer and project manager, responsible for shipping advanced features in Office, Windows, Tablet PC, Encarta, and many other products. Visit his website here.

~~~~~~~~~~~~~~

Please join us for a reception preceding the seminar outside Klaus 1324, beginning at 1:30 pm
 
To receive future announcements, please sign up to the cse-seminar email list:

https://mailman.cc.gatech.edu/mailman/listinfo/cse-seminar

Status

  • Workflow Status:Published
  • Created By:Louise Russo
  • Created:10/01/2009
  • Modified By:Fletcher Moore
  • Modified:10/07/2016