CSE Seminar: By Paul Bennett/MSR,

Event Details
  • Date/Time:
    • Tuesday November 6, 2012 - Wednesday November 7, 2012
      12:00 pm - 12:59 pm
  • Location: Klaus 1116 East
  • Phone:
  • URL:
  • Email:
  • Fee(s):
  • Extras:

Hongyuan Zha | College of Computing Georgia Institute of Technology http://www.cc.gatech.edu/~zha


Summary Sentence: Mining and Using Contextual Information from Large-Scale Web Search Logs

Full Summary: No summary paragraph submitted.


CSE Seminar

Speaker: Paul Bennett/MSR, 


Mining and Using Contextual Information from Large-Scale Web Search Logs


Information retrieval has made significant progress in returning relevant results for a single query. However, much search activity is conducted within a much richer context of a current task focus, recent search activities as well as longer-term preferences. For example, our ability to accurately interpret the current query can be informed by knowledge of the web pages a searcher was viewing when initiating the search or recent actions of the searcher such as queries issued, results clicked, and pages viewed. We develop a framework that enables representation of a broad variety of context including the searcher's long-term interests, recent activity, current focus, and other user characteristics. We then demonstrate how that can be used to improve the quality of search results.  We describe recent progress on three key challenges in this domain: mining contextual signals from large scale logs; understanding and modeling the combination of short-term and long-term behavior; and learning a more robust model that mitigates the risk of applying the contextual model when a simpler model would suffice.

This talk will present joint work with Filip Radlinski, Lidan Wang, Ryen White, Kevyn Collins-Thompson, Wei Chu, Susan Dumais, Peter Bailey, Emine Yilmaz, Fedor Borisyuk, and Xiaoyuan Cui.


Paul Bennett is a Researcher in the Context, Learning & User Experience for Search (CLUES) group at Microsoft Research where he works on using machine learning technology to improve information access and retrieval. His recent research has focused on classification-enhanced and contextual information retrieval, pairwise preferences, human computation, and text classification while his previous work focused primarily on ensemble methods, active learning, and obtaining reliable probability estimates, but also extended to machine translation, recommender systems, and knowledge bases. He completed his dissertation on combining text classifiers using reliability indicators in 2006 at Carnegie Mellon where he was advised by Profs. Jaime Carbonell and John Lafferty.


For more information, please contact


Hongyuan Zha | College of Computing

Georgia Institute of Technology



Additional Information

In Campus Calendar

High Performance Computing (HPC), College of Computing, School of Computer Science, School of Interactive Computing, School of Computational Science and Engineering

Invited Audience
No audiences were selected.
No keywords were submitted.
  • Created By: Lometa Mitchell
  • Workflow Status: Published
  • Created On: Oct 30, 2012 - 5:53am
  • Last Updated: Oct 7, 2016 - 10:00pm