{"412081":{"#nid":"412081","#data":{"type":"event","title":"PhD Defense by Christopher Berlind","body":[{"value":"\u003Cp\u003EPh.D.\u0026nbsp;Dissertation\u0026nbsp;Defense\u003C\/p\u003E\u003Cp\u003E\u003Cstrong\u003E\u0026nbsp;\u003C\/strong\u003E\u003C\/p\u003E\u003Cp\u003E\u003Cstrong\u003ETitle: New Insights on the Power of Active Learning\u003C\/strong\u003E\u003C\/p\u003E\u003Cp\u003E\u0026nbsp;\u003C\/p\u003E\u003Cp\u003E\u003Cstrong\u003EChristopher Berlind\u003C\/strong\u003E\u003C\/p\u003E\u003Cp\u003EPh.D. Candidate in Computer Science\u003C\/p\u003E\u003Cp\u003ESchool of Computer Science\u003Cbr \/\u003E Georgia Institute of Technology\u003C\/p\u003E\u003Cp\u003E\u003Ca href=\u0022http:\/\/www.cc.gatech.edu\/~cberlind\u0022\u003Ehttp:\/\/www.cc.gatech.edu\/~cberlind\u003C\/a\u003E\u003C\/p\u003E\u003Cp\u003E\u0026nbsp;\u003C\/p\u003E\u003Cp\u003EDate: Tuesday, June 23, 2015\u003Cbr \/\u003E Time: 9:30 am\u003Cbr \/\u003E Location: KACB Room 2100\u003C\/p\u003E\u003Cp\u003E\u0026nbsp;\u003C\/p\u003E\u003Cp\u003E\u003Cstrong\u003ECommittee\u003C\/strong\u003E\u003C\/p\u003E\u003Cp\u003E---------------\u003C\/p\u003E\u003Cp\u003EProf. Maria-Florina Balcan (Co-advisor, School of Computer Science, Carnegie Mellon University)\u003C\/p\u003E\u003Cp\u003EProf. Le Song (Co-advisor,\u0026nbsp;School of Computational Science and Engineering,\u0026nbsp;Georgia Institute of Technology)\u003C\/p\u003E\u003Cp\u003EProf. Santosh Vempala (School of Computer Science,\u0026nbsp;Georgia Institute of Technology)\u003C\/p\u003E\u003Cp\u003EProf. Charles L. Isbell, Jr. (School of Interactive Computing,\u0026nbsp;Georgia Institute of Technology)\u003C\/p\u003E\u003Cp\u003EProf. Avrim Blum (School of Computer Science,\u0026nbsp;Carnegie Mellon University)\u003C\/p\u003E\u003Cp\u003E\u0026nbsp;\u003C\/p\u003E\u003Cp\u003EAbstract\u003C\/p\u003E\u003Cp\u003E---------------\u003C\/p\u003E\u003Cp\u003ESupervised machine learning is the process of algorithmically learning how to make future predictions by training on labeled examples of past occurrences. While traditionally a learning algorithm has access to a large corpus of labeled examples, the recent proliferation of data made possible by modern computing power and the Internet has made unlabeled data much easier to come by than accompanying labels. For example, billions of images are readily available for download on the Internet, but annotations of the objects present in an image are much more difficult to acquire.\u003C\/p\u003E\u003Cp\u003E\u0026nbsp;\u003C\/p\u003E\u003Cp\u003ETwo main methods have been\u0026nbsp;proposed\u0026nbsp;by the machine learning community for taking advantage of relatively low-cost unlabeled examples in an effort to reduce the number of expensive labeled examples needed for learning. One method is semi-supervised learning, which includes a large quantity of unlabeled examples into the training data in addition to a smaller number of labeled examples. Another is active learning, in which the algorithm itself can select which examples it would like labeled out of a large pool of unlabeled examples. Prior research on active learning has focused almost entirely on the issue of reducing labeling effort (over that of passive learning) through intelligent querying strategies.\u003C\/p\u003E\u003Cp\u003E\u0026nbsp;\u003C\/p\u003E\u003Cp\u003EIn this\u0026nbsp;dissertation, we show that the power to make adaptive label queries has benefits beyond reducing labeling effort over passive learning.\u0026nbsp;We develop and explore several novel methods for active learning that exemplify these new capabilities.\u0026nbsp;Some of these methods use active learning for a non-standard purpose, such as computational speedup, structure discovery, and domain adaptation. Others successfully apply active learning in situations where prior results have given evidence of its ineffectiveness.\u003C\/p\u003E\u003Cp\u003E\u0026nbsp;\u003C\/p\u003E\u003Cp\u003ESpecifically, we first give an active algorithm for learning disjunctions that is able to overcome a computational intractability present in the semi-supervised version of the same problem. This is the first known example of the computational advantages of active learning. Next, we investigate using active learning to determine structural properties (margins) of the data-generating distribution that can further improve learning rates. This is in contrast to most active learning algorithms which either assume or ignore structure rather than seeking to identify and exploit it. We then give an active nearest neighbors algorithm for domain adaptation, the task of learning a predictor for some target domain using mostly examples from a different source domain. This is the first formal analysis of the generalization and query behavior of an active domain adaptation algorithm. Finally, we show a situation where active learning can outperform passive learning on very noisy data, circumventing prior results that active learning cannot have a significant advantage over passive learning in high-noise regimes.\u003C\/p\u003E\u003Cp\u003E \u003C\/p\u003E","summary":null,"format":"limited_html"}],"field_subtitle":"","field_summary":"","field_summary_sentence":[{"value":"New Insights on the Power of Active Learning"}],"uid":"27707","created_gmt":"2015-06-09 07:50:47","changed_gmt":"2016-10-08 02:12:10","author":"Tatianna Richardson","boilerplate_text":"","field_publication":"","field_article_url":"","field_event_time":{"event_time_start":"2015-06-23T10:30:00-04:00","event_time_end":"2015-06-23T12:30:00-04:00","event_time_end_last":"2015-06-23T12:30:00-04:00","gmt_time_start":"2015-06-23 14:30:00","gmt_time_end":"2015-06-23 16:30:00","gmt_time_end_last":"2015-06-23 16:30:00","rrule":null,"timezone":"America\/New_York"},"extras":[],"groups":[{"id":"221981","name":"Graduate Studies"}],"categories":[],"keywords":[{"id":"208","name":"computing"},{"id":"1366","name":"defense"},{"id":"1808","name":"graduate students"},{"id":"913","name":"PhD"}],"core_research_areas":[],"news_room_topics":[],"event_categories":[{"id":"1788","name":"Other\/Miscellaneous"}],"invited_audience":[{"id":"78771","name":"Public"}],"affiliations":[],"classification":[],"areas_of_expertise":[],"news_and_recent_appearances":[],"phone":[],"contact":[],"email":[],"slides":[],"orientation":[],"userdata":""}}}