{"599056":{"#nid":"599056","#data":{"type":"event","title":"PhD Proposal by Shanmukha Ramakrishna Vedantam","body":[{"value":"\u003Cp\u003E\u003Cstrong\u003ETitle:\u003C\/strong\u003E Connecting Vision and Language for Interpretation, Grounding, and Imagination\u003C\/p\u003E\r\n\r\n\u003Cp\u003E\u0026nbsp;\u003C\/p\u003E\r\n\r\n\u003Cp\u003EDate: Wednesday, November 29 2017\u003Cbr \/\u003E\r\nTime: 12:30PM - 02:30PM (EDT)\u003Cbr \/\u003E\r\nLocation: CCB 247\u003Cbr \/\u003E\r\n\u003Cbr \/\u003E\r\nShanmukha Ramakrishna Vedantam\u003Cbr \/\u003E\r\nPh.D. Student\u003Cbr \/\u003E\r\nSchool of Interactive Computing\u003Cbr \/\u003E\r\nCollege of Computing\u003Cbr \/\u003E\r\nGeorgia Institute of Technology\u003Cbr \/\u003E\r\n\u003Cbr \/\u003E\r\n\u003Cstrong\u003ECommittee:\u003C\/strong\u003E\u003Cbr \/\u003E\r\nDr. Devi Parikh (Advisor, School of Interactive Computing, Georgia Institute of Technology)\u003Cbr \/\u003E\r\nDr. Dhruv Batra (School of Interactive Computing, Georgia Institute of Technology)\u003Cbr \/\u003E\r\nDr. Jacob Eisenstein (School of Interactive Computing, Georgia Institute of Technology)\u003Cbr \/\u003E\r\nDr. Kevin P. Murphy (Research Scientist, Google Research)\u003Cbr \/\u003E\r\nDr. C. Lawrence Zitnick (Research Manager, Facebook AI Research)\u003C\/p\u003E\r\n\r\n\u003Cp\u003E\u0026nbsp;\u003C\/p\u003E\r\n\r\n\u003Cp\u003E\u003Cstrong\u003EAbstract:\u003C\/strong\u003E\u003C\/p\u003E\r\n\r\n\u003Cp\u003EUnderstanding how to model computer vision and natural language jointly is a long-standing challenge in artificial intelligence. In this thesis, I will study how modeling vision and language in meaningful ways can derive more human-like inferences from machine learning models. Specifically, I will consider three related problems: interpretation, grounding, and imagination.\u003C\/p\u003E\r\n\r\n\u003Cp\u003E\u0026nbsp;\u003C\/p\u003E\r\n\r\n\u003Cp\u003EIn interpretation, the goal will be to get machine learning models to understand an image and describe its contents using natural language in a contextually relevant manner. In grounding, I will study how to connect natural language to referents in the physical world, and show how this can help learn common sense. Finally, in proposed work, I will study how to \u0026lsquo;imagine\u0026rsquo; visual concepts completely and accurately across the full range and (potentially unseen) compositions of their visual attributes. I will study these problems from computational as well as algorithmic perspectives and suggest exciting directions for future work.\u003C\/p\u003E\r\n","summary":null,"format":"limited_html"}],"field_subtitle":"","field_summary":"","field_summary_sentence":[{"value":": Connecting Vision and Language for Interpretation, Grounding, and Imagination"}],"uid":"27707","created_gmt":"2017-11-22 14:10:17","changed_gmt":"2017-11-22 14:10:17","author":"Tatianna Richardson","boilerplate_text":"","field_publication":"","field_article_url":"","field_event_time":{"event_time_start":"2017-11-29T12:30:00-05:00","event_time_end":"2017-11-29T14:30:00-05:00","event_time_end_last":"2017-11-29T14:30:00-05:00","gmt_time_start":"2017-11-29 17:30:00","gmt_time_end":"2017-11-29 19:30:00","gmt_time_end_last":"2017-11-29 19:30:00","rrule":null,"timezone":"America\/New_York"},"extras":[],"groups":[{"id":"221981","name":"Graduate Studies"}],"categories":[],"keywords":[{"id":"102851","name":"Phd proposal"}],"core_research_areas":[],"news_room_topics":[],"event_categories":[{"id":"1788","name":"Other\/Miscellaneous"}],"invited_audience":[{"id":"78761","name":"Faculty\/Staff"},{"id":"78771","name":"Public"},{"id":"174045","name":"Graduate students"},{"id":"78751","name":"Undergraduate students"}],"affiliations":[],"classification":[],"areas_of_expertise":[],"news_and_recent_appearances":[],"phone":[],"contact":[],"email":[],"slides":[],"orientation":[],"userdata":""}}}