<node id="599056">
  <nid>599056</nid>
  <type>event</type>
  <uid>
    <user id="27707"><![CDATA[27707]]></user>
  </uid>
  <created>1511359817</created>
  <changed>1511359817</changed>
  <title><![CDATA[PhD Proposal by Shanmukha Ramakrishna Vedantam]]></title>
  <body><![CDATA[<p><strong>Title:</strong> Connecting Vision and Language for Interpretation, Grounding, and Imagination</p>

<p>&nbsp;</p>

<p>Date: Wednesday, November 29 2017<br />
Time: 12:30PM - 02:30PM (EDT)<br />
Location: CCB 247<br />
<br />
Shanmukha Ramakrishna Vedantam<br />
Ph.D. Student<br />
School of Interactive Computing<br />
College of Computing<br />
Georgia Institute of Technology<br />
<br />
<strong>Committee:</strong><br />
Dr. Devi Parikh (Advisor, School of Interactive Computing, Georgia Institute of Technology)<br />
Dr. Dhruv Batra (School of Interactive Computing, Georgia Institute of Technology)<br />
Dr. Jacob Eisenstein (School of Interactive Computing, Georgia Institute of Technology)<br />
Dr. Kevin P. Murphy (Research Scientist, Google Research)<br />
Dr. C. Lawrence Zitnick (Research Manager, Facebook AI Research)</p>

<p>&nbsp;</p>

<p><strong>Abstract:</strong></p>

<p>Understanding how to model computer vision and natural language jointly is a long-standing challenge in artificial intelligence. In this thesis, I will study how modeling vision and language in meaningful ways can derive more human-like inferences from machine learning models. Specifically, I will consider three related problems: interpretation, grounding, and imagination.</p>

<p>&nbsp;</p>

<p>In interpretation, the goal will be to get machine learning models to understand an image and describe its contents using natural language in a contextually relevant manner. In grounding, I will study how to connect natural language to referents in the physical world, and show how this can help learn common sense. Finally, in proposed work, I will study how to &lsquo;imagine&rsquo; visual concepts completely and accurately across the full range and (potentially unseen) compositions of their visual attributes. I will study these problems from computational as well as algorithmic perspectives and suggest exciting directions for future work.</p>
]]></body>
  <field_summary_sentence>
    <item>
      <value><![CDATA[: Connecting Vision and Language for Interpretation, Grounding, and Imagination]]></value>
    </item>
  </field_summary_sentence>
  <field_summary>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_summary>
  <field_time>
    <item>
      <value><![CDATA[2017-11-29T12:30:00-05:00]]></value>
      <value2><![CDATA[2017-11-29T14:30:00-05:00]]></value2>
      <rrule><![CDATA[]]></rrule>
      <timezone><![CDATA[America/New_York]]></timezone>
    </item>
  </field_time>
  <field_fee>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_fee>
  <field_extras>
      </field_extras>
  <field_audience>
          <item>
        <value><![CDATA[Faculty/Staff]]></value>
      </item>
          <item>
        <value><![CDATA[Public]]></value>
      </item>
          <item>
        <value><![CDATA[Graduate students]]></value>
      </item>
          <item>
        <value><![CDATA[Undergraduate students]]></value>
      </item>
      </field_audience>
  <field_media>
      </field_media>
  <field_contact>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_contact>
  <field_location>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_location>
  <field_sidebar>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_sidebar>
  <field_phone>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_phone>
  <field_url>
    <item>
      <url><![CDATA[]]></url>
      <title><![CDATA[]]></title>
            <attributes><![CDATA[]]></attributes>
    </item>
  </field_url>
  <field_email>
    <item>
      <email><![CDATA[]]></email>
    </item>
  </field_email>
  <field_boilerplate>
    <item>
      <nid><![CDATA[]]></nid>
    </item>
  </field_boilerplate>
  <links_related>
      </links_related>
  <files>
      </files>
  <og_groups>
          <item>221981</item>
      </og_groups>
  <og_groups_both>
          <item><![CDATA[Graduate Studies]]></item>
      </og_groups_both>
  <field_categories>
          <item>
        <tid>1788</tid>
        <value><![CDATA[Other/Miscellaneous]]></value>
      </item>
      </field_categories>
  <field_keywords>
          <item>
        <tid>102851</tid>
        <value><![CDATA[Phd proposal]]></value>
      </item>
      </field_keywords>
  <field_userdata><![CDATA[]]></field_userdata>
</node>
