<node id="677167">
  <nid>677167</nid>
  <type>event</type>
  <uid>
    <user id="27707"><![CDATA[27707]]></user>
  </uid>
  <created>1727446380</created>
  <changed>1727446407</changed>
  <title><![CDATA[PhD Proposal by Alexander Bendeck]]></title>
  <body><![CDATA[<p><strong>Title:</strong>&nbsp;Large Language Models as Computational Engines and Virtual Domain Experts for Visual Data Analysis</p><p>&nbsp;</p><p><strong>Date:</strong>&nbsp;Thursday, October 10, 2024</p><p><strong>Time:</strong>&nbsp;9 a.m. - 11 a.m. ET (US)</p><p><strong>Location:</strong>&nbsp;Technology Square Research Building (TSRB) 334</p><p><strong>Virtual meeting (hybrid):</strong>&nbsp;<a href="https://gatech.zoom.us/j/5618662383?pwd=dTB2YjB5WnRiaHhFaHZITVNQeFJVUT09" title="https://gatech.zoom.us/j/5618662383?pwd=dTB2YjB5WnRiaHhFaHZITVNQeFJVUT09">Click here to join Zoom meeting</a></p><p>&nbsp;</p><p><strong>Alexander Bendeck</strong></p><p>Ph.D. Student in Computer Science&nbsp;</p><p>School of Interactive Computing&nbsp;</p><p>Georgia Institute of Technology&nbsp;</p><p>&nbsp;</p><p><strong>Committee</strong></p><p>Dr. John Stasko (Advisor) - School of Interactive Computing, Georgia Institute of Technology</p><p>Dr. Alex Endert - School of Interactive Computing, Georgia Institute of Technology</p><p>Dr. Clio Andris - School of City and Regional Planning, Georgia Institute of Technology</p><p>Dr. Cindy Xiong Bearfield - School of Interactive Computing, Georgia Institute of Technology</p><p>Dr. Ross Maciejewski&nbsp;- School of Computing and Augmented Intelligence, Arizona State University</p><p>&nbsp;</p><p><strong>Abstract</strong></p><p>Advances in generative artificial intelligence have led to the development of pre-trained large language models (LLMs) which are widely available and broadly useful. For data visualization researchers, LLMs have the promise to extend existing research threads in exciting directions, potentially super-charging visualization systems with their vast domain knowledge and computational power. However, LLM-powered systems pose new challenges for both visualization researchers and our intended system users. For instance, well-documented hallucination and inconsistency issues with LLMs can inhibit visualization system performance and erode user trust. We also have little formal understanding of LLMs’ ability to help data analysts with specific tasks.</p><p>&nbsp;</p><p>The aim of my thesis work is to study the potential use of LLMs as “virtual domain experts” during visual data analysis. This includes two main goals: First, to evaluate LLMs at applying their knowledge bases to data- and chart-centric tasks; and second, to study user task completion, satisfaction, and trust for LLM-powered visualization systems. I addressed the first goal through an empirical evaluation of the GPT-4V multimodal language model on a suite of visualization literacy tasks, demonstrating the state of the art in LLM performance at reading and understanding visualizations. I propose subsequent work to address both goals by assessing LLMs’ domain knowledge and generative capabilities on two specific tasks: question answering and data integration. For each task, I will conduct formative studies, empirical evaluations, and design probes using prototype visualization systems, exploring both technical and human-centered perspectives on the use of LLMs during visual data analysis.</p><p>&nbsp;</p>]]></body>
  <field_summary_sentence>
    <item>
      <value><![CDATA[Large Language Models as Computational Engines and Virtual Domain Experts for Visual Data Analysis]]></value>
    </item>
  </field_summary_sentence>
  <field_summary>
    <item>
      <value><![CDATA[<p>Large Language Models as Computational Engines and Virtual Domain Experts for Visual Data Analysis</p>]]></value>
    </item>
  </field_summary>
  <field_time>
    <item>
      <value><![CDATA[2024-10-10T09:00:00-04:00]]></value>
      <value2><![CDATA[2024-10-10T11:00:00-04:00]]></value2>
      <rrule><![CDATA[]]></rrule>
      <timezone><![CDATA[America/New_York]]></timezone>
    </item>
  </field_time>
  <field_fee>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_fee>
  <field_extras>
      </field_extras>
  <field_audience>
          <item>
        <value><![CDATA[Public]]></value>
      </item>
      </field_audience>
  <field_media>
      </field_media>
  <field_contact>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_contact>
  <field_location>
    <item>
      <value><![CDATA[Technology Square Research Building (TSRB) 334]]></value>
    </item>
  </field_location>
  <field_sidebar>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_sidebar>
  <field_phone>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_phone>
  <field_url>
    <item>
      <url><![CDATA[]]></url>
      <title><![CDATA[]]></title>
            <attributes><![CDATA[]]></attributes>
    </item>
  </field_url>
  <field_email>
    <item>
      <email><![CDATA[]]></email>
    </item>
  </field_email>
  <field_boilerplate>
    <item>
      <nid><![CDATA[]]></nid>
    </item>
  </field_boilerplate>
  <links_related>
      </links_related>
  <files>
      </files>
  <og_groups>
          <item>221981</item>
      </og_groups>
  <og_groups_both>
          <item><![CDATA[Graduate Studies]]></item>
      </og_groups_both>
  <field_categories>
          <item>
        <tid>1788</tid>
        <value><![CDATA[Other/Miscellaneous]]></value>
      </item>
      </field_categories>
  <field_keywords>
          <item>
        <tid>102851</tid>
        <value><![CDATA[Phd proposal]]></value>
      </item>
      </field_keywords>
  <field_userdata><![CDATA[]]></field_userdata>
</node>
