<node id="613227">
  <nid>613227</nid>
  <type>event</type>
  <uid>
    <user id="27707"><![CDATA[27707]]></user>
  </uid>
  <created>1540404281</created>
  <changed>1540828546</changed>
  <title><![CDATA[PhD Defense by Xingyu Yang ]]></title>
  <body><![CDATA[<p>In partial fulfillment of the requirements for the degree of&nbsp;</p>

<p>Doctor of Philosophy in Bioinformatics</p>

<p>in the School of Biological Sciences</p>

<p><br />
<strong>Xingyu Yang</strong></p>

<p><br />
Defends his thesis:<br />
<strong>Computational approaches to intuitively analyze and visualize big data in biological research</strong></p>

<p>&nbsp;</p>

<p>Tuesday, November 6, 2018</p>

<p>2:00pm<br />
Materials Science and Engineering Building 3201a</p>

<p>&nbsp;</p>

<p><strong>Thesis Advisor:</strong></p>

<p>Dr. Peng Qiu<br />
Department of Biomedical Engineering<br />
Georgia Institute of Technology and Emory University</p>

<p><br />
<strong>Committee Members:</strong></p>

<p>Dr. Soojin Yi<br />
School of Biological Sciences<br />
Georgia Institute of Technology</p>

<p>&nbsp;</p>

<p>Dr. Gregory Gibson</p>

<p>School of Biological Sciences<br />
Georgia Institute of Technology</p>

<p>&nbsp;</p>

<p>Dr. David Archer</p>

<p>Department of Pediatrics</p>

<p>Emory University School of Medicine</p>

<p>&nbsp;</p>

<p>Dr. Ignacio Sanz<br />
Department of Medicine</p>

<p>Emory University</p>

<p>&nbsp;</p>

<p><strong>Abstract: </strong></p>

<p>To quantitatively understand the cell behavior in molecular level, scientists have developed technologies including high throughput sequencing and flow cytometry. High throughput sequencing can obtain the entire genome sequence and measure expression of large number of genes. Flow cytometry can measure multiple parameters of large number of cells. Both technologies generate large amount of data in high dimension. Therefore, efficient methods to analyze and interpret the data become in demand. In my thesis, I focus on developing computational methods that deliver intuitive and interpretable visualization of biological data. The first chapter describes a software named Cluster-to-Gate (C2G) that can visualize existing clustering results of flow/mass cytometry data in the format of 2D gating hierarchy. Though C2G presents a way to visualize and interpret clustering results, the visualization is still data-driven and no human-knowledge is incorporated. To overcome the limitation of C2G, the second chapter describes a framework that can learn gating approach from existing publications to build a knowledge-graph. This knowledge-graph can automatically suggest order of marker usage and gating hierarchy for new data set, which can be used to gate cell populations. The obtained cell populations are immediately matched to known cell types in the knowledge-graph, which makes them interpretable. The third chapter describe a novel algorithm (GLaMST) to reconstruct lineage tree of B cell receptor gene from high throughput sequencing data. This algorithm outperforms state-of-art in both accuracy and speed.</p>

<p>&nbsp;</p>
]]></body>
  <field_summary_sentence>
    <item>
      <value><![CDATA[Computational approaches to intuitively analyze and visualize big data in biological research]]></value>
    </item>
  </field_summary_sentence>
  <field_summary>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_summary>
  <field_time>
    <item>
      <value><![CDATA[2018-11-06T14:00:00-05:00]]></value>
      <value2><![CDATA[2018-11-06T16:00:00-05:00]]></value2>
      <rrule><![CDATA[]]></rrule>
      <timezone><![CDATA[America/New_York]]></timezone>
    </item>
  </field_time>
  <field_fee>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_fee>
  <field_extras>
      </field_extras>
  <field_audience>
          <item>
        <value><![CDATA[Public]]></value>
      </item>
          <item>
        <value><![CDATA[Graduate students]]></value>
      </item>
          <item>
        <value><![CDATA[Undergraduate students]]></value>
      </item>
      </field_audience>
  <field_media>
      </field_media>
  <field_contact>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_contact>
  <field_location>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_location>
  <field_sidebar>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_sidebar>
  <field_phone>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_phone>
  <field_url>
    <item>
      <url><![CDATA[]]></url>
      <title><![CDATA[]]></title>
            <attributes><![CDATA[]]></attributes>
    </item>
  </field_url>
  <field_email>
    <item>
      <email><![CDATA[]]></email>
    </item>
  </field_email>
  <field_boilerplate>
    <item>
      <nid><![CDATA[]]></nid>
    </item>
  </field_boilerplate>
  <links_related>
      </links_related>
  <files>
      </files>
  <og_groups>
          <item>221981</item>
      </og_groups>
  <og_groups_both>
          <item><![CDATA[Graduate Studies]]></item>
      </og_groups_both>
  <field_categories>
          <item>
        <tid>1788</tid>
        <value><![CDATA[Other/Miscellaneous]]></value>
      </item>
      </field_categories>
  <field_keywords>
          <item>
        <tid>100811</tid>
        <value><![CDATA[Phd Defense]]></value>
      </item>
      </field_keywords>
  <field_userdata><![CDATA[]]></field_userdata>
</node>
