<node id="628783">
  <nid>628783</nid>
  <type>news</type>
  <uid>
    <user id="34773"><![CDATA[34773]]></user>
  </uid>
  <created>1573227186</created>
  <changed>1573229681</changed>
  <title><![CDATA[Making Sure Computing Machines Don’t Stereotype People]]></title>
  <body><![CDATA[<p>Machine learning algorithms dominate society, from helping judges with courtroom decisions to influencing banks on who gets loans. With big and small decisions potentially being swayed by these mathematical equations, research has become dedicated to making algorithms more transparent and fair.</p>

<p><strong>Uthaipon (Tao) Tantipongpipat</strong> and <strong>Samira Samadi,</strong> Georgia Tech Ph.D. students in the <a href="https://scs.gatech.edu/">School of Computer Science</a>, recently published a <a href="https://arxiv.org/pdf/1902.11281.pdf">new paper</a> that takes large data sets for population analysis and reduces the dimension of those data sets while also preserving essential traits of the groups being analyzed. Algorithms can handle millions of records but the process might compress information and lose details. This, in turn, can lead to groups of people being unfairly associated with certain behaviors or characteristics.</p>

<p>Samadi and Tantipongpipat&rsquo;s <a href="https://www.cc.gatech.edu/news/615576/georgia-tech-researchers-working-improve-fairness-ml-pipeline">previous work</a> uses principal component analysis (PCA), a dimension reduction technique that has been the gold standard for analyzing large data sets more efficiently. Their own version, Fair-PCA, uses the strength of PCA and retains more information so that algorithms can, in theory, have better data for decision-making.</p>

<p>In their latest work, the duo is optimizing Fair-dimensionality reduction, allowing populations to be more accurately represented when not only using PCA, but a wider class of dimension reduction techniques.</p>

<p>The updated algorithm incorporates multiple equity measurements for populations &ndash; i.e. with respect to social and economic welfare &ndash; and takes into account multiple demographical attributes. For example, gender is usually analyzed as male and female, but this leaves transgender people and other non-binary people out of an algorithm&rsquo;s calculations leading to unfair or biased assessments.</p>

<p>This new work is designed to allow machine learning researchers to analyze complex data sets more accurately, potentially leading to less bias.</p>

<p>&quot;I feel like if fairness and bias are not being taken seriously into account at this point, then our problems are only going to compound. Machine learning algorithms are dominating our lives every day and they learn to behave based on previous outcomes. If we just let this build up and if we don&#39;t take care of it now, it will have a huge impact, one that may not be as positive as we had hoped,&rdquo; said Samadi.</p>

<p>The team will present <a href="https://arxiv.org/pdf/1902.11281.pdf"><em>Multi-Criteria Dimensionality Reduction with Applications to Fairness</em></a>&nbsp;in December at the <a href="https://neurips.cc/">33<sup>rd</sup> Annual Conference on Neural Information Processing Systems (NeurIPS)</a> 2019 in Vancouver, British Columbia.</p>
]]></body>
  <field_subtitle>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_subtitle>
  <field_dateline>
    <item>
      <value>2019-11-08T00:00:00-05:00</value>
      <timezone><![CDATA[America/New_York]]></timezone>
    </item>
  </field_dateline>
  <field_summary_sentence>
    <item>
      <value><![CDATA[Georgia Tech researchers develop an algorithm that is less biased towards different populations.]]></value>
    </item>
  </field_summary_sentence>
  <field_summary>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_summary>
  <field_media>
          <item>
        <nid>
          <node id="628782">
            <nid>628782</nid>
            <type>image</type>
            <title><![CDATA[This summer, Samira Samadi presented work at the International Conference on Machine Learning.]]></title>
            <body><![CDATA[]]></body>
                          <field_image>
                <item>
                  <fid>239466</fid>
                  <filename><![CDATA[-4936141608470894095_IMG_3150.jpg]]></filename>
                  <filepath><![CDATA[/sites/default/files/images/-4936141608470894095_IMG_3150.jpg]]></filepath>
                  <file_full_path><![CDATA[http://hg.gatech.edu//sites/default/files/images/-4936141608470894095_IMG_3150.jpg]]></file_full_path>
                  <filemime>image/jpeg</filemime>
                  <image_740><![CDATA[]]></image_740>
                  <image_alt><![CDATA[Samira Samadi]]></image_alt>
                </item>
              </field_image>
            
                      </node>
        </nid>
      </item>
      </field_media>
  <field_contact_email>
    <item>
      <email><![CDATA[]]></email>
    </item>
  </field_contact_email>
  <field_location>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_location>
  <field_contact>
    <item>
      <value><![CDATA[<p>Allie McFadden</p>

<p>Communications Officer</p>

<p>allie.mcfadden@cc.gatech.edu</p>
]]></value>
    </item>
  </field_contact>
  <field_sidebar>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_sidebar>
  <field_boilerplate>
    <item>
      <nid><![CDATA[]]></nid>
    </item>
  </field_boilerplate>
  <!--  TO DO: correct to not conflate categories and news room topics  -->
  <!--  Disquisition: it's funny how I write these TODOs and then never
         revisit them. It's as though the act of writing the thing down frees me
         from the responsibility to actually solve the problem. But what can I
         say? There are more problems than there's time to solve.  -->
  <links_related> </links_related>
  <files> </files>
  <og_groups>
          <item>47223</item>
          <item>576481</item>
          <item>50875</item>
      </og_groups>
  <og_groups_both>
          <item>
        <![CDATA[Student and Faculty]]>
      </item>
          <item>
        <![CDATA[Student Research]]>
      </item>
          <item>
        <![CDATA[Computer Science/Information Technology and Security]]>
      </item>
      </og_groups_both>
  <field_categories>
          <item>
        <tid>134</tid>
        <value><![CDATA[Student and Faculty]]></value>
      </item>
          <item>
        <tid>8862</tid>
        <value><![CDATA[Student Research]]></value>
      </item>
          <item>
        <tid>153</tid>
        <value><![CDATA[Computer Science/Information Technology and Security]]></value>
      </item>
      </field_categories>
  <core_research_areas>
          <term tid="39501"><![CDATA[People and Technology]]></term>
      </core_research_areas>
  <field_news_room_topics>
      </field_news_room_topics>
  <links_related>
      </links_related>
  <files>
      </files>
  <og_groups>
          <item>47223</item>
          <item>576481</item>
          <item>50875</item>
      </og_groups>
  <og_groups_both>
          <item><![CDATA[College of Computing]]></item>
          <item><![CDATA[ML@GT]]></item>
          <item><![CDATA[School of Computer Science]]></item>
      </og_groups_both>
  <field_keywords>
      </field_keywords>
  <field_userdata><![CDATA[]]></field_userdata>
</node>
