<node id="678710">
  <nid>678710</nid>
  <type>event</type>
  <uid>
    <user id="36518"><![CDATA[36518]]></user>
  </uid>
  <created>1733150434</created>
  <changed>1743700498</changed>
  <title><![CDATA[ML@GT Seminar Series | Parameter-Efficient Training of Large Language Models]]></title>
  <body><![CDATA[<p>Featuring Anna Rumshisky,&nbsp;University of Massachusetts Lowell</p><p><strong>Abstract: </strong>Over the past few years, scaling large language models (LLMs) has become the go-to AI solution for solving progressively more complicated tasks. However, as the models scaled towards hundreds of billions of parameters, not just training them from scratch, but even fine-tuning to adapt to specific tasks has become computationally expensive and unmanageable for most practitioners. This has led to a crisis, in which only a few well-funded industry labs have the resources to develop high-quality LLMs. In response to these developments, numerous parameter-efficient techniques for fine-tuning large language models have been developed, revolutionizing the accessibility of fine-tuning LLMs. However, until recently, such methods were not available for pre-training.<br><br>In this talk, I will present our recent work on ReLoRA, the first parameter-efficient method for training models from scratch, which utilizes low-rank updates to train high-rank networks, with demonstrated results for models with up to 1.3 billion parameters. I will also include a brief overview and taxonomy of different parameter-efficient training (PET) methods that have been developed over the past two years. I will discuss the advantages and limitations of different approaches to PET with respect to parameter and memory efficiency, as well as training speed and inference throughput.<br><br><strong>Bio: </strong>Anna Rumshisky is an Associate Professor of Computer Science at the University of Massachusetts Lowell, where she leads the Text Machine Lab for NLP.&nbsp; Her primary research area is artificial intelligence and large language model (LLM) research, with a focus on efficient large model training and model analysis and interpretability. She holds a joint appointment as an Amazon Scholar at Amazon AGIF (Artificial General Intelligence Foundations), where she helps drive the scientific efforts behind large-scale foundational LLM training in an industry setting. She was previously a postdoctoral fellow at MIT CSAIL and a PhD in Computer Science from Brandeis University.&nbsp; She is a recipient of the NSF CAREER award in 2017 and the best thematic paper award at NAACL-HLT 2019. Her research has been funded by the NSF, NIH, Army Research Office, among others.</p>]]></body>
  <field_summary_sentence>
    <item>
      <value><![CDATA[Featuring Anna Rumshisky, University of Massachusetts Lowell]]></value>
    </item>
  </field_summary_sentence>
  <field_summary>
    <item>
      <value><![CDATA[<p>Machine Learning Center Seminar Series is held bi-weekly on Wednesdays at 12pm.&nbsp;</p>]]></value>
    </item>
  </field_summary>
  <field_time>
    <item>
      <value><![CDATA[2025-04-16T12:00:00-04:00]]></value>
      <value2><![CDATA[2025-04-16T13:00:00-04:00]]></value2>
      <rrule><![CDATA[]]></rrule>
      <timezone><![CDATA[America/New_York]]></timezone>
    </item>
  </field_time>
  <field_fee>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_fee>
  <field_extras>
          <item>
        <value><![CDATA[free_food]]></value>
      </item>
      </field_extras>
  <field_audience>
          <item>
        <value><![CDATA[Faculty/Staff]]></value>
      </item>
          <item>
        <value><![CDATA[Postdoc]]></value>
      </item>
          <item>
        <value><![CDATA[Graduate students]]></value>
      </item>
          <item>
        <value><![CDATA[Undergraduate students]]></value>
      </item>
      </field_audience>
  <field_media>
          <item>
        <nid>
          <node id="676746">
            <nid>676746</nid>
            <type>image</type>
            <title><![CDATA[2025.0416-ML-Seminar-Announcement-Anna-Rumshisky.jpg]]></title>
            <body><![CDATA[]]></body>
                          <field_image>
                <item>
                  <fid>260574</fid>
                  <filename><![CDATA[2025.0416-ML-Seminar-Announcement-Anna-Rumshisky.jpg]]></filename>
                  <filepath><![CDATA[/sites/default/files/2025/04/03/2025.0416-ML-Seminar-Announcement-Anna-Rumshisky.jpg]]></filepath>
                  <file_full_path><![CDATA[http://hg.gatech.edu//sites/default/files/2025/04/03/2025.0416-ML-Seminar-Announcement-Anna-Rumshisky.jpg]]></file_full_path>
                  <filemime>image/jpeg</filemime>
                  <image_740><![CDATA[]]></image_740>
                  <image_alt><![CDATA[ML@GT Seminar Series hosts Anna Rumshisky on Wednesday, April 16 at 12pm. ]]></image_alt>
                </item>
              </field_image>
            
                      </node>
        </nid>
      </item>
      </field_media>
  <field_contact>
    <item>
      <value><![CDATA[<p>Shelli Hatcher, Program and Operations Manager</p>]]></value>
    </item>
  </field_contact>
  <field_location>
    <item>
      <value><![CDATA[CODA 9th Floor Atrium]]></value>
    </item>
  </field_location>
  <field_sidebar>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_sidebar>
  <field_phone>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_phone>
  <field_url>
    <item>
      <url><![CDATA[https://coda.gatech.edu/]]></url>
      <title><![CDATA[]]></title>
            <attributes><![CDATA[]]></attributes>
    </item>
  </field_url>
  <field_email>
    <item>
      <email><![CDATA[]]></email>
    </item>
  </field_email>
  <field_boilerplate>
    <item>
      <nid><![CDATA[]]></nid>
    </item>
  </field_boilerplate>
  <links_related>
      </links_related>
  <files>
      </files>
  <og_groups>
          <item>576481</item>
      </og_groups>
  <og_groups_both>
          <item><![CDATA[ML@GT]]></item>
      </og_groups_both>
  <field_categories>
          <item>
        <tid>1795</tid>
        <value><![CDATA[Seminar/Lecture/Colloquium]]></value>
      </item>
      </field_categories>
  <field_keywords>
          <item>
        <tid>9167</tid>
        <value><![CDATA[machine learning]]></value>
      </item>
      </field_keywords>
  <field_userdata><![CDATA[]]></field_userdata>
</node>
