{"680510":{"#nid":"680510","#data":{"type":"event","title":"MS Proposal by Kwangmin Cho","body":[{"value":"\u003Cp\u003E\u003Cstrong\u003EKwangmin Cho\u003C\/strong\u003E\u003Cbr\u003E\u003Cem\u003E(Advisor: Prof. Dimitri\u0026nbsp; Mavris]\u003C\/em\u003E\u003C\/p\u003E\u003Cp\u003E\u003Cem\u003Ewill propose a master\u2019s thesis entitled,\u003C\/em\u003E\u003C\/p\u003E\u003Cp\u003E\u003Cstrong\u003EImproving LLM Performance in Aerospace NER Task:\u003C\/strong\u003E\u003Cbr\u003E\u003Cstrong\u003EA Study on Data Augmentation and Fine-tuning Strategy\u003C\/strong\u003E\u003C\/p\u003E\u003Cp\u003E\u003Cem\u003EOn\u003C\/em\u003E\u003C\/p\u003E\u003Cp\u003E\u003Cstrong\u003EThursday, February 27 at 10:00 a.m.\u0026nbsp;\u003C\/strong\u003E\u003Cbr\u003E\u003Cstrong\u003EWeber Space and Technology Building (SST II), Collaborative Visualization Environment (CoVE)\u003C\/strong\u003E\u003C\/p\u003E\u003Cp\u003E\u003Cstrong\u003EAnd on MS Teams\u003C\/strong\u003E\u003C\/p\u003E\u003Cp\u003E\u003Ca href=\u0022https:\/\/teams.microsoft.com\/l\/meetup-join\/19%3ameeting_NjcyNWVlZDYtZjYyMy00MTFhLWIwMDUtODdiMzZhOTIyYzg3%40thread.v2\/0?context=%7b%22Tid%22%3a%22482198bb-ae7b-4b25-8b7a-6d7f32faa083%22%2c%22Oid%22%3a%22ee1dabea-b513-4004-80ce-a321c9168962%22%7d\u0022\u003Ehttps:\/\/teams.microsoft.com\/l\/meetup-join\/19%3ameeting_NjcyNWVlZDYtZjYyMy00MTFhLWIwMDUtODdiMzZhOTIyYzg3%40thread.v2\/0?context=%7b%22Tid%22%3a%22482198bb-ae7b-4b25-8b7a-6d7f32faa083%22%2c%22Oid%22%3a%22ee1dabea-b513-4004-80ce-a321c9168962%22%7d\u003C\/a\u003E\u003C\/p\u003E\u003Cp\u003E\u003Cstrong\u003EAbstract\u003C\/strong\u003E\u003Cbr\u003EAs digital transformation progresses across various sectors, systems engineering is also transitioning from document-based practices to Model-Based Systems Engineering (MBSE). This shift is anticipated to improve traceability, streamline verification and validation processes, and enable better integration across system components. In alignment with this transition, there is a growing need for Named Entity Recognition (NER) methods capable of extracting machine-readable entities from requirements written in natural language (NL). NER plays a critical role in identifying and classifying data belonging to target entity types. Among the various approaches for NER, fine-tuning Large Language Models (LLMs) has shown significant promise due to the rapid advancements in their capabilities.\u0026nbsp;\u003C\/p\u003E\u003Cp\u003EHowever, fine-tuning LLMs for domain-specific tasks presents significant challenges, particularly in low-resource domains where open-source data is scarce, and in labor-intensive pre-processing tasks such as NER, which requires every token in the training data to be paired with corresponding entity labels. Aerospace requirements exemplify both challenges: their confidential nature restricts data availability, and NER tasks demand not only extensive annotation efforts but also expert-level knowledge. Consequently, the NER task for aerospace requirements engineering remains underexplored compared to other NLP and fine-tuning applications.\u003C\/p\u003E\u003Cp\u003ETo address the challenges of low-resource domains and labor-intensive pre-processing, this study proposes a domain-entity adaptive data augmentation strategy aimed at improving the performance of fine-tuned LLMs without requiring extensive manual labeling efforts. This strategy employs Synonym Replacement (SR) and Label-wise Token Replacement (LwTR) adaptively, based on a detailed analysis of domain-specific entity characteristics. These characteristics are identified by evaluating entity-wise performance across varying replacement rates and augmentation methods. By tailoring the augmentation strategy to account for the desired levels of variability and method preferences for each entity type, this study explores optimal combinations of replacement rates and augmentation methods. The proposed approach seeks to enhance the overall performance of fine-tuned LLMs for NER tasks in aerospace domains, addressing key challenges in data scarcity and annotation costs, while contributing to advancements in requirements engineering.\u003C\/p\u003E\u003Cp\u003E\u003Cstrong\u003ECommittee\u003C\/strong\u003E\u003C\/p\u003E\u003Cul\u003E\u003Cli\u003EProf. Dimitri Mavris \u2013 School of Aerospace Engineering (advisor)\u003C\/li\u003E\u003Cli\u003EDr. Olivia Fischer \u2013 School of Aerospace Engineering\u003C\/li\u003E\u003Cli\u003EDr. Woongje Sung \u2013 School of Aerospace Engineering\u003C\/li\u003E\u003C\/ul\u003E","summary":"","format":"limited_html"}],"field_subtitle":"","field_summary":[{"value":"\u003Cp\u003E\u003Cstrong\u003EImproving LLM Performance in Aerospace NER Task:\u003C\/strong\u003E\u003Cbr\u003E\u003Cstrong\u003EA Study on Data Augmentation and Fine-tuning Strategy\u003C\/strong\u003E\u003C\/p\u003E","format":"limited_html"}],"field_summary_sentence":[{"value":"Improving LLM Performance in Aerospace NER Task: A Study on Data Augmentation and Fine-tuning Strategy"}],"uid":"27707","created_gmt":"2025-02-14 19:53:09","changed_gmt":"2025-02-14 19:53:41","author":"Tatianna Richardson","boilerplate_text":"","field_publication":"","field_article_url":"","field_event_time":{"event_time_start":"2025-02-27T10:00:00-05:00","event_time_end":"2025-02-27T12:00:00-05:00","event_time_end_last":"2025-02-27T12:00:00-05:00","gmt_time_start":"2025-02-27 15:00:00","gmt_time_end":"2025-02-27 17:00:00","gmt_time_end_last":"2025-02-27 17:00:00","rrule":null,"timezone":"America\/New_York"},"location":"Weber Space and Technology Building (SST II), Collaborative Visualization Environment (CoVE)","extras":[],"groups":[{"id":"221981","name":"Graduate Studies"}],"categories":[],"keywords":[{"id":"166866","name":"MS Proposal"}],"core_research_areas":[],"news_room_topics":[],"event_categories":[{"id":"1788","name":"Other\/Miscellaneous"}],"invited_audience":[{"id":"78771","name":"Public"}],"affiliations":[],"classification":[],"areas_of_expertise":[],"news_and_recent_appearances":[],"phone":[],"contact":[],"email":[],"slides":[],"orientation":[],"userdata":""}}}