{"689922":{"#nid":"689922","#data":{"type":"event","title":"PhD Proposal by Kaige Xie","body":[{"value":"\u003Cp\u003E\u003Cstrong\u003ETitle:\u003C\/strong\u003E\u0026nbsp;Lifecycle-Oriented Optimization of Natural Language Generation Systems through Text Sub-Structures\u003C\/p\u003E\u003Cp\u003E\u003Cstrong\u003EDate:\u003C\/strong\u003E\u0026nbsp;Monday, April 27th, 2026\u003C\/p\u003E\u003Cp\u003E\u003Cstrong\u003ETime:\u003C\/strong\u003E\u0026nbsp;1:00\u20133:00 PM ET\u003C\/p\u003E\u003Cp\u003E\u003Cstrong\u003ELocation:\u003C\/strong\u003E\u0026nbsp;online [\u003Ca href=\u0022https:\/\/teams.microsoft.com\/meet\/293794348175007?p=H5XD7DVYM8ekfduaF9\u0022 title=\u0022https:\/\/teams.microsoft.com\/meet\/293794348175007?p=H5XD7DVYM8ekfduaF9\u0022\u003ETeams link\u003C\/a\u003E]\u003C\/p\u003E\u003Cp\u003E\u0026nbsp;\u003C\/p\u003E\u003Cp\u003E\u003Cstrong\u003EKaige Xie\u003C\/strong\u003E\u003C\/p\u003E\u003Cp\u003EPh.D. Student in Computer Science\u003C\/p\u003E\u003Cp\u003ESchool of Interactive Computing\u003C\/p\u003E\u003Cp\u003EGeorgia Institute of Technology\u003C\/p\u003E\u003Cp\u003E\u0026nbsp;\u003C\/p\u003E\u003Cp\u003E\u003Cstrong\u003ECommittee:\u003C\/strong\u003E\u003C\/p\u003E\u003Cp\u003EDr. Pascal Van Hentenryck (advisor) - School of Industrial and Systems Engineering and School of Interactive Computing, Georgia Institute of Technology\u003C\/p\u003E\u003Cp\u003EDr. Thomas Ploetz - School of Interactive Computing, Georgia Institute of Technology\u003C\/p\u003E\u003Cp\u003EDr. Chao Zhang - School of Computational Science and Engineering, Georgia Institute of Technology\u003C\/p\u003E\u003Cp\u003E\u0026nbsp;\u003C\/p\u003E\u003Cp\u003E\u003Cstrong\u003EAbstract:\u003C\/strong\u003E\u003C\/p\u003E\u003Cp\u003EThis dissertation investigates how to optimize natural language generation (NLG) systems built on large language models (LLMs) from a holistic, lifecycle-oriented perspective. While recent advances in LLMs have led to substantial gains across a wide range of NLG tasks, prior research has largely focused on improving benchmark performance, often overlooking the broader challenges that arise across model training, inference, evaluation, and deployment. This dissertation argues that such a performance-centric view is insufficient for real-world NLG systems, whose success depends not only on output quality but also on efficiency, reasoning capability, evaluation fidelity, and user trust. To address this gap, the dissertation introduces a unified framework centered on text sub-structures\u2014semantically meaningful intermediate representations embedded in text\u2014and studies how their recognition and strategic utilization can improve NLG systems throughout their full lifecycle.\u0026nbsp;\u003C\/p\u003E\u003Cp\u003E\u0026nbsp;\u003C\/p\u003E\u003Cp\u003EThe dissertation develops this framework across four representative NLG tasks: dialogue summarization, story generation, action plan generation, and question answering. In dialogue summarization, it shows how dialogue skeletons can facilitate more effective few-shot learning and improve cross-task prompt transfer under limited supervision. In story generation, it demonstrates how outline-based planning structures can guide LLMs toward producing more coherent and engaging narratives. In action plan generation, it examines precondition-effect dependencies as a form of latent world knowledge that enables LLMs to better model action feasibility and environmental change. In question answering, it explores sub-questions as a versatile sub-structure for both fine-grained system evaluation and explanation generation, improving the assessment of open-ended retrieval-augmented generation systems and enhancing users\u2019 ability to judge model reliability. Collectively, these studies show that text sub-structures provide a general and effective semantic scaffold for improving learning efficiency, inference-time planning and reasoning, evaluation robustness, and deployment-time user experience.\u0026nbsp;\u003C\/p\u003E","summary":"","format":"limited_html"}],"field_subtitle":"","field_summary":[{"value":"\u003Cp\u003ELifecycle-Oriented Optimization of Natural Language Generation Systems through Text Sub-Structures\u003C\/p\u003E","format":"limited_html"}],"field_summary_sentence":[{"value":"Lifecycle-Oriented Optimization of Natural Language Generation Systems through Text Sub-Structures"}],"uid":"27707","created_gmt":"2026-04-21 17:14:05","changed_gmt":"2026-04-21 17:19:54","author":"Tatianna Richardson","boilerplate_text":"","field_publication":"","field_article_url":"","field_event_time":{"event_time_start":"2026-04-27T13:00:36-04:00","event_time_end":"2026-04-28T15:00:00-04:00","event_time_end_last":"2026-04-28T15:00:00-04:00","gmt_time_start":"2026-04-27 17:00:36","gmt_time_end":"2026-04-28 19:00:00","gmt_time_end_last":"2026-04-28 19:00:00","rrule":null,"timezone":"America\/New_York"},"location":"TEAMS","extras":[],"groups":[{"id":"221981","name":"Graduate Studies"}],"categories":[],"keywords":[{"id":"102851","name":"Phd proposal"}],"core_research_areas":[],"news_room_topics":[],"event_categories":[{"id":"1788","name":"Other\/Miscellaneous"}],"invited_audience":[{"id":"78771","name":"Public"}],"affiliations":[],"classification":[],"areas_of_expertise":[],"news_and_recent_appearances":[],"phone":[],"contact":[],"email":[],"slides":[],"orientation":[],"userdata":""}}}