event

PhD Defense by Sun Ju (Julie) Lee

Primary tabs

You are cordially invited to my thesis defense on Wednesday, July 2nd at 10AM EST.
Title: Optimization for Interpretable Predictive and Prescriptive Models in Healthcare
Date: July 2nd, 2025
Time: 10AM – 12PM EST
Meeting Link: Microsoft Teams Meeting

Sun Ju Lee
Ph.D. Candidate in Operations Research
School of Industrial and Systems Engineering
Georgia Institute of Technology

Committee:
Dr. Gian-Gabriel Garcia (Advisor), H. Milton Stewart School of Industrial and Systems Engineering, Georgia Institute of Technology
Dr. Nicoleta Serban, H. Milton Stewart School of Industrial and Systems Engineering, Georgia Institute of Technology
Dr. Weijun Xie, H. Milton Stewart School of Industrial and Systems Engineering, Georgia Institute of Technology
Dr. Lauren Steimle, H. Milton Stewart School of Industrial and Systems Engineering, Georgia Institute of Technology
Dr. Sheree Boulet, Department of Obstetrics & Gynecology, Boston University

Abstract:
As machine learning and artificial intelligence increasingly inform decision-making in high-stakes domains such as healthcare, there is a growing need for models that are not only accurate but also interpretable and trustworthy. This dissertation develops optimization-based methods for interpretable modeling in healthcare, addressing problems in disease prediction within maternal and perinatal health and medical decision-making in the context of chronic disease management.

Motivated by the challenge of accurate risk estimation in maternal and perinatal health, Chapter 2, builds sparse and interpretable logistic regression models to predict a composite adverse perinatal outcome at multiple points during pregnancy. Our findings enable clinicians to understand how risk factors for these adverse outcomes evolve over time, toward facilitating timely intervention.

Chapter 3 builds on the work in Chapter 2 by addressing the complexities and disadvantages of modeling a composite outcome. Chapter 3 proposes the Composite Clustered Multi-Task Learning (CCMTL) problem, an optimization framework that simultaneously clusters individual outcomes into composite outcomes and learns predictive models for each cluster. We prove that that CCMTL is NP-Hard and derive both an exact non-linear mixed integer program and a scalable greedy heuristic algorithm to solve the problem. Through numerical experiments and a case study on maternal and perinatal health, we demonstrate that CCMTL can generate composite outcomes which exhibit improved prediction performance and reveal meaningful clusters of outcomes sharing underlying similarity.

Lastly, Chapter 4 explores interpretability in sequential decision-making by studying the trade-off between one-size-fits-all treatment guidelines and personalized care. Specifically, we formulate the Treatment Guideline Design Problem (TGDP), which simultaneously partitions a patient population into a pre-determined number of groups and designs an optimal treatment policy for each group.  We characterize the structural properties of TGDP, proving that it is NP-hard and showing that population-level health outcomes are non-decreasing in the number of groups. To solve the problem, we develop exact mixed-integer linear programming formulations alongside heuristic solution methods. Our numerical experiments and case study on hypertension treatment planning demonstrate that TGDP can generate parsimonious treatment guidelines that achieve health outcomes within 1% of optimal personalized policies while enhancing ease of implementation. Chapter 5 concludes with an overview of future research directions.

 

Status

  • Workflow Status:Published
  • Created By:Tatianna Richardson
  • Created:06/23/2025
  • Modified By:Tatianna Richardson
  • Modified:06/23/2025

Categories

Keywords

Target Audience