Date of Graduation

5-2026

Document Type

Thesis

Degree Name

Bachelor of Science in Industrial Engineering and Operations Analytics

Degree Level

Undergraduate

Department

Industrial Engineering

Advisor/Mentor

Shengfan Zhang

Committee Member

Brandon Crisel

Abstract

Sepsis remains a leading cause of morbidity and mortality among young children, with infants experiencing the highest risk of adverse outcomes, yet they are often aggregated with older pediatric populations in research and clinical risk models. Using nationally representative inpatient data from the 2018–2020 Healthcare Cost and Utilization Project National Inpatient Sample, this study characterizes sepsis hospitalizations among infants aged 0–3 years, examines comorbidity structure using network-based methods, and evaluates mortality prediction under severe class imbalance. Comorbidity networks constructed from ICD-10 diagnosis co-occurrence revealed dense diagnostic relationships, with renal, respiratory, metabolic, and neurologic conditions occupying structurally central positions independent of sepsis itself. Infants who died during hospitalization exhibited substantially higher diagnostic burden and longer lengths of stay than survivors. Random forest mortality prediction models demonstrated marked trade-offs between accuracy and sensitivity: a baseline model achieved high overall accuracy (0.935) but low sensitivity (0.350), while a model trained with class-balanced downsampling improved sensitivity to 0.700 at the cost of increased false positives; both models achieved strong and comparable ROC-AUC values (0.862 and 0.854) using administrative variables alone. Post-hoc SHAP analysis confirmed that diagnostic burden was the dominant predictor across both models, with the rank ordering of features stable regardless of sampling strategy, providing convergent evidence across descriptive, network, and predictive analyses. These findings highlight the value of network-informed representations of comorbidity and imbalance-aware modeling for identifying high-risk infants, and demonstrate a scalable, interpretable framework for studying infant sepsis risk using administrative data.

Keywords

infant sepsis; comorbidity network analysis; class imbalance; random forest; administrative data; HCUP National Inpatient Sample

Available for download on Friday, April 23, 2027

Share

COinS