Predicting in-hospital death from derived EHR trajectory features
MetadataShow full item record
Medical histories of patients can provide insight into the immediate future of a patient. While most studies propose to predict survival from vital signs and hospital tests within one episode of care, we carry out selective feature engineering from longitudinal historical medical records in this study to develop a dataset with derived features. We then train multiple machine learning models for the binary prediction whether an episode of care will culminate in death among patients suspected of bloodstream infections. The machine learning classifier performance is evaluated and compared and the feature importance impacting the model output is explored. The findings indicated that the logistic regression model achieved the best performance for predicting death in the next hospital episode with an accuracy of 98% and an almost perfect area under the receiver operating characteristic curve. Exploring the feature importance reveals that time to and severity of the last episode and previous history of sepsis episodes were the most critical features.