Causal Graph Discovery From Self and Mutually Exciting Time Series | IEEE Journals & Magazine | IEEE Xplore

Causal Graph Discovery From Self and Mutually Exciting Time Series


Abstract:

We present a generalized linear structural causal model, coupled with a novel data-adaptive linear regularization, to recover causal directed acyclic graphs (DAGs) from t...Show More
Topic: Causality: Fundamental Limits and Applications

Abstract:

We present a generalized linear structural causal model, coupled with a novel data-adaptive linear regularization, to recover causal directed acyclic graphs (DAGs) from time series. By leveraging a recently developed stochastic monotone Variational Inequality (VI) formulation, we cast the causal discovery problem as a general convex optimization. Furthermore, we develop a non-asymptotic recovery guarantee and quantifiable uncertainty by solving a linear program to establish confidence intervals for a wide range of non-linear monotone link functions. We validate our theoretical results and show the competitive performance of our method via extensive numerical experiments. Most importantly, we demonstrate the effectiveness of our approach in recovering highly interpretable causal DAGs over Sepsis Associated Derangements (SADs) while achieving comparable prediction performance to powerful “black-box” models such as XGBoost.
Topic: Causality: Fundamental Limits and Applications
Page(s): 747 - 761
Date of Publication: 20 December 2023
Electronic ISSN: 2641-8770

Funding Agency:


I. Introduction

Continuous, automated surveillance systems incorporating machine learning models are becoming increasingly common in healthcare environments. These models can capture temporally dependent changes across multiple patient variables and enhance a clinician’s situational awareness by providing an early alarm of an impending adverse event. Among those adverse events, we are particularly interested in sepsis, which is a life-threatening medical condition contributing to one in five deaths globally [1] and stands as one of the most important cases for automated in-hospital surveillance. Recently, many machine learning methods have been developed to predict the onset of sepsis, utilizing electronic medical record (EMR) data [2]. A recent sepsis prediction competition [3] demonstrated the robust performance of XGBoost models [4], [5], [6]; meanwhile, Deep Neural Networks [7] are also commonly used. However, most approaches offer an alert adjudicator very little information pertaining to the reasons for the prediction, leading many to refer to them as “black box” models. Thus, model predictions related to disease identification, particularly for complex diseases, still need to be adjudicated (i.e., interpreted) by a clinician before further action (i.e., treatment) can be initiated. Among the aforementioned works, [6] provided one of the best attempts at identifying causality for their models’ predictions by reporting feature importance at a global level for all patients; still, this did not convey which features were most important in arriving at a given prediction for an individual patient. The common lack of interpretability of many clinical models, particularly those related to sepsis, suggests a strong need for principled methods to study the interactions among time series in medical settings.

Contact IEEE to Subscribe

References

References is not available for this document.