Algorithms for Prediction of Clinical Deterioration on the General Wards: A Scoping Review

Journal of Hospital Medicine 16(10). 2021 October;612-619. Published Online First June 25, 2021 | 10.12788/jhm.3630

June 23, 2021|Journal of Hospital Medicine

Roel V Peelen, MD;Yassin Eddahchouri, MD;Mats Koeneman, MSc;Tom H van de Belt, MSc, PhD;Harry van Goor, MD, PhD;Sebastian JH Bredie, MD, PhD

OBJECTIVE: The primary objective of this scoping review was to identify and describe state-of-the-art models that use vital sign monitoring to predict clinical deterioration on the general ward. The secondary objective was to identify facilitators, barriers, and effects of implementing these models.

DATA SOURCES: PubMed, Embase, and CINAHL databases until November 2020.

STUDY SELECTION: We selected studies that compared vital signs–based automated real-time predictive algorithms to current track-and-trace protocols in regard to the outcome of clinical deterioration in a general ward population.

DATA EXTRACTION: Study characteristics, predictive characteristics and barriers, facilitators, and effects.

RESULTS: We identified 1741 publications, 21 of which were included in our review. Two of the these were clinical trials, 2 were prospective observational studies, and the remaining 17 were retrospective studies. All of the studies focused on hospitalized adult patients. The reported area under the receiver operating characteristic curves ranged between 0.65 and 0.95 for the outcome of clinical deterioration. Positive predictive value and sensitivity ranged between 0.223 and 0.773 and from 7.2% to 84.0%, respectively. Input variables differed widely, and predicted endpoints were inconsistently defined. We identified 57 facilitators and 48 barriers to the implementation of these models. We found 68 reported effects, 57 of which were positive.

CONCLUSION: Predictive algorithms can detect clinical deterioration on the general ward earlier and more accurately than conventional protocols, which in one recent study led to lower mortality. Consensus is needed on input variables, predictive time horizons, and definitions of endpoints to better facilitate comparative research.

All of the studies that compared their proposed model with one of various warning systems (eg, EWS, National Early Warning Score [NEWS], Modified Early Warning Score [MEWS]) showed superior performance (based on AUROC and reported predictive values). In 17 studies, the authors reported their model as more useful or superior to the EWS.^{20-23,26-28,34,36-41} Four studies reported real-time detection of deterioration before regular EWS,^20,26,42 and three studies reported positive effects on patient-related outcomes.^26,35 Four negative effects were noted on the controllability, validity, and potential limitations.^27,42

There were 26 positive effects on the clinical process mentioned, 7 of which pointed out the effects of earlier, predictive alarming. Algorithms with higher PPVs reported greater rates of actionable alarms, less alarm fatigue, and improved workflow.^{21,22,24-27,30,32,33,35-38,40}Potential alarm fatigue was named as a barrier.^27,42 Smoother scoring instead of binned categories was mentioned positively.^24,26In the infrastructure domain, very few items were reported. The increased need for education on the used techniques was reported once as a negative effect.³⁴ One of the positive infrastructural effects noted was more efficient planning and use of resources.^24,37,40We identified 57 facilitators and 48 barriers for the clinical implementation and use of real-time predictive analytics (Appendix Figure). In the Technology domain, there were 18 facilitators and 20 barriers cited, and in the Organization domain, 25 and 14, respectively. They were equally present in the Professional and Physiology domains (6 vs 5, 8 vs 9).

Of the 38 remarks in the Technology domain, difficulty with implementation in daily practice was a commonly cited barrier.^22,24,40,42Difficulties included creating real-time data feeds out of the EMR, though there were mentions of some successful examples.^25,27,36 Difficulty in the interpretability of AI was also considered a potential barrier.^{30,32,33,35,39,41} There were remarks as to the applicability of the prolonged prediction horizon because of the associated decoupling from the clinical view.^39,42

Conservative attitudes toward new technologies and inadequate knowledge were mentioned as barriers.³⁹ Repeated remarks were made on the difficulty of interpreting and responding to a predicted escalation, as the clinical pattern might not be recognizable at such an early stage. On the other hand, it is expected that less invasive countermeasures would be adequate to avert further escalation. Earlier recognition of possible escalations also raised potential ethical questions, such as when to discuss palliative care.²⁴

The heterogeneity of the general ward population and the relatively low prevalence of deterioration were mentioned as barriers.^24,30,38,41There were also concerns that not all escalations are preventable and that some patient outcomes may not be modifiable.^24,38

Many investigators expected reductions in false alarms and associated alarm fatigue (reflected as higher PPVs). Furthermore, they expected workflow to improve and workload to decrease.^{21,23,27,31,33,35,38,41} Despite the capacity of modern EMRs to store large amounts of patient data, some investigators felt improvements to real-time access, data quality and validity, and data density are needed to ensure valid associated predictions.^{21,22,24,32,37}

DISCUSSION

As the complexity and comorbidity of hospitalized adults grow, predicting clinical deterioration is becoming more important. With an ever-increasing amount of available patient data, real-time algorithms can predict the patient’s clinical course with increasing accuracy, positively affecting outcomes.^4,21,25,43 The studies identified in this scoping review, as measured by higher AUROC scores and improved PPVs, show that predictive algorithms can outperform more conventional EWS, enable earlier and more efficient alarming, and be successfully implemented on the general wards. However, formal meta-analysis was made infeasible by differences in populations, use of different endpoint definitions, cut-off points, time-horizons to prediction, and other methodologic heterogeneity.