Loading…

Not just data: A method for improving prediction with knowledge

[Display omitted] •We focus on latent clinical factors that can only be indirectly observed.•We propose a methodology of developing BNs that reason with latent variables.•A series of expert reviews reveal the relation between data and latent variables.•The method is illustrated by a medical case stu...

Full description

Saved in:
Bibliographic Details
Published in:Journal of biomedical informatics 2014-04, Vol.48, p.28-37
Main Authors: Yet, Barbaros, Perkins, Zane, Fenton, Norman, Tai, Nigel, Marsh, William
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:[Display omitted] •We focus on latent clinical factors that can only be indirectly observed.•We propose a methodology of developing BNs that reason with latent variables.•A series of expert reviews reveal the relation between data and latent variables.•The method is illustrated by a medical case study on trauma care.•The case study displays significant predictive improvements from the expert reviews. Many medical conditions are only indirectly observed through symptoms and tests. Developing predictive models for such conditions is challenging since they can be thought of as ‘latent’ variables. They are not present in the data and often get confused with measurements. As a result, building a model that fits data well is not the same as making a prediction that is useful for decision makers. In this paper, we present a methodology for developing Bayesian network (BN) models that predict and reason with latent variables, using a combination of expert knowledge and available data. The method is illustrated by a case study into the prediction of acute traumatic coagulopathy (ATC), a disorder of blood clotting that significantly increases the risk of death following traumatic injuries. There are several measurements for ATC and previous models have predicted one of these measurements instead of the state of ATC itself. Our case study illustrates the advantages of models that distinguish between an underlying latent condition and its measurements, and of a continuing dialogue between the modeller and the domain experts as the model is developed using knowledge as well as data.
ISSN:1532-0464
1532-0480
DOI:10.1016/j.jbi.2013.10.012