Log in to save to my catalogue

Predicting population health with machine learning: a scoping review

Predicting population health with machine learning: a scoping review

https://devfeature-collection.sl.nsw.gov.au/record/TN_cdi_doaj_primary_oai_doaj_org_article_0a7d838897804beba83efdfabec21d89

Predicting population health with machine learning: a scoping review

About this item

Full title

Predicting population health with machine learning: a scoping review

Publisher

England: British Medical Journal Publishing Group

Journal title

BMJ open, 2020-10, Vol.10 (10), p.e037860-e037860

Language

English

Formats

Publication information

Publisher

England: British Medical Journal Publishing Group

More information

Scope and Contents

Contents

ObjectiveTo determine how machine learning has been applied to prediction applications in population health contexts. Specifically, to describe which outcomes have been studied, the data sources most widely used and whether reporting of machine learning predictive models aligns with established reporting guidelines.DesignA scoping review.Data sourcesMEDLINE, EMBASE, CINAHL, ProQuest, Scopus, Web of Science, Cochrane Library, INSPEC and ACM Digital Library were searched on 18 July 2018.Eligibility criteriaWe included English articles published between 1980 and 2018 that used machine learning to predict population-health-related outcomes. We excluded studies that only used logistic regression or were restricted to a clinical context.Data extraction and synthesisWe summarised findings extracted from published reports, which included general study characteristics, aspects of model development, reporting of results and model discussion items.ResultsOf 22 618 articles found by our search, 231 were included in the review. The USA (n=71, 30.74%) and China (n=40, 17.32%) produced the most studies. Cardiovascular disease (n=22, 9.52%) was the most studied outcome. The median number of observations was 5414 (IQR=16 543.5) and the median number of features was 17 (IQR=31). Health records (n=126, 54.5%) and investigator-generated data (n=86, 37.2%) were the most common data sources. Many studies did not incorporate recommended guidelines on machine learning and predictive modelling. Predictive discrimination was commonly assessed using area under the receiver operator curve (n=98, 42.42%) and calibration was rarely assessed (n=22, 9.52%).ConclusionsMachine learning applications in population health have concentrated on regions and diseases well represented in traditional data sources, infrequently using big data. Important aspects of model development were under-reported. Greater use of big data and reporting guidelines for predictive modelling could improve machine learning applications in population health.Registration numberRegistered on the Open Science Framework on 17 July 2018 (available at https://osf.io/rnqe6/)....

Alternative Titles

Full title

Predicting population health with machine learning: a scoping review

Identifiers

Primary Identifiers

Record Identifier

TN_cdi_doaj_primary_oai_doaj_org_article_0a7d838897804beba83efdfabec21d89

Permalink

https://devfeature-collection.sl.nsw.gov.au/record/TN_cdi_doaj_primary_oai_doaj_org_article_0a7d838897804beba83efdfabec21d89

Other Identifiers

ISSN

2044-6055

E-ISSN

2044-6055

DOI

10.1136/bmjopen-2020-037860

How to access this item