Log in to save to my catalogue

Statistical Methods for Detecting Differentially Abundant Features in Clinical Metagenomic Samples

Statistical Methods for Detecting Differentially Abundant Features in Clinical Metagenomic Samples

https://devfeature-collection.sl.nsw.gov.au/record/TN_cdi_plos_journals_1312465606

Statistical Methods for Detecting Differentially Abundant Features in Clinical Metagenomic Samples

About this item

Full title

Statistical Methods for Detecting Differentially Abundant Features in Clinical Metagenomic Samples

Publisher

United States: Public Library of Science

Journal title

PLoS computational biology, 2009-04, Vol.5 (4), p.e1000352-e1000352

Language

English

Formats

Publication information

Publisher

United States: Public Library of Science

More information

Scope and Contents

Contents

Numerous studies are currently underway to characterize the microbial communities inhabiting our world. These studies aim to dramatically expand our understanding of the microbial biosphere and, more importantly, hope to reveal the secrets of the complex symbiotic relationship between us and our commensal bacterial microflora. An important prerequisite for such discoveries are computational tools that are able to rapidly and accurately compare large datasets generated from complex bacterial communities to identify features that distinguish them.We present a statistical method for comparing clinical metagenomic samples from two treatment populations on the basis of count data (e.g. as obtained through sequencing) to detect differentially abundant features. Our method, Metastats, employs the false discovery rate to improve specificity in high-complexity environments, and separately handles sparsely-sampled features using Fisher's exact test. Under a variety of simulations, we show that Metastats performs well compared to previously used methods, and significantly outperforms other methods for features with sparse counts. We demonstrate the utility of our method on several datasets including a 16S rRNA survey of obese and lean human gut microbiomes, COG functional profiles of infant and mature gut microbiomes, and bacterial and viral metabolic subsystem data inferred from random sequencing of 85 metagenomes. The application of our method to the obesity dataset reveals differences between obese and lean subjects not reported in the original study. For the COG and subsystem datasets, we provide the first statistically rigorous assessment of the differences between these populations. The methods described in this paper are the first to address clinical metagenomic datasets comprising samples from multiple subjects. Our methods are robust across datasets of varied complexity and sampling level. While designed for metagenomic applications, our software can also be applied to digital gene expression studies (e.g. SAGE). A web server implementation of our methods and freely available source code can be found at http://metastats.cbcb.umd.edu/....

Alternative Titles

Full title

Statistical Methods for Detecting Differentially Abundant Features in Clinical Metagenomic Samples

Authors, Artists and Contributors

Identifiers

Primary Identifiers

Record Identifier

TN_cdi_plos_journals_1312465606

Permalink

https://devfeature-collection.sl.nsw.gov.au/record/TN_cdi_plos_journals_1312465606

Other Identifiers

ISSN

1553-7358,1553-734X

E-ISSN

1553-7358

DOI

10.1371/journal.pcbi.1000352

How to access this item