Log in to save to my catalogue

Tissue-aware RNA-Seq processing and normalization for heterogeneous and sparse data

Tissue-aware RNA-Seq processing and normalization for heterogeneous and sparse data

https://devfeature-collection.sl.nsw.gov.au/record/TN_cdi_doaj_primary_oai_doaj_org_article_39492d2dc8214268be55238a0b71829a

Tissue-aware RNA-Seq processing and normalization for heterogeneous and sparse data

About this item

Full title

Tissue-aware RNA-Seq processing and normalization for heterogeneous and sparse data

Publisher

England: BioMed Central Ltd

Journal title

BMC bioinformatics, 2017-10, Vol.18 (1), p.437-437, Article 437

Language

English

Formats

Publication information

Publisher

England: BioMed Central Ltd

More information

Scope and Contents

Contents

Although ultrahigh-throughput RNA-Sequencing has become the dominant technology for genome-wide transcriptional profiling, the vast majority of RNA-Seq studies typically profile only tens of samples, and most analytical pipelines are optimized for these smaller studies. However, projects are generating ever-larger data sets comprising RNA-Seq data from hundreds or thousands of samples, often collected at multiple centers and from diverse tissues. These complex data sets present significant analytical challenges due to batch and tissue effects, but provide the opportunity to revisit the assumptions and methods that we use to preprocess, normalize, and filter RNA-Seq data - critical first steps for any subsequent analysis.
We find that analysis of large RNA-Seq data sets requires both careful quality control and the need to account for sparsity due to the heterogeneity intrinsic in multi-group studies. We developed Yet Another RNA Normalization software pipeline (YARN), that includes quality control and preprocessing, gene filtering, and normalization steps designed to facilitate downstream analysis of large, heterogeneous RNA-Seq data sets and we demonstrate its use with data from the Genotype-Tissue Expression (GTEx) project.
An R package instantiating YARN is available at http://bioconductor.org/packages/yarn ....

Alternative Titles

Full title

Tissue-aware RNA-Seq processing and normalization for heterogeneous and sparse data

Identifiers

Primary Identifiers

Record Identifier

TN_cdi_doaj_primary_oai_doaj_org_article_39492d2dc8214268be55238a0b71829a

Permalink

https://devfeature-collection.sl.nsw.gov.au/record/TN_cdi_doaj_primary_oai_doaj_org_article_39492d2dc8214268be55238a0b71829a

Other Identifiers

ISSN

1471-2105

E-ISSN

1471-2105

DOI

10.1186/s12859-017-1847-x

How to access this item