Annotation of genomics data using bidirectional hidden Markov models unveils variations in Pol II tr...
Annotation of genomics data using bidirectional hidden Markov models unveils variations in Pol II transcription cycle
About this item
Full title
Author / Creator
Publisher
London: Nature Publishing Group UK
Journal title
Language
English
Formats
Publication information
Publisher
London: Nature Publishing Group UK
Subjects
More information
Scope and Contents
Contents
DNA replication, transcription and repair involve the recruitment of protein complexes that change their composition as they progress along the genome in a directed or strand‐specific manner. Chromatin immunoprecipitation in conjunction with hidden Markov models (HMMs) has been instrumental in understanding these processes, as they segment the genome into discrete states that can be related to DNA‐associated protein complexes. However, current HMM‐based approaches are not able to assign forward or reverse direction to states or properly integrate strand‐specific (e.g., RNA expression) with non‐strand‐specific (e.g., ChIP) data, which is indispensable to accurately characterize directed processes. To overcome these limitations, we introduce bidirectional HMMs which infer directed genomic states from occupancy profiles
de novo
. Application to RNA polymerase II‐associated factors in yeast and chromatin modifications in human T cells recovers the majority of transcribed loci, reveals gene‐specific variations in the yeast transcription cycle and indicates the existence of directed chromatin state patterns at transcribed, but not at repressed, regions in the human genome. In yeast, we identify 32 new transcribed loci, a regulated initiation–elongation transition, the absence of elongation factors Ctk1 and Paf1 from a class of genes, a distinct transcription mechanism for highly expressed genes and novel DNA sequence motifs associated with transcription termination. We anticipate bidirectional HMMs to significantly improve the analyses of genome‐associated directed processes.
Synopsis
Bidirectional hidden Markov models improve the annotation of DNA‐associated processes from genomics data, reveal variations in the yeast Pol II transcription cycle and identify directed chromatin state patterns at transcribed regions in the human genome.
Genomic feature annotations derived from bidirectional hidden Markov models are up to twice as accurate compared to those from standard hidden Markov models.
Variations in the yeast Pol II transcription cycle fall into clusters of co‐regulated genes, whose functional categories range from housekeeping and cell cycle to stress response.
New insights into transcriptional regulation are obtained, indicating a regulated initiation–elongation transition and a distinct transcription mechanism for highly expressed genes.
An implementation of bidirectional hidden Markov models is freely available at the Bioconductor website:
http://www.bioconductor.org/packages/devel/bioc/html/STAN.html
.
Graphical Abstract
Bidirectional hidden Markov models improve the annotation of DNA‐associated processes from genomics data, reveal variations in the yeast Pol II transcription cycle and identify directed chromatin state patterns at transcribed regions in the human genome....
Alternative Titles
Full title
Annotation of genomics data using bidirectional hidden Markov models unveils variations in Pol II transcription cycle
Authors, Artists and Contributors
Author / Creator
Identifiers
Primary Identifiers
Record Identifier
TN_cdi_doaj_primary_oai_doaj_org_article_a4ac83f854274e27a23f1dcb3daf7ddd
Permalink
https://devfeature-collection.sl.nsw.gov.au/record/TN_cdi_doaj_primary_oai_doaj_org_article_a4ac83f854274e27a23f1dcb3daf7ddd
Other Identifiers
ISSN
1744-4292
E-ISSN
1744-4292
DOI
10.15252/msb.20145654