Log in to save to my catalogue

A method for achieving complete microbial genomes and improving bins from metagenomics data

A method for achieving complete microbial genomes and improving bins from metagenomics data

https://devfeature-collection.sl.nsw.gov.au/record/TN_cdi_plos_journals_2541866480

A method for achieving complete microbial genomes and improving bins from metagenomics data

About this item

Full title

A method for achieving complete microbial genomes and improving bins from metagenomics data

Publisher

United States: Public Library of Science

Journal title

PLoS computational biology, 2021-05, Vol.17 (5), p.e1008972-e1008972

Language

English

Formats

Publication information

Publisher

United States: Public Library of Science

More information

Scope and Contents

Contents

Metagenomics facilitates the study of the genetic information from uncultured microbes and complex microbial communities. Assembling complete genomes from metagenomics data is difficult because most samples have high organismal complexity and strain diversity. Some studies have attempted to extract complete bacterial, archaeal, and viral genomes and often focus on species with circular genomes so they can help confirm completeness with circularity. However, less than 100 circularized bacterial and archaeal genomes have been assembled and published from metagenomics data despite the thousands of datasets that are available. Circularized genomes are important for (1) building a reference collection as scaffolds for future assemblies, (2) providing complete gene content of a genome, (3) confirming little or no contamination of a genome, (4) studying the genomic context and synteny of genes, and (5) linking protein coding genes to ribosomal RNA genes to aid metabolic inference in 16S rRNA gene sequencing studies. We developed a semi-automated method called Jorg to help circularize small bacterial, archaeal, and viral genomes using iterative assembly, binning, and read mapping. In addition, this method exposes potential misassemblies from k-mer based assemblies. We chose species of the Candidate Phyla Radiation (CPR) to focus our initial efforts because they have small genomes and are only known to have one ribosomal RNA operon. In addition to 34 circular CPR genomes, we present one circular Margulisbacteria genome, one circular Chloroflexi genome, and two circular megaphage genomes from 19 public and published datasets. We demonstrate findings that would likely be difficult without circularizing genomes, including that ribosomal genes are likely not operonic in the majority of CPR, and that some CPR harbor diverged forms of RNase P RNA. Code and a tutorial for this method is available at
https://github.com/lmlui/Jorg
and is available on the DOE Systems Biology KnowledgeBase as a beta app....

Alternative Titles

Full title

A method for achieving complete microbial genomes and improving bins from metagenomics data

Identifiers

Primary Identifiers

Record Identifier

TN_cdi_plos_journals_2541866480

Permalink

https://devfeature-collection.sl.nsw.gov.au/record/TN_cdi_plos_journals_2541866480

Other Identifiers

ISSN

1553-7358,1553-734X

E-ISSN

1553-7358

DOI

10.1371/journal.pcbi.1008972

How to access this item