Log in to save to my catalogue

HybPiper: Extracting Coding Sequence and Introns for Phylogenetics from High-Throughput Sequencing R...

HybPiper: Extracting Coding Sequence and Introns for Phylogenetics from High-Throughput Sequencing R...

https://devfeature-collection.sl.nsw.gov.au/record/TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_4948903

HybPiper: Extracting Coding Sequence and Introns for Phylogenetics from High-Throughput Sequencing Reads Using Target Enrichment

About this item

Full title

HybPiper: Extracting Coding Sequence and Introns for Phylogenetics from High-Throughput Sequencing Reads Using Target Enrichment

Publisher

United States: Botanical Society of America

Journal title

Applications in plant sciences, 2016-07, Vol.4 (7), p.n/a

Language

English

Formats

Publication information

Publisher

United States: Botanical Society of America

More information

Scope and Contents

Contents

Premise of the study: Using sequence data generated via target enrichment for phylogenetics requires reassembly of high-throughput sequence reads into loci, presenting a number of bioinformatics challenges. We developed HybPiper as a user-friendly platform for assembly of gene regions, extraction of exon and intron sequences, and identification of paralogous gene copies. We test HybPiper using baits designed to target 333 phylogenetic markers and 125 genes of functional significance in Artocarpus (Moraceae). Methods and Results: HybPiper implements parallel execution of sequence assembly in three phases: read mapping, contig assembly, and target sequence extraction. The pipeline was able to recover nearly complete gene sequences for all genes in 22 species of Artocarpus. HybPiper also recovered more than 500 bp of nontargeted intron sequence in over half of the phylogenetic markers and identified paralogous gene copies in Artocarpus. Conclusions: HybPiper was designed for Linux and Mac OS X and is freely available at https://github.com/mossmatters/HybPiper....

Alternative Titles

Full title

HybPiper: Extracting Coding Sequence and Introns for Phylogenetics from High-Throughput Sequencing Reads Using Target Enrichment

Identifiers

Primary Identifiers

Record Identifier

TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_4948903

Permalink

https://devfeature-collection.sl.nsw.gov.au/record/TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_4948903

Other Identifiers

ISSN

2168-0450

E-ISSN

2168-0450

DOI

10.3732/apps.1600016

How to access this item