Log in to save to my catalogue

FastTree: Computing Large Minimum Evolution Trees with Profiles instead of a Distance Matrix

FastTree: Computing Large Minimum Evolution Trees with Profiles instead of a Distance Matrix

https://devfeature-collection.sl.nsw.gov.au/record/TN_cdi_proquest_journals_3175946225

FastTree: Computing Large Minimum Evolution Trees with Profiles instead of a Distance Matrix

About this item

Full title

FastTree: Computing Large Minimum Evolution Trees with Profiles instead of a Distance Matrix

Publisher

United States: Oxford University Press

Journal title

Molecular biology and evolution, 2009-07, Vol.26 (7), p.1641-1650

Language

English

Formats

Publication information

Publisher

United States: Oxford University Press

More information

Scope and Contents

Contents

Gene families are growing rapidly, but standard methods for inferring phylogenies do not scale to alignments with over 10,000 sequences. We present FastTree, a method for constructing large phylogenies and for estimating their reliability. Instead of storing a distance matrix, FastTree stores sequence profiles of internal nodes in the tree. FastTree uses these profiles to implement Neighbor-Joining and uses heuristics to quickly identify candidate joins. FastTree then uses nearest neighbor interchanges to reduce the length of the tree. For an alignment with N sequences, L sites, and a different characters, a distance matrix requires O(N2) space and O(N2L) time, but FastTree requires just O(NLa + N) memory and O(Nlog (N)La) time. To estimate the tree's reliability, FastTree uses local bootstrapping, which gives another 100-fold speedup over a distance matrix. For example, FastTree computed a tree and support values for 158,022 distinct 16S ribosomal RNAs in 17 h and 2.4 GB of memory. Just computing pairwise Jukes–Cantor distances and storing them, without inferring a tree or bootstrapping, would require 17 h and 50 GB of memory. In simulations, FastTree was slightly more accurate than Neighbor-Joining, BIONJ, or FastME; on genuine alignments, FastTree's topologies had higher likelihoods. FastTree is available at http://microbesonline.org/fasttree....

Alternative Titles

Full title

FastTree: Computing Large Minimum Evolution Trees with Profiles instead of a Distance Matrix

Identifiers

Primary Identifiers

Record Identifier

TN_cdi_proquest_journals_3175946225

Permalink

https://devfeature-collection.sl.nsw.gov.au/record/TN_cdi_proquest_journals_3175946225

Other Identifiers

ISSN

0737-4038

E-ISSN

1537-1719

DOI

10.1093/molbev/msp077

How to access this item