A basic analysis toolkit for biological sequences
A basic analysis toolkit for biological sequences
About this item
Full title
Author / Creator
Publisher
England: BioMed Central Ltd
Journal title
Language
English
Formats
Publication information
Publisher
England: BioMed Central Ltd
Subjects
More information
Scope and Contents
Contents
This paper presents a software library, nicknamed BATS, for some basic sequence analysis tasks. Namely, local alignments, via approximate string matching, and global alignments, via longest common subsequence and alignments with affine and concave gap cost functions. Moreover, it also supports filtering operations to select strings from a set and establish their statistical significance, via z-score computation. None of the algorithms is new, but although they are generally regarded as fundamental for sequence analysis, they have not been implemented in a single and consistent software package, as we do here. Therefore, our main contribution is to fill this gap between algorithmic theory and practice by providing an extensible and easy to use software library that includes algorithms for the mentioned string matching and alignment problems. The library consists of C/C++ library functions as well as Perl library functions. It can be interfaced with Bioperl and can also be used as a stand-alone system with a GUI. The software is available at http://www.math.unipa.it/~raffaele/BATS/ under the GNU GPL....
Alternative Titles
Full title
A basic analysis toolkit for biological sequences
Authors, Artists and Contributors
Author / Creator
Identifiers
Primary Identifiers
Record Identifier
TN_cdi_doaj_primary_oai_doaj_org_article_357238b89cf84e8e92ac03c21cb9e50a
Permalink
https://devfeature-collection.sl.nsw.gov.au/record/TN_cdi_doaj_primary_oai_doaj_org_article_357238b89cf84e8e92ac03c21cb9e50a
Other Identifiers
ISSN
1748-7188
E-ISSN
1748-7188
DOI
10.1186/1748-7188-2-10