A multi-task convolutional deep neural network for variant calling in single molecule sequencing
A multi-task convolutional deep neural network for variant calling in single molecule sequencing
About this item
Full title
Author / Creator
Publisher
London: Nature Publishing Group UK
Journal title
Language
English
Formats
Publication information
Publisher
London: Nature Publishing Group UK
Subjects
More information
Scope and Contents
Contents
The accurate identification of DNA sequence variants is an important, but challenging task in genomics. It is particularly difficult for single molecule sequencing, which has a per-nucleotide error rate of ~5–15%. Meeting this demand, we developed Clairvoyante, a multi-task five-layer convolutional neural network model for predicting variant type (SNP or indel), zygosity, alternative allele and indel length from aligned reads. For the well-characterized NA12878 human sample, Clairvoyante achieves 99.67, 95.78, 90.53% F1-score on 1KP common variants, and 98.65, 92.57, 87.26% F1-score for whole-genome analysis, using Illumina, PacBio, and Oxford Nanopore data, respectively. Training on a second human sample shows Clairvoyante is sample agnostic and finds variants in less than 2 h on a standard server. Furthermore, we present 3,135 variants that are missed using Illumina but supported independently by both PacBio and Oxford Nanopore reads. Clairvoyante is available open-source (
https://github.com/aquaskyline/Clairvoyante
), with modules to train, utilize and visualize the model.
Single Molecule Sequencing (SMS) technologies generate long but noisy reads data. Here, the authors develop Clairvoyante, a deep neural network-based method for variant calling with SMS reads such as PacBio and ONT data....
Alternative Titles
Full title
A multi-task convolutional deep neural network for variant calling in single molecule sequencing
Authors, Artists and Contributors
Author / Creator
Identifiers
Primary Identifiers
Record Identifier
TN_cdi_doaj_primary_oai_doaj_org_article_9ad545dd5e814a6f9351c9241b1c6be7
Permalink
https://devfeature-collection.sl.nsw.gov.au/record/TN_cdi_doaj_primary_oai_doaj_org_article_9ad545dd5e814a6f9351c9241b1c6be7
Other Identifiers
ISSN
2041-1723
E-ISSN
2041-1723
DOI
10.1038/s41467-019-09025-z