Metric learning for comparing genomic data with triplet network
Metric learning for comparing genomic data with triplet network
About this item
Full title
Author / Creator
Ma, Zhi , Lu, Yang Young , Wang, Yiwen , Lin, Renhao , Yang, Zizi , Zhang, Fang and Wang, Ying
Publisher
Oxford: Oxford University Press
Journal title
Language
English
Formats
Publication information
Publisher
Oxford: Oxford University Press
Subjects
More information
Scope and Contents
Contents
Abstract
Many biological applications are essentially pairwise comparison problems, such as evolutionary relationships on genomic sequences, contigs binning on metagenomic data, cell type identification on gene expression profiles of single-cells, etc. To make pair-wise comparison, it is necessary to adopt suitable dissimilarity metric. However, not all the metrics can be fully adapted to all possible biological applications. It is necessary to employ metric learning based on data adaptive to the application of interest. Therefore, in this study, we proposed MEtric Learning with Triplet network (MELT), which learns a nonlinear mapping from original space to the embedding space in order to keep similar data closer and dissimilar data far apart. MELT is a weakly supervised and data-driven comparison framework that offers more adaptive and accurate dissimilarity learned in the absence of the label information when the supervised methods are not applicable. We applied MELT in three typical applications of genomic data comparison, including hierarchical genomic sequences, longitudinal microbiome samples and longitudinal single-cell gene expression profiles, which have no distinctive grouping information. In the experiments, MELT demonstrated its empirical utility in comparison to many widely used dissimilarity metrics. And MELT is expected to accommodate a more extensive set of applications in large-scale genomic comparisons. MELT is available at https://github.com/Ying-Lab/MELT....
Alternative Titles
Full title
Metric learning for comparing genomic data with triplet network
Authors, Artists and Contributors
Author / Creator
Identifiers
Primary Identifiers
Record Identifier
TN_cdi_proquest_miscellaneous_2709743395
Permalink
https://devfeature-collection.sl.nsw.gov.au/record/TN_cdi_proquest_miscellaneous_2709743395
Other Identifiers
ISSN
1467-5463
E-ISSN
1477-4054
DOI
10.1093/bib/bbac345