Development and validation of a reliable DNA copy-number-based machine learning algorithm (CopyClust...
Development and validation of a reliable DNA copy-number-based machine learning algorithm (CopyClust) for breast cancer integrative cluster classification
About this item
Full title
Author / Creator
Publisher
Cold Spring Harbor: Cold Spring Harbor Laboratory Press
Journal title
Language
English
Formats
Publication information
Publisher
Cold Spring Harbor: Cold Spring Harbor Laboratory Press
Subjects
More information
Scope and Contents
Contents
The Integrative Clusters (IntClusts) provide a framework for the classification of breast cancer tumors into 10 distinct genomic subtypes based on DNA copy number and gene expression. Current classifiers achieve only low accuracy without gene expression data, warranting the development of new approaches to copy-number-only-based IntClust classification. A novel XGBoost-driven classification algorithm, CopyClust, was trained using genomic features from METABRIC and validated on TCGA achieving a nine-percentage point or greater improvement in overall IntClust subtype classification accuracy.Competing Interest StatementC.C. is a member of the iMED External Science Panel for AstraZeneca, the Scientific Advisory Board for Illumina, and is a recipient of research grants (administered by the University of Cambridge) from AstraZeneca, Genentech, Roche, and Servier. The remaining authors declare no competing interests.Footnotes* https://github.com/camyoung54/CopyClust* https://www.cbioportal.org/study/summary?id=brca_metabric* https://www.cbioportal.org/study/summary?id=brca_tcga...
Alternative Titles
Full title
Development and validation of a reliable DNA copy-number-based machine learning algorithm (CopyClust) for breast cancer integrative cluster classification
Authors, Artists and Contributors
Identifiers
Primary Identifiers
Record Identifier
TN_cdi_proquest_journals_2892415160
Permalink
https://devfeature-collection.sl.nsw.gov.au/record/TN_cdi_proquest_journals_2892415160
Other Identifiers
E-ISSN
2692-8205
DOI
10.1101/2023.11.21.568129
How to access this item
https://www.proquest.com/docview/2892415160?pq-origsite=primo&accountid=13902