Development and validation of a reliable DNA copy-number-based machine learning algorithm (CopyClust) for breast cancer integrative cluster classification

Cameron C Young, Katherine Eason,Raquel Manzano Garcia, Richard Moulange,Sach Mukherjee, Suet-Feung C Chin,Carlos Caldas,Oscar M Rueda

biorxiv(2023)

引用 0|浏览4
暂无评分
摘要
The Integrative Clusters (IntClusts) provide a framework for the classification of breast cancer tumors into 10 distinct genomic subtypes based on DNA copy number and gene expression. Current classifiers achieve only low accuracy without gene expression data, warranting the development of new approaches to copy-number-only-based IntClust classification. A novel XGBoost-driven classification algorithm, CopyClust, was trained using genomic features from METABRIC and validated on TCGA achieving a nine-percentage point or greater improvement in overall IntClust subtype classification accuracy. ### Competing Interest Statement C.C. is a member of the iMED External Science Panel for AstraZeneca, the Scientific Advisory Board for Illumina, and is a recipient of research grants (administered by the University of Cambridge) from AstraZeneca, Genentech, Roche, and Servier. The remaining authors declare no competing interests.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要