A framework for scRNA-seq data clustering based on multi-view feature integration

BIOMEDICAL SIGNAL PROCESSING AND CONTROL(2024)

引用 0|浏览2
暂无评分
摘要
Accurate and consistent estimation of cell-to-cell similarity is crucial for clustering single-cell RNA-seq (scRNAseq) data. However, the high sparsity of scRNA-seq data often leads to suboptimal mining and decreased accuracy in identifying cell types. Moreover, using a larger number of features (genes) does not necessarily improve clustering accuracy due to redundant information. In this paper, we propose a framework, called scMVFI (singlecell Multi-View Feature Integration), which integrates linear and non-linear features of scRNA-seq data to address the disadvantage of zero-inflated noise caused by technical factors. By employing an autoencoder model for data reconstruction, scMVFI performs multi-view similarity estimation using subsets of features with different sampling rates to identify highly similar cell pairs. We evaluate the effectiveness of scMVFI using five real scRNAseq datasets and three simulated datasets. The results demonstrate that scMVFI can effectively mitigate the impact of data "dropout" events compared to other methods. Moreover, the affinity networks constructed from both linear and non-linear perspectives can accurately capture sample relationships, thereby enhancing the identification of cell types when combined with existing clustering methods.
更多
查看译文
关键词
scRNA-seq,Dropout,Autoencoder,Feature integration,Clustering
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要