Is graph biased feature selection of genes better than random?

Hashir Mohammad,Bertin Paul,Weiss Martin,Frappier Vincent,Perkins Theodore,Boucher Geneviève,Cohen Joseph Paul

arxiv（2019）

引用 0|浏览75

暂无评分

摘要

Gene interaction graphs aim to capture various relationships between genes and can represent decades of biology research. When trying to make predictions from genomic data, those graphs could be used to overcome the curse of dimensionality by making machine learning models sparser and more biased with biological common knowledge. In this work, we focus on assessing whether those graphs capture dependencies seen in gene expression data better than random. We formulate a condition that graphs should satisfy to provide a good bias and propose to test it using a 'Single Gene Inference' (SGI) task. We compare random graphs with seven major gene interaction graphs published by different research groups, aiming to measure the true benefit of using biologically relevant graphs in this context. Our analysis finds that dependencies can be captured almost as well at random which suggests that, in terms of gene expression levels, the relevant information about the state of the cell is spread across many genes.

查看译文

关键词

biased feature selection,feature selection,genes,random

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要