A two-sample tree-based test for hierarchically organized genomic signals

JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS(2024)

引用 0|浏览3
暂无评分
摘要
This article addresses a common type of data encountered in genomic studies, where a signal along a linear chromosome exhibits a hierarchical organization. We propose a novel framework to assess the significance of dissimilarities between two sets of genomic matrices obtained from distinct biological conditions. Our approach relies on a data representation based on trees. It utilizes tree distances and an aggregation procedure for tests performed at the level of leaf pairs. Numerical experiments demonstrate its statistical validity and its superior accuracy and power compared to alternatives. The method's effectiveness is illustrated using real-world data from GWAS and Hi-C data.
更多
查看译文
关键词
cophenetic distances,moderated t statistics,p-value aggregation,tree distances
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要