Analysis of large sparse graphs using regular decomposition of graph distance matrices

arXiv (Cornell University)(2018)

引用 9|浏览14
暂无评分
摘要
Statistical analysis of large and sparse graphs is a challenging problem in data science due to the high dimensionality and nonlinearity of the problem. This paper presents a fast and scalable algorithm for partitioning such graphs into disjoint groups based on observed graph distances from a set of reference nodes. The resulting partition provides a low-dimensional approximation of the full distance matrix which helps to reveal global structural properties of the graph using only small samples of the distance matrix. The presented algorithm is inspired by the information-theoretic minimum description principle. We investigate the performance of this algorithm for selected real data sets and for synthetic graph data sets generated using stochastic block models and power-law random graphs, together with analytical considerations for sparse stochastic block models with bounded average degrees.
更多
查看译文
关键词
statistical analysis,graph distance matrices,sparse stochastic block models,power-law random graphs,information-theoretic minimum description principle,distance matrix,low-dimensional approximation,observed graph distances,data science,sparse graphs
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要