GPU-Embedding of kNN-Graph Representing Large and High-Dimensional Data.

international conference on conceptual structures(2020)

引用 1|浏览16
暂无评分
摘要
Interactive visual exploration of large and multidimensional data still needs more efficient ND -> 2D data embedding (DE) algorithms. We claim that the visualization of very high-dimensional data is equivalent to the problem of 2D embedding of undirected kNN-graphs. We demonstrate that high quality embeddings can be produced with minimal time&memory complexity. A very efficient GPU version of IVHD (interactive visualization of high-dimensional data) algorithm is presented, and we compare it to the state-of-the-art GPU-implemented DE methods: BH-SNE-CUDA and AtSNE-CUDA. We show that memory and time requirements for IVHD-CUDA are radically lower than those for the baseline codes. For example, IVHD-CUDA is almost 30 times faster in embedding (without the procedure of kNN graph generation, which is the same for all the methods) of the largest (M = 1.4 . 10(6)) YAHOO dataset than AtSNE-CUDA. We conclude that in the expense of minor deterioration of embedding quality, compared to the baseline algorithms, IVHD well preserves the main structural properties of ND data in 2D for radically lower computational budget. Thus, our method can be a good candidate for a truly big data ( M = 10(8+)) interactive visualization.
更多
查看译文
关键词
High-dimensional data, Data embedding, kNN graph visualization, GPU implementation
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要