Distributed Parallelization of Semantic Web Java Applications by Means of the Message-Passing Interface

mag(2011)

引用 1|浏览3
暂无评分
摘要
In the recent years, performance has become a key point for a number of Java applications. For some of them, such as from the Semantic Web domain, where the size and the scale of the analyzed data is of a big challenge for a conventional computer, use of the High Performance Computing (HPC) systems is a major factor in achieving the required scalability and performance demands. Parallelization is a key mechanism that leverages HPC for such applications. However, the high development effort for a scalable parallel application has been a major drawback towards the efficient application of HPC to the applications designed for a serial execution only. The Message-Passing Interface (MPI) is a well-known programming standard for large-scale parallel applications development. However, MPI has found its most wide use in the applications written in C and Fortran. We show how MPI can be beneficially applied for the parallelization of the Java applications as well. We describe a parallel implementation of a Random Indexing application that performs similarity search in the large text corpora on the web, which allowed us to improve the performance by up to 33 times on the already 16 nodes of a testbed HPC system.
更多
查看译文
关键词
Domain Decomposition, High Performance Computing, Query Expansion, Java Application, Random Indexing
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要