Alchemist: An Apache Spark MPI Interface
arXiv: Distributed, Parallel, and Cluster Computing, Volume abs/1806.01270, 2018.
The Apache Spark framework for distributed computation is popular in the data analytics community due to its ease of use, but its MapReduce-style programming model can incur significant overheads when performing computations that do not map directly onto this model. One way to mitigate these costs is to off-load computations onto MPI code...更多
下载 PDF 全文 (上传PDF)