MLlib: Machine Learning in Apache Spark
Journal of Machine Learning Research, Volume abs/1505.06807, Issue 1, 2016.
Apache Spark is a popular open-source platform for large-scale data processing that is well-suited for iterative machine learning tasks. In this paper we present MLlib, Spark's open-source distributed machine learning library. MLlib provides efficient functionality for a wide range of learning settings and includes several underlying st...More
PPT (Upload PPT)