Scaling spark in the real world: performance and usability
Proceedings of The Vldb Endowment, Volume 8, Issue 12, 2015.
Apache Spark is one of the most widely used open source processing engines for big data, with rich language-integrated APIs and a wide range of libraries. Over the past two years, our group has worked to deploy Spark to a wide range of organizations through consulting relationships as well as our hosted service, Databricks. We describe th...More
PPT (Upload PPT)