SparkFuzz: searching correctness regressions in modern query engines
SIGMOD/PODS '20: International Conference on Management of Data Portland Oregon June, 2020, pp. 1-6, 2020.
With more than 1200 contributors, Apache Spark is one of the most actively developed open source projects. At this scale and pace of development, mistakes are bound to happen. In this paper we present SparkFuzz, a toolkit we developed at Databricks for uncovering correctness errors in the Spark SQL engine. To guard the system against corr...More
Full Text (Upload PDF)
PPT (Upload PPT)