RALF: Accuracy-Aware Scheduling for Feature Store Maintenance

Sarah Wooders,Xiangxi Mo, Amit Narang,Kevin Lin,Ion Stoica,Joseph M. Hellerstein,Natacha Crooks,Joseph E. Gonzalez

PROCEEDINGS OF THE VLDB ENDOWMENT（2023）

Cited 0|Views16

No score

Abstract

Feature stores (also sometimes referred to as embedding stores) are becoming ubiquitous in model serving systems: downstream applications query these stores for auxiliary inputs at inferencetime. Stored features are derived by featurizing rapidly changing base data sources. Featurization can be costly prohibitively expensive to trigger on every data update, particularly for features that are vector embeddings computed by a model. Yet, existing systems naively apply a one-size-fits-all policy as to when/how to update these features, and do not consider query access patterns or impacts on prediction accuracy. This paper introduces RALF, which orchestrates feature updates by leveraging downstream error feedback to minimize feature store regret, a metric for how much featurization degrades downstream accuracy. We evaluate with representative feature store workloads, anomaly detection and recommendation, using real-world datasets. We run system experiments with a 275,077 key anomaly detection workload on 800 cores to show up to a 32.7% reduction in prediction error or up to 1.6x compute cost reduction with accuracy-aware scheduling.

Translated text

AI Read Science

Must-Reading Tree

Example

Generate MRT to find the research sequence of this paper

Chat Paper

Summary is being generated by the instructions you defined