Fine-Tuning Dependencies with Parameters

International Conference on Extending Database Technology (EDBT)(2022)

引用 1|浏览0
暂无评分
摘要
Discovering dependencies from data has been studied extensively, with applications in data mining, data integration and data cleaning. Recent work has proposedmanually parameterized relaxations of traditional dependencies. For example, a Metric Functional Dependency (MFD) with a parameter δ asserts that any pair of tuples that agree on the antecedent attribute values must have similar (not necessarily equal) values of the consequent attributes within a distance of δ . To avoid human burden, we present a framework to automatically fine-tune parameterized dependencies and a proof-of-concept implementation for MFDs. To fine-tune a dependency, we produce a concise pattern tableau, i.e., conditions representing semantically meaningful subsets of the data, along with parameter values that hold within each subset. Our solution is based on a variant of the weighted set cover problem to ensure conciseness, data coverage, and tight parameter values. We demonstrate the efficiency and effectiveness of our approach by analyzing its performance on two real-life datasets.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要