Leveraging Public Data For Practical Private Query Release

INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139(2021)

引用 52|浏览80
暂无评分
摘要
In many statistical problems, incorporating priors can significantly improve performance. However, the use of prior knowledge in differentially private query release has remained underexplored, despite such priors commonly being available in the form of public datasets, such as previous US Census releases. With the goal of releasing statistics about a private dataset, we present PMWPub, which-unlike existing baselines-leverages public data drawn from a related distribution as prior information. We provide a theoretical analysis and an empirical evaluation on the American Community Survey (ACS) and ADULT datasets, which shows that our method outperforms state-of-the-art methods. Furthermore, PMWPub scales well to high-dimensional data domains, where running many existing methods would be computationally infeasible.
更多
查看译文
关键词
public data,release
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要