Join Sampling under Acyclic Degree Constraints and (Cyclic) Subgraph Sampling
CoRR(2023)
摘要
Given a join with an acyclic set of degree constraints, we show how to draw a
uniformly random sample from the join result in $O(\mathit{polymat}/ \max \{1,
\mathrm{OUT} \})$ expected time after a preprocessing of $O(\mathrm{IN})$
expected time, where $\mathrm{IN}$, $\mathrm{OUT}$, and $\mathit{polymat}$ are
the join's input size, output size, and polymatroid bound, respectively. This
compares favorably with the state of the art (Deng et al.\ and Kim et al., both
in PODS'23), which states that a uniformly random sample can be drawn in
$\tilde{O}(\mathrm{AGM} / \max \{1, \mathrm{OUT}\})$ expected time after a
preprocessing phase of $\tilde{O}(\mathrm{IN})$ expected time, where
$\mathrm{AGM}$ is the join's AGM bound.
We then utilize our techniques to tackle {\em directed subgraph sampling}.
Let $G = (V, E)$ be a directed data graph where each vertex has an out-degree
at most $\lambda$, and let $P$ be a directed pattern graph with $O(1)$
vertices. The objective is to uniformly sample an occurrence of $P$ in $G$. The
problem can be modeled as join sampling with input size $\mathrm{IN} =
\Theta(|E|)$ but, whenever $P$ contains cycles, the converted join has {\em
cyclic} degree constraints. We show that it is always possible to throw away
certain degree constraints such that (i) the remaining constraints are acyclic
and (ii) the new join has asymptotically the same polymatroid bound
$\mathit{polymat}$ as the old one. Combining this finding with our new join
sampling solution yields an algorithm to sample from the original (cyclic) join
(thereby yielding a uniformly random occurrence of $P$) in $O(\mathit{polymat}/
\max \{1, \mathrm{OUT}\})$ expected time after $O(|E|)$ expected-time
preprocessing. We also prove similar results for {\em undirected subgraph
sampling} and demonstrate how our techniques can be significantly simplified in
that scenario.
更多查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要