Towards Neural Synthesis for SMT-Assisted Proof-Oriented Programming
arxiv(2024)
摘要
Proof-oriented programs mix computational content with proofs of program
correctness. However, the human effort involved in programming and proving is
still substantial, despite the use of Satisfiability Modulo Theories (SMT)
solvers to automate proofs in languages such as F*.
Seeking to spur research on using AI to automate the construction of
proof-oriented programs, we curate a dataset of 600K lines of open-source F*
programs and proofs, including software used in production systems ranging from
Windows and Linux, to Python and Firefox. Our dataset includes around 32K
top-level F* definitions, each representing a type-directed program and proof
synthesis problem – producing a definition given a formal specification
expressed as an F* type. We provide a program-fragment checker that queries F*
to check the correctness of candidate solutions. We believe this is the largest
corpus of SMT-assisted program proofs coupled with a reproducible
program-fragment checker.
Grounded in this dataset, we investigate the use of AI to synthesize programs
and their proofs in F*, with promising results. Our main finding in that the
performance of fine-tuned smaller language models (such as Phi-2 or StarCoder)
compare favorably with large language models (such as GPT-4), at a much lower
computational cost. We also identify various type-based retrieval augmentation
techniques and find that they boost performance significantly. With detailed
error analysis and case studies, we identify potential strengths and weaknesses
of models and techniques and suggest directions for future improvements.
更多查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要