Finding errors in the Enron spreadsheet corpus.

Symposium on Visual Languages and Human Centric Computing VL HCC(2016)

引用 17|浏览50
暂无评分
摘要
Spreadsheet environments like MS Excel are the most widespread type of end-user software development tools and spreadsheet-based applications can be found almost everywhere in organizations. Since spreadsheets are prone to error, several approaches were proposed in the research literature to help users locate formula errors. However, the proposed methods were often designed based on assumptions about the nature of errors and were evaluated with mutations of correct spreadsheets. In this work we propose a method and tool to identify real-world formula errors within the Enron spreadsheet corpus. Our approach is based on heuristics that help us identify versions of the same spreadsheet and our software helps the user identify spreadsheets of which we assume that they contain error corrections. An initial manual inspection of a subset of such candidates led to the identification of more than two dozen formula errors. We publicly share the new collection of real-world spreadsheet errors.
更多
查看译文
关键词
error finding,Enron spreadsheet corpus,spreadsheet environment,MS Excel,end-user software development tool,spreadsheet-based application,formula error location,error correction,formula error identification
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要