Algorithmic Improvement of Crowdsourced Data: Intrinsic Quality Measures, Local Optima, and Consensus

Thomas C. van Dijk, Norbert Fischer, Bernhard Häussner

SIGSPATIAL '20: 28th International Conference on Advances in Geographic Information Systems Seattle WA USA November, 2020（2020）

引用 1|浏览14

暂无评分

摘要

Raw crowdsourced data is often of questionable quality. The typical solution to this is redundancy: ask multiple independent participants the same question and take some form of majority answer. However, this can be wasteful in terms of human effort. In this paper we show that algorithmic analysis of the data is able to get higher quality results out of a given amount of crowd effort (or alternatively, that less crowd effort would have sufficed for the same level of quality). Our case study is based on a publicly available crowdsourced data set by the New York Public Library, featuring building footprints in historical insurance atlases. Besides evaluating the quality improvement achieved by our methods, we provide both a command line interface for batch-mode processing and an interactive web interface; both work with standard data formats and are available as open source software.

查看译文

关键词

crowdsourced data,improvement

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要