Public record aggregation using semi-supervised entity resolution

ICAIL(2011)

引用 2|浏览0
暂无评分
摘要
This paper describes a highly scalable state of the art record aggregation system and the backbone infrastructure developed to support it. The system, called PeopleMap, allows legal professionals to effectively and efficiently explore a broad spectrum of public records databases by way of a single person-centric search. The backbone support system, called Concord, is a toolkit that allows developers to economically create record resolution solutions. The PeopleMap system is capable of linking billions of public records to a master data set consisting of hundreds of millions of person records. It was constructed using successive applications of Concord to link disparate public record data sets to a central person authority file. To our knowledge, the PeopleMap system is the largest of its kind. In contrast, the Concord support system is a novel record linkage tool that uses a new semi-supervised training technique called `surrogate learning' to enable the rapid development of record resolution solutions.
更多
查看译文
关键词
public records databases,person record,disparate public record data,art record aggregation system,record resolution solution,public record aggregation,peoplemap system,backbone support system,novel record linkage tool,concord support system,public record,semi-supervised entity resolution,record linkage,spectrum,entity resolution,evaluation
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要