Web Service aggregation with string distance ensembles and active probe selection

Information Fusion（2008）

引用 6|浏览6

暂无评分

摘要

The adoption of standards for exchanging information across the Web presents both new opportunities and important challenges for data integration and aggregation. Although Web Services simplify the discovery and access of information sources, the problem of semantic heterogeneity remains: how to find semantic correspondences across the data being integrated. In this paper, we explore these issues in the context of Web Services, and propose OATS, a novel algorithm for schema matching that is specifically suited to Web Service data aggregation. We show how probing Web Services with a small set of related queries results in semantically correlated data instances which greatly simplifies the matching process, and demonstrate that the use of an ensemble of string distance metrics in matching data instances performs better than individual metrics. We also show how the choice of probe queries has a dramatic effect on matching accuracy. Motivated by this observation, we describe and evaluate an machine learning approach to selecting probes to maximise accuracy while minimising cost.

查看译文

关键词

web service data aggregation,information source,schema matching,aggregation,individual metrics,data instance,web service aggregation,active probe selection,web services,schema integration,data integration,string distance ensemble,matching process,related queries result,semantically correlated data instance,data integrity,web service,data aggregation,machine learning,distance metric

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要