Scalable community-driven data sharing in e-science grids

Future Generation Computer Systems(2009)

引用 26|浏览0
暂无评分
摘要
E-science projects of various disciplines face a fundamental challenge: thousands of users want to obtain new scientific results by application-specific and dynamic correlation of data from globally distributed sources. Considering the involved enormous and exponentially growing data volumes, centralized data management reaches its limits. Since scientific data are often highly skewed and exploration tasks exhibit a large degree of spatial locality, we propose the locality-aware allocation of data objects onto a distributed network of interoperating databases. HiSbase is an approach to data management in scientific federated Data Grids that addresses the scalability issue by combining established techniques of database research in the field of spatial data structures (quadtrees), histograms, and parallel databases with the scalable resource sharing and load balancing capabilities of decentralized Peer-to-Peer (P2P) networks. The proposed combination constitutes a complementary e-science infrastructure enabling load balancing and increased query throughput.
更多
查看译文
关键词
data sharing (h.3.5),load balancing,scalable community-driven data,spatial data structure,scientific data,data volume,data management,scientific federated data grids,new scientific result,centralized data management,interoperating databases,data object,e-science grid,scientific databases (h.2.8),distributed databases (h.2.4),resource sharing,load balance,data grid,distributed database
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要