ReWorDs 2022 Keynote: Towards Orchestrating Distributed & Data-Intensive Workflows

2022 IEEE 18th International Conference on e-Science (e-Science)(2022)

引用 0|浏览1
暂无评分
摘要
Scientific exploration and hypothesis generation is increasingly dependent on the convergence of scientific modeling, data analytics, and machine learning. The result is data-intensive workflows that are composed of multiple stages of computation and communication between distributed and heterogeneous computing resources. The key bottleneck is frequently non-scalable data management, resulting in problematic data volumes and velocities. This talk will discuss our efforts characterizing, modeling, analyzing, and accelerating the performance of these emerging workloads. It will also sketch challenges of enabling workflows, written in a variety of languages, that have production-like performance in massively distributed environments with multiple computing paradigms.
更多
查看译文
关键词
Data-intensive workflows,data scalability,distributed computation
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要