ReWorDs 2022 Keynote: Towards Orchestrating Distributed & Data-Intensive Workflows
2022 IEEE 18th International Conference on e-Science (e-Science)(2022)
摘要
Scientific exploration and hypothesis generation is increasingly dependent on the convergence of scientific modeling, data analytics, and machine learning. The result is data-intensive workflows that are composed of multiple stages of computation and communication between distributed and heterogeneous computing resources. The key bottleneck is frequently non-scalable data management, resulting in problematic data volumes and velocities. This talk will discuss our efforts characterizing, modeling, analyzing, and accelerating the performance of these emerging workloads. It will also sketch challenges of enabling workflows, written in a variety of languages, that have production-like performance in massively distributed environments with multiple computing paradigms.
更多查看译文
关键词
Data-intensive workflows,data scalability,distributed computation
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要