High Performance Processing of Streaming Data

Supun Kamburugamuve, Milinda Pathirage,Saliya Ekanayake,Geoffrey C. Fox

HIPCW '15: Proceedings of the 2015 IEEE 22nd International Conference on High Performance Computing Workshops (HiPCW)(2015)

引用 0|浏览54
暂无评分
摘要
Recent advancements in the cloud-based technologies and the Internet of Things (IoT) has fueled a class of Dynamic Data Driven Application Systems (DDDAS)1, where realtime computing requirements of mobile robots and other IoT related devices are offloaded to clouds in order to improve mobility of the devices and accuracy of computing tasks. Due to the event-based nature of these applications, modern distributed stream processing frameworks (DSPF) such as Apache Storm [2] provide an effective platform to meet their needs. A good example is a parallel real time distributed streaming algorithm implementing simultaneous localization and mapping (SLAM) for mobile robots [1] in the cloud. This algorithm requires low latency parallel processing with strict guarantees. Current DSPFs are designed to cater to traditional event processing tasks, such as extract transformation and load (ETL) pipelines, counting, cardinality estimation, frequent itemsets finding, windowed aggregations and joins or pattern detection. The above-mentioned novel applications with strict real time guarantees demand support for low latency synchronous and asynchronous parallel processing of events, requiring efficient broadcast and gathering of data streams. Our research into high performance computing on streaming data has found that the lack of efficient communication for data distribution operations like broadcasting in current DSPF implementations is limiting the performance of these applications when the parallelism of the processing increases.Distributed streaming applications are generally defined as a directed graph where process elements are connected through edges. Data flow occurs through the edges of the directed graph as streams of events. With naive implementations, a collective operation such as broadcast happens through separate communication links (edges) from the source to each target. Such communications can be made efficient by modeling them based on data structures such as trees. In the last couple of decades, different broadcasting algorithms have been perfected for HPC applications using technologies like MPI [3]. In this work we propose to adapt these algorithms to improve broadcasting for DSPFs considering attributes of the special class of streaming algorithms mentioned above and those of cloud runtime environments.We have implemented several broadcasting algorithms for an open source fork2 of a DSPF called Apache Storm. The primary algorithms we implement are flat tree with node-based variation, binary tree and pipeline. Storm utilizes both process based and thread based parallel execution of 1 http://www.1dddas.org/ 2 https://github.com/alibaba/jstorm events. The tasks of a Storm streaming application run in different processes in different nodes. Tasks running in the same process can use the memory to communicate, while others running in different processes utilize networks. The communication algorithms are optimized to consider the task locality in order to improve network and inter-process communications. To test the system we use the SLAM streaming algorithm mentioned above and a simple stream processing application with synchronous and asynchronous processing at the parallel tasks for evaluating the behavior of the broadcasting algorithms with different data sizes and nodes. The tests are conducted on an OpenStack-based cloud test bed and a HPC cluster.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要