Monitoring Energy Consumption With SIOX Autonomous Monitoring Triggered by Abnormal Energy Consumption

semanticscholar(2014)

引用 0|浏览0
暂无评分
摘要
In the face of the growing complexity of HPC systems, their growing energy costs, and the increasing difficulty to run applications efficiently, a number of monitoring tools have been developed during the last years. SIOX is one such endeavor, with a uniquely holistic approach: Not only does it aim to record a certain kind of data, but to make all relevant data available for analysis and optimization. Among other sources, this encompasses data from hardware energy counters and trace data from different hardware/software layers. However, not all data that can be recorded should be recorded. As such, SIOX needs good heuristics to determine when and what data needs to be collected, and the energy consumption can provide an important signal about when the system is in a state that deserves closer attention. In this paper, we show that SIOX can use Likwid to collect and report the energy consumption of applications, and present how this data can be visualized using SIOX’s web-interface. Furthermore, we outline how SIOX can use this information to intelligently adjust the amount of data it collects, allowing it to reduce the monitoring overhead while still providing complete information about critical situations. Julian M. Kunkel DKRZ GmbH Hamburg, Germany E-mail: kunkel@dkrz.de Alvaro Aguilera ZIH, TU Dresden Dresden, Germany E-mail: alvaro.aguilera@tu-dresden.de Hübbe—Wiedemann—Zimmer University of Hamburg Hamburg, Germany E-mail: {huebbe, wiedemann, fzimmer}@informatik.uni-hamburg.de
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要