Learning a Hierarchical Monitoring System for Detecting and Diagnosing Service Issues
ACM Knowledge Discovery and Data Mining, pp. 2029-2038, 2015.
We propose a machine learning based framework for building a hierarchical monitoring system to detect and diagnose service issues. We demonstrate its use for building a monitoring system for a distributed data storage and computing service consisting of tens of thousands of machines. Our solution has been deployed in production as an end-...More
Get fulltext within 24h
Full Text (Upload PDF)
PPT (Upload PPT)