Loading…
Data By the Bay has ended
Data By the Bay is the first Data Grid conference matrix with 6 vertical application areas  spanned by multiple horizontal data pipelines, platforms, and algorithms.  We are unifying data science and data engineering, showing what really works to run businesses at scale.
Thursday, May 19 • 11:10am - 11:30am
MacroBase: Analytic Monitoring for the Internet of Things

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

An increasing proportion of data today is generated by automated processes, sensors, and systems---collectively, the Internet of Things (IoT). A core challenge in IoT and an increasingly popular value proposition of many IoT applications in domains including industrial diagnostics, predictive maintenance, and urban observability is in identifying and highlighting unusual and surprising data (e.g., poor driving behavior, equipment failures, gunshots). We call this task---which is often statistical in nature and time-sensitive---analytic monitoring. To facilitate rapid development and scalable deployment of analytic monitoring queries, we have developed MacroBase, a new kind of data analytics engine that provides turn-key analytic monitoring of IoT data streams. MacroBase implements a customizable pipeline of outlier detection, summarization, and ranking operators. To facilitate efficient and accurate operation, MacroBase implements several cross-layer optimizations across robust estimation, pattern mining, and sketching procedures. As a result, MacroBase can analyze several million events per second on a single server. MacroBase has already uncovered several unexpected behaviors (and corresponding bugs) in production in a medium-scale IoT deployment.

Speakers
avatar for Peter Bailis

Peter Bailis

Founder, Sisu
Founder and CEO, Sisu Data (https://sisu.ai) and Assistant Professor, Stanford CS (https://dawn.cs.stanford.edu)


Thursday May 19, 2016 11:10am - 11:30am PDT
Gardner