Loading…
This event has ended. View the official site or create your own event → Check it out
This event has ended. Create your own
Data By the Bay is the first Data Grid conference matrix with 6 vertical application areas  spanned by multiple horizontal data pipelines, platforms, and algorithms.  We are unifying data science and data engineering, showing what really works to run businesses at scale.
View analytic
Thursday, May 19 • 11:10am - 11:30am
MacroBase: Analytic Monitoring for the Internet of Things

Sign up or log in to save this to your schedule and see who's attending!

An increasing proportion of data today is generated by automated processes, sensors, and systems---collectively, the Internet of Things (IoT). A core challenge in IoT and an increasingly popular value proposition of many IoT applications in domains including industrial diagnostics, predictive maintenance, and urban observability is in identifying and highlighting unusual and surprising data (e.g., poor driving behavior, equipment failures, gunshots). We call this task---which is often statistical in nature and time-sensitive---analytic monitoring. To facilitate rapid development and scalable deployment of analytic monitoring queries, we have developed MacroBase, a new kind of data analytics engine that provides turn-key analytic monitoring of IoT data streams. MacroBase implements a customizable pipeline of outlier detection, summarization, and ranking operators. To facilitate efficient and accurate operation, MacroBase implements several cross-layer optimizations across robust estimation, pattern mining, and sketching procedures. As a result, MacroBase can analyze several million events per second on a single server. MacroBase has already uncovered several unexpected behaviors (and corresponding bugs) in production in a medium-scale IoT deployment.

Speakers
avatar for Peter Bailis

Peter Bailis

Professor, Stanford University
Peter Bailis is an assistant professor of computer science at Stanford University. Peter's research in the Future Data Systems group (http://futuredata.stanford.edu/) focuses on the design and implementation of next-generation data-intensive systems. His work spans large-scale data management, distributed protocol design, and architectures for high-volume complex decision support. He is the recipient of an NSF Graduate Research Fellowship, a... Read More →


Thursday May 19, 2016 11:10am - 11:30am
Gardner

Attendees (14)