Data By the Bay has ended
Data By the Bay is the first Data Grid conference matrix with 6 vertical application areas  spanned by multiple horizontal data pipelines, platforms, and algorithms.  We are unifying data science and data engineering, showing what really works to run businesses at scale.
Back To Schedule
Monday, May 16 • 1:10pm - 1:30pm
Netflix Keystone - Streaming Data Pipeline @Scale in the Cloud

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Keystone processes over 700 billion events per day (1 peta byte) with at-least once processing semantics in the cloud. We will explore in detail how we leverage Kafka, Samza, Docker, and Linux at scale to implement a multi-tenant pipeline in AWS cloud within a year. We will also share our plans on offering a Stream Processing as a Service for all of Netflix use.

avatar for Monal Daxini

Monal Daxini

Senior Software Engineer, Netflix, Inc.
Monal Daxini is a Senior Software Engineer at Netflix building a scalable and multi-tenant event processing pipeline, and infrastructure for Stream Processing as a Service. He has over 15 years of experience building scalable distributed systems at organizations like Netflix, NFL.com... Read More →

Monday May 16, 2016 1:10pm - 1:30pm PDT