Data By the Bay has ended
Data By the Bay is the first Data Grid conference matrix with 6 vertical application areas  spanned by multiple horizontal data pipelines, platforms, and algorithms.  We are unifying data science and data engineering, showing what really works to run businesses at scale.
Monday, May 16 • 1:10pm - 1:30pm
Building domain specific databases with a distributed commit log [Kafka/Kubernetes]

Sign up or log in to save this to your schedule and see who's attending!

Smyte is building a platform to analyze all of the traffic running through busy consumer websites and mobile apps. In this talk I'm going to describe our solution to one tricky problem — counting. Specifically: accurately counting ludicrous amounts of events over sliding windows, while keeping costs as low as possible. Oh, and… lets get something working in hour or two and improve it later.

avatar for Yunjing Xu

Yunjing Xu

Infrastructure Engineer, Smyte
Yunjing is an software engineer building server and database infrastructure at Smyte. Before Smyte, Yunjing worked on the data science and infrastructure team at Square and received Ph.D. from University of Michigan for researching performance and security problems of public cloud... Read More →

Monday May 16, 2016 1:10pm - 1:30pm

Attendees (5)