Data By the Bay has ended
Data By the Bay is the first Data Grid conference matrix with 6 vertical application areas  spanned by multiple horizontal data pipelines, platforms, and algorithms.  We are unifying data science and data engineering, showing what really works to run businesses at scale.
Back To Schedule
Monday, May 16 • 9:50am - 10:30am
Building a Realtime Receiver with Spark Streaming

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

This talk will demystify Spark Streaming Receivers by showing live how to enable streaming consumption from new and untapped data sources.  We'll play around with a publicly available financial data API, while dipping our toes into a twitter data stream and a queue source.  Starting with a simple single-node receiver, we will build up to distributed, reliable receivers.

avatar for Sal Uryasev

Sal Uryasev

Data Werewolf, GoFundMe
Having recently started to build out data infrastructure at GoFundMe, Sal is a veteran of the LinkedIn and Salesforce data science teams. A Scala fanatic, he's done a lot of work on streaming processors to build realtime data recommendation engines.

Monday May 16, 2016 9:50am - 10:30am PDT