Loading…
Data By the Bay has ended
Data By the Bay is the first Data Grid conference matrix with 6 vertical application areas  spanned by multiple horizontal data pipelines, platforms, and algorithms.  We are unifying data science and data engineering, showing what really works to run businesses at scale.
Back To Schedule
Wednesday, May 18 • 4:00pm - 4:40pm
Deep Dive: Spark Memory Management

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Memory management is at the heart of any data-intensive system. Spark, in particular, must arbitrate memory allocation between two main use cases: buffering intermediate data for processing (execution) and caching user data (storage). This talk will take a deep dive through the memory management designs adopted in Spark since its inception and discuss their performance and usability implications for the end user.

Speakers
avatar for Andrew Or

Andrew Or

Software Engineer, Databricks
Anything about Spark.


Wednesday May 18, 2016 4:00pm - 4:40pm PDT
Ada