Name: Sparse data alternatives with neural network embeddings
Start: 2016-05-17T14:10:00-0700
End: 2016-05-17T14:50:00-0700

Data By the Bay is the first Data Grid conference matrix with 6 vertical application areas spanned by multiple horizontal data pipelines, platforms, and algorithms. We are unifying data science and data engineering, showing what really works to run businesses at scale.

Back To Schedule

Sparse data alternatives with neural network embeddings

The advent of continuous word representation technologies such as Word2Vec and GLOVE has transformed how Data Scientists and Machine Learning experts work with natural language data. One reason these algorithms are so successful is that they offer an efficient information preserving methodology to highly compress native features (word frequencies) to the dimensions of the embedded vector space. This is particularly effective in the sparse data context of word count frequencies. Recently word embedding algorithms have been generalized to generic graph networks contexts. In this talk we review results of applying this generalization to alternative sparse data contexts such as User-based as well as Item-based recommender algorithms.

Speakers

Text

Data By the Bay

Marvin Bertin

David Ott

Mike Tamir

Attendees (24)

Data By the Bay

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Marvin Bertin

David Ott

Mike Tamir

Attendees (24)