Data By the Bay is the first Data Grid conference matrix with 6 vertical application areas spanned by multiple horizontal data pipelines, platforms, and algorithms. We are unifying data science and data engineering, showing what really works to run businesses at scale.
The modern day search engine has significantly evolved from its keyword matching days to its current form which leverages a wide variety of data inputs and user feedback loops to help users find out what’s most important in their data. At Lucidworks, we leverage Apache Spark and Solr, together with a variety of open source machine learning and NLP approaches, to build smarter, richer search and data applications. This talk will explore several motivating use cases (customer 360, knowledge management, ecommerce) for our integrations as well as technical approaches and key lessons learned in real world implementations.