Data By the Bay is the first Data Grid conference matrix with 6 vertical application areas spanned by multiple horizontal data pipelines, platforms, and algorithms. We are unifying data science and data engineering, showing what really works to run businesses at scale.
Big Data processing is dominating several business applications. Businesses are realizing the benefits of Big Data and analyzing large volumes of data to gain insights about their customers. This causes more data to be collected and stored centrally. Much of the storage occurs in the Cloud. This attracts hackers since they could gain access to large volumes of data about people. Accessing this type of data results in disastrous consequences to individuals whose privacy is violated. In this talk we will look at five of the major data breaches within the recent past and look at ways to protect people's privacy by considering several Best Practices approach. Many in the IT field have realized that it would be very difficult to secure all data that is accessible from the cloud and so better mechanisms should be developed to protect such public data.