Data By the Bay is the first Data Grid conference matrix with 6 vertical application areas spanned by multiple horizontal data pipelines, platforms, and algorithms. We are unifying data science and data engineering, showing what really works to run businesses at scale.
Approximately one-quarter of people searching for health information online hit a paywall. Medical knowledge is locked up in non-open access scientific research papers which have copyright licenses that prevent free distribution. However, facts cannot be copyrighted* and may pass through paywalls unencumbered by copyright license restrictions. We have developed a framework to enable access to scientific knowledge. Academic readers with access to papers can locally install and run our freely available Fact Extractor software. After a local PDF paper is identified and approved by the user, Fact Extractor identifies and extracts facts from the scientific paper. The software then distributes the extracted facts to our public Wiki-based server http://factpub.org for everyone to access. Client-side processing for fact extraction means no copies of the paper are distributed. Large-scale adoption of this fact-publishing framework will empower accessibility to health and other scientific research. * Feist Publications, Inc., v. Rural Telephone Ser-vice Co., 499 U.S. 340 (1991)