C3.ai Releases COVID-19 Data Lake V2
Doubles in Size – Now One of the World’s Largest Unified Sources of COVID-19 Data
C3.ai, a leading enterprise AI software provider for accelerating digital transformation, announced the addition of 11 new integrated COVID-19 data sets to the C3.ai COVID-19 Data Lake, making it one of the largest pre-integrated and free sources of COVID-19 data in the world. C3.ai COVID-19 Data Lake offers researchers access to normalized, unified data to accelerate efforts in the fight against COVID-19.
Researchers are in a race to predict the virus’ trajectory, forecast demand for ICU bed capacity, analyze the efficacy of COVID-19 guidelines, support COVID-19 diagnosis, and speed the development of medical treatments. The challenge is that most data are dispersed in a variety of different locations and in unusable formats. Absent rich, integrated data sets, it is impossible to develop meaningful and accurate artificial intelligence models.
The C3.ai COVID-19 Data Lake – developed by C3.ai in three weeks and accessible at https://c3.ai/covid – is a unified source of comprehensive, integrated COVID-19 data that C3.ai has made publicly available, at no cost, to global research and scientific communities. The data lake is unique and structurally different from other COVID-19 data collections in that it provides analysis-ready data that researchers can use immediately to enhance new or ongoing COVID projects.
Early Adopters Fast Track COVID-19 Projects
Researchers and data scientists from top universities, leading hospitals, and government agencies are among the early adopters using the C3.ai COVID-19 Data Lake to support a variety of efforts, including:
- Supply chain analysis at Massachusetts Institute of Technology (MIT) Humanitarian Supply Chain Lab, MIT Center for Transportation & Logistics: Researchers at MIT, in collaboration with the Federal Emergency Management Agency (FEMA) and other agencies, are focused on the analysis of critical supply chain issues to understand the distribution and availability of COVID-19 testing equipment and personal protective equipment (PPE) – and the pandemic’s impact on freight flows throughout the country.
”Having access to an integrated set of diverse COVID-19 data sources with a common data model can help accelerate analysis of critical supply chain issues in our work with FEMA and other agencies,” said Tim Russell, Research Engineer at the MIT Humanitarian Supply Chain Lab, MIT Center for Transportation & Logistics. “The C3.ai COVID-19 Data Lake provides a valuable resource in unifying and simplifying access to the necessary data without having to waste time on finding, cleaning, and preparing the data for analysis.”
- COVID-19 search engine affiliated with Lawrence Berkeley National Laboratory (Berkeley Lab):A team of materials scientists at Berkeley Lab have launched a COVID-19 publications search engine that synthesizes hundreds of scientific papers every day for information extraction using text mining algorithms and natural language processing. Berkeley Lab scientists used the C3.ai COVID-19 Data Lake to incorporate Milken Institute data on therapeutics.
- Media portrayal of COVID-19 in the U.S. at Arizona State University (ASU): Researchers at Arizona State want to understand the social psychology behind people’s responses to the pandemic based on media portrayal of COVID-19. Specifically, they will be evaluating the impact of news and social media posts on the population’s compliance with local mandates over time.
- Pandemic strategies and response scenarios at a government agency: Data scientists are developing pandemic strategies, response scenarios, and risk assessments by building predictive models that will validate other publicly available models.
Recommended AI News: 3 Steps To Channel Customer Feedback Into Product Innovation
“With the addition of these 11 important data sets, we are proud to continue enhancing the scope and exponentially increasing the value of the C3.ai COVID-19 Data Lake as a no-cost resource for the global research community,” said Thomas M. Siebel, CEO of C3.ai. “We are excited by the enthusiastic response among researchers and we are confident that their creativity, innovation, and imaginative use of this resource will yield significant results toward mitigating this and future pandemics.”
C3.ai also is encouraging researchers to recommend data sources they would like to see added to the C3.ai COVID-19 Data Lake for future research. For example, a physician from a leading hospital has requested C3.ai add all U.S. vaccination data to the data lake to study the impact of previous vaccinations on the rate of hospitalizations and infections. Additionally, researchers affiliated with a leading university have requested C3.ai populate de-identified patient data into the data lake to improve an app that informs users with pre-existing conditions of COVID-related morbidity risks.
C3.ai COVID-19 Data Lake data sources currently include:
- Johns Hopkins University: COVID-19 Data Repository
- The COVID Tracking Project
- World Health Organization: Daily Situation Reports
- The New York Times: COVID-19 Data in the United States
- European Centre for Disease Prevention and Control: Worldwide Situation Updates
- University of Washington’s Institute for Health Metrics and Evaluation: COVID-19 Projections
- Data Science for COVID-19: South Korea Dataset
- Dipartimento della Protezione Civile – Emergenza Coronavirus
- COVID-19 India
- nCoV-2019 Data Working Group: Epidemiology Data
- MOBS Lab: COVID-19 Situation Report
- National Center for Biotechnology Information Virus Database
- Allen Institute for AI: COVID-19 Open Research Dataset (CORD-19)
Recommended AI News: Frost & Sullivan Presents A Strategic Framework For A Blockchain-Enabled World