First Steps toward a Data Washing Machine

Data has a life cycle from planning to acquiring, cleansing, storing & sharing, integrating, application, and disposing. While AI and machine learning have taken the application of data to new levels, the other phases remain largely manually mediated processes. The research goal for the Data Life Cycle and Curation thrust is to develop fully automated processes for the other phases of the data life cycle. The presentation today describes some of the progress of the research finding ways to automate data cleansing and data integration phases of the data life cycle.

DART Cyber Infrastructure Resources

Dr. Fred Prior, Dr. Chris Angel, Dr. David Chaffin, and Dr. Pawel Wolinski will present an overview of the Coordinated Cyber Infrastructure efforts on the DART project, including providing secure, distributed, agile, scalable, and on-demand services. This presentation will feature a demonstration using simple, “real-world” data science examples within the Arkansas Research Platform including: DART GitLab repository, Globus file transfer, and Pinnacle-portal.

Welcome to DART

Dr. Jackson Cothren will present an overview of the National Science Foundation-funded EPSCoR Track 1 award entitled, “Data Analytics that are Robust and Trusted (DART): From Smart Curation to Socially Aware Decision Making.”