Data has a life cycle from planning to acquiring, cleansing, storing & sharing, integrating, application, and disposing. While AI and machine learning have taken the application of data to new levels, the other phases remain largely manually mediated processes. The research goal for the Data Life Cycle and Curation thrust is to develop fully automated processes for the other phases of the data life cycle. The presentation today describes some of the progress of the research finding ways to automate data cleansing and data integration phases of the data life cycle.
Dr. Fred Prior, Dr. Chris Angel, Dr. David Chaffin, and Dr. Pawel Wolinski will present an overview of the Coordinated Cyber Infrastructure efforts on the DART project, including providing secure, distributed, agile, scalable, and on-demand services. This presentation will feature a demonstration using simple, “real-world” data science examples within the Arkansas Research Platform including: DART GitLab repository, Globus file transfer, and Pinnacle-portal.