- What is DART?
- Why is DART important?
- The 4 Integrative Barriers to Wide Implementation of Big Data Analytics
- Mission
What is DART?
DART is part of a much larger State effort to establish an educational and research foundation for expanding and supporting a sustainable data science sector within our economy. Data science is targeted in this EPSCoR (Established Program to Stimulate Competitive Research) project because it is a strategically important technology for a significant and growing part of the State’s economy. Arkansas companies, including Walmart, Tyson, J.B. Hunt Transport Service Inc., Stephens Inc., First Orion, and Acxiom, make decisions from data, employ large numbers of data scientists, and are recruiting a workforce with a higher-level of broad and integrated data science skills.
DART, led by the state EPSCoR office and with a state-wide focus, completes the puzzle by providing necessary organizational structure and shared cyberinfrastructure. DART defines a coordinated research plan composed of five topical areas related to data science needs identified by an expanded number of industrial partners who drive a significant portion of our state’s economy. These topical areas are distinct but interrelated and are intentionally composed of researchers from different academic institutions across the state. Using the same relationships that made data science into a substantive, cohesive program at UAF, DART will establish a state-of-the-art data science research ecosystem involving multiple campuses across the state and develop a close interactive relationship between our university researchers and data science dependent industries.
Why is DART important?
The growing array of tools – powerful high-level programming languages, distributed data storage and computation, visualization tools, statistical modeling, and machine learning – along with a staggering array of big data sources, has the potential to empower people to make better and more timely decisions in science, business, and society. However, there remain fundamental barriers to practical application and acceptance of data analytics, any one of which could derail or impede its full development and contributions.
DART research will systematically investigate key aspects of these four barriers and develop novel, integrated solutions to address them by operating as a true, multi-institution, multi-disciplinary data science research center and educational ecosystem, in which faculty and students from campuses across the state work together on targeted problems important to the research community and the economy of Arkansas.
Did You Know?
Employment of data scientists is projected to grow 35% from 2022 to 2032, significantly outpacing the growth of all occupations in the United States.
Source: Bureau of Labor Statistics, U.S. Department of Labor, Occupational Outlook Handbook, Data Scientists, at https://www.bls.gov/ooh/math/data-scientists.htm
The 4 Integrative Barriers to Wide Implementation of Big Data Analytics
Big Data Management
Before data streams and datasets can be used in the many kinds of learning models, they are often manually curated, or at the least, curated for a specific problem. We still rely on hosts of analysts to assess the content and quality of source data, engineer features, define and transform data models, annotate training data, and track data processes and movement.
Security and Privacy
Government agencies and private entities collect and integrate large amounts of data, process it in real-time, and deliver products or services based on these data to consumers and constituents. There are increasing worries that both the acquisition and subsequent application of big data analytics are not secure or well-managed. This can create a risk of privacy breaches, enable discrimination, and negatively impact diversity in our society.
Model Interpretability
Machine learning models often sacrifice interpretability for predictive power and are difficult to generalize beyond their training and test data. But interpretability and generalizability of trained models is critical in many decision-making systems and/or processes, especially when learning from multi-modal and heterogeneous big data sources. There is a continuing to need to better balance the predictive power of complex machine learning models with the strengths of statistical models to better configure deep learning models to allow humans to see the reasoning behind the predictions.
Data-Skilled Workforce
As data-driven science and decision making become commonplace, our state and nation will need to rely on a well-educated workforce at almost all levels of responsibility to be aware of the power and pitfalls of using data in decision making. The gap between the demand for data-skilled workers and the available degree programs is so large that companies are desperate for more flexible credentialing and alternatives to traditional baccalaureate degrees. A range of offerings from certificates to advanced degrees and apprenticeships must be implemented nationwide to address this issue.
Mission
To improve research capability and competitiveness in Arkansas by creating an integrated statewide consortium of researchers and educators working to establish a synergistic, statewide focus on excellence in data analytics research and training.
As Arkansas transitions to a more diverse, data-driven economy we must create an environment for university and industry collaborations in data science that will sustain this new economy with cutting-edge research and educate a workforce that enhances competitiveness in Arkansas industries. By bringing together experts from different data science sub-fields and application areas, we expect to develop both specific and comprehensive solutions that would be difficult to obtain in isolation. Collaboration with our industry partners provides a better definition of both problems and solutions in data analytics and workforce education.