Ti trovi qui: Home » News e Seminari

Re-computation of Big Data Analytics Processes in the Presence of Changes: a meta-process powered by provenance

Giovedì 19 Dicembre 2019, dalle ore 11:00 alle ore 12:00 in aula M2.4, edificio Matematica, Dipartimento FIM, Modena

Relatore: Prof. Paolo Missier (New Castle University, UK), visiting professor a UNIMORE

Abstract: Insights generated from Big Data through analytics processes are often unstable over time and thus lose their value, as the analysis typically depends on elements that change and evolve dynamically. However, the cost of having to periodically ``redo'' computationally expensive data analytics is not normally taken into account when assessing the benefits of the outcomes.
The ReComp project addresses the problem of efficiently re-computing, all or in part, outcomes from complex analytical processes in response to some of the changes that occur to process dependencies. While such dependencies may include application and system libraries, as well as the deployment environment, ReComp is focused exclusively on changes to reference datasets as well as to the original inputs.
Our hypothesis is that an efficient re-computation strategy requires the ability to (i) observe and quantify data changes, (ii) estimate the impact of those changes on a population of prior outcomes, (iii) identify the minimal process fragments that can restore the currency of the impacted outcomes, and (iv) selectively drive their refresh.
The talk will discuss a generic framework that addresses these requirements, highlight the role of data provenance as part of the ReComp meta-process, and show how it can be customised to operate on two case studies of very diverse domains, namely genomics and geosciences.

Host: Federica Mandreoli (fmandreoli@unimore.it)

[Ultimo aggiornamento: 16/12/2019 18:34:32]