This seminar tackles the complexity of two particular problems in the world of Big Data management: analyzing Big Data in a streaming fashion and managing the security and privacy of Big Data in the cloud. Michele Guerriero and Lorenzo Affetti provide the major take outs of the concepts exposed at the 3rd International Winter School on Big Data (Bari, February 2017) by Minos Garofalakis and Pierangela Samarati.
Streaming Big Data Analytics (Minos Garofalakis)
Effective Big Data analytics need to rely on algorithms for querying and analyzing massive, continuous data streams (that is, data that is seen only once and in a fixed order) with limited memory and CPU-time resources. Such streams arise naturally in emerging large-scale event monitoring applications; for instance, network-operations monitoring in large ISPs, where usage information from numerous sites needs to be continuously collected and analyzed for interesting trends. In addition to memory - and time - efficiency concerns, the inherently distributed nature of such applications also raises important communication-efficiency issues, making it critical to carefully optimize the use of the underlying network infrastructure.
This seminar will give an overview of some key algorithmic tools for effective query processing over streaming data.
The focus will be on small-space sketching structures for approximating continuous data streams in both centralized and distributed settings.
Data Security and Privacy in the Cloud (Pierangela Samarati)
Nowadays the Cloud tecnology is widely used for conveniently storing, accessing, processing, and sharing information. Its success stands from the scalability and elasticity it provides, which lead many users and companies to rely on the multitude of available Cloud providers for storing and processing their data. Unfortunately, such a convenience comes at the price of loss of control over these data by their owner, and consequent new data security and privacy threats that, on the long run, can seriously limit the widespread adoption of the Cloud computing paradigm. This seminar gives an overview the data security and privacy issues arising in the Cloud scenario, along with the techniques to address specific problems related to guaranteeing confidentiality of data stored or processed by external providers, ensuring access privacy, regulating and controlling access to data in the cloud, and performing queries on protected data.