Work Experience

  • Present 2014

    Doctoral Researcher

    Politecnico di Milano

  • 2015 2014

    Part Time Counselor

    Fluxedo, Politecnico di Milano spin-off

    Design and development of a cloud architecture supporting scalable back-ends, handling social-media applications and adopting the Social BPM approach.

  • 2014

    Research Fellow

    Politecnico di Milano, Department of Electronics, Information, and Bioengineering (DEIB)

    Design and development of an offline data migration system for heterogeneous NoSQL databases, in the context of an European FP7 project called MODAClouds.

  • 2014

    Part Time Counselor

    Fluxedo, Politecnico di Milano spin-off

    NoSQL databases deployment to support the development of mobile applications based on social networks data (in collaboration with University of Trento).

  • 2010 2009

    Networking Technician

    OneSource Building Technologies, FlexTek

    Site survey and Network equipment installation

Education & Training

  • IT Certification 2015

    In-memory Data Management

    Hasso Plattner Institute (HPI)

  • Ph.D. 2014-Present

    Ph.D. student in Information Technology (Computer Science and Engineering)

    Politecnico di Milano

  • M.Sc.2011-2013

    Master Degree in Computer Science and Engineering

    Politecnico di Milano

  • B.Sc.2007-2011

    Bachelor Degree in Computer Science and Engineering

    University of Catania

  • IT Certification2009

    Cisco Certificate Networking Associate · CCNA (640-802)

  • IT Certification2008

    MySQL developer

Privacy Policy

Research Projects

  • image

    MODAClouds (FP7 EU Project)

    MOdel-Driven Approach for design and execution of applications on multiple Clouds

    Context

    Current Cloud’s offer is becoming day by day wider providing a vibrant technical environment, where SMEs can create innovative solutions and evolve their services. Cloud promises cheap and flexible services to end-users at a much larger scale than before. However, Cloud business models and technologies are still in their initial hype and characterized by critical early stage issues, which pose specific challenges and require advanced software engineering methods.

    Goals

    The main goal of MODAClouds is to provide methods, a decision support system, an open source IDE and run-time environment for the high-level design, early prototyping, semi-automatic code generation, and automatic deployment of applications on multi-Clouds with guaranteed QoS. Model-driven development combined with novel model-driven risk analysis and quality prediction will enable developers to specify Cloud-provider independent models enriched with quality parameters, implement these, perform quality prediction, monitor applications at run-time and optimize them based on the feedback, thus filling the gap between design and run-time. Additionally, MODAClouds provides techniques for data mapping and synchronization among multiple Clouds.

    Innovations

    MODAClouds innovations are:

    • simplify Cloud provider selection favoring the emergence of European Clouds,
    • avoid vendor lock-in problems supporting the development of Cloud enabled Future Internet applications,
    • provide quality assurance during the application life-cycle and support migration from Cloud to Cloud when needed.

    Fundings

    The project is partially Funded by European Commission Grant no. FP7-ICT-2011-8-318484

  • image

    SeaClouds (FP7 EU Project)

    Seamless adaptive multi-cloud management of service-based applications.

    SeaClouds performs a seamless adaptive multi-cloud management of service-based applications, by developing Cloud Service Orchestrators and a set of tools to manage complex applications, thus avoiding the problem of Cloud lock-in. This will be achieved by supporting the migration, replication, and distribution of modules that compose cloud-based applications over multiple and technologically diverse Clouds offerings, by using a unified management API and universal metrics for monitoring and verifying functional and non-functional properties. The results delivered by SeaClouds will be compatible with the current standards related to cloud interoperability, OASIS

    Objective

    The objectives of SeaClouds are therefore:

    • Support orchestration, adaptation, and verification of services distributed over different Cloud providers.
    • Provide a unified Cloud-independent mechanism to manage cloud-based services distributed over different Cloud providers.
    • Enable monitoring and runtime reconfiguration operations of services distributed over different Cloud providers.
    • Align SeaClouds with major standards for cloud interoperability, particularly OASIS CAMP and TOSCA, promoting them in research and industrial communities.

Filter by type:

Sort by year:

A Distributed File System over Heterogeneous SaaS Storage Platforms

Marco Scavuzzo
Conference Papers 2014 16th International Symposium on Symbolic and Numeric Algorithms for Scientific Computing (SYNASC 2014)

Abstract

Nowadays, thanks to the massive usage of the Cloud, different providers offer storage as a service solutions. Each of these solutions is characterized by different storage capacity and features. They are also offered according to various business models; typically, users can choose between free plans (with a limited amount of space) and paid plans. Free plans users, when the storage capacity lowers, tend to subscribe to new free plans from other providers, thus increasing the so called data fragmentation. This phenomenon heavily increases the file management complexity.This paper proposes a solution to the data fragmentation problem, by describing an innovative approach which allows to deploy a distributed file system on top of different SaaS storage accounts, offered by different providers. This approach, not only lowers the complexity of data management by providing a single transparent storage solution to the user, but it is also able to provide features like full-text search, file classification and categorization, data analytics (MapReduce) on top of these SaaS storage accounts. Furthermore, this approach proposes a new way to address data privacy and security issues, typically connected to SaaS storage accounts.

Interoperable Data Migration between NoSQL Columnar Databases

Marco Scavuzzo, Elisabetta Di Nitto, Stefano Ceri
Conference Papers Enterprise Distributed Object Computing Conference Workshops and Demonstrations (EDOCW), 2014 IEEE 18th International

Abstract

NoSQL databases have emerged as the solution to handle large quantities of user-generated contents still guaranteeing fault tolerance, availability and scalability. Each NoSQL database offers differentiated properties and characteristics as well as different data models and architectures. As a result, the development of applications exploiting such kind of technology is strictly dependent on the specific NoSQL solution being adopted, and the migration from a NoSQL to the other requires the development of ad-hoc code managing the transfer of data. In order to mitigate such issue, this paper proposes an interoperable migration system for columnar NoSQL databases. The proposed approach is based on an orginal Metamodel, capable of preserving both strong and weak consistency between data updates, secondary indexes and various data types. Moreover, the approach allows developers to easily add support for new databases.

Building Data Intensive Applications Exploiting Data as a Service Systems: Experiences and Challenges

Marco Scavuzzo, Elisabetta Di Nitto, Danilo Ardagna
Technical Reports Politecnico di Milano, Technical Report n. 2014.10

Abstract

Data Intensive (DI) applications are becoming more and more important in several fields of science, economy, and even in our normal life. Unfortunately, even if some technological frameworks are available for them, we still lack solid software engineering approaches to support their development. In this paper we report our experience in developing and optimizing a DI application that migrates data between well known Database as a Service systems. We highlight the issues we have faced during the design and development activities and identify some open challenges that could generate new research in the areas of software design and verification.

Providing BigData Applications with Fault-Tolerant Data Migration Across Heterogeneous NoSQL Databases

Marco Scavuzzo, Damian A. Tamburri, Elisabetta Di Nitto
Conference Papers Proceedings of the Second International Workshop on BIG Data Software Engineering, Austin, TX, USA, 2016.

Abstract

The recent growing interest on highly-available data-intensive applications sparked the need for flexible and portable storage technologies, e.g., NoSQL databases. Unfortunately, the lack of standard interfaces and architectures for NoSQLs makes it difficult and expensive to create portable applications, which results in vendor lock-in. Building on previous work, we aim at providing guaranteed fault-tolerant techniques and supporting architectures to port or migrate data to and across heterogeneous NoSQL technology. To prove the effectiveness of our approach we evaluate it on an industrial case-study. We conclude that our method and supporting architecture offer an efficient and fault-tolerant mechanism for NoSQL portability and interoperation.

Fault-Tolerant Off-line Data Migration: The Hegira4Clouds Approach

Marco Scavuzzo, Elisabetta Di Nitto
Book Chapter Model-Driven Development and Operation of Multi-Cloud Applications, Part of the series SpringerBriefs in Applied Sciences and Technology, pp 59-68

Experiences and challenges in building a data intensive system for data migration

Marco Scavuzzo, Elisabetta Di Nitto, Danilo Ardagna
Journal Papers Empirical Software Engineering. Springer

Abstract

Data Intensive (DI) applications are becoming more and more important in several fields of science, economy, and even in our normal life. Unfortunately, even if some technological frameworks are available for their development, we still lack solid software engineering approaches to support their development and, in particular, to ensure that they offer the required properties in terms of availability, throughput, data loss, etc.. In this paper we report our action research experience in developing-testing-reengineering a specific DI application, Hegira4Cloud, that migrates data between widely used NoSQL databases. We highlight the issues we have faced during our experience and we show how cumbersome, expensive and time-consuming the developing-testing-reengineering approach can be in this specific case. Also, we analyse the state of the art in the light of our experience and identify weaknesses and open challenges that could generate new research in the areas of software design and verification.

Currrent Teaching

  • 2017 2016

    Informatica B - Teacher Assistant

    Politecnico di Milano, Italy

    ANSI C, Matlab

Teaching History

  • 2015

    NoSQL Databases - Industrial Trainer

    CEFRIEL, Milan, Italy
  • 2016 2015

    Informatica B - Teacher Assistant

    Politecnico di Milano, Italy

    ANSI C, Matlab

  • 2015

    Ingegneria del Software (Software Engineering) - Teacher Assistant

    Politecnico di Milano, Italy

    OOP, Java, Design Patterns, Test Driven Development

  • 2015 2014

    Informatica B - Teacher Assistant

    Politecnico di Milano, Italy

    ANSI C, Matlab

  • 2014

    Sistemi informativi (per il settore dell'informazione) - Teacher Assistant

    Politecnico di Milano, Italy

    BPMN and UML modelling