In the first reporting period, a roadmap was drafted and, as starting point, the pilot owners prepared a detailed description of their scenarios. These descriptions have then been used for defining the project requirements, the TOREADOR methodology, the services to be designed, the Service Level Agreements (SLA), and the legal aspects.
Then the project consortium worked on TOREADOR model-based methodology and vocabulary. The following solutions have been provided or specified: a preliminary vocabulary and syntax for defining declarative, procedural, and deployment models at the basis of the Big Data Analytics-as-a-Service (BDaaS) framework, a methodology specifying early mechanisms for models design and development, a first Model Driven Architecture (MDA)-based approach to model transformation, supporting BDaaS.
TOREADOR has also defined ready-to-be-executed deployment models that include all executable artefacts for the semi-automatic deployment of TOREADOR’s customer Big Data campaigns and has showed how a procedural model can be transformed into a vendor-specific deployment model by means of a binding process. TOREADOR has also described two complete case studies based on the clickstream analysis pilot and the security log pilot.
Several project meetings were held to consolidate the outcomes of the project.
The outreach activities during this period consisted of participation and presentations in conferences and meetings with the scientific community, with the policy makers and with the industry. The general public has also been reached through coverage in newspapers. News have been broadcasted in Web channels. Furthermore, newsletters have been published as a dissemination tool for the project activities.
As regards the legal aspects, a first general legal framework and quick reference guide were delivered at M6 to provide an overview of the major legal issues to be considered throughout the TOREADOR project by all the project partners, WPs and pilots. At M12, TOREADOR produced a comprehensive deliverable looking into the ownership and intellectual property aspects of data management in a big data context, the results thereof not only aimed at the partners of the TOREADOR Project but also at policymakers to provide additional evidence on the emerging issues of data ownership. Finally, at M18, the privacy and security aspects of data management in a big data context were examined in a final legal research deliverable. Said deliverable is not only aimed at the TOREADOR consortium members but can also provide useful insights to anyone working on similar big data projects. A first audit deliverable was delivered at M6, providing a preliminary review of the legal management and compliance of the project pilots in light of the Legal Aspects Specifications to be drafted for each Pilot at M12. At M12, a second audit deliverable was provided, focusing on the legal management and compliance of the TOREADOR architecture. A third audit deliverable was delivered at M18, providing an overview of the replies to public consultations given on behalf of TOREADOR, as well as the main legal issues related to data ownership, privacy and security, which are to be covered in legal SLAs and contractual arrangements.
In the second reporting period, we mainly focused on the implementation of the TOREADOR framework (GUI + platform) and on the validation of the latter on internal and external pilots and use cases. In particular, we implemented one complete workflow for each of the pilots evaluating our methodology in two different instantiations: service-based and code-based. Two of them (Energy Production Data Analysis and Clickstream Analysis Pilots) have then been deployed on the TOREADOR platform, while the remaining two (Application Log Analysis and Aerospace Products Manufacturing Analysis Pilots) have been executed on premises of the pilot partners for privacy reasons. In addition, some external pilots and industrial use cases have been conducted to test our approaches outside the TOREADOR consortium. As an example, we considered an infrastructure for pollution monitoring managed by Lombardia Informatica, the main ICT agency of the Lombardy region in Italy, to the aim of defining and deploying a Decision Support System (DSS) for pollution data labeling.
To ensure its industrial applicability, the TOREADOR platform was evaluated in reference to the pilot scenarios involved in the project. The delivery of positive and useful experience, as assessed by the industrial partners of the consortium, testifies that they benefit from integrating MBDAaaS in their decision-making.