Place and date of birth Rome, November 26 th 1983



Similar documents
Database Design for NoSQL Systems

OAK Database optimizations and architectures for complex large data Ioana MANOLESCU-GOUJOT

How I Learned to Stop Worrying and Love NoSQL Databases

Preparing Your Data For Cloud

TECHNOLOGY TRANSFER PRESENTS OCTOBER OCTOBER RESIDENZA DI RIPETTA - VIA DI RIPETTA, 231 ROME (ITALY)

Laboratoire d Informatique de Paris Nord, Institut Galilée, Université. 99 avenue Jean-Baptiste Clément, Villetaneuse, France.

GIOVANNI MATTEO FUMAROLA

Composite Data Virtualization Composite Data Virtualization And NOSQL Data Stores

A Logical Approach to NoSQL Databases

MicroStrategy Course Catalog

The University of Jordan

Introduction to NoSQL Databases. Tore Risch Information Technology Uppsala University

E6895 Advanced Big Data Analytics Lecture 4:! Data Store

NoSQL systems: introduction and data models. Riccardo Torlone Università Roma Tre

Computer Information Systems

Databases. DSIC. Academic Year

MUSYOP: Towards a Query Optimization for Heterogeneous Distributed Database System in Energy Data Management

Martino Sykora CURRICULUM VITAE ET STUDIORUM

GeoKettle: A powerful open source spatial ETL tool

THE OPEN UNIVERSITY OF TANZANIA FACULTY OF SCIENCE TECHNOLOGY AND ENVIRONMENTAL STUDIES BACHELOR OF SIENCE IN DATA MANAGEMENT

Chapter 6 Basics of Data Integration. Fundamentals of Business Analytics RN Prasad and Seema Acharya

RDF Data Management in the Amazon Cloud

Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

BUSINESS TECHNOLOGY (BTE)

M. Lamine BA Post-doc fellow

Cloud Scale Distributed Data Storage. Jürmo Mehine

Introduction to Databases

Overview of Data Management

FACULTY OF COMPUTER SCIENCE AND INFORMATION TECHNOLOGY AUTUMN 2016 BACHELOR COURSES

Chapter 2 Database System Concepts and Architecture

Database Systems. Lecture 1: Introduction

EIT ICT Labs MASTER SCHOOL DSS Programme Specialisations

Logistics. Database Management Systems. Chapter 1. Project. Goals for This Course. Any Questions So Far? What This Course Cannot Do.

1 File Processing Systems

Designing an Object Relational Data Warehousing System: Project ORDAWA * (Extended Abstract)

Course DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

From Oracle Warehouse Builder to Oracle Data Integrator fast and safe.

Analytics March 2015 White paper. Why NoSQL? Your database options in the new non-relational world

Chapter 5. Warehousing, Data Acquisition, Data. Visualization

City-Wide Smart Healthcare Appointment Systems Based on Cloud Data Virtualization PaaS

Cloud3DView: Gamifying Data Center Management

Bachelor of Science in Information Technology. Course Descriptions

How To Handle Big Data With A Data Scientist

MIS630 Data and Knowledge Management Course Syllabus

ECS 165A: Introduction to Database Systems

CSE 132A. Database Systems Principles

INTEROPERABILITY IN DATA WAREHOUSES

What is Data Virtualization? Rick F. van der Lans, R20/Consultancy

History of Database Systems

WebRatio 5: An Eclipse-based CASE tool for engineering Web applications

Automating Big Data Management, by DISIT Lab Distributed [Systems and Internet, Data Intelligence] Technologies Lab Prof. Ph.D. Eng.

DataBridges: data integration for digital cities

Data Modeling for Big Data

Introduction. Introduction: Database management system. Introduction: DBS concepts & architecture. Introduction: DBS versus File system

Scalable Architecture on Amazon AWS Cloud

BIGS: A Framework for Large-Scale Image Processing and Analysis Over Distributed and Heterogeneous Computing Resources

Peninsula Strategy. Creating Strategy and Implementing Change

Professional Organization Checklist for the Computer Science Curriculum Updates. Association of Computing Machinery Computing Curricula 2008

CONCEPTUAL FRAMEWORK OF BUSINESS INTELLIGENCE ANALYSIS IN ACADEMIC ENVIRONMENT USING BIRT

Big Data Analytics. Rasoul Karimi

The relational model is dead, SQL is dead, and I don t feel so good myself

Cloud-based RDF data management

Lambda Architecture. Near Real-Time Big Data Analytics Using Hadoop. January Website:

Introduction: Database management system

Time: A Coordinate for Web Site Modelling

Data Analytics Infrastructure

DATABASE SYSTEM CONCEPTS AND ARCHITECTURE CHAPTER 2

Report Data Management in the Cloud: Limitations and Opportunities

Course Syllabus For Operations Management. Management Information Systems

Alejandro Vaisman Esteban Zimanyi. Data. Warehouse. Systems. Design and Implementation. ^ Springer

College of Engineering, Technology, and Computer Science

n Assignment 4 n Due Thursday 2/19 n Business paper draft n Due Tuesday 2/24 n Database Assignment 2 posted n Due Thursday 2/26

You should have a working knowledge of the Microsoft Windows platform. A basic knowledge of programming is helpful but not required.

A Workbench for Prototyping XML Data Exchange (extended abstract)

How To Write A Database Program

Contents The College of Information Science and Technology Undergraduate Course Descriptions

XpoLog Competitive Comparison Sheet

Database Management System Choices. Introduction To Database Systems CSE 373 Spring 2013

DEGREE PLAN INSTRUCTIONS FOR COMPUTER ENGINEERING

Why NoSQL? Your database options in the new non- relational world IBM Cloudant 1

Reverse Engineering in Data Integration Software

Data Integration Checklist

Cloud Ready Data: Speeding Your Journey to the Cloud

Enabling Continuous Delivery for Java Projects with Oracle Cloud Services (Oracle PaaS) Siva Rama Krishna Oracle India

Challenges for Data Driven Systems

A Multidatabase System as 4-Tiered Client-Server Distributed Heterogeneous Database System

ESS event: Big Data in Official Statistics. Antonino Virgillito, Istat

Structured Data Storage

Cleveland State University

Querying MongoDB without programming using FUNQL

Introductory Concepts

Division of Mathematical Sciences

A Design and implementation of a data warehouse for research administration universities

Next-Generation Cloud Analytics with Amazon Redshift

Information and Communications Technology Courses at a Glance

A common interface for multi-rule-engine distributed systems

CISC 432/CMPE 432/CISC 832 Advanced Database Systems

SAP BusinessObjects Business Intelligence 4.1 One Strategy for Enterprise BI. May 2013

Curriculum of the research and teaching activities. Matteo Golfarelli

Transcription:

Francesca Bugiotti Personal Information Place and date of birth Rome, November 26 th 1983 E-mail francesca.bugiotti@inria.fr Education Università Roma Tre (November 2008 - April 2012 ) PhD in Computer Science - Computer Science and Automation department Thesis: A model oriented approach to heterogeneity Università Roma Tre - IBM - Formit 1 (January 2008 - March 2009 ) Post lauream degree in IT governance: development, management and monitoring Università Roma Tre (October 2008 ) Qualifying examination to exercise the engineering profession Università Roma Tre (October 2005 - December 2007 ) Master degree in Computer Engineering (Laurea Specialistica in Ingegneria informatica) Thesis: Tools and methodology for model management problems Final grade: 110/110 lode (maximum honors) Università Roma Tre (October 2002 - July 2005 ) First level degree in Computer Engineering (Laurea in Ingegneria informatica) Thesis: Datalog rules management for data and schema translation Final grade: 110/110 lode (maximum honors) Awards Accenture and Università Roma Tre (March 2009 ) Accenture Outstanding Engineering Graduate Award IBM (July 2007 ) Participant in IBM EMEA Best Student Recognition Event, Nice Research Activity Inria - Institut National de Recherche en Informatique et en Automatique (November 2013 - present) Post-doctoral research on efficient data storage mechanism for heterogeneous data in the cloud, supported by Inria in connection with the KIC EIT ICT Labs Europa activity on scalable cloud-based data management Università Roma Tre (April 2012 - October 2013 ) Post graduate research activity on model management in databases and NoSQL data stores integration Inria - Institut National de Recherche en Informatique et en Automatique (April 2011 - July 2011 ) Fine-granularity Semantic Web data indexing in Amazon Web Services (AWS) cloud platform Università Roma Tre (January 2008 - November 2008 ) Research activity on model management problems in databases ISA s.r.l. 2 (December 2008 - January 2010 ) Part-time research activity on data mining applied to clinical data 1 FORMIT is a foundation that performs activities of scientific research, technical support, analysis and industrial, financial and socio-economic evaluation to sustain migration processes and integration of technological systems in every field of society. 2 ISA s.r.l. is an Italian enterprise that provides software for small-medium companies. It is focused on ERP services and business intelligence.

Teaching Activity Technical Skills Research Projects Database course for 2nd year Bachelor students at Università Roma Tre (2008, 2009, 2011), with Paolo Atzeni. Tests grading, office hours. The course introduces databases design and data base management systems (DBMS). I curated the practical part that requires to perform experiments in some DBMSs (IMB DB2, PostgresSQL, Oracle, SimpleDB) and in some environments for data warehousing (Pentaho). Object-Oriented Analysis and Design course for 3rd year Bachelor students at Università Roma Tre (2009, 2011, 2012), with Luca Cabibbo. Tests grading, office hours. The course covers Object Oriented Analysis and Design, introduces to agile object oriented programming and to the iterative and incremental software developing approach. IT Governance course for 2nd year Master students at Università Roma Tre (2012), with Paolo Atzeni. Lectures, exercise sessions, office hours, exams grading. The course introduces the principles governing the lifecycle of an IT systems describing the methodological issues, the economic rules and the juridical aspects. Java Programming and Algorithms course for 3rd year Bachelor students at Università Roma Tre (2007), with Luca Cabibbo. Laboratory exercise lessons and tests grading. The course introduces the Java programming methodologies. Remedial Mathematics course for 1st year Bachelor students at Università Roma Tre (2009), math lessons. Lectures and exercise sessions. The course introduces math concepts to students. It covers linear algebra, geometric, trigonometry and function analysis. Contribution to the database book Databases - Models and query languages (Basi di dati - Modelli e linguaggi di interrogazione), McGraw Hill 2009, under contract with the publisher. I wrote 100 exercises accompanying the book chapters and I provided the solutions; they are accessible on-line at: http://www.ateneonline.it/atzeni3e/areastudenti.asp. Programming languages: Java, C/C++, Prolog, OCaml, Assembly Databases: Relational Databases, SQL, NoSQL data stores (MongoDB, Redis, Oracle NoSQL, HBase, DynamoDB) Operating Systems: Linux, MacOS, Windows At Università Roma Tre NOAM and ONDM - 2013 The NoSQL Abstract Model (NOAM) is a logical approach to the NoSQL database design problem [N15] which aims at identifying and exploiting the commonalities of various NoSQL systems. It is based on an intermediate, abstract data model where aggregates (complex objects) are introduced as units of distribution, to support scalability as well as consistency (when the latter is needed). Data represented according the NOAM Abstract Model can be stored into a variety of NoSQL data stores; the choice between these can be made by the application developer taking into account criteria such as consistency, efficiency etc. [M16]. ONDM (Object-NoSQL Datastore Mapper) is a concrete tool implementing the NOAM model [M17] ONDM provides application developers with a uniform programming interface, as well as the ability to map application data to different data representations. ONDM can be used also in order to run experiments for validating the effectiveness and the efficiency of a chosen data representation with respect to the queries of interest. GENDATA - 2013 The work regarding data models continues within the GENDATA European project http:// gendata.weebly.com/. GENDATA aims at building the abstractions, models, and protocols for supporting a network of genomic data, making them available to genome data analysis servers located in the major biology laboratories in the world. My involvement in the project concerns the work packages of Università Roma Tre: the model design, the query language and the model standardization. MATRIX and EXL - 2012 I collaborated with the Bank of Italy on supporting the implementation of EXLEngine for high-level manipulation of statistical data in terms of entities of statistical models such as time series. We proposed (i) the EXL language for declaratively specifying statistical programs, (ii) an approach for the translation of EXL code into executables in various target

systems, and (iii) a concrete implementation, namely the EXLEngine platform, that was integrated into the Bank of Italy information system: INFOSTAT. The approach leverages on schema mappings as an intermediate specification step, in order to facilitate the translation from EXL towards several target systems [IC5]. SOS - 2011/2012 Save Our Systems (SOS) is a common programming interface [D10] to heterogeneous NoSQL systems. Its goal is to support and simplify application development by hiding the system details. I contributed to defining the architecture of the platform, the operations it exposes and the query strategies it implements. I have been involved in the definition of the strategies for integrating the NoSQL data stores into the system. I also participated in the definition of the data storage techniques that are used for each datastore in order to perform operations the interface exposes [IC7, IJ4]. MISM - 2009/2012 Model Independent Schema Management (MISM) is a platform for model management that offers a set of operators to transform schemas and manipulate models. The management of data through superposed levels of abstraction (e.g., a conceptual level such as ER, plus a lower implementation level such as a relational schema specified in SQL Data Definition Language) raises the so-called round-trip engineering problem: If, during exploitation, changes are made to the lower implementation level, for instance the relational schema is altered, the higher-level specification needs to be altered also for the two levels of the specification to remain consistent. MISM allows to solve the round-trip engineering problem by providing appropriate operators (at the levels of the schema and the model) which enable to correctly propagate such schema changes to the higher model level. Importantly, the operators supported by MISM are defined in a model-independent, model-aware fashion, based on the MIDST supermodel [IJ3, IN13]. This generality makes MISM applicable to a variety of concrete models. MIDST-RT - 2008/2012 Model-Independent Schema and Data Translation-RunTime (MIDST-RT) is a platform based on MIDST providing runtime data transformations (this contrats with the data manipulation features of MISM which are designed to be used off-line, once per database). I contributed to the definition, design and implementation of the MIDST-RT algorithm that, given the schema of the source database and the model of the target one, generates views on the operational system that expose the underlying data according to the corresponding schema in the target model [IJ2, IC6, N14]. MIDST - 2005/2012 Model-Independent Schema and Data Translation (MIDST) is a platform for modelindependent schema and data translation based on a meta-level approach ranging over a variety of data models such as relational, object-relational, object-oriented, entity-relationship, or XML. I contributed to the extension of the MIDST supermodel (an abstract data model describing the various data models in terms of a small set of basic constructs) adding new basic constructs. Further, I have implemented core MIDST software components contributing to the evolution of the platform [N12]. At INRIA AMADA - 2011 During my internship at INRIA I contributed to the AMADA project: a platform [D9, N11]for storing Web data (XML documents and RDF graphs) based on the Amazon Web Services (AWS) cloud infrastructure. I worked on the problem of indexing RDF datasets by using SimpleDB, a key-value store provided by AWS. I contributed to the definition and development of four indexing strategies, deploying them in the AWS platform and experimentally evaluating them [B1, IC8]. Publications All my publications can be found at:

http://www.bugiotti.it/publications.html The general practice within my scientific community is to list all authors in alphabetic order unless one author has contributed very significantly more than her or his share. Books and contribution to books [B1] Francesca Bugiotti, Jesús Camacho-Rodríguez, François Goasdoué, Zoi Kaoudi, Ioana Manolescu, and Stamatis Zampetakis. SPARQL query processing in the cloud. In Andreas Harth, Katja Hose, and Ralph Schenkel, editors, Linked Data Management: Principles and Techniques, pages 49 72. CRC Press, 2014. To appear. Articles in international journals [IJ2] Paolo Atzeni, Luigi Bellomarini, Francesca Bugiotti, Fabrizio Celli, and Giorgio Gianforme. A runtime approach to model-generic translation of schema and data. Information Systems, 37(3):269 287, 2012. [IJ3] Paolo Atzeni, Luigi Bellomarini, Francesca Bugiotti, and Giorgio Gianforme. MISM: A Platform for Model-Independent Solutions to Model Management Problems. Journal of Data Semantics, 14:133 161, 2009. [IJ4] Paolo Atzeni, Francesca Bugiotti, and Luca Rossi. Uniform Access to NoSQL Systems. Information Systems, 2013. Full articles in international conferences and workshops [IC5] Paolo Atzeni, Luigi Bellomarini, and Francesca Bugiotti. EXLEngine: executable schema mappings for statistical data processing. In International Conference on Extending Database Technology (EDBT), pages 672 682, 2013. [IC6] Paolo Atzeni, Luigi Bellomarini, Francesca Bugiotti, and Giorgio Gianforme. A runtime approach to model-independent schema and data translation. In International Conference on Extending Database Technology (EDBT), pages 275 286, 2009. [IC7] Paolo Atzeni, Francesca Bugiotti, and Luca Rossi. Uniform Access to Non-relational Database Systems: The SOS Platform. In International Conference on Advanced Information Systems Engineering (CAiSE), pages 160 174, 2012. [IC8] Francesca Bugiotti, François Goasdoué, Zoi Kaoudi, and Ioana Manolescu. RDF Data Management in the Amazon Cloud. In Workshop on Data analytics in the Cloud (DanaC 2012), Berlin, Allemagne, February 2012. Demonstrations in international conferences [D9] Andrés Aranda-Andújar, Francesca Bugiotti, Jesús Camacho-Rodríguez, Dario Colazzo, François Goasdoué, Zoi Kaoudi, and Ioana Manolescu. AMADA: Web Data Repositories in the Amazon Cloud. In ACM CIKM - International Conference on Information and Knowledge Management, Maui, États-Unis, 2012. [D10] Paolo Atzeni, Francesca Bugiotti, and Luca Rossi. SOS (Save Our Systems): a uniform programming interface. for non-relational systems. In EDBT, pages 582 585, 2012.

Articles and demos at national database conference [N11] Andrés Aranda-Andújar, Francesca Bugiotti, Jesús Camacho-Rodríguez, and Zoi Kaoudi. AMADA: Web Data Repositories in the Amazon Cloud. In Journées des Bases de Données Avancées (BDA), 2012. [N12] Paolo Atzeni, Luigi Bellomarini, Francesca Bugiotti, and Giorgio Gianforme. From Schema and Model Translation to a Model Management System. In British National Conference on Databases (BNCOD), pages 227 240, 2008. [N13] Paolo Atzeni, Luigi Bellomarini, Francesca Bugiotti, and Giorgio Gianforme. A platform for model-independent solutions to model management problems. In Italian Symposium on Advanced Database Systems (SEBD), pages 310 317, 2008. [N14] Paolo Atzeni, Luigi Bellomarini, Francesca Bugiotti, and Giorgio Gianforme. A runtime approach to model-independent schema and data translation. In Italian Symposium on Advanced Database Systems (SEBD), pages 245 252, 2009. [N15] Francesca Bugiotti and Luca Cabibbo. A Comparison of Data Models and APIs of NoSQL Datastores. In Italian Symposium on Advanced Database Systems (SEBD), pages 63 74, 2013. Submitted for Publication [SP16] Paolo Atzeni, Francesca Bugiotti, François Goasdoué, Ioana Manolescu, and Yifan Li. Effisto: Toward a Universal Data Store. pages 1 4, 2014. [SP17] Francesca Bugiotti and Luca Cabibbo. A logical approach to nosql databases. pages 1 12, 2014. [SP18] Francesca Bugiotti and Luca Cabibbo. nosql database design. pages 1 12, 2014. An object-datastore mapper supporting