How To Work With Big Data From Space In Europe

Similar documents
ESA Earth Observation Big Data R&D Past, Present, & Future Activities

Big Data in the context of Preservation and Value Adding

Leveraging Big Data Value Towards a Data-driven Europe with joint PPP efforts

ACCESS TO ERS AND ENVISAT DATA. CGMS is informed about the ESA Earth Observation data policy and data access, in particular in Near Real Time.

The Massachusetts Open Cloud (MOC)

Synergies between the Big Data Value (BDV) Public Private Partnership and the Helix Nebula Initiative (HNI)

Mission Operations and Ground Segment

Towards a Thriving Data Economy: Open Data, Big Data, and Data Ecosystems

EO data hosting and processing core capabilities and emerging solutions

DGE /DG Connect

A Future Scenario of interconnected EO Platforms How will EO data be used in 2025?

Forestry Thematic Exploitation Platform Earth Observation Open Science 2.0

Questionnaire on the European Data-Driven Economy

Standards for Big Data in the Cloud

Kimmo Rossi. European Commission DG CONNECT

Big Data and evolution of the Ground System EO ENG and the imarine case

Cloud Computing and Content Delivery Network use within Earth Observation Ground Segments: experiences and lessons learnt

9360/15 FMA/AFG/cb 1 DG G 3 C

8970/15 FMA/AFG/cb 1 DG G 3 C

European Big Data Value Strategic Research & Innovation Agenda

WORK PROGRAMME Topic ICT 9: Tools and Methods for Software Development

TechNavio Infiniti Research

Educational Opportunities in Big Data

Sustainable Innovation for Sustainable Life

You should have a working knowledge of the Microsoft Windows platform. A basic knowledge of programming is helpful but not required.

RETHINK big Project. European Data Economy Workshop-Focus Data Value Chain & Big and Open Data

The Copernicus Master Prize and the ESA App Camps

How To Handle Big Data With A Data Scientist

EARSC Views on the. Procurement of the Copernicus Services

Helix Nebula, the Science Cloud: Potential for Earth Science Franco-British Workshop on Big Data in Science 6-7 November 2012

How To Help The European Space Program

The Analytics COE: the key to Monetizing Big Data via Predictive Analytics

W H I T E P A P E R E d u c a t i o n a t t h e C r o s s r o a d s o f B i g D a t a a n d C l o u d

Ali Eghlima Ph.D Director of Bioinformatics. A Bioinformatics Research & Consulting Group

Seminar on Polish & Danish Experiences June 2th 2015

PROPOSAL To Develop an Enterprise Scale Disease Modeling Web Portal For Ascel Bio Updated March 2015

Providing On-Demand Situational Awareness

Big Data Big Deal? Salford Systems

Vivir en un mar de Datos 2015: Big Data una mirada Global Fundación Telefónica

Building Your CRM Short List: What You Need to Know Before You Buy

Cloud SingularLogic:

STRATEGIC POLICY FORUM ON DIGITAL ENTREPRENEURSHIP. Fuelling Digital Entrepreneurship in Europe. Background paper

Concept and Project Objectives

H2020-EUJ-2016: EU-Japan Joint Call. EUJ : IoT/Cloud/Big Data platforms in social application contexts

UK Government Information Economy Strategy

Network for Sustainable Ultrascale Computing (NESUS)

H2020-LEIT-ICT WP Big Data PPP

Council of the European Union Brussels, 13 February 2015 (OR. en)

Copernicus Space Component ESA Data Access Overview J. Martin (ESA), R. Knowelden (Airbus D&S)

Long Term Preservation of Earth Observation Data

BIG DATA IS MESSY PARTNER WITH SCALABLE

How To Understand The Power Of Decision Science In Insurance

VITO Centre of Image Processing

EGI services for distribution and federation of data and computing

Emerging Technologies CEOS/WGISS

BIG DATA AND ANALYTICS

Collaborative Product Development The case of network operators introducing Cloud Services

THE QUEST FOR A CLOUD INTEGRATION STRATEGY

The following was presented at DMT 14 (June 1-4, 2014, Newark, DE).

Workprogramme

Better Decision Making

How to use Big Data in Industry 4.0 implementations. LAURI ILISON, PhD Head of Big Data and Machine Learning

Johan Hallberg Research Manager / Industry Analyst IDC Nordic Services & Sourcing Digital Transformation Global CIO Agenda

Helix Nebula, the Science Cloud

An analysis of Big Data ecosystem from an HCI perspective.

Copernicus Space Component Data Access Architecture. Meeting with Austria 27 May 2014, Vienna

TXT e-solutions. Corporate Overview September 2015

Towards a data-driven economy in Europe

Capgemini Big Data Analytics Sandbox for Financial Services

ICT 9: Tools and Methods for Software Development

Data Analytics. SPAN White Paper. Turning information into insights

Digital Collections as Big Data. Leslie Johnston, Library of Congress Digital Preservation 2012

Best Practices: Cloud ediscovery Using On-Demand Technology and Workflows to Speed Discovery and Reduce Expenditure

PREDICTIVE MARKETING, DIGITAL ATTRIBUTION, OPTIMIZATION, AND DATA-DRIVEN PERSONALIZATION

Hadoop in the Hybrid Cloud

Big data in Europe: new environment, new opportunities

How To Help Your Business With Big Data Analytics

The German interagency approach to SSA

What is the Global Innovation Platform. Challenge Driven Innovation

ESS event: Big Data in Official Statistics

Standard Big Data Architecture and Infrastructure

Next-Generation Building Energy Management Systems

Sentinels Operations Konzept und Prinzipien des Datenzugangs - Copernicus Space Component Data Access Overview

Big Data Explained. An introduction to Big Data Science.

Big Data Infrastructures for Processing Sentinel Data

IT convergence driving demand for managed services in telecom. Huawei growing leadership presence in IT and Network managed services.

Cloud Computing Safe Harbor or Wild West?

Managed Hosting: Best Practices to Support Education Strategy in the Career College Sector

Industry s view: How to match the services paradigm. by Markus Probeck, EARSC Director

An Act. To provide for a coordinated Federal program to ensure continued United States leadership in high-performance computing.

12/7/2015. Data Science Master s programs

Building Your Big Data Team

Retail. White Paper. Driving Strategic Sourcing Effectively with Supply Market Intelligence

Client Technology Solutions Suresh Kumar Chief Information Officer

Big Data and Cloud Computing for GHRSST

End-to-End Innovation Solutions. for Telehealth and Remote Patient Monitoring

ESA Earth Observation and the need for high speed networking

HDP Hadoop From concept to deployment.

APPROACHABLE ANALYTICS MAKING SENSE OF DATA

Transcription:

Industry & SMEs Round Table 2014 Conference on Big Data from Space (BiDS '14) Dr. Florin Serban Dr. Catalin Cucu-Dumitrescu 12-14 November 2014, ESRIN, Frascati, Italy

Main aspects to be presented: Status and challenges of Big Data from space in Europe Specific non-technical problems encountered by companies of various scale in Working with Big Data ASRC introduction Round table Open discussion Next steps and recommendations 2

ASRC Funded in 2007 27 employees More than 1 Mil EUR turnover for each of the last 2 years The only Romanian company that developed its own capabilities of analysis, processing and interpretation of optical and radar Earth Observation data Offers innovative solutions for environmental monitoring and risk assessment (flood risk analysis, drought early warning, deforestation evaluation etc.) Clients: European Space Agency, German Aerospace Center, World Bank, national public authorities, private companies 3

ASRC Four main development directions: Monitoring services based on satellite and in situ data processing for: natural hazards risks (drought, floods, landslides / earthquakes), mining, urban and wet zones, agriculture, forestry, critical infrastructure, CO2 storage areas Web based applications and platforms for data searching, downloading, management and processing Educational software development Complementary ground based data acquisition sensors (radar) for different monitoring applications and services Other activities: Visual Analytics tools and services for EO and linked data access Modeling Tool Design 4

Status and Challenges of Big Data from Space in Europe Big Data: Expanding on 3 fronts at an increasing rate Expansion of Big Data in the 3Vs representation (Diya Soubra: The 3Vs that define Big Data) 5

Existing European Data Repositories ESA Data Policy (ERS, Envisat, Earth Explorers): free datasets - free of charge, based on user registration and acceptance of ESA Terms & Conditions; restrained datasets - free of charge, based on user registration and submission of a Project (Full) Proposal and acceptance of ESA Terms & Conditions; after the project evaluation a quota will be assigned. ESA Third Party Missions (Data Policy of individual data providers): reproduction cost (e.g. ALOS)/specific restrictions to the use of data (limitations of quota, geographical restrictions, etc.). (www.analyticbridge.com) 6

Existing European Data Repositories Evolution of ESA's EO Data Archives between 1986-2010 and future projections* *Günther Kohlhammer: (Big?) Data and Earth Observation, H/EO Ground Segment and Missions Operations Department, Big Data from Space, 5-7 June 2013 7

Existing European Data Repositories ESA Mission Sentinel-1 Swarm CryoSat SMOS Data Volume Huge, potentially up to 2.4 TB/day (with the two satellites) Modest data volume ~50 GB per day ~10 GB per day Current and past ESA Missions ESA Mission Sentinel 2 Sentinel 3 ADM-Aeolus Data Volume Huge, potentially up to 1.6 TB/day (with the two satellites) Huge, potentially up to 2.2 TB/day (with the three satellites) 5 TB over the entire mission Earthcare Level 1: 100 GB/day Future ESA Missions (ESA) 8

Application Areas These are the important areas identified by ESA Ground Segment (GS) as priorities*: Dissemination and on-demand processing (because needs are variable and depending on user demand); Secondary archive and re-processing (because needs are limited in time); Temporary resources for integration, testing and demonstration (because needs are limited in time); System sizing (because needs are unknown). *S. Loekken, J. Farres: ESA Earth Observation Big Data R&D Past, Present, & Future Activities, Ground Segment and Mission Operations Department, Earth Observation Programmes Directorate, March 2014 9

Services for ESA EO GS Cloud services have made significant progress. US companies lead the competition: offer very sophisticated and integrated services including user management and communication; intend to develop a business based on information extraction from merged datasets (EO data with other data). Microsoft is cooperating with the European research community, while Google is strongly approaching EO data holders at all levels to offer its services*. *ESA @ ASI: Big Data, IT Technology and their Impact on EO in Europe, November 2013 10

Services for ESA EO GS ESA is under pressure to reduce the cost of its EO Ground Segment. Currently no European service provider can offer the level of services the big players (all of them US companies) can. (ESA) *ESA @ ASI: Big Data, IT Technology and their Impact on EO in Europe, November 2013 11

Users expectations 1. Open Data a. All data are discoverable, accessible online and free b. Data is arranged on long time series of coherent data from different providers. 2. Open Computing a. Users are able to perform processing directly on the cloud using virtual servers. b. Users can choose their preferred cloud provider 3. Open Source Software a. All basic/platform software is open and freely available b. Applications can be easily ported across clouds 4. Open Collaboration* a. Data and applications can be easily shared with other users (eoxserver.org/doc/en/users/index.html) *A. Minchella on behalf of ESA EOPI Team: Access to ESA & ESA TPM EO Data, ESA Advanced Training Course in Land Remote Sensing, 2 July 2013, Athens, Greece 12

Specific Non-Technical Problems Encountered by Companies of Various Scale in Working with Big Data 13

System: The Need to Develop an European Big Data Ecosystem (EBDE) A business ecosystem is an economic community supported by a foundation of interacting organizations and individuals* *Moore, J.F. The Death of Competition: Leadership and Strategy in the Age of Business Ecosystems, HarperBusiness. (1996) The Big Data Value Chain** ** Framing a European Partnership for a Big Data Value Ecosystem, version 1.4, Vision for a European Big Data Value Partnership, February 2014 14

The Dimensions of a Big Data Value Ecosystem* * Diya Soubra: The 3Vs that define Big Data, posted on July 5, 2012 15

System: The Need to Develop an European Big Data Ecosystem (EBDE) The European Partnership for Big Data Value (EP-BDV) has identified several areas where the Big Data Value contractual Public Private Partnership should focus its actions: broadening the availability and accessibility of data sources; assessing the economic value of data assets; developing Big Data technologies and tools to support best datadriven applications and business opportunities; developing data-driven applications and business models providing measurable value to the involved players and addressing the lack of convincing use cases; testing and benchmarking technologies, applications, and business models; addressing the lack of skills and expertise; addressing the issues related to security and privacy and increasing the level of trust into data and data-driven applications. 16

System: The Need to Develop an European Big Data Ecosystem (EBDE) Intensive discussions with stakeholders have clearly shown that besides technology and application many infrastructural, economic, social and legal issues will have to be addressed in an interdisciplinary fashion. Especially for SMEs, these issues are central for a fast take-up of the opportunities offered by Big Data Value*: skills and training; reliable legal frameworks; reference applications and access to an ecosystem. The signature of the Big Data Value Public Private Partnership (BDV PPP) took place on 13 October 2014 (http://www.bigdatavalue.eu/#sthash.vqkffyxw.x61p5wcg.dpuf) *NESSI: DRAFT European Big Data Value Strategic Research & Innovation Agenda, Version 0.7, April 2014 17

System: The Need to Develop an European Big Data Ecosystem (EBDE) There are a number of drivers encouraging the scaling up of the EBDE*: ensuring appropriate access to finance for big data companies; establishing an enabling business environment for data storage, data transfers and communication networks; supporting entrepreneurship, leading to the creation of start-ups and SMEs that offer big data analytics and decision making solutions; fostering administrative simplification to enable companies to submit information to a single public administration; to support big data SMEs in their internationalization process, e.g. reimburse young companies when they move to international market; to develop and promote an education system able to answer the specific needs of big data companies. *Laurent Probst et al.: Big Data Analytics & Decision Making, Directorate-General for Enterprise and Industry, Directorate B Sustainable Growth and EU 2020, Unit B3 Innovation Policy for Growth, September 2013 18

Legal: Specific Legal Aspects for Space Big Data Big Data s increasing economic importance also raises a number of legal issues: Ownership of data Data protection law Copyright Contractual and Liability problems Who owns a piece of data and what rights come attached with a dataset? What defines fair use of data? Who is responsible when an inaccurate piece of data leads to negative consequences? Such types of legal issues will need clarification, probably over time, to capture the full potential of big data*. *McKinsey Global Institute: Big data, the next frontier for innovation, competition and productivity, 2011 19

Business: Challenges for Companies in the Field of Big Data from Space Organizations capitalizing on Big Data differ from traditional data analysis in three ways*: 1. They pay attention to data flows as opposed to stocks. 2. They rely on data scientists and product and process developers rather than data analysts. 3. They are moving analytics away from IT function and into core business, operational and production functions. *Davenport, T.H., Barth, P. and Bean, R. How: Big Data is Different. MIT Sloan Management Review, July 2012 20

Business: Challenges for Companies in the Field of Big Data from Space Companies can have different positions in the Big Data Ecosystem*: Established User Enterprises Data Generators and Providers Technology Providers Collaborative networks *S. Loekken, J. Farres: ESA Earth Observation Big Data R&D Past, Present, & Future Activities, Ground Segment and Mission Operations Department, Earth Observation Programmes Directorate, March 2014 21

Skills: The Demand for Specialists Qualified for Working with Big Data In order to leverage the potential of Big Data, a key challenge for Europe is to ensure the availability of highly and rightly skilled people: Data Scientists - solid knowledge in statistical foundations and advanced data analysis methods combined with a thorough understanding of scalable data management, with the associated technical and implementation aspects; deliver novel algorithms and approaches for the Big Data Value stack in general, such as advanced learning algorithms, predictive analytics mechanisms, etc. Data Engineers - develop and exploit techniques, processes, tools and methods for developing applications that actually turn data into value; understand the domain and the business of the organizations; bring knowledge and work at the intersection of technology, application domains and business. In order to educate and train Data Engineers, novel courses and forms of training are required*. *ESA @ ASI: Big Data, IT Technology and their Impact on EO in Europe, Nov 2013 22

Discussion Topics Q1. What aspects do you consider that are the most important to be addressed within a future consistent strategy on big data (e.g. data access, harmonization at EU / international level, education, etc.? Q2. Do we have a proper view on the requirements? If not, who is supposed to generate the requirements? Q3. Who is who - what roles for which type of organization? For example, should private organizations perform data archiving? Q4. How value-added service delivery will change in the big data era? Q5. Space exploration, world round, is carried out mostly by national governments, and this data hosted on government servers*. How do you see the future of space open data? Could selling or outsourcing this data have significant repercussions on national security, especially in politically charged times such as this one? If yes, what is to be seen as alternative solution? Q6. Big Data, as a general domain, is a USA dominated play-ground?** European ICT companies have a backlog (1 to 2 years) compared to the USA. European scientific institutions are also relatively late to become involved in Big Data research. There are only a few Big Data technology suppliers in Europe, which is a reason of concern for EU. How do you see the situation & future evolution of this European backlog for the particular domain of Space Big Data? Is collaboration with US desirable? If yes, in which way? *A. Santhanam: The Data Behind Deep Space Exploration, October 30, 2014, Dataconomy Media GmbH 2014 ** Pierre-Yves DANET et al.: Big and Open data Position Paper, Networked and Electronic Media, December 2013 23

Discussion Topics Q7. What are current main challenges for SMEs in using satellite data for providing services? Q8. Buy or build the needed technology? When your company is developing a Big Data project, how does the Manager and the Information Officer choose the right solution that confers a competitive advantage?* Which are the plusses and the minuses to be considered? Q9. Open source versus proprietary source business models. Most of the supporting tools and storage architectures are now Open Source (Hadoop, Hive, Spark, Shark, HBase, Riak, Titan, etc.), leveling the playing field for tool vendors in this field. Opting for the open source route, however, comes with its own set of difficulties. What is the model your company uses? What is the rationale? Q10. Training the internal Big Data specialists. Have the joint research & innovation projects, between academia and your company, proved to be a good way to foster knowledge exchange, thus delivering experience about cutting-edge technology? At what level is the Big Data specialist formation to be addressed: education (university level) or training? Q11. Is Big Data really within the reach of SMEs? Are the cheap commodity hardware and open source software, together with Big Data cloud solutions, enough to ensure that? If not, what is it to be done? AOB Next steps/recommendations Conclusions * Nicole Laskowski: The big data architecture dilemma for CIOs, August 2014 24

Thank you for your attention! ASRC Dr. Florin Serban florin.serban@asrc.ro http://asrc.ro/ 25