Themes: Challenges of Big Data International Academia Federal Government & Industry NASA Earth Science Data



Similar documents
How cloud computing can transform your business landscape

Big Data: Overview and Roadmap eglobaltech. All rights reserved.

The Matsu Wheel: A Cloud-based Scanning Framework for Analyzing Large Volumes of Hyperspectral Data

Trends and Research Opportunities in Spatial Big Data Analytics and Cloud Computing NCSU GeoSpatial Forum

Cloud Sobriety. Technical challenges in mapping Informatics to the cloud. Chris Dagdigian 2010 NHGRI Cloud Workshop

July 2014 Federal Cloud Computing Summit Summary

50x Zettabytes*

Google Apps. Google Apps. On Steroids. Extend Google Apps to your directory services. Extend Google Apps to your directory services

Implementing Hybrid Cloud at Microsoft

How To Use Arcgis For Free On A Gdb (For A Gis Server) For A Small Business

Accelerate Your Enterprise Private Cloud Initiative

1 st Symposium on Colossal Data and Networking (CDAN-2016) March 18-19, 2016 Medicaps Group of Institutions, Indore, India

Cloud Computing. What does it really mean for your business?

The world is going digital

MOVING COLORADO TO THE CLOUD A BUSINESS CASE

3rd International Symposium on Big Data and Cloud Computing Challenges (ISBCC-2016) March 10-11, 2016 VIT University, Chennai, India

CLOUD COMPUTING INTRODUCTION HISTORY

The Case for Cloud Computing: Business Advantages and Costs

The Future of Data Management

Smart Cities Challenge ASSOCIATION FOR COMMUTER TRANSPORTATION BEST WORKPLACES FOR COMMUTERS

How Server And Network Virtualization Make Data Centers More Dynamic

NAVIGATING THE BIG DATA JOURNEY

Using GPUs in the Cloud for Scalable HPC in Engineering and Manufacturing March 26, 2014

TURNING THE SHIP DIGITAL GOVERNMENT

Integrated Risk Management System Components in the GEO Architecture Implementation Pilot Phase 2 (AIP-2)

Big Data & the Cloud: The Sum Is Greater Than the Parts

IDC MaturityScape Benchmark: Big Data and Analytics in Government

SPEED AND EASE Spreadsheets. Workflow Apps. SECURITY Spreadsheets. Workflow Apps

Ten Mistakes to Avoid

The Massachusetts Open Cloud (MOC)

P4.1 Reference Architectures for Enterprise Big Data Use Cases Romeo Kienzler, Data Scientist, Advisory Architect, IBM Germany, Austria, Switzerland

Understanding ArcGIS Deployments in Public and Private Cloud. Marwa Mabrouk

Five best practices for deploying a successful service-oriented architecture

Overview of NASA Applied Remote Sensing Training Program on Water Resources and Disaster Management

Data Semantics Aware Cloud for High Performance Analytics

Matsu: An Elastic Cloud Connected to a SensorWeb for Disaster. (Session 12F Working Group: Cloud Computing for Spacecraft Operations)

Increase Agility and Reduce Costs with a Logical Data Warehouse. February 2014

Archiving and the Cloud: Perfect Together

ATA DRIVEN GLOBAL VISION CLOUD PLATFORM STRATEG N POWERFUL RELEVANT PERFORMANCE SOLUTION CLO IRTUAL BIG DATA SOLUTION ROI FLEXIBLE DATA DRIVEN V

NEXT GENERATION ARCHIVE MIGRATION TOOLS

Introduction to Cloud Computing

Technology Enablement

INTRODUCING CLOUD POWER

Internet of Things. Opportunity Challenges Solutions

How cloud computing can transform your business landscape.

The Future of Census Bureau Operations

How To Improve Cloud Computing

Big Data Success Step 1: Get the Technology Right

Cloud Business Value. Francis Magann Senior Customer Solutions Architect. 20 th April 2011

BIG DATA FUNDAMENTALS

Introduction to Cloud : Cloud and Cloud Storage. Lecture 2. Dr. Dalit Naor IBM Haifa Research Storage Systems. Dalit Naor, IBM Haifa Research

AWWA/WEF Information Management & Technology Conference, March 4-7, 2007, Austin, TX

Private Cloud for the Enterprise: Platform ISF

PROCESSING & MANAGEMENT OF INBOUND TRANSACTIONAL CONTENT

The following was presented at DMT 14 (June 1-4, 2014, Newark, DE).

USEReady Introduction

EMERGENCY MANAGEMENT BRITISH COLUMBIA A STRATEGY TO ADVANCE SUPPORT FOR LOCAL AUTHORITY EMERGENCY MANAGEMENT PROGRAMS OCTOBER 14, 2015

IT Enterprise Services. Capita Collaboration Suite Unified Communications in the Cloud

Your asset is your business. The more challenging the economy, the more valuable the asset becomes. Decisions are magnified. Risk is amplified.

IBM Software Hadoop in the cloud

Simplify Your Windows Server Migration

Disaster Preparedness: A Shared Responsibility

Introduction to Cloud Computing

WHITE PAPER OCTOBER Unified Monitoring. A Business Perspective

Ten Cornerstones of a Modern Data Warehouse Environment

Cloud Service Model. Selecting a cloud service model. Different cloud service models within the enterprise

Implementing Oracle BI Applications during an ERP Upgrade

Microsoft Research Worldwide Presence

Q&A: The Many Aspects of Private Cloud Computing

Transform your Business with VMware

IDC MaturityScape Benchmark: Big Data and Analytics in Government. Adelaide O Brien Research Director IDC Government Insights June 20, 2014

Maginatics Cloud Storage Platform A primer

IBM Tivoli Netcool network management solutions for SMB

Cisco Unified Communications and Collaboration technology is changing the way we go about the business of the University.

Master Your Data and Your Business Using Informatica MDM. Ravi Shankar Sr. Director, MDM Product Marketing

Moving to the Cloud: A Practical Guide Community IT

Streamlining the move to the cloud. Key tips for selecting the right cloud tools and preparing your infrastructure for migration

Marathon Information Management Program

BBC Technology Strategy

Whitepaper: Creating an ECM Advisory Board and Program Charter

Joint Polar Satellite System (JPSS)

APPLICATION REPLATFORMING : MIGRATION AND MODERNIZATION

Hadoop in the Hybrid Cloud

Software-Defined Networks Powered by VellOS

Kaseya IT Automation Framework

Big Data Efficiencies That Will Transform Media Company Businesses

Implementing Oracle BI Applications during an ERP Upgrade

A Sumo Logic White Paper. Harnessing Continuous Intelligence to Enable the Modern DevOps Team

Service Oriented Architecture(SOA)

Oracle Fusion Project Portfolio Management CLOUD SERVICE. The New Standard for Project Portfolio Management

CIS 4930/6930 Spring 2014 Introduction to Data Science Data Intensive Computing. University of Florida, CISE Department Prof.

SURVEY REPORT DATA SCIENCE SOCIETY 2014

White paper: Unlocking the potential of load testing to maximise ROI and reduce risk.

Enabling Science in the Cloud: A Remote Sensing Data Processing Service for Environmental Science Analysis

Symposium on the Interagency Strategic Plan for Big Data: Focus on R&D

7 INSIGHTS FROM OUR 2014 CLOUD ADOPTION SURVEY

WHAT S ON YOUR CLOUD? Workload Deployment Strategies for Private and Hybrid Clouds RESEARCH AND ANALYSIS PROVIDED BY TECHNOLOGY BUSINESS RESEARCH

Master of Science in. Sustainability Management

Transcription:

Themes: Challenges of Big Data International Academia Federal Government & Industry NASA Earth Science Data 1

Although Unique, NASA still faces the similar challenges as other Big Data users... Federal, Industry and Academia NASA s Big Data challenges go beyond the stereotypical Big Data problem-space -- involving warehousing and mining of unstructured transactional business data. This should be no surprise to the ESIP community. Many of you have significant day-to-day experience with Big Data and may be ahead of NASA on the innovation curve. Not to say that we don t have unstructured data, but our key assets are our Science Data Sets especially our earth science. Many of our big data sets are describedbysignificant metadata, but on a scale that challenges current and future data management practice. TRMM Satellite Data 2

Getting the right data to the right people, fordecision support in real time. NASA s data requirements demands collaboration beyond our current working relationships to include leveraging capabilities and resources across the Federal Government, Industry, Academia, and NASA Centers to push the envelope toward new capabilities. 3

Making real-time data available to disaster workers. Whereare the hardest-hit areas, how can we get in to help? What will we need to bring with us? How is the situation changing? Sendai Japan, March 14, 2011 downloadgeotifffile (12 MB, TIFF, 3600x2400) 4

High stress eco systems respondto so many variables Accuracy matters. What if the data sets could help to predict their behavior? Through detailed models we can help build ecological resilience and sustainability. Map of Missouri and Mississippi river confluence around St Louis, Missouri 5

Should I buy this house or that? Whatkind of risk am I taking by wanting to build here? Using the latest data, helpto give the answers to Why? Incredibly valuable to urban planners and civil engineers. Insurance companies can use up-to-date and relevant data, reflecting true risk. Spring 2011 Floods (Springfield) EO-1 6

With more intuitive real-time tools, researchers can easily 'see' the phenomena and students can quickly grasp/master the knowledge. Help the educational community to tackle real problems, on a scale that they can handle. Training the next generation of researchers through interdisciplinary tools and data with a lower burden. 7

8

The dialogue must begin and end with Data Sensitivity Challenges of Moving to the Cloud Initial infrastructure resource $$$ investment is potentially significant Equipment acquisition Software retooling to run efficiently in the cloud IT Security Costs Public/Private Initial risks are high not everything will work Some cloud technologies may not develop into production-ready capabilities Some science data experiments may not be well suited to cloud technologies New Skill Sets Required Creating applications in the Cloud migrating legacy applications to the Cloud Programming skills, performance management techniques still need to be learned Agility in the cloud will be an important skill 9

The rewards can be quite fruitful. The Proof of conceptefforts Help to define how deep the rewards can be. What the cost advantages can be. Understand what duplication can be mitigated. Initial resource investment is significant, however will save money in the long term, and save HPC resources for highly complex models Optimizing use of cloud resources is a new skill a skill that once gathered will be available for other institutional projects You get more than just the capability at the end. 10

Leveraging Collaboration Within the organization private cloud, outside the organization public cloud. Share experience and findings, lessons learned, above all, best practices 11

Complementing traditional computing Cloud technology fosters collaboration between data providers and compute providers. Cloud is adaptable complement to HPC Also, extendsthe capability tothe average scientists at universities, eventually even citizen scientists. Discovery Cluster 12

Form collaboration opportunities across organizational boundaries: Shared risk/shared technical skills to build shared knowledge and experience and relationships Reduces the entry barriers by limiting startup cost and sharing performance risks Reach within your organization as well as outside 13

We want to get to a sustainable operational state. New designs for data architectures, including clouds. Collaborative Sustainable outcomes are the ideal outcomes. Policy drives the approach through Proof of Concept efforts 14

Start with small experiments well defined and measured Builds experience and relationships Examples of this can be found in NASA Goddard s internal experiments 15

Image of the Aelutian islands by TERRA MODIS 16

Goddard Data Storage Cloud Pilot Project To further explore and address the Big Data problems, a Proof of Concept is formed among different Goddard units. OpenDAPand Giovanni will be ported into a Goddard Data Storage Cloud to test End-to-end cloud solution vs. traditional client-server solutions Storage (volume & speed) bottlenecks Communication(networking) bottlenecks Processing (e.g., spatiotemporal subsettingand format/projection transforming) bottlenecks On demand support to Big Data access Image from TERRA MODIS 17

How can we begin to think of our data sets as a greater Data asa Service Prepare for a NASA Data as a Service (DaaS) Establish sustainable collaborations among difference units/organizations to share resources (people, environment, tools) Form a requirements, R&D, solutions, feedback cyclic process for addressing Big Data challenges Facilitate releasing the potential of Big Data Provide an elastic and pay-as-you-go solution for Big Data Many needs, many data sets 18

Working Cloud Beyond Earth Science Data Goddard Persistent Virtual Desktop Working with IBM to pilot a persistent virtual desktop Goddard Heliophysicscommunity working with ITCD to explore possibilities for science virtual desktop Could lead to the development of suites of collaborative science analysis applications as a service in the cloud, much the same way that Google Docs provides a collaborative office suite in the cloud Image: SDO 19

Across Federal Government National Science Foundation soliciting proposals for Core Techniques and Technologies for Advancing Big Data Science & Engineering Industry Using Space Act Agreement to work with IBM on Persistent Virtual Desktop Microsoft has offered to beta test new virtual machine environments to support larger data volumes ISS Photo 20

Work to leverage outside resources NASA resources are tight and risk tolerance is low Working to leverage outside resources, such as industry and academia to pilot innovation efforts to solve NASA s big data challenges Within NASA Potential for additional DSC experiments as team builds successes Other teams at GSFC are interested in leveraging cloud technology Other NASA Centers looking at data storage capability Outside NASA You? Grand Canyon, Terra Satellite (composite 3-D view) 21

I know that there is a good bit of knowledge in these areas. The community can benefit from the sharing ofexperience in these areas to facilitate sustainable leveraging of Big Data Leverage NASA s data in a way we never have in the past Challenge to industry and academia More opportunities exist between Federal Partners, NASA,industry, and academia need to expand those relationships If there are things that you can contribute that will make our offering better Our work solving these problems should also be collaborative ISS Photo 22

23