Volunteer Crowd Computing and Federated Cloud developments. David Wallom



Similar documents
Trials community. Yannick Legré. EGI InSPIRE RI

The EGI pan-european Federation of Clouds

Challenges in Hybrid and Federated Cloud Computing

Big Data in BioMedical Sciences. Steven Newhouse, Head of Technical Services, EMBL-EBI

Enabling cloud for e-science with OpenNebula

Fujitsu Cloud IaaS Trusted Public S5. shaping tomorrow with you

StratusLab project. Standards, Interoperability and Asset Exploitation. Vangelis Floros, GRNET

How an Open Source Cloud Will Help Keep Your Cloud Strategy Options Open

Size and Development of the Shadow Economy of 31 European and 5 other OECD Countries from 2003 to 2015: Different Developments

OpenNebula Leading Innovation in Cloud Computing Management

SUSE OpenStack Cloud 4 Private Cloud Platform based on OpenStack. Gábor Nyers Sales gnyers@suse.com

OpenNebula Open Souce Solution for DC Virtualization

The OpenNebula Cloud Platform for Data Center Virtualization

OpenNebula Open Souce Solution for DC Virtualization

CLOUD MANAGEMENT GUIDE

Getting Started Hacking on OpenNebula

Infrastructure as a Service (IaaS)

Argonne National Laboratory

Cloud services in PL-Grid and EGI Infrastructures

Cloud Computing Architecture with OpenNebula HPC Cloud Use Cases

Enterprise Mobility Suite (EMS) Overview

OpenNebula Open Souce Solution for DC Virtualization. C12G Labs. Online Webinar

T Mobile Cloud Computing Private Cloud & Assignment

INDIGO-DataCloud Wupi 4 (Resource Virtualization)

Electricity, Gas and Water: The European Market Report 2014

Learn How to Leverage System z in Your Cloud

cloud functionality: advantages and Disadvantages

Workprogramme

CloudCenter Full Lifecycle Management. An application-defined approach to deploying and managing applications in any datacenter or cloud environment

Scientific and Technical Applications as a Service in the Cloud

Top 5 Reasons to choose Microsoft Windows Server 2008 R2 SP1 Hyper-V over VMware vsphere 5

Cloud Computing for Forest Fire Management. Dr. Nikos Athanasis, Prof. Kostas Kalabokidis University of the AEGEAN

<Insert Picture Here> Infrastructure as a Service (IaaS) Cloud Computing for Enterprises

NFON Cloud Telephone System Product Overview. The Future of Business Communications. nfon.com

Agenda. Company Platform Customers Partners Competitive Analysis

VIRTUAL PRIVATE CLOUD FOR ENTERPRISES

Turnkey Technologies- A Closer Look

Ernst Rauch Munich Re 18 October 2011

Intelligent Inventory and Professional License Management

Big Data and the Earth Observation and Climate Modelling Communities: JASMIN and CEMS

EMI views on Cloud Computing

Copernicus Atmosphere Monitoring Service (CAMS) Copernicus Climate Change Service (C3S)

Six greenhouse gases covered by the United Nations Framework Convention on Climate Change (UNFCCC) and its Kyoto Protocol are:

T-SYSTEMS Cloud STORY

Geoff Raines Cloud Engineer

European Data Centre Services Market Growing Popularity of the Outsourcing Model Fuels Growth of Retail Colocation and Managed Hosting

NIST Big Data PWG & RDA Big Data Infrastructure WG: Implementation Strategy: Best Practice Guideline for Big Data Application Development

FIWARE Lab Solution for Managing Resources & Services in a Cloud Federation

ONE Cloud Services Secure Cloud Applications for E-Health

Intel IT Cloud Extending OpenStack* IaaS with Cloud Foundry* PaaS

IaaS Cloud Architectures: Virtualized Data Centers to Federated Cloud Infrastructures

solutions EFFIFUEL TM From a company of Michelin group

SAP HANA Cloud Platform. Technical Overview Uwe Heinz

TUT5605: Deploying an elastic Hadoop cluster Alejandro Bonilla

Cloud Computing and Amazon Web Services

Hadoop in the Hybrid Cloud

Steven Newhouse, Head of Technical Services

Dell Cloud Services. Services

AMI-Partners. SMB Managed Services Assessment Opportunities & Actionable GTM Insights

Cloudian delivers object storage for next generation infrastructures

2) Xen Hypervisor 3) UEC

Implementing Microsoft Azure Infrastructure Solutions

Introducing Red Hat Enterprise Linux. Packaging. Joe Yu. Solution Architect, Red Hat

CLOUDFORMS Open Hybrid Cloud

Building Storage-as-a-Service Businesses

Big Data Infrastructures for Processing Sentinel Data

Manjrasoft Market Oriented Cloud Computing Platform

Oracle Virtualization Strategy and Roadmap

Cloud OS. Neue Geschäftsmodelle mit Microsoft Lösungen für Hoster und Service Provider. Windows Server & Windows Azure

Helix Nebula: Secure Brokering of Cloud Resources for escience. Dr. Jesus Luna Garcia

<Insert Picture Here> Cloud Archive Trends and Challenges PASIG Winter 2012

Transcription:

Volunteer Crowd Computing and Federated Cloud developments David Wallom 1

Overview Crowd Computing through Climateprediction.net Cloud Computing with the EGI Federated Cloud Utilising cloud resources for Crowd Computing 2

The world s largest climate modeling facility >300,000 registered volunteers, ~40,000 active, 127M model-years 3

Climate modelling with distributed computing Disadvantages: Limited diagnostics & resolution. You make all your mistakes in public. Advantages: Effectively unlimited ensemble size. You make all your mistakes in public. 4

What has distributed computing allowed us to do that we would not have done otherwise? 5

Unlimited ensemble size: exploring uncertainties in climate predictions Results of the BBC Climate Change Experiment: Rowlands et al, Nature Geosci., 2012 6

The weather@home regional modelling project (with Microsoft Research, the Risk Prediction Initiative and Environment Guardian) High impact weather events are typically rare and unpredictable. They also involve small scales. Modify boundary conditions to mimic counter-factual world that might have been. Resolution provided by nested regional model. Compare numbers of events in the 1960s and 2000s to explore trends. 7

And that grew into Weather@home - Regional climate modeling projects Weather@home 2014: the causes of the UK winter floods Weather@home ANZ 2013: the causes of recent heatwaves and drought in Australia and New Zealand Weather@home Climate Accountability: the causes of extreme heat in the Western US Weather@home ACE-Africa Weather@home: High Resolution 2003 European Heatwave Forest Mortality, Economics, and Climate in Western North America (FMEC) 8

Photo: Dave Mitchell What is the role of increased greenhouse gas levels in UK autumn/winter flood events? South Oxford on January 5th, 2003 9

Photo: Dave Mitchell What is the role of increased greenhouse gas levels in UK autumn/winter flood events? South Oxford on July 24th, 2007 10

Photo: Dave Mitchell What is the role of increased greenhouse gas levels in UK autumn/winter flood events? Oxfordshire has flooded before so how do we quantify the role of human influence? South Oxford on July 24th, 2007 11

An accidental spin-off: the Pall et al experiment Aim: to quantify the role of increased greenhouse gases in precipitation responsible for 2000 floods. Challenge: relatively unlikely event even given 2000 climate drivers and sea surface temperatures (SSTs). Approach: large (multi-thousand-member) ensemble simulation of April 2000 March 2001 using forecast-resolution global model (90km resolution near UK). Identical non-industrial ensemble removing the influence of increased greenhouse gases, including attributable SST change, allowing for uncertainty. Still operated as a historical experiment 12

Risks of floods in the Pall et al ensemble A flood that happened and one that did not 40% decrease in risk 100% increase in risk Pall et al and Kay et al, 2011 13

14

Weather@home 2014: the causes of the UK winter floods Generated at peak over 1.5TB data per day in returning models Academic paper quality needed 120k runs for increased statistics 15

The changing mode of Climateprediction.net Original focus on multi-decade prediction: challenging results, and challenging experiment. Increasing interest in event attribution Timeliness of results and processing becomes important, Science as a Service? Can we operate in a forecast mode for likelihood of extreme events? Concern though that new experiments could become volunteer/crowd limited Science as a Service would imply measurable deliverables including time to results 16

The EGI Federated Cloud EGI-InSPIRE RI-261323 www.egi.eu

Value proposition The EGI Federated Cloud, a federation of institutional private Clouds, offering Cloud Services to researchers in Europe and worldwide A single cloud system able to Scale to user needs Integrate multiple different providers to give resilience Prevent vendor lock-in EGI-InSPIRE RI-261323 www.egi.eu

EGI Cloud Infrastructure User Community Col l ab or at i on T ool s EGI Image EGI Cloud Service Marketplace Application DB Repository E GI EGI Cloud Infrastructure Platform Storage Managemen t Instance Mgmt OCCI CDMI Management Stacks OVF Cloud (OpenStack, OpenNebula, Synnefo, ) Service Registry GLUE 2 Information Discovery Help and Support GSI Federated AAI Security Coordination SAM UR Monitoring Training and Outreach Accounting Sustainable Business Models EGI Core Platform EGI-InSPIRE RI-261323 19 www.egi.eu

Geographical dispersion 9/14 12 countries provide 19 certified resources Czech Republic, Germany, Greece, Hungary, Italy, Macedonia, Poland, Slovakia, Spain, Sweden, Turkey, United Kingdom * Not shown on map EGI-InSPIRE RI-261323 www.egi.eu

Federated Cloud Services VRE Tier 4: Zero ICT Infrastructures Virtual elaborato ry PaaS Federated IaaS and STaaS Cloud EGI-InSPIRE RI-261323 Platform as a Service Tier 2: Key Mgmt Encrypt ion ACL mgmt Hadoop aas DB aas P aas Secure storage Tier 3: General-purpose platform services Tier 1: Reliable Infrastructure Cloud 21 www.egi.eu

EGI FedCloud Communities 5/2014 Ecology BioVeL: Biodiversity Virtual e-laboratory Structural biology WeNMR: a worldwide e-infrastructure for NMR and structural biology Linguistics CLARIN: British National Corpus service (BNCWeb) Earth Observation SSEP: European Space Agency s Supersites Exploitation Platform for volcano and earthquakes monitoring (Collaboration with Helix Nebula) Software Engineering SCI-BUS: simulated environments for portal testing Software Engineering DIRAC: deploying ready-to-use distributed computing systems Software Engineering Catania Science Gateway Framework Musicology Peachnote: dynamic analysis of musical scores Earth Observation ENVRI: Common Operations of Environmental Research infrastructures (collaboration with EISCAT3D) Geology VERCE: Virtual Earthquake and seismology Research Ecology LifeWatch: E-Science European Infrastructure for Biodiversity and Ecosystem Research High Energy Physics CERN ATLAS: ATLAS processing cluster via HelixNebula More info: https://wiki.egi.eu/wiki/fedcloud-tf:users EGI-InSPIRE RI-261323 22 www.egi.eu

New EGI FedCloud Communities since launch Education Cranfield University distributed systems course Cultural Heritage DCH-RP management of preservation services in the cloud Hydrological Modelling Running Hydrological models to support real time analysis Bioinformatics ELIXIR execution of the Ensamble application in the Federated Cloud environment Systems implementations deployment of FTK developed tools and services and data preservation Internet of Things Smart Grid systems investigation Software Development deployment of research PaaS RNA Sequencing deployment of analysis engines in the cloud Physiological Modelling Calibration, scenario mapping and development More info: https://wiki.egi.eu/wiki/fedcloud-tf:users EGI-InSPIRE RI-261323 23 www.egi.eu

Crowd and Cloud? 24

Where can the cloud help? 1. 2. 3. 4. 5. 6. Volunteers Short term demand response due to multiple models launched concurrently exceeding volunteer capacity Volunteers Demand for high resolution models exceeds standard volunteers systems [e.g. multicore coupled models] Volunteers Supporting 3 platforms is time consuming and difficult, distribute VMs Servers Large numbers of high resolution models generate vast amounts of data -> upload servers in the cloud with coupled analysis servers Servers Resilience of central core services becomes more important with global partners Servers New Crowd Computing project setup accelerated through roll out of standard BOINC setup. 25

Demand Response Distribute in the cloud multiple cloud volunteer BOINC clients Simplified since no console -> no graphics Must not put off or make it appear replacing volunteers 26

High resolution Next generation coupled models designed for multi/many core Some implementations of multicore support are platform dependant Current efficiency gained through utilising multi-core for separate models 27

Multi-Platform Supporting 3 platforms [Windows, Linux and Mac] becomes time consuming Simplifying the application creation procedure is key to supporting multiple models Distribution of apps as VMs or even containers? 28

Data Storage? Individual experiment can exceed 20TB New SciaaS applications could increase by OoM Need scalable storage services coupled with analysis capability to prevent undue movement of BIG DATA 29

Resilient core servers CPDN/WatH has global partners, launching work on their own timescales Core services must be reliable to support this Scale hybrid cloud from current VMWare located services to external cloud services 30

Server Infrastructure Oxford Volunteer Computing group supporting multiple proposals for Crowd Computing Some funders require exploratory work to show promise before funding Connect cloud volunteers to cloud server stack to accelerate time to start 31

Conclusions CPDN moving towards SciaaS or at least Data sets aas Federated cloud services are a reality across Europe through EGI Open standards give user confidence to build tools and services against known interfaces Cloud can support different aspects of Crowd Computing lifecycle 32