Open Cirrus: Enabling System Software Research in Computer Science
|
|
- Dominick Wilkerson
- 3 years ago
- Views:
Transcription
1 OpenCirrus: Enabling System Software Research in Computer Science through Clusters of Bare Metal Michael Kozuch Intel Labs, Pittsburgh November 2011
2 2 Motivation
3 Connecting the Cloud to Open Research Cloud Computing Clearly important whatever it is! Industry developing rapidly Research still plays a role in disruptive advances Open Communities enable rapid innovation Exchange of ideas: Knowledge grows Constructive Darwinism: Best tools survive/evolve Cloud researchers need access to experimental tools at reasonable scale 3
4 Good News Many Options Open Cirrus Type of research Systems and services Approach Participants Federation of heterogeneous data centers CESGA, CMRI, CMU, CNIS, ETRI, HP, GaTech, GSTZ, Intel, IDA, KIT, MIMOS, RAS, UIUC, Yahoo! Google/ IBM cluster Data- intensive applications Cluster supported by Google and IBM Google, IBM, MIT, Stanford Univ., Univ. of Washington Distribution 15 sites Centralized one data center in Atlanta Teragrid PlanetLab Emulab Open Cloud Amazon EC2 PRObE Scientific applications Multisite heterogeneous clusters for supercomputing Many universities and organizations Systems and services Nodes hosted by research institution Many universities and organizations 11 partners in US More than 700 nodes worldwide Systems Single- site cluster with flexible control Univ. of Utah More than 300 machines Interloperability across clouds using open APIs Multisite heterogeneous clusters Commercial use Systems Raw access to virtual machines Reuse of LANL s retiring clusters Four centers Amazon CMU, LANL, NSF 480 cores distributed in four locations Several unified data centers Each test bed has strengths and weaknesses, need to match the resource to the research Thousands of older but still useful nodes at one site 4 Open Cirrus: A Global Cloud Computing Testbed, IEEE Computer, April 2010
5 Agenda Motivation Overview of Open Cirrus * Organization of Open Cirrus Open Cirrus Software Services Example Open Cirrus Site Closing Thoughts 5
6 6 Overview of Open Cirrus
7 Open Cirrus * Cloud Computing Testbed Collaboration between industry and academia, sharing hardware infrastructure software infrastructure research applications and data sets UIUC* KIT* ISPRAS* CMU* GaTech* CESGA* MIMOS* IDA* ETRI* China Telecom* Chinese Acad. Sci.* China Mobile* 7 Sponsored by HP, Intel, and Yahoo! (with additional support from NSF) 15 sites currently, target of around 20 in the next two years
8 Open Cirrus * Objectives Foster systems research around cloud computing Vendor-neutral open-source stacks and APIs for the cloud Expose research community to enterprise level requirements Provide realistic traces of cloud workloads How Open Cirrus differ from typical cloud services Support for systems research and applications research Federation of heterogeneous datacenters Collection of interesting data sets 8
9 9 Organization of Open Cirrus
10 Open Cirrus * Management Central Management Office, oversees Open Cirrus Currently run by HP Labs with Dejan Milojicic as Director Each site Runs its own research and technical teams Contributes individual technologies Operates some of the global services Contributes a member to the Steering Committee Steering Committee Provides long-term guidance to the effort Independently-managed sites providing a cooperative research testbed 10
11 User Access to Open Cirrus * User access is organized around Research Projects Led by Principal Investigator (PI) Project PIs apply to each site separately Identifying additional team members Contact information for applications to each site are available on the Open Cirrus Web site ( Each Open Cirrus site decides which users and projects get access to its site. 11
12 Open Cirrus * Research Projects Example research areas of interest Datacenter federation Datacenter management Web services Data-intensive systems Projects typically not of interest Traditional HPC app development Production apps looking for free cycles Closed-source system development 12
13 Open Cirrus Summits 13 Past summits, 2009-now HP, Palo Alto, 6/2009; Yahoo, Sunnyvale, 1/2010; ETRI, Seoul, Korea, 6/2010; Intel/CMU, Pittsburgh, 10/2010 Summits in 2011 [Fifth] Moscow, June 1-3 & [Sixth] Atlanta, October Formal events, IEEE Technically Co-sponsored >40 papers submitted, 17-member program committee ~25 accepted papers, ~40 posters Each was 2 intense days of interactive exchange Future summits 7th Summit, Beijing, China, May 2012 Workshop on Cloud Computing Testbeds in September 2012 at ICAC (San Jose, CA)
14 2011 Summit Papers Hadoop in the cloud A Study of Skew in MapReduce Applications, Univ. Washington (Best student award) Volume Management of Hadoop DataNode, ChinaMobile Linux PageCache Optimization for Hadoop, Yahoo, India Clusterken: A Reliable Object-Based Messaging Framework to Support Data Center Processing, HP Experiences and apps Experience Using Open Cirrus for a Security-Aware Cloud-Based Business Process Modeling and Execution Environment, UFPE Brazil, and HP Supporting Test-Driven Development of Web Service Choreographies, USP Brazil, HP High-Performance Algorithm for the Coupled Conductive-Radiative Heat Transfer Problems, Far Eastern Federal University, Russia Distributed Cloud Storage Services with FleCS Containers, Georgia Tech, CERCS Lessons from Adopting Cloud-like Architectures in Real-life Financial Applications, Singapore Management University
15 2011 Summit Papers, Cont. Measurements High-Performance Testing, ISP RAS, Russia Virtual Machine based Disk I/O Bandwidth Controller, ETRI, Korea Software performance estimation in a virtualized environment based on atomic tests, Lomonosov Moscow State University, Russia Adding a silver lining to the cloud Controlling Traffic Ensembles in Open Cirrus, HP Practical Cloud Federations, HP and Dortmund University Cloud Sustainability Dashboard, HP Cloud technology and management An Open-source Cloud Management Platform Comparison, CESGA, Spain CAS Data Cloud: Integrated services of data, middleware and infrastructure, Chinese Academy of Sciences
16 2011 Summit Papers, Cont. Systems in the cloud Distributed, Robust Auto-Scaling Policies for Power Management in Compute Intensive Server Farms, Anshul Gandhi, Mor Harchol-Balter, Ram Raghunathan and Michael Kozuch CMU, Intel Labs An Open Job Scheduling Service for Large-Scale Data Processing, Zhenghua Xue, Jianhui Li, Yuanchun Zhou, Yang Zhang and Geng Sheng CNIC Using Active NVRAM for Cloud I/O, Sudarsun Kannan, Dejan Milojicic, Ada Gavrilovska, Karsten Schwan, Hasan Abbasi and Vanish Talwar GaTech, HP Labs, ORNL Applications in the cloud The Application and Practice of Parallel Cloud Computing in ISP, Zhilan Huang, Guoliang Yang and Shengyong Ding (China Telecom) Globally Distributed BookPrep, Dejan Milojicic, Shariff Dudekula, Susanth Puthanveedu and Prakash Reddy HP Labs US, India Running Interactive Perception Applications on OpenCirrus, Qian Zhu, Nezih Yigitbasi and Padmanabhan Pillai Accenture, TUDelft, Intel Labs Evaluation of HPC Applications on Cloud, Abhishek Gupta and Dejan Milojicic HP Labs, UIUC 16
17 17 Open Cirrus Software Services
18 Cloud Software Stack Key Learnings Enable use of application frameworks (Hadoop, Maui-Torque) Enable general IaaS use Provide Big Data storage service Enable physical resources allocation Application Frameworks IaaS Why Physical? 1. Virtualization overhead 2. Access to phys resource 3. Security issues Storage Service Resource Allocator Node Node Node Node Node Node
19 Key Software Services Zoni: Used to allocate mini-clusters to researchers. E.g. bare-metal access for power management projects HDFS: Distributed file system from Hadoop. Used to provide high bandwidth access to Big Data using OTS server systems. E.g. bare-metal access for power management projects Tashi: A VM instantiation service. Particularly useful to users with custom software stacks. E.g. MPI jobs Maui/Torque: Job submission service. Used primarily by users with straightforward applications. E.g. microarchitecture simulation runs Hadoop: Service for users who want to experiment with MapReduce. E.g. evaluation of new HDFS management strategies
20 Zoni Functionality Provides each project with a mini-datacenter Allocation Isolation Assignment of physical resources to users Allow multiple mini-clusters to co-exist without interference Provisioning Booting of specified OS Management Debugging OOB power management OOB console access Isolation of experiments Domain 0 PXE/DNS/DHCP Server Pool 0 Domain 1 DNS/PXE/DHCP Server Pool 0 Server Pool 1 Gateway
21 Access Model At a minimum, sites must expose a ssh gateway Sites may also provide additional external connections Some provision for web services is highly recommended Sites may also be divided into resource pools by service Some services may require a front-end machine (e.g. hadoop) Site Firewall/NAT exported web services port 80/443 Open Cirrus * cluster access port 22 ssh gw Front-end machines
22 22 Example Open Cirrus Site: Intel Labs Pittsburgh
23 45 Mb/s T3 to Internet 20 nodes: 1 Xeon (single-core) [Irwindale] 6GB DRAM 366GB disk 10 nodes: 2 Xeon 5160 (dual-core) [Woodcrest] 4GB RAM 2 75GB Disk 10 nodes: 2 Xeon E5345 (quad-core) [Clovertown] 8GB DRAM 2 150GB Disk Key: rxry=row X rack Y TOR Switch (sw0-r2r1) Blade Rack 1 Gb/s 40 nodes (x4x4 p2p) rxrycz=row X rack Y chassis Z (r2r1c1-4) 1 Gb/s (x4) 1 Gb/s (x4) TOR Switch (sw0-r2r2) Blade Rack 40 nodes 1 Gb/s Xeon E5345 (quad-core) [Clovertown] 8GB DRAM 2 150GB Disk (r2r2c1-4) TOR Switch (sw1-r1r2) 1 Gb/s (x4) (x4x4 p2p) 1 Gb/s x1 (x4) TOR (r1r1) 1 Gb/s Switch (sw0-r1r1) (x2x5 p2p) 1 Gb/s 1U Rack 15 nodes (x15 p2p) 2 Xeon E5420 (quad-core) [Harpertown] 8GB DRAM 2 1TB Disk 1 Gb/s (x4) TOR 1 Gb/s (x8) Switch (sw0-r1r2) 1U Rack 15 nodes Xeon E5420 (quad-core) [Harpertown] 8GB DRAM 2 1TB Disk 3U Rack 5 storage nodes TB Disk 1 Gb/s (r1r5) (x15 p2p) x1 (r1r2) TOR 1 Gb/s Switch (sw2-r1r2) 12 nodes Xeon X5650 (six-core) TOR Switch (sw0-m1) (x4) 10 Gb/s 1 Gb/s [WestmereEP] 48GB DRAM 6 0.5TB Disk 1 Gb/s (x4) (x15 p2p) 1 Gb/s (x8 p2p) 10G Switch (sw0-r3r3) TOR Switch (sw0-r?r?) 1 Gb/s 2U Rack 15 nodes (x15 p2p) Xeon E5440 (quad-core) [Harpertown] 8GB DRAM 6 1TB Disk x3 (r1r3,r1r4,r2r3) Mobile Rack 8 (1u) nodes Xeon E5440 (quad-core) [Harpertown] 16GB DRAM 2 1TB Disk 10 Gb/s TOR Switch (sw0-r3r?) 1 Gb/s 2U Rack 15 nodes (x15 p2p) Xeon E5520 (quad-core) [Nehalem-EP] 16GB DRAM 6 1TB Disk x2 (r3r2,r3r3)
24 24 Intel BigData Cluster Dashboard
25 Example Project 1: AutoScale Idea: Save data center power by turning machines off when load decreases Investigate the trade-off between power and response time Open Cirrus resources used: 2.5 racks 25
26 Our Data Center Unknown Power Aware Load Balancer Load Balancer DB key-value workload mix of CPU & I/O Mor Harchol-Balter, CMU 28 Application servers 7 Memcached Experimental platform deployed on Open Cirrus cluster at Intel Labs Pittsburgh 26
27 Metrics T 95 T avg P avg 95%-tile response time mean power consumption 400 ms 95% Resp. time (ms) Single Server arrival rate (req/s) AlwaysOn ON/OFF N = (max req. rate)/60 Leave N servers on always. N(t) = (req. rate(t))/60 Mor Harchol-Balter, CMU 60 req/s
28 AlwaysOn ON/OFF Num. servers --- load o n busy+idle x n busy+idle+setup Num. servers --- load o n busy+idle x n busy+idle+setup Time (min) Time (min) T 95 =291ms, T avg =137ms P avg =2,323W T 95 =11,003ms, T avg =3,537ms P avg =1,281W I m late, I m late! Mor Harchol-Balter, CMU
29 Towards Better Dynamic Provisioning: AutoScale Problems with Existing Policies: PROBLEM 1: SHUTTING OFF Quick to shut servers off to save power Then suffer huge setup lag. Fixes: Wait until server goes idle. Then wait some more Reduce #servers. by packing servers to SLA point.* PROBLEM 2: TURNING ON Existing policies turn on based on arrival rate. *Converges to optimal # spare servers, w/o knowing arrival rate Mor Harchol-Balter, CMU [Gandhi, Gupta, Harchol-Balter, Kozuch 2010]. Far more accurate & robust to use #jobs in system.
30 ON/OFF AutoScale Num. servers --- load o n busy+idle x n busy+idle+setup Num. servers --- load o n busy+idle x n busy+idle+setup Time (min) Time (min) T 95 =11,003ms, T avg =3,537ms P avg =1,281W T 95 =474ms, T avg =210ms P avg =1,387W Mor Harchol-Balter, CMU I m late, I m late! T 95 =291ms, T avg =137ms P avg =2,323W
31 Results of Implementation AlwaysOn ON/OFF AutoScale T 95 P avg T 95 P avg T 95 P avg 291 ms 2323 W 11,003 ms 1281 W 474 ms 1387 W 271 ms 2205 W 3,802 ms 759 W 466 ms 1016 W 289 ms 2363 W 4,227 ms 1,391 W 480 ms 1729 W 377 ms 2263 W > 1 min 849 W 556 ms 1412 W Mor Harchol-Balter, CMU
32 Example Project 2: Rabbit Idea: Develop a file system with the good properties of HDFS that can scale with load Open Cirrus resources used: 2 racks 32 Robust and Flexible Power-Proportional Storage, SOCC, 2010
33 Power Proportionality and Big Data 2000 Number of blocks stored on node i The Hadoop Filesystem (10K blocks) Possible power savings: ~66% ~0% Optimal: ~95% Node number i i=100
34 Rabbit Filesystem A reliable, power-proportional filesystem for Big Data workloads Simple Strategy: Maintain a primary replica 34
35 Example Project 3: CloudConnect Idea: Add inexpensive optical interconnect to the data center to improve bandwidth Optical links are circuit-switched. Open Cirrus resources used: 1 Rack 35 c-through: Part-time Optics in Data Centers, SIGCOMM, 2010
36 Current solutions for increasing data center network bandwidth FatTree BCube 1. Hard to construct 2. Hard to expand 36
37 Hybrid packet/circuit switched network architecture Electrical packet-switched network for low latency delivery Optical circuit-switched network for high capacity transfer Optical paths are provisioned rack-to-rack A simple and cost-effective choice Aggregate traffic on per-rack basis to better utilize optical circuits
38 Testbed setup 16 servers with 1Gbps NICs Emulate a hybrid network on 48-port Ethernet switch 100Mbps links Ethernet switch Optical circuit emulation Optical paths are available only when hosts are notified During reconfiguration, no host can use optical paths 10 ms reconfiguration delay 4Gbps links Emulated optical circuit switch 38
39 MapReduce sort 10GB random data Completion time (s) Electrical network 128 KB 50 MB 153s 100 MB 300 MB 500 MB 135s Full bisection bandwidth c-through c-through varying socket buffer size limit (reconfiguration interval: 1 sec) 39
40 Yahoo Gridmix benchmark 3 runs of 100 mixed jobs such as web query, web scan and sorting 200GB of uncompressed data, 50 GB of compressed data 40
41 41 Closing Thoughts
42 Where we are now Open Cirrus Beginning our second three-year effort Open Cirrus continues to grow Good research, good community events What could be better (particularly at Intel)? Scheduling idle VMs, idle nodes, etc Federation needs good use model Tracing could be incredibly valuable for research Better tools for creating/uploading images Unlimited resources 42
43 Summary Open Communities can shape the development of Cloud Computing Open Cirrus* is a multi-partner test bed for research in Cloud Computing The Open Cirrus software stack provides a good starting point for open-source cloud computing software development Hadoop, Tashi, Zoni are Apache Software Foundation* efforts 43
S06: Open-Source Stack for Cloud Computing
S06: Open-Source Stack for Cloud Computing Milind Bhandarkar Yahoo! Richard Gass Intel Michael Kozuch Intel Michael Ryan Intel 1 Agenda Sessions: (A) Introduction 8.30-9.00 (B) Hadoop 9.00-10.00 Break
More informationCloud Computing mit mathematischen Anwendungen
Cloud Computing mit mathematischen Anwendungen Vorlesung SoSe 2009 Dr. Marcel Kunze Karlsruhe Institute of Technology (KIT) Steinbuch Centre for Computing (SCC) KIT the cooperation of Forschungszentrum
More informationOpen Cirrus : A Global Testbed for Cloud Computing Research
Open Cirrus : A Global Testbed for Cloud Computing Research David O Hallaron Director, Intel Labs Pittsburgh Carnegie Mellon University Open Cirrus Testbed http://opencirrus.intel-research.net Sponsored
More informationOpen Cirrus: Towards an Open Source Cloud Stack
Open Cirrus: Towards an Open Source Cloud Stack Karlsruhe Institute of Technology (KIT) HPC2010, Cetraro, June 2010 Marcel Kunze KIT University of the State of Baden-Württemberg and National Laboratory
More informationOpen Cirrus TM Cloud Computing Testbed: Federated Data Centers for Open Source Systems and Services Research
Open Cirrus TM Cloud Computing Testbed: Federated Data Centers for Open Source Systems and Services Research Roy Campbell, 5 Indranil Gupta, 5 Michael Heath, 5 Steven Y. Ko, 5 Michael Kozuch, 3 Marcel
More informationT. S. Eugene Ng Rice University
T. S. Eugene Ng Rice University Guohui Wang, David Andersen, Michael Kaminsky, Konstantina Papagiannaki, Eugene Ng, Michael Kozuch, Michael Ryan, "c-through: Part-time Optics in Data Centers, SIGCOMM'10
More informationElastic Cloud Computing in the Open Cirrus Testbed implemented via Eucalyptus
Elastic Cloud Computing in the Open Cirrus Testbed implemented via Eucalyptus International Symposium on Grid Computing 2009 (Taipei) Christian Baun The cooperation of and Universität Karlsruhe (TH) Agenda
More informationNetwork-Aware Scheduling of MapReduce Framework on Distributed Clusters over High Speed Networks
Network-Aware Scheduling of MapReduce Framework on Distributed Clusters over High Speed Networks Praveenkumar Kondikoppa, Chui-Hui Chiu, Cheng Cui, Lin Xue and Seung-Jong Park Department of Computer Science,
More informationBig Data Streams. Analytics Challenges, Analysis, and Applications. Adel M. Alimi
Big Data Streams 1 Analytics Challenges, Analysis, and Applications Adel M. Alimi REGIM-Lab., University of Sfax, Tunisia http://adel.alimi.regim.org adel.alimi@ieee.org 2 Evolution of Technology 3 Nano,
More informationCan High-Performance Interconnects Benefit Memcached and Hadoop?
Can High-Performance Interconnects Benefit Memcached and Hadoop? D. K. Panda and Sayantan Sur Network-Based Computing Laboratory Department of Computer Science and Engineering The Ohio State University,
More informationThe Comprehensive Performance Rating for Hadoop Clusters on Cloud Computing Platform
The Comprehensive Performance Rating for Hadoop Clusters on Cloud Computing Platform Fong-Hao Liu, Ya-Ruei Liou, Hsiang-Fu Lo, Ko-Chin Chang, and Wei-Tsong Lee Abstract Virtualization platform solutions
More informationManaged Virtualized Platforms: From Multicore Nodes to Distributed Cloud Infrastructures
Managed Virtualized Platforms: From Multicore Nodes to Distributed Cloud Infrastructures Ada Gavrilovska Karsten Schwan, Mukil Kesavan Sanjay Kumar, Ripal Nathuji, Adit Ranadive Center for Experimental
More informationPerformance measurement of a private Cloud in the OpenCirrus Testbed
Performance measurement of a private Cloud in the OpenCirrus Testbed 4th Workshop on Virtualization in High-Performance Cloud Computing (VHPC '09) Euro-Par 2009 Delft August 25th 2009 Christian Baun KIT
More informationThere is growing interest in cloud computing. Open Cirrus: Testbed
COVER FE ATURE Open Cirrus: A Global Cloud Computing Testbed Arutyun I. Avetisyan, Institute for System Programming of the Russian Academy of Sciences Roy Campbell, Indranil Gupta, Michael T. Heath, and
More informationA Cloud Test Bed for China Railway Enterprise Data Center
A Cloud Test Bed for China Railway Enterprise Data Center BACKGROUND China Railway consists of eighteen regional bureaus, geographically distributed across China, with each regional bureau having their
More informationAchieving Performance Isolation with Lightweight Co-Kernels
Achieving Performance Isolation with Lightweight Co-Kernels Jiannan Ouyang, Brian Kocoloski, John Lange The Prognostic Lab @ University of Pittsburgh Kevin Pedretti Sandia National Laboratories HPDC 2015
More informationAnalysis and Optimization of Massive Data Processing on High Performance Computing Architecture
Analysis and Optimization of Massive Data Processing on High Performance Computing Architecture He Huang, Shanshan Li, Xiaodong Yi, Feng Zhang, Xiangke Liao and Pan Dong School of Computer Science National
More informationIBM Platform Computing Cloud Service Ready to use Platform LSF & Symphony clusters in the SoftLayer cloud
IBM Platform Computing Cloud Service Ready to use Platform LSF & Symphony clusters in the SoftLayer cloud February 25, 2014 1 Agenda v Mapping clients needs to cloud technologies v Addressing your pain
More informationEnabling Technologies for Distributed and Cloud Computing
Enabling Technologies for Distributed and Cloud Computing Dr. Sanjay P. Ahuja, Ph.D. 2010-14 FIS Distinguished Professor of Computer Science School of Computing, UNF Multi-core CPUs and Multithreading
More informationHadoop & its Usage at Facebook
Hadoop & its Usage at Facebook Dhruba Borthakur Project Lead, Hadoop Distributed File System dhruba@apache.org Presented at the The Israeli Association of Grid Technologies July 15, 2009 Outline Architecture
More informationBuilding a Private Cloud with Eucalyptus
Building a Private Cloud with Eucalyptus 5th IEEE International Conference on e-science Oxford December 9th 2009 Christian Baun, Marcel Kunze KIT The cooperation of Forschungszentrum Karlsruhe GmbH und
More informationMichael Kagan. michael@mellanox.com
Virtualization in Data Center The Network Perspective Michael Kagan CTO, Mellanox Technologies michael@mellanox.com Outline Data Center Transition Servers S as a Service Network as a Service IO as a Service
More informationHadoop on the Gordon Data Intensive Cluster
Hadoop on the Gordon Data Intensive Cluster Amit Majumdar, Scientific Computing Applications Mahidhar Tatineni, HPC User Services San Diego Supercomputer Center University of California San Diego Dec 18,
More informationAgenda. HPC Software Stack. HPC Post-Processing Visualization. Case Study National Scientific Center. European HPC Benchmark Center Montpellier PSSC
HPC Architecture End to End Alexandre Chauvin Agenda HPC Software Stack Visualization National Scientific Center 2 Agenda HPC Software Stack Alexandre Chauvin Typical HPC Software Stack Externes LAN Typical
More informationHadoop & its Usage at Facebook
Hadoop & its Usage at Facebook Dhruba Borthakur Project Lead, Hadoop Distributed File System dhruba@apache.org Presented at the Storage Developer Conference, Santa Clara September 15, 2009 Outline Introduction
More informationCloud Computing through Virtualization and HPC technologies
Cloud Computing through Virtualization and HPC technologies William Lu, Ph.D. 1 Agenda Cloud Computing & HPC A Case of HPC Implementation Application Performance in VM Summary 2 Cloud Computing & HPC HPC
More informationSolving I/O Bottlenecks to Enable Superior Cloud Efficiency
WHITE PAPER Solving I/O Bottlenecks to Enable Superior Cloud Efficiency Overview...1 Mellanox I/O Virtualization Features and Benefits...2 Summary...6 Overview We already have 8 or even 16 cores on one
More informationEnabling Large-Scale Testing of IaaS Cloud Platforms on the Grid 5000 Testbed
Enabling Large-Scale Testing of IaaS Cloud Platforms on the Grid 5000 Testbed Sébastien Badia, Alexandra Carpen-Amarie, Adrien Lèbre, Lucas Nussbaum Grid 5000 S. Badia, A. Carpen-Amarie, A. Lèbre, L. Nussbaum
More informationHow to Deploy OpenStack on TH-2 Supercomputer Yusong Tan, Bao Li National Supercomputing Center in Guangzhou April 10, 2014
How to Deploy OpenStack on TH-2 Supercomputer Yusong Tan, Bao Li National Supercomputing Center in Guangzhou April 10, 2014 2014 年 云 计 算 效 率 与 能 耗 暨 第 一 届 国 际 云 计 算 咨 询 委 员 会 中 国 高 峰 论 坛 Contents Background
More informationVON/K: A Fast Virtual Overlay Network Embedded in KVM Hypervisor for High Performance Computing
Journal of Information & Computational Science 9: 5 (2012) 1273 1280 Available at http://www.joics.com VON/K: A Fast Virtual Overlay Network Embedded in KVM Hypervisor for High Performance Computing Yuan
More informationWhite Paper. Recording Server Virtualization
White Paper Recording Server Virtualization Prepared by: Mike Sherwood, Senior Solutions Engineer Milestone Systems 23 March 2011 Table of Contents Introduction... 3 Target audience and white paper purpose...
More informationEvaluating the Need for Complexity in Energy-aware Management for Cloud Platforms
Evaluating the Need for Complexity in Energy-aware Management for Cloud Platforms Pooja Ghumre, Junwei Li, Mukil Kesavan, Ada Gavrilovska, Karsten Schwan Center for Experimental Research in Computer Systems
More informationPaolo Costa costa@imperial.ac.uk
joint work with Ant Rowstron, Austin Donnelly, and Greg O Shea (MSR Cambridge) Hussam Abu-Libdeh, Simon Schubert (Interns) Paolo Costa costa@imperial.ac.uk Paolo Costa CamCube - Rethinking the Data Center
More informationPerformance Evaluation of the Illinois Cloud Computing Testbed
Performance Evaluation of the Illinois Cloud Computing Testbed Ahmed Khurshid, Abdullah Al-Nayeem, and Indranil Gupta Department of Computer Science University of Illinois at Urbana-Champaign Abstract.
More informationOutline. High Performance Computing (HPC) Big Data meets HPC. Case Studies: Some facts about Big Data Technologies HPC and Big Data converging
Outline High Performance Computing (HPC) Towards exascale computing: a brief history Challenges in the exascale era Big Data meets HPC Some facts about Big Data Technologies HPC and Big Data converging
More informationGraySort on Apache Spark by Databricks
GraySort on Apache Spark by Databricks Reynold Xin, Parviz Deyhim, Ali Ghodsi, Xiangrui Meng, Matei Zaharia Databricks Inc. Apache Spark Sorting in Spark Overview Sorting Within a Partition Range Partitioner
More informationData Centers and Cloud Computing. Data Centers
Data Centers and Cloud Computing Intro. to Data centers Virtualization Basics Intro. to Cloud Computing 1 Data Centers Large server and storage farms 1000s of servers Many TBs or PBs of data Used by Enterprises
More informationData Semantics Aware Cloud for High Performance Analytics
Data Semantics Aware Cloud for High Performance Analytics Microsoft Future Cloud Workshop 2011 June 2nd 2011, Prof. Jun Wang, Computer Architecture and Storage System Laboratory (CASS) Acknowledgement
More informationSistemi Operativi e Reti. Cloud Computing
1 Sistemi Operativi e Reti Cloud Computing Facoltà di Scienze Matematiche Fisiche e Naturali Corso di Laurea Magistrale in Informatica Osvaldo Gervasi ogervasi@computer.org 2 Introduction Technologies
More informationHadoop Distributed File System. Dhruba Borthakur Apache Hadoop Project Management Committee dhruba@apache.org dhruba@facebook.com
Hadoop Distributed File System Dhruba Borthakur Apache Hadoop Project Management Committee dhruba@apache.org dhruba@facebook.com Hadoop, Why? Need to process huge datasets on large clusters of computers
More informationData Centers and Cloud Computing
Data Centers and Cloud Computing CS377 Guest Lecture Tian Guo 1 Data Centers and Cloud Computing Intro. to Data centers Virtualization Basics Intro. to Cloud Computing Case Study: Amazon EC2 2 Data Centers
More informationAmazon EC2 Product Details Page 1 of 5
Amazon EC2 Product Details Page 1 of 5 Amazon EC2 Functionality Amazon EC2 presents a true virtual computing environment, allowing you to use web service interfaces to launch instances with a variety of
More informationPlug-and-play Virtual Appliance Clusters Running Hadoop. Dr. Renato Figueiredo ACIS Lab - University of Florida
Plug-and-play Virtual Appliance Clusters Running Hadoop Dr. Renato Figueiredo ACIS Lab - University of Florida Advanced Computing and Information Systems laboratory Introduction You have so far learned
More informationData Centers and Cloud Computing. Data Centers. MGHPCC Data Center. Inside a Data Center
Data Centers and Cloud Computing Intro. to Data centers Virtualization Basics Intro. to Cloud Computing Data Centers Large server and storage farms 1000s of servers Many TBs or PBs of data Used by Enterprises
More informationReference Design: Scalable Object Storage with Seagate Kinetic, Supermicro, and SwiftStack
Reference Design: Scalable Object Storage with Seagate Kinetic, Supermicro, and SwiftStack May 2015 Copyright 2015 SwiftStack, Inc. swiftstack.com Page 1 of 19 Table of Contents INTRODUCTION... 3 OpenStack
More informationHeterogeneous Workload Consolidation for Efficient Management of Data Centers in Cloud Computing
Heterogeneous Workload Consolidation for Efficient Management of Data Centers in Cloud Computing Deep Mann ME (Software Engineering) Computer Science and Engineering Department Thapar University Patiala-147004
More informationBrocade and EMC Solution for Microsoft Hyper-V and SharePoint Clusters
Brocade and EMC Solution for Microsoft Hyper-V and SharePoint Clusters Highlights a Brocade-EMC solution with EMC CLARiiON, EMC Atmos, Brocade Fibre Channel (FC) switches, Brocade FC HBAs, and Brocade
More informationOpen Cloud System. (Integration of Eucalyptus, Hadoop and AppScale into deployment of University Private Cloud)
Open Cloud System (Integration of Eucalyptus, Hadoop and into deployment of University Private Cloud) Thinn Thu Naing University of Computer Studies, Yangon 25 th October 2011 Open Cloud System University
More informationEnabling Technologies for Distributed Computing
Enabling Technologies for Distributed Computing Dr. Sanjay P. Ahuja, Ph.D. Fidelity National Financial Distinguished Professor of CIS School of Computing, UNF Multi-core CPUs and Multithreading Technologies
More informationAMD SEAMICRO OPENSTACK BLUEPRINTS CLOUD- IN- A- BOX OCTOBER 2013
AMD SEAMICRO OPENSTACK BLUEPRINTS CLOUD- IN- A- BOX OCTOBER 2013 OpenStack What is OpenStack? OpenStack is a cloud operaeng system that controls large pools of compute, storage, and networking resources
More informationIaaS Cloud Architectures: Virtualized Data Centers to Federated Cloud Infrastructures
IaaS Cloud Architectures: Virtualized Data Centers to Federated Cloud Infrastructures Dr. Sanjay P. Ahuja, Ph.D. 2010-14 FIS Distinguished Professor of Computer Science School of Computing, UNF Introduction
More informationSMB Direct for SQL Server and Private Cloud
SMB Direct for SQL Server and Private Cloud Increased Performance, Higher Scalability and Extreme Resiliency June, 2014 Mellanox Overview Ticker: MLNX Leading provider of high-throughput, low-latency server
More informationEfficient Data Replication Scheme based on Hadoop Distributed File System
, pp. 177-186 http://dx.doi.org/10.14257/ijseia.2015.9.12.16 Efficient Data Replication Scheme based on Hadoop Distributed File System Jungha Lee 1, Jaehwa Chung 2 and Daewon Lee 3* 1 Division of Supercomputing,
More informationA Hybrid Electrical and Optical Networking Topology of Data Center for Big Data Network
ASEE 2014 Zone I Conference, April 3-5, 2014, University of Bridgeport, Bridgpeort, CT, USA A Hybrid Electrical and Optical Networking Topology of Data Center for Big Data Network Mohammad Naimur Rahman
More informationCisco for SAP HANA Scale-Out Solution on Cisco UCS with NetApp Storage
Cisco for SAP HANA Scale-Out Solution Solution Brief December 2014 With Intelligent Intel Xeon Processors Highlights Scale SAP HANA on Demand Scale-out capabilities, combined with high-performance NetApp
More informationSTeP-IN SUMMIT 2013. June 18 21, 2013 at Bangalore, INDIA. Performance Testing of an IAAS Cloud Software (A CloudStack Use Case)
10 th International Conference on Software Testing June 18 21, 2013 at Bangalore, INDIA by Sowmya Krishnan, Senior Software QA Engineer, Citrix Copyright: STeP-IN Forum and Quality Solutions for Information
More informationHyperscale Use Cases for Scaling Out with Flash. David Olszewski
Hyperscale Use Cases for Scaling Out with Flash David Olszewski Business challenges Performanc e Requireme nts Storage Budget Balance the IT requirements How can you get the best of both worlds? SLA Optimized
More informationRAMCloud and the Low- Latency Datacenter. John Ousterhout Stanford University
RAMCloud and the Low- Latency Datacenter John Ousterhout Stanford University Most important driver for innovation in computer systems: Rise of the datacenter Phase 1: large scale Phase 2: low latency Introduction
More informationThe Software Defined Hybrid Packet Optical Datacenter Network SDN AT LIGHT SPEED TM. 2012-13 CALIENT Technologies www.calient.
The Software Defined Hybrid Packet Optical Datacenter Network SDN AT LIGHT SPEED TM 2012-13 CALIENT Technologies www.calient.net 1 INTRODUCTION In datacenter networks, video, mobile data, and big data
More informationGeoGrid Project and Experiences with Hadoop
GeoGrid Project and Experiences with Hadoop Gong Zhang and Ling Liu Distributed Data Intensive Systems Lab (DiSL) Center for Experimental Computer Systems Research (CERCS) Georgia Institute of Technology
More informationThe Future of Computing Cisco Unified Computing System. Markus Kunstmann Channels Systems Engineer
The Future of Computing Cisco Unified Computing System Markus Kunstmann Channels Systems Engineer 2009 Cisco Systems, Inc. All rights reserved. Data Centers Are under Increasing Pressure Collaboration
More informationCost-effective Resource Provisioning for MapReduce in a Cloud
1 -effective Resource Provisioning for MapReduce in a Cloud Balaji Palanisamy, Member, IEEE, Aameek Singh, Member, IEEE Ling Liu, Senior Member, IEEE Abstract This paper presents a new MapReduce cloud
More informationApplication Development. A Paradigm Shift
Application Development for the Cloud: A Paradigm Shift Ramesh Rangachar Intelsat t 2012 by Intelsat. t Published by The Aerospace Corporation with permission. New 2007 Template - 1 Motivation for the
More informationData Centers and Cloud Computing. Data Centers
Data Centers and Cloud Computing Slides courtesy of Tim Wood 1 Data Centers Large server and storage farms 1000s of servers Many TBs or PBs of data Used by Enterprises for server applications Internet
More informationHow swift is your Swift? Ning Zhang, OpenStack Engineer at Zmanda Chander Kant, CEO at Zmanda
How swift is your Swift? Ning Zhang, OpenStack Engineer at Zmanda Chander Kant, CEO at Zmanda 1 Outline Build a cost-efficient Swift cluster with expected performance Background & Problem Solution Experiments
More informationPerformance Analysis of Mixed Distributed Filesystem Workloads
Performance Analysis of Mixed Distributed Filesystem Workloads Esteban Molina-Estolano, Maya Gokhale, Carlos Maltzahn, John May, John Bent, Scott Brandt Motivation Hadoop-tailored filesystems (e.g. CloudStore)
More informationVirtualization Overview
VMWARE W HWHITE I T E PPAPER A P E R Virtualization Overview 1 Table of Contents Introduction... 3 Virtualization in a Nutshell... 3 Virtualization Approaches... 4 Virtualization for Server Consolidation
More informationA Framework for Performance Analysis and Tuning in Hadoop Based Clusters
A Framework for Performance Analysis and Tuning in Hadoop Based Clusters Garvit Bansal Anshul Gupta Utkarsh Pyne LNMIIT, Jaipur, India Email: [garvit.bansal anshul.gupta utkarsh.pyne] @lnmiit.ac.in Manish
More informationHow To Compare Amazon Ec2 To A Supercomputer For Scientific Applications
Amazon Cloud Performance Compared David Adams Amazon EC2 performance comparison How does EC2 compare to traditional supercomputer for scientific applications? "Performance Analysis of High Performance
More informationPutchong Uthayopas, Kasetsart University
Putchong Uthayopas, Kasetsart University Introduction Cloud Computing Explained Cloud Application and Services Moving to the Cloud Trends and Technology Legend: Cluster computing, Grid computing, Cloud
More informationConverged storage architecture for Oracle RAC based on NVMe SSDs and standard x86 servers
Converged storage architecture for Oracle RAC based on NVMe SSDs and standard x86 servers White Paper rev. 2015-11-27 2015 FlashGrid Inc. 1 www.flashgrid.io Abstract Oracle Real Application Clusters (RAC)
More informationHow To Run Apa Hadoop 1.0 On Vsphere Tmt On A Hyperconverged Network On A Virtualized Cluster On A Vspplace Tmter (Vmware) Vspheon Tm (
Apache Hadoop 1.0 High Availability Solution on VMware vsphere TM Reference Architecture TECHNICAL WHITE PAPER v 1.0 June 2012 Table of Contents Executive Summary... 3 Introduction... 3 Terminology...
More informationResearch on Job Scheduling Algorithm in Hadoop
Journal of Computational Information Systems 7: 6 () 5769-5775 Available at http://www.jofcis.com Research on Job Scheduling Algorithm in Hadoop Yang XIA, Lei WANG, Qiang ZHAO, Gongxuan ZHANG School of
More informationQoS & Traffic Management
QoS & Traffic Management Advanced Features for Managing Application Performance and Achieving End-to-End Quality of Service in Data Center and Cloud Computing Environments using Chelsio T4 Adapters Chelsio
More informationPerformance characterization report for Microsoft Hyper-V R2 on HP StorageWorks P4500 SAN storage
Performance characterization report for Microsoft Hyper-V R2 on HP StorageWorks P4500 SAN storage Technical white paper Table of contents Executive summary... 2 Introduction... 2 Test methodology... 3
More informationBlobSeer: Towards efficient data storage management on large-scale, distributed systems
: Towards efficient data storage management on large-scale, distributed systems Bogdan Nicolae University of Rennes 1, France KerData Team, INRIA Rennes Bretagne-Atlantique PhD Advisors: Gabriel Antoniu
More informationDEPLOYING AND MONITORING HADOOP MAP-REDUCE ANALYTICS ON SINGLE-CHIP CLOUD COMPUTER
DEPLOYING AND MONITORING HADOOP MAP-REDUCE ANALYTICS ON SINGLE-CHIP CLOUD COMPUTER ANDREAS-LAZAROS GEORGIADIS, SOTIRIOS XYDIS, DIMITRIOS SOUDRIS MICROPROCESSOR AND MICROSYSTEMS LABORATORY ELECTRICAL AND
More informationScaling in a Hypervisor Environment
Scaling in a Hypervisor Environment Richard McDougall Chief Performance Architect VMware VMware ESX Hypervisor Architecture Guest Monitor Guest TCP/IP Monitor (BT, HW, PV) File System CPU is controlled
More informationMicrosoft Exchange Server 2007 and Hyper-V high availability configuration on HP ProLiant BL680c G5 server blades
Microsoft Exchange Server 2007 and Hyper-V high availability configuration on HP ProLiant BL680c G5 server blades Executive summary... 2 Introduction... 2 Exchange 2007 Hyper-V high availability configuration...
More informationPentaho High-Performance Big Data Reference Configurations using Cisco Unified Computing System
Pentaho High-Performance Big Data Reference Configurations using Cisco Unified Computing System By Jake Cornelius Senior Vice President of Products Pentaho June 1, 2012 Pentaho Delivers High-Performance
More informationCSE-E5430 Scalable Cloud Computing Lecture 2
CSE-E5430 Scalable Cloud Computing Lecture 2 Keijo Heljanko Department of Computer Science School of Science Aalto University keijo.heljanko@aalto.fi 14.9-2015 1/36 Google MapReduce A scalable batch processing
More informationThe Performance Characteristics of MapReduce Applications on Scalable Clusters
The Performance Characteristics of MapReduce Applications on Scalable Clusters Kenneth Wottrich Denison University Granville, OH 43023 wottri_k1@denison.edu ABSTRACT Many cluster owners and operators have
More informationDell Reference Configuration for Hortonworks Data Platform
Dell Reference Configuration for Hortonworks Data Platform A Quick Reference Configuration Guide Armando Acosta Hadoop Product Manager Dell Revolutionary Cloud and Big Data Group Kris Applegate Solution
More informationWhen Does Colocation Become Competitive With The Public Cloud? WHITE PAPER SEPTEMBER 2014
When Does Colocation Become Competitive With The Public Cloud? WHITE PAPER SEPTEMBER 2014 Table of Contents Executive Summary... 2 Case Study: Amazon Ec2 Vs In-House Private Cloud... 3 Aim... 3 Participants...
More informationGuidelines for Selecting Hadoop Schedulers based on System Heterogeneity
Noname manuscript No. (will be inserted by the editor) Guidelines for Selecting Hadoop Schedulers based on System Heterogeneity Aysan Rasooli Douglas G. Down Received: date / Accepted: date Abstract Hadoop
More informationInternational Journal of Computer & Organization Trends Volume20 Number1 May 2015
Performance Analysis of Various Guest Operating Systems on Ubuntu 14.04 Prof. (Dr.) Viabhakar Pathak 1, Pramod Kumar Ram 2 1 Computer Science and Engineering, Arya College of Engineering, Jaipur, India.
More informationEvaluating Task Scheduling in Hadoop-based Cloud Systems
2013 IEEE International Conference on Big Data Evaluating Task Scheduling in Hadoop-based Cloud Systems Shengyuan Liu, Jungang Xu College of Computer and Control Engineering University of Chinese Academy
More informationStudy of virtual data centers for cost savings and management
1 Study of virtual data centers for cost savings and management María Virtudes López López School of Industrial Engineering and Information Technology Master s Degree in Cybernetics Research León, Spain
More informationPrivate cloud computing advances
Building robust private cloud services infrastructures By Brian Gautreau and Gong Wang Private clouds optimize utilization and management of IT resources to heighten availability. Microsoft Private Cloud
More informationHadoop: Embracing future hardware
Hadoop: Embracing future hardware Suresh Srinivas @suresh_m_s Page 1 About Me Architect & Founder at Hortonworks Long time Apache Hadoop committer and PMC member Designed and developed many key Hadoop
More informationWhen Does Colocation Become Competitive With The Public Cloud?
When Does Colocation Become Competitive With The Public Cloud? PLEXXI WHITE PAPER Affinity Networking for Data Centers and Clouds Table of Contents EXECUTIVE SUMMARY... 2 CASE STUDY: AMAZON EC2 vs IN-HOUSE
More informationA Micro-benchmark Suite for Evaluating Hadoop RPC on High-Performance Networks
A Micro-benchmark Suite for Evaluating Hadoop RPC on High-Performance Networks Xiaoyi Lu, Md. Wasi- ur- Rahman, Nusrat Islam, and Dhabaleswar K. (DK) Panda Network- Based Compu2ng Laboratory Department
More informationFPGA Accelerator Virtualization in an OpenPOWER cloud. Fei Chen, Yonghua Lin IBM China Research Lab
FPGA Accelerator Virtualization in an OpenPOWER cloud Fei Chen, Yonghua Lin IBM China Research Lab Trend of Acceleration Technology Acceleration in Cloud is Taking Off Used FPGA to accelerate Bing search
More informationCloud and DataCenter Systems: `Fast Data -> Online Management
Cloud and DataCenter Systems: `Fast Data -> Online Management Karsten Schwan, Greg Eisenhauer, Ada Gavrilovska, Hrishikesh Amur, Liting Hu, Chengwei Wang, Junwei Li, Motivation Motivation Microsales, Product
More informationAn Experimental Study of Load Balancing of OpenNebula Open-Source Cloud Computing Platform
An Experimental Study of Load Balancing of OpenNebula Open-Source Cloud Computing Platform A B M Moniruzzaman 1, Kawser Wazed Nafi 2, Prof. Syed Akhter Hossain 1 and Prof. M. M. A. Hashem 1 Department
More informationAn Introduction to Private Cloud
An Introduction to Private Cloud As the word cloud computing becomes more ubiquitous these days, several questions can be raised ranging from basic question like the definitions of a cloud and cloud computing
More informationHadoop Cluster Applications
Hadoop Overview Data analytics has become a key element of the business decision process over the last decade. Classic reporting on a dataset stored in a database was sufficient until recently, but yesterday
More informationMit Soft- & Hardware zum Erfolg. Giuseppe Paletta
Mit Soft- & Hardware zum Erfolg IT-Transformation VCE Converged and Hyperconverged Infrastructure VCE VxRack EMC VSPEX Blue IT-Transformation IT has changed dramatically in last past years The requirements
More informationIntroduction to Cloud Computing
Introduction to Cloud Computing Cloud Computing I (intro) 15 319, spring 2010 2 nd Lecture, Jan 14 th Majd F. Sakr Lecture Motivation General overview on cloud computing What is cloud computing Services
More informationCisco UCS and Fusion- io take Big Data workloads to extreme performance in a small footprint: A case study with Oracle NoSQL database
Cisco UCS and Fusion- io take Big Data workloads to extreme performance in a small footprint: A case study with Oracle NoSQL database Built up on Cisco s big data common platform architecture (CPA), a
More information