NUMA-like architecture for Microservers
|
|
- Osborne Bryce George
- 7 years ago
- Views:
Transcription
1 Foundation for Research and Technology Hellas (FORTH) Institute of Computer Science (ICS) NUMA-like architecture for Microservers Iakovos Mavroidis FORTH-ICS, Greece MPSoC 14, July 8, Margaux, France
2 Outline Cha a te isti s a d e ui e e ts of today s Data e te s Importance of Energy Efficiency Energy Proportionality Small form-factor Microservers Energy Efficient Architecture Intel or ARM? EUROSERVER approach EUROSERVER Architecture Unimem Architecture Testing Environment FMC Fan-Out Daughtercard 2
3 Why is Size/Power/Energy Efficiency Important? Utility costs Management Cost Improve Size Power and Cooling costs Improve PUE (Power Usage Effectiveness) Electricity growth 56% increase (US increase 36%) 19% increase % increase in %-1.5% of global electricity 10 (US %) Google report PUE=1.16 in 10 PUE=1.14 in 11 Environmental friendliness programs Energy Star (US), TopRunner (Japan), FOE (Switzerland) 3
4 Energy Proportionality in Datacenters Most of the time at 10 50% Challenge: Power not proportional to utilization Server underutilized Two approaches: Turn off hardware when not used Dynamic Voltage Scaling (DVS) Clock Gating Keep CPU utilization high Multiple Virtual Machines Overprovisioning QoS guarantees? 4 4
5 Aligning energy use with workloads 5
6 Why many small cores? Scale out applications require large number of cores no brute processors Smaller cores more power-efficient for several workloads static web page serving, entry dedicated hosting, and basic content delivery, among others Less power consumption (sub-10w levels) Lower running costs, lower PUE Energy proportionality Easy to turn off idle cores (parts of the system) Easier maintenance and management Small form factor allows tightly packed clusters and less physical space Easier more efficient implementation? CPU partitioning instead of sharing (no sharing overhead) But No compute power for single-threaded application Hard to parallelize an application Not so efficient for HPC domain? 6
7 Energy-efficient architecture: Microservers Low-power components CPU (ARM, Intel Atom) Memory () Storage (NVM) Small form factor Small CPUs Fast interconnections (high-speed serial links) High integration Microservers are still in their infancy 7
8 Intel or ARM in Microservers? Diversity of ARM ecosystem Custom microservers using ARM-based SoCs Hundreds of customers More than 50 variations of Intel Atom and Xeon Xeon E3 suitable for webscale applications, online gaming, cloud Atom C00 suitable for lightweight scale-out workloads Hard to compete hundreds of chip-makers (Samsung Exynos, AMD Opteron A1100 with 8 A-57, APM s X-Gene, Google, Facebook, However Intel first released 64-bit SoC with ECC (Atom Avoton) Intel 3-D technology smaller die area less energy consumption Most datacenter software run on x86 (porting on ARM in progress) Calxeda: ARM-based servers didn't have the software support or hardware needed to win enterprise customers 8
9 EUROSERVER Challenges and Approach Energy-efficient architecture Use of highly-integrated, high-performance, energy-efficient components in a Microserver arcitecture Many low-power 3D - Interposer Technology main memory NVM memory for storage Suitable from cloud data-centers to embedded applications Unimem Architecture (Focus of this presentation) Take advantage of fast communication Scalable architecture Many coherent islands Global Address Space Facilitate maintenance and management Small form factor Energy proportionality 9
10 EUROSERVER Architecture Chiplet: Cores+L0 Coherent Interconnect 1 coherence island μserver: Nodes+L2 Interconnect Scale-out or HPC Node: Chiplets+L1 Interconnect Shared IO and Storage EuroServer System ARM. 64b Node-SSD Local-IO memory memory ARM 64b ARM 64b ARM 64b Local-IO memory ARM 64b ARM 64b Node-SSD memory Node-SSD memory ARM 64b memory memory memory Compute Node. Compute Node 2 Compute Node 1 Compute Node 0 ARM 64b Local-IO Node-SSD Local-IO Interlink System: Nodes+L3 Interconnect other μservers Ethernet John Goodacre, ARM Clustered Architecture: Coherence Islands communicating through multi-level Interconnect Sha ed IO s Each Coherence Island has its own local independent global (coherent) address space (GAS L) 10
11 Unimem Architecture μserver0 Compute Node 0 Compute Node 1 Compute Node 2 Compute Node 3 Interlink other MicroServers Ethernet μserver1 Compute Node 0 Compute Node 1 Compute Node 2 Compute Node 3 Interlink Ethernet other MicroServers John Goodacre, ARM Every memory page has a single owner (coherence island) A p o esso a a ess a y page i the syste th ough the page ow e s ohe e t i te o e t Every page can be cacheable either locally (single borrower) or remotely (owner) but not both 11
12 EUROSERVER environment Coherence Island0 Coherence Island1 Lite Local Cache Coherent Interconnect AXI DMC Lite AXI Chip2 Chip Lite Local Cache Coherent Interconnect Lite AXI Chip2 Chip AXI DMC 8-core Chiplet 8-core Chiplet Multi-level Global Interconnect Two coherence islands might belong in the same Compute Node (intralink communication) or not (intralink + interlink communication) 12
13 Remote Page Borrowing Coherence Island0 Coherence Island1 Lite Miss/Replace Lite Local Cache Coherent Interconnect AXI DMC Lite AXI Chip2 Chip Local Cache Coherent Interconnect Lite AXI Chip2 Chip AXI DMC 8-core Chiplet 8-core Chiplet Multi-level Global Interconnect Lo ally a hea le i itiato s a he 13
14 Shared Memory Coherence Island0 Coherence Island1 Miss/Replace Lite Local Cache Coherent Interconnect AXI DMC Lite AXI Lite Local Cache Coherent Interconnect Lite AXI Chip2 Chip Chip2 Chip AXI DMC 8-core Chiplet 8-core Chiplet Multi-level Global Interconnect Re otely a hea le ow e s a he 14
15 R Coherence Island0 Miss/Replace Coherence Island1 Read DMC Lite Lite AXI Chip2 Chip Lite Write Local Cache Coherent Interconnect AXI Local Cache Coherent Interconnect Lite AXI Chip2 Chip AXI DMC 8-core Chiplet 8-core Chiplet Multi-level Global Interconnect reads from (or writes to) on Coherence Island0 and writes to (or reads from) on Coherence Island1 Accesses can also be uncacheable locally or cacheable remotely (dashed lines) 15
16 NUMA-aware linux Coherence Island0 Coherence Island1 Lite Miss/Replace Lite Local Cache Coherent Interconnect AXI DMC Lite AXI Chip2 Chip Local Cache Coherent Interconnect Lite AXI Chip2 Chip AXI DMC 8-core Chiplet 8-core Chiplet Multi-level Global Interconnect Borrow unused remote memory instead of page faulting Fast shared memory and MPI communication NUMA-aware memory allocator and garbage collector 16
17 Initial Testing Environment using A9-based boards ZedBoard 0 ZedBoard 1 FPGA Cache FPGA A9 core M (GP) INT MBOX INT INTC Cache A9 core M (GP) S (ACP) INT MBOX INT INTC S (ACP) AXI Interconnect 100MHz) S AXI Interconnect 100MHz) S Chip2Chip M Master/Slave Chip2Chip M Master/Slave FMC FMC FMC to FMC cable (6.4Gb/s) Can we interconnect more A9 processors? (see next slides) 17
18 FMC Fan-Out Daughtercard v.1 2MicroZed boards (40LVDS per board) MZ0 4 MicroZed boards ( LVDS per board) mechanical support MZ0 MZ2 FMC HPC MZ1 MZ1 mechanical support 80 pairs mechanical support MZ3 FMC HPC mechanical support 80 pairs Top and bottom s are mainly used for mechanical support. Two connectivity modes: support for 1 to 4 MicroZed boards PCB design and fabrication done Testing done Version 2 in progress 18
19 Pictures of FMC Fan-Out Daugthercard v.1 8 s 2 MicroZeds FMC 4 MicroZeds 19
20 FMC Fan-out v.2 with 10GE and PCIe PCIe 4GTX 4GTX SFP+ SFP+ SFP+ FMC HPC SFP+ 80 pairs, 8GTX Support for: Four 10Gb SFP+ 2.5 PCIe socket
21 Initial Prototype using FMC Fan-Out v.2 4x10Gb not connected (no room for daughtercard) MicroZed 6 FMC Fan-Out v.2 4 GTX MicroZed 5 MicroZed 7 80 pairs, 8 GTX FMC HPC FMC HPC FMC HPC 80 pairs, 8 GTX MicroZed 3 v.2 MicroZed 2 MicroZed 0 FMC Fan-Out MicroZed 1 4x10Gb MicroZed 4 SSD Virtex 7 Specialist Node Hitech Global (HTG-V7-PCIE-585) 8 MicroZed boards 8 10Gb SFP+ ports 1-2 PCIe x4 SSD 21
22 Picture of testing environment using FMC Fan-Out v.1 8 MicroZeds (A9+1GB) Shared 10 GigE Hitech Global board (central router) 22
23 Thank you! Questions? Iakovos Mavroidis FORTH-ICS 23
Scaling Mobile Compute to the Data Center. John Goodacre
Scaling Mobile Compute to the Data Center John Goodacre Director Technology and Systems, ARM Ltd. Cambridge Professor Computer Architectures, APT. Manchester EuroServer Project EUROSERVER is a European
More informationSeeking Opportunities for Hardware Acceleration in Big Data Analytics
Seeking Opportunities for Hardware Acceleration in Big Data Analytics Paul Chow High-Performance Reconfigurable Computing Group Department of Electrical and Computer Engineering University of Toronto Who
More informationA Study on the Scalability of Hybrid LS-DYNA on Multicore Architectures
11 th International LS-DYNA Users Conference Computing Technology A Study on the Scalability of Hybrid LS-DYNA on Multicore Architectures Yih-Yih Lin Hewlett-Packard Company Abstract In this paper, the
More informationPower Efficiency Comparison: Cisco UCS 5108 Blade Server Chassis and IBM FlexSystem Enterprise Chassis
White Paper Power Efficiency Comparison: Cisco UCS 5108 Blade Server Chassis and IBM FlexSystem Enterprise Chassis White Paper March 2014 2014 Cisco and/or its affiliates. All rights reserved. This document
More informationLS DYNA Performance Benchmarks and Profiling. January 2009
LS DYNA Performance Benchmarks and Profiling January 2009 Note The following research was performed under the HPC Advisory Council activities AMD, Dell, Mellanox HPC Advisory Council Cluster Center The
More informationCloud Computing through Virtualization and HPC technologies
Cloud Computing through Virtualization and HPC technologies William Lu, Ph.D. 1 Agenda Cloud Computing & HPC A Case of HPC Implementation Application Performance in VM Summary 2 Cloud Computing & HPC HPC
More informationAccelerate Cloud Computing with the Xilinx Zynq SoC
X C E L L E N C E I N N E W A P P L I C AT I O N S Accelerate Cloud Computing with the Xilinx Zynq SoC A novel reconfigurable hardware accelerator speeds the processing of applications based on the MapReduce
More informationHigh Performance Computing in CST STUDIO SUITE
High Performance Computing in CST STUDIO SUITE Felix Wolfheimer GPU Computing Performance Speedup 18 16 14 12 10 8 6 4 2 0 Promo offer for EUC participants: 25% discount for K40 cards Speedup of Solver
More informationHigh Performance. CAEA elearning Series. Jonathan G. Dudley, Ph.D. 06/09/2015. 2015 CAE Associates
High Performance Computing (HPC) CAEA elearning Series Jonathan G. Dudley, Ph.D. 06/09/2015 2015 CAE Associates Agenda Introduction HPC Background Why HPC SMP vs. DMP Licensing HPC Terminology Types of
More informationMicrosoft Private Cloud Fast Track Reference Architecture
Microsoft Private Cloud Fast Track Reference Architecture Microsoft Private Cloud Fast Track is a reference architecture designed to help build private clouds by combining Microsoft software with NEC s
More informationPedraforca: ARM + GPU prototype
www.bsc.es Pedraforca: ARM + GPU prototype Filippo Mantovani Workshop on exascale and PRACE prototypes Barcelona, 20 May 2014 Overview Goals: Test the performance, scalability, and energy efficiency of
More informationGPU System Architecture. Alan Gray EPCC The University of Edinburgh
GPU System Architecture EPCC The University of Edinburgh Outline Why do we want/need accelerators such as GPUs? GPU-CPU comparison Architectural reasons for GPU performance advantages GPU accelerated systems
More informationSolving I/O Bottlenecks to Enable Superior Cloud Efficiency
WHITE PAPER Solving I/O Bottlenecks to Enable Superior Cloud Efficiency Overview...1 Mellanox I/O Virtualization Features and Benefits...2 Summary...6 Overview We already have 8 or even 16 cores on one
More informationCopyright 2013, Oracle and/or its affiliates. All rights reserved.
1 Oracle SPARC Server for Enterprise Computing Dr. Heiner Bauch Senior Account Architect 19. April 2013 2 The following is intended to outline our general product direction. It is intended for information
More informationECLIPSE Performance Benchmarks and Profiling. January 2009
ECLIPSE Performance Benchmarks and Profiling January 2009 Note The following research was performed under the HPC Advisory Council activities AMD, Dell, Mellanox, Schlumberger HPC Advisory Council Cluster
More informationInfrastructure Matters: POWER8 vs. Xeon x86
Advisory Infrastructure Matters: POWER8 vs. Xeon x86 Executive Summary This report compares IBM s new POWER8-based scale-out Power System to Intel E5 v2 x86- based scale-out systems. A follow-on report
More informationHP Project Moonshot and the Redstone Development Server Platform
Technical white paper HP Project Moonshot and the Redstone Development Server Platform Introduction 2 Challenges facing today s hyperscale data center 2 Elements of Project Moonshot 3 Server platforms
More informationMicrosoft Private Cloud Fast Track
Microsoft Private Cloud Fast Track Microsoft Private Cloud Fast Track is a reference architecture designed to help build private clouds by combining Microsoft software with Nutanix technology to decrease
More informationPower Efficiency Comparison: Cisco UCS 5108 Blade Server Chassis and Dell PowerEdge M1000e Blade Enclosure
White Paper Power Efficiency Comparison: Cisco UCS 5108 Blade Server Chassis and Dell PowerEdge M1000e Blade Enclosure White Paper March 2014 2014 Cisco and/or its affiliates. All rights reserved. This
More informationParallel Programming Survey
Christian Terboven 02.09.2014 / Aachen, Germany Stand: 26.08.2014 Version 2.3 IT Center der RWTH Aachen University Agenda Overview: Processor Microarchitecture Shared-Memory
More informationRevolutionising the data centre with application-specific servers based on ARM processors
Revolutionising the data centre with application-specific servers based on ARM processors By Boston Limited Table of Contents ABOUT BOSTON LIMITED... 3 INTRODUCTION... 3 POWER-HUNGRY PROCESSORS... 4 CISC
More informationUnified Computing Systems
Unified Computing Systems Cisco Unified Computing Systems simplify your data center architecture; reduce the number of devices to purchase, deploy, and maintain; and improve speed and agility. Cisco Unified
More informationHow System Settings Impact PCIe SSD Performance
How System Settings Impact PCIe SSD Performance Suzanne Ferreira R&D Engineer Micron Technology, Inc. July, 2012 As solid state drives (SSDs) continue to gain ground in the enterprise server and storage
More informationCloud Storage. Parallels. Performance Benchmark Results. White Paper. www.parallels.com
Parallels Cloud Storage White Paper Performance Benchmark Results www.parallels.com Table of Contents Executive Summary... 3 Architecture Overview... 3 Key Features... 4 No Special Hardware Requirements...
More informationGetting Started with the Xilinx Zynq- 7000 All Programmable SoC Mini-ITX Development Kit
Getting Started with the Xilinx Zynq- 7000 All Programmable SoC Mini-ITX Development Kit Table of Contents ABOUT THIS GUIDE... 3 ADDITIONAL DOCUMENTATION... 3 ADDITIONAL SUPPORT RESOURCES... 3 INTRODUCTION...
More informationOptimizing Web Infrastructure on Intel Architecture
White Paper Intel Processors for Web Architectures Optimizing Web Infrastructure on Intel Architecture Executive Summary and Purpose of this Paper Today s data center infrastructures must adapt to mobile
More informationIBM System x family brochure
IBM Systems and Technology System x IBM System x family brochure IBM System x rack and tower servers 2 IBM System x family brochure IBM System x servers Highlights IBM System x and BladeCenter servers
More informationFPO. Expanding Intel Architecture Flexibility in the Data Center. Markus Leberecht Data Center Solutions Architect, Intel EMEA March 20, 2013
FPO Expanding Intel Architecture Flexibility in the Data Center Markus Leberecht Data Center Solutions Architect, Intel EMEA March 20, 2013 Agenda Micro Servers Properties and Benefits Workload Suitability
More informationECLIPSE Best Practices Performance, Productivity, Efficiency. March 2009
ECLIPSE Best Practices Performance, Productivity, Efficiency March 29 ECLIPSE Performance, Productivity, Efficiency The following research was performed under the HPC Advisory Council activities HPC Advisory
More informationInternational Journal of Computer & Organization Trends Volume20 Number1 May 2015
Performance Analysis of Various Guest Operating Systems on Ubuntu 14.04 Prof. (Dr.) Viabhakar Pathak 1, Pramod Kumar Ram 2 1 Computer Science and Engineering, Arya College of Engineering, Jaipur, India.
More informationpræsentation oktober 2011
Johnny Olesen System X presale præsentation oktober 2011 2010 IBM Corporation 2 Hvem er jeg Dagens agenda Server overview System Director 3 4 Portfolio-wide Innovation with IBM System x and BladeCenter
More informationNetworking Virtualization Using FPGAs
Networking Virtualization Using FPGAs Russell Tessier, Deepak Unnikrishnan, Dong Yin, and Lixin Gao Reconfigurable Computing Group Department of Electrical and Computer Engineering University of Massachusetts,
More informationVPX Implementation Serves Shipboard Search and Track Needs
VPX Implementation Serves Shipboard Search and Track Needs By: Thierry Wastiaux, Senior Vice President Interface Concept Defending against anti-ship missiles is a problem for which high-performance computing
More informationHow To Write An Article On An Hp Appsystem For Spera Hana
Technical white paper HP AppSystem for SAP HANA Distributed architecture with 3PAR StoreServ 7400 storage Table of contents Executive summary... 2 Introduction... 2 Appliance components... 3 3PAR StoreServ
More informationGUEST OPERATING SYSTEM BASED PERFORMANCE COMPARISON OF VMWARE AND XEN HYPERVISOR
GUEST OPERATING SYSTEM BASED PERFORMANCE COMPARISON OF VMWARE AND XEN HYPERVISOR ANKIT KUMAR, SAVITA SHIWANI 1 M. Tech Scholar, Software Engineering, Suresh Gyan Vihar University, Rajasthan, India, Email:
More informationOracle Database Scalability in VMware ESX VMware ESX 3.5
Performance Study Oracle Database Scalability in VMware ESX VMware ESX 3.5 Database applications running on individual physical servers represent a large consolidation opportunity. However enterprises
More informationAll Programmable Logic. Hans-Joachim Gelke Institute of Embedded Systems. Zürcher Fachhochschule
All Programmable Logic Hans-Joachim Gelke Institute of Embedded Systems Institute of Embedded Systems 31 Assistants 10 Professors 7 Technical Employees 2 Secretaries www.ines.zhaw.ch Research: Education:
More informationConcepts Introduced in Chapter 6. Warehouse-Scale Computers. Important Design Factors for WSCs. Programming Models for WSCs
Concepts Introduced in Chapter 6 Warehouse-Scale Computers introduction to warehouse-scale computing programming models infrastructure and costs cloud computing A cluster is a collection of desktop computers
More informationSERVER CLUSTERING TECHNOLOGY & CONCEPT
SERVER CLUSTERING TECHNOLOGY & CONCEPT M00383937, Computer Network, Middlesex University, E mail: vaibhav.mathur2007@gmail.com Abstract Server Cluster is one of the clustering technologies; it is use for
More informationLoad Balancing & DFS Primitives for Efficient Multicore Applications
Load Balancing & DFS Primitives for Efficient Multicore Applications M. Grammatikakis, A. Papagrigoriou, P. Petrakis, G. Kornaros, I. Christophorakis TEI of Crete This work is implemented through the Operational
More informationSun Microsystems Special Promotions for Education and Research January 9, 2007
Sun Microsystems Special Promotions for Education and Research Solve big problems on a small budget with Sun-Education s trusted partner for cutting-edge technology solutions. Sun solutions help your campus
More informationECDF Infrastructure Refresh - Requirements Consultation Document
Edinburgh Compute & Data Facility - December 2014 ECDF Infrastructure Refresh - Requirements Consultation Document Introduction In order to sustain the University s central research data and computing
More informationSun Constellation System: The Open Petascale Computing Architecture
CAS2K7 13 September, 2007 Sun Constellation System: The Open Petascale Computing Architecture John Fragalla Senior HPC Technical Specialist Global Systems Practice Sun Microsystems, Inc. 25 Years of Technical
More informationThe Mainframe Virtualization Advantage: How to Save Over Million Dollars Using an IBM System z as a Linux Cloud Server
Research Report The Mainframe Virtualization Advantage: How to Save Over Million Dollars Using an IBM System z as a Linux Cloud Server Executive Summary Information technology (IT) executives should be
More informationEvoluzione dell Infrastruttura di Calcolo e Data Analytics per la ricerca
Evoluzione dell Infrastruttura di Calcolo e Data Analytics per la ricerca Carlo Cavazzoni CINECA Supercomputing Application & Innovation www.cineca.it 21 Aprile 2015 FERMI Name: Fermi Architecture: BlueGene/Q
More informationCan High-Performance Interconnects Benefit Memcached and Hadoop?
Can High-Performance Interconnects Benefit Memcached and Hadoop? D. K. Panda and Sayantan Sur Network-Based Computing Laboratory Department of Computer Science and Engineering The Ohio State University,
More informationI/O Performance of Cisco UCS M-Series Modular Servers with Cisco UCS M142 Compute Cartridges
White Paper I/O Performance of Cisco UCS M-Series Modular Servers with Cisco UCS M142 Compute Cartridges October 2015 2015 Cisco and/or its affiliates. All rights reserved. This document is Cisco Public.
More informationCoolEmAll - Tools for realising an energy efficient data centre
CoolEmAll - Tools for realising an energy efficient data centre Wolfgang Christmann christmann informationstechnik + medien GmbH & Co. KG www.christmann.info 1 Outline CoolEmAll project RECS system towards
More information24/12/8 UP Server Nodes in 3U
MicroCloud 24/12/8 UP Server Nodes in 3U New! Intel Atom Processor C2750 Series / Xeon E5-2600 v2 / E3-1200 v3 Product Families Support High-Density, High-Performance High-Efficiency, and Cost-Effective
More informationEmerging storage and HPC technologies to accelerate big data analytics Jerome Gaysse JG Consulting
Emerging storage and HPC technologies to accelerate big data analytics Jerome Gaysse JG Consulting Introduction Big Data Analytics needs: Low latency data access Fast computing Power efficiency Latest
More informationHPC Update: Engagement Model
HPC Update: Engagement Model MIKE VILDIBILL Director, Strategic Engagements Sun Microsystems mikev@sun.com Our Strategy Building a Comprehensive HPC Portfolio that Delivers Differentiated Customer Value
More informationThe virtualization of SAP environments to accommodate standardization and easier management is gaining momentum in data centers.
White Paper Virtualized SAP: Optimize Performance with Cisco Data Center Virtual Machine Fabric Extender and Red Hat Enterprise Linux and Kernel-Based Virtual Machine What You Will Learn The virtualization
More informationEnabling Technologies for Distributed Computing
Enabling Technologies for Distributed Computing Dr. Sanjay P. Ahuja, Ph.D. Fidelity National Financial Distinguished Professor of CIS School of Computing, UNF Multi-core CPUs and Multithreading Technologies
More informationHP Moonshot: An Accelerator for Hyperscale Workloads
HP Moonshot: An Accelerator for Hyperscale Workloads Sponsored by HP, see HP Moonshot for more information www.hp.com/go/moonshot Executive Summary Hyperscale data center customers have specialized workloads,
More informationThe Transition to PCI Express* for Client SSDs
The Transition to PCI Express* for Client SSDs Amber Huffman Senior Principal Engineer Intel Santa Clara, CA 1 *Other names and brands may be claimed as the property of others. Legal Notices and Disclaimers
More informationCisco UCS B-Series M2 Blade Servers
Cisco UCS B-Series M2 Blade Servers Cisco Unified Computing System Overview The Cisco Unified Computing System is a next-generation data center platform that unites compute, network, storage access, and
More informationRecommended hardware system configurations for ANSYS users
Recommended hardware system configurations for ANSYS users The purpose of this document is to recommend system configurations that will deliver high performance for ANSYS users across the entire range
More informationPetascale Software Challenges. Piyush Chaudhary piyushc@us.ibm.com High Performance Computing
Petascale Software Challenges Piyush Chaudhary piyushc@us.ibm.com High Performance Computing Fundamental Observations Applications are struggling to realize growth in sustained performance at scale Reasons
More informationExploiting Remote Memory Operations to Design Efficient Reconfiguration for Shared Data-Centers over InfiniBand
Exploiting Remote Memory Operations to Design Efficient Reconfiguration for Shared Data-Centers over InfiniBand P. Balaji, K. Vaidyanathan, S. Narravula, K. Savitha, H. W. Jin D. K. Panda Network Based
More informationPerformance Evaluation of VMXNET3 Virtual Network Device VMware vsphere 4 build 164009
Performance Study Performance Evaluation of VMXNET3 Virtual Network Device VMware vsphere 4 build 164009 Introduction With more and more mission critical networking intensive workloads being virtualized
More informationHPC Cluster Decisions and ANSYS Configuration Best Practices. Diana Collier Lead Systems Support Specialist Houston UGM May 2014
HPC Cluster Decisions and ANSYS Configuration Best Practices Diana Collier Lead Systems Support Specialist Houston UGM May 2014 1 Agenda Introduction Lead Systems Support Specialist Cluster Decisions Job
More informationMotherboard- based Servers versus ATCA- based Servers
Motherboard- based Servers versus ATCA- based Servers Summary: A comparison of costs, features and applicability for telecom application hosting After many years of struggling for market acceptance, it
More informationThe High Performance Internet of Things: using GVirtuS for gluing cloud computing and ubiquitous connected devices
WS on Models, Algorithms and Methodologies for Hierarchical Parallelism in new HPC Systems The High Performance Internet of Things: using GVirtuS for gluing cloud computing and ubiquitous connected devices
More informationBuilding an energy dashboard. Energy measurement and visualization in current HPC systems
Building an energy dashboard Energy measurement and visualization in current HPC systems Thomas Geenen 1/58 thomas.geenen@surfsara.nl SURFsara The Dutch national HPC center 2H 2014 > 1PFlop GPGPU accelerators
More informationSupport a New Class of Applications with Cisco UCS M-Series Modular Servers
Solution Brief December 2014 Highlights Support a New Class of Applications Cisco UCS M-Series Modular Servers are designed to support cloud-scale workloads In which a distributed application must run
More informationCray Gemini Interconnect. Technical University of Munich Parallel Programming Class of SS14 Denys Sobchyshak
Cray Gemini Interconnect Technical University of Munich Parallel Programming Class of SS14 Denys Sobchyshak Outline 1. Introduction 2. Overview 3. Architecture 4. Gemini Blocks 5. FMA & BTA 6. Fault tolerance
More informationCisco UCS and Fusion- io take Big Data workloads to extreme performance in a small footprint: A case study with Oracle NoSQL database
Cisco UCS and Fusion- io take Big Data workloads to extreme performance in a small footprint: A case study with Oracle NoSQL database Built up on Cisco s big data common platform architecture (CPA), a
More informationParallel Algorithm Engineering
Parallel Algorithm Engineering Kenneth S. Bøgh PhD Fellow Based on slides by Darius Sidlauskas Outline Background Current multicore architectures UMA vs NUMA The openmp framework Examples Software crisis
More informationHow To Build A Cloud Computer
Introducing the Singlechip Cloud Computer Exploring the Future of Many-core Processors White Paper Intel Labs Jim Held Intel Fellow, Intel Labs Director, Tera-scale Computing Research Sean Koehl Technology
More informationClusters: Mainstream Technology for CAE
Clusters: Mainstream Technology for CAE Alanna Dwyer HPC Division, HP Linux and Clusters Sparked a Revolution in High Performance Computing! Supercomputing performance now affordable and accessible Linux
More informationMaxDeploy Ready. Hyper- Converged Virtualization Solution. With SanDisk Fusion iomemory products
MaxDeploy Ready Hyper- Converged Virtualization Solution With SanDisk Fusion iomemory products MaxDeploy Ready products are configured and tested for support with Maxta software- defined storage and with
More informationCisco for SAP HANA Scale-Out Solution on Cisco UCS with NetApp Storage
Cisco for SAP HANA Scale-Out Solution Solution Brief December 2014 With Intelligent Intel Xeon Processors Highlights Scale SAP HANA on Demand Scale-out capabilities, combined with high-performance NetApp
More informationMichael Kagan. michael@mellanox.com
Virtualization in Data Center The Network Perspective Michael Kagan CTO, Mellanox Technologies michael@mellanox.com Outline Data Center Transition Servers S as a Service Network as a Service IO as a Service
More informationDavid Rioja Redondo Telecommunication Engineer Englobe Technologies and Systems
David Rioja Redondo Telecommunication Engineer Englobe Technologies and Systems About me David Rioja Redondo Telecommunication Engineer - Universidad de Alcalá >2 years building and managing clusters UPM
More informationUCS M-Series Modular Servers
UCS M-Series Modular Servers The Next Wave of UCS Innovation Marian Klas Cisco Systems June 2015 Cisco UCS - Powering Applications at Every Scale Edge-Scale Computing Cloud-Scale Computing Seamlessly Extend
More informationSystem Models for Distributed and Cloud Computing
System Models for Distributed and Cloud Computing Dr. Sanjay P. Ahuja, Ph.D. 2010-14 FIS Distinguished Professor of Computer Science School of Computing, UNF Classification of Distributed Computing Systems
More informationHUAWEI Tecal E6000 Blade Server
HUAWEI Tecal E6000 Blade Server Professional Trusted Future-oriented HUAWEI TECHNOLOGIES CO., LTD. The HUAWEI Tecal E6000 is a new-generation server platform that guarantees comprehensive and powerful
More informationPatriot Hardware and Systems Software Requirements
Patriot Hardware and Systems Software Requirements Patriot is designed and written for Microsoft Windows. As a result, it is a stable and consistent Windows application. Patriot is suitable for deployment
More informationScaling from Datacenter to Client
Scaling from Datacenter to Client KeunSoo Jo Sr. Manager Memory Product Planning Samsung Semiconductor Audio-Visual Sponsor Outline SSD Market Overview & Trends - Enterprise What brought us to NVMe Technology
More informationA+ Guide to Managing and Maintaining Your PC, 7e. Chapter 1 Introducing Hardware
A+ Guide to Managing and Maintaining Your PC, 7e Chapter 1 Introducing Hardware Objectives Learn that a computer requires both hardware and software to work Learn about the many different hardware components
More informationDIABLO TECHNOLOGIES MEMORY CHANNEL STORAGE AND VMWARE VIRTUAL SAN : VDI ACCELERATION
DIABLO TECHNOLOGIES MEMORY CHANNEL STORAGE AND VMWARE VIRTUAL SAN : VDI ACCELERATION A DIABLO WHITE PAPER AUGUST 2014 Ricky Trigalo Director of Business Development Virtualization, Diablo Technologies
More informationSGI High Performance Computing
SGI High Performance Computing Accelerate time to discovery, innovation, and profitability 2014 SGI SGI Company Proprietary 1 Typical Use Cases for SGI HPC Products Large scale-out, distributed memory
More informationHP ProLiant Gen8 vs Gen9 Server Blades on Data Warehouse Workloads
HP ProLiant Gen8 vs Gen9 Server Blades on Data Warehouse Workloads Gen9 Servers give more performance per dollar for your investment. Executive Summary Information Technology (IT) organizations face increasing
More informationICRI-CI Retreat Architecture track
ICRI-CI Retreat Architecture track Uri Weiser June 5 th 2015 - Funnel: Memory Traffic Reduction for Big Data & Machine Learning (Uri) - Accelerators for Big Data & Machine Learning (Ran) - Machine Learning
More informationWhite paper. ATCA Compute Platforms (ACP) Use ACP to Accelerate Private Cloud Deployments for Mission Critical Workloads. Rev 01
White paper ATCA Compute Platforms (ACP) Use ACP to Accelerate Private Cloud Deployments for Mission Critical Workloads Rev 01 Contents 1 ATCA COMPUTE PLATFORMS... 1-3 1.1 CLOUD COMPUTING... 1-3 1.1.1
More informationLS-DYNA Best-Practices: Networking, MPI and Parallel File System Effect on LS-DYNA Performance
11 th International LS-DYNA Users Conference Session # LS-DYNA Best-Practices: Networking, MPI and Parallel File System Effect on LS-DYNA Performance Gilad Shainer 1, Tong Liu 2, Jeff Layton 3, Onur Celebioglu
More informationTechnology and Cost Considerations for Cloud Deployment: Amazon Elastic Compute Cloud (EC2) Case Study
Creating Value Delivering Solutions Technology and Cost Considerations for Cloud Deployment: Amazon Elastic Compute Cloud (EC2) Case Study Chris Zajac, NJDOT Bud Luo, Ph.D., Michael Baker Jr., Inc. Overview
More informationHP Proliant BL460c G7
HP Proliant BL460c G7 The HP Proliant BL460c G7, is a high performance, fully fault tolerant, nonstop server. It s well suited for all mid-level operations, including environments with local storage, SAN
More informationAvid ISIS 7000. www.avid.com
Avid ISIS 7000 www.avid.com Table of Contents Overview... 3 Avid ISIS Technology Overview... 6 ISIS Storage Blade... 6 ISIS Switch Blade... 7 ISIS System Director... 7 ISIS Client Software... 8 ISIS Redundant
More informationIntroduction History Design Blue Gene/Q Job Scheduler Filesystem Power usage Performance Summary Sequoia is a petascale Blue Gene/Q supercomputer Being constructed by IBM for the National Nuclear Security
More informationEnergy Efficient MapReduce
Energy Efficient MapReduce Motivation: Energy consumption is an important aspect of datacenters efficiency, the total power consumption in the united states has doubled from 2000 to 2005, representing
More informationA Holistic Model of the Energy-Efficiency of Hypervisors
A Holistic Model of the -Efficiency of Hypervisors in an HPC Environment Mateusz Guzek,Sebastien Varrette, Valentin Plugaru, Johnatan E. Pecero and Pascal Bouvry SnT & CSC, University of Luxembourg, Luxembourg
More informationBuilt for Business. Ready for the Future.
Built for Business. Ready for the Future. Addressing End User and IT Needs Introducing 4 th Generation Intel Core Products Addressing Datacenter Needs Introducing Intel in Dell PowerEdge VRTX Usage Model
More informationRecent Advances in HPC for Structural Mechanics Simulations
Recent Advances in HPC for Structural Mechanics Simulations 1 Trends in Engineering Driving Demand for HPC Increase product performance and integrity in less time Consider more design variants Find the
More informationOutdated Architectures Are Holding Back the Cloud
Outdated Architectures Are Holding Back the Cloud Flash Memory Summit Open Tutorial on Flash and Cloud Computing August 11,2011 Dr John R Busch Founder and CTO Schooner Information Technology JohnBusch@SchoonerInfoTechcom
More informationEnabling Technologies for Distributed and Cloud Computing
Enabling Technologies for Distributed and Cloud Computing Dr. Sanjay P. Ahuja, Ph.D. 2010-14 FIS Distinguished Professor of Computer Science School of Computing, UNF Multi-core CPUs and Multithreading
More informationEDUCATION. PCI Express, InfiniBand and Storage Ron Emerick, Sun Microsystems Paul Millard, Xyratex Corporation
PCI Express, InfiniBand and Storage Ron Emerick, Sun Microsystems Paul Millard, Xyratex Corporation SNIA Legal Notice The material contained in this tutorial is copyrighted by the SNIA. Member companies
More informationA Taxonomy and Survey of Energy-Efficient Data Centers and Cloud Computing Systems
A Taxonomy and Survey of Energy-Efficient Data Centers and Cloud Computing Systems Anton Beloglazov, Rajkumar Buyya, Young Choon Lee, and Albert Zomaya Present by Leping Wang 1/25/2012 Outline Background
More informationAmazon EC2 XenApp Scalability Analysis
WHITE PAPER Citrix XenApp Amazon EC2 XenApp Scalability Analysis www.citrix.com Table of Contents Introduction...3 Results Summary...3 Detailed Results...4 Methods of Determining Results...4 Amazon EC2
More informationOperating System Support for Multiprocessor Systems-on-Chip
Operating System Support for Multiprocessor Systems-on-Chip Dr. Gabriel marchesan almeida Agenda. Introduction. Adaptive System + Shop Architecture. Preliminary Results. Perspectives & Conclusions Dr.
More information