Intel Itanium Architecture



Similar documents
Itanium 2 Platform and Technologies. Alexander Grudinski Business Solution Specialist Intel Corporation

Innovativste XEON Prozessortechnik für Cisco UCS

Leading Virtualization Performance and Energy Efficiency in a Multi-processor Server

Competitive Comparison Dual-Core Intel Xeon Processor-based Platforms vs. AMD Opteron*

Hitachi Virtage Embedded Virtualization Hitachi BladeSymphony 10U

Intel Xeon Processor E5-2600

Scaling from Datacenter to Client

A Superior Hardware Platform for Server Virtualization

Intel Xeon Processor 3400 Series-based Platforms

Quad-Core Intel Xeon Processor

A Quantum Leap in Enterprise Computing

Intel Itanium Quad-Core Architecture for the Enterprise. Lambert Schaelicke Eric DeLano

Intel Xeon Processor 5400 Series

Intel Cluster Ready Appro Xtreme-X Computers with Mellanox QDR Infiniband

NEC Corporation of America Intro to High Availability / Fault Tolerant Solutions

Scaling in a Hypervisor Environment

Enabling Technologies for Distributed and Cloud Computing

Intel Virtualization and Server Technology Update

Performance Comparison of Fujitsu PRIMERGY and PRIMEPOWER Servers

Highly Scalable Server for Many Possible Uses. MAXDATA PLATINUM Server 3200 I

How To Configure Your Computer With A Microsoft X86 V3.2 (X86) And X86 (Xo) (Xos) (Powerbook) (For Microsoft) (Microsoft) And Zilog (X

The Foundation for Better Business Intelligence

ECLIPSE Best Practices Performance, Productivity, Efficiency. March 2009

Enabling Technologies for Distributed Computing

Oracle Database Reliability, Performance and scalability on Intel Xeon platforms Mitch Shults, Intel Corporation October 2011

Stovepipes to Clouds. Rick Reid Principal Engineer SGI Federal by SGI Federal. Published by The Aerospace Corporation with permission.

Servervirualisierung mit Citrix XenServer

Emerging storage and HPC technologies to accelerate big data analytics Jerome Gaysse JG Consulting

GPU System Architecture. Alan Gray EPCC The University of Edinburgh

ICONICS Choosing the Correct Edition of MS SQL Server

Intel Xeon Processor 5500 Series. An Intelligent Approach to IT Challenges

Removing Performance Bottlenecks in Databases with Red Hat Enterprise Linux and Violin Memory Flash Storage Arrays. Red Hat Performance Engineering

SQL Server Consolidation Using Cisco Unified Computing System and Microsoft Hyper-V

LS DYNA Performance Benchmarks and Profiling. January 2009

Sockets vs. RDMA Interface over 10-Gigabit Networks: An In-depth Analysis of the Memory Traffic Bottleneck

System Configuration and Order-information Guide ECONEL 100 S2. March 2009

IBM System x Enterprise Servers in the New Enterprise Data

Leading Virtualization 2.0

OpenPower: IBM s Strategy for Best of Breed 64-bit Linux

IBM System x family brochure

Infrastructure Matters: POWER8 vs. Xeon x86

How System Settings Impact PCIe SSD Performance

Windows Server 2008 R2 for Itanium-Based Systems offers the following high-end features and capabilities:

Copyright 2013, Oracle and/or its affiliates. All rights reserved.

Quad-Core Intel Xeon Processor 5400 Series. Boost Performance and Energy Efficiency with new 45nm multi-core Intel Xeon processors

ECLIPSE Performance Benchmarks and Profiling. January 2009

SAP HANA - an inflection point

Dual-Core Intel Xeon Processor 5100 Series

FLOW-3D Performance Benchmark and Profiling. September 2012

Comparing Free Virtualization Products

2015 Global Technology conference. Diane Bryant Senior Vice President & General Manager Data Center Group Intel Corporation

Dell Virtualization Solution for Microsoft SQL Server 2012 using PowerEdge R820

Solving the Hypervisor Network I/O Bottleneck Solarflare Virtualization Acceleration

WHITE PAPER Mainstreaming Server Virtualization: The Intel Approach

SUSE Linux Enterprise 10 SP2: Virtualization Technology Support

Hardware Based Virtualization Technologies. Elsie Wahlig Platform Software Architect

Small Industries Development Bank of India Request for Proposal (RfP) For Purchase of Intel Servers for Various Locations

IOS110. Virtualization 5/27/2014 1

Windows Server 2008 R2 Hyper V. Public FAQ

AMD Opteron Quad-Core

Data Center Storage Solutions

I/O Virtualization Using Mellanox InfiniBand And Channel I/O Virtualization (CIOV) Technology

Oracle Database Scalability in VMware ESX VMware ESX 3.5

Lecture 11: Multi-Core and GPU. Multithreading. Integration of multiple processor cores on a single chip.

Evaluating Intel Virtualization Technology FlexMigration with Multi-generation Intel Multi-core and Intel Dual-core Xeon Processors.

SiS AMD Athlon TM 64FX/PCI-E Solution. Silicon Integrated Systems Corp. Integrated Product Division April. 2004

Virtualization Technologies and Blackboard: The Future of Blackboard Software on Multi-Core Technologies

Desktop Processor Roadmap. Solution Provider Accounts

NEC Express5800/A2000 Series Scalable Enterprise Server.OR CX (Core Xeon) High Operation-ability

How SSDs Fit in Different Data Center Applications

Performance Characteristics of VMFS and RDM VMware ESX Server 3.0.1

Competitive Comparison

Analyzing the Virtualization Deployment Advantages of Two- and Four-Socket Server Platforms

ICRI-CI Retreat Architecture track

Multi-core and Linux* Kernel

Table Of Contents. Page 2 of 26. *Other brands and names may be claimed as property of others.

Understanding the Benefits of IBM SPSS Statistics Server

Maximize Performance and Scalability of RADIOSS* Structural Analysis Software on Intel Xeon Processor E7 v2 Family-Based Platforms

Parallels Server 4 Bare Metal

Intel Server Board S5000PALR Intel Server System SR1500ALR

Intel RAID SSD Cache Controller RCS25ZB040

BC43: Virtualization and the Green Factor. Ed Harnish

SUBJECT: SOLIDWORKS HARDWARE RECOMMENDATIONS UPDATE

WHITE PAPER FUJITSU PRIMERGY SERVERS PERFORMANCE REPORT PRIMERGY BX620 S6

Where IT perceptions are reality. Test Report. OCe14000 Performance. Featuring Emulex OCe14102 Network Adapters Emulex XE100 Offload Engine

Sun Microsystems Special Promotions for Education and Research January 9, 2007

HP ProLiant BL460c achieves #1 performance spot on Siebel CRM Release 8.0 Benchmark Industry Applications running Microsoft, Oracle

Achieving Nanosecond Latency Between Applications with IPC Shared Memory Messaging

Comparing Multi-Core Processors for Server Virtualization

OpenPOWER Outlook AXEL KOEHLER SR. SOLUTION ARCHITECT HPC

Models Smart Array 6402A/128 Controller 3X-KZPEC-BF Smart Array 6404A/256 two 2 channel Controllers

Emerging Technology for the Next Decade

Transcription:

Intel Itanium Architecture Roadmap and Technology Update Dr. Gernot Hoyler Technical Marketing EMEA

Intel Itanium Architecture Growth MARKET Over 3x revenue growth Y/Y* More than 10x growth* in shipments of large SMP systems (64+) IDC Worldwide Quarterly Server Tracker, August 2004 SOFTWARE Over 100% Y/Y growth 2,000 HARDWARE # OEM server models keep growing 2002 2003 2004 2P, 4P 20 50 70 8P - 128P 5 15 20 8 of 9 RISC vendors selling Intel Itanium-based servers END-CUSTOMERS 38 of Global 100 companies using Intel Itanium-based servers today 1,000 2004 forecast of 2000 applications reached TODAY! 0 1H 03 2H 03 1H 04 2H 04 High profile wins: General Mills, Pfizer, Thomson Financial, Procter and Gamble, The Weather Channel, First American Title, Motorola Other names and brands may be claimed as the property of others.

Broad Ecosystem Support Application Choice Operating System Choice System Vendor & Platform Choice OpenVMS* >2000 native applications 32-bit application support with IA-32 Execution Layer Windows*, Linux*, and Unix* support >70 vendors selling today 20 large SMP systems 2-way to 256-way systems 8 of 9 RISC vendors support Broad industry support Choice that you don t t get with RISC

Itanium Processor Family: A Strategic Product for Intel Design teams working on more than 6 future processors

Intel Itanium Processor Family 2001 2002 2003 2004 2005 future Itanium 2 Madison** Madison9M Montecito ** Tukwila** Common Platform 800MHz 4MB L3-Cache 460GX Chip-set OEM Chip-sets 180nm 1GHz 3MB il3-cache E8870 Chip-set OEM Chip-sets 180nm 1.5GHz 6MB il3-cache E8870 Chip-set OEM Chip-sets 130nm 1.6GHz 9MB il3-cache E8870 Chip-set OEM Chip-sets 130nm >>1.6GHz 24MB il3- Cache Dual-Core E8870 OEM Chip-sets 90nm All features and dates specified are targets provided for planning purposes only and are subject to change. Multi-Core Developed by former Alpha team **codename

Common Platform & Infrastructure Today: Itanium Processor exceeds RISC performance & price / perf Today: Itanium platform delivering superior price / performance vs Intel Xeon Processor on transaction processing 30% more transactions at 10% incremental cost of hardware platform/ OS / database*** 07: Itanium platform cost reduced to parity with Intel Xeon processor- based platforms Common platform components to lead to common platform infrastructure over time All features and dates specified are targets provided for planning purposes only and are subject to change without notice.

Current Architecture or Solutions Choice and Flexibility for Evolving Enterprise Servers Transition Benefits Architecture of Choice RISC Architecture Target Applications Database, ERP, BI, HPC Exceptional performance choice of operating systems, software and hardware vendors TODAY Mission critical 64-bit architecture Premier performance, reliability, scalability; cost effective vs. RISC 64-bit support via Intel EM64T, IA-32 Architecture great performance for 32-bit applications Target Applications MP: SCM, CRM, BI, ERP DP: HPC, Application Server, Workgroup E-Commerce, E Portals, Firewall/Security, Workstation apps Mainstream 64-bit architecture; price performance reliability * Performance tests and ratings are measured using specific computer systems and/or components and reflect the approximate performance of Intel products as measured by those tests. Any difference in system hardware or software design or configuration may affect actual performance.

Future technologies: Processor and Platform Enhancements Multiple Cores and Thread Virtualization Power Management

Intel Processor Technology Trends Era of Thread & Core Parallelism Era of Instruction Parallelism All features and dates specified are targets provided for planning purposes only and are subject to change without notice

Enterprise Multi-Core Transition dual core a natural evolution 4 or more cores + cache TODAY Single Core 2005-2006 2006 Dual Core Future Multi-Core 2 or more cores + cache All products, dates and features are preliminary and subject to change without notice

Itanium Architecture Optimized for Multi-Core Architecture: Parallelism and many registers to keep data on-chip Core size: Smaller than IA-32, up to 2x more cores per die on than on IA-32 All products, dates, and figures are preliminary and are subject to change without notice. * EPIC is Itanium s architecture Explicitly Parallel Instruction Set Computing All features and dates specified are targets provided for planning purposes only and are subject to change without notice.

Multi-Threading Approaches Sharing resources High utilization High complexity Medium latency hiding Medium utilization Low complexity Medium latency hiding Temporal A A A B B B B B A A A A B B B A A A B B B B B Single thread/cycle Event A A A A A A A A A B B B B B B B B B B B B Multiple thread/cycle A A A B B A A B B B A A B B B A A A B B B A A B B B B Medium utilization Low complexity High latency hiding Event for the core and Multiple for the caches

Montecito Multi-Threading Serial Execution A i Idle A i+1 B i Idle B i+1 Montecito Multi-threaded threaded Execution A i Idle A i+1 B i B i+1 Multi-threading threading decreases stalls and increases performance

Optimal Dynamic Thread Switching Determine when execution is stalled for long latency operations Practical Predict that a long latency event will stall execution Hysteresis to avoid needless switches hint@pause gives software control Effective solution allowing streaming and access clumping

Multi-Level Parallelism MULTI-CORE DUAL-CORE MULTI-THREADING THREADING SINGLE CORE (ILP, SIMD) ILP,SIMD Thread Core Die/Socket All features and dates specified are targets provided for planning purposes only and are subject to change without notice

Performance Innovations Itanium Processor Performance Strategy: Increased performance/thread, then increased number of threads Source: Intel Corporation All data measured at Intel on current processors. Projections on future processors based on Intel estimates using similar workload testing at Intel (Transaction processing using 64GB of memory, Floating point using 4GB). Increased Performance/Thread 2 1.8 1.6 1.4 1.2 1 0.8 0.6 0.4 0.2 0 Floating Point Performance (Single thread) - Relative Performance Driven by: Increased frequency Increased L3 cache Increased bus speed Itanium 2 Processor Itanium 2 6M Itanium 2 9M Montecito 3 2.5 2 1.5 1 0.5 0 Multi-threaded threaded Performance Montecito: 4 virtual processors 4S Transaction Processing Relative Performance Itanium 2 Processor Itanium 2 6M Itanium 2 9M Montecito Driven by: Dual core Montecito Multi-threading threading support in Montecito *Third-party marks and brands are the property of their respective owners; Performance tests and ratings are measured using specific computer systems and/or components and reflect the approximate performance of Intel products as measured by those tests. Any difference in system hardware or software design or configuration may affect actual performance. Buyers should consult other sources of information to evaluate the performance of systems or components they are considering purchasing. All products, dates, and figures are preliminary and are subject to change without any notice. Copyright Intel Corporation 2004.

First Implementation of Key Features: Montecito Core Montecito Core Core 1 Core 2 Key Processor Features Intel s s first dual-core processor Intel s s first processor with >1 billion transistors 24 MB L3 cache Multi-threading threading Compatible with existing Dualcore Itanium 2-based 2 systems Pellston, Foxton and Silvervale Technology Targeting H2 2005 2005 1.7 Billion Transistors 1MB L2I 2 Way Multi-threading threading 2x12MB L3 caches with Pellston L3 Cache System Bus Arbiter L3 Cache Multiple cores, Multiple threads and L3 Cache on ONE die 90nm Power Management/ Frequency Boost (Foxton)

Montecito Technologies Pellston Technology Automatically disables cache lines in the event of a hard cache memory error Allows processor and system to continue normal operation Improve reliability / / uptime uptime Silvervale Technology VM1 VM-n Foxton Technology Processor boosts performance dynamically based on application power consumption (up to 10% freq.) Largest performance impact expected on transaction processing More More performance / / no no platform modifications App MRTE OS(1) Memory Network App Virtual Machine Monitor Silvervale Technology Shared Physical Host Hardware Processors Storage App MRTE OS(N) App Graphics Keyboard / Mouse HW Value Provides the hooks for a hardware supported virtualisation Enables the Virtual Machine Monitor to be more robust and shows a higher performance Next Next level level of of Server Server Consolidation Performance tests and ratings are measured using specific computer systems and/or components and reflect the approximate performance of Intel products as measured by those tests. Any difference in system hardware or software design or configuration may affect actual performance. Buyers should consult other sources of information to evaluate the performance of systems or components they are considering purchasing. All products, dates, and figures are preliminary and are subject to change without any notice. Copyright Intel

Designing for Power Leakage current increases exponentially as process size decreases linearly Business as usual is not an option 45nm CPU might need 1kW

Transistor Design Roadmap Extending Moore s Law: Continued World Leading Transistor Scaling and Novel Structures for Low Power / High Performance All features and dates specified are targets provided for planning purposes only and are subject to change without notice

Power Reduction Features In addition to on-going improvements such as voltage scaling, new power reduction features are planned for each process generation Strained silicon Sleep transistors 90nm 65nm 45nm 32nm 2003 2005 2007 2009 High-k/ metal gate Tri-gate transistors All features and dates specified are targets provided for planning purposes only and are subject to change without notice

Power Management Innovation Growing Capabilities over TIME Reduced Power via Management Demand Based Switching (DBS( DBS) Aggressive use of C1E state Power Prediction/Monitoring Report Configuration Power (PConfig) Monitor Power (PSMI( PSMI) SILICON TRANSISTORS PACKAGES HEAT SINKS SYSTEMS FACILITIES Lower power cores Process (65nm( 65nm) Proliferate Benefits

Advances in Memory Technology DDR2 Features DRAM Technology for Multiple Generations Higher Bandwidth Lower Power On Die Termination Four DIMMs per Channel IT Benefits Increased Platform Longevity Higher System Performance Increased Server Density Increased Memory Density Lower cost memory configurations Note: Comparison relative to DDR333

Fully-Buffered DIMM Memory

DDR2 vs. FB-DIMM

FBD & Dual Core Analysis Simulation -- Theoretical Analysis FBD Performance Advantages Significant with Dual Core CPUs 40% FBD Addresses Mem Channel Demands of Dual Core CPUs 100% (lower is better) Gains vs DDR2 2Ch 35% 30% 25% 20% FBD 2Ch FBD 4Ch 80% 60% DDR2 2Ch FBD 2Ch FBD 4Ch 15% 40% 10% 20% 5% 0% TPC-C 0% Memory Channel Utilization Unleashing Dual Core Performance Requires FBD

PCI Express* for I/O Advantages Performance Performance Bottleneck Processor Network I/O Source: Merrill Lynch 1980 1990 2000 IDC expects PCI Express to be a leading contender in keeping the future IT infrastructure fed with lots of I/O delivered quickly and managed securely. - IDC, Vernon Turner, June 2003 I/O Breakthrough Keep pace with rest of platform Lower Cost Standards-based & Serial I/O drive inherently lower product development & manufacturing, and TCO Investment Protection Future proofing through 10 year roadmap. Increased RAS. Rapid Enterprise Adoption Majority of OEMs/ODMs with PCIe slots, compelling 04 adapter availability, and strong IHV development plans

PCI Express* Performance Mellanox Technologies Data Mellanox Measurements: Realizing over 2.9x the bandwidth of PCI-X X 133 20% reduction in latency reduced CPU overhead Additional performance possible with tuning MB/s 2500 2000 1500 1000 500 0 InfiniHost III Ex Bandwidth (Actual) IB Over PCI-X Uni-directional B/W IB Single Port Over 8x PCI Express IB Dual Port over 8x PCI Express Bi-directional B/W **Source: Mellanox Technologies(March 04) Full report available at : http://www.mellanox.com/products/shared/infinihost_iii_ex_launch.pdf PCI-Express delivers significant performance improvement over PCI-X

Vielen Dank! www.intel.com

Backup

Long Term Goal: 1M Transactions per Minute Today In 2007 With planned performance improvements, a 4-way 4 Itanium -based server in 07 could deliver equivalent OLTP of a current 64-way system, delivering dramatically Lower TCO Lower power consumption Higher density Shown are representations of 64-way system (today) and 4-way system (2007). Not to scale. All products, dates, comparisons, and information are preliminary and subject to change without notice.

SILVERVALE TECHNOLOGY Better Virtualization through OSVs and ISVs Virtual Machine Virtual Machine Virtual Machine Virtual Machine App App App App App App App App Virtual Machine App App App App App App App App App MRTE MRTE App App MRTE MRTE App MRTE OS(X) OS(X) OS(X) OS(X) OS(1) Virtual Machine Monitor NEW VIRTUALIZATION TECHNOLOGY Virtualization End User Benefits Reliability Efficiency & flexibility Security Shared Physical Host Hardware Silvervale Technology Benefits Choice Robustness Performance

IA-32 Execution Layer Availability IPF Code Itanium 2 processor IA-32 Code IA-32 EL IA-32 H/W Historically, support of IA-32 applications has been carried out by on-die hardware When using OS with IA-32 EL, support for IA-32 applications is provided by IA-32 EL Operating System Gives access to the ia-32 EcoSystem Microsoft* Windows* Server 2003 Enterprise Edition Microsoft Windows Server 2003 Datacenter Edition Microsoft Windows XP Professional 64-Bit Edition Red Hat* Enterprise Linux 3 SGI* Advanced Linux Environment with ProPack* 3.0 SUSE* Linux Entreprise Server 9 Asianux* 1.0 1 Red Flag* Advanced Server 4.1 1 Red Flag DC Server 4.1 1 Miracle Linux* 3.0 1 Available 2H 04 2H 04 2H 04 2H 04 For Microsoft Windows, IA-32 EL is currently available at Microsoft Download Center,, and will ship with Windows Server 2003 SP1 RTM For Linux, IA-32 EL ships or will ship with the OS, as indicated by the availability date 1 Asianux 1.0 includes Red Flag Advanced Server 4.1 and Miracle Linux 3.0. Performance Scaling with Future Processors 1 SPECint_base2000* Itanium 2 Itanium Processor Processor 1.5 GHz Family IA-32 EL All products, dates, comparisons and information are preliminary and subject to change without notice. ~ Xeon Processor MP 1.5 GHz '03 '04 '05 '06

SW Tools, Cross Platform Support Currently available Planned Single Source Code Multiple Platforms Itanium 2, 2, Xeon (EM64T/32Bit) and and XScale Processor

Foxton Technology Dynamically adjust Voltage (V) & frequency (f) Exploit full power envelope for all applications Demand Based Switching and flexible power settings Large power change small frequency change (P=fCV 2 ) 3% power change with only 1% frequency change Monitor/calculate power and temperature Set Voltage to the minimum value needed to support highest frequency Over power and/or temperature results in voltage change Frequency responds to global and local voltage