PHYSICAL CORES V. ENHANCED THREADING SOFTWARE: PERFORMANCE EVALUATION WHITEPAPER



Similar documents
Intel Platform Controller Hub EG20T

Cyber Security Framework: Intel s Implementation Tools & Approach

Upgrading Intel AMT 5.0 drivers to Linux kernel v2.6.31

AMD Processor Performance. AMD Phenom II Processors Discrete Platform Benchmarks December 2008

ARM* to Intel Atom Microarchitecture - A Migration Study

DDR2 x16 Hardware Implementation Utilizing the Intel EP80579 Integrated Processor Product Line

Internal LVDS Dynamic Backlight Brightness Control

Development for Mobile Devices Tools from Intel, Platform of Your Choice!

White Paper David Hibler Jr Platform Solutions Engineer Intel Corporation. Considerations for designing an Embedded IA System with DDR3 ECC SO-DIMMs

Processor Reorder Buffer (ROB) Timeout

Developing secure software A practical approach

An Architecture to Deliver a Healthcare Dial-tone

Using GStreamer for hardware accelerated video decoding on Intel Atom Processor E6xx series

ATI Radeon 4800 series Graphics. Michael Doggett Graphics Architecture Group Graphics Product Group

Xen in Embedded Systems. Ray Kinsella Senior Software Engineer Embedded and Communications Group Intel Corporation

Intel IXP42X Product Line of Network Processors and IXC1100 Control Plane Processor: Spread-Spectrum Clocking to Reduce EMI

AMD PhenomII. Architecture for Multimedia System Prof. Cristina Silvano. Group Member: Nazanin Vahabi Kosar Tayebani

A Vision for Tomorrow s Hosting Data Center

Enabling new usage models for Intel Embedded Platforms

Contents Overview and Product Contents

How To Install An Intel System Studio 2015 For Windows* For Free On A Computer Or Mac Or Ipa (For Free)

Debugging Machine Check Exceptions on Embedded IA Platforms

AMD Product and Technology Roadmaps

Accessing the Real Time Clock Registers and the NMI Enable Bit

CT Bus Clock Fallback for Linux Operating Systems

Implementing Multiple Displays with IEGD Multi-GPU - Multi-Monitor Mode on Intel Atom Processor with Intel System Controller Hub US15W Chipset

Using Windows* 7/Windows Embedded Standard 7* with Platforms Based on the Intel Atom Processor Z670/Z650 and Intel SM35 Express Chipset

Intel(R) IT Director User's Guide

Enhanced Intel SpeedStep Technology for the Intel Pentium M Processor

Inside Linux* graphics

Intel Core i Processor (3M Cache, 3.30 GHz)

Hard Disk Drive vs. Kingston SSDNow V+ 200 Series 240GB: Comparative Test

Maximize Performance and Scalability of RADIOSS* Structural Analysis Software on Intel Xeon Processor E7 v2 Family-Based Platforms

Radeon GPU Architecture and the Radeon 4800 series. Michael Doggett Graphics Architecture Group June 27, 2008

Embedded Controller Usage in Low Power Embedded Designs

Memory Configuration for Intel Xeon 5500 Series Branded Servers & Workstations

Intel EP80579 Software for Security Applications on Intel QuickAssist Technology Cryptographic API Reference

Family 10h AMD Phenom II Processor Product Data Sheet

Comparing Multi-Core Processors for Server Virtualization

Intel Rapid Storage Technology (Intel RST) in Linux*

Overview. CPU Manufacturers. Current Intel and AMD Offerings

Intel Core i3-2310m Processor (3M Cache, 2.10 GHz)

FLOATING-POINT ARITHMETIC IN AMD PROCESSORS MICHAEL SCHULTE AMD RESEARCH JUNE 2015

Considerations for Designing an Embedded Intel Architecture System with System Memory Down

Configuring Memory on the HP Business Desktop dx5150

THE AMD MISSION 2 AN INTRODUCTION TO AMD NOVEMBER 2014

Summary. Key results at a glance:

Figure 1A: Dell server and accessories Figure 1B: HP server and accessories Figure 1C: IBM server and accessories

Legal Notices and Important Information

Intel Server S3200SHL

Family 12h AMD Athlon II Processor Product Data Sheet

White Paper AMD PROJECT FREESYNC

Desktop Processor Roadmap. Solution Provider Accounts


HETEROGENEOUS SYSTEM COHERENCE FOR INTEGRATED CPU-GPU SYSTEMS

DELL. Virtual Desktop Infrastructure Study END-TO-END COMPUTING. Dell Enterprise Solutions Engineering

Adaptec: Snap Server NAS Performance Study

Initial Hardware Estimation Guidelines. AgilePoint BPMS v5.0 SP1

About Intel Virtualization Technology

White Paper Amit Aneja Platform Architect Intel Corporation. Xen* Hypervisor Case Study - Designing Embedded Virtualized Intel Architecture Platforms

How System Settings Impact PCIe SSD Performance

How To Compare Two Servers For A Test On A Poweredge R710 And Poweredge G5P (Poweredge) (Power Edge) (Dell) Poweredge Poweredge And Powerpowerpoweredge (Powerpower) G5I (

Intel Desktop Board DG31GL

Autodesk Revit 2016 Product Line System Requirements and Recommendations

Introduction to PCI Express Positioning Information

Conseils Installations Formations Dépannages Matériels Logiciels N Entreprise : TVA : BE Banque CBC :

Runtime Power Management

Accelerating Business Intelligence with Large-Scale System Memory

Out-of-box comparison between Dell and HP blade servers

Revit products will use multiple cores for many tasks, using up to 16 cores for nearphotorealistic

Hardware/Software Specifications for Self-Hosted Systems (Multi-Server)

Intel Server Board S5000PALR Intel Server System SR1500ALR

Intel X58 Express Chipset

Intel Server Board S3420GPLX Intel Server Board S3420GPLC Intel Server System SR1630GP Intel Server System SR1630HGP

Modernizing Servers and Software

What is in Your Workstation?

Software PBX Performance on Intel Multi- Core Platforms - a Study of Asterisk* White Paper

Intel Dialogic System Software for PCI Products on Windows

Generations of the computer. processors.

MS Exchange Server Acceleration

Dual-Core Processors on Dell-Supported Operating Systems

HP Workstations for Adobe Creative Cloud

Intel Server Board S3420GPRX Intel Server System SR1630GPRX Intel Server System SR1630HGPRX

Cisco MCS 7825-H3 Unified Communications Manager Appliance

Accelerating Innovation in the Desktop. Rob Crooke VP and General Manager Business Client Group

The Motherboard Chapter #5

A Study on the Scalability of Hybrid LS-DYNA on Multicore Architectures

Intel Server Board S3420GPV

CHAPTER 2: HARDWARE BASICS: INSIDE THE BOX

Fusion iomemory iodrive PCIe Application Accelerator Performance Testing

CPU Benchmarks Over 600,000 CPUs Benchmarked

Intel 965 Express Chipset Family Memory Technology and Configuration Guide

Dell Vostro 3350/3450/3550/3750

Recording Server Monitoring Tool

Citrix and Intel Deliver Client Virtualization

Transcription:

PHYSICAL CORES V. ENHANCED THREADING SOFTWARE: PERFORMANCE EVALUATION WHITEPAPER Preface Today s world is ripe with computing technology. Computing technology is all around us and it s often difficult to decipher what is actually being offered at any given moment with any given advancement. Oftentimes advancements can come so quick that we do not take the time to understand the importance it has on our fields. One such advancement in computing technology is Parallelism. Parallelism is defined by the execution of many computing calculations carried out simultaneously. The foundation behind parallelism is that large calculations can be completed quicker if they are divided into smaller calculations and all solved in parallel, hence the root definition of the word. Parallelism has just recently achieved mass popularity in the technology industry long after the advent of the microprocessor. Recent physical constraints like power delivery and heat dissipation, and the need for higher clock frequencies have made single-core processors outdated. Parallel computing capabilities enabled in common software applications coupled with advances in microprocessor technologies have made multi-core processors the preferred architecture for today s PCs. Multicore processors standout and are differentiated by their core-count, or the amount of physical cores contained on the processor die itself. Core-count differentiates processors and is often used in the nomenclature: i.e.: the AMD Phenom II X4, quad-core processor. Parallelism and multi-core processors go hand-in-hand. When large calculations are divided into many smaller calculations it is best to have dedicated technology resources ready to work on each of those smaller calculations simultaneously. This is where the performance of physical multicore processors really begin to shine, and where multi-core processor technology shifts from a want to have to a must have.

Introduction In this performance evaluation whitepaper we are going to examine the difference between dedicating processor cores to completing smaller calculations simultaneously versus the speed of enhanced threading software on cores. Performance Measurements 1 For this performance comparison we ll be comparing four different processors, two from both AMD and Intel. The list of processors used for the performance evaluations is listed in Figure 1. The two AMD processors both have six cores each while the two Intel processors both have four cores each. Figure 1: Model Name Company Make # of Cores Clock speed (Frequency) Total Cache (L2+L3) AMD Phenom II 1090T AMD Phenom II 1055T AMD 6 3.2GHz 9MB AMD 6 2.8GHz 9MB Core i7 930* Intel 4 2.8GHz 8MB Core i5 750 Intel 4 2.66GHz 8MB * Equipped with Intel Hyper-Threading Technology Apart from their brand and model numbers, both Intel processors appear very similar, and are primarily differentiated by their clock speeds. Both the Intel Core i5 750 and Intel Core i7 930 are equipped with four cores and 8MB of total cache. One key difference between these two processors is that the Intel Core i7 930 processor has a feature called Intel Hyper- Threading Technology. Intel Hyper-Threading Technology utilizes Intel s proprietary software to enhance the computing capabilities of thread processing on each of the Intel Core i7 930 s four cores. 1 Results based on performance testing completed at AMD performance labs on 05/01/2010. Please see pages 9-11 for both AMD and Intel system configurations used in testing.

Performance Benchmarking For the purpose of this exercise performance will be measured and based on benchmarking applications that will focus on threading. Benchmarks Used: Maxon CINEBENCH http://www.maxon.net/index.php?id=162 Persistence of Vision Raytracer (POV-Ray)- http://www.povray.org/ Measuring Results The results of the Maxon Cinebench and Persistence of Vision Raytracer (POV-Ray) on all four processors are recorded in Figure 2 below. Figure 2 Model # of real cores Clock speed CINEBENCH Raw score POV-Ray Raw score 1090T 6 3.2GHz 5.65 4306 1055T 6 2.8GHz 5.0 3743 i7-930 4 2.8GHz 5.03 3634 i5-750 4 2.66GHz 3.75 2709 The AMD Phenom II X6 six-core processors excel at straight-forward threading exercises because they contain physical cores dedicated to performing the functions. Individual core performance of both AMD processors in Maxon Cinebench and Persistence of Vision Raytracer (POV-Ray) is calculated by dividing the total score of each benchmark by the number of physical cores contained on the processor used.

Cinebench: AMD Phenom II X6 1055T = 5.0/6 cores = 0.83 PovRay: AMD Phenom II X6 1055T = 3743/6 cores = 623.8 With the Intel Core i7 930 and Intel Core i5 750 processors, the calculation for performance is different. To examine the performance which can be attributed to Intel s Hyper-Threading Tehnology, performance is based on the difference in scores between the Intel Core i7 930 (4-core + 4-Hyper-threads) and the Intel Core i5 750 (4-core). Finally, the result is divided by four to isolate the performance attributable to the additional threads enabled by Intel Hyper-Threading technology. 2 Cinebench : Intel Hyper-Threading Technology Performance = Core i7 930 score less Intel Core i5 750 score = (5.03-3.75)/4 = 0.321 PovRay : Intel Hyper-Threading Technology Performance = Core i7 930 score less Intel Core i5 750 score = (3436-2709)/4 = 231 2 Isolated Intel Hyper-Threading results is a theoretical calculation constructed by AMD to estimate performace attributable to Intel Hyper-Threading software.

Comparing Results When examining how much additional performance may be gained by an increasing physical processor core count versus software that boosts the threading ability on existing cores, we find based on our testing that physical processing cores have a direct performance advantage over enhanced threading software. When looking at the direct results of our testing we find: Cinebench : The performance attributable to a core of the AMD Phenom II 1055T Six-Core Processor = 2.6 times faster than the threading enhancement achieved on the Intel Core i7 930 Processor 0.83/0.321 = 2.6 PovRay : The performance attributable to a core of the AMD Phenom II 1055T Six-Core Processor = 2.7 times faster than the threading enhancement achieved on the Intel Core i7 930 Processor 623/231 = 2.7 When we examine the AMD Phenom II X6 1090T Six-Core processor the performance advantage against enhanced threading software becomes even greater: Cinebench : The performance attributable to a core of the AMD Phenom II X6 1090T Six-Core Processor = 2.9 times faster than the threading enhancement achieved on the Intel Core i7 930 Processor 0.942/0.321 = 2.93 PovRay : The performance attributable to a core of the AMD Phenom II X6 1090T Six-Core Processor = 3.1 times faster than the threading enhancement achieved on the Intel Core i7 930 Processor 718/231 = 3.1

Conclusion Based on the results of the Cinebench and PovRay benchmark tests, we find that physical cores outperform enhanced threading software in certain applications. Although direct results will always vary upon the system configurations used, we find that a microprocessor s physical core count to not only be an important factor to consider in the purchase of a new PC, but also enhances its longevity.

DISCLAIMER The information presented in this document is for informational purposes only and may contain technical inaccuracies, omissions and typographical errors. The information contained herein is subject to change and may be rendered inaccurate for many reasons, including but not limited to changes to the AMD Fusion Partner Program. AMD assumes no obligation to update or otherwise correct or revise this information. However, AMD reserves the right to revise this information and to make changes from time to time to the content hereof without obligation of AMD to notify any person of such revisions or changes. AMD MAKES NO REPRESENTATIONS OR WARRANTIES WITH RESPECT TO THE CONTENTS HEREOF AND ASSUMES NO RESPONSIBILITY FOR ANY INACCURACIES, ERRORS OR OMISSIONS THAT MAY APPEAR IN THIS INFORMATION. AMD SPECIFICALLY DISCLAIMS ANY IMPLIED WARRANTIES OF MERCHANTABILITY OR FITNESS FOR ANY PARTICULAR PURPOSE. IN NO EVENT WILL AMD BE LIABLE TO ANY PERSON FOR ANY DIRECT, INDIRECT, SPECIAL OR OTHER CONSEQUENTIAL DAMAGES ARISING FROM THE USE OF ANY INFORMATION CONTAINED HEREIN, EVEN IF AMD IS EXPRESSLY ADVISED OF THE POSSIBILITY OF SUCH DAMAGES.

Trademark Attributions AMD AMD, the AMD Arrow logo, ATI, the ATI logo, AMD Athlon, AMD Phenom, Avivo, CrossFireX, PowerXpress, Radeon and combinations thereof are trademarks of Advanced Micro Devices, Inc. in the United States and/or other jurisdictions. Other names used in this presentation are for identification purposes only and may be trademarks of their respective owners. Copyright 2010 Advanced Micro Devices, Inc. All rights reserved. Intel BunnyPeople, Celeron, Celeron Inside, Centrino, Centrino Inside, Core Inside, i960, Intel, the Intel logo, Intel AppUp, Intel Atom, Intel Atom Inside, Intel Core, Intel Inside, the Intel Inside logo, Intel NetBurst, Intel NetMerge, Intel NetStructure, Hyper-Threading, Intel SingleDriver, Intel SpeedStep, Intel Sponsors of Tomorrow., the Intel Sponsors of Tomorrow. logo, Intel StrataFlash, Intel Viiv, Intel vpro, Intel XScale, InTru, the InTru logo, InTru soundmark, Itanium, Itanium Inside, MCS, MMX, Moblin, Pentium, Pentium Inside, skoool, the skoool logo, Sound Mark, The Journey Inside, vpro Inside, VTune, Xeon, and Xeon Inside are trademarks of Intel Corporation in the U.S. and other countries. Copyright Intel Corporation. All rights reserved. Intel Corporation, 2200 Mission College Blvd., Santa Clara, CA 95052-8119, USA. Maxon Maxon, Cinebench are trademarks of Maxon Computer Inc.,in the U.S. and other countries. Copyright 1998-2009 MAXON Computer GmbH, MAXON Computer Inc., MAXON Computer Ltd. Persistence of Vision Raytracer (POV-Ray) Persistence of Vision Raytracer, POV-Ray, PovRay, are trademarks of Persistence of Vision Raytracer Pty. Ltd.,in the U.S. and other countries. Copyright 2003-2008 Persistence of Vision Raytracer Pty. Ltd.

Testing Configurations Used AMD System Specs: AMD Processor-Based System Configuration Information AMD is required to identify hardware and software used in evaluation configurations that are NOT publicly available. Operating System Name: Win7 64bit Ultimate Processor AMD Phenom II X6 1055T AMD Phenom II X6 1090T Hardware Motherboard: Talapia I (Reference Board) BIOS Info: Is BIOS publicly available? Chipset: A7640AMS.10i Yes RS880 + SB750 Memory Manufacturer & Type: Quantity & Size: (each)/(mb) Total Memory Size: (MB) Corsair CM2X2048-8500C5D Qty (4) 2048MB DIMM Module 8 GIG Total Hard Drive Model Name: Model Number: Hard Drive Size: x1 Western Digital Raptor WD300HLFS WD VelociRaptor 300GB

Transfer Mode: SATA 10000 RPM Other Info: NTFS was used to format the hard disk Network Card Onboard Sound Card Onboard Video: Graphics Adapter: Sapphire ATI Radeon HD 5870 Memory Size (MB) and Type: 1GB DDR5 Intel System Specs AMD Processor-Based System Configuration Information AMD is required to identify hardware and software used in evaluation configurations that are NOT publicly available. Operating System Name: Win7 64bit Ultimate Processor Intel Core i7 930 Intel Core i5 750 Hardware Motherboard: Intel DX580SO BIOS Info: 5020 Is BIOS publicly Yes

available? Chipset: RS880 + SB750 Memory Manufacturer & Type: Quantity & Size: (each)/(mb) Total Memory Size: (MB) Corsair CM2X2048-8500C5D Qty (4) 2048MB DIMM Module 8 GIG Total Hard Drive Model Name: Model Number: Hard Drive Size: Transfer Mode: Other Info: x1 Western Digital Raptor WD300HLFS WD VelociRaptor 300GB SATA 10000 RPM NTFS was used to format the hard disk Network Card Onboard Sound Card Onboard Video: Graphics Adapter: Sapphire ATI Radeon HD 5870 Memory Size (MB) and Type: 1GB DDR5