Technology Note. PCI Express

Similar documents
Creating a PCI Express Interconnect AJAY V. BHATT, TECHNOLOGY AND RESEARCH LABS, INTEL CORPORATION

The Bus (PCI and PCI-Express)

How PCI Express Works (by Tracy V. Wilson)

Intel PCI and PCI Express*

Introduction to PCI Express Positioning Information

3.4 Planning for PCI Express

PCI EXPRESS: AN OVERVIEW OF PCI EXPRESS, CABLED PCI EXPRESS, AND PXI EXPRESS

PCI Express Overview. And, by the way, they need to do it in less time.

PCI Express* Ethernet Networking

Maximizing Server Storage Performance with PCI Express and Serial Attached SCSI. Article for InfoStor November 2003 Paul Griffith Adaptec, Inc.

PCI EXPRESS TECHNOLOGY. Jim Brewer, Dell Business and Technology Development Joe Sekel, Dell Server Architecture and Technology

Computer Systems Structure Input/Output

PCI Express and Storage. Ron Emerick, Sun Microsystems

Appendix A. by Gordon Getty, Agilent Technologies

Successfully negotiating the PCI EXPRESS 2.0 Super Highway Towards Full Compliance

EDUCATION. PCI Express, InfiniBand and Storage Ron Emerick, Sun Microsystems Paul Millard, Xyratex Corporation

Pericom PCI Express 1.0 & PCI Express 2.0 Advanced Clock Solutions

PCI Express: Interconnect of the future

PCI Express Basic Info *This info also applies to Laptops

How to build a high speed PCI Express bus expansion system using the Max Express product family 1

EureCard DOMINO series

PCI Express IO Virtualization Overview

pco.interface GigE & USB Installation Guide

WD Hard Drive Interface Guide

Universal Serial Bus Implementers Forum EHCI and xhci High-speed Electrical Test Tool Setup Instruction

PCAN-ISA. CAN Interface for ISA. User Manual

LonManager PCC-10 and ISA Protocol Analyzers Models and

Orbit PCI Mk 2 Network Card. User Manual. Part No Issue 4

USB 3.0 to SATA 3.5" Enclosure Installation Guide

From Bus and Crossbar to Network-On-Chip. Arteris S.A.

Transitioning to 6Gb/s SAS (Serial-Attached SCSI) A Dell White Paper

The Elements of GigE Vision

PCI Express Impact on Storage Architectures and Future Data Centers. Ron Emerick, Oracle Corporation

IxChariot Virtualization Performance Test Plan

Computer Performance. Topic 3. Contents. Prerequisite knowledge Before studying this topic you should be able to:

HP Z Turbo Drive PCIe SSD

4 Channel 6-Port SATA 6Gb/s PCIe RAID Host Card

USB Port PCI Express Card

Communicating with devices

Fibre Channel over Ethernet in the Data Center: An Introduction

This is a product of VCCI Class B Compliance

Chapter 5 Busses, Ports and Connecting Peripherals

760 Veterans Circle, Warminster, PA Technical Proposal. Submitted by: ACT/Technico 760 Veterans Circle Warminster, PA

FOUNDATION Fieldbus High Speed Ethernet Control System

The Motherboard Chapter #5

32-bit PCI 10/100/1000Mbps Gigabit Ethernet Card. User s Guide. FCC Warning. CE Mark Warning. VCCI Warning

PCI vs. PCI Express vs. AGP

Video Conferencing Glossary of Terms

IDE/ATA Interface. Objectives. IDE Interface. IDE Interface

An Introduction to PCI Express

2. THE PCI EXPRESS BUS

Fibre Channel over Ethernet: Enabling Server I/O Consolidation

PCIe XMC x8 Lane Adapter

3 Address Spaces & Transaction Routing. The Previous Chapter. This Chapter. The Next Chapter

Intel Desktop Board D945GNT

Introduction to PCI Express

Hardware RAID vs. Software RAID: Which Implementation is Best for my Application?

Chapter 5 Cubix XP4 Blade Server

DATA SECURITY 1/12. Copyright Nokia Corporation All rights reserved. Ver. 1.0

White Paper: M.2 SSDs: Aligned for Speed. Comparing SSD form factors, interfaces, and software support

White Paper Abstract Disclaimer

Lizy Kurian John Electrical and Computer Engineering Department, The University of Texas as Austin

The proliferation of the raw processing

Lecture 2: Computer Hardware and Ports.

A+ Guide to Managing and Maintaining Your PC, 7e. Chapter 1 Introducing Hardware

Computer Organization & Architecture Lecture #19

Appendix A: Camera connectors

USB 3.0 Bandwidth, High Definition Performance

PCI Express Impact on Storage Architectures and Future Data Centers. Ron Emerick, Oracle Corporation

USB Port PCI Express Card

PCI Technology Overview

Layer 3 Network + Dedicated Internet Connectivity

Computer buses and interfaces

SAN Conceptual and Design Basics

Intel Desktop Board DG965RY

Hardware Level IO Benchmarking of PCI Express*

Web Site: Forums: forums.parallax.com Sales: Technical:

ISO11783 a Standardized Tractor Implement Interface

DELL RAID PRIMER DELL PERC RAID CONTROLLERS. Joe H. Trickey III. Dell Storage RAID Product Marketing. John Seward. Dell Storage RAID Engineering

Intel Processors in Industrial Control and Automation Applications Top-to-bottom processing solutions from the enterprise to the factory floor

Analyzing Full-Duplex Networks

SCSI vs. Fibre Channel White Paper

HP 8Gb PCIe FC HBAs Overview

RocketU 114x Host Controller

Steps to Migrating to a Private Cloud

Understanding the Performance of an X User Environment

PCI Express: The Evolution to 8.0 GT/s. Navraj Nandra, Director of Marketing Mixed-Signal and Analog IP, Synopsys

Performance and Recommended Use of AB545A 4-Port Gigabit Ethernet Cards

PCIe SATA 6G Raid Card

Intel architecture. Platform Basics. White Paper Todd Langley Systems Engineer/ Architect Intel Corporation. September 2010

Serial/ Parallel Combo PCi Card user manual

2-Serial/1-Parallel Port PCI Adapter RC303 User Manual

The Shortcut Guide to Balancing Storage Costs and Performance with Hybrid Storage

Selecting the Optimum PCI Express Clock Source

Bus Interconnect Evolution in embedded computing

Application Note. PCIEC-85 PCI Express Jumper. High Speed Designs in PCI Express Applications Generation GT/s

SATA II 4 Port PCI RAID Card RC217 User Manual

Chapter 13 Selected Storage Systems and Interface

Transcription:

Technology Note www.euresys.com info@euresys.com Copyright 2006 Euresys s.a. Belgium. Euresys is registred trademarks of Euresys s.a. Belgium. Other product and company names listed are trademarks or trade names of their respective manufacturers. Euresys reserves the right to modify product specifications and price without previous notice.

Contents Introduction 3 Bus Variation 3 Background 3 Structure 4 Lanes 4 Architecture 5 Layer Stack 6 Cost 6 Mechanical 6 Power 6 Conventional Comparison 7 Performance 8 Number of Lanes 8 Clock Speed 9 Interoperability 9 General Rule 9 Remark 10 References 10 and 10 Euresys 10 WARNING EURESYS S.A. shall retain all property rights, title and interest in the documentation and trademarks of EURESYS S.A. The licensing, use, leasing, loaning, translation, reproduction, copying or modification of the marks or documentation of EURESYS S.A. contained in this book, is not allowed without prior notice. EURESYS S.A. may modify or change the information given in this documentation at any time, in its discretion, and without prior notice. EURESYS S.A. shall not be liable for any loss of or damage to revenues, profits, goodwill, data, information systems or other special, incidental, indirect, consequential or punitive damages of any kind resulting of omissions or errors in this book. www.euresys.com Copyright 2006 Euresys s.a. Belgium. Euresys is registred trademark of Euresys s.a. Belgium. Other product and company names listed are trademarks or trade names of their respective manufacturers. 2

Introduction Today, is mainstream technology for connecting a number of devices in a PC. Most motherboards have architecture and additional slots, in consumer as well as in high-end PCs. The Conventional bus has served the industry well for the past 10-12 years and it will continue to play an important role in the coming years. Following successful Conventional tracks, the new standard was developed. performs as a general purpose high data rate I/O interconnect in multiple market segments and applications. The key Conventional attributes, such as its usage model and software interfaces, are maintained. At the same time, its bandwidth-limiting, parallel bus implementation is replaced by a long-life, fully-serial interface. provides industry leading performance and cost-effectiveness for the coming decade. Bus Variation Euresys has published an application note dedicated to Conventional and -X: Bus Variation (available from Euresys website). In addition to compatibility issues, this note exposes important performance figures. Please refer to this application note for comparison material with performance and characteristics. Background inherits from ten years of Conventional usage. The current status of busses and the important milestones of the Conventional and -X history are summarized hereafter. Conventional is characterized by a bus width of 32 or 64 bits, and a clock speed of 33 or 66 MHz. Any combination of width and speed can be found. The majority of existing desktop PCs are, at least partly, equipped with Conventional technology. The original revision 1.0 local bus specification was introduced in the early 1990s. The industrial inception of this technology started with revision 2.0 in April 1993, which included expansion slots and 66 MHz clock rate support. In the following years, three revisions (2.1 in 1995, 2.2 in 1998 and 2.3 in 2002) introduced significant improvements. The final specification revision 3.0 published in February 2004- definitely precludes the use of 5 Volt add-in cards. -X is a high performance evolution of Conventional. It uses the same hardware structure, which makes it possible to operate a -X add-in card in a Conventional slot, and vice-versa. The -X technology is dedicated to server applications, and is therefore found on many high-end PC motherboards. -X allows for a considerably higher bandwidth than Conventional. It is characterized by a higher clock speed (up to 133 MHz), and is usually found in the 64-bit bus width, although the 32-bit width can exist. Revision 1.0, initially issued in July 2000, specifies -X as an addendum to Conventional. Revision 2.0, an improvement to -X specification, published in November 2002, extends the clocking capability to 266 and 533 MHz. 3

Structure Lanes The physical layer interfaces fast and robust serial point-to-point bus. A lane -the basic communication channel- combines two serial busses, one transmitting data and one receiving data. From the application side, a Lanes lane behaves as full-duplex channel. Lane Data Packets Data Device A Device B Clock Packets Clock The delivery Architecture bandwidth from and to a device is increased by using multiple lanes in parallel. The architecture permits the parallelization of 1, 2, 4, 8, 12, 16 and 32 lanes. This parallelization only involves the hardware layer and is application transparent. lanes are clocked at 2.5 GHz, providing theoretical communication channels at 200 MByte/s in each direction. This bandwidth is close to twice the base Conventional CPU data rate. In the draft 2.0 specification, the clock will be increased to 5 GHz -and expectedly more in a future specification- increasing significantly the available bandwidth while keeping backward compatibility with lower speed components. Graphics Root Complex Memory Switch Switch Slots Bridge Conventional Slots Layer Stack Config / OS Model 4

Lanes Architecture Lane Data Packets Data In based computers, the CPU and the memory exchange data through a new device, called the root complex. This root complex exposes multiple lanes. Device A Device B Usually, sixteen lanes are dedicated to the graphical interface. The remaining lanes are connected to devices, or end Clock points, and to connectors. According Packets to the required data rate of the end Clock points and connectors, one or more lanes will be wired in parallel. The last component introduced in architecture is the switch that easily adds connection capabilities to a system, as shown on the following figure. Implementing to bridges offers compatibility with Conventional devices. Architecture CPU Graphics Root Complex Memory Switch Switch Slots Bridge Conventional Slots Layer Stack Config / OS Model Software Software / Driver Model Transaction Packet-based Protocol Data Link Physical Data Integrity 5 Point-to-point serial inter-op form-factors Future speeds and encodin onl impact ph sical la er

Layer Stack Bridge The structure is specified in layers. Compatibility with the addressing model is maintained to ensure that Conventional all existing applications and drivers operate unchanged. Slots configuration uses standard mechanisms as defined in the specification. The software layers generate read and write requests that are transported by the transaction layer to the I/O devices using a packet-based, splittransaction protocol. The data link layer adds sequence numbers and CRC error correction to these packets to insure the data integrity. The basic physical layer implements a transmit pair and a receive pair. Layer Stack Config / OS Model Software Software / Driver Model Transaction Packet-based Protocol Data Link Data Integrity Physical Point-to-point serial inter-op form-factors Future speeds and encodin onl impact ph sical la er software layers are backward compatible with Conventional. Applications moving from Conventional to architecture do not require software upgrading. Future operating systems will take advantage of new capabilities such as increased memory space management, virtual channels, and packet prioritization. Cost In many applications, the simplified connection scheme between components will reduce the overall cost of systems compared to Conventional. Mechanical The form factor complies with Conventional. Boards, equivalent in size and using the same brackets, integrate straight in current PC enclosures. Only the connectors are modified and often reduced. Four edge connectors are specified for add-on boards; the x1 or 1-lane smaller connector, the x4, the x8 and the x16 connectors. The identical four connector sizes are specified for motherboards slots. Power 1 x 4 x 8 x 16 x For simple integration, the connector provides a +12 V and a +3.3 V voltage to power the add-on boards. The effective power consumption of a board should be marked on the board, according to a standard addendum currently in review. 6

Conventional Comparison implements serial point-to-point communication channels where Conventional uses parallel multipoint busses. This fundamental move in the physical layer leads to a renewed architecture of computers. For reminder, in Conventional based computers, the CPU, the memory and the graphical interface exchange data through the host bridge. One or several busses connect the host bridge with additional lower data rate devices, including expansion connectors. Implementing to bridges adds connection capabilities to a system. Conventional Comparison CPU AGP Graphics Host Bridge Memory ss Bridge Bridge Device Device Device Slots Number of Lanes 1 Lane 4 Lanes... Byte 5 Byte 4... Byte 5 Byte 4 7 Byte 4 Byte 5 Byte 6 Byte 7

Performance Number of Device Lanes The number of lanes is, for now, the main parameter influencing the performance of a interface. Parallelizing lanes increases the available data rate accordingly. The physical interface automatically splits the data across the available lanes, as shown in the following figure. No firmware is required to split the data packets. Number of Lanes Bridge Device Bridge Device Slots 1 Lane... Byte 5 Byte 4 4 Lanes... Byte 5 Byte 4 Byte 4 Byte 5 Byte 6 Byte 7 Physical Physical Physical Physical Physical Conceptual Splittin over 4 Lanes The theoretical usable data rate of one lane reaches about 200 MByte/s in each direction (for a total of 2 x 200 MBytes/s). Theoretical Data Rate at 2.5 GHz Number of Lanes MByte/s Bus Data Rate GByte/s 1 2 x 200 0.4 2 2 x 400 0.8 4 2 x 800 1.6 8 2 x 1600 3.2 12 2 x 2400 4.8 16 2 x 3200 6.4 32 2 x 6400 12.8 8

Euresys development team reports the available data rate as slightly higher than 180 MByte/s in both directions. These figures have been measured on marketed standard PCs. As of today, in typical conditions, a data rate of 2 x 180 MByte/s is a realistic performance for a 1-lane board. An equivalent performance of 2 x 720 MByte/s is expected for a 4-lane board. Available Data Rate at 2.5 GHz Number of Lanes MByte/s Bus Data Rate GByte/s 1 2 x 180 0.36 4 2 x 720 1.44 Clock Speed The revision 1.1 specification, released in March 2005, specifies a 2.5 GHz clock. Expectedly future revision will specify higher clock rates. At that time, the interface performance will directly depends on the clock speed. Interoperability General Rule A board may be plugged in any connector with as many lanes or more. As a result, a 1-lane board is universal as it is compatible with all connectors. A board cannot be plugged in a connector with fewer lanes. As a safeguard, the connector mechanical design prevents any forbidden configuration. The Card Electromechanical Specification revision 1.1, released in March 2005, specifies the x1, x4, x8 and x16 connectors. The following figure shows examples of configurations. 4 Lanes board 1 Lane board x1 slot 1 Lane board OK x4 slot OK x1 slot 9

Remark Please note that according to the specification, boards and connectors are only required to work in the basic x1 and in their nominal configurations. They are not required to work in intermediate configurations. As shown in the following table, the effective number of lanes used for data transfer is unknown in advance in three combinations of board and slot sizes, and must be negotiated by both devices. In these cases of up plugging plugging a smaller board in a larger connector the guaranteed effective number of lanes is one, as per specification. A successful negotiation will increase the effective number of lanes up to the intermediate state. Effective Number of Lanes used for Data Transfer Motherboard Slot Size x1 x4 x8 x16 x1 1 1 1 1 Add-on Board Size x4-4 Negotiated (1 or 4) x8 - - 8 Negotiated (1 or 4) Negotiated (1, 4 or 8) x16 - - - 16 For example, a 4-lane board is inserted in an 8-lane connector. If the slot is only compatible with x1 and x8 configurations, the communication channel will establish on a single lane, with a data rate four times lower than expected. If the slot is 4-lane compatible, the communication channel will use these four lanes up to the expected data rate of 2 x 800 MByte/s. An addendum to the standard is in review and will specify a marking of both boards and connectors in order to show to which configurations a device is compliant with and help the user to determine the available data rate of its system. References and The bus specification is handled by the -SIG ( Special Interest Group), which is an association of numerous industrial actors in the field. Base Specification Revision 1.1 March 28, 2005 Card Electromechanical Specification Revision 1.1 March 28, 2005 The web site address of -SIG is www.pcisig.com Euresys Euresys is a member of the -SIG (vendor ID h1805). America, Euresys Inc. 500 Park Boulevard, suite 525, Itasca, Illinois 60143 Toll free: 1-866-EURESYS - Phone: 630-250-2300 - Fax: 630-250-2301 Asia, Euresys Pte. Ltd. 627A Aljunied Road, #08-09 BizTech Centre, Singapore 389842 Phone: +65 6748 0085 - Fax: +65 6841 2137 Japan, Euresys s.a. Japan Representative Office AIOS Hiroo Building 8F, Hiroo 1-11-2, Shibuya-ku, Tokyo 150-0012 Phone: +81 3 5447-1256 - Fax: +81 3 5447-0529 Europe, Euresys s.a. Corporate Headquarters 14, Avenue du Pré-Aily, B-4031 Angleur, Belgium Phone: +32 4 367 72 88 - Fax: +32 4 367 74 66 www.euresys.com Your distributor info@euresys.com, Technolo echnolo Note, ote, December ecem er 2006 TN_Exp_Dec06_Cor0