Chapter 2. Multiprocessors Interconnection Networks



Similar documents
Interconnection Networks

System Interconnect Architectures. Goals and Analysis. Network Properties and Routing. Terminology - 2. Terminology - 1

Chapter 12: Multiprocessor Architectures. Lesson 04: Interconnect Networks

Interconnection Network

Interconnection Networks. Interconnection Networks. Interconnection networks are used everywhere!

Lecture 2 Parallel Programming Platforms

Topological Properties

Interconnection Networks Programmierung Paralleler und Verteilter Systeme (PPV)


Interconnection Networks

Components: Interconnect Page 1 of 18

Behavior Analysis of Multilayer Multistage Interconnection Network With Extra Stages

Interconnection Network Design

Introduction to Parallel Computing. George Karypis Parallel Programming Platforms

Lecture 23: Interconnection Networks. Topics: communication latency, centralized and decentralized switches (Appendix E)

Parallel Programming

Lecture 18: Interconnection Networks. CMU : Parallel Computer Architecture and Programming (Spring 2012)

Design and Implementation of an On-Chip timing based Permutation Network for Multiprocessor system on Chip

COMP 422, Lecture 3: Physical Organization & Communication Costs in Parallel Machines (Sections 2.4 & 2.5 of textbook)

Hyper Node Torus: A New Interconnection Network for High Speed Packet Processors

Architectural Level Power Consumption of Network on Chip. Presenter: YUAN Zheng

Interconnection Networks

Table of Contents. Cisco How Does Load Balancing Work?

Improved Irregular Augmented Shuffle Multistage Interconnection Network

Scalability and Classifications

Principles and characteristics of distributed systems and environments

Communication Networks. MAP-TELE 2011/12 José Ruela

Interconnection Network of OTA-based FPAA

Introduction to Exploration and Optimization of Multiprocessor Embedded Architectures based on Networks On-Chip

Why the Network Matters

SAN Conceptual and Design Basics

Input / Ouput devices. I/O Chapter 8. Goals & Constraints. Measures of Performance. Anatomy of a Disk Drive. Introduction - 8.1

Lecture 6 Types of Computer Networks and their Topologies Three important groups of computer networks: LAN, MAN, WAN

CPS104 Computer Organization and Programming Lecture 18: Input-Output. Robert Wagner

Computer Systems Structure Input/Output

Chapter Introduction. Storage and Other I/O Topics. p. 570( 頁 585) Fig I/O devices can be characterized by. I/O bus connections

Modeling of Topologies of Interconnection Networks based on Multidimensional Multiplicity

MULTISTAGE INTERCONNECTION NETWORKS: A TRANSITION TO OPTICAL

How To Understand The Concept Of A Distributed System

Asynchronous Bypass Channels

Distributed Systems LEEC (2005/06 2º Sem.)

Performance of networks containing both MaxNet and SumNet links

Analysis of Various Crosstalk Avoidance Techniques in Optical Multistage Interconnection Network

Scaling 10Gb/s Clustering at Wire-Speed

Lizy Kurian John Electrical and Computer Engineering Department, The University of Texas as Austin

UNIT 2 CLASSIFICATION OF PARALLEL COMPUTERS

OpenSPARC T1 Processor

COMPUTER HARDWARE. Input- Output and Communication Memory Systems

Switched Interconnect for System-on-a-Chip Designs

Chapter 2 Parallel Architecture, Software And Performance

Chapter 15: Distributed Structures. Topology

Non-blocking Switching in the Cloud Computing Era

Annotation to the assignments and the solution sheet. Note the following points

ECE 358: Computer Networks. Solutions to Homework #4. Chapter 4 - The Network Layer

AMD Opteron Quad-Core

Chapter 12: Multiprocessor Architectures. Lesson 09: Cache Coherence Problem and Cache synchronization solutions Part 1

LOAD-BALANCED ROUTING IN INTERCONNECTION NETWORKS

Computer Network. Interconnected collection of autonomous computers that are able to exchange information

Switching. An Engineering Approach to Computer Networking

Hardware Implementation of Improved Adaptive NoC Router with Flit Flow History based Load Balancing Selection Strategy

Computer System Design. System-on-Chip

Load balancing Static Load Balancing

SoC IP Interfaces and Infrastructure A Hybrid Approach

Course 12 Synchronous transmission multiplexing systems used in digital telephone networks

Outline. Introduction. Multiprocessor Systems on Chip. A MPSoC Example: Nexperia DVP. A New Paradigm: Network on Chip

Read this before starting!

University of Castilla-La Mancha

Communicating with devices

Optimizing Configuration and Application Mapping for MPSoC Architectures

What is a bus? A Bus is: Advantages of Buses. Disadvantage of Buses. Master versus Slave. The General Organization of a Bus

Performance Evaluation of 2D-Mesh, Ring, and Crossbar Interconnects for Chip Multi- Processors. NoCArc 09

Scalable Source Routing

CMSC 611: Advanced Computer Architecture

Synchronization. Todd C. Mowry CS 740 November 24, Topics. Locks Barriers

SUPPORT FOR HIGH-PRIORITY TRAFFIC IN VLSI COMMUNICATION SWITCHES

Remote Copy Technology of ETERNUS6000 and ETERNUS3000 Disk Arrays

Some Computer Organizations and Their Effectiveness. Michael J Flynn. IEEE Transactions on Computers. Vol. c-21, No.

2. Research and Development on the Autonomic Operation. Control Infrastructure Technologies in the Cloud Computing Environment

Switch Fabric Implementation Using Shared Memory

Tolerating Multiple Faults in Multistage Interconnection Networks with Minimal Extra Stages

SOC architecture and design

Module 5. Broadcast Communication Networks. Version 2 CSE IIT, Kharagpur

Chapter 16: Distributed Operating Systems

GEDAE TM - A Graphical Programming and Autocode Generation Tool for Signal Processor Applications

Power Reduction Techniques in the SoC Clock Network. Clock Power

Boolean Network Models

enabling Ultra-High Bandwidth Scalable SSDs with HLnand

AND8336. Design Examples of On Board Dual Supply Voltage Logic Translators. Prepared by: Jim Lepkowski ON Semiconductor.

Dependable Systems. 9. Redundant arrays of. Prof. Dr. Miroslaw Malek. Wintersemester 2004/05

Module 15: Network Structures

Where to Provide Support for Efficient Multicasting in Irregular Networks: Network Interface or Switch?

Load Balancing and Termination Detection

White Paper Abstract Disclaimer

Parallel Architectures and Interconnection

2) What is the structure of an organization? Explain how IT support at different organizational levels.

Network Structure or Topology

Transcription:

Chapter 2 Multiprocessors Interconnection Networks

2.1 Taxonomy Interconnection Network Static Dynamic 1-D 2-D HC Bus-based Switch-based Single Multiple SS MS Crossbar

2.2 Bus-Based Dynamic Single Bus Systems Simplest way to connect multiprocessor systems. The use of local caches reduces the processormemory traffic. Size of such system varies between 2 and 50 processors. Single bus multiprocessors are inherently limited by: Bandwidth of bus. 1 processor can access the bus. 1 memory access can take place at any given time.

2.2 Bus-Based Dynamic Single Bus Systems p p p 1 2 N 1 p N Shared Memory I/O

2.2 Bus-Based Dynamic Multiple Bus Systems Several parallel buses to interconnect multiple processors and multiple memory modules. Many connection schemes are possible. Examples: Multiple Bus with Full Bus Memory Connection (MBFBMC). Multiple Bus with Single Bus Memory Connection (MBSBMC). Multiple Bus with Partial Bus Memory Connection (MBPBMC). Multiple Bus with Class-based Bus Memory Connection (MBCBMC).

2.2 Bus-Based Dynamic Multiple Bus Systems: Multiple Bus with Full Bus Memory Connection (MBFBMC). P 1 P 2 P 3 P 4 P 5 P 6 M 1 M 2 M 3 M 4

2.2 Bus-Based Dynamic Multiple Bus Systems: Multiple Bus with Single Bus Memory Connection (MBSBMC). P 1 P 2 P 3 P 4 P 5 P 6 M 2 M 3 M 1 M 4

2.2 Bus-Based Dynamic Multiple Bus Systems: Multiple Bus with Partial Bus Memory Connection (MBPBMC). P 1 P 2 P 3 P 4 P 5 P 6 M 1 M 2 M 3 M 4

2.2 Bus-Based Dynamic Multiple Bus Systems: Multiple Bus with Class-based Memory Connection (MBCBMC). P 1 P 2 P 3 P 4 P 5 P 6 M 1 M 2 M 3 M 4 M 5 M 6 Class 1 Class 2 Class 3

2.2 Bus-Based Dynamic Bus Synchronization A bus can be synchronous: Time for any transaction is known in advance. A bus can be asynchronous: Depends on the availability of data and readiness of devices to initiate bus transactions. Bus arbitration logic is required to resolve bus contention when more than 1 processor compete to access the bus in single bus multiprocessor. Process of passing mastership from 1 processor to another is called handshaking Requires a bus request and a bus grant.

2.2 Bus-Based Dynamic Bus Synchronization Bus arbitration logic uses a a predefined priority scheme: Random Simple rotating Equal priority Least Recently Used (LRU)

2.3 Switch-Based Interconnection Crossbar Networks Networks Provide simultaneous connections among all its inputs and all its outputs. A Switching Element (SE) is at the intersection of any 2 lines extended horizontally or vertically inside the switch. It is a non-blocking network allowing multiple inputoutput connection pattern to be achieved simultaneously.

2.3 Switch-Based Interconnection Networks Crossbar Networks P1 P2 P3 P4 P5 P6 P7 P8 M1 M2 M3 M4 M5 M6 M7 M8 Straight Switch Setting Diagonal Switch Setting

2.3 Switch-Based Single-Stage Networks A single stage of SE exists between the inputs and outputs of the network. Possible settings of a 2x2 SE are: Straight Exchange Upper-broadcast Lower-broadcast

2.3 Switch-Based Multistage (MINs) A MIN consists of a number of stages each consisting of a set of 2x2 SEs. Stages are connected to each other using Inter-Stage Connection (ISC) pattern. In MINs the routing of a message from a given source to a given destination is based on the destination address (self-routing).

2.3 Switch-Based Multistage Networks (MINs) ISC 1 ISC x-1 Switches (Stage 1) Switches (Stage 2) Switches (Stage x)

2.3 Switch-Based Blockage in Multistage Interconnection Networks Blocking networks: when an interconnection between a pair of input/output is currently established, the arrival of a request for a new interconnection between 2 arbitrary unused input and output may or may not be possible.

2.3 Switch-Based Blockage in Multistage Interconnection Networks Rearrangeable networks: Always possible to rearrange already established connections in order to make allowance for other connections to be established simultaneously

2.3 Switch-Based Blockage in Multistage Interconnection Networks Rearrangeable networks 000 001 010 011 100 101 110 111 000 001 010 011 100 101 110 111 000 001 010 011 100 101 110 111 000 001 010 011 100 101 110 111

2.3 Switch-Based Blockage in Multistage Interconnection Networks Non-blocking networks: In presence of a currently established connection between any pair of input/output, it is always possible to establish a connection between any arbitrary unused pair of input/output.

2.3 Switch-Based Blockage in Multistage Interconnection Networks Non-blocking networks:

2.3 Switch-Based Interconnection Networks 000 001 010 011 100 101 110 111

2.4 Static Have fixed paths, unidirectional or bidirectional, between processors. Types: Completely connected networks: Number of links: O(N 2 ), delay complexity: O(1). 1 2 6 3 completely connected network. 5 4

2.4 Static Limited Connection Networks: Linear arrays Ring (Loop) networks Two-dimensional arrays Tree networks Cube network

2.4 Static Linear arrays Ring (Loop) networks Two-dimensional arrays Tree networks Cube network

2.4 Static Cube Connected Networks: Patterned after the n-cube structure In an n-cube, every processor is connected to n others Ex: a 4-cube: 1 1 1110 1111 0100 0110 0101 0111 1100 1101 S 1010 1010 1011 1011 0010 0010 0011 0011 1000 1001 0000 0001

2.4 Static Mesh Connected Networks: D D S S Example 3X3X2 mesh network

2.5 Analysis and Performance Dynamic Networks Metrics Networks Delay Cost Blockin g Degree of FT Bus O(N) O(1) Yes 0 Multiple-bus O(mN) O(m) Yes (m-1) MIN O(logN) O(NlogN) Yes 0 Crossbar O(1) O(N 2 ) No 0

2.5 Analysis and Performance Static Networks Metrics Networks Degree (d) Diameter (D) Cost (No. of links) Symmetry Worst Delay CCNs N-1 1 N(N-1)/2 Yes 1 Linear array 2 N -1 N -1 No N Binary tree 3 Log N -1) 2( 2 N -1 No Log 2 N n-cube Log 2 N Log 2 N nn/2 Yes Log 2 N 2D-mesh 4 2(n-1) 2(N-n) No N K-ary n-cube 2n N/ k/2 n x N Yes k x log 2 N

2.6 Summary Different topologies used for interconnecting multiprocessors were discussed. Taxonomy for interconnection networks based on their topology is introduced. Dynamic and static interconnection schemes have been studied. A number of basic performance aspects related to both dynamic and static interconnection networks have been introduced.