Interconnection Networks. Interconnection Networks. Interconnection networks are used everywhere!



Similar documents
Interconnection Network Design

Interconnection Networks Programmierung Paralleler und Verteilter Systeme (PPV)

System Interconnect Architectures. Goals and Analysis. Network Properties and Routing. Terminology - 2. Terminology - 1

Interconnection Network

Topological Properties

Lecture 18: Interconnection Networks. CMU : Parallel Computer Architecture and Programming (Spring 2012)

Interconnection Networks

Lecture 23: Interconnection Networks. Topics: communication latency, centralized and decentralized switches (Appendix E)

Scalability and Classifications

Chapter 2. Multiprocessors Interconnection Networks

Components: Interconnect Page 1 of 18


Why the Network Matters

Hyper Node Torus: A New Interconnection Network for High Speed Packet Processors

Lecture 2 Parallel Programming Platforms

COMP 422, Lecture 3: Physical Organization & Communication Costs in Parallel Machines (Sections 2.4 & 2.5 of textbook)

Interconnection Networks

Scaling 10Gb/s Clustering at Wire-Speed

Computer Network. Interconnected collection of autonomous computers that are able to exchange information

Introduction to Parallel Computing. George Karypis Parallel Programming Platforms

Behavior Analysis of Multilayer Multistage Interconnection Network With Extra Stages

Parallel Programming

Chapter 12: Multiprocessor Architectures. Lesson 04: Interconnect Networks

ECE 358: Computer Networks. Solutions to Homework #4. Chapter 4 - The Network Layer

Introduction to Exploration and Optimization of Multiprocessor Embedded Architectures based on Networks On-Chip

Interconnection Networks

Overview of Computer Networks

Chapter 16: Distributed Operating Systems

Architectural Level Power Consumption of Network on Chip. Presenter: YUAN Zheng

Chapter 14: Distributed Operating Systems

Communication Networks. MAP-TELE 2011/12 José Ruela

Module 15: Network Structures

Asynchronous Bypass Channels

CCNA R&S: Introduction to Networks. Chapter 5: Ethernet

Computer Systems Structure Input/Output

On-Chip Interconnection Networks Low-Power Interconnect

Module 5. Broadcast Communication Networks. Version 2 CSE IIT, Kharagpur

Protocols and Architecture. Protocol Architecture.

Three Key Design Considerations of IP Video Surveillance Systems

Operating System Concepts. Operating System 資 訊 工 程 學 系 袁 賢 銘 老 師

Switched Interconnect for System-on-a-Chip Designs

Datagram-based network layer: forwarding; routing. Additional function of VCbased network layer: call setup.

Router Architectures

Exam 1 Review Questions

EE4367 Telecom. Switching & Transmission. Prof. Murat Torlak

Architecture of distributed network processors: specifics of application in information security systems

WAN Data Link Protocols

White Paper Abstract Disclaimer

Chapter 1 Reading Organizer

PLANEAMENTO E GESTÃO DE REDES INFORMÁTICAS COMPUTER NETWORKS PLANNING AND MANAGEMENT

CHAPTER 5 FINITE STATE MACHINE FOR LOOKUP ENGINE

PCI Express Overview. And, by the way, they need to do it in less time.

Data Communication Networks and Converged Networks

The proliferation of the raw processing

CS 78 Computer Networks. Internet Protocol (IP) our focus. The Network Layer. Interplay between routing and forwarding

SAN Conceptual and Design Basics

2. What is the maximum value of each octet in an IP address? A. 128 B. 255 C. 256 D. None of the above

Optimizing Enterprise Network Bandwidth For Security Applications. Improving Performance Using Antaira s Management Features

Design and Implementation of an On-Chip timing based Permutation Network for Multiprocessor system on Chip

Xiaoqiao Meng, Vasileios Pappas, Li Zhang IBM T.J. Watson Research Center Presented by: Payman Khani

Chapter 9A. Network Definition. The Uses of a Network. Network Basics

QoS Switching. Two Related Areas to Cover (1) Switched IP Forwarding (2) 802.1Q (Virtual LANs) and 802.1p (GARP/Priorities)

Unit of Learning # 2 The Physical Layer. Sergio Guíñez Molinos sguinez@utalca.cl

Intel Ethernet Switch Load Balancing System Design Using Advanced Features in Intel Ethernet Switch Family

VXLAN: Scaling Data Center Capacity. White Paper

From Hypercubes to Dragonflies a short history of interconnect

Computer Networks. Definition of LAN. Connection of Network. Key Points of LAN. Lecture 06 Connecting Networks

Objectives. The Role of Redundancy in a Switched Network. Layer 2 Loops. Broadcast Storms. More problems with Layer 2 loops

Infrastructure Components: Hub & Repeater. Network Infrastructure. Switch: Realization. Infrastructure Components: Switch

PART OF THE PICTURE: The TCP/IP Communications Architecture

CHAPTER 3 STATIC ROUTING

InfiniBand Clustering

ESSENTIALS. Understanding Ethernet Switches and Routers. April 2011 VOLUME 3 ISSUE 1 A TECHNICAL SUPPLEMENT TO CONTROL NETWORK

Distributed Computing over Communication Networks: Topology. (with an excursion to P2P)

LAN Switching and VLANs

Introducing Basic MPLS Concepts

Region 10 Videoconference Network (R10VN)

LAN Switching Computer Networking. Switched Network Advantages. Hubs (more) Hubs. Bridges/Switches, , PPP. Interconnecting LANs

2. What is the maximum value of each octet in an IP address? A. 28 B. 255 C. 256 D. None of the above

Computer Networks: LANs, WANs The Internet

Network Design. Yiannos Mylonas

Definition. A Historical Example

Computer Organization & Architecture Lecture #19

Local Area Networks transmission system private speedy and secure kilometres shared transmission medium hardware & software

WHITEPAPER. VPLS for Any-to-Any Ethernet Connectivity: When Simplicity & Control Matter

IP Networking. Overview. Networks Impact Daily Life. IP Networking - Part 1. How Networks Impact Daily Life. How Networks Impact Daily Life

524 Computer Networks

Data Center Network Topologies: FatTree

Master Course Computer Networks IN2097

IP - The Internet Protocol

Internet Working 5 th lecture. Chair of Communication Systems Department of Applied Sciences University of Freiburg 2004

Scalable Prefix Matching for Internet Packet Forwarding

Communications and Computer Networks

DESIGN AND VERIFICATION OF LSR OF THE MPLS NETWORK USING VHDL

CSE 3461 / 5461: Computer Networking & Internet Technologies

John Ragan Director of Product Management. Billy Wise Communications Specialist

APPENDIX B. Routers route based on the network number. The router that delivers the data packet to the correct destination host uses the host ID.

Annotation to the assignments and the solution sheet. Note the following points

Data Networking and Architecture. Delegates should have some basic knowledge of Internet Protocol and Data Networking principles.

Transcription:

Interconnection Networks Interconnection Networks Interconnection networks are used everywhere! Supercomputers connecting the processors Routers connecting the ports can consider a router as a parallel machine with the ports/linecards as the processors! Clusters of machines Internet (loosely coupled network) Interconnection Networks Pins of a processor form --processor bus Bus connecting the processor and memory modules is called the local bus, or memory bus System bus (also called I/O bus) provides slots to connect I/O devices such as disk drives, tapes, network interface cards (e.g., PCI) connected to the local bus through an I/O bridge circuit -- this is often built on a motherboard, a backplane 1

Interconnection Networks Crossbar Crossbar is the simplest and most flexible switch architecture It can be used to establish n connections between n inputs and n outputs Each crosspoint can be switched on or off under program control Number n is often called the degree of the switch Network Components Static Vs Dynamic Networks Based on connectivity and control networks can be divided into two classes Static networks that don t change dynamically (e.g., trees, rings, meshes (not crossbars)) Dynamic networks that change interconnectivity dynamically 2

Dynamic Networks Implemented with switched channels dynamically configured to meet the communication needs of user programs E.g: system buses, crossbar switches, multistage networks Network Characteristics Shared-media networks can be converted to switched design at increased cost Switched networks are often designed with two options: circuit switching and packet switching Circuit switched networks -- the entire path from the source to the destination is reserved for the entire period of transmission Network Characteristics Cell switched networks -- long message (variable length packets) broken into a sequence of small packets (fixed length) A cell contains the routing information and segment of data payload Cells from the same message can be routed separately on different paths Network Performance Metrics Communication latency software overhead: overhead associated with sending and receiving messages at end stations channel delay: caused by the channel occupancy routing delay: time spent in the successive switches in making a sequence of routing decisions along the routing path contention delay: caused by traffic contentions in the network 3

Network Performance Metrics Per-port bandwidth: maximum number of bits that can be transmitted per second from any port to any other port For symmetric network, per-port bandwidth is independent of port location For asymmetric network, depends on port location Aggregate bandwidth: defined as the maximum number of bits that can be transmitted from one half of the nodes to another half of the nodes per second Network Performance Metrics For example, for 512-port HPS with 40MB/s per-port bandwidth, the aggregate bandwidth = (40x512)/2 = 10.24GB/s Network Topologies and Properties Node degree: number of edges incident on a node is called the node degree Network diameter: diameter D of a network is the maximum path length between any two nodes -- the path length is measured in terms of links traversed Routing Schemes Two ways of routing packets in switched networks: Store-and-forward Cut-through or wormhole routing Store-and-Forward Entire packet is stored in a node before it is forwarded to an outgoing link Successive packets are transmitted sequentially without overlapping in time 4

Routing Schemes Cut-through Routing Each node uses a flit-buffer to hold a flit (one cell) A flit is automatically forwarded to an outgoing link, once the header is decoded All data flits in the same packet follow the same path that the header traverses Routing Schemes Using the cut-through it takes only 7 time units for node 4 to receive the entire message The same message took 16 time units in the store-and-forward scheme Meshes and Rings: Simplest connection topology is the onedimensional mesh, or linear array In a linear array, the interior nodes have two connections and boundary nodes have one If we connect the two boundaries, we get a ring with all nodes of degree 2 A higher dimensional mesh is constructed similarly with k dimensions, interior nodes have degree 2k 5

Common mesh topology is the 2-D mesh Some 2-D meshes have wrap-around connections along the edges Routing: Assume interior nodes routing performed over one dimension at a time On a 3-D mesh, minimal path from a node (a, b, c) to (x, y, z) is constructed by moving along 1st dimension to (x, b, c) then to (x, y, c) and finally to (x, y, z) This is known as the XY-routing Trees: Common tree topology is the binary-tree Binary trees are well matched for VLSI and other planar layouts Routing in trees Idea: travel up the tree from A until you reach an ancestor of B and then travel down To implement number the root as 1 and left and right children as of x as 2x and 2x+1, respectively If the root is at level 1 Then the nodes at level i have a label that is i bits long and the left and right children of a node have 0 or 1 appended to their parent s number, respectively Lowest common ancestor of A (source) and B (destination) is the node numbered P, the longest common prefix of A and B From this it is easy to see how many levels we should go up to reach B from A What is this tree called? 6

Hybercubes: Multidimensional mesh of processors with exactly two processors in each dimension D-dimensional hypercube has p = 2 D Recursively constructed as follows: a single processor is 0-dim hypercube 1-dim hypercube is constructed by connecting two 0- dim hypercubes (d+1)-dim hypercube is constructed by connecting corresponding processors of two d-dim hypercubes Properties of hypercube network two processors are connected by a direct link if and only if the binary representation of their labels differ at exactly one bit position in a d-dimensional hypercube, each processor is directly connected to d other processors a d-dimensional hypercube can be partitioned into two (d-1)-dimensional sub-hypercubes total number of bit positions at which these two labels differ is called the hamming distance Routing: E-Cube routing Let s and d be the labels of the source and destination nodes respectively Minimum distance between the processors is given by x = (s XOR d) Processor s sends the message along dim k, where k is the position of the least significant non-zero bit in (s XOR d) Processor i computes (i XOR d) and forwards the message along the dimension corresponding to the least significant nonzero bit 7

Dynamic Network Topologies Examples: Buses, Crossbars, and Multistage interconnection networks Supercomputers, high performance IP routers, ATM switches use these networks (e.g., IBM DeepBlue) Several aliases - omega, flip, butterfly, baseline, delta, generalized cube, multistage shuffle-exchange For a NxN network, we have m = log 2 N stages Each stage has N/2 two-input/two-output interchange boxes The connection pattern among the boxes is different for the different multistage interconnection networks (MINs) P P cube i (P) cube i (P) 8

cube i (P) is the cube interconnection function Let P = p m-1...p i...p 1 p 0 cube i (P) = p m-1...p i...p 1 p 0 Each box can be controlled by routing tags -- in one of the following four states Routing in multistage cube networks: Circuit-switching all switches in a stage set the same way Packet switching each packet has routing tag in its header and the routing can be performed in a distributed fashion Less overhead for circuit switching almost like a direct wire connection Unicast routing: routing between a sender and a receiver XOR routing tags Let source be S and destination be D tag T = S XOR D If circuit switching is used, stage i is set straight if bit i of T is 0 otherwise stage i is set exchange If packet switching is used, each box is set independently by the header (tag is sent in the header) of the packet 9

Destination Routing Tag Let S be the source and D be the destination Tag = D This is used in distributed fashion -- each network input device determines its own action Tag is sent in the header for the message Stage i box examines d i d i = 0 use upper box output d i = 1 use lower box output Trade-offs between the two routing schemes for MINs XOR tag can be used for return message and source information T = S XOR D = D XOR S; S = D XOR T destination tag can be used to check the correct destination Broadcast routing: One port to 2 j ports -- note this is a restriction -- this is not a multicast where you can have an arbitrary set of receivers The receivers can have at most j bits different between any pair of destination addresses port S -> ports { D 1, D 2,... j D2 } unicast routing tag R = S XOR D 1 broadcast tag B = D i XOR D k (must differ in at least j positions) 10

Stage i looks at the i-th bit of routing tag R (r i ) and broadcast tag B (b i ) If b i = 0, use r i 1 exchange, 0 straight If b i = 1, broadcast (ignore r i ) Example Source S = 2 = 010 Destinations = {100, 101, 110, 111} Vary at most 2 bits R = S XOR 100 = 110 B = 100 XOR 111 = 011 Multistage Cube Partitioning these networks can form independent or communicating subnetworks each subnetwork has properties of multistage cube each partition size is power of two partition sizes can vary routing tags can still be used 11

Omega Network Summary of Dynamic Topologies Summary of Static Topologies 12