Network on Chip (NoC) : An Introduction

Similar documents
Design and Implementation of an On-Chip timing based Permutation Network for Multiprocessor system on Chip

A CDMA Based Scalable Hierarchical Architecture for Network- On-Chip

From Bus and Crossbar to Network-On-Chip. Arteris S.A.

Hyper Node Torus: A New Interconnection Network for High Speed Packet Processors

Hardware Implementation of Improved Adaptive NoC Router with Flit Flow History based Load Balancing Selection Strategy

Design and Implementation of an On-Chip Permutation Network for Multiprocessor System-On-Chip

Switched Interconnect for System-on-a-Chip Designs

3D On-chip Data Center Networks Using Circuit Switches and Packet Switches

Architectural Level Power Consumption of Network on Chip. Presenter: YUAN Zheng

Analysis of Error Recovery Schemes for Networks-on-Chips

AN EVENT-BASED NETWORK-ON-CHIP MONITORING SERVICE

Design of a Feasible On-Chip Interconnection Network for a Chip Multiprocessor (CMP)

A Low Latency Router Supporting Adaptivity for On-Chip Interconnects

Multistage Interconnection Network for MPSoC: Performances study and prototyping on FPGA

SUPPORT FOR HIGH-PRIORITY TRAFFIC IN VLSI COMMUNICATION SWITCHES

Using Fuzzy Logic Control to Provide Intelligent Traffic Management Service for High-Speed Networks ABSTRACT:

Design and Verification of Nine port Network Router

Distributed Elastic Switch Architecture for efficient Networks-on-FPGAs

Introduction to Exploration and Optimization of Multiprocessor Embedded Architectures based on Networks On-Chip

CONSTRAINT RANDOM VERIFICATION OF NETWORK ROUTER FOR SYSTEM ON CHIP APPLICATION

Lecture 18: Interconnection Networks. CMU : Parallel Computer Architecture and Programming (Spring 2012)

MPLS. Packet switching vs. circuit switching Virtual circuits

Asynchronous Bypass Channels

Three Key Design Considerations of IP Video Surveillance Systems

Performance Evaluation of AODV, OLSR Routing Protocol in VOIP Over Ad Hoc

DESIGN AND VERIFICATION OF LSR OF THE MPLS NETWORK USING VHDL

5 Performance Management for Web Services. Rolf Stadler School of Electrical Engineering KTH Royal Institute of Technology.

Leveraging Torus Topology with Deadlock Recovery for Cost-Efficient On-Chip Network

WAN Data Link Protocols

Interconnection Networks. Interconnection Networks. Interconnection networks are used everywhere!

Real-time Processor Interconnection Network for FPGA-based Multiprocessor System-on-Chip (MPSoC)

Mobile IP Network Layer Lesson 01 OSI (open systems interconnection) Seven Layer Model and Internet Protocol Layers

Optical interconnection networks with time slot routing

PART III. OPS-based wide area networks

Module 5. Broadcast Communication Networks. Version 2 CSE IIT, Kharagpur

Interconnection Networks

ARM Ltd 110 Fulbourn Road, Cambridge, CB1 9NJ, UK.

How To Understand The Concept Of Circuit Switching

Quality of Service (QoS) for Asynchronous On-Chip Networks

Scaling 10Gb/s Clustering at Wire-Speed

Question: 3 When using Application Intelligence, Server Time may be defined as.

Towards a Design Space Exploration Methodology for System-on-Chip

Performance Evaluation of 2D-Mesh, Ring, and Crossbar Interconnects for Chip Multi- Processors. NoCArc 09

4 Internet QoS Management

CCNA R&S: Introduction to Networks. Chapter 5: Ethernet

On-Chip Interconnection Networks Low-Power Interconnect

On-Chip Communications Network Report

Packetization and routing analysis of on-chip multiprocessor networks

Simulation and Evaluation for a Network on Chip Architecture Using Ns-2

Power Reduction Techniques in the SoC Clock Network. Clock Power

Asynchronous IC Interconnect Network Design and Implementation Using a Standard ASIC Flow

Optimizing Configuration and Application Mapping for MPSoC Architectures

Extending Networking to Fit the Cloud

Communication Networks. MAP-TELE 2011/12 José Ruela

Introduction to WAN Technologies

Efficient Interconnect Design with Novel Repeater Insertion for Low Power Applications

SoC IP Interfaces and Infrastructure A Hybrid Approach

Influence of Load Balancing on Quality of Real Time Data Transmission*

Extending Platform-Based Design to Network on Chip Systems

High-Performance IP Service Node with Layer 4 to 7 Packet Processing Features

ECE 358: Computer Networks. Solutions to Homework #4. Chapter 4 - The Network Layer

ISSCC 2003 / SESSION 4 / CLOCK RECOVERY AND BACKPLANE TRANSCEIVERS / PAPER 4.7

A Hybrid Electrical and Optical Networking Topology of Data Center for Big Data Network

Ethernet. Ethernet Frame Structure. Ethernet Frame Structure (more) Ethernet: uses CSMA/CD

Research on Video Traffic Control Technology Based on SDN. Ziyan Lin

The Economics of Cisco s nlight Multilayer Control Plane Architecture

FPGA Implementation of IP Packet Segmentation and Reassembly in Internet Router*

Network Layer: Network Layer and IP Protocol

Service Level Analysis of Video Conferencing over Wireless Local Area Network

Solving Network Challenges

Quality-of-service and error control techniques for mesh-based network-on-chip architectures

IP Office Technical Tip

A Generic Network Interface Architecture for a Networked Processor Array (NePA)

InfiniBand Clustering

On-Chip Communication Architectures

Improving the Performance of TCP Using Window Adjustment Procedure and Bandwidth Estimation

Introduction to System-on-Chip

Transcription:

Network on Chip (NoC) : An Introduction 1

Outline What is a NoC Motivation Basics Data Abstraction What it looks like Hardware Components Software Tools Interesting problems References 2

Introduction NoC : Is an on chip packet based communication system between blocks connected via routers Today application-specific systems on-chip (SoC) make extensive use of busses as the interconnect infrastructure However, in recent years research has shown that Network on-chip (NoC) is likely to replace buses in future SoCs [1-4] Fig. 1 : Bus based System Architecture 3

Motivation NoCs offer superior performance, power and area tradeoffs as the number of modules increases [1-4]. Bus-based architectures have trouble scaling with increasing number of IP blocks and decreasing geometries NoC is a far easier-to-scale interconnect, with shorter, unidirectional, point-to-point wires [5] For QC ASICs NoC reduces interconnect by 30 40% Fig. 2 : Generic 2-D NoC setup 4

NoC and Global Wire Delay Figure shows wire area cost for same performance Figures from [5] and [6] 5

Data Abstraction Data is transmitted or routed in the form of packets Just like regular networks, packets contain destination/source information etc. Functional simulation based trace data is often used for modeling during NoC generation flow Fig. 3 : Levels of Data Abstraction 6

NoC Hardware Common building blocks : Routers Resizers Clock-Synchronizer NIU FIFOs Arbiters Binners Fig. 4 : Closer look at a NoC 7

8 NoC Hardware : Routers Routers are the core of the NoC interconnect, typically lie on critical paths and area a major contributor to area A basic router is a MxN switch, with M-inputs and N- outputs Each input port is actually multiple signals (data, flow control, valid signals etc) Fig. 5 : A basic NoC Router

NoC Hardware : Binner Bins packets into appropriate FIFOs/Queues Binning can be done on the basis of some subset of header information Allows for some notion of priority Fig. 6 : A basic NoC Binner Block 9

NoC Software Tools Topology Generation : often called NoC Placement, optimize multiple metrics, latency, bandwidth, size of routers System Simulation : to figure out traffic patterns, delay, latency, power etc. Also needed to figure out if constraints are met Route Optimization : figuring out optimal routing directions for packets from sender to destination Allowing for deadlock free network, is a challenge Fig. 7 : A basic NoC Flow from Arteris [7] 10

Interesting Problems : Placement Inputs : Traffic information, Module sizes, Link and Router Latency information (optional), Physical Constraints (optional) Come up with NoC router configuration and related core connectivity information Optimize on the basis of : Bandwidth : Cores that talk a lot sit close to each other Size of routers, router logic grows as O(n^2) Annealing based approach [8] How do we include floor-plan or physical information? Fig. 8 : A Typical NoC Placement flow 11

Router Frequency and Width Assignment Assign frequencies and widths to routers in the NoC network, with the objective of minimizing the cost incurred due to : Clock-Domain-Crossings Width Conversion Finding globally optimal assignment has exponential growth in runtime (#clock-domains)^(#routers) Constraint : Size and Frequency should satisfy bandwidth R1 R2 Fig. 9 : Toy router frequency assignment problem 12

VLSI CAD Problems Reframed for NoC[9] Floor-planning : NoC Placement Wire Routing : Message Routing Buffer/Inverter Sizing : Buffer (FIFO) Sizing Timing Closure : Link Bandwidth Capacity Allocation Simulation : Networking Simulation and Traffic Modeling 13

References [1] P. Guerrier and A. Greiner, "A Generic Architecture for On-Chip Packet-Switched Interconnections", Proc. Design, Automation and Test in Europe (DATE) 2000, 250-256 [2] K. Goossens, J. Dielissen, and A. Radulescu, "A Ethereal Network on Chip: Concepts, Architectures, and Implementations", IEEE Design and Test of Computers, 2005, 414-421 [3] E. Bolotin, I. Cidon, R. Ginosar, and A. Kolodny, "QNoC: QoS Architecture and Design Process for Network on Chip", Journal of Systems Architecture, Vol. 50, February 2004, 105-128 [4] D. Bertozzi and L. Benini, "Xpipes: A Networkon- Chip Architecture for Gigascale Systems-on- Chip", Circuits and Systems Magazine, IEEE Volume 4, Issue 2, 2004, 18-31 [5] E. Bolotin, I. Cidon, R. Ginosar, and A. Kolodny, "Cost Considerations in Network on Chip", Integration - the VLSI Journal, Volume 38, pp.19-42, 2004 [6] Dally, W. J. and Towles, B. 2001. Route packets, not wires: on-chip inteconnection networks. In Proceedings of the 38th Annual Design Automation Conference (Las Vegas, Nevada, United States). DAC '01. ACM, New York, NY, 684-689. DOI= http://doi.acm.org/10.1145/378239.379048 [7] Arteris : Network on Chip Company. http://www.arteris.com/ [8] Beraha, R., Walter, I., Cidon, I., and Kolodny, A. 2009. The design of a latency constrained, power optimized NoC for a 4G SoC. In Proceedings of the 2009 3rd ACM/IEEE international Symposium on Networks-on-Chip (May 10-13, 2009). NOCS. IEEE Computer Society, Washington, DC, 86. DOI= http://dx.doi.org/10.1109/nocs.2009.5071449 [9] Technion NoC Tutorial : http://webee.technion.ac.il/matrics/talks/technion%20noc%20tutorial%20part%20four.pdf 14