Photonic Networks for Data Centres and High Performance Computing



Similar documents
inter-chip and intra-chip Harm Dorren and Oded Raz

Photonic components for signal routing in optical networks on chip

Volumes. Goal: Drive optical to high volumes and low costs

Technology Developments Towars Silicon Photonics Integration

On-Chip Interconnection Networks Low-Power Interconnect

Optical interconnection networks for data centers

The 50G Silicon Photonics Link

Silicon Photonic Interconnection Networks

Are Optical Networks Green? Rod Tucker University of Melbourne

Implementation of Short Reach (SR) and Very Short Reach (VSR) data links using POET DOES (Digital Opto- electronic Switch)

Sol: Optical range from λ 1 to λ 1 +Δλ contains bandwidth

Network Architecture and Topology

Intel Labs at ISSCC Copyright Intel Corporation 2012

Data Center Network Structure using Hybrid Optoelectronic Routers

Optical Networking for Data Centres Network

Using Pre-Emphasis and Equalization with Stratix GX

Silicon photonics can make it green(er): two case-studies of mass market communication applications

Interconnection Networks. Interconnection Networks. Interconnection networks are used everywhere!

Energy Limitations on Optical Data Transport and Switching. Rod Tucker University of Melbourne

Hyper Node Torus: A New Interconnection Network for High Speed Packet Processors

Recent developments in high bandwidth optical interconnects. Brian Corbett.

WDM-PON: A VIABLE ALTERNATIVE FOR NEXT GENERATION FTTP

Green Photonics in Switching : Rod Tucker Centre for Energy-Efficient Telecommunications University of Melbourne

How To Build A Cloud Computer

Architectural Level Power Consumption of Network on Chip. Presenter: YUAN Zheng

Delivering Scale Out Data Center Networking with Optics Why and How. Amin Vahdat Google/UC San Diego

First 40 Giga-bits per second Silicon Laser Modulator. Dr. Mario Paniccia Intel Fellow Director, Photonics Technology Lab

Solution: start more than one instruction in the same clock cycle CPI < 1 (or IPC > 1, Instructions per Cycle) Two approaches:

Cray Gemini Interconnect. Technical University of Munich Parallel Programming Class of SS14 Denys Sobchyshak

Performance Evaluation of 2D-Mesh, Ring, and Crossbar Interconnects for Chip Multi- Processors. NoCArc 09

An Introduction to High-Frequency Circuits and Signal Integrity

From Hypercubes to Dragonflies a short history of interconnect

Optimizing Configuration and Application Mapping for MPSoC Architectures

Control Plane architectures for Photonic Packet/Circuit Switching-based Large Scale Data Centres

Lecture 18: Interconnection Networks. CMU : Parallel Computer Architecture and Programming (Spring 2012)

Emerging storage and HPC technologies to accelerate big data analytics Jerome Gaysse JG Consulting

PART II. OPS-based metro area networks

DESIGN CHALLENGES OF TECHNOLOGY SCALING

Networking Virtualization Using FPGAs

Optical Networks for Next Generation Disaster Recovery Networking Solutions with WDM Systems Cloud Computing and Security

Performance Evaluation of Linux Bridge

Low Power AMD Athlon 64 and AMD Opteron Processors

Low Cost 100GbE Data Center Interconnect

Flexible SDN Transport Networks With Optical Circuit Switching

The Fraunhofer Heinrich Hertz Institute

Data center day. a silicon photonics update. Alexis Björlin. Vice President, General Manager Silicon Photonics Solutions Group August 27, 2015

10 Port L2 Managed Gigabit Ethernet Switch with 2 Open SFP Slots - Rack Mountable

Cloud-Based Apps Drive the Need for Frequency-Flexible Clock Generators in Converged Data Center Networks

Advancing Applications Performance With InfiniBand

Naveen Muralimanohar Rajeev Balasubramonian Norman P Jouppi

Intel Ethernet Switch Load Balancing System Design Using Advanced Features in Intel Ethernet Switch Family

Topics of Chapter 5 Sequential Machines. Memory elements. Memory element terminology. Clock terminology

High-Performance Automated Trading Network Architectures

Clocking. Figure by MIT OCW Spring /18/05 L06 Clocks 1

Use-it or Lose-it: Wearout and Lifetime in Future Chip-Multiprocessors

Towards Rack-scale Computing Challenges and Opportunities

Lecture 11: Multi-Core and GPU. Multithreading. Integration of multiple processor cores on a single chip.

PART III. OPS-based wide area networks

Intel DPDK Boosts Server Appliance Performance White Paper

Interconnection Networks

PEX 8748, PCI Express Gen 3 Switch, 48 Lanes, 12 Ports

Switching Solution Creating the foundation for the next-generation data center

CloudEngine Series Data Center Switches. Cloud Fabric Data Center Network Solution

Components: Interconnect Page 1 of 18

Quality of Service (QoS) for Asynchronous On-Chip Networks

10GBASE T for Broad 10_Gigabit Adoption in the Data Center

Design Guide for Photonic Architecture

White Paper Solarflare High-Performance Computing (HPC) Applications

CWDM: lower cost for more capacity in the short-haul

International Journal of Electronics and Computer Science Engineering 1482

Introduction, Rate and Latency

Prisma IP 10 GbE VOD Line Card

Latency Considerations for 10GBase-T PHYs

Operating Systems. Cloud Computing and Data Centers

MULTISTAGE INTERCONNECTION NETWORKS: A TRANSITION TO OPTICAL

The Integrated Research Center for Photonic Networks and Technologies (IRCPhoNeT) Piero Castoldi. Pisa, November 9, 2011

Multi-Gigabit Interfaces for Communications/Datacomm

Optical packet switching with distributed control for high performance data center networks

Why the Network Matters

Alpha CPU and Clock Design Evolution

All-optical fiber-optics networks

Achieving Nanosecond Latency Between Applications with IPC Shared Memory Messaging

Making Multicore Work and Measuring its Benefits. Markus Levy, president EEMBC and Multicore Association

Photonic Switching Applications in Data Centers & Cloud Computing Networks

Enabling the Use of Data

ISSCC 2003 / SESSION 4 / CLOCK RECOVERY AND BACKPLANE TRANSCEIVERS / PAPER 4.7

Ethernet Link SGI-4844F

CSCA0102 IT & Business Applications. Foundation in Business Information Technology School of Engineering & Computing Sciences FTMS College Global

Router Architectures

McPAT: An Integrated Power, Area, and Timing Modeling Framework for Multicore and Manycore Architectures

Integrated Photonic. Electronic. Optics. Optoelettronics. Integrated Photonic - G. Breglio L1. Quantum Mechanics Materials Science Nano/Bio-photonic

Fiber Optics and Liquid Level Sensors Line Guide

Transcription:

Photonic Networks for Data Centres and High Performance Computing Philip Watts Department of Electronic Engineering, UCL Yury Audzevich, Nick Barrow-Williams, Robert Mullins, Simon Moore, Andrew Moore Computer Laboratory, University of Cambridge

Motivation Future digital services are reliant on power efficient computers Data centres, large servers, scientific computing Power management is now the primary issue 200 W/chip thermal limit reached Interconnect accounts for majority of power consumption Transistors scale, wires don t Can t continue to exploit Moore s Law without dealing with interconnect power issue Power of Computation vs. Communication Transfer 32b across chip Transfer 32b off-chip 130nm CMOS (2002) 20 computations 260 computations Polymer Waveguides 45nm CMOS (2008) 57 computations 1300 computations Increase +285% +500% Many developments are addressing this issue: 3D stacked memory Non-volatile unified memory Photonic interconnect MEMORY CACHE CORES PHOTONIC NETWORK 3D Computer Chip Optical PCB Optical Backplane

DEMUX Components of a Photonic C2C Interconnect Hybrid Si Laser (UCSB/Intel) Si/SiO 2 Waveguides 400 x 200 nm, Bend Radius few μm Germanium Photodiodes integrated with Silicon waveguides 3 x 1 μm (Intel) Chip #1 Chip #2 Tx Silicon ring resonators can modulate, filter and switch Tx MUX X Tx 16 x 16 integrated SOA switch (Cambridge/Eindhoven) Polymer waveguides in standard FR4 PCB (Cambridge)

Characteristics of Photonic Networks Photonics offers very high bandwidth (WDM) over long distances with lower power Photonics favours circuit switching No viable optical buffer exists End-to-end paths required Computer architectures favours packet switching Message sizes in Shared Memory systems 64-bits Design implications of complete computer systems with photonic interconnect are not clear Traditional Electronic Networks 3D Mesh Fat Tree Edge Buffered Photonic Network Optical Network 4

On-chip Lasers or Off-chip Power Supply On chip laser approach Avoid losses in distribution and getting light onto chip Can be rapidly power gated ( ns) Photonic power supply (PPS) approach Only compact and low drive power components need to be on-chip Single optical power supply can be used for multiple devices We can t store photons any generated light is dissipated mainly onchip Unlikely that efficiency of optical power supply will be >20% Efficiency of electronic power supply can be >80% and is naturally dynamic PHOTONIC POWER SUPPLY On-chip laser approach LASER LASER LASER Photonic Power Supply approach MOD MOD MOD 5

Photonic Networks for Data Centres - Vision Scalability of the all-optical network is limited to around 64 ports, rack network Optical power and attenuation Latency and complexity of Arbitration/Contention resolution Expandable using O sections Optical Network Optical Network Optical Network

Methodology Multi- Wavelength Source Power Request (optional) Processor Chip Network Adapter Switch Chip Allocator Receiver Modelled the power consumption of the photonic path using the photonic power supply approach Created SystemVerilog models of the network control circuits and synthesized using a low leakage 45 nm CMOS process Evaluated the power consumption of circuit switched and time slotted networks covering a single rack How much power is disipated on-chip? What can be power gated?

Networks: Circuit Switching Electronic Network is required for residual flows Analysis of the traffic patterns generated running the PARSEC benchmark suite on simulated 32-core x86 system running Linux Observe the effect of changing the circuit decision time Circuit decisions are ideal (a priori knowledge of traffic)

Energy (J/bit) Energy per bit Circuit Switching Only 1.0 3.2 packets on average per circuit 10-9 10-10 10-11 10-12 Setting up circuits on μs timescales required 10-7 10-6 10-5 10-4 10-3 10-2 10-1 10 0 Circuit Decision Time (s) Ring resonator Clos switch assumed Current Injection - Min Current Injection - Max Capacitive - Min Capacitive - Max Electronic Packet Switch No significant advantage in static case

Time Slotted Networks Scheduled Speculative Time slots are 10 ns, sized for transmission of a single 32B cache line Speculative does not allow power gating of PPS Intra-Rack Network Latency

Total Energy, J/b Processor Chip Energy, J/b Time Slotted Network Time slotted networks can be > order of magnitude lower power than electronic mesh network 10-8 10-9 SOA Switch Ring Resonator Switch SOA switch, Dynamic OPS) Electronic total energy 10-8 10-9 SOA Switch Ring Resonator Switch SOA switch, Dynamic OPS) Electronic total energy 10-10 10-10 10-11 10-11 10-12 10 5 10 6 10 7 10 8 Packet Arrival Rate, pkts/s/port 10-12 10 5 10 6 10 7 10 8 Packet Arrival Rate, pkts/s/port 11

Sources of Power - Time Slotted Network Total Network Power, including PPS (W)

Sources of Power Time Slotted Network Power Dissipation on the Processor Chip, including PPS

Summary Photonic time slotted networks can reduce power consumption by > order of magnitude Total network power Processor chip dissipation Further work required to understand how these networks scale under realistic data centre traffic patterns Transceiver power consumption is significant proportion of on-chip disipation Yury will tell you more in next talk