Intel Labs at ISSCC 2012. Copyright Intel Corporation 2012



Similar documents
SRAM Scaling Limit: Its Circuit & Architecture Solutions

Tri-Band RF Transceivers for Dynamic Spectrum Access. By Nishant Kumar and Yu-Dong Yao

AMS/RF-CMOS circuit design for wireless transceivers

Lecture 3: Modern GPUs A Hardware Perspective Mohamed Zahran (aka Z) mzahran@cs.nyu.edu

UTBB-FDSOI 28nm : RF Ultra Low Power technology for IoT

Static-Noise-Margin Analysis of Conventional 6T SRAM Cell at 45nm Technology

Legal Notices and Important Information

Photonic Networks for Data Centres and High Performance Computing

An All-Digital Phase-Locked Loop with High Resolution for Local On-Chip Clock Synthesis

What is a System on a Chip?

An Introduction to High-Frequency Circuits and Signal Integrity

Area 3: Analog and Digital Electronics. D.A. Johns

Design of a Wireless Medical Monitoring System * Chavabathina Lavanya 1 G.Manikumar 2

Industry First X86-based Single Board Computer JaguarBoard Released

Riding silicon trends into our future

Reconfigurable System-on-Chip Design

1.1 Silicon on Insulator a brief Introduction

數 位 積 體 電 路 Digital Integrated Circuits

A 1-GSPS CMOS Flash A/D Converter for System-on-Chip Applications

Unmatched RF Spectrum Analysis

A Survey on ARM Cortex A Processors. Wei Wang Tanima Dey

Introduction to GP-GPUs. Advanced Computer Architectures, Cristina Silvano, Politecnico di Milano 1

INF4420 Introduction

Normally-Off Technologies for

SOC architecture and design

Digital Design for Low Power Systems

Lesson 7: SYSTEM-ON. SoC) AND USE OF VLSI CIRCUIT DESIGN TECHNOLOGY. Chapter-1L07: "Embedded Systems - ", Raj Kamal, Publs.: McGraw-Hill Education

A New, High-Performance, Low-Power, Floating-Point Embedded Processor for Scientific Computing and DSP Applications

Implementation Details

How To Use A Sound Card With A Subsonic Sound Card

Intel s Revolutionary 22 nm Transistor Technology

2B1459 System-on-Chip Applications (5CU/7.5ECTS) Course Number for Graduate Students (2B5476, 4CU)

Naveen Muralimanohar Rajeev Balasubramonian Norman P Jouppi

Universal Form-factor. Wi Fi Troubleshooting Made Easy

A New Programmable RF System for System-on-Chip Applications

Binary search tree with SIMD bandwidth optimization using SSE

Power Reduction Techniques in the SoC Clock Network. Clock Power

DESIGN CHALLENGES OF TECHNOLOGY SCALING

8 Gbps CMOS interface for parallel fiber-optic interconnects

How To Build A Cloud Computer

Computer Aided Design of Home Medical Alert System

A 10,000 Frames/s 0.18 µm CMOS Digital Pixel Sensor with Pixel-Level Memory

IC-EMC Simulation of Electromagnetic Compatibility of Integrated Circuits

Wireless Sensor Network for Electric Meter Handling

mm-wave System-On-Chip & System-in-Package Design for 122 GHz Radar Sensors

Microtronics technologies Mobile:

FPGAs in Next Generation Wireless Networks

A true low voltage class-ab current mirror

ZigBee Technology Overview

Department of Electrical and Computer Engineering Ben-Gurion University of the Negev. LAB 1 - Introduction to USRP

The Heartbeat behind Portable Medical Devices: Ultra-Low-Power Mixed-Signal Microcontrollers

Chapter 2 Heterogeneous Multicore Architecture

SG Miniature Wi-Fi Radio

System Design Issues in Embedded Processing

Lecture 11: Multi-Core and GPU. Multithreading. Integration of multiple processor cores on a single chip.

Analog Signal Conditioning

NVIDIA SDR (Software Defined Radio) Technology

Memory Systems. Static Random Access Memory (SRAM) Cell

White Paper: Pervasive Power: Integrated Energy Storage for POL Delivery

OpenPOWER Outlook AXEL KOEHLER SR. SOLUTION ARCHITECT HPC

BEAGLEBONE BLACK ARCHITECTURE MADELEINE DAIGNEAU MICHELLE ADVENA

Chapter 2 Logic Gates and Introduction to Computer Architecture

ÇANKAYA ÜNİVERSİTESİ ECE 491 SENIOR PROJECT I ERDİNÇ YILMAZ

Chapter 6 PLL and Clock Generator

Wireless monitoring system for temperature and humidity based on ZigBee

Implementation of Short Reach (SR) and Very Short Reach (VSR) data links using POET DOES (Digital Opto- electronic Switch)

Hardware-Aware Analysis and. Presentation Date: Sep 15 th 2009 Chrissie C. Cui

Side Channel Analysis and Embedded Systems Impact and Countermeasures

Computer System: User s View. Computer System Components: High Level View. Input. Output. Computer. Computer System: Motherboard Level

SIMPLE TECHNIQUES TO IMPROVE SOLAR PANEL EFFICIENCY USING A MICROCONTROLLER OR SOC

Evaluating AC Current Sensor Options for Power Delivery Systems

Am186ER/Am188ER AMD Continues 16-bit Innovation

Intel Xeon Processor E5-2600

WiSER: Dynamic Spectrum Access Platform and Infrastructure

AMICSA Integrated SAR Receiver/Converter for L, C and X bands Markku Åberg VTT Technical Research Centre of Finland

US-SPI New generation of High performances Ultrasonic device

Digital Integrated Circuit (IC) Layout and Design

Spike-Based Sensing and Processing: What are spikes good for? John G. Harris Electrical and Computer Engineering Dept

ISSCC 2003 / SESSION 4 / CLOCK RECOVERY AND BACKPLANE TRANSCEIVERS / PAPER 4.7

Logical Operations. Control Unit. Contents. Arithmetic Operations. Objectives. The Central Processing Unit: Arithmetic / Logic Unit.

CMOS 5GHz WLAN a/n/ac RFeIC WITH PA, LNA, AND SPDT

Impedance 50 (75 connectors via adapters)

EEM870 Embedded System and Experiment Lecture 1: SoC Design Overview

Serial port interface for microcontroller embedded into integrated power meter

Testing of Digital System-on- Chip (SoC)

Design Considerations for RF Energy Harvesting Devices

Embedded Systems: map to FPGA, GPU, CPU?

Wireless Communication and RF System Design Using MATLAB and Simulink Giorgia Zucchelli Technical Marketing RF & Mixed-Signal

7a. System-on-chip design and prototyping platforms

Intel Itanium Architecture

Programmable Single-/Dual-/Triple- Tone Gong SAE 800

«A 32-bit DSP Ultra Low Power accelerator»

Article from Micrel. A new approach to the challenge of powering cellular M2M modems By Anthony Pele Senior Field Applications Engineer, Micrel

Transcription:

Intel Labs at ISSCC 2012 Copyright Intel Corporation 2012

Intel Labs ISSCC 2012 Highlights 1. Efficient Computing Research: Making the most of every milliwatt to make computing greener and more scalable across the Continuum 5-10x efficiency gains by enabling low or near-threshold voltage (NTV) operation Applying low voltage techniques broadly to compute, graphics, and memory circuits 2. Digital Radio Research: Simplifying radio architecture by approaching it as a computing problem rather than a circuit design problem Enabling Moore s Law for RF circuits through digital techniques Tackling the last few barriers to make digital RF practical for SoC integration 3. High-Efficiency Math: Reducing power while guaranteeing the accuracy of numeric computations with right-sized floating point arithmetic 2 Revolutionary approach to attack floating point challenges at the architecture level

ENERGY EFFICENCY Investing in Near Threshold Voltage HIGH Subthreshold FRUGAL NTV NORMAL TURBO Standard operating range Peak energy efficiencies at NTV Greater dynamic operating range Ideal for variable workloads and highly parallel applications 5x 10x Applicable from deeply embedded to exascale computing LOW ZERO VCC MAX Threshold: Voltage at which transistors begin to conduct electricity (turn on) 3

Near Threshold Voltage Processor (3.6) A 280mV-to-1.2V Wide-Operating-Range IA-32 Processor in 32nm CMOS Claremont: low-power IA concept processor Initially demoed at Fall IDF 2011 First processor to demonstrate the benefits of (NTV) circuits for compute New for ISSCC Overview of chip layout, design methodology, etc. Built in low-leakage 32nm SoC technology Operates from 280mV @3MHz to 1.2V @915MHz 4.7x better energy efficiency in NTV mode 2mW minimum power Core demonstrated running Windows and Linux powered by this solar cell 4

Lowering the Operating Voltage for Memory (13.3) Capacitive-Coupling Wordline Boosting with Self-Induced VCC Collapse for Write V min Reduction in 22-nm 8T SRAM Std Array 4.5KB CC Array 6KB scan Std Array 6KB CC Array 6KB Demonstrates voltage scaling for medium or large memory arrays with minimal area or capacitance impact Allows 80-140mV V min reduction for a 1MB memory array by boosting voltage on sensitive writes Typically requires a costly charge pump circuit for a local voltage increase Eliminates need for charge pump via clever and novel use of intrinsic capacitances 5

NTV SIMD Engine for Processor Graphics (10.1) A 280mV-to-1.1V 256b Reconfigurable SIMD Vector Permutation Engine With 2-Dimensional Shuffle in 22nm CMOS Clock I/O Register File Permute Xbar Register File I/O Control Winner ISSCC 2012 Distinguished Technical Paper Award First demonstration of NTV on 22nm Tri-Gate technology Shows NTV viability for both compute and memory on representative SIMD block: Vector Permutation Engine Dynamic voltage scaling down to 280mV: 9x efficiency gain 6

Rethinking Radio as a Compute Problem T/R Switch ISSCC 11 Switching Power Amplifier ESSCIRC 10 Low Noise Amplifier ISSCC 11 SD ADC VLSI 11 Digital Transmitter VLSI 11 Digital Synthesizer VLSI 10 Inductorless LO Generation ISSCC 09 Conventional radio circuits are analog, struggle to keep up with CMOS scaling Intel Labs investing in research to fully exploit computational nature of radio Removing the final barriers to make digital RF practical for SoC integration Bringing the benefits of Moore s Law to RF circuits

Pure Digital RF Transmitter (9.4) A 20dBm 2.4GHz Digital Outphasing Transmitter for WLAN Application in 32nm CMOS Switching power amplifier enables performance to improve with CMOS scaling 1 st demonstration of a digital phase modulator architecture delivering full WiFi bandwidth State-of-the-art power efficiency and designed to improve with further CMOS scaling 8

First Atom SoC with CMOS WiFi Radio (3.4) 32nm x86 OS-Compliant PC On-Chip With Dual-Core Atom Processor and RF WiFi Transceiver Rosepoint: First 32nm SoC with WiFi RF transceiver and two Atom cores on the same die Demonstrates effective mitigation of interference between the WiFi radio and IA cores, despite operation in similar frequency range Cross-corporate collaboration between Intel research, development, and manufacturing 9

Variable Precision Floating Point Unit (10.3) A 1.45GHz 52-to-162GFLOPS/W Variable-Precision Floating-Point Fused Multiply-Add Unit With Certainty Tracking in 32nm CMOS 1 st reported variable-precision floating point unit with accuracy tracking for multiply-add Today s floating-point math wastes energy, time, and storage by using worst-case precision everywhere Using variable precision (24-bit 12-bit 6-bit) as needed can cut energy by 50% Uses NTV circuits for up to 7x further efficiency gain 10

Additional Intel Papers at ISSCC 2012 Processor A 22nm IA Multi-CPU and GPU System-on-Chip Low Power A 2.05G Vertices/s 151mW Lighting Accelerator for 3D Graphics in 32nm CMOS A 4.6GHz 162Mb SRAM in 22nm CMOS With Integrated Active VMIN-Enhancing Assist Circuitry Digital Radio A 32nm CMOS All-Digital Reconfigurable Fractional Freq. Divider for Multistandard SoC Radios Clock Generation A Reconfigurable Distributed All-Digital Clock Generator Core in 22nm High-k Tri-Gate LP CMOS A TDC-Less ADPLL With 200-to-3200MHz Range for Mobile SoC Clocking in 22nm CMOS Sensing Ratiometric BJT-Based Thermal Sensor in 32nm and 22nm Technologies

12