TIMING-DRIVEN PHYSICAL DESIGN FOR DIGITAL SYNCHRONOUS VLSI CIRCUITS USING RESONANT CLOCKING



Similar documents
Clocking. Figure by MIT OCW Spring /18/05 L06 Clocks 1

Alpha CPU and Clock Design Evolution

Agenda. Michele Taliercio, Il circuito Integrato, Novembre 2001

How To Design A Chip Layout

Introduction to CMOS VLSI Design (E158) Lecture 8: Clocking of VLSI Systems

Timing Methodologies (cont d) Registers. Typical timing specifications. Synchronous System Model. Short Paths. System Clock Frequency

Latch Timing Parameters. Flip-flop Timing Parameters. Typical Clock System. Clocking Overhead

S. Venkatesh, Mrs. T. Gowri, Department of ECE, GIT, GITAM University, Vishakhapatnam, India

LOW POWER DESIGN OF DIGITAL SYSTEMS USING ENERGY RECOVERY CLOCKING AND CLOCK GATING

ISSCC 2003 / SESSION 13 / 40Gb/s COMMUNICATION ICS / PAPER 13.7

Power Reduction Techniques in the SoC Clock Network. Clock Power

Lecture 7: Clocking of VLSI Systems

PROGETTO DI SISTEMI ELETTRONICI DIGITALI. Digital Systems Design. Digital Circuits Advanced Topics

EE 459/500 HDL Based Digital Design with Programmable Logic. Lecture 16 Timing and Clock Issues

IL2225 Physical Design

NTE2053 Integrated Circuit 8 Bit MPU Compatible A/D Converter

Topics of Chapter 5 Sequential Machines. Memory elements. Memory element terminology. Clock terminology

A NEAR FIELD INJECTION MODEL FOR SUSCEPTIBILITY PREDICTION IN INTEGRATED CIRCUITS

Asynchronous IC Interconnect Network Design and Implementation Using a Standard ASIC Flow

Lecture 11: Sequential Circuit Design

Fairchild Solutions for 133MHz Buffered Memory Modules

Sentinel-SSO: Full DDR-Bank Power and Signal Integrity. Design Automation Conference 2014

NAME AND SURNAME. TIME: 1 hour 30 minutes 1/6

EE 42/100 Lecture 24: Latches and Flip Flops. Rev B 4/21/2010 (2:04 PM) Prof. Ali M. Niknejad

PROGETTO DI SISTEMI ELETTRONICI DIGITALI. Digital Systems Design. Digital Circuits Advanced Topics

Reconfigurable ECO Cells for Timing Closure and IR Drop Minimization. TingTing Hwang Tsing Hua University, Hsin-Chu

INTEGRATED CIRCUITS. For a complete data sheet, please also download:

路 論 Chapter 15 System-Level Physical Design

Design Verification & Testing Design for Testability and Scan

ICS379. Quad PLL with VCXO Quick Turn Clock. Description. Features. Block Diagram

Architectural Level Power Consumption of Network on Chip. Presenter: YUAN Zheng

TRUE SINGLE PHASE CLOCKING BASED FLIP-FLOP DESIGN

Lecture 10: Sequential Circuits

Set-Reset (SR) Latch

DS2187 Receive Line Interface

Model-Based Synthesis of High- Speed Serial-Link Transmitter Designs

Lecture 10: Latch and Flip-Flop Design. Outline

An All-Digital Phase-Locked Loop with High Resolution for Local On-Chip Clock Synthesis

On-Chip Interconnect: The Past, Present, and Future

IBIS for SSO Analysis

Low Power AMD Athlon 64 and AMD Opteron Processors

Sequential 4-bit Adder Design Report

AVX EMI SOLUTIONS Ron Demcko, Fellow of AVX Corporation Chris Mello, Principal Engineer, AVX Corporation Brian Ward, Business Manager, AVX Corporation

Introduction to Digital System Design

TIMING ISSUES IN DIGITAL CIRCUITS

University of Texas at Dallas. Department of Electrical Engineering. EEDG Application Specific Integrated Circuit Design

Class 11: Transmission Gates, Latches

ECE124 Digital Circuits and Systems Page 1

PowerPC Microprocessor Clock Modes

Application Note 58 Crystal Considerations with Dallas Real Time Clocks

11. High-Speed Differential Interfaces in Cyclone II Devices

CMOS, the Ideal Logic Family

Lecture 11. Clocking High-Performance Microprocessors

Efficient Interconnect Design with Novel Repeater Insertion for Low Power Applications

MADR TR. Quad Driver for GaAs FET or PIN Diode Switches and Attenuators. Functional Schematic. Features. Description. Pin Configuration 2

Hunting Asynchronous CDC Violations in the Wild

Harmonics and Noise in Photovoltaic (PV) Inverter and the Mitigation Strategies

Layout of Multiple Cells

1. Description. 2. Feature. 3. PIN Configuration

A 1.62/2.7/5.4 Gbps Clock and Data Recovery Circuit for DisplayPort 1.2 with a single VCO

Phase-Locked Loop Based Clock Generators

DATA SHEET. HEF40193B MSI 4-bit up/down binary counter. For a complete data sheet, please also download: INTEGRATED CIRCUITS

Abstract. Cycle Domain Simulator for Phase-Locked Loops

Programmable Single-/Dual-/Triple- Tone Gong SAE 800

Signal Types and Terminations

Clock Distribution Networks in Synchronous Digital Integrated Circuits

DESIGN CHALLENGES OF TECHNOLOGY SCALING

Contents. Preface. xiii. Part I 1

Sequential Logic. (Materials taken from: Principles of Computer Hardware by Alan Clements )

MM74C150 MM82C19 16-Line to 1-Line Multiplexer 3-STATE 16-Line to 1-Line Multiplexer

Spread-Spectrum Crystal Multiplier DS1080L. Features

Gate Delay Model. Estimating Delays. Effort Delay. Gate Delay. Computing Logical Effort. Logical Effort

International Journal of Electronics and Computer Science Engineering 1482

ATE-A1 Testing Without Relays - Using Inductors to Compensate for Parasitic Capacitance

ISSCC 2003 / SESSION 4 / CLOCK RECOVERY AND BACKPLANE TRANSCEIVERS / PAPER 4.7

ASYNCHRONOUS COUNTERS

Class 18: Memories-DRAMs

On-chip clock error characterization for clock distribution system

Figure 1 FPGA Growth and Usage Trends

ZL40221 Precision 2:6 LVDS Fanout Buffer with Glitchfree Input Reference Switching and On-Chip Input Termination Data Sheet

SPREAD SPECTRUM CLOCK GENERATOR. Features

3-Phase DC Brushless Motor Pre-Drivers Technical Information NJM2625A

Asynchronous counters, except for the first block, work independently from a system clock.

Chapter 13: Verification

A 3.2Gb/s Clock and Data Recovery Circuit Without Reference Clock for a High-Speed Serial Data Link

INTEGRATED CIRCUITS. For a complete data sheet, please also download:

CERAMIC RESONATOR PRINCIPLES

SECTION 2 Transmission Line Theory

Curriculum for a Master s Degree in ECE with focus on Mixed Signal SOC Design

Application Note 58 Crystal Considerations for Dallas Real-Time Clocks

Application Note: PCB Design By: Wei-Lung Ho

DP8570A DP8570A Timer Clock Peripheral (TCP)

Design and Applications of HCPL-3020 and HCPL-0302 Gate Drive Optocouplers

Status of the design of the TDC for the GTK TDCpix ASIC

. MEDIUM SPEED OPERATION - 8MHz . MULTI-PACKAGE PARALLEL CLOCKING FOR HCC4029B HCF4029B PRESETTABLE UP/DOWN COUNTER BINARY OR BCD DECADE

Transcription:

TIMING-DRIVEN PHYSICAL DESIGN FOR DIGITAL SYNCHRONOUS VLSI CIRCUITS USING RESONANT CLOCKING BARIS TASKIN, JOHN WOOD, IVAN S. KOURTEV February 28, 2005

Research Objective Objective: Electronic design automation and synchronization of digital IC systems with rotary resonant clocking technology.

Problems: Clocking at GHz Low-skew low-jitter uncharacteristic Timing violations Power dissipation Some solutions Multi-domain clocking Skew-tolerant multi-phase clocking Alternative technologies Opticalclocking Transmission-line based clocking

Resonant Clocking 1 Oscillator Type Phase Voltage Coupled LC Constant Constant Standing Wave Constant Variable Traveling Wave Variable Constant 1: Abstract from IBM Research http://www.reseach.ibm.com/compsci/project_spotlight/vlsi/

Transmission Line Long interconnect L (variation with process) 1% C (Variation with process) < 30% Vp(variation with process) 15%

Mobius Termination Shunt connected inverters between lines f osc ~ 1/ L 2 laps to complete 360 o phase

Rotary Clock Waveforms Waveforms for line voltage and line current at 2.4GHz

Rotary Clocking Low-jitter ~6ps for 2.4GHz 0.25um 1% of clock period Non-sinusoidal clock signal 20ps rise and fall times (0.25um) 5% of the clock period 16GHz theoretical upper limit in 0.25um

Rotary Cycles 360 o Phase/ring Multi-phase! No distribution, generated across the die Energy preserving Self-replenishing

ASIC Implementation

Capacitive Loading Reduce propagation velocity Independent of parasitic capacitance Increase current in wires, but no CV 2 f power

Rotary Wires for ASIC Rotary distribution Synchronous components

Modes of Operation ASIC drive Global rotary clock to synchronize any number of: Derived clocks Other global signals Reset, Enable, Step, Scan Retain standard FFs Minimal flow impact Direct Drive Maximum power benefit. One high frequency clock grid over whole chip directly driving all FFs. Custom FFs for lowest power. Modified flow.

DFF Load High internal capacitance High dynamic power consumption Direct drive: Rotary clock drives Nfet and Pfet pass devices directly

Latch Load Less clocked C: save CV 2 f power No need to gate clock (only data)

Rotary Modes

CAD: Extraction and Simulation RLC extraction for rotary RC for data Fast SPICE for confirmation Internal STA engine

Physical Design Flow DESIGN ENTRY Partitioning PARTITIONING ROA SIZE PARTITIONING REGISTER INSERTION NO ROA FEASIBLE? YES CSS CLOCK SKEW SCHEDULING CSS on PARTITION I CSS on PARTITION N NO CSS on TOP BLOCK CSS FEASIBLE? YES Placement PLACEMENT REGISTER MAPPING LOGIC PLACEMENT

CAD: Placement & Route 1 Select rotary rings Physical implementation

CAD: Placement & Route 2 Clock Pin Identify communicating register-toregister paths Partitioning Static timing analysis Clock skew scheduling 28% average improvement Parallelization

CSS Parallelization 10k registers 25k local paths 2.5 hours 10*10 rotary clocking 150 registers 500 paths 2 secs Speed up: 44X without parallelization 1286X with parallelization Sub-optimality

CAD: Placement & Route 3 45 o 225 o 0 o 180 o 270 o 90 o = T/4 delay 315 o 135 o Pre-place register banks Map registers to phase Proceed with logic synthesis

Conclusions Look-ahead to next-generation Rotary clocking Non-zero clock skew Parallelization Implementation results to follow

TIMING-DRIVEN PHYSICAL DESIGN FOR DIGITAL SYNCHRONOUS VLSI CIRCUITS USING RESONANT CLOCKING QUESTIONS?

DESIGN AND TIMING ANALYSIS OF LEVEL-SENSITIVE DIGITAL INTEGRATED CIRCUITS BACKUP SLIDES

Clock Period Minimization Problem - 1 Objective function : min T Problem variables For each register R i Earliest/latest arrival times a i, A i Earliest/latest departure times d i, D i Clock signal delay t i

Clock Period Minimization Problem - 2 Problem Parameters For each register R i Clock-to-output delay D CQ Data-to-output D DQ Setup time S i Hold time H i For each local data path R i R j Data propagation time D P if

Practical Causes of Clock Skew Size Mismatches Buffer Size, Interconnect length Process Variations L eff, T ox etc. Temperature Gradients Power Supply Voltage Drop

Rotary Implementation Odd number of crossovers Multi-phase Relative phase information on ring Non-zero clock skew Cross-coupled inverters Low power

Capacitive Loading of the Rotary Ring 4.5 pf each side 0.13u x 10834 e-18 / micron [sq] = 12.7 ff on each gate Assume 10 ff on each line for wiring cap. of spur = 22.7 ff * 200 loads 4.5 pf each side

Benefits of Rotary Clock Architecture No practical upper frequency limitation No practical size limitation Negates the dynamic clock power Guaranteed near-zero skew Precise skew scheduling possible Negligible jitter

Benefits of Rotary Clock Architecture (cont d.) Largely independent of: Process variations Temperature variations Supply voltage Inherently low noise No SSN generated by clock. Differential Greater immunity to noise Less generation of noise

Benefits of Rotary Clock Architecture (cont d) Works for all existing IC processes Short and predictable design cycle Automated CAD tooling