Lecture 10: Latch and Flip-Flop Design. Outline



Similar documents
Lecture 10: Sequential Circuits

Clocking. Figure by MIT OCW Spring /18/05 L06 Clocks 1

Lecture 11: Sequential Circuit Design

Lecture 10 Sequential Circuit Design Zhuo Feng. Z. Feng MTU EE4800 CMOS Digital IC Design & Analysis 2010

Sequential Circuits. Combinational Circuits Outputs depend on the current inputs

Lecture 7: Clocking of VLSI Systems

Introduction to CMOS VLSI Design (E158) Lecture 8: Clocking of VLSI Systems

L4: Sequential Building Blocks (Flip-flops, Latches and Registers)

Topics of Chapter 5 Sequential Machines. Memory elements. Memory element terminology. Clock terminology

Latch Timing Parameters. Flip-flop Timing Parameters. Typical Clock System. Clocking Overhead

A Survey on Sequential Elements for Low Power Clocking System

Latches, the D Flip-Flop & Counter Design. ECE 152A Winter 2012

TRUE SINGLE PHASE CLOCKING BASED FLIP-FLOP DESIGN

LOW POWER DUAL EDGE - TRIGGERED STATIC D FLIP-FLOP

S. Venkatesh, Mrs. T. Gowri, Department of ECE, GIT, GITAM University, Vishakhapatnam, India

Weste07r4.fm Page 183 Monday, January 5, :39 AM. 7.1 Introduction

Sequential Logic: Clocks, Registers, etc.

Sequential 4-bit Adder Design Report

ECE380 Digital Logic

Theory of Logic Circuits. Laboratory manual. Exercise 3

Sequential Circuit Design

Design and analysis of flip flops for low power clocking system

Memory Elements. Combinational logic cannot remember

Sequential Logic Design Principles.Latches and Flip-Flops

Layout of Multiple Cells

CHAPTER 16 MEMORY CIRCUITS

EE 42/100 Lecture 24: Latches and Flip Flops. Rev B 4/21/2010 (2:04 PM) Prof. Ali M. Niknejad

CHAPTER 11 LATCHES AND FLIP-FLOPS

Introduction to CMOS VLSI Design

True Single Phase Clocking Flip-Flop Design using Multi Threshold CMOS Technique

Set-Reset (SR) Latch

LOW POWER DESIGN OF DIGITAL SYSTEMS USING ENERGY RECOVERY CLOCKING AND CLOCK GATING

Module 4 : Propagation Delays in MOS Lecture 22 : Logical Effort Calculation of few Basic Logic Circuits

ECE124 Digital Circuits and Systems Page 1

Master/Slave Flip Flops

DESIGN CHALLENGES OF TECHNOLOGY SCALING

So far we have investigated combinational logic for which the output of the logic devices/circuits depends only on the present state of the inputs.

WEEK 8.1 Registers and Counters. ECE124 Digital Circuits and Systems Page 1

TIMING-DRIVEN PHYSICAL DESIGN FOR DIGITAL SYNCHRONOUS VLSI CIRCUITS USING RESONANT CLOCKING

Having read this workbook you should be able to: recognise the arrangement of NAND gates used to form an S-R flip-flop.

Static-Noise-Margin Analysis of Conventional 6T SRAM Cell at 45nm Technology

SEQUENTIAL CIRCUITS. Block diagram. Flip Flop. S-R Flip Flop. Block Diagram. Circuit Diagram

CSE140: Components and Design Techniques for Digital Systems

Digital Logic Design. Basics Combinational Circuits Sequential Circuits. Pu-Jen Cheng

Timing Methodologies (cont d) Registers. Typical timing specifications. Synchronous System Model. Short Paths. System Clock Frequency

Engr354: Digital Logic Circuits

Flip-Flops, Registers, Counters, and a Simple Processor

NTE2053 Integrated Circuit 8 Bit MPU Compatible A/D Converter

ISSCC 2003 / SESSION 13 / 40Gb/s COMMUNICATION ICS / PAPER 13.7

Power Reduction Techniques in the SoC Clock Network. Clock Power

CDA 3200 Digital Systems. Instructor: Dr. Janusz Zalewski Developed by: Dr. Dahai Guo Spring 2012

Sequential Logic. (Materials taken from: Principles of Computer Hardware by Alan Clements )

Pass Gate Logic An alternative to implementing complex logic is to realize it using a logic network of pass transistors (switches).

CS311 Lecture: Sequential Circuits

Alpha CPU and Clock Design Evolution

Lesson 12 Sequential Circuits: Flip-Flops

NAME AND SURNAME. TIME: 1 hour 30 minutes 1/6

Multi-GHz Systems Clocking Invited Paper

BINARY CODED DECIMAL: B.C.D.

CS250 VLSI Systems Design Lecture 8: Memory

PROGETTO DI SISTEMI ELETTRONICI DIGITALI. Digital Systems Design. Digital Circuits Advanced Topics

Design Verification & Testing Design for Testability and Scan

Optimization and Comparison of 4-Stage Inverter, 2-i/p NAND Gate, 2-i/p NOR Gate Driving Standard Load By Using Logical Effort

Two-Phase Clocking Scheme for Low-Power and High- Speed VLSI

INTEGRATED CIRCUITS. For a complete data sheet, please also download:

DIGITAL SYSTEM CLOCKING. High-Performance and Low-Power Aspects

INTEGRATED CIRCUITS. For a complete data sheet, please also download:

路 論 Chapter 15 System-Level Physical Design

High Performance Low Power Dual Edge Triggered Static D Flip-Flop

MAS.836 HOW TO BIAS AN OP-AMP

e.g. τ = 12 ps in 180nm, 40 ps in 0.6 µm Delay has two components where, f = Effort Delay (stage effort)= gh p =Parasitic Delay

ASYNCHRONOUS COUNTERS

Chapter 9 Latches, Flip-Flops, and Timers

CMOS, the Ideal Logic Family

Use and Application of Output Limiting Amplifiers (HFA1115, HFA1130, HFA1135)

CMOS Binary Full Adder

Low Power AMD Athlon 64 and AMD Opteron Processors

Modeling Sequential Elements with Verilog. Prof. Chien-Nan Liu TEL: ext: Sequential Circuit

MM74HCT373 MM74HCT374 3-STATE Octal D-Type Latch 3-STATE Octal D-Type Flip-Flop

1 pc Charge Injection, 100 pa Leakage CMOS 5 V/5 V/3 V 4-Channel Multiplexer ADG604

CD4027BC Dual J-K Master/Slave Flip-Flop with Set and Reset

Gate Delay Model. Estimating Delays. Effort Delay. Gate Delay. Computing Logical Effort. Logical Effort

Experiment # 9. Clock generator circuits & Counters. Eng. Waleed Y. Mousa

Lecture 5: Logical Effort

Digital Fundamentals

Chapter 2 Logic Gates and Introduction to Computer Architecture

7. Latches and Flip-Flops

DIGITAL TECHNICS II. Dr. Bálint Pődör. Óbuda University, Microelectronics and Technology Institute

RAM & ROM Based Digital Design. ECE 152A Winter 2012

Understanding Power Impedance Supply for Optimum Decoupling

DM74LS112A Dual Negative-Edge-Triggered Master-Slave J-K Flip-Flop with Preset, Clear, and Complementary Outputs

Module 7 : I/O PADs Lecture 33 : I/O PADs

These help quantify the quality of a design from different perspectives: Cost Functionality Robustness Performance Energy consumption

Class 11: Transmission Gates, Latches

Transcription:

Lecture 1: Latch and Flip-Flop esign Slides orginally from: Vladimir Stojanovic Computer Systems Laboratory Stanford University horowitz@stanford.edu 1 Outline Recent interest in latches and flip-flops Timing and Power metrics esign and optimization tradeoffs Master-slave vs. Pulse-triggered Latch Representative designs Comparison 2

Recent Interest in Flip-Flops Trends in high-performance systems! Higher clock frequency! More transistors on chip Consequences! Increased flip-flop overhead relative to cycle time Cycle time 1-2 FO4 delays, flop overhead 2-4 FO4! ifficult to control both edges of the clock! Higher impact of clock skew! Higher crosstalk and substrate coupling! Higher power consumption expensive packages and cooling systems limit in performance! Clock burns up to 4%, flops up to 2% of total power 3 Requirements in the Flip-Flop esign Small -Output delay, Narrow sampling window Low power Small clock load High driving capability (increased levels of parallelism)! Typical flip-flop load in a.18µm CMOS ranges from 5fF to over 2fF, with typical values of 1-15fF in critical paths (2-8FO4s or even higher) Integration of logic into the flop Multiplexed or clock scan Crosstalk insensitivity - dynamic/high impedance nodes are affected 4

Flip-Flop elay Sum of setup time and -output delay is the only true measure of the performance with respect to the system speed T = T - + T Logic + T setup +T skew Logic N T - T Logic T Setup 5 elay vs. Setup/Hold Times 35 Minimum ata-output 3 25 -Output [ps] Setup 2 15 1 Hold 5-2 -15-1 -5 5 1 15 2 ata- [ps] 6

Timing parameters, details 41 39 Unstable - region Stable - region Time [ps] 37 35 33 31 Failure region C +U minimum - - - stable C 29 27 Optimum setup time U 25-8 -6-4 -2 2 4 6 8 1 - delay [ps] The best point to pick on delay curve is minimum - 7 PPtot [fj] esign & optimization tradeoffs 9 8 Opposite Goals 7! Minimal Total power 6 5 consumption 4! Minimal elay 3 2 Power-elay tradeoff 1 5 1 15 2 Minimize Power-elay product (PP tot ) Total Power [uw] 9 9 8 8 7 7 6 6 5 5 4 4 3 3 2 2 1 1 5 1 15 2 25 2 4 6 8 1 12 Width [um] elay [ps] PPtot [fj] PPtot [fj] 8

Types of Flip-Flops Master-Slave Latch Pulse-Triggered Latch ata L1 L2 ata L ata S R 9 Master-Slave Latches Positive setup times Two clock phases:! distributed globally! generated locally Small penalty in delay for incorporating MUX Some circuit tricks needed to reduce the overall delay 1

T-G Master-Slave Latch PowerPC 63 (Gerosa, JSSC 12/94) b b 11 T-G Master-Slave Latch Low power feedback Unbuffered input! input capacitance depends on the phase of the clock! over-shoot and under-shoot with long routes! wirelength must be restricted at the input Clock load is high Low power Small clk-output delay, but positive setup Easily embedded scan or mux 12

C 2 MOS MS Latches Ck Ckb Ckb Ck Ck Ckb Ck Ck Low power feedback Locally generated second phase Poor driving capability Robustness to clock slope Ckb 13 Single-Transistor-Clocked MS latches STC SSTC Yuan and Svennson, JSSC Jan. 97 Ratioed CVS and SRPL based designs Relatively small clock load Very sensitive to input glitching Capacitive coupling and charge sharing related speed and power problems 14

Pulse-Triggered Latches First stage is a pulse generator! generates a pulse (glitch) on a rising edge of the clock Second stage is a latch! captures the pulse generated in the first stage Pulse generation results in a negative setup time Frequently exhibit a soft edge property Must check for hold time violations Note: power is always consumed in the clocked pulse generator 15 Hybrid Latch Flip-Flop AM K-6, Partovi, ISSCC 96 16

HLFF Operation 1- and -1 transitions at the input with ps setup time 17 Skew absorption Hybrid Latch Flip-Flop Partovi et al, ISSCC 96 18

Hybrid Latch Flip-Flop Flip-flop features:! single phase clock! edge triggered, on one clock edge Latch features: Soft clock edge property! brief transparency, equal to 3 inverter delays! negative setup time! allows slack passing! absorbs skew Hold time is comparable to HLFF delay! minimum delay between flip-flops must be controlled Fully static Possible to incorporate logic 19 Semi-ynamic Flip-Flop (SFF) Sun UltraSparc III, Klass, VLSI Circuits 98 Soft edge conditioned by data since first stage is precharged - cross-coupled latch is added for robustness Small penalty for adding logic Latch has one transistor less in stack - faster than HLFF, but 1-1 glitch exists 2

Sense-amplifier-based flip-flop Matsui et al. 1994. EC Alpha 21264, StrongARM 11 First stage is a sense amplifier On rising clock edge monotonic S_b or R_b trigger the S-R latch Cross-coupled NAN - speed bottleneck Big power savings in reduced swing designs Nice interface to/from domino logic 21 Modified Sense Amplifier-Based Flip-Flop The first stage is unchanged sense amplifier Second stage is sized to provide maximum switching speed river transistors are large Keeper transistors are small and disengaged during transitions Nikolic & Stojanovic, ISSCC 99 22

Modified Sense Amplifier-Based Flip-Flop elay of each of the outputs is independent of the load on the other output elay of and is symmetrical as opposed to the NAN based design Convenient for dual rail logic and driving strength for standard CMOS is effectively doubled SAFF presents a small clock load, small setup time and all the advantages of original design Possible tradeoff between speed and robustness to crosstalk 23 K-6 ual-rail ETL Self-reset property! increases dynamic power! drives domino logic Precharge increases speed Very fast but burns a lot of power Small clock load 24

Flip-Flop Performance Comparison Test bench ata 2fF Clock 2fF Total power consumed! internal power! data power! clock power Measured for four cases! no activity ( and 1111 )! maximum activity (111..)! average activity (random sequence) 5fF elay is (minimum -) - + setup time 25 elay comparison elay [ps] 7 6 5 4 3 2 1 K6 SA-F/F StrongArm SSTC STC elay [ps] 35 3 25 2 15 1 5 SFF HLFF PowerPC mc2mos Pulsed design brings the fastest structures 26

Overall performance 6 16 5 14 12 PPtot [fj] 4 3 2 PPtot [fj] 1 8 6 4 1 2 HLFF SFF PowerPC mc2mos SA-F/F Strong Arm11 K6 SSTC STC Activity=.5 equal transition probability Activity=.5 equal transition probability Real signals have the activity between and.5 (g) Precharged hybrid structures are the fastest but their power consumption strongly depends on the probability of ones More ones above the g point 27 Conventional - vs. minimum - Total power [uw] 4 35 SSTC & STC 3 Pulsed designs 25 MS designs 2 15 1 5 15 2 25 3 35 4 45 5 55 6 65 elay [ps] HLFF PowerPC Strong Arm FF SA-F/F mc2mos latch K6 ETL SSTC STC SFF Total Power [uw] 4 35 3 25 2 15 1 5 1 15 2 25 3 35 - delay [ps] HLFF PowerPC Strong Arm FF SA-F/F mc2mos latch K6 ETL SSTC STC SFF Hidden positive setup time egradation of total delay 28

Comparison of Clock power consumption STC MS latch SSTC MS latch K6 ETL StrongArm FF SA-F/F mc 2 MOS PowerPC MS latch SFF HLFF 1 2 3 4 5 Local Clock power consumption [µw] 29 Apply! Small clock load! Short direct path! Reduced node swing! Low-power feedback! Pulsed design! Optimization of both Master and Slave latch esign goals Avoid! Positive setup time! Sensitivity to clock slope and skew! ynamic (floating) nodes! ynamic Master latch Conduct Power *elay optimizations on constant frequency - really optimize Energy*elay product Take into account all sources of power dissipation ALWAYS use - + setup time for max delay 3

General characteristics! 6ps = FO4 delay in.2u technology! min gate width 1.6u Nominal conditions # of transistors Table 1: General characteristics Internal Clock power power [uw] [uw] Total transistor width [u] ata power [uw] Total power [uw] elay [ps] PP tot [fj] PowerPC 63 16 147 36 46 5 87 266 23 HLFF 2 162 16 18 3 127 199 25 SFF 23 167 158 27 2 187 187 35 mc 2 MOS 24 17 94 15 6 115 292 34 SA-F/F 19 214 97 18 3 118 272 32 StrongArm FF 2 215 11 18 3 122 275 34 K6 ETL 37 246 25 15 5 27 2 54 SSTC 16 147 94 22 4 12 592 71 STC 1 136 132 22 4 158 629 99 31