Low Power System-On-Chip Design. Retention Register Design

Similar documents
Low Power AMD Athlon 64 and AMD Opteron Processors

Lesson 12 Sequential Circuits: Flip-Flops

NTE2053 Integrated Circuit 8 Bit MPU Compatible A/D Converter

1. Memory technology & Hierarchy

Counters and Decoders

Lecture-3 MEMORY: Development of Memory:

Memory Elements. Combinational logic cannot remember

CHAPTER 11: Flip Flops

Memory Basics. SRAM/DRAM Basics

Chapter 9 Semiconductor Memories. Jin-Fu Li Department of Electrical Engineering National Central University Jungli, Taiwan

BINARY CODED DECIMAL: B.C.D.

Latches, the D Flip-Flop & Counter Design. ECE 152A Winter 2012

Memory Testing. Memory testing.1

SEQUENTIAL CIRCUITS. Block diagram. Flip Flop. S-R Flip Flop. Block Diagram. Circuit Diagram

L297 STEPPER MOTOR CONTROLLERS

Computer Architecture

Chapter 9 Latches, Flip-Flops, and Timers

Lecture 11: Sequential Circuit Design

Sequential Circuits. Combinational Circuits Outputs depend on the current inputs

Chapter 2 Logic Gates and Introduction to Computer Architecture

Introduction to CMOS VLSI Design (E158) Lecture 8: Clocking of VLSI Systems

ETEC 2301 Programmable Logic Devices. Chapter 10 Counters. Shawnee State University Department of Industrial and Engineering Technologies

To design digital counter circuits using JK-Flip-Flop. To implement counter using 74LS193 IC.

RAM & ROM Based Digital Design. ECE 152A Winter 2012

Having read this workbook you should be able to: recognise the arrangement of NAND gates used to form an S-R flip-flop.

So far we have investigated combinational logic for which the output of the logic devices/circuits depends only on the present state of the inputs.

Digital Systems Based on Principles and Applications of Electrical Engineering/Rizzoni (McGraw Hill

CHAPTER 16 MEMORY CIRCUITS

S. Venkatesh, Mrs. T. Gowri, Department of ECE, GIT, GITAM University, Vishakhapatnam, India

EE 42/100 Lecture 24: Latches and Flip Flops. Rev B 4/21/2010 (2:04 PM) Prof. Ali M. Niknejad

Lecture 10 Sequential Circuit Design Zhuo Feng. Z. Feng MTU EE4800 CMOS Digital IC Design & Analysis 2010

Layout of Multiple Cells

COMBINATIONAL and SEQUENTIAL LOGIC CIRCUITS Hardware implementation and software design

TRUE SINGLE PHASE CLOCKING BASED FLIP-FLOP DESIGN

Latch Timing Parameters. Flip-flop Timing Parameters. Typical Clock System. Clocking Overhead

Switch Fabric Implementation Using Shared Memory

L4: Sequential Building Blocks (Flip-flops, Latches and Registers)

Introduction to CMOS VLSI Design

A N. O N Output/Input-output connection

Lecture 10: Sequential Circuits

CS250 VLSI Systems Design Lecture 8: Memory

Lecture 5: Gate Logic Logic Optimization

Semiconductor Memories

Sequential Logic. (Materials taken from: Principles of Computer Hardware by Alan Clements )

ECE124 Digital Circuits and Systems Page 1

Sequential Circuit Design

Lecture 7: Clocking of VLSI Systems

PROGETTO DI SISTEMI ELETTRONICI DIGITALI. Digital Systems Design. Digital Circuits Advanced Topics

Digital Logic Design. Basics Combinational Circuits Sequential Circuits. Pu-Jen Cheng

Power Reduction Techniques in the SoC Clock Network. Clock Power

Sequential Logic: Clocks, Registers, etc.

1.1 Silicon on Insulator a brief Introduction

Memory Systems. Static Random Access Memory (SRAM) Cell

Memory. The memory types currently in common usage are:

Systems I: Computer Organization and Architecture

Testing Low Power Designs with Power-Aware Test Manage Manufacturing Test Power Issues with DFTMAX and TetraMAX

Module 2. Embedded Processors and Memory. Version 2 EE IIT, Kharagpur 1

LOW POWER DESIGN OF DIGITAL SYSTEMS USING ENERGY RECOVERY CLOCKING AND CLOCK GATING

Master/Slave Flip Flops

DIGITAL TECHNICS II. Dr. Bálint Pődör. Óbuda University, Microelectronics and Technology Institute

ASYNCHRONOUS COUNTERS

Micro-Step Driving for Stepper Motors: A Case Study

Chapter 7 Memory and Programmable Logic

Lecture 10: Latch and Flip-Flop Design. Outline

Wiki Lab Book. This week is practice for wiki usage during the project.

Design and analysis of flip flops for low power clocking system

CS311 Lecture: Sequential Circuits

HT1632C 32 8 &24 16 LED Driver

Module 7 : I/O PADs Lecture 33 : I/O PADs

ECE380 Digital Logic

Combinational Logic Design Process

Topics of Chapter 5 Sequential Machines. Memory elements. Memory element terminology. Clock terminology

CHAPTER 11 LATCHES AND FLIP-FLOPS

ECE410 Design Project Spring 2008 Design and Characterization of a CMOS 8-bit Microprocessor Data Path

Leakage Power Reduction Using Sleepy Stack Power Gating Technique

Timer A (0 and 1) and PWM EE3376

The components. E3: Digital electronics. Goals:

McPAT: An Integrated Power, Area, and Timing Modeling Framework for Multicore and Manycore Architectures

Chapter 10 Advanced CMOS Circuits

Modeling Sequential Elements with Verilog. Prof. Chien-Nan Liu TEL: ext: Sequential Circuit

Chapter 5 :: Memory and Logic Arrays

Class 11: Transmission Gates, Latches

True Single Phase Clocking Flip-Flop Design using Multi Threshold CMOS Technique

Implementation Details

Engr354: Digital Logic Circuits

A New Paradigm for Synchronous State Machine Design in Verilog

Theory of Transistors and Other Semiconductor Devices

University of Texas at Dallas. Department of Electrical Engineering. EEDG Application Specific Integrated Circuit Design

Yaffs NAND Flash Failure Mitigation

NAND Flash FAQ. Eureka Technology. apn5_87. NAND Flash FAQ

Theory of Operation. Figure 1 illustrates a fan motor circuit used in an automobile application. The TPIC kω AREF.

Pass Gate Logic An alternative to implementing complex logic is to realize it using a logic network of pass transistors (switches).

LFSR BASED COUNTERS AVINASH AJANE, B.E. A technical report submitted to the Graduate School. in partial fulfillment of the requirements

Status of the design of the TDC for the GTK TDCpix ASIC

Low latency synchronization through speculation

Digital Logic Elements, Clock, and Memory Elements

Notes about Small Signal Model. for EE 40 Intro to Microelectronic Circuits

Interfacing 3V and 5V applications

DIGITAL COUNTERS. Q B Q A = 00 initially. Q B Q A = 01 after the first clock pulse.

Information Board User s Guide

Transcription:

1 Low Power System-On-Chip Design Chapter 13: Retention Register Design Ismo Hänninen Institute Department of Digital of Computer and Computer Systems Systems / TKT-9626 / TKT-9636 Ch13: Retention Register Design 1/8/2009

2 Chapter 13: Retention Register Design Overview Retention Register Types Single Control Live Slave Dual Control Balloon Single Control Balloon Retention Register Relative Layouts Memory Retention Methods Reduced VDD Retention Source-Diode Biasing Source Biasing Memory Retention Latency Reduction Department of Computer Systems / TKT-9626 Ch13: Retention Register Design 1/8/2009

3 Overview Storing the data during block power-down (Power Gating) Retention capable flip-flops Or larger memories

4 Retention Register Types Single Control Live Slave Dual Control Balloon Single Control Balloon

5 Single Control Live Slave Standard master-slave flip-flop with minimal modifications Master-latch power-gated, fast Low-V T transistors Slave-latch always powered-on, slow High-V T transistors Three-state buffer (T1) between latches, isolating slave input AND-gate for isolating clock input, during power-down Control signal NRETAIN

6 Single Control Live Slave Usage: Control signal changed only, when clock is inactive (low) Control signal always driven Possible set/reset signals isolated Clock has to be low, before restoring state Advantages: Minimal area Single control signal Disadvantages: Slow slave-latch (low leakage High-V T transistors) CLK to Q output delay increased Longer input data hold-time needed (due to slow clock gating AND), but also good balance of rise and fall times needed (fast gates) Making sure clock is low, when restoring state

7 Dual Control Balloon Standard master-slave slave flipflop with added retention latch Master and slave latches power-gated, fast Low-V T transistors Retention ti latch (shadow latch) always powered-on, slow High- V T transistors Three-state buffer (T1) isolating retention latch input Control signals SAVE, NRESTORE

8 Dual Control Balloon Usage: SAVE control signal always driven, pulsed to latch data into shadow register NRESTORE control signal may float during power-off, pulsed to restore data Clock can be low or high, before restoring state Possible set/reset signals do not affect shadow register, when holding data Advantages: Minimal leakage power (minimal shadow latch and control transistors) Almost no performance loss, compared to non-retaining flip-flop (only very little increase on transistor loading) Clock phase independent restore Disadvantages: Area increased, compared to live-slave (due to third latch) Two control signals, requiring two buffer networks, also adds complexity

9 Single Control Balloon Standard master-slave slave flipflop with added retention latch Master and slave latches power-gated, fast Low-V T transistors Retention ti latch (shadow latch) always powered-on, slow High- V T transistors Three-state buffer (T1) isolating retention latch input Control signal NRET (save during runtime, restore on edge)

10 Single Control Balloon Usage: Control signal always driven Clock can be low or high, before restoring state Possible set/reset signals do not affect shadow register, when holding data Advantages: Almost minimal leakage power (minimal shadow latch and control transistors) Almost no performance loss, compared to non-retaining flip-flop (only very little increase on transistor loading) Clock phase independent restore System-level dynamic power reduced, since only one control signal network Disadvantages: Area increased, compared to live-slave (due to third latch) Some (small) additional dynamic power, since shadow register follows slavelatch state (during run-time)

11 Retention Register Relative Layouts Standard D-type register (scan-testable, minimal output buffer) Balloon D-type register (scan-testable, minimal output buffer)

12 Memory Retention Methods Reduced VDD Retention Source-Diode Biasing Source Biasing

13 Memory Retention Methods Often, FIFOs, caches, and other memories are allowed to reset after power-down (losing contents) For minimum wake-up latency, also memory contents have to be retained Large memories are too expensive to implement by using retention register approach (replacing standard, optimized memory cells) Needs a solution to reduce the basic cell leakage (without corrupting memory contents) Three basic approaches for SRAM

14 Reduced VDD Retention Memory is given own power supply, with two modes (VDD): Normal supply voltage Reduced (0.5-0.6 V) sleeping voltage Reduced voltage saves leakage power Memory contents stay uncorrupted, but cannot be accessed Simple implementation No memory circuit change needed A switchable, dedicated power supply added

15 Source-Diode Biasing Biasing the SRAM cell source voltage Reduced operating voltage (in addition to lowered VDD) Reverse body bias reduces sub-threshold leakage (NMOS substrate voltage is ground) Simplest biasing: Diode in source supply of cell Bypass control switch Problem: Diode threshold sets fixed bias for the process, difficult to optimize

16 Source Biasing Biasing the SRAM cell source voltage Reduced operating voltage (in addition to lowered VDD) Reverse body bias reduces sub-threshold leakage (NMOS substrate voltage is ground) Dedicated source bias supply (replacing the diode) Optimal operating voltage and bias in sub-1v designs

17 Memory Retention Latency Reduction For memories, which h are not often accessed (but always retained), if short access latency is required Block-based retention and wakeup Memory divided id d into banks, individual id control of sleep mode Normally, bank mode is sleep Read/write access actives only one bank (address decoder) Reduced wakeup latency, since smaller power blocks (smaller virtual VSS network, per bank) Compromise bank size, speed vs. column overhead (small banks cause area and power overhead in column sense-amps) Row-based retention and wakeup Biased voltage net for each row, individual control of sleep mode Normally, row mode is sleep Read/write access actives only one row Very fast, since a row is relatively small Reduced control overhead, by using row-grouping Department of Computer Systems / TKT-9626 Ch13: Retention Register Design 08.01.2009

18 Conclusion Retention flip-flops easily replace standard flip-flops Performance loss (live slave) Or area penalty (balloon) For retained memories, the only practical way to save energy during sleep is reduced operating voltage VDD and VSS moved near each other Sleep/wakeup latency reduction with block/row grouping