Three-Phase Dual-Rail Pre-Charge Logic



Similar documents
International Journal of Electronics and Computer Science Engineering 1482

Gates, Circuits, and Boolean Algebra

Fault Modeling. Why model faults? Some real defects in VLSI and PCB Common fault models Stuck-at faults. Transistor faults Summary

Design of Low Power One-Bit Hybrid-CMOS Full Adder Cells

Here we introduced (1) basic circuit for logic and (2)recent nano-devices, and presented (3) some practical issues on nano-devices.

CMOS Binary Full Adder

Class 11: Transmission Gates, Latches

Pass Gate Logic An alternative to implementing complex logic is to realize it using a logic network of pass transistors (switches).

A New Low Power Dynamic Full Adder Cell Based on Majority Function

1.1 Silicon on Insulator a brief Introduction

LOW POWER DESIGN OF DIGITAL SYSTEMS USING ENERGY RECOVERY CLOCKING AND CLOCK GATING

NAME AND SURNAME. TIME: 1 hour 30 minutes 1/6

Chapter 10 Advanced CMOS Circuits

CMOS, the Ideal Logic Family

Reconfigurable ECO Cells for Timing Closure and IR Drop Minimization. TingTing Hwang Tsing Hua University, Hsin-Chu

Class 18: Memories-DRAMs

Lecture 5: Gate Logic Logic Optimization

IL2225 Physical Design

Design of Energy Efficient Low Power Full Adder using Supply Voltage Gating

S. Venkatesh, Mrs. T. Gowri, Department of ECE, GIT, GITAM University, Vishakhapatnam, India

Design and analysis of flip flops for low power clocking system

Topics of Chapter 5 Sequential Machines. Memory elements. Memory element terminology. Clock terminology

Optimization and Comparison of 4-Stage Inverter, 2-i/p NAND Gate, 2-i/p NOR Gate Driving Standard Load By Using Logical Effort

e.g. τ = 12 ps in 180nm, 40 ps in 0.6 µm Delay has two components where, f = Effort Delay (stage effort)= gh p =Parasitic Delay

Gate Delay Model. Estimating Delays. Effort Delay. Gate Delay. Computing Logical Effort. Logical Effort

Physical Security: Status and Outlook

McPAT: An Integrated Power, Area, and Timing Modeling Framework for Multicore and Manycore Architectures

路 論 Chapter 15 System-Level Physical Design

CMOS Power Consumption and C pd Calculation

Module 4 : Propagation Delays in MOS Lecture 22 : Logical Effort Calculation of few Basic Logic Circuits

ECE124 Digital Circuits and Systems Page 1

CADENCE LAYOUT TUTORIAL

ECE410 Design Project Spring 2008 Design and Characterization of a CMOS 8-bit Microprocessor Data Path

Sequential 4-bit Adder Design Report

CMOS Thyristor Based Low Frequency Ring Oscillator

NEW adder cells are useful for designing larger circuits despite increase in transistor count by four per cell.

CMOS Logic Integrated Circuits

3 The TTL NAND Gate. Fig. 3.1 Multiple Input Emitter Structure of TTL

Leakage Power Reduction Using Sleepy Stack Power Gating Technique

Layout of Multiple Cells

CSE140: Components and Design Techniques for Digital Systems

Subthreshold Real-Time Counter.

Introduction to CMOS VLSI Design

1ED Compact A new high performance, cost efficient, high voltage gate driver IC family

Chapter 4 Register Transfer and Microoperations. Section 4.1 Register Transfer Language

Clocking. Figure by MIT OCW Spring /18/05 L06 Clocks 1

CSE140: Components and Design Techniques for Digital Systems. Introduction. Prof. Tajana Simunic Rosing

System on Chip Design. Michael Nydegger

FORDHAM UNIVERSITY CISC Dept. of Computer and Info. Science Spring, Lab 2. The Full-Adder

Static-Noise-Margin Analysis of Conventional 6T SRAM Cell at 45nm Technology

These help quantify the quality of a design from different perspectives: Cost Functionality Robustness Performance Energy consumption

Introduction to VLSI Programming. TU/e course 2IN30. Prof.dr.ir. Kees van Berkel Dr. Johan Lukkien [Dr.ir. Ad Peeters, Philips Nat.

Lecture 5: Logical Effort

Weste04r4.fm Page 67 Monday, January 5, :24 AM. 4.1 Introduction

Lecture 10: Latch and Flip-Flop Design. Outline

Analog & Digital Electronics Course No: PH-218

Introduction to Digital System Design

Alpha CPU and Clock Design Evolution

TRUE SINGLE PHASE CLOCKING BASED FLIP-FLOP DESIGN

A Survey on Sequential Elements for Low Power Clocking System

LFSR BASED COUNTERS AVINASH AJANE, B.E. A technical report submitted to the Graduate School. in partial fulfillment of the requirements

ELEC EXPERIMENT 1 Basic Digital Logic Circuits

Digital Electronics Detailed Outline

SC728/SC729. 2A Low Vin, Very Low Ron Load Switch. POWER MANAGEMENT Features. Description. Applications. Typical Application Circuit SC728 / SC729

Chapter 9 Semiconductor Memories. Jin-Fu Li Department of Electrical Engineering National Central University Jungli, Taiwan

Design Verification and Test of Digital VLSI Circuits NPTEL Video Course. Module-VII Lecture-I Introduction to Digital VLSI Testing

6.004 Computation Structures Spring 2009

Upon completion of unit 1.1, students will be able to

Low leakage and high speed BCD adder using clock gating technique

Gates. J. Robert Jump Department of Electrical And Computer Engineering Rice University Houston, TX 77251

CHARGE pumps are the circuits that used to generate dc

A Novel Low Power, High Speed 14 Transistor CMOS Full Adder Cell with 50% Improvement in Threshold Loss Problem

CO2005: Electronics I (FET) Electronics I, Neamen 3th Ed. 1

1. True or False? A voltage level in the range 0 to 2 volts is interpreted as a binary 1.

Module 7 : I/O PADs Lecture 33 : I/O PADs

Mixed Radix Design Flow for Security Applications

Two-Phase Clocking Scheme for Low-Power and High- Speed VLSI

University of Texas at Dallas. Department of Electrical Engineering. EEDG Application Specific Integrated Circuit Design

Lecture 11: Sequential Circuit Design

Gates & Boolean Algebra. Boolean Operators. Combinational Logic. Introduction

Lecture 8: Synchronous Digital Systems

After opening the Programs> Xilinx ISE 8.1i > Project Navigator, you will come to this screen as start-up.

TDA4605 CONTROL CIRCUIT FOR SWITCH MODE POWER SUPPLIES USING MOS TRANSISTORS

Asynchronous IC Interconnect Network Design and Implementation Using a Standard ASIC Flow

Two's Complement Adder/Subtractor Lab L03

Lab 3 Layout Using Virtuoso Layout XL (VXL)

Sequential Circuits. Combinational Circuits Outputs depend on the current inputs

Lab #5: Design Example: Keypad Scanner and Encoder - Part 1 (120 pts)

ZL40221 Precision 2:6 LVDS Fanout Buffer with Glitchfree Input Reference Switching and On-Chip Input Termination Data Sheet

Document Contents Introduction Layout Extraction with Parasitic Capacitances Timing Analysis DC Analysis

Harmonics and Noise in Photovoltaic (PV) Inverter and the Mitigation Strategies

Content Map For Career & Technology

Nexus: An Asynchronous Crossbar Interconnect for Synchronous System-on-Chip Designs

A true low voltage class-ab current mirror

AN460 Using the P82B96 for bus interface

Occupational Profile: Electrical & Electronics Engineering Technician

Transcription:

Infineon Page 1 CHES 2006 - Yokohama Three-Phase Dual-Rail Pre-Charge Logic L. Giancane, R. Luzzi, A. Trifiletti {marco.bucci, raimondo.luzzi}@infineon.com {giancane, trifiletti}@die.mail.uniroma1.it

Summary Advantages of pre-charged dual-rail logics Some implementation issues Impact of unbalanced loads and circuit asymmetries on the actual robustness against DPA The proposed logic style (TDPL) vs a reference dual-rail logic style (SABL) Simulation testbench and results on basic gates Case study: FULLADDER Page 2

Application scenario: masking a state machine Busses and registers can be easily XOR-masked. Unfortunately, this does not hold for combinatorial functions that, in general, are not linear with the XOR operator. x clk XOR(s m, m) F(x, s) XOR(s, m ) REG Dual-Rail pre-charged logic is a countermeasure suitable to be transparently applied to combinatorial functions. It has also the advantage to be glitch-free. Page 3

Just two gates are needed for the implementation of a complete dual-rail family Basic gates AND XOR NOT NAND XNOR OR NOR Using a dynamic logic, gates are cascoded in Domino style. Therefore, different fanouts can be simply obtained by using a suitable pair of static inverters Page 4

Load capacitances in a dual-rail gate The actual robustness of a dual-rail device depends on the actual balancing of its implementation. Load capacitance are the main issue, but also internal nodes are relevant. In facts, an accurate load balancing requires a full custom design flow (i.e. manual or semi-manual routing and a large amount of post-layout simulations). Page 5

Effect of unbalancing of complementary wires d(t) = X 1 (t) " X 0 (t) " X 0 X 0 (t) d(t) X 1 (t) " X 1 A DPA peak appears when a set X of traces is partitioned on two sets X 0 and X 1 in such a way their averages can be distinguished. Page 6

Effect of unbalancing of complementary wires (cont.) d " d is, in facts, the signal/noise ratio of a DPA peak where What does it mean? " 2 2 d = " X 0 2 + " X 1 and " X i = " X i N i As an example: a 5% unbalanced dual-rail will feature only a 1/20 attenuation and could be attacked, using 400*N traces, the same as a single rail using N traces....and 5% is a quite optimistic unbalancing estimation. Page 7

How can we make the energy consumption of the circuit balanced? Normally, both rails are pre-charged, but only one is discharged during the evaluation phase. z z z z evaluation 0 1 pre-charge The problem comes from the fact that the energy consumption depends on which wire (or internal node) switches. Page 8

Three-phase dual-rail In order to balance energy consumption we can simply make both the wires switch by adding a discharge phase. Page 9 z z z z evaluation 0 1 discharge pre-charge As a matter of fact, we can notice that now the information is not anymore on which wire switches, but on when wire switches (i.e. we use a sort of phaseencoding).

The proposed logic style (inverter) Page 10 1. charge: outputs are pre-charged to VDD via P 1, P 2, P 5 ; 2. evaluation: N 7 is closed thus discharging one of the outputs via the pull down circuit (N 5, N 6 ) according on the input lines; 3. discharge: the additional pull down N 1, N 3, discharges the output line that was not discharged during the evaluation phase.

SABL vs TDPL SABL inverter TDPL inverter In TDPL, two additional pull-downs NMOS (N 1, N 4 ) and a PMOS switch (P 1 ) are added in order to implement the discharge phase. Page 11

Other gates in TDPL As in SABL, gates differ only in the pull-down circuit. AND XOR Page 12 Notice that, being a dual-rail logic, the AND gate is actually a universal gate (AND, NAND, OR, NOR).

Simulation testbench (TDPL vs SABL) Process: Infineon 0.12m, 1.5V supply voltage; Transistor sizes: W = 0.68m; L = 0.12m; Simulation: Spectre/BSIM3v3. Since power consumption mainly depends on transitions, all possible input transitions have been simulated. Page 13

NAND current traces for all input transitions: SABL vs TDPL + non constant constant + Page 14

Simulation results for the three basic gates NED = max(e) " min(e) max(e) NSD = " E E Page 15

Case study: TDPL vs SABL Full Adder Dual-rail FULLADDER based on XOR and NAND gates cascaded using a Domino logic (static inverters). Notice that, in TDPL, static inverters do not cause unbalanced energy consumption since, as the connection wires, each inverter switches the same number of times. Page 16

FULLADDER current traces: SABL vs TDPL Page 17

Simulation results for the FULLADDER (8x8) traces related to all possible input transitions each gate is unbalanced as shown in the previous testbench Page 18 NSD improvement over 25 times power consumption increasing about 50%

Balanced vs unbalanced TDPL Balanced and unbalanced TDPL behave almost the same: i.e. the goal to avoid full-custom layout seems to be achieved C BALANCED [pq] C UNBALANCED 10% [pq] 0,1761 0,1836 0,1761 0,1836 0,1764 0,1839 0,1764 0,1839 0,1761 0,1836 0,1761 0,1836 0,1764 0,1839 0,1764 0,1839 0,1764 0,1839 0,1764 0,1839 0,1761 0,1836 0,1761 0,1836 0,1764 0,1839 0,1764 0,1839 0,1761 0,1836 0,1761 0,1836 C BALANCED C UNBALANCED 10% MAX 0,1764 0,1839 MIN 0,1761 0,1836 AED 0,0003 0,0003 NED 0,002 0,002 SD 0,000 0,000 NSD 0,001 0,001 Page 19 The residual leakage seems to depend on second order effects such as: - parasitics which depend on the state of adiacent nodes; - power transfer through inputs; -...

Conclusions DPA-resistant dual-rail logic family suitable for a semicustom design flow almost constant energy consumption even in presence of heavily asymmetric interconnections and pull-downs: about 25 times more balanced than SABL in case of a FULLADDER; in facts, the same robustness as a full-custom dual-rail logic power consumption and area penalties smaller than MDPL Page 20

Questions? Page 21