Low Power Design of Johnson Counter Using DDFF Featuring Efficient Embedded Logic and Clock Gating

Similar documents
Design and analysis of flip flops for low power clocking system

S. Venkatesh, Mrs. T. Gowri, Department of ECE, GIT, GITAM University, Vishakhapatnam, India

A Survey on Sequential Elements for Low Power Clocking System

TRUE SINGLE PHASE CLOCKING BASED FLIP-FLOP DESIGN

Sequential 4-bit Adder Design Report

LOW POWER DUAL EDGE - TRIGGERED STATIC D FLIP-FLOP

Lecture 10: Latch and Flip-Flop Design. Outline

HIGH SPEED AREA EFFICIENT 1-BIT HYBRID FULL ADDER

A New Low Power Dynamic Full Adder Cell Based on Majority Function

Efficient Interconnect Design with Novel Repeater Insertion for Low Power Applications

LOW POWER DESIGN OF DIGITAL SYSTEMS USING ENERGY RECOVERY CLOCKING AND CLOCK GATING

Design of Low Power One-Bit Hybrid-CMOS Full Adder Cells

A Novel Low Power Fault Tolerant Full Adder for Deep Submicron Technology

Design of Energy Efficient Low Power Full Adder using Supply Voltage Gating

True Single Phase Clocking Flip-Flop Design using Multi Threshold CMOS Technique

ASYNCHRONOUS COUNTERS

A Novel Low Power, High Speed 14 Transistor CMOS Full Adder Cell with 50% Improvement in Threshold Loss Problem

NEW adder cells are useful for designing larger circuits despite increase in transistor count by four per cell.

Static-Noise-Margin Analysis of Conventional 6T SRAM Cell at 45nm Technology

International Journal of Electronics and Computer Science Engineering 1482

Leakage Power Reduction Using Sleepy Stack Power Gating Technique

Counters and Decoders

IEEE. Proof. INCREASING circuit speed is certain to remain the major. Dual-Edge Triggered Storage Elements and Clocking Strategy for Low-Power Systems

Having read this workbook you should be able to: recognise the arrangement of NAND gates used to form an S-R flip-flop.

LOW POWER MULTIPLEXER BASED FULL ADDER USING PASS TRANSISTOR LOGIC

INTERPRETATION of published results comparing various

ISSCC 2003 / SESSION 13 / 40Gb/s COMMUNICATION ICS / PAPER 13.7

High Speed Gate Level Synchronous Full Adder Designs

Two-Phase Clocking Scheme for Low-Power and High- Speed VLSI

Experiment # 9. Clock generator circuits & Counters. Eng. Waleed Y. Mousa

Lecture 7: Clocking of VLSI Systems

Module 3: Floyd, Digital Fundamental

Optimization and Comparison of 4-Stage Inverter, 2-i/p NAND Gate, 2-i/p NOR Gate Driving Standard Load By Using Logical Effort

Latches, the D Flip-Flop & Counter Design. ECE 152A Winter 2012

Memory Elements. Combinational logic cannot remember

Topics of Chapter 5 Sequential Machines. Memory elements. Memory element terminology. Clock terminology

A high Speed 8 Transistor Full Adder Design using Novel 3 Transistor XOR Gates

Chapter 2 Logic Gates and Introduction to Computer Architecture

IN RECENT YEARS, the increase of data transmission over

High Performance Low Power Dual Edge Triggered Static D Flip-Flop

Clocking. Figure by MIT OCW Spring /18/05 L06 Clocks 1

Performance of Flip-Flop Using 22nm CMOS Technology

Power Reduction Techniques in the SoC Clock Network. Clock Power

CHAPTER 11: Flip Flops

DESIGN CHALLENGES OF TECHNOLOGY SCALING

Introduction to CMOS VLSI Design (E158) Lecture 8: Clocking of VLSI Systems

Low Power AMD Athlon 64 and AMD Opteron Processors

Lesson 12 Sequential Circuits: Flip-Flops

WEEK 8.1 Registers and Counters. ECE124 Digital Circuits and Systems Page 1

路 論 Chapter 15 System-Level Physical Design

Digital Logic Design. Basics Combinational Circuits Sequential Circuits. Pu-Jen Cheng

CMOS Binary Full Adder

L4: Sequential Building Blocks (Flip-flops, Latches and Registers)

Digital Systems Based on Principles and Applications of Electrical Engineering/Rizzoni (McGraw Hill

A CDMA Based Scalable Hierarchical Architecture for Network- On-Chip

Sequential Logic: Clocks, Registers, etc.

So far we have investigated combinational logic for which the output of the logic devices/circuits depends only on the present state of the inputs.

Sequential Logic. (Materials taken from: Principles of Computer Hardware by Alan Clements )

BURST-MODE communication relies on very fast acquisition

DIGITAL TECHNICS II. Dr. Bálint Pődör. Óbuda University, Microelectronics and Technology Institute. 2nd (Spring) term 2012/2013

A 3.2Gb/s Clock and Data Recovery Circuit Without Reference Clock for a High-Speed Serial Data Link

A MULTILEVEL INVERTER FOR SYNCHRONIZING THE GRID WITH RENEWABLE ENERGY SOURCES BY IMPLEMENTING BATTERY CUM DC-DC CONERTER

Multi-GHz Systems Clocking Invited Paper

Theory of Logic Circuits. Laboratory manual. Exercise 3

ECE380 Digital Logic

ETEC 2301 Programmable Logic Devices. Chapter 10 Counters. Shawnee State University Department of Industrial and Engineering Technologies

Lecture 8: Synchronous Digital Systems

A 1.62/2.7/5.4 Gbps Clock and Data Recovery Circuit for DisplayPort 1.2 with a single VCO

Architectural Level Power Consumption of Network on Chip. Presenter: YUAN Zheng

Pass Gate Logic An alternative to implementing complex logic is to realize it using a logic network of pass transistors (switches).

Chapter 10 Advanced CMOS Circuits

DIGITAL COUNTERS. Q B Q A = 00 initially. Q B Q A = 01 after the first clock pulse.

LFSR BASED COUNTERS AVINASH AJANE, B.E. A technical report submitted to the Graduate School. in partial fulfillment of the requirements

Sequential Circuit Design

COMBINATIONAL and SEQUENTIAL LOGIC CIRCUITS Hardware implementation and software design

Latch Timing Parameters. Flip-flop Timing Parameters. Typical Clock System. Clocking Overhead

DIGITAL SYSTEM CLOCKING. High-Performance and Low-Power Aspects

CONSTRAINT RANDOM VERIFICATION OF NETWORK ROUTER FOR SYSTEM ON CHIP APPLICATION

Master/Slave Flip Flops

Performance Comparison of an Algorithmic Current- Mode ADC Implemented using Different Current Comparators

INTEGRATED CIRCUITS. For a complete data sheet, please also download:

10 BIT s Current Mode Pipelined ADC

CHAPTER 16 MEMORY CIRCUITS

SEQUENTIAL CIRCUITS. Block diagram. Flip Flop. S-R Flip Flop. Block Diagram. Circuit Diagram

Automated Switching Mechanism for Multi-Standard RFID Transponder

Layout of Multiple Cells

EE 42/100 Lecture 24: Latches and Flip Flops. Rev B 4/21/2010 (2:04 PM) Prof. Ali M. Niknejad

CMOS Thyristor Based Low Frequency Ring Oscillator

DC/DC BUCK Converter for Renewable Energy Applications Mr.C..Rajeshkumar M.E Power Electronic and Drives,

To design digital counter circuits using JK-Flip-Flop. To implement counter using 74LS193 IC.

Clock Distribution in RNS-based VLSI Systems

Introduction to CMOS VLSI Design

A Digital Timer Implementation using 7 Segment Displays

Sequential Circuits. Combinational Circuits Outputs depend on the current inputs

Digital to Analog Converter. Raghu Tumati

Study on Differential Protection of Transmission Line Using Wireless Communication

NTE2053 Integrated Circuit 8 Bit MPU Compatible A/D Converter

Floating Point Fused Add-Subtract and Fused Dot-Product Units

Module 4 : Propagation Delays in MOS Lecture 22 : Logical Effort Calculation of few Basic Logic Circuits

Transcription:

Low Power Design of Johnson Counter Using DDFF Featuring Efficient Embedded Logic and Clock Gating Chandra shekhar kotikalapudi, Smt. P. Pushpalatha, Dept. of Electronics and communications engineering University College of engineering, JNTUK, Kakinada, India E-Mail-Sekhar2790@gmail.com Abstract In this paper, we have proposed a power efficient design of a 4-bit up-down Johnson counter by using Dual Dynamic Node Pulsed Flip-flop(DDFF) featuring efficient embedded logic module(ddff-elm) and then clock gating is incorporated in order to reduce the power dissipation further. The proposed design employs a DDFF which mainly reduces the power consumption by eliminating the large capacitance present in the pre-charge node of several existing flip-flop designs by separately driving the output pull-up and pull-down transistors by following a split dynamic node structure. This reduces the power up to 40% compared to conventional architectures of the flip-flops. Then the Embedded logic module is an efficient method to incorporate complex logic functions into the flip-flop. Clock gating is applied to reduce the power consumption up to 50% due to unnecessary clock activities. Key words Johnson counter, DDFF, DDFF-ELM, clock gating, XCFF, CDMFF, low power. I. INTRODUCTION Low power, high speed and area efficient circuit design is the major concern in now-a-day VLSI design. Designing a low power circuit involves a proper architecture of the sequential and combinational circuits used in the design by using minimum CMOS logic gates and then eliminating the redundant operations by using efficient techniques. In order to improve the performance of flip-flop architecture and reduce the power consumption extensive work has been carried out in the past few decades. With steady growth in clock frequency and chip capacity, power dissipation of the CMOS design has been increasing tremendously. This results in necessity for development of new techniques for reducing the power dissipation in VLSI design. High speed can be achieved in synchronous systems by using advanced pipelining techniques. In all kinds of digital circuits flip-flops are used as basic storage elements. Design styles of flip-flops can be classified as static and dynamic. Static flip-flops are those which can preserve their stored values even if the clock is stopped. Many static design flip-flop topologies have been proposed in the past. Classic transmission-gate latch based master-slave flip-flop (TGMS) and PowerPC 603 master slave latch are the examples of static design style. Basically dynamic flipflops can achieve higher speed and lower power consumption. Dynamic flip-flops can be either purely dynamic or pseudo dynamic structures. Hybrid latch flip-flop (HLFF) and Semi-dynamic flip-flop (SDFF) are the examples of pseudo dynamic structures. Purely dynamic structures are Conditional data mapping flip-flop (CDMFF) and Cross charge control flip-flop (XCFF). To eliminate the drawback of the existing flip-flop architectures a Dual dynamic node hybrid flip-flop (DDFF) has been developed. And to reduce the pipeline overhead an embedded logic module (DDFF-ELM) has been developed. Both of these eliminate the drawbacks present in dynamic flip-flop architecture. In this paper, we have proposed a power efficient design of 4-bit up-down Johnson counter using DDFF-ELM and applying clock gating. This design using DDFF-ELM reduces the power consumption up to 27% compared to conventional flip-flop architectures. Counter designed by applying clock gating has a maximum power reduction of up to 40%. In section II, a brief introduction of conventional flip-flop architectures along with their disadvantages has been described and challenges in achieving high performance are also been discussed. Section III provides the details about the 4-bit synchronous up-down Johnson counter. Section IV provides the details of proposed 4-bit Johnson counter using clock 572 www.ijergs.org

gating. In section V, we have provided simulation results of the proposed design. And finally, in section VI, we conclude with the improvements of the proposed design. II. FLIP-FLOP ARCHITECTURES In order to improve the performance of the flip-flops to achieve higher speeds and low power consumption several flip-flop architectures have been designed over the past decade. All these architectures can be grouped under static and dynamic design styles. Static flip-flops are those which can preserve their stored values even if the clock is stopped. Many static design flip-flop topologies have been proposed in the past. Classic transmission-gate latch based master-slave flip-flop (TGMS) and PowerPC 603 master slave latch are the examples of static design style. TGMS is realized by using two transmission gate based latches operating on complementary clocks. PowerPC 603 master slave flip-flop is a combination of TGMS flip-flop and mc 2 MOS flip-flop. Main features of the static designs are that they dissipate low power and also they have low clock-to-output (clock-q) delay. But because of their large positive setup time static designs have large data to output delay. Whenever in the designs if the speed is not a concern static designs are more suitable. Modern high performance flip-flops are considered under dynamic design style. Dynamic designs may be purely dynamic or pseudo dynamic. Pseudo dynamic design styles are also referred to as semi-dynamic or hybrid structures as they consist of static output and dynamic frontend. The flip-flops under this category are Hybrid latch flip-flop (HLFF) and Semi-dynamic flip-flop (SDFF). HLFF is basically a level sensitive latch which is clocked with an internally generated sharp pulse. This sharp pulse is generated at the positive edge of the clock and delayed version of clock. Two main building blocks of SDFF are a pulse generator and a level sensitive latch. With an internally generated sharp pulse which is of short duration latch is clocked so as to make it behave like a flip-flop. Both HLFF and SDFF benefit from clock overlap to perform latching operation. Even though HLFF is not the fastest but it has low power consumption. The reason for being slow compared to SDFF is that HLFF has longer stack of nmos transistors at the output node. SDFF is the fastest classic hybrid structure but because of the large clock load and large pre charge capacitance it is not efficient as far as power consumption is considered. In hybrid structured designs redundant data transitions and large pre charge capacitance are the main sources of power dissipation. In order to eliminate these two problems different architectures have been designed. To reduce the redundant operations in the flip-flops Conditional data mapping flip-flop (CDMFF) is the most efficient architecture. Similarly Cross charge control flip-flop (XCFF) is considered as the best architecture to eliminate the large pre charge capacitance. In CDMFF unwanted transitions are eliminated by using an output feedback structure to conditionally feed the data to flip-flop. Thus when a redundant event is predicted this architecture reduces power dissipation by eliminating unwanted transitions. This flip-flop is bulky because of the presence of additional transistors in conditional and has more power consumption at higher data activities. In XCFF the large pre charge capacitance present at the output node is eliminated by driving output pull-up and pull-down transistors separately. Total power consumption is reduced considerably as only one of the two dynamic nodes is switched during one clock cycle. Main drawbacks in this design are conditional shutoff and charge sharing. The Dual dynamic node hybrid flip-flop (DDFF) eliminates the large pre charge capacitance present in the output node of several conventional designs by following a split dynamic node structure to separately drive the output pull up and pull down transistors. Figure 1 shows the architecture of DDFF. Fig 1: DDFF 573 www.ijergs.org

Fig 2: DDFF-ELM (MULTIPLEXER) In the DDFF architecture, node X1 is pseudo-dynamic and X2 is dynamic. Instead of the conditional shutoff mechanism present in XCFF an unconditional one is present in DDFF. Its operation is based on whether the clock is high or low. The performance improvements show that the DDFF design is well suited for modern high performance designs where minimum delay and low power dissipation are required. The figure 2 shows the architecture of DDFF-ELM. Complex logic functions can be efficiently embedded into the architecture of DDFF. The main advantages of DDFF-ELM over the other flip-flops that are having embedded logics are lower power consumption and capable of embedding complex logic functions. III. 4-BIT UP-DOWN JOHNSON COUNTER Counter is a device that stores the number of times a particular event or process has occurred often in relation to a clock signal. Counters are used in almost all the digital circuits for counting operations. There are many types of counters used in digital circuits. Johnson counter also called as twisted ring counter is the modified form of a ring counter. Figure 3 shows the architecture of 4-bit synchronous up-down Johnson counter. In Johnson counter output of the last stage is complemented and connected to the input of the first stage. Fig 3:4-bit up-down Johnson counter In the 4-bit up-down Johnson counter designed Dual dynamic pulsed hybrid flip-flop is used. By embedding a multiplexer into the flip-flop architecture as shown in the figure 2 counting operation can be performed in either up counting mode or down counting mode. Last stage output is complemented and connected to the input of the first stage for an up counter and first stage output is complemented and connected to last stage input as shown in the figure 3. The counting sequence repeats for every eight clock pulses. The counting sequences of the Johnson up and down counters are shown in table 1 and table 2. 574 www.ijergs.org

clock Q0 Q1 Q2 Q3 0 0 0 0 0 1 1 0 0 0 2 1 1 0 0 3 1 1 1 0 4 1 1 1 1 5 0 1 1 1 6 0 0 1 1 7 0 0 0 1 8 0 0 0 0 Table 1: Johnson up counter clock Q0 Q1 Q2 Q3 0 0 0 0 0 1 0 0 0 1 2 0 0 1 1 3 0 1 1 1 4 1 1 1 1 5 1 1 1 0 6 1 1 0 0 7 1 0 0 0 8 0 0 0 0 Table 2: Johnson down counter IV. JOHNSON COUNTER USING CLOCK GATING Clock gating technique is used to reduce the dynamic power dissipation by stopping the clock signal to the segments of the circuit that are inactive at that instant of time. Clock transitions contribute to the major part of power consumption in a digital circuit. By eliminating the unwanted distribution of the clock signal to the segments that are not active power dissipation can be reduced in the digital circuits. For clock gating to the Johnson counter in this design XOR and NAND gates are used. Fig 4: Clock gating to Johnson counter Each of the Clock gating blocks in the figure 4 consists of combination of XOR and NAND gate. Clock gating module is shown in figure 5. Fig 5: Clock gating module for generation of clock for Q0 575 www.ijergs.org

V o lta g e (V ) V o lta g e (V ) V o lta g e (V ) V o lta g e (V ) V o lta g e (V ) v( c lk) v( Q 0) v( Q 1) v( Q 2) v( Q 3) International Journal of Engineering Research and General Science Volume 2, Issue 5, August-September, 2014 V. SIMULATION RESULTS The existing and the proposed design are implemented in Tanner EDA tools in 250nm technology. Schematics for the design are designed in S-edit.Figure 1 shows the schematic design of the DDFF and figure 6 shows the timing diagram of the DDFF simulated using T-spice and resulting waveforms are obtained in W-edit. Fig 6: Timing diagram of DDFF The simulated waveform of the Johnson counter is shown in figure 7. Fig 7: Timing diagram of Johnson counter The layout designs of the DDFF, DDFF-ELM (MULTIPLEXER) and the Johnson counter are shown in the figures 8, 9 and 10 respectively. Layouts are designed and areas of the designs are calculated by using L-EDIT. Fig 8: Layout design of the DDFF 576 www.ijergs.org

Fig 8: Layout design of the DDFF-ELM (MUX) Fig 8: Layout design of the Johnson counter Simulation results of the various dynamic flip-flops are shown in table3. Flip-flop Total Layout area (μm 2 ) Total Power (μw) Minimum D-Q (ns) PDP (fj) CDMFF 540.67 62.34 1.37 85.40 XCFF 470.43 68.53 1.31 89.77 DDFF 430.01 59.62 1.34 79.89 DDFF-ELM 528.74 85.90 1.29 110.81 Table 3: Performance comparison of various dynamic flip-flops (250nm technology) Area and power consumption comparison of the Johnson counter with and without clock gating are shown in table 4. Johnson counter Power consumption (μw) Total Layout Area ((μm 2 ) Without Clock Gating 344.24 2820.32 With Clock Gating 160.56 2932.62 Table 4: Johnson counter with and without Clock gating 577 www.ijergs.org

Power consumption of the 4-bit synchronous up-down Johnson counter is 344.24 μw. Power consumption of the 4-bit synchronous up-down Johnson counter with clock gating is 160.56 μw. So by using the clock gating power dissipation in the Johnson counter is reduced up to 50%. VI. CONCLUSION In this paper, a dual dynamic node pulsed hybrid flip-flop (DDFF), an embedded logic module, a 4-bit synchronous up-down Johnson counter and clock gated Johnson counter are presented. Simulation results show improvements in power and speed. The dual dynamic node pulsed hybrid flip-flop (DDFF) shows improvement over the cross charge control flip-flop (XCFF) as it eliminates the redundant power dissipation. Further complex logic function can be incorporated efficiently in to the flip-flop. The 4-bit up-down Johnson counter with clock gating shows an improvement in terms of power consumption of up to 50% compared to the 4-bit up-down counter without clock gating though there is an increase in area. Thus in modern high performance designs where power consumption is a major concern the presented architecture is well suited. REFERENCES: [1] Kalarikkal Absel, Lijo Manuel and R. K. Kavitha, 2013, Low-Power Dual Dynamic Node Pulsed Hybrid Flip-Flop Featuring Efficient Embedded Logic, VLSI Systems, IEEE Transactions (Volume: 21 Issue: 9) pp. 1693 1704. [2] Ahmed Sayed and Hussain Al-Assad, 2011, Low-Power Flip-Flops: Survey, Comparative Evaluation and a New Design, IACSIT International Journal of Engineering and Technology, Vol.3, No.3. [3] Ismail.S.M and Islam.F.T. 2012, Low power design of Johnson Counter using clock gating International Conference on Computer and Information Technology (ICCIT), pp 510-517. [4] Itamer Levi and Alexander Fish, 2013, Dual Mode Logic Design for energy efficiency and high performance, IEEE Access. [5] F. Klass, Semi-dynamic and dynamic flip-flops with embedded logic, in Proc. Symp. VLSI Circuits Dig. Tech. Papers, Honolulu, HI, Jun. 1998, pp. 108 109. [6] J. M. Rabaey, A. Chandrakasan, and B. Nikolic, Digital Integrated Circuits: A Design Perspective, 2nd ed. Englewood Cliffs, NJ: Prentice-Hall, 2003. [7] N. Nedovic and V. G. Oklobdzija, Hybrid latch flip-flop with improved power efficiency, in Proc. Symp. Integr.Circuits Syst.Design, 2000, pp.211 215. [8] Li Li, Ken Choi, and Haiqing Nan, Activity-Driven Fine-grained Clock Gating and Run Time Power Gating Integration IEEE Transactions on very large scale integration (VLSI) systems, vol. 21, no. 8, august 2013. [9] G. Gerosa, S. Gary, C. Dietz, P. Dac, K. Hoover, J. Alvarez, H. Sanchez, P. Ippolito, N. Tai, S. Litch, J. Eno, J. Golab, N. Vanderschaaf, and J. Kahle, A 2.2 W, 80 MHz superscalar RISC microprocessor, IEEE J. Solid-State Circuits, vol. 29, no. 12, pp. 1440 1452, Dec. 1994. [10] V. Stojanovic and V. Oklobdzija, Comparative analysis of master slave latches and flip-flops for high-performance and low-power systems, IEEE J. Solid-State Circuits, vol. 34, no. 4, pp. 536 548, Apr. 1999. [11] B.-S. Kong, S.-S. Kim and Y.-H. Jun, Conditional-capture flip-flop for statistical power reduction, IEEE J. Solid-State Circuits, vol. 36, no., pp. 1263 1271, Aug. 2001. [12] N. Nedovic, M. Aleksic, and V. G. Oklobdzija, Conditional pre-charge techniques for power-efficient dual-edge clocking, in Proc. Int. Symp. Low-Power Electron. Design, 2002, pp. 56 59. [13] P. Zhao, T. K. Darwish, and M. A. Bayoumi, High-performance and low-power conditional discharge flip-flop, IEEE Trans. Very Large Scale Integr. (VLSI) Syst., vol. 12, no. 5, pp. 477 484, May 2004. [14] C. K. Teh, M. Hamada, T. Fujita, H. Hara, N. Ikumi, and Y. Oowaki, Conditional data mapping flip-flops for low-power and high performance systems, IEEE Trans. Very Large Scale Integr. (VLSI) Syst., vol. 14, no. 12, pp. 1379 1383, Dec. 2006. [15] S. H. Rasouli, A. Khademzadeh, A. Afzali-Kusha, and M. Nourani, Low-power single- and double-edge-triggered flip-flops for high-speed applications, Proc. Inst. Elect. Eng. Circuits Devices Syst., vol. 152, no. 2, pp. 118 122, Apr. 2005. Very Large Scale Integr. (VLSI) Syst., vol. 17, no. 1, pp. 33 44, Jan. 2009. [17] A. Ma and K. Asanovic, A double-pulsed set-conditional-reset flip flop, Laboratory for Computer Science, Massachusetts Inst. technology, Cambridge, Tech. Rep. MIT-LCS-TR-844, May 2002. [18] O. Sarbishei and M. Maymandi-Nejad, Power-delay efficient overlapbased charge-sharing free pseudo-dynamic D flip-flops, in Proc. IEEE Int. Symp. Circuits Syst., May 2007, pp. 637 640. [19] O. Sarbishei and M. Maymandi-Nejad, A novel overlap-based logic cell: An efficient implementation of flip flops with embedded logic, IEEE Trans. Very Large Scale Integr. (VLSI) Syst., vol. 18, no. 2, pp. 222 231, Feb. 2010. [20] M. Hansson and A. Alvandpour, Comparative analysis of process variation impact on flip-flop power-performance, in Proc. IEEE Int.Symp. Circuits Syst., May 2007, pp. 3744 3747. 578 www.ijergs.org

[21] S. Yang, W. Wolf, N. Vijaykrishnan, Y. Xie, and W. Wang, Accurate stacking effect macro-modeling of leakage power in sub- 100 nm [16] H. Mahmoodi, V. Tirumalashetty, M. Cooke, and K. Roy, Ultra low power clocking scheme using energy recovery and clock gating, IEEE Trans. Circuits, in Proc. IEEE 18th Int. Conf. VLSI Design, Jan. 2005, pp. 165 170. [22] Y.-F. Tsai, D. Duarte, N. Vijaykrishnan, and M. J. Irwin, Implications of technology scaling on leakage reduction techniques, in Proc. Design Autom. Conf., Jun. 2003, pp. 187 190 579 www.ijergs.org