CSE477 VLSI Digital Circuits Fall Lecture 18/19: Datapath Design/Fast Adders
|
|
- Jessie Lindsey
- 7 years ago
- Views:
Transcription
1 CSE477 VLSI Digital Circuits Fall 2 Lecture 8/9: Datapath Design/Fast Adders [Adapted in part from Rabaey s Digital Integrated Circuits, Prentice Hall, 995] CSE477 L8 Datapath Design. Irwin&Vijay, PSU, 2
2 Major Components of a Computer Processor Devices Control Memory Input Datapath Output CSE477 L8 Datapath Design.2 Irwin&Vijay, PSU, 2
3 MIPS Pipelined Datapath IFetch Dec Exec Mem WB PC 4 Instruction Memory Read Address Add IF/Dec Read Addr Register Read Read Addr 2Data File Write Addr Write Data Read Data 2 Dec/Exec Shift left 2 Add ALU Exec/Mem Data Address Memory Write Data Read Data Mem/WB Sign 6 Extend 32 CSE477 L8 Datapath Design.3 Irwin&Vijay, PSU, 2
4 Bit-Sliced Approach Control Data In Register File Multiplexer Adder Shifter Bit 3 Bit 2 Bit Data Out Bit Tile identical processing elements CSE477 L8 Datapath Design.4 Irwin&Vijay, PSU, 2
5 Basic Building Blocks Datapath Execution units - Adder, multiplier, divider, shifter, comparator, etc. Register file and pipeline registers Multiplexors Control Finite state machines (PLA, ROM, random logic) Interconnect Switches, arbiters, buses Memory Caches, TLBs, DRAM, buffers CSE477 L8 Datapath Design.5 Irwin&Vijay, PSU, 2
6 The -bit Binary Adder A B C in C out S carry status A B C in -bit Full Adder (FA) S kill kill propagate propagate propagate propagate C out generate generate S = A B C in C out = A B v A C in v B C in (majority function) How can we use it to build a 32-bit adder? How can we modify it easily to build an adder/subtractor? How can we make it better (faster, lower power, smaller)? CSE477 L8 Datapath Design.6 Irwin&Vijay, PSU, 2
7 FA Gate Implementation A B C in A B C in C out S t t t2 t2 t t C out S CSE477 L8 Datapath Design.7 Irwin&Vijay, PSU, 2
8 XOR FA C in A B S C out 6 transistors CSE477 L8 Datapath Design.8 Irwin&Vijay, PSU, 2
9 CPL FA B B C in C in A!Sum A Sum B B C in C in A!C out B C in A C out B C in 28 transistors dual rail CSE477 L8 Datapath Design.9 Irwin&Vijay, PSU, 2
10 Delay Balanced FA B!B C in!p Identical Delays for Carry and Sum B p p A p A p!c out C in S C in!b A!p p!p!p Sum generation Carry generation Signal set-up 22 transistors CSE477 L8 Datapath Design. Irwin&Vijay, PSU, 2
11 A 32-bit Adder/Subtractor Ripple Carry Adder (RCA) built out of 32 FAs add/subt A C =C in -bit FA S Subtraction complement all subtrahend bits (xor gates) and set the low order carry-in RCA B B A A 2 C -bit FA S C 2 -bit FA S 2 advantage: simple logic, so small (low cost) B 2... C 3 disadvantage: slow (O(N)) and lots of glitching (so lots of energy consumption) A 3 C 3 -bit FA S 3 B 3 C 32 =C out CSE477 L8 Datapath Design. Irwin&Vijay, PSU, 2
12 Ripple Carry Adder (RCA) A 3 B 3 A 2 B 2 A B A B C out =C 4 FA FA FA FA C =C in S 3 S 2 S S T = O(N) worst case delay T adder T FA (A,B C out ) + (N-2)T FA (C in C out ) + T FA (C in S) Real Goal: Make the fastest possible carry path CSE477 L8 Datapath Design.2 Irwin&Vijay, PSU, 2
13 Mirror Adder -propagate -propagate A B B kill A B C in A!C out C in!sum A generate A B B A B C in C in B A C in A B C = AB + BCin +ACin 28 transistors S= ABCin + Co(A+B+Cin) CSE477 L8 Datapath Design.3 Irwin&Vijay, PSU, 2
14 The -bit Binary Adder A B C in C out S carry status A B C in -bit Full Adder (FA) S kill kill propagate propagate propagate propagate C out generate generate S = A B C in C out = A B v A C in v B C in (majority function) How can we use it to build a 32-bit adder? How can we modify it easily to build an adder/subtractor? How can we make it better (faster, lower power, smaller)? CSE477 L8 Datapath Design.4 Irwin&Vijay, PSU, 2
15 Mirror Adder Features The NMOS and PMOS chains are completely symmetrical with a maximum of two series transistors in the carry circuitry, guaranteeing identical rise and fall transitions if the NMOS and PMOS devices are properly sized. When laying out the cell, the most critical issue is the minimization of the capacitances at node!c out (four diffusion capacitances, two internal gate capacitances, and six next FA gate capacitances). The transistors connected to C in are placed closest to the output. Only the transistors in the carry stage have to be optimized for optimal speed. All transistors in the sum stage can be minimal size. CSE477 L8 Datapath Design.5 Irwin&Vijay, PSU, 2
16 Inversion Property A B A B C out FA C in C out FA C in S S!S (A, B, C in ) = S(!A,!B,!C in )!C out (A, B, C in ) = C out (!A,!B,!C in ) CSE477 L8 Datapath Design.6 Irwin&Vijay, PSU, 2
17 Exploiting the Inversion Property A 3 B 3 A 2 B 2 A B A B C out =C 4 FA FA FA FA C =C in S 3 S 2 S S odd cell even cell Minimizing the critical path the carry chain Need two flavors of FAs CSE477 L8 Datapath Design.7 Irwin&Vijay, PSU, 2
18 Fast Carry Chain Design The key to fast addition is a low latency carry network What matters is whether in a given position a carry is generated g i = A i B i propagated p i = A i B i (sometimes use A i v B i ) annihilated (killed) k i =!A i!b i Giving a carry recurrence of C i+ = g i + C i p i C = g + p C C 2 = g + p g + p p C C 3 = g 2 + p 2 g + p 2 p g + p 2 p p C C 4 = g 3 + p 3 g 2 + p 3 p 2 g + p 3 p 2 p g + p 3 p 2 p p C CSE477 L8 Datapath Design.8 Irwin&Vijay, PSU, 2
19 Carry Chains On the average, the longest carry chain in adding N-bit numbers is of length log 2 N Experimental results verify this log 2 N worst case approximation and suggest that log 2 (.25N) is a better estimate The conclusion is that typical carry chains are usually quite short CSE477 L8 Datapath Design.9 Irwin&Vijay, PSU, 2
20 Manchester Carry Chain Adders Switches controlled by g i and p i!c i+ g i pi!c i clock Total delay of time to form the switch control signals g i and p i setup time for the switches signal propagation delay through n switches in the worst case CSE477 L8 Datapath Design.2 Irwin&Vijay, PSU, 2
21 Domino Manchester Carry Chain p 3 p 2 p p CLK C i, g 3 2 g 2 3 g 4 g 5 C i, CLK!(g 2 + p 2 g + p 2 p g + p 2 p p C i, )!(g + p C i, )!(g 3 + p 3 g 2 + p 3 p 2 g + p 3 p 2 p g + p 3 p 2 p p C i, )!(g + p g + p p C i, ) CSE477 L8 Datapath Design.2 Irwin&Vijay, PSU, 2
22 Four Bit-Sliced MC Adder clock,,,, g i p i g i p i g i p i g i p i C in CSE477 L8 Datapath Design.22 Irwin&Vijay, PSU, 2
23 Coping with Carries Carry-completion sensing (asynchronous) sense when carry is done (average carry length is O(log n)) Carry lookahead, conditional sum, Deferred carry assimilation (e.g., in multiplication) - carry save adders, signed digit adders CSE477 L8 Datapath Design.23 Irwin&Vijay, PSU, 2
24 Binary Adders synchronous word parallel adders ripple carry adders (RCA) T = O(N), A = O(N) carry prop min adders signed-digit fast carry prop residue adders adders adders T = O(), A = O(N) Manchester carry carry conditional carry carry chain select lookahead sum skip T = O(N) A = O(N) T = O(log N) A = O(N log N) T = O(N**/2), A = O(N) CSE477 L8 Datapath Design.24 Irwin&Vijay, PSU, 2
25 CSE477 VLSI Digital Circuits Fall 2 Lecture 9: Fast Adders [Adapted in part from Rabaey s Digital Integrated Circuits, Prentice Hall, 995] CSE477 L8 Datapath Design.25 Irwin&Vijay, PSU, 2
26 Review: A 32-bit Adder/Subtractor Ripple Carry Adder (RCA) built out of 32 FAs add/subt A C =C in -bit FA S Subtaction complement all subtrahend bits (xor gates) and set the low order carry-in RCA B B A A 2 C -bit FA S C 2 -bit FA S 2 advantage: simple logic, so small (low cost) B 2... C 3 disadvantage: slow (O(N) for N bit operands) and lots of glitching (so lots of energy consumption) B 3 A 3 C 3 -bit FA S 3 C 32 =C out CSE477 L8 Datapath Design.26 Irwin&Vijay, PSU, 2
27 Review: Fast Carry Chain Design The key to fast addition is a low latency carry network What matters is whether in a given position a carry is generated g i = A i B i propagated p i = A i B i (sometimes use A i v B i ) annihilated (killed) k i =!A i!b i Giving a carry recurrence of C i+ = g i + C i p i C = g + p C C 2 = g + p g + p p C C 3 = g 2 + p 2 g + p 2 p g + p 2 p p C C 4 = g 3 + p 3 g 2 + p 3 p 2 g + p 3 p 2 p g + p 3 p 2 p p C CSE477 L8 Datapath Design.27 Irwin&Vijay, PSU, 2
28 Binary Adders synchronous word parallel adders ripple carry adders (RCA) T = O(N), A = O(N) carry prop min adders signed-digit fast carry prop residue adders adders adders T = O(), A = O(N) Manchester carry carry conditional carry carry chain select lookahead sum skip T = O(N) A = O(N) T = O(log N) A = O(N log N) T = O(N**/2) A = O(N) CSE477 L8 Datapath Design.28 Irwin&Vijay, PSU, 2
29 Carry-Skip (Carry-Bypass) Adder A 3 B 3 A 2 B 2 A B A B C 3 FA FA FA FA C i, C o,3 S 3 S 2 S S BP = p p p 2 p 3 Block Propagate Idea: If (p and p and p 2 and p 3 = ) then C o,3 = C i, else block kills carry or generates carry internally CSE477 L8 Datapath Design.29 Irwin&Vijay, PSU, 2
30 Carry-Skip Chain Implementation carry-out block carry-out BP block carry-in BP p 3 p 2 p p!c o,3 C i, g 3 g 2 g g BP CSE477 L8 Datapath Design.3 Irwin&Vijay, PSU, 2
31 4-bit Block Carry-Skip Adder bits 2 to 5 bits 8 to bits 4 to 7 bits to 3 Setup Setup Setup Setup Carry Propagation Carry Propagation Carry Propagation Carry Propagation C i, Sum Sum Sum Sum Worst-case delay carry from bit to bit 5 = carry generated in bit, ripples through bits, 2, and 3, skips the middle two groups (B = group size in bits), ripples in the last group from bit 2 to bit 5 T add = t setup + B t carry + ((N/B) -2) t skip +B t carry + t sum CSE477 L8 Datapath Design.3 Irwin&Vijay, PSU, 2
32 Optimal Block Size Assuming one stage of ripple has the same delay as one skip logic stage (where B is the number of bits in a block and for N bit operands) T CSkA = (B-) (N/B-2) + (B-) in block OR gate skips in last block = 2B + N/B stages So the optimal block size is dt CSkA /db = (N/2) **/2 = B opt And the optimal time is Optimal T CSkA = 2(2N**/2) CSE477 L8 Datapath Design.32 Irwin&Vijay, PSU, 2
33 One Level Variable Carry-Skip Adder c out c in CSE477 L8 Datapath Design.33 Irwin&Vijay, PSU, 2
34 Variable Carry-Skip Addition Clearly, a carry that is generated in, or absorbed by, one of the inner blocks travels a shorter distance through the skip blocks. So can allow more ripple stages for inner carries without increasing the overall delay Bt- Bt-2 B B skip one fewer skip one fewer skip ripple carry path carry path 2 carry path 3 CSE477 L8 Datapath Design.34 Irwin&Vijay, PSU, 2
35 Optimal Variable Block Sizes B B+ (B+K)/2 - (B+K)/2 - B+ B So the total number of bits in the K blocks is B = N/K - K/4 + /2 Giving an adder delay of T VSkA = 2N/K + K/2-2.5 So the optimal number of blocks is dt VSkA /dk = 2 (N**/2) = t opt And optimal time is Optimal T VSkA = 2 (N**/2) CSE477 L8 Datapath Design.35 Irwin&Vijay, PSU, 2
36 Multilevel Carry-Skip Addition What about allowing a carry to skip over several blocks at once? cout cin three skips one skip skip level skip level 2 AND of the first level skip signals (BP s) CSE477 L8 Datapath Design.36 Irwin&Vijay, PSU, 2
37 Carry-Skip Adder Comparisons RCA CSkA VSkA 2 8 bits 6 bits 32 bits 48 bits 64 bits CSE477 L8 Datapath Design.37 Irwin&Vijay, PSU, 2
38 Carry-Select Adder Setup P,G "" "" Carry Propagation "" "" Carry Propagation C o,k- Multiplexer C o,k+3 Sum Generation Carry Vector CSE477 L8 Datapath Design.38 Irwin&Vijay, PSU, 2
39 Carry-Select Adder: Critical Path Bit -3 Bit 4-7 Bit 8- Bit 2-5 Setup Setup Setup Setup "" "" Carry "" "" Carry "" "" Carry "" "" Carry "" "" Carry "" "" Carry "" "" Carry "" "" Carry Multiplexer Multiplexer Multiplexer Multiplexer C i, C o,3 C o,7 C o, C o,5 Sum Generation Sum Generation Sum Generation Sum Generation S -3 S 4-7 S 8- S 2-5 CSE477 L8 Datapath Design.39 Irwin&Vijay, PSU, 2
40 Linear Carry-Select Adder Bit -3 Bit 4-7 Bit 8- Bit 2-5 Setup Setup Setup Setup () "" () "" Carry "" "" Carry "" "" Carry "" "" Carry C i, "" "" Carry (5) (5) Multiplexer "" "" Carry (5) "" "" Carry (5) "" "" Carry (5) (6) (7) (8) Multiplexer Multiplexer Multiplexer (9) Sum Generation Sum Generation Sum Generation Sum Generation S -3 S 4-7 S 8- S 2-5 () CSE477 L8 Datapath Design.4 Irwin&Vijay, PSU, 2
41 Square Root Carry-Select Adder Bit - Bit 2-4 Bit 5-8 Bit 9-3 Bit 4-9 Setup Setup Setup Setup () "" Carry "" () "" "" Carry "" "" Carry "" "" Carry C i, "" Carry "" Carry "" Carry "" Carry "" "" "" "" (3) (3) (4) (5) (6) (4) (5) (6) (7) Multiplexer Multiplexer Multiplexer Multiplexer (7) Mux (8) Sum Generation Sum Generation Sum Generation Sum Generation Sum S - S 2-4 S 5-8 S 9-3 S 4-9 (9) M M is number of bits in first stage; 2N^/2 is number of stages CSE477 L8 Datapath Design.4 Irwin&Vijay, PSU, 2
42 Carry Lookahead Adder Carry recurrence C i+ = g i + C i p i C = g + p C C 2 = g + p g + p p C C 3 = g 2 + p 2 g + p 2 p g + p 2 p p C C 4 = g 3 + p 3 g 2 + p 3 p 2 g + p 3 p 2 p g + p 3 p 2 p p C C 5 = g 4 + p 4 g 3 + p 4 p 3 g 2 + p 4 p 3 p 2 g + p 4 p 3 p 2 p g + p 4 p 3 p p 2 p C C 6 = g 5 + p 5 g 4 + p 5 p 4 g 3 + p 5 p 4 p 3 g 2 + p 5 p 4 p 3 p 2 g + p 5 p 4 p 3 p 2 p g + p 5 p 4 p 3 p 2 p p C... CSE477 L8 Datapath Design.42 Irwin&Vijay, PSU, 2
43 Carry Lookahead Concept A,B A,B A N-,B N-... all carries are determined directly from the input bits C i, P C i, P C i,n- P N-... CSE477 L8 Datapath Design.43 Irwin&Vijay, PSU, 2
44 Carry Lookahead Topology V DD G 3 G 2 G G C i, Co,3 P P P 2 P 3 CSE477 L8 Datapath Design.44 Irwin&Vijay, PSU, 2
45 Logarithmic Time Adder Approach Operator where (g, p) (g, p ) = (g (p g ), p p ) Can prove (by induction) that (G i, P i ) = (g, p ) if i = (g i, p i ) (G i-, P i- ) if i N- And, since the operator, is associative, the computation of the (G i, P i ) s can be done in any order. CSE477 L8 Datapath Design.45 Irwin&Vijay, PSU, 2
46 T = log 2 N - 2 A = 2log 2 N T = log 2 N Brent-Kung (Log Time) Adder g 5 p 5 g 4 p 4 g 3 p 3 g 2 p 2 g p g p g 9 p 9 g 8 p 8 g 7 p 7 g 6 p 6 g 5 p 5 g 4 p 4 g 3 p 3 g 2 p 2 g p g p C Parallel Prefix Computation c 6 c 5 c 4 c 3 c 2 c c c 9 c 8 c 7 A = N/2 c 6 c 5 c 4 c 3 c 2 c CSE477 L8 Datapath Design.46 Irwin&Vijay, PSU, 2
47 T = log 2 N - 2 A = 2log 2 N T = log 2 N Brent-Kung (Log Time) Adder g 5 p 5 g 4 p 4 g 3 p 3 g 2 p 2 g p g p g 9 p 9 g 8 p 8 g 7 p 7 g 6 p 6 g 5 p 5 g 4 p 4 g 3 p 3 g 2 p 2 g p g p C Parallel Prefix Computation c 6 c 5 c 4 c 3 c 2 c c c 9 c 8 c 7 A = N/2 c 6 c 5 c 4 c 3 c 2 c CSE477 L8 Datapath Design.47 Irwin&Vijay, PSU, 2
48 PDP of Different Adders RCA MCCA CSkA VSkA CSlA CLA BKA ELMA 8 bits 6 bits 32 bits 48 bits 64 bits From Nagendra, 996 CSE477 L8 Datapath Design.48 Irwin&Vijay, PSU, 2
49 Next Lecture and Reminders Next lecture Multipliers/Shifters - Reading assignment Rabaey, CSE477 L8 Datapath Design.49 Irwin&Vijay, PSU, 2
50 Next Lecture and Reminders Next lecture Fast adders - Reading assignment Rabaey, CSE477 L8 Datapath Design.5 Irwin&Vijay, PSU, 2
NEW adder cells are useful for designing larger circuits despite increase in transistor count by four per cell.
CHAPTER 4 THE ADDER The adder is one of the most critical components of a processor, as it is used in the Arithmetic Logic Unit (ALU), in the floating-point unit and for address generation in case of cache
More informationECE410 Design Project Spring 2008 Design and Characterization of a CMOS 8-bit Microprocessor Data Path
ECE410 Design Project Spring 2008 Design and Characterization of a CMOS 8-bit Microprocessor Data Path Project Summary This project involves the schematic and layout design of an 8-bit microprocessor data
More informationLecture 5: Gate Logic Logic Optimization
Lecture 5: Gate Logic Logic Optimization MAH, AEN EE271 Lecture 5 1 Overview Reading McCluskey, Logic Design Principles- or any text in boolean algebra Introduction We could design at the level of irsim
More informationSequential 4-bit Adder Design Report
UNIVERSITY OF WATERLOO Faculty of Engineering E&CE 438: Digital Integrated Circuits Sequential 4-bit Adder Design Report Prepared by: Ian Hung (ixxxxxx), 99XXXXXX Annette Lo (axxxxxx), 99XXXXXX Pamela
More informationAdder.PPT(10/1/2009) 5.1. Lecture 13. Adder Circuits
Adder.T(//29) 5. Lecture 3 Adder ircuits Objectives Understand how to add both signed and unsigned numbers Appreciate how the delay of an adder circuit depends on the data values that are being added together
More informationLet s put together a Manual Processor
Lecture 14 Let s put together a Manual Processor Hardware Lecture 14 Slide 1 The processor Inside every computer there is at least one processor which can take an instruction, some operands and produce
More informationCSE140 Homework #7 - Solution
CSE140 Spring2013 CSE140 Homework #7 - Solution You must SHOW ALL STEPS for obtaining the solution. Reporting the correct answer, without showing the work performed at each step will result in getting
More informationexclusive-or and Binary Adder R eouven Elbaz reouven@uwaterloo.ca Office room: DC3576
exclusive-or and Binary Adder R eouven Elbaz reouven@uwaterloo.ca Office room: DC3576 Outline exclusive OR gate (XOR) Definition Properties Examples of Applications Odd Function Parity Generation and Checking
More informationSistemas Digitais I LESI - 2º ano
Sistemas Digitais I LESI - 2º ano Lesson 6 - Combinational Design Practices Prof. João Miguel Fernandes (miguel@di.uminho.pt) Dept. Informática UNIVERSIDADE DO MINHO ESCOLA DE ENGENHARIA - PLDs (1) - The
More informationChapter 4 Register Transfer and Microoperations. Section 4.1 Register Transfer Language
Chapter 4 Register Transfer and Microoperations Section 4.1 Register Transfer Language Digital systems are composed of modules that are constructed from digital components, such as registers, decoders,
More informationLFSR BASED COUNTERS AVINASH AJANE, B.E. A technical report submitted to the Graduate School. in partial fulfillment of the requirements
LFSR BASED COUNTERS BY AVINASH AJANE, B.E A technical report submitted to the Graduate School in partial fulfillment of the requirements for the degree Master of Science in Electrical Engineering New Mexico
More informationNAME AND SURNAME. TIME: 1 hour 30 minutes 1/6
E.T.S.E.T.B. MSc in ICT FINAL EXAM VLSI Digital Design Spring Course 2005-2006 June 6, 2006 Score publication date: June 19, 2006 Exam review request deadline: June 22, 2006 Academic consultancy: June
More informationCMOS Binary Full Adder
CMOS Binary Full Adder A Survey of Possible Implementations Group : Eren Turgay Aaron Daniels Michael Bacelieri William Berry - - Table of Contents Key Terminology...- - Introduction...- 3 - Design Architectures...-
More informationPipeline Hazards. Structure hazard Data hazard. ComputerArchitecture_PipelineHazard1
Pipeline Hazards Structure hazard Data hazard Pipeline hazard: the major hurdle A hazard is a condition that prevents an instruction in the pipe from executing its next scheduled pipe stage Taxonomy of
More informationModule 4 : Propagation Delays in MOS Lecture 22 : Logical Effort Calculation of few Basic Logic Circuits
Module 4 : Propagation Delays in MOS Lecture 22 : Logical Effort Calculation of few Basic Logic Circuits Objectives In this lecture you will learn the following Introduction Logical Effort of an Inverter
More informationDigital Logic Design. Basics Combinational Circuits Sequential Circuits. Pu-Jen Cheng
Digital Logic Design Basics Combinational Circuits Sequential Circuits Pu-Jen Cheng Adapted from the slides prepared by S. Dandamudi for the book, Fundamentals of Computer Organization and Design. Introduction
More informationComputer organization
Computer organization Computer design an application of digital logic design procedures Computer = processing unit + memory system Processing unit = control + datapath Control = finite state machine inputs
More informationChapter 2 Logic Gates and Introduction to Computer Architecture
Chapter 2 Logic Gates and Introduction to Computer Architecture 2.1 Introduction The basic components of an Integrated Circuit (IC) is logic gates which made of transistors, in digital system there are
More informationIntroducción. Diseño de sistemas digitales.1
Introducción Adapted from: Mary Jane Irwin ( www.cse.psu.edu/~mji ) www.cse.psu.edu/~cg431 [Original from Computer Organization and Design, Patterson & Hennessy, 2005, UCB] Diseño de sistemas digitales.1
More informationPass Gate Logic An alternative to implementing complex logic is to realize it using a logic network of pass transistors (switches).
Pass Gate Logic n alternative to implementing complex logic is to realize it using a logic network of pass transistors (switches). Switch Network Regeneration is performed via a buffer. We have already
More informationEE 42/100 Lecture 24: Latches and Flip Flops. Rev B 4/21/2010 (2:04 PM) Prof. Ali M. Niknejad
A. M. Niknejad University of California, Berkeley EE 100 / 42 Lecture 24 p. 1/20 EE 42/100 Lecture 24: Latches and Flip Flops ELECTRONICS Rev B 4/21/2010 (2:04 PM) Prof. Ali M. Niknejad University of California,
More informationChapter 2 Basic Structure of Computers. Jin-Fu Li Department of Electrical Engineering National Central University Jungli, Taiwan
Chapter 2 Basic Structure of Computers Jin-Fu Li Department of Electrical Engineering National Central University Jungli, Taiwan Outline Functional Units Basic Operational Concepts Bus Structures Software
More informationLatch Timing Parameters. Flip-flop Timing Parameters. Typical Clock System. Clocking Overhead
Clock - key to synchronous systems Topic 7 Clocking Strategies in VLSI Systems Peter Cheung Department of Electrical & Electronic Engineering Imperial College London Clocks help the design of FSM where
More informationIntroduction to CMOS VLSI Design (E158) Lecture 8: Clocking of VLSI Systems
Harris Introduction to CMOS VLSI Design (E158) Lecture 8: Clocking of VLSI Systems David Harris Harvey Mudd College David_Harris@hmc.edu Based on EE271 developed by Mark Horowitz, Stanford University MAH
More informationBinary Adders: Half Adders and Full Adders
Binary Adders: Half Adders and Full Adders In this set of slides, we present the two basic types of adders: 1. Half adders, and 2. Full adders. Each type of adder functions to add two binary bits. In order
More informationLecture 7: Clocking of VLSI Systems
Lecture 7: Clocking of VLSI Systems MAH, AEN EE271 Lecture 7 1 Overview Reading Wolf 5.3 Two-Phase Clocking (good description) W&E 5.5.1, 5.5.2, 5.5.3, 5.5.4, 5.5.9, 5.5.10 - Clocking Note: The analysis
More informationSystem on Chip Design. Michael Nydegger
Short Questions, 26. February 2015 What is meant by the term n-well process? What does this mean for the n-type MOSFETs in your design? What is the meaning of the threshold voltage (practically)? What
More information1. True or False? A voltage level in the range 0 to 2 volts is interpreted as a binary 1.
File: chap04, Chapter 04 1. True or False? A voltage level in the range 0 to 2 volts is interpreted as a binary 1. 2. True or False? A gate is a device that accepts a single input signal and produces one
More informationCOMBINATIONAL and SEQUENTIAL LOGIC CIRCUITS Hardware implementation and software design
PH-315 COMINATIONAL and SEUENTIAL LOGIC CIRCUITS Hardware implementation and software design A La Rosa I PURPOSE: To familiarize with combinational and sequential logic circuits Combinational circuits
More informationLayout of Multiple Cells
Layout of Multiple Cells Beyond the primitive tier primitives add instances of primitives add additional transistors if necessary add substrate/well contacts (plugs) add additional polygons where needed
More informationCOMBINATIONAL CIRCUITS
COMBINATIONAL CIRCUITS http://www.tutorialspoint.com/computer_logical_organization/combinational_circuits.htm Copyright tutorialspoint.com Combinational circuit is a circuit in which we combine the different
More informationUnderstanding Logic Design
Understanding Logic Design ppendix of your Textbook does not have the needed background information. This document supplements it. When you write add DD R0, R1, R2, you imagine something like this: R1
More informationSequential Circuit Design
Sequential Circuit Design Lan-Da Van ( 倫 ), Ph. D. Department of Computer Science National Chiao Tung University Taiwan, R.O.C. Fall, 2009 ldvan@cs.nctu.edu.tw http://www.cs.nctu.edu.tw/~ldvan/ Outlines
More informationRAM & ROM Based Digital Design. ECE 152A Winter 2012
RAM & ROM Based Digital Design ECE 152A Winter 212 Reading Assignment Brown and Vranesic 1 Digital System Design 1.1 Building Block Circuits 1.1.3 Static Random Access Memory (SRAM) 1.1.4 SRAM Blocks in
More informationSolution: start more than one instruction in the same clock cycle CPI < 1 (or IPC > 1, Instructions per Cycle) Two approaches:
Multiple-Issue Processors Pipelining can achieve CPI close to 1 Mechanisms for handling hazards Static or dynamic scheduling Static or dynamic branch handling Increase in transistor counts (Moore s Law):
More informationFlip-Flops, Registers, Counters, and a Simple Processor
June 8, 22 5:56 vra235_ch7 Sheet number Page number 349 black chapter 7 Flip-Flops, Registers, Counters, and a Simple Processor 7. Ng f3, h7 h6 349 June 8, 22 5:56 vra235_ch7 Sheet number 2 Page number
More information路 論 Chapter 15 System-Level Physical Design
Introduction to VLSI Circuits and Systems 路 論 Chapter 15 System-Level Physical Design Dept. of Electronic Engineering National Chin-Yi University of Technology Fall 2007 Outline Clocked Flip-flops CMOS
More informationMemory Elements. Combinational logic cannot remember
Memory Elements Combinational logic cannot remember Output logic values are function of inputs only Feedback is needed to be able to remember a logic value Memory elements are needed in most digital logic
More informationAn Effective Deterministic BIST Scheme for Shifter/Accumulator Pairs in Datapaths
An Effective Deterministic BIST Scheme for Shifter/Accumulator Pairs in Datapaths N. KRANITIS M. PSARAKIS D. GIZOPOULOS 2 A. PASCHALIS 3 Y. ZORIAN 4 Institute of Informatics & Telecommunications, NCSR
More informationArchitectural Level Power Consumption of Network on Chip. Presenter: YUAN Zheng
Architectural Level Power Consumption of Network Presenter: YUAN Zheng Why Architectural Low Power Design? High-speed and large volume communication among different parts on a chip Problem: Power consumption
More informationDEPARTMENT OF INFORMATION TECHNLOGY
DRONACHARYA GROUP OF INSTITUTIONS, GREATER NOIDA Affiliated to Mahamaya Technical University, Noida Approved by AICTE DEPARTMENT OF INFORMATION TECHNLOGY Lab Manual for Computer Organization Lab ECS-453
More informationTwo-level logic using NAND gates
CSE140: Components and Design Techniques for Digital Systems Two and Multilevel logic implementation Tajana Simunic Rosing 1 Two-level logic using NND gates Replace minterm ND gates with NND gates Place
More informationLecture 10 Sequential Circuit Design Zhuo Feng. Z. Feng MTU EE4800 CMOS Digital IC Design & Analysis 2010
EE4800 CMOS igital IC esign & Analysis Lecture 10 Sequential Circuit esign Zhuo Feng 10.1 Z. Feng MTU EE4800 CMOS igital IC esign & Analysis 2010 Sequencing Outline Sequencing Element esign Max and Min-elay
More informationCombinational Logic Design
Chapter 4 Combinational Logic Design The foundations for the design of digital logic circuits were established in the preceding chapters. The elements of Boolean algebra (two-element switching algebra
More informationModeling Sequential Elements with Verilog. Prof. Chien-Nan Liu TEL: 03-4227151 ext:34534 Email: jimmy@ee.ncu.edu.tw. Sequential Circuit
Modeling Sequential Elements with Verilog Prof. Chien-Nan Liu TEL: 03-4227151 ext:34534 Email: jimmy@ee.ncu.edu.tw 4-1 Sequential Circuit Outputs are functions of inputs and present states of storage elements
More informationGate Delay Model. Estimating Delays. Effort Delay. Gate Delay. Computing Logical Effort. Logical Effort
Estimating Delays Would be nice to have a back of the envelope method for sizing gates for speed Logical Effort Book by Sutherland, Sproull, Harris Chapter 1 is on our web page Also Chapter 4 in our textbook
More informationMore Verilog. 8-bit Register with Synchronous Reset. Shift Register Example. N-bit Register with Asynchronous Reset.
More Verilog 8-bit Register with Synchronous Reset module reg8 (reset, CLK, D, Q); input reset; input [7:0] D; output [7:0] Q; reg [7:0] Q; if (reset) Q = 0; else Q = D; module // reg8 Verilog - 1 Verilog
More informatione.g. τ = 12 ps in 180nm, 40 ps in 0.6 µm Delay has two components where, f = Effort Delay (stage effort)= gh p =Parasitic Delay
Logic Gate Delay Chip designers need to choose: What is the best circuit topology for a function? How many stages of logic produce least delay? How wide transistors should be? Logical Effort Helps make
More informationSolutions. Solution 4.1. 4.1.1 The values of the signals are as follows:
4 Solutions Solution 4.1 4.1.1 The values of the signals are as follows: RegWrite MemRead ALUMux MemWrite ALUOp RegMux Branch a. 1 0 0 (Reg) 0 Add 1 (ALU) 0 b. 1 1 1 (Imm) 0 Add 1 (Mem) 0 ALUMux is the
More informationCS 61C: Great Ideas in Computer Architecture Finite State Machines. Machine Interpreta4on
CS 61C: Great Ideas in Computer Architecture Finite State Machines Instructors: Krste Asanovic & Vladimir Stojanovic hbp://inst.eecs.berkeley.edu/~cs61c/sp15 1 Levels of RepresentaKon/ InterpretaKon High
More informationCHAPTER 3 Boolean Algebra and Digital Logic
CHAPTER 3 Boolean Algebra and Digital Logic 3.1 Introduction 121 3.2 Boolean Algebra 122 3.2.1 Boolean Expressions 123 3.2.2 Boolean Identities 124 3.2.3 Simplification of Boolean Expressions 126 3.2.4
More informationSystems I: Computer Organization and Architecture
Systems I: Computer Organization and Architecture Lecture 9 - Register Transfer and Microoperations Microoperations Digital systems are modular in nature, with modules containing registers, decoders, arithmetic
More informationCSE140: Components and Design Techniques for Digital Systems
CSE4: Components and Design Techniques for Digital Systems Tajana Simunic Rosing What we covered thus far: Number representations Logic gates Boolean algebra Introduction to CMOS HW#2 due, HW#3 assigned
More informationLecture-3 MEMORY: Development of Memory:
Lecture-3 MEMORY: It is a storage device. It stores program data and the results. There are two kind of memories; semiconductor memories & magnetic memories. Semiconductor memories are faster, smaller,
More informationDesign of Low Power One-Bit Hybrid-CMOS Full Adder Cells
Design of Low Power One-Bit Hybrid-CMOS Full Adder Cells Sushil B. Bhaisare 1, Sonalee P. Suryawanshi 2, Sagar P. Soitkar 3 1 Lecturer in Electronics Department, Nagpur University, G.H.R.I.E.T.W. Nagpur,
More informationChapter 5 :: Memory and Logic Arrays
Chapter 5 :: Memory and Logic Arrays Digital Design and Computer Architecture David Money Harris and Sarah L. Harris Copyright 2007 Elsevier 5- ROM Storage Copyright 2007 Elsevier 5- ROM Logic Data
More information5 Combinatorial Components. 5.0 Full adder. Full subtractor
5 Combatorial Components Use for data transformation, manipulation, terconnection, and for control: arithmetic operations - addition, subtraction, multiplication and division. logic operations - AND, OR,
More informationHigh Speed Gate Level Synchronous Full Adder Designs
High Speed Gate Level Synchronous Full Adder Designs PADMANABHAN BALASUBRAMANIAN and NIKOS E. MASTORAKIS School of Computer Science, The University of Manchester, Oxford Road, Manchester M13 9PL, UNITED
More informationASYNCHRONOUS COUNTERS
LB no.. SYNCHONOUS COUNTES. Introduction Counters are sequential logic circuits that counts the pulses applied at their clock input. They usually have 4 bits, delivering at the outputs the corresponding
More informationLecture 10: Sequential Circuits
Introduction to CMOS VLSI esign Lecture 10: Sequential Circuits avid Harris Harvey Mudd College Spring 2004 Outline q Sequencing q Sequencing Element esign q Max and Min-elay q Clock Skew q Time Borrowing
More informationComputer System: User s View. Computer System Components: High Level View. Input. Output. Computer. Computer System: Motherboard Level
System: User s View System Components: High Level View Input Output 1 System: Motherboard Level 2 Components: Interconnection I/O MEMORY 3 4 Organization Registers ALU CU 5 6 1 Input/Output I/O MEMORY
More informationTopics of Chapter 5 Sequential Machines. Memory elements. Memory element terminology. Clock terminology
Topics of Chapter 5 Sequential Machines Memory elements Memory elements. Basics of sequential machines. Clocking issues. Two-phase clocking. Testing of combinational (Chapter 4) and sequential (Chapter
More informationANALOG & DIGITAL ELECTRONICS
ANALOG & DIGITAL ELECTRONICS Course Instructor: Course No: PH-218 3-1-0-8 Dr. A.P. Vajpeyi E-mail: apvajpeyi@iitg.ernet.in Room No: #305 Department of Physics, Indian Institute of Technology Guwahati,
More informationCHAPTER 4 MARIE: An Introduction to a Simple Computer
CHAPTER 4 MARIE: An Introduction to a Simple Computer 4.1 Introduction 195 4.2 CPU Basics and Organization 195 4.2.1 The Registers 196 4.2.2 The ALU 197 4.2.3 The Control Unit 197 4.3 The Bus 197 4.4 Clocks
More information1. Memory technology & Hierarchy
1. Memory technology & Hierarchy RAM types Advances in Computer Architecture Andy D. Pimentel Memory wall Memory wall = divergence between CPU and RAM speed We can increase bandwidth by introducing concurrency
More informationChapter 7. Registers & Register Transfers. J.J. Shann. J. J. Shann
Chapter 7 Registers & Register Transfers J. J. Shann J.J. Shann Chapter Overview 7- Registers and Load Enable 7-2 Register Transfers 7-3 Register Transfer Operations 7-4 A Note for VHDL and Verilog Users
More informationHIGH SPEED AREA EFFICIENT 1-BIT HYBRID FULL ADDER
HIGH SPEED AREA EFFICIENT 1-BIT HYBRID FULL ADDER Sachin Kumar *1, Aman Kumar #2, Puneet Bansal #3 * Department of Electronic Science, Kurukshetra University, Kurukshetra, Haryana, India # University Institute
More informationFinite State Machine. RTL Hardware Design by P. Chu. Chapter 10 1
Finite State Machine Chapter 10 1 Outline 1. Overview 2. FSM representation 3. Timing and performance of an FSM 4. Moore machine versus Mealy machine 5. VHDL description of FSMs 6. State assignment 7.
More informationData Dependences. A data dependence occurs whenever one instruction needs a value produced by another.
Data Hazards 1 Hazards: Key Points Hazards cause imperfect pipelining They prevent us from achieving CPI = 1 They are generally causes by counter flow data pennces in the pipeline Three kinds Structural
More informationInterconnection Networks
Advanced Computer Architecture (0630561) Lecture 15 Interconnection Networks Prof. Kasim M. Al-Aubidy Computer Eng. Dept. Interconnection Networks: Multiprocessors INs can be classified based on: 1. Mode
More informationOptimization and Comparison of 4-Stage Inverter, 2-i/p NAND Gate, 2-i/p NOR Gate Driving Standard Load By Using Logical Effort
Optimization and Comparison of -Stage, -i/p NND Gate, -i/p NOR Gate Driving Standard Load By Using Logical Effort Satyajit nand *, and P.K.Ghosh ** * Mody Institute of Technology & Science/ECE, Lakshmangarh,
More informationCounters are sequential circuits which "count" through a specific state sequence.
Counters Counters are sequential circuits which "count" through a specific state sequence. They can count up, count down, or count through other fixed sequences. Two distinct types are in common usage:
More informationADVANCED PROCESSOR ARCHITECTURES AND MEMORY ORGANISATION Lesson-12: ARM
ADVANCED PROCESSOR ARCHITECTURES AND MEMORY ORGANISATION Lesson-12: ARM 1 The ARM architecture processors popular in Mobile phone systems 2 ARM Features ARM has 32-bit architecture but supports 16 bit
More informationHardware Implementations of RSA Using Fast Montgomery Multiplications. ECE 645 Prof. Gaj Mike Koontz and Ryon Sumner
Hardware Implementations of RSA Using Fast Montgomery Multiplications ECE 645 Prof. Gaj Mike Koontz and Ryon Sumner Overview Introduction Functional Specifications Implemented Design and Optimizations
More informationMultipliers. Introduction
Multipliers Introduction Multipliers play an important role in today s digital signal processing and various other applications. With advances in technology, many researchers have tried and are trying
More information(Refer Slide Time: 00:01:16 min)
Digital Computer Organization Prof. P. K. Biswas Department of Electronic & Electrical Communication Engineering Indian Institute of Technology, Kharagpur Lecture No. # 04 CPU Design: Tirning & Control
More informationPower Reduction Techniques in the SoC Clock Network. Clock Power
Power Reduction Techniques in the SoC Network Low Power Design for SoCs ASIC Tutorial SoC.1 Power Why clock power is important/large» Generally the signal with the highest frequency» Typically drives a
More informationRN-Codings: New Insights and Some Applications
RN-Codings: New Insights and Some Applications Abstract During any composite computation there is a constant need for rounding intermediate results before they can participate in further processing. Recently
More informationRead-only memory Implementing logic with ROM Programmable logic devices Implementing logic with PLDs Static hazards
Points ddressed in this Lecture Lecture 8: ROM Programmable Logic Devices Professor Peter Cheung Department of EEE, Imperial College London Read-only memory Implementing logic with ROM Programmable logic
More informationChapter 10 Advanced CMOS Circuits
Transmission Gates Chapter 10 Advanced CMOS Circuits NMOS Transmission Gate The active pull-up inverter circuit leads one to thinking about alternate uses of NMOS devices. Consider the circuit shown in
More informationEXPERIMENT 4. Parallel Adders, Subtractors, and Complementors
EXPERIMENT 4. Parallel Adders, Subtractors, and Complementors I. Introduction I.a. Objectives In this experiment, parallel adders, subtractors and complementors will be designed and investigated. In the
More informationETEC 2301 Programmable Logic Devices. Chapter 10 Counters. Shawnee State University Department of Industrial and Engineering Technologies
ETEC 2301 Programmable Logic Devices Chapter 10 Counters Shawnee State University Department of Industrial and Engineering Technologies Copyright 2007 by Janna B. Gallaher Asynchronous Counter Operation
More informationDesign of Energy Efficient Low Power Full Adder using Supply Voltage Gating
Design of Energy Efficient Low Power Full Adder using Supply Voltage Gating S.Nandhini 1, T.G.Dhaarani 2, P.Kokila 3, P.Premkumar 4 Assistant Professor, Dept. of ECE, Nandha Engineering College, Erode,
More informationCHAPTER 5 FINITE STATE MACHINE FOR LOOKUP ENGINE
CHAPTER 5 71 FINITE STATE MACHINE FOR LOOKUP ENGINE 5.1 INTRODUCTION Finite State Machines (FSMs) are important components of digital systems. Therefore, techniques for area efficiency and fast implementation
More informationSequential Circuits. Combinational Circuits Outputs depend on the current inputs
Principles of VLSI esign Sequential Circuits Sequential Circuits Combinational Circuits Outputs depend on the current inputs Sequential Circuits Outputs depend on current and previous inputs Requires separating
More informationLecture 8: Synchronous Digital Systems
Lecture 8: Synchronous Digital Systems The distinguishing feature of a synchronous digital system is that the circuit only changes in response to a system clock. For example, consider the edge triggered
More informationWEEK 8.1 Registers and Counters. ECE124 Digital Circuits and Systems Page 1
WEEK 8.1 egisters and Counters ECE124 igital Circuits and Systems Page 1 Additional schematic FF symbols Active low set and reset signals. S Active high set and reset signals. S ECE124 igital Circuits
More informationLecture 11: Sequential Circuit Design
Lecture 11: Sequential Circuit esign Outline Sequencing Sequencing Element esign Max and Min-elay Clock Skew Time Borrowing Two-Phase Clocking 2 Sequencing Combinational logic output depends on current
More informationLecture 10: Latch and Flip-Flop Design. Outline
Lecture 1: Latch and Flip-Flop esign Slides orginally from: Vladimir Stojanovic Computer Systems Laboratory Stanford University horowitz@stanford.edu 1 Outline Recent interest in latches and flip-flops
More informationAddressing The problem. When & Where do we encounter Data? The concept of addressing data' in computations. The implications for our machine design(s)
Addressing The problem Objectives:- When & Where do we encounter Data? The concept of addressing data' in computations The implications for our machine design(s) Introducing the stack-machine concept Slide
More informationThe Central Processing Unit:
The Central Processing Unit: What Goes on Inside the Computer Chapter 4 Objectives Identify the components of the central processing unit and how they work together and interact with memory Describe how
More informationA New Paradigm for Synchronous State Machine Design in Verilog
A New Paradigm for Synchronous State Machine Design in Verilog Randy Nuss Copyright 1999 Idea Consulting Introduction Synchronous State Machines are one of the most common building blocks in modern digital
More informationLecture 5: Logical Effort
Introduction to CMOS VLSI Design Lecture 5: Logical Effort David Harris Harvey Mudd College Spring 2004 Outline Introduction Delay in a Logic Gate Multistage Logic Networks Choosing the Best Number of
More informationUnited States Naval Academy Electrical and Computer Engineering Department. EC262 Exam 1
United States Naval Academy Electrical and Computer Engineering Department EC262 Exam 29 September 2. Do a page check now. You should have pages (cover & questions). 2. Read all problems in their entirety.
More informationA Novel Low Power, High Speed 14 Transistor CMOS Full Adder Cell with 50% Improvement in Threshold Loss Problem
A Novel Low Power, High Speed 4 Transistor CMOS Full Adder Cell with 5% Improvement in Threshold Loss Problem T. Vigneswaran, B. Mukundhan, and P. Subbarami Reddy Abstract Full adders are important components
More informationearlier in the semester: The Full adder above adds two bits and the output is at the end. So if we do this eight times, we would have an 8-bit adder.
The circuit created is an 8-bit adder. The 8-bit adder adds two 8-bit binary inputs and the result is produced in the output. In order to create a Full 8-bit adder, I could use eight Full -bit adders and
More informationEE 459/500 HDL Based Digital Design with Programmable Logic. Lecture 16 Timing and Clock Issues
EE 459/500 HDL Based Digital Design with Programmable Logic Lecture 16 Timing and Clock Issues 1 Overview Sequential system timing requirements Impact of clock skew on timing Impact of clock jitter on
More informationGates, Circuits, and Boolean Algebra
Gates, Circuits, and Boolean Algebra Computers and Electricity A gate is a device that performs a basic operation on electrical signals Gates are combined into circuits to perform more complicated tasks
More informationLatches, the D Flip-Flop & Counter Design. ECE 152A Winter 2012
Latches, the D Flip-Flop & Counter Design ECE 52A Winter 22 Reading Assignment Brown and Vranesic 7 Flip-Flops, Registers, Counters and a Simple Processor 7. Basic Latch 7.2 Gated SR Latch 7.2. Gated SR
More informationComputer Systems Design and Architecture by V. Heuring and H. Jordan
1-1 Chapter 1 - The General Purpose Machine Computer Systems Design and Architecture Vincent P. Heuring and Harry F. Jordan Department of Electrical and Computer Engineering University of Colorado - Boulder
More information