Lecture 11. Clocking High-Performance Microprocessors
|
|
|
- Isabella Jefferson
- 10 years ago
- Views:
Transcription
1 Lecture 11 Clocking High-Performance Microprocessors Computer Systems Laboratory Stanford University Copyright 2007 Ron Ho, Mark Horowitz 1 Overview Readings (Actually for today s lecture) Gronowski Alpha clocking paper Restle Clock grid paper Harris Variations paper Today s topics Overview of clock distribution methods and challenges Three schemes of clock distribution Overcoming and exploiting clock skew and jitter 2
2 Latches and Flops Need Clocks Previous lecture on latches and flops assumed an input clock We talked briefly about imperfections in that clock: skew, jitter In a chip, we can have n clocks for the latches and flops asynchrony n=1: one clock for the whole chip n<10: one clock per major chip portion (e.g., core and cache) 10<n<100: one clock per major functional block (e.g., GALS) 100<n<10000: one clock per datapath stage This lecture will focus on the globally synchronous systems One or a few clocks distributed across the chip How do we do this, and why is it so hard? 3 Principal Goal of Clock Distribution We need to distribute a stable clock across a die Clock should be phase-aligned with an external input clock Skew should be zero, or at least well-characterized Jitter should be zero Both skew and jitter should be insensitive to PVT Duty cycle should be 50%-50%, or at least well-characterized clk1 t skew t +jit clk2 t -jit 4
3 Other Goals of Clock Distribution Clock network should be low-power But remember that most of the power is burned in the latches/flops Clock distribution should be simple to design and verify Clocking should allow external buses at slower frequencies Bus/system clock is much slower than on-chip clock (why?) Example: 533MHz bus clock, 3.0GHz chip clock Often at a non-integer fraction of the CPU clock (3/5, 2/9 ) Every n th CPU clock is phase-aligned with every m th bus clock Clock generation should support test and debug Control over frequency, skews, and duty cycle 5 Principal Goal is Difficult Uncertainty in the clocking network appears everywhere! Affects skew, jitter, and duty cycle of the delivered clock Uncertainty from clock generation clk source clk loads (latches/flops) 6
4 Principal Goal is Difficult Uncertainty in the clocking network appears everywhere! Affects skew, jitter, and duty cycle of the delivered clock Mismatched buffers Uncertainty from clock generation clk source clk loads (latches/flops) 7 Principal Goal is Difficult Uncertainty in the clocking network appears everywhere! Affects skew, jitter, and duty cycle of the delivered clock Mismatched buffers Uncertainty from clock generation clk source Misplaced buffers clk loads (latches/flops) 8
5 Principal Goal is Difficult Uncertainty in the clocking network appears everywhere! Affects skew, jitter, and duty cycle of the delivered clock Mismatched buffers Mismatched wires Uncertainty from clock generation clk source Misplaced buffers clk loads (latches/flops) 9 Principal Goal is Difficult Uncertainty in the clocking network appears everywhere! Affects skew, jitter, and duty cycle of the delivered clock Mismatched buffers Mismatched wires Variable loads Uncertainty from clock generation clk source Misplaced buffers clk loads (latches/flops) 10
6 Principal Goal is Difficult Uncertainty in the clocking network appears everywhere! Affects skew, jitter, and duty cycle of the delivered clock Mismatched buffers Mismatched wires Variable loads Uncertainty from clock generation clk source Misplaced buffers clk loads (latches/flops) Temp variations 11 Principal Goal is Difficult Uncertainty in the clocking network appears everywhere! Affects skew, jitter, and duty cycle of the delivered clock Power supply variations Mismatched buffers Mismatched wires Variable loads Uncertainty from clock generation clk source Misplaced buffers clk loads (latches/flops) Temp variations 12
7 Mismatched Buffers Earlier we talked about process corners (FF, SS, TT, SF, etc.) This represents variation from one die to another die There is also significant cross-die variation: process tilt ITRS predicts 3σ of L e will grow to be about 20% of L e Two devices on opposite chip sides will behave differently Won t show up in process corners: all devices identically fast in FF There is also local systematic variation: proximity effects Local wiring density affects etch rates and thus dimension control Use dummy structures to equalize and match environments Both process tilt and proximity effects must be considered 13 Mismatched Wires and Misplaced Buffers We would prefer to put buffers in exactly the right places But usually we cannot due to congestion (middle of the cache?) Leads to mismatched wire loads Wires don t always fit exactly right either They also have to fit in the middle of other layouts You get different capacitive components (M5-M4, M5-M5, fringe ) But even if we put everything in the right places (Expensive, but perhaps worth it for the clock) Still have process tilt and proximity effects for wires Again, not counted in process corner simulations Designers often use shield wires and then ignore this 14
8 Variable Loads Clocks drive latches and flops, which can be passgate loads Load depends on the data going through the passgate Drain Source Drain Fall Fall Low High High High Rise Source Fall Low Low Low High Rise Rise Gate load 1.3x 1.1x x 0.42x 0.31x 0.13x Source: Bailey JSSC :1 variation 10:1 variation 15 Power Supply Variations RTL-based power traces can drive a global grid simulation Example of Itanium II power supply variations (snapshot in time) Source: Harris TransVLSI 12/01 16
9 Temperature Variations Same for temperature variations, but these are slower effects Long-term skew and/or jitter 17 Source: Harris TransVLSI 12/01 So What Makes Up Skew? If we eliminate systematic offsets we care mostly about devices Source: Geannopoulos ISSCC
10 How Much Skew Do We Have? Historically, skew budgets have been about 5% of the cycle time What people generally try to hit in designs Source: Rusu Intel 19 Simulating Skew Do not just gather all 3σ numbers in the worst possible way Way too pessimistic; you'll never get the design done You ll never see 3σ variation in clock buffers across a single datapath Statistical simulations figure out the right skew budget Key point: skew from process variation affects both clock and data Data variation makes local skew budgets worse but makes the difference in global and local skew smaller Global skew Local skew C1 C2 20
11 Simulating Skew: A Note on Inductance Inductance doesn t matter for skinny signal wires Short wires: driver resistance swamps out wire inductance When R driver > 2Z 0, where Z 0 Long wires: wire resistance swamps out wire inductance When R wire > 2Z 0 (i.e., if attenuation factor α > 1) RLC models are exceedingly complicated and time-consuming But: model L for clock networks Wires are wide (much less R wire ) We care about ps of skew Only a few wires (100s) to model L noise also worth modeling Source: Restle JSSC What About Cycle-to-Cycle Variation (Jitter)? Jitter comes from temporal variations in the clock network Some comes from the PLL/DLL, but not very much Power supply changes mostly; temperature variations are slow Also data coupling noise, but can design this away with shielding Jitter depends linearly on the delay through the clock network Insertion delay is the time from PLL to the final latches Around 2 cycles in CPUs today Note that jitter is primarily a max-path problem Min-path problems are at the same clock edge Although beware differential jitter between different clock nets 22
12 How Much Jitter Do We Have? Also around 5% of the cycle time (statistical simulations) But you should ask: how do people measure jitter? More later Source: Rusu Intel 23 Clock Distribution Schemes Three principal ways of distributing clocks around a chip Grid (DEC Alpha processors) H-tree (IBM S/390, Intel Itanium) Spines (Intel PentiumII, PentiumIII, P4) Tree-Grid hybrids (IBM Power, Sun Ultrasparc) Other methods also being researched Optical distribution Resonant clocks 24
13 Clock Grids Cover the entire chip with a grid that carries clock Good: simple, low skew (because everything is shorted together) Bad: lots of power and metal resources consumed Need to figure out from where you will drive the grid 25 Three Generations of Gridded Alphas DEC Alpha used clock grids, driven by panes of drivers (1993): 200MHz, 0.75μm process, one driver pane (1996): 333MHz, 0.5μm process, two driver panes (1997): 600MHz, 0.35μm process, sixteen driver panes Source: Gronowski JSSC
14 Alpha Gridded Skews Skews dropped: 240pS to 120pS to 75pS As fraction of Tcycle: 4.8%, 3.6%, 4.5% Simulated only, not measured Source: Gronowski JSSC Thermal Image of and Single pane driver on was definitely a hotspot Splitting into two panes reduced thermal gradients significantly Source: Gronowski JSSC
15 Clock H-Trees Balanced H-Trees are complex to design but are appealing Good: Low power, low routing resources Bad: Skew between divergent branches, complex design Example: IBM S/390 used balanced RC paths in the tree Source: Webb JSSC H-Trees and Points of Divergence Skew between nodes on an H-Tree depends on location Their last common node is their Point of Divergence (POD) Two nodes with a near POD = low skew (10% s+j budget) Two nodes with a far POD = high skew (20% s+j budget) Trees must balance RC delays with non-uniform loads small skew big skew IBM testchip with loads shown by diameter Source: Restle VLSI
16 Hybrid: Trees Driving Grids Eliminates skew differences due to POD But consumes lots of power, like grids IBM Power4 in 0.18μm consumes 70% power in clocks and latches Source: Anderson ISSCC 2001 Source: Restle ISSCC Hybrid: Trees Driving Grids Simulations of skew-smoothing that you get with grids IBM Power4 with intentional skew added to one buffer side Simulations only 32
17 Animations of Grid Clocks Restle (IBM Research) does interesting work visualizing circuits In particular, clock animations 33 Hybrid: Trees Driving Grids Sun UltraSparcIII: balanced tree to global grid (skew simulated) Source: Heald JSSC
18 Clock Spines Spines look like a collapsed tree Each spine is equipotential and can radiate outwards Example: First Intel Pentium4 microprocessor used three spines Source: Kurd ISSCC 2001 Source: Kurd JSSC Clock Spines Skew Later Intel s 90nm P4 added more spines Skew was simulated to be under 10pS Source: Bindal ISSCC
19 A True Hybrid Compaq Alpha (EV8, r.i.p.) Grid in core, panes and trees in memory/network, trees in cache Skew in the glue memory/network controller < 90pS Source: Xanthopoulos ISSCC Minimizing Skew In Design Follow basic heuristics to match all gates Single cell design with dummy structures around it, replicated Same cell orientation in each instance Similar heuristics for wires Consider and account for return path resistance Heavily shield clock routes: makes C uniform and L minimal Limit maximum wire dimensions: avoid skin effect Simplify the problem by using templates All of M5 looks like the following, repeated over the entire die Gnd/Vdd Local Clk Data 38
20 Active Skew Cancellation (Insertion) Look at clocks and adjust them Need to be careful about the loop bandwidth and stability concerns Example from Intel PentiumIII (dual spine design) Source: Geannopoulos ISSCC Multi-zone Deskew Circuits Example from Intel Itanium clocking Distribute an array of deskew buffers, each tying together 4 zones Source: Rusu ISSCC
21 Deskew Effectiveness Chip Publication Zones Skew After Stepsize PentiumIII ISSCC pS 15pS 12pS Itanium ISSCC pS 28pS 8pS Pentium4 ISSCC pS 16pS 8pS Itanium (3 rd G) ISSCC pS 7pS 7pS Source: Stinson ISSCC Minimizing Jitter In Design Jitter predominantly comes from power supply noise So filter the clock power supply Example from Pentium4 clock network (simulated) Source: Kurd JSSC
22 Minimizing Jitter In Design, con t Compaq Alpha EV8 used essentially the same idea (simulated) Source: Xanthopoulos ISSCC Measuring Skew and Jitter There's a good reason why skew numbers are usually simulated E-beam and picoprobe don t work with flip-chip packaging IBM PICA: Picosecond Imaging Circuit Analysis Saturated nmos emit photons in the channel Very rarely; photons photomultiplied exponentially to be seen Subsample to probe the whole chip at once over a long time Can measure skew by imaging XOR of two different clocks Intel LVP: Laser Voltage Probe Shine a laser onto the diffusion (from the back of the substrate) Need a good transparent heat-spreader (diamond) Reflectivity depends on voltage bias. Probes only one point at a time, but rapidly 44
23 PICA Movie of Ring 45 PICA Measurements PICA measurements of S/390 clock shown Source: Tsang IBM JourResDev
24 LVP Measurements of Skew Voltage levels not well characterized, but transitions are Good for teasing out skew numbers from the edges Shows a layout error that was fixed in a subsequent stepping Source: Anderson ISSCC Measuring Jitter Almost impossible to do, so everybody estimates it You can try to measure total skew + jitter (Restle ISSCC 2005) Both PICA and LVP are averaging measurements (no jitter) Typical schemes drive a clock node to a pin to expose it But need to buffer up to drive the pin Ramp-up chain introduces jitter itself Measurement scopes have source jitter, too Measured jitter: 150pS. Source trigger jitter: 150pS. Hmmm Commercial systems try to cancel trig jitter (M2, Wavecrest) But still end up with on-chip buffer jitter 48
Clocking. Figure by MIT OCW. 6.884 - Spring 2005 2/18/05 L06 Clocks 1
ing Figure by MIT OCW. 6.884 - Spring 2005 2/18/05 L06 s 1 Why s and Storage Elements? Inputs Combinational Logic Outputs Want to reuse combinational logic from cycle to cycle 6.884 - Spring 2005 2/18/05
Alpha CPU and Clock Design Evolution
Alpha CPU and Clock Design Evolution This lecture uses two papers that discuss the evolution of the Alpha CPU and clocking strategy over three CPU generations Gronowski, Paul E., et.al., High Performance
TIMING-DRIVEN PHYSICAL DESIGN FOR DIGITAL SYNCHRONOUS VLSI CIRCUITS USING RESONANT CLOCKING
TIMING-DRIVEN PHYSICAL DESIGN FOR DIGITAL SYNCHRONOUS VLSI CIRCUITS USING RESONANT CLOCKING BARIS TASKIN, JOHN WOOD, IVAN S. KOURTEV February 28, 2005 Research Objective Objective: Electronic design automation
Topics of Chapter 5 Sequential Machines. Memory elements. Memory element terminology. Clock terminology
Topics of Chapter 5 Sequential Machines Memory elements Memory elements. Basics of sequential machines. Clocking issues. Two-phase clocking. Testing of combinational (Chapter 4) and sequential (Chapter
Latch Timing Parameters. Flip-flop Timing Parameters. Typical Clock System. Clocking Overhead
Clock - key to synchronous systems Topic 7 Clocking Strategies in VLSI Systems Peter Cheung Department of Electrical & Electronic Engineering Imperial College London Clocks help the design of FSM where
Power Reduction Techniques in the SoC Clock Network. Clock Power
Power Reduction Techniques in the SoC Network Low Power Design for SoCs ASIC Tutorial SoC.1 Power Why clock power is important/large» Generally the signal with the highest frequency» Typically drives a
LOW POWER DESIGN OF DIGITAL SYSTEMS USING ENERGY RECOVERY CLOCKING AND CLOCK GATING
LOW POWER DESIGN OF DIGITAL SYSTEMS USING ENERGY RECOVERY CLOCKING AND CLOCK GATING A thesis work submitted to the faculty of San Francisco State University In partial fulfillment of the requirements for
On-Chip Interconnect: The Past, Present, and Future
On-Chip Interconnect: The Past, Present, and Future Professor Eby G. Friedman Department of Electrical and Computer Engineering University of Rochester URL: http://www.ece.rochester.edu/~friedman Future
Signal Integrity: Tips and Tricks
White Paper: Virtex-II, Virtex-4, Virtex-5, and Spartan-3 FPGAs R WP323 (v1.0) March 28, 2008 Signal Integrity: Tips and Tricks By: Austin Lesea Signal integrity (SI) engineering has become a necessary
TIMING ISSUES IN DIGITAL CIRCUITS
chapter10_141.fm Page 42 Tuesday, April 16, 2002 9:12 AM CHAPTER 10 TIMING ISSUES IN DIGITAL CIRCUITS Impact of clock skew and jitter on performance and functionality n Alternative timing methodologies
Low latency synchronization through speculation
Low latency synchronization through speculation D.J.Kinniment, and A.V.Yakovlev School of Electrical and Electronic and Computer Engineering, University of Newcastle, NE1 7RU, UK {David.Kinniment,Alex.Yakovlev}@ncl.ac.uk
A 1.62/2.7/5.4 Gbps Clock and Data Recovery Circuit for DisplayPort 1.2 with a single VCO
JOURNAL OF SEMICONDUCTOR TECHNOLOGY AND SCIENCE, VOL.13, NO.3, JUNE, 2013 http://dx.doi.org/10.5573/jsts.2013.13.3.185 A 1.62/2.7/5.4 Clock and Data Recovery Circuit for DisplayPort 1.2 with a single VCO
ISSCC 2003 / SESSION 13 / 40Gb/s COMMUNICATION ICS / PAPER 13.7
ISSCC 2003 / SESSION 13 / 40Gb/s COMMUNICATION ICS / PAPER 13.7 13.7 A 40Gb/s Clock and Data Recovery Circuit in 0.18µm CMOS Technology Jri Lee, Behzad Razavi University of California, Los Angeles, CA
Low Power AMD Athlon 64 and AMD Opteron Processors
Low Power AMD Athlon 64 and AMD Opteron Processors Hot Chips 2004 Presenter: Marius Evers Block Diagram of AMD Athlon 64 and AMD Opteron Based on AMD s 8 th generation architecture AMD Athlon 64 and AMD
Introduction to CMOS VLSI Design (E158) Lecture 8: Clocking of VLSI Systems
Harris Introduction to CMOS VLSI Design (E158) Lecture 8: Clocking of VLSI Systems David Harris Harvey Mudd College [email protected] Based on EE271 developed by Mark Horowitz, Stanford University MAH
Naveen Muralimanohar Rajeev Balasubramonian Norman P Jouppi
Optimizing NUCA Organizations and Wiring Alternatives for Large Caches with CACTI 6.0 Naveen Muralimanohar Rajeev Balasubramonian Norman P Jouppi University of Utah & HP Labs 1 Large Caches Cache hierarchies
White Paper Utilizing Leveling Techniques in DDR3 SDRAM Memory Interfaces
White Paper Introduction The DDR3 SDRAM memory architectures support higher bandwidths with bus rates of 600 Mbps to 1.6 Gbps (300 to 800 MHz), 1.5V operation for lower power, and higher densities of 2
Lecture 10: Latch and Flip-Flop Design. Outline
Lecture 1: Latch and Flip-Flop esign Slides orginally from: Vladimir Stojanovic Computer Systems Laboratory Stanford University [email protected] 1 Outline Recent interest in latches and flip-flops
Sequential 4-bit Adder Design Report
UNIVERSITY OF WATERLOO Faculty of Engineering E&CE 438: Digital Integrated Circuits Sequential 4-bit Adder Design Report Prepared by: Ian Hung (ixxxxxx), 99XXXXXX Annette Lo (axxxxxx), 99XXXXXX Pamela
FPGA Clocking. Clock related issues: distribution generation (frequency synthesis) multiplexing run time programming domain crossing
FPGA Clocking Clock related issues: distribution generation (frequency synthesis) Deskew multiplexing run time programming domain crossing Clock related constraints 100 Clock Distribution Device split
ICS514 LOCO PLL CLOCK GENERATOR. Description. Features. Block Diagram DATASHEET
DATASHEET ICS514 Description The ICS514 LOCO TM is the most cost effective way to generate a high-quality, high-frequency clock output from a 14.31818 MHz crystal or clock input. The name LOCO stands for
IL2225 Physical Design
IL2225 Physical Design Nasim Farahini [email protected] Outline Physical Implementation Styles ASIC physical design Flow Floor and Power planning Placement Clock Tree Synthesis Routing Timing Analysis Verification
EE 459/500 HDL Based Digital Design with Programmable Logic. Lecture 16 Timing and Clock Issues
EE 459/500 HDL Based Digital Design with Programmable Logic Lecture 16 Timing and Clock Issues 1 Overview Sequential system timing requirements Impact of clock skew on timing Impact of clock jitter on
S. Venkatesh, Mrs. T. Gowri, Department of ECE, GIT, GITAM University, Vishakhapatnam, India
Power reduction on clock-tree using Energy recovery and clock gating technique S. Venkatesh, Mrs. T. Gowri, Department of ECE, GIT, GITAM University, Vishakhapatnam, India Abstract Power consumption of
路 論 Chapter 15 System-Level Physical Design
Introduction to VLSI Circuits and Systems 路 論 Chapter 15 System-Level Physical Design Dept. of Electronic Engineering National Chin-Yi University of Technology Fall 2007 Outline Clocked Flip-flops CMOS
Fairchild Solutions for 133MHz Buffered Memory Modules
AN-5009 Fairchild Semiconductor Application Note April 1999 Revised December 2000 Fairchild Solutions for 133MHz Buffered Memory Modules Fairchild Semiconductor provides several products that are compatible
PowerPC Microprocessor Clock Modes
nc. Freescale Semiconductor AN1269 (Freescale Order Number) 1/96 Application Note PowerPC Microprocessor Clock Modes The PowerPC microprocessors offer customers numerous clocking options. An internal phase-lock
Reducing EMI and Improving Signal Integrity Using Spread Spectrum Clocking
Reducing EMI and Improving Signal Integrity Using Spread Spectrum Clocking Electromagnetic interference (EMI), once the exclusive concern of equipment designers working with high-speed signals, is no longer
The Motherboard Chapter #5
The Motherboard Chapter #5 Amy Hissom Key Terms Advanced Transfer Cache (ATC) A type of L2 cache contained within the Pentium processor housing that is embedded on the same core processor die as the CPU
DESIGN CHALLENGES OF TECHNOLOGY SCALING
DESIGN CHALLENGES OF TECHNOLOGY SCALING IS PROCESS TECHNOLOGY MEETING THE GOALS PREDICTED BY SCALING THEORY? AN ANALYSIS OF MICROPROCESSOR PERFORMANCE, TRANSISTOR DENSITY, AND POWER TRENDS THROUGH SUCCESSIVE
Harmonics and Noise in Photovoltaic (PV) Inverter and the Mitigation Strategies
Soonwook Hong, Ph. D. Michael Zuercher Martinson Harmonics and Noise in Photovoltaic (PV) Inverter and the Mitigation Strategies 1. Introduction PV inverters use semiconductor devices to transform the
Lecture 11: Sequential Circuit Design
Lecture 11: Sequential Circuit esign Outline Sequencing Sequencing Element esign Max and Min-elay Clock Skew Time Borrowing Two-Phase Clocking 2 Sequencing Combinational logic output depends on current
POWER FORUM, BOLOGNA 20-09-2012
POWER FORUM, BOLOGNA 20-09-2012 Convertitori DC/DC ad alta densità di potenza e bassa impedenza termica. Massimo GAVIOLI. Senior Field Application Engineer. Intersil SIMPLY SMARTER Challenges when Designing
EMI and t Layout Fundamentals for Switched-Mode Circuits
v sg (t) (t) DT s V pp = n - 1 2 V pp V g n V T s t EE core insulation primary return secondary return Supplementary notes on EMI and t Layout Fundamentals for Switched-Mode Circuits secondary primary
A 3.2Gb/s Clock and Data Recovery Circuit Without Reference Clock for a High-Speed Serial Data Link
A 3.2Gb/s Clock and Data Recovery Circuit Without Reference Clock for a High-Speed Serial Data Link Kang jik Kim, Ki sang Jeong, Seong ik Cho The Department of Electronics Engineering Chonbuk National
How To Design A Chip Layout
Spezielle Anwendungen des VLSI Entwurfs Applied VLSI design (IEF170) Course and contest Intermediate meeting 3 Prof. Dirk Timmermann, Claas Cornelius, Hagen Sämrow, Andreas Tockhorn, Philipp Gorski, Martin
TRUE SINGLE PHASE CLOCKING BASED FLIP-FLOP DESIGN
TRUE SINGLE PHASE CLOCKING BASED FLIP-FLOP DESIGN USING DIFFERENT FOUNDRIES Priyanka Sharma 1 and Rajesh Mehra 2 1 ME student, Department of E.C.E, NITTTR, Chandigarh, India 2 Associate Professor, Department
EE 42/100 Lecture 24: Latches and Flip Flops. Rev B 4/21/2010 (2:04 PM) Prof. Ali M. Niknejad
A. M. Niknejad University of California, Berkeley EE 100 / 42 Lecture 24 p. 1/20 EE 42/100 Lecture 24: Latches and Flip Flops ELECTRONICS Rev B 4/21/2010 (2:04 PM) Prof. Ali M. Niknejad University of California,
Abstract. Cycle Domain Simulator for Phase-Locked Loops
Abstract Cycle Domain Simulator for Phase-Locked Loops Norman James December 1999 As computers become faster and more complex, clock synthesis becomes critical. Due to the relatively slower bus clocks
Grounding Demystified
Grounding Demystified 3-1 Importance Of Grounding Techniques 45 40 35 30 25 20 15 10 5 0 Grounding 42% Case 22% Cable 18% Percent Used Filter 12% PCB 6% Grounding 42% Case Shield 22% Cable Shielding 18%
DDR subsystem: Enhancing System Reliability and Yield
DDR subsystem: Enhancing System Reliability and Yield Agenda Evolution of DDR SDRAM standards What is the variation problem? How DRAM standards tackle system variability What problems have been adequately
Evaluating AC Current Sensor Options for Power Delivery Systems
Evaluating AC Current Sensor Options for Power Delivery Systems State-of-the-art isolated ac current sensors based on CMOS technology can increase efficiency, performance and reliability compared to legacy
Photonic Networks for Data Centres and High Performance Computing
Photonic Networks for Data Centres and High Performance Computing Philip Watts Department of Electronic Engineering, UCL Yury Audzevich, Nick Barrow-Williams, Robert Mullins, Simon Moore, Andrew Moore
PCB Design Conference - East Keynote Address EMC ASPECTS OF FUTURE HIGH SPEED DIGITAL DESIGNS
OOOO1 PCB Design Conference - East Keynote Address September 12, 2000 EMC ASPECTS OF FUTURE HIGH SPEED DIGITAL DESIGNS By Henry Ott Consultants Livingston, NJ 07039 (973) 992-1793 www.hottconsultants.com
Analyzing Electrical Effects of RTA-driven Local Anneal Temperature Variation
1 Analyzing Electrical Effects of RTA-driven Local Anneal Temperature Variation Vivek Joshi, Kanak Agarwal*, Dennis Sylvester, David Blaauw Electrical Engineering & Computer Science University of Michigan,
CHAPTER 11: Flip Flops
CHAPTER 11: Flip Flops In this chapter, you will be building the part of the circuit that controls the command sequencing. The required circuit must operate the counter and the memory chip. When the teach
Jitter in PCIe application on embedded boards with PLL Zero delay Clock buffer
Jitter in PCIe application on embedded boards with PLL Zero delay Clock buffer Hermann Ruckerbauer EKH - EyeKnowHow 94469 Deggendorf, Germany [email protected] Agenda 1) PCI-Express Clocking
8 Gbps CMOS interface for parallel fiber-optic interconnects
8 Gbps CMOS interface for parallel fiberoptic interconnects Barton Sano, Bindu Madhavan and A. F. J. Levi Department of Electrical Engineering University of Southern California Los Angeles, California
White Paper: Electrical Ground Rules
Acromag, Incorporated 30765 S Wixom Rd, Wixom, MI 48393 USA Tel: 248-295-0880 Fax: 248-624-9234 www.acromag.com White Paper: Electrical Ground Rules Best Practices for Grounding Your Electrical Equipment
Asynchronous IC Interconnect Network Design and Implementation Using a Standard ASIC Flow
Asynchronous IC Interconnect Network Design and Implementation Using a Standard ASIC Flow Bradley R. Quinton Dept. of Electrical and Computer Engineering University of British Columbia [email protected]
On-Chip Interconnection Networks Low-Power Interconnect
On-Chip Interconnection Networks Low-Power Interconnect William J. Dally Computer Systems Laboratory Stanford University ISLPED August 27, 2007 ISLPED: 1 Aug 27, 2007 Outline Demand for On-Chip Networks
An All-Digital Phase-Locked Loop with High Resolution for Local On-Chip Clock Synthesis
An All-Digital Phase-Locked Loop with High Resolution for Local On-Chip Clock Synthesis Oliver Schrape 1, Frank Winkler 2, Steffen Zeidler 1, Markus Petri 1, Eckhard Grass 1, Ulrich Jagdhold 1 International
Lecture 7: Clocking of VLSI Systems
Lecture 7: Clocking of VLSI Systems MAH, AEN EE271 Lecture 7 1 Overview Reading Wolf 5.3 Two-Phase Clocking (good description) W&E 5.5.1, 5.5.2, 5.5.3, 5.5.4, 5.5.9, 5.5.10 - Clocking Note: The analysis
11. High-Speed Differential Interfaces in Cyclone II Devices
11. High-Speed Differential Interfaces in Cyclone II Devices CII51011-2.2 Introduction From high-speed backplane applications to high-end switch boxes, low-voltage differential signaling (LVDS) is the
AVX EMI SOLUTIONS Ron Demcko, Fellow of AVX Corporation Chris Mello, Principal Engineer, AVX Corporation Brian Ward, Business Manager, AVX Corporation
AVX EMI SOLUTIONS Ron Demcko, Fellow of AVX Corporation Chris Mello, Principal Engineer, AVX Corporation Brian Ward, Business Manager, AVX Corporation Abstract EMC compatibility is becoming a key design
Clock Distribution Networks in Synchronous Digital Integrated Circuits
Clock Distribution Networks in Synchronous Digital Integrated Circuits EBY G. FRIEDMAN Invited Paper Clock distribution networks synchronize the flow of data signals among synchronous data paths. The design
Lecture 10: Sequential Circuits
Introduction to CMOS VLSI esign Lecture 10: Sequential Circuits avid Harris Harvey Mudd College Spring 2004 Outline q Sequencing q Sequencing Element esign q Max and Min-elay q Clock Skew q Time Borrowing
ECEN689: Special Topics in High-Speed Links Circuits and Systems Spring 2010
ECEN689: Special Topics in High-Speed Links Circuits and Systems Spring 2010 Lecture 25: Clocking Architectures Sam Palermo Analog & Mixed-Signal Center Texas A&M University Announcements Project Preliminary
DirectFET TM - A Proprietary New Source Mounted Power Package for Board Mounted Power
TM - A Proprietary New Source Mounted Power Package for Board Mounted Power by Andrew Sawle, Martin Standing, Tim Sammon & Arthur Woodworth nternational Rectifier, Oxted, Surrey. England Abstract This
Chapter 6 PLL and Clock Generator
Chapter 6 PLL and Clock Generator The DSP56300 core features a Phase Locked Loop (PLL) clock generator in its central processing module. The PLL allows the processor to operate at a high internal clock
ZL40221 Precision 2:6 LVDS Fanout Buffer with Glitchfree Input Reference Switching and On-Chip Input Termination Data Sheet
Features Inputs/Outputs Accepts two differential or single-ended inputs LVPECL, LVDS, CML, HCSL, LVCMOS Glitch-free switching of references On-chip input termination and biasing for AC coupled inputs Six
Signal Types and Terminations
Helping Customers Innovate, Improve & Grow Application Note Signal Types and Terminations Introduction., H, LV, Sinewave, Clipped Sinewave, TTL, PECL,,, CML Oscillators and frequency control devices come
MODULE BOUSSOLE ÉLECTRONIQUE CMPS03 Référence : 0660-3
MODULE BOUSSOLE ÉLECTRONIQUE CMPS03 Référence : 0660-3 CMPS03 Magnetic Compass. Voltage : 5v only required Current : 20mA Typ. Resolution : 0.1 Degree Accuracy : 3-4 degrees approx. after calibration Output
Lizy Kurian John Electrical and Computer Engineering Department, The University of Texas as Austin
BUS ARCHITECTURES Lizy Kurian John Electrical and Computer Engineering Department, The University of Texas as Austin Keywords: Bus standards, PCI bus, ISA bus, Bus protocols, Serial Buses, USB, IEEE 1394
Learning Outcomes. Simple CPU Operation and Buses. Composition of a CPU. A simple CPU design
Learning Outcomes Simple CPU Operation and Buses Dr Eddie Edwards [email protected] At the end of this lecture you will Understand how a CPU might be put together Be able to name the basic components
On-chip clock error characterization for clock distribution system
On-chip clock error characterization for clock distribution system Chuan Shan, Dimitri Galayko, François Anceau Laboratoire d informatique de Paris 6 (LIP6) Université Pierre & Marie Curie (UPMC), Paris,
Phase-Locked Loop Based Clock Generators
Phase-Locked Loop Based Clock Generators INTRODUCTION As system clock frequencies reach 100 MHz and beyond maintaining control over clock becomes very important In addition to generating the various clocks
Electromagnetic. Reducing EMI and Improving Signal Integrity Using Spread Spectrum Clocking EMI CONTROL. The authors describe the
From September 2011 High Frequency Electronics Copyright 2011 Summit Technical Media, LLC Reducing EMI and Improving Signal Integrity Using Spread Spectrum Clocking By Jeffrey Batchelor and Jimmy Ma Silicon
PROGETTO DI SISTEMI ELETTRONICI DIGITALI. Digital Systems Design. Digital Circuits Advanced Topics
PROGETTO DI SISTEMI ELETTRONICI DIGITALI Digital Systems Design Digital Circuits Advanced Topics 1 Sequential circuit and metastability 2 Sequential circuit - FSM A Sequential circuit contains: Storage
Multi-GHz Systems Clocking Invited Paper
Multi-GHz Systems ing Invited Paper Vojin G. Oklobdzija, Fellow IEEE Department of Electrical Engineering, University of California, Davis [email protected] http://www.ece.ucdavis.edu/acsel Abstract:
Sequential Circuits. Combinational Circuits Outputs depend on the current inputs
Principles of VLSI esign Sequential Circuits Sequential Circuits Combinational Circuits Outputs depend on the current inputs Sequential Circuits Outputs depend on current and previous inputs Requires separating
Efficient Interconnect Design with Novel Repeater Insertion for Low Power Applications
Efficient Interconnect Design with Novel Repeater Insertion for Low Power Applications TRIPTI SHARMA, K. G. SHARMA, B. P. SINGH, NEHA ARORA Electronics & Communication Department MITS Deemed University,
ECE124 Digital Circuits and Systems Page 1
ECE124 Digital Circuits and Systems Page 1 Chip level timing Have discussed some issues related to timing analysis. Talked briefly about longest combinational path for a combinational circuit. Talked briefly
Lecture 2. High-Speed I/O
Lecture 2 High-Speed I/O Mark Horowitz Computer Systems Laboratory Stanford University [email protected] Copyright 2007 by Mark Horowitz, with material from Stefanos Sidiropoulos, and Vladimir Stojanovic
Current Probes, More Useful Than You Think
Current Probes, More Useful Than You Think Training and design help in most areas of Electrical Engineering Copyright 1998 Institute of Electrical and Electronics Engineers. Reprinted from the IEEE 1998
Application Note 58 Crystal Considerations with Dallas Real Time Clocks
www.dalsemi.com Application Note 58 Crystal Considerations with Dallas Real Time Clocks Dallas Semiconductor offers a variety of real time clocks (RTCs). The majority of these are available either as integrated
ICS379. Quad PLL with VCXO Quick Turn Clock. Description. Features. Block Diagram
Quad PLL with VCXO Quick Turn Clock Description The ICS379 QTClock TM generates up to 9 high quality, high frequency clock outputs including a reference from a low frequency pullable crystal. It is designed
ICS650-44 SPREAD SPECTRUM CLOCK SYNTHESIZER. Description. Features. Block Diagram DATASHEET
DATASHEET ICS650-44 Description The ICS650-44 is a spread spectrum clock synthesizer intended for video projector and digital TV applications. It generates three copies of an EMI optimized 50 MHz clock
International Journal of Electronics and Computer Science Engineering 1482
International Journal of Electronics and Computer Science Engineering 1482 Available Online at www.ijecse.org ISSN- 2277-1956 Behavioral Analysis of Different ALU Architectures G.V.V.S.R.Krishna Assistant
Equalization/Compensation of Transmission Media. Channel (copper or fiber)
Equalization/Compensation of Transmission Media Channel (copper or fiber) 1 Optical Receiver Block Diagram O E TIA LA EQ CDR DMUX -18 dbm 10 µa 10 mv p-p 400 mv p-p 2 Copper Cable Model Copper Cable 4-foot
Design Compiler Graphical Create a Better Starting Point for Faster Physical Implementation
Datasheet Create a Better Starting Point for Faster Physical Implementation Overview Continuing the trend of delivering innovative synthesis technology, Design Compiler Graphical delivers superior quality
Design and analysis of flip flops for low power clocking system
Design and analysis of flip flops for low power clocking system Gabariyala sabadini.c PG Scholar, VLSI design, Department of ECE,PSNA college of Engg and Tech, Dindigul,India. Jeya priyanka.p PG Scholar,
Introduction to CMOS VLSI Design
Introduction to CMOS VLSI esign Slides adapted from: N. Weste,. Harris, CMOS VLSI esign, Addison-Wesley, 3/e, 24 Introduction Integrated Circuits: many transistors on one chip Very Large Scale Integration
A PC-BASED TIME INTERVAL COUNTER WITH 200 PS RESOLUTION
35'th Annual Precise Time and Time Interval (PTTI) Systems and Applications Meeting San Diego, December 2-4, 2003 A PC-BASED TIME INTERVAL COUNTER WITH 200 PS RESOLUTION Józef Kalisz and Ryszard Szplet
inter-chip and intra-chip Harm Dorren and Oded Raz
Will photonics penetrate into inter-chip and intra-chip communications? Harm Dorren and Oded Raz On-chip interconnect networks R X R X R X R X R T X R XT X T X T X T X X Electronic on-chip interconnect
Timing Methodologies (cont d) Registers. Typical timing specifications. Synchronous System Model. Short Paths. System Clock Frequency
Registers Timing Methodologies (cont d) Sample data using clock Hold data between clock cycles Computation (and delay) occurs between registers efinition of terms setup time: minimum time before the clocking
Core Power Delivery Network Analysis of Core and Coreless Substrates in a Multilayer Organic Buildup Package
Core Power Delivery Network Analysis of Core and Coreless Substrates in a Multilayer Organic Buildup Package Ozgur Misman, Mike DeVita, Nozad Karim, Amkor Technology, AZ, USA 1900 S. Price Rd, Chandler,
(Amplifying) Photo Detectors: Avalanche Photodiodes Silicon Photomultiplier
(Amplifying) Photo Detectors: Avalanche Photodiodes Silicon Photomultiplier (no PiN and pinned Diodes) Peter Fischer P. Fischer, ziti, Uni Heidelberg, Seite 1 Overview Reminder: Classical Photomultiplier
Introduction to Cloud Computing
Introduction to Cloud Computing Parallel Processing I 15 319, spring 2010 7 th Lecture, Feb 2 nd Majd F. Sakr Lecture Motivation Concurrency and why? Different flavors of parallel computing Get the basic
A NEAR FIELD INJECTION MODEL FOR SUSCEPTIBILITY PREDICTION IN INTEGRATED CIRCUITS
ICONIC 2007 St. Louis, MO, USA June 27-29, 2007 A NEAR FIELD INJECTION MODEL FOR SUSCEPTIBILITY PREDICTION IN INTEGRATED CIRCUITS Ali Alaeldine 12, Alexandre Boyer 3, Richard Perdriau 1, Sonia Ben Dhia
Demystifying Data-Driven and Pausible Clocking Schemes
Demystifying Data-Driven and Pausible Clocking Schemes Robert Mullins Computer Architecture Group Computer Laboratory, University of Cambridge ASYNC 2007, 13 th IEEE International Symposium on Asynchronous
Lecture-3 MEMORY: Development of Memory:
Lecture-3 MEMORY: It is a storage device. It stores program data and the results. There are two kind of memories; semiconductor memories & magnetic memories. Semiconductor memories are faster, smaller,
Module 2. Embedded Processors and Memory. Version 2 EE IIT, Kharagpur 1
Module 2 Embedded Processors and Memory Version 2 EE IIT, Kharagpur 1 Lesson 5 Memory-I Version 2 EE IIT, Kharagpur 2 Instructional Objectives After going through this lesson the student would Pre-Requisite
Intel Labs at ISSCC 2012. Copyright Intel Corporation 2012
Intel Labs at ISSCC 2012 Copyright Intel Corporation 2012 Intel Labs ISSCC 2012 Highlights 1. Efficient Computing Research: Making the most of every milliwatt to make computing greener and more scalable
