ISSCC 2003 / SESSION 13 / 40Gb/s COMMUNICATION ICS / PAPER 13.7



Similar documents
ISSCC 2003 / SESSION 4 / CLOCK RECOVERY AND BACKPLANE TRANSCEIVERS / PAPER 4.7

A 1.62/2.7/5.4 Gbps Clock and Data Recovery Circuit for DisplayPort 1.2 with a single VCO

A 10-Gb/s CMOS Clock and Data Recovery Circuit with a Half-Rate Linear Phase Detector

A 3.2Gb/s Clock and Data Recovery Circuit Without Reference Clock for a High-Speed Serial Data Link

A CMOS Clock Recovery Circuit for 2.5-Gb/s NRZ Data

ISSCC 2003 / SESSION 10 / HIGH SPEED BUILDING BLOCKS / PAPER 10.5

IN RECENT YEARS, the increase of data transmission over

CLOCK and data recovery (CDR) circuits have found

BURST-MODE communication relies on very fast acquisition

Rong-Jyi YANG, Nonmember and Shen-Iuan LIU a), Member

Equalization/Compensation of Transmission Media. Channel (copper or fiber)

Clock- and data-recovery IC with demultiplexer for a 2.5 Gb/s ATM physical layer controller

Analysis and Design of Robust Multi-Gb/s Clock and Data Recovery Circuits

A 1.25-GHz 0.35-m Monolithic CMOS PLL Based on a Multiphase Ring Oscillator

A 1.7 Gbps DLL-Based Clock Data Recovery for a Serial Display Interface in 0.35-μm CMOS

Alpha CPU and Clock Design Evolution

8 Gbps CMOS interface for parallel fiber-optic interconnects

Phase Locked Loop (PLL) based Clock and Data Recovery Circuits (CDR) using Calibrated Delay Flip Flop

Clock Recovery in Serial-Data Systems Ransom Stephens, Ph.D.

A 2 Gbps to 12 Gbps Wide-Range CDR with Automatic Frequency Band Selector

TRUE SINGLE PHASE CLOCKING BASED FLIP-FLOP DESIGN

EFMPlus Data Recovery Circuit with a Fast Locking Scheme for 12X Speed DVD-ROM Drivers

High-Speed Electronics

11. High-Speed Differential Interfaces in Cyclone II Devices

CLOCK AND DATA RECOVERY CIRCUITS RUIYUAN ZHANG

Design and Modelling of Clock and Data Recovery Integrated Circuit in 130 nm CMOS Technology for 10 Gb/s Serial Data Communications

TIMING-DRIVEN PHYSICAL DESIGN FOR DIGITAL SYNCHRONOUS VLSI CIRCUITS USING RESONANT CLOCKING

DS2187 Receive Line Interface

S. Venkatesh, Mrs. T. Gowri, Department of ECE, GIT, GITAM University, Vishakhapatnam, India

International Journal of Electronics and Computer Science Engineering 1482

A 125-MHz Mixed-Signal Echo Canceller for Gigabit Ethernet on Copper Wire

LOW POWER DESIGN OF DIGITAL SYSTEMS USING ENERGY RECOVERY CLOCKING AND CLOCK GATING

Abstract. Cycle Domain Simulator for Phase-Locked Loops

Signal Types and Terminations

Loop Bandwidth and Clock Data Recovery (CDR) in Oscilloscope Measurements. Application Note

A 3 V 12b 100 MS/s CMOS D/A Converter for High- Speed Communication Systems

Clocking. Figure by MIT OCW Spring /18/05 L06 Clocks 1

A 10,000 Frames/s 0.18 µm CMOS Digital Pixel Sensor with Pixel-Level Memory

DESIGN CHALLENGES OF TECHNOLOGY SCALING

How PLL Performances Affect Wireless Systems

Laboratory 4: Feedback and Compensation

Model-Based Synthesis of High- Speed Serial-Link Transmitter Designs

Design and analysis of flip flops for low power clocking system

it4036f 120-ps Wideband Phase Delay Description Features Device Diagram Timing Diagram

A 1-GSPS CMOS Flash A/D Converter for System-on-Chip Applications

Static-Noise-Margin Analysis of Conventional 6T SRAM Cell at 45nm Technology

JITTER tolerance indicates the maximum sinusoidal jitter

LM139/LM239/LM339/LM2901/LM3302 Low Power Low Offset Voltage Quad Comparators

VARIABLE-frequency oscillators (VFO s) phase locked

Current-Controlled Slew-Rate Adjustable Trapezoidal Waveform Generators for Low- and High-Voltage Applications

Timing Errors and Jitter

A High Frequency Divider in 0.18 um SiGe BiCMOS Technology

MONOLITHIC PHASE-LOCKED LOOPS AND CLOCK RECOVERY CIRCUITS

On-chip clock error characterization for clock distribution system

A Wideband mm-wave CMOS Receiver for Gb/s Communications Employing Interstage Coupled Resonators

Clocks Basics in 10 Minutes or Less. Edgar Pineda Field Applications Engineer Arrow Components Mexico

A 40 Gb/s Clock and Data Recovery Module with Improved Phase-Locked Loop Circuits

PowerPC Microprocessor Clock Modes

Using Pre-Emphasis and Equalization with Stratix GX

A true low voltage class-ab current mirror

Design of a Reliable Broadband I/O Employing T-coil

Interconnection Network of OTA-based FPAA

INJECTION of a periodic signal into an oscillator leads

A PC-BASED TIME INTERVAL COUNTER WITH 200 PS RESOLUTION

Output Ripple and Noise Measurement Methods for Ericsson Power Modules

An All-Digital Phase-Locked Loop with High Resolution for Local On-Chip Clock Synthesis

A Survey on Sequential Elements for Low Power Clocking System

Ultra Wideband Signal Impact on IEEE802.11b Network Performance

CMOS, the Ideal Logic Family

EECS 240 Topic 7: Current Sources

Class 11: Transmission Gates, Latches

Bridgeless PFC Implementation Using One Cycle Control Technique

AN460 Using the P82B96 for bus interface

How To Test The Performance Of An Oversampling Cdr In An Fgpa, Jitter And Memory On A Black Box (Cdr) In A Test Program

Low latency synchronization through speculation

FINAL PROJECT THESIS. Ana Armendáriz Hugalde. ANALYSIS AND DESIGN OF PFDs IN CADENCE

Sequential Circuit Design

Lecture 2. High-Speed I/O

Sequential 4-bit Adder Design Report

W a d i a D i g i t a l

1997 Mixed-Signal Products SLAA011B

Bi-directional Power System for Laptop Computers

A 10GB/S FULL ON-CHIP BANG-BANG CLOCK AND DATA RECOVERY SYSTEM USING AN ADAPTIVE LOOP BANDWIDTH STRATEGY. A Thesis HYUNG-JOON JEON

Core Power Delivery Network Analysis of Core and Coreless Substrates in a Multilayer Organic Buildup Package

Topics of Chapter 5 Sequential Machines. Memory elements. Memory element terminology. Clock terminology

Programmable Single-/Dual-/Triple- Tone Gong SAE 800

A/D Converter based on Binary Search Algorithm

1+1 PROTECTION WITHOUT RELAYS USING IDT82V2044/48/48L & IDT82V2054/58/58L HITLESS PROTECTION SWITCHING

Digital to Analog Converter. Raghu Tumati

Microcontroller-based experiments for a control systems course in electrical engineering technology

Achieving New Levels of Channel Density in Downstream Cable Transmitter Systems: RF DACs Deliver Smaller Size and Lower Power Consumption

LVDS Technology Solves Typical EMI Problems Associated with Cell Phone Cameras and Displays

Duobinary Modulation For Optical Systems

Transcription:

ISSCC 2003 / SESSION 13 / 40Gb/s COMMUNICATION ICS / PAPER 13.7 13.7 A 40Gb/s Clock and Data Recovery Circuit in 0.18µm CMOS Technology Jri Lee, Behzad Razavi University of California, Los Angeles, CA Clock and data recovery (CDR) circuits operating at tens of gigabits per second pose difficult challenges with respect to speed, jitter, signal distribution, and power consumption. Recent integration of 10Gb/s receivers in CMOS technology encourages further research on CMOS solutions for higher speeds, especially if it leads to low-voltage, low-power realizations. The design and experimental verification of a 40Gb/s CMOS phase-locked CDR is described. Shown in Fig. 13.7.1, the architecture incorporates a multiphase voltage-controlled oscillator (VCO), a quarter-rate phase detector (PD), a voltage-to-current (V/I) converter, and a simple loop filter. The PD uses the halfquadrature phases provided by the VCO to sample the input data every 12.5 ps, thereby detecting data edges and determining whether the clock is early or late. Four of these samples fall in the center of the data eye, retiming and demultiplexing the 40Gb/s input into four 10Gb/s outputs. In the absence of data transitions, the V/I converter generates no output current, leaving the oscillator control line undisturbed. With quarter-rate sampling, the flipflops hold time can be four times as long as that required in full-rate operation, but their acquisition speed must still guarantee correct sampling of the input bits in less than 25ps. The speed, jitter, and driving capability required of the oscillator point to the use of an LC realization. Coupled oscillators [1] operate away from the resonance frequency of the tanks to create the required phase shift, bearing a trade-off between reliability of oscillation and the phase noise. The multiphase oscillator in [2] drives transmission lines by a gain stage loaded by resistors, incurring energy loss in each cycle. The multiphase oscillator introduced here is based on the concept of differential stimulus of a closed-loop transmission line at equally-spaced points. Illustrated in Fig. 13.7.2a, the circuit sustains a phase separation of 180 O at diagonally-opposite nodes, providing 45 O phase steps in between. Unlike coupled oscillators, this circuit does not operate away from the resonance frequency. Also, in contrast to the design in [2], the transmission line requires no termination resistors, displaying lower phase noise and larger voltage swings for a given power dissipation and inductor Q. The topology of Fig.13.7.2a nonetheless necessitates long interconnects between the nodes and their corresponding G m cells. However, since diagonally-opposite inductors carry currents that are 180 O out of phase, the circuit can be modified as shown in Fig. 13.7.2b, where inductor elements are grouped into differential structures and the G m cells are placed in close proximity of the oscillator nodes. SpectreRF simulations indicate that, for a given power dissipation, inductor Q, and frequency of oscillation, the proposed oscillator achieves twice the voltage swings and 12 db lower phase noise than that in [2]. Each differential port of the VCO is buffered by an inductivelyloaded differential pair. These buffers (1) isolate the VCO from the long interconnects going to the PDs that would otherwise introduce greater uncertainty in the oscillation frequency; (2) generate voltage swings above the supply voltage, driving the flipflops efficiently; and (3) isolate the VCO from the data edges coupled through the phase detectors. The PD employs eight flipflops to strobe the data at 12.5ps intervals (Fig. 13.7.3). In a manner similar to an Alexander topology [3], the PD compares every two consecutive samples by means of an XOR gate, generating a net output current if the two are unequal, i.e., if an edge has occurred. With no data transitions, the FFs produce equal outputs, and the V/I converters a zero current. Figure 13.7.4 depicts the master-slave flipflop used in the phase detector. Here, NMOS switches M 1 and M 2 sample D in on the parasitic capacitances at nodes X and Y when CK is high. Since the minimum input common-mode (CM) level is dictated by the gatesource voltage of M 3 -M 4 and the headroom required by I SS, the sampling switches experience only an overdrive voltage of 0.5V even if CK reaches V DD, failing to provide fast sampling. This issue is remedied by setting the CM level of CK and CK equal to V DD, a choice afforded by the inductively-loaded VCO buffer. The peak value of CK thus exceeds V DD by 0.8V, more than doubling the sampling speed of M 1 and M 2. With large clock swings available, the current switching in pairs M 5 -M 6, M 7 -M 8 and M 9 -M 10 is accomplished by gate control rather than conventional source-coupled steering. The proposed topology offers two advantages: (1) since the tail current source is removed, M 11 -M 13 can be quite narrower, presenting a smaller capacitance to the VCO buffer; (2) since the drain currents of M 11 - M 13 are not limited by a tail current source, these transistors experience class AB switching, drawing a large current at the peak of the clock swing and providing greater voltage swings and a higher gain in the data path. The CDR circuit has been fabricated in a 0.18µm CMOS technology. Figure 13.7.5 shows a photograph of the die, which measures 1.4mm 2. Figure 13.7.6 depicts the CDR input and output waveforms under locked condition in response to a pseudo-random sequence of length 2 31-1. The demultiplexed data experiences some ISI, but if further demultiplexing is included on the same chip, the ISI can be tolerated. The recovered clock, suggesting an rms jitter of 1.756ps and a peak-to-peak jitter of 9.67ps is displayed in Fig. 13.7.7. However, as shown in the inset, the oscilloscope itself suffers from rms and peak-to-peak jitters of 1.508ps and 8.89ps, respectively. Thus, the CDR output contains a jitter of 0.9ps,rms and at most 9.67ps,pp. Preliminary measurements indicate a bit error rate of 10-6, demonstrating correct data recovery and demultiplexing. The overall power consumption excluding the output buffers is 144mW from a 2V supply. Acknowledgments The authors thank Multilink, Inc., for measurement support. This work was supported by Semiconductor Research Corporation under contract number 98-HJ-640. References [1] J. Kim and B. Kim, A Low Phase-Noise CMOS LC Oscillator with a Ring Structure, ISSCC Dig. of Tech. Papers, pp. 430-431, Feb. 2000. [2] J. E. Rogers and J. R. Long, A 10Gb/s CDR/DEMUX with LC Delay Line VCO in 0.18µm CMOS, ISSCC Dig. of Tech. Papers, pp. 254-255, Feb. 2002. [3] J. D. H. Alexander, Clock Recovery from Random Binary Data, Electronics Letters, vol. 11, pp. 541-542, Oct. 1975.

ISSCC 2003 / February 11, 2003 / Salon 7 / 4:45 PM Figure 13.7.1: CDR architecture. Figure 13.7.2: (a) Multiphase oscillator, (b) modification of (a). 13 Figure 13.7.3: Quarter-rate phase detector. Figure 13.7.4: Proposed flipflop. Figure 13.7.6: Input and outputs of CDR circuit (horizontal scale: 50 ps/div., vertical scale 100mV/div.). Figure 13.7.7: Measured clock and oscilloscope jitter (horizontal scale: 5ps/div).

Figure 13.7.5: Chip micrograph. 13

Figure 13.7.1: CDR architecture.

Figure 13.7.2: (a) Multiphase oscillator, (b) modification of (a).

Figure 13.7.3: Quarter-rate phase detector.

Figure 13.7.4: Proposed flipflop.

Figure 13.7.5: Chip micrograph. 2003 IEEE International Solid-State Circuits Conference 0-7803-7707-9/03/$17.00 2003 IEEE

Figure 13.7.6: Input and outputs of CDR circuit (horizontal scale: 50 ps/div., vertical scale 100mV/div.).

Figure 13.7.7: Measured clock and oscilloscope jitter (horizontal scale: 5ps/div).