Clock- and data-recovery IC with demultiplexer for a 2.5 Gb/s ATM physical layer controller

Similar documents
A 3.2Gb/s Clock and Data Recovery Circuit Without Reference Clock for a High-Speed Serial Data Link

ISSCC 2003 / SESSION 13 / 40Gb/s COMMUNICATION ICS / PAPER 13.7

IN RECENT YEARS, the increase of data transmission over

8 Gbps CMOS interface for parallel fiber-optic interconnects

A 1.62/2.7/5.4 Gbps Clock and Data Recovery Circuit for DisplayPort 1.2 with a single VCO

Rong-Jyi YANG, Nonmember and Shen-Iuan LIU a), Member

BURST-MODE communication relies on very fast acquisition

Clocking. Figure by MIT OCW Spring /18/05 L06 Clocks 1

MONOLITHIC PHASE-LOCKED LOOPS AND CLOCK RECOVERY CIRCUITS

Alpha CPU and Clock Design Evolution

High-Speed Electronics

A 1.25-GHz 0.35-m Monolithic CMOS PLL Based on a Multiphase Ring Oscillator

ISSCC 2003 / SESSION 4 / CLOCK RECOVERY AND BACKPLANE TRANSCEIVERS / PAPER 4.7

Abstract. Cycle Domain Simulator for Phase-Locked Loops

A High Frequency Divider in 0.18 um SiGe BiCMOS Technology

A CMOS Clock Recovery Circuit for 2.5-Gb/s NRZ Data

A 10-Gb/s CMOS Clock and Data Recovery Circuit with a Half-Rate Linear Phase Detector

Analysis and Design of Robust Multi-Gb/s Clock and Data Recovery Circuits

Topics of Chapter 5 Sequential Machines. Memory elements. Memory element terminology. Clock terminology

Chapter 6 PLL and Clock Generator

Design and Modelling of Clock and Data Recovery Integrated Circuit in 130 nm CMOS Technology for 10 Gb/s Serial Data Communications

An All-Digital Phase-Locked Loop with High Resolution for Local On-Chip Clock Synthesis

Managing High-Speed Clocks

Loop Bandwidth and Clock Data Recovery (CDR) in Oscilloscope Measurements. Application Note

CHARGE pumps are the circuits that used to generate dc

Content Map For Career & Technology

A 10,000 Frames/s 0.18 µm CMOS Digital Pixel Sensor with Pixel-Level Memory

Equalization/Compensation of Transmission Media. Channel (copper or fiber)

A 1-GSPS CMOS Flash A/D Converter for System-on-Chip Applications

A 2 Gbps to 12 Gbps Wide-Range CDR with Automatic Frequency Band Selector

EFMPlus Data Recovery Circuit with a Fast Locking Scheme for 12X Speed DVD-ROM Drivers

TRUE SINGLE PHASE CLOCKING BASED FLIP-FLOP DESIGN

Phase Locked Loop (PLL) based Clock and Data Recovery Circuits (CDR) using Calibrated Delay Flip Flop

Clock Distribution in RNS-based VLSI Systems

How PLL Performances Affect Wireless Systems

Model-Based Synthesis of High- Speed Serial-Link Transmitter Designs

Power Reduction Techniques in the SoC Clock Network. Clock Power

PowerPC Microprocessor Clock Modes

S. Venkatesh, Mrs. T. Gowri, Department of ECE, GIT, GITAM University, Vishakhapatnam, India

6.976 High Speed Communication Circuits and Systems Lecture 1 Overview of Course

Static-Noise-Margin Analysis of Conventional 6T SRAM Cell at 45nm Technology

Programmable Single-/Dual-/Triple- Tone Gong SAE 800

(12) (10) Patent N0.: US 6,614,314 B2 d Haene et al. 45 Date 0f Patent: Se (54) NON-LINEAR PHASE DETECTOR FOREIGN PATENT DOCUMENTS

ECEN689: Special Topics in High-Speed Links Circuits and Systems Spring 2010

CLOCK and data recovery (CDR) circuits have found

CLOCK AND DATA RECOVERY CIRCUITS RUIYUAN ZHANG

Digital to Analog Converter. Raghu Tumati

DS2187 Receive Line Interface

International Journal of Electronics and Computer Science Engineering 1482

A 40 Gb/s Clock and Data Recovery Module with Improved Phase-Locked Loop Circuits

Clock Recovery in Serial-Data Systems Ransom Stephens, Ph.D.

Introduction to CMOS VLSI Design

A Laser Scanner Chip Set for Accurate Perception Systems

Timing Errors and Jitter

TIMING-DRIVEN PHYSICAL DESIGN FOR DIGITAL SYNCHRONOUS VLSI CIRCUITS USING RESONANT CLOCKING

Any-Rate Precision Clocks

High-Frequency Integrated Circuits

Clocks Basics in 10 Minutes or Less. Edgar Pineda Field Applications Engineer Arrow Components Mexico

A high-speed interconnect network using ternary logic

路 論 Chapter 15 System-Level Physical Design

OPERATIONAL AMPLIFIER

VARIABLE-frequency oscillators (VFO s) phase locked

Electronics. Discrete assembly of an operational amplifier as a transistor circuit. LD Physics Leaflets P

11. High-Speed Differential Interfaces in Cyclone II Devices

Design and analysis of flip flops for low power clocking system

8B/10B Coding 64B/66B Coding

Simplifying System Design Using the CS4350 PLL DAC

A NEAR FIELD INJECTION MODEL FOR SUSCEPTIBILITY PREDICTION IN INTEGRATED CIRCUITS

A Gigabit Transceiver for Data Transmission in Future HEP Experiments and An overview of optoelectronics in HEP

CMOS, the Ideal Logic Family

Computer Aided Design of Home Medical Alert System

Computer Network. Interconnected collection of autonomous computers that are able to exchange information

RF Network Analyzer Basics

Physical Layer, Part 2 Digital Transmissions and Multiplexing

Application Note SAW-Components

LOW POWER DESIGN OF DIGITAL SYSTEMS USING ENERGY RECOVERY CLOCKING AND CLOCK GATING

USB 3.0 CDR Model White Paper Revision 0.5

The Future of Multi-Clock Systems

Flip-Flops, Registers, Counters, and a Simple Processor

Programmable Logic Design Grzegorz Budzyń Lecture. 10: FPGA clocking schemes

EE 42/100 Lecture 24: Latches and Flip Flops. Rev B 4/21/2010 (2:04 PM) Prof. Ali M. Niknejad

On-chip clock error characterization for clock distribution system

Two-Phase Clocking Scheme for Low-Power and High- Speed VLSI

Optimizing VCO PLL Evaluations & PLL Synthesizer Designs

Signal Types and Terminations

Chapter 6: From Digital-to-Analog and Back Again

Fiber Optic Communications Educational Toolkit

Source-Synchronous Serialization and Deserialization (up to 1050 Mb/s) Author: NIck Sawyer

Sentinel-SSO: Full DDR-Bank Power and Signal Integrity. Design Automation Conference 2014

How To Test The Performance Of An Oversampling Cdr In An Fgpa, Jitter And Memory On A Black Box (Cdr) In A Test Program

Jitter Transfer Functions in Minutes

FURTHER READING: As a preview for further reading, the following reference has been provided from the pages of the book below:

Design Verification & Testing Design for Testability and Scan

How To Make A Power Source From A Power Supply

Use and Application of Output Limiting Amplifiers (HFA1115, HFA1130, HFA1135)

Introduction to CMOS VLSI Design (E158) Lecture 8: Clocking of VLSI Systems

It explains the differences between the Plesiochronous Digital Hierarchy and the Synchronous Digital Hierarchy.

Glitch Free Frequency Shifting Simplifies Timing Design in Consumer Applications

Transcription:

Downloaded from orbit.dtu.dk on: Jan 04, 2016 Clock and datarecovery IC with demultiplexer for a 2.5 Gb/s ATM physical layer controller Hansen, Flemming; Salama, C.A.T. Published in: Proceedings of the IEEE International Symposium on Circuits and Systems DOI: 10.1109/ISCAS.1996.541949 Publication date: 1996 Document Version Publisher final version (usually the publisher pdf) Link to publication Citation (APA): Hansen, F., & Salama, C. A. T. (1996). Clock and datarecovery IC with demultiplexer for a 2.5 Gb/s ATM physical layer controller. In Proceedings of the IEEE International Symposium on Circuits and Systems: Connecting the World. (Vol. Volume 4, pp. 125128). IEEE. 10.1109/ISCAS.1996.541949 General rights Copyright and moral rights for the publications made accessible in the public portal are retained by the authors and/or other copyright owners and it is a condition of accessing publications that users recognise and abide by the legal requirements associated with these rights. Users may download and print one copy of any publication from the public portal for the purpose of private study or research. You may not further distribute the material or use it for any profitmaking activity or commercial gain You may freely distribute the URL identifying the publication in the public portal? If you believe that this document breaches copyright please contact us providing details, and we will remove access to the work immediately and investigate your claim.

CLOCK AND DATA.RECOVERY IC WITH DEMULTIPLEXER FOR A 2.5GB/S ATM PHYSICAL LAYER CONTROLLER Flemming Hansen Center for Broadband Telecommunications Dep. of Electromagnetic Systems Technical University of Denmark Building 348 DK:2800 Lyngby, Denmark fh@emi.dtu.dk C. Andre T. Salama VLSI Research Group Dep. of Electrical and Computer Engineering University of Toronto 10 King s College Road M5S 1A4, Ontario, Canada salama@vrg.utoronto.ca ABSTRACT A Clock and DataRecovery (CDR) IC for a Physical Layer Controller in an Asynchronous Transfer Mode (ATM) system operating at a bit rate of 2.488Gb/s is presented. The circuit was designed and fabricated in a 0.8pm BiCMOS process featuring 13GHz ft bipolar transistors. Clockrecovery is accomplished with a PhaseLocked Loop (PLL). The PLL uses a Phaseand Frequency Detector (PFD) to increase the pullin range. No external components are required. A novel Voltage Controlled Oscillator (Vco) generating both inphase and quadrature clocks, required by the PFD, is presented. The CDR includes a 1:s demultiplexer with bitrotation. Emitter Coupled Logic (ECL) is used in the PLL, dataregeneration and part of the demultiplexer, while the lowspeed parts of the demultiplexer are implemented in dynamic CMOS using the Bue SinglePhased Clock (TSPC) approach. 1. INTRODIJCTION The Clock and DataRecovery circuit is part of a Physical Layer Controller for a 2.5Gb/s Asynchronous Transfer Mode (ATM) system. In optical fiber transmission only the data signal is transmitted, the reference clock signal must be generated at the receiving end from the data signal. Two popular methods for extracting the clock information from the data signal are narrowband filters [l] and phase locked loops [a]. To reduce the bandwidth of the optical signal, a Non Return to Zero (NRZ) coding is usually chosen, although clock extraction is harder for this coding scheme than other coding schemes, e.g. Return to Zero (RZ), since an NRZ signal contains n.o energy at the clock fre quency. When using narrowband filters it is therefore necessary to preprocess the NRZ signal in a nonlinear element before filtering the clock signal from the data. In addition to preprocessing, #a narrowband filter is usually bulky, and not suited for integration. A better suited method for integration is to use a Phase Locked Loop (PLL) where the phase of a local oscillator is aligned with the phase of the incoming data, using a feedback loop. This soliition and its implementation in a O.8pm BiCMOS process featuring 13GHz ft bipolar transistors [3] is presented in this paper. 2. CDR ARCHITECTURE A block diagram of the CDR circuit is shown in Figure 1. The Phase and Frequency Detector (PFD) compares the arriving data with the clock generated by the Voltage Controlled Oscillator (VCO), generating an errorsignal that is filtered in the Loop Filter (LF) and fed back to the VCO. A basic MasterSlave flipflop is used to regenerate and syrtchronize the data, using the clock signal firom the VC8. MSDFF Figure 1. CDR block diagram The PFD used, and illustrated in Figure 2, is a minor variation of one presented by Pottbiicker et al [2]. The PFD consist of two identical PhaseDetectors (PD), a FrequencyDetector (FD) and a Loop Filter Driver. In the two PDs the data is sampled with the inphase and 0780330730/96/$5.OO @1996 IEEE 12 5

quadrature clocks respectively, resulting in two beat notes, at nodes &I and Q2, with a frequency equal to the difference between the data bit rate and the VCO clock frequency. By examining the phase relation between the two beat notes, the FD can determine whether the VCO frequency is higher or lower than the lclk Data Phase Detector I. Phase Detector oi 1 LF 03a frequency Detector * 02 d To LF Driver In previous work [2], the outputs from one PD and the FD are summed in the analogue domain, resulting in a ternary output from the PFD. To avoid this, and associated problems, the PFD outputs are processed digitally in the Loop Filter Driver. The acquisition procedure consists of two stages, frequencyacquisition and phaseacquisition. During frequencyacquisition, the PD output will have a duty cycle very close to 50%, and can therefore not be used to drive the VCO towards lock. In this case the outputs from the FD, given in Table 1, are used to make sure that the VCO control voltage is pulled in the right direction. Due to the digital nature of the control loop, there is no steadystate for the VCO control voltage. The active area occupied by the PLL is 0.54mm2 and the power dissipation is approximately 250mW. t Table 1. Phase and Frequency Detector states 3. VOLTAGE CONTROLLED OSCILLATOR The PFD used requires two clock signals with a 90 degree phase shift, if the phase shift is not exactly 90 degrees the PFD will still work, but with an asymmetric pullin range. A VCO that generates quadrature clock signals with an accurate phase shift is shown in Figure 3, it is based on the VCO presented in 141 by Razavi and Sung. In the process available to us, the VCO in [4] would give an oscillation frequency of approximately 5GHz. I I I I W k Figure 3. VCO schematic However, the target frequency of our work is 2.488GHz (STM16/STS48), and this factor of two can be used to get both inphase and quadrature outputs from the VCO. To reduce the center frequency of the VCO to the required 2.5GHz, a 6stage ringoscillator is used, with two ringstages between the inputs of transconductance amplifiers connected to the same common load. By connecting transconductance amplifiers to the remaining ring stage outputs, an exact quadrature output is achieved. The time between transitions on the clock is the delay through two ringstages, which means that unequal loading of the ringstages directly translates into jitter. To eliminate this cause of jitter, a layout with abutted cells was used, alternating between ringstages and transconductance amplifiers, eliminating the need for interconnect in the ring. To generate the ring, we used the fact that any odd number of inversions in the ring will give oscillation, and that the transconductance stages can be either inverting or noninverting, by interchanging the outputs. The ringstages are basic ECL buffers, with the VCO control voltage applied to the current sources in the emitterfollowers, to reduce variations in voltage swing. An additional control voltage for the current sources in the differential stage of the ECL buffers is available, and can be used to adjust the center frequency of the VCO. The VCO occupies an area of 0.11mm2, with a power dissipation of approximately 150mW, and a VCO tuning range of 400MHz centered around 2.5GHz. 4. DEMULTIPLEXER The block diagram for the demultiplexer is shown in Figure 4. A basic treestructure with MasterSlave (MS) and PhaseShifted (PS) flipflops is used, with dataregeneration performed by the first flipflop. The clock can be supplied by either the onchip PLL or an external clock generator. The 1:4 operation is implemented in ECL, while the final stage, as well as the bitrotation is implemented in CMOS using the True SinglePhased Clock (TSPC) approach [5]. In the final stage positive and nega 12 6

Figure 4. Demultiplexer schematic tive edgetriggered dynamic CMOS Aipflops are used. To synchronize the outputs, Ptype latches are added to the outputs of the positive edgetriggered CMOS flipflops. The size of the demultiplexer including bitrotation block is 0.42mm2, power dissipation is approximately 400mW. 4.2. Bitrotation The presented circuit is part of a Physical Layer Controller for cellbased ATM. During the celldelineation procedure byte and cell boundaries must be determined, by scanning the bitstream for valid Header Error Control (HEC) fields [6]. The logical operation of the demultiplexer is to grab eight consecutive bits from the serial bitstream, and present them tal the subsequent circuitry in parallel. However, at startup the eight bits will most likely not correspond to a single octet. During the cell delineation procedure, it is therefore necessary to be able to rotate the bits, corresponding to moving the octetwindow in the serial bitstream. This operation is not unique to an ATM Physical Layer Controller, and is therefore implemented with the demultiplexer, to facilitate the use of the demultiplexer in other systems. A schematic for the bitroation circuit is shown in Figure 6. 4.1. ECLtoCMOS converter The ECLtoCMOS converter used here uses a current mode approach, to avoid having MOS transistors directly driven by lowswing nodes. A schematic for the ECLtoCMOS converter is shown in Figure 5. Figure 5. ECLtoCMOS converter schematic The converter consists of a Current Mode Logic (CML) stage, three CMOS current mirrors and a CMOS inverter for better driving ability. If the input A is high, the collector current of Q1 is mirrored through the NMOS current mirror of M2 and MI, and further mirrored through the PMOS current mirror Ms and M7, pulling down the input to the inverter, thus resulting in a rising output. If the input is low, i.e. Abar is high, the collector current of Q2 is mirrored through M3 and M4, pulling the input to the inverter high, and thereby resulting in a low output. Figure 6. Bitrotation schematic 5. FABRICATION The presented circuit has been fabricated in a standard 0.8pm BiCMOS process featuring 13GHz ft bipolar transistors [3], ak Northern Telecom, through Canadian Microelectronics Corporation (CMC). A chip photo is shown in Figure 7. The chip contains several independent teststructures, to facilitate testing and characterization of the individual blocks. In the lower right of 127

Figure 7, the VCO can be seen, while the upper right shows the CDR block, corresponding to Figure 1. The left hand side of Figure 7 contains the full circuit with CDR and demultiplexer. The total area of the chip is 7.3mm2. 3 2.5 0 Y 2 1.5 1.6 1.8 2 2.2 2.4 Control Voltage Figure 7. Micrograph of CDR with demultiplexer 6. MEASUREMENT RESULTS Since it is the determining factor in proving the feasibility of the new architecture and obtaining the required speed, preliminary measurements were concentrated on the VCO. Measurements on the performance of the full circuit will be presented at the conference. Figure 8 shows the measured and simulated VCO oscillation frequency versus the applied control voltage. Simulation results are shown for worst, best and typical process corners. It is seen that the measured oscillation frequency is a bit lower than expected from simulations, but within what could be caused by process variations. Adjustments to the current sources in the CML stages of the VCO ring can easily be used to move the entire curve up or down, thereby centering the VCO tuning range around the target frequency of 2.5GHz. 7. CONCLUSION In this paper a clock and datarecovery circuit, combined with a demultiplexer and bitrotation circuitry, was presented. This circuit is an important part of a fully integrated Physical Layer Controller for a 2.5Gb/s ATM system, but can be used in other systems operating at this bit rate. The circuit was implemented in BiCMOS, with the highspeed parts using Emitter Coupled Logic, while dynamic CMOS was used for the last demultiplexing stage and for the bitrotation logic. Figure 8. VCO frequency vs control voltage As part of the PLL, a novel VCO configuration, generating highspeed quadrature clock signals, was presented. 8. ACKNOWLEDGEMENTS The authors would like to thank the Danish Research Academv (Forskerakademiet) and Micronet for finan I\ cia1 support. REFERENCES [l] Z. Wang, U. Langmann & B.G. Bosch, Multi Gb/s silicon bipolar clock recovery IC, IEEE J. Selected Areas Commun., vol. 9, pp. 656663, 1991. [2] A. Pottbacker, U. Langmann & H. Schreiber, A Si Bipolar Phase and Frequency Detector IC for Clock Extraction up to 8 Gb/s, IEEE J. Solid State Circuits, vol. 27, pp. 17471751, 1992. [3] R. Hadaway et al, A SubMicron BiCMOS Technology for Telecommunications, Proceedings, 21st European Solid State Device Research Conference (ESSDERC 911, pp. 513516, 1991. [4] B. Razavi & J. Sung, A 6GHz 60 mw BiC MOS PhaseLocked Loop, IEEE J. Solidstate Circuits, vol. 29, pp. 15601565, 1994. [5] J.R. Yuan & C. Svensson, Highspeed CMOS circuit Technique, IEEE J. Solidstate Circuits, vol. 24, pp. 6270, 1989. [6] BISDN UserNetwork Interface Physical Layer Specification, 1.432, ITUT, 1993. 12 8