SA 19.1: A 10Gb/s Si-Bipolar TX/RX Chipset for Computer Data Transmission

Similar documents
Multi-Gigabit Interfaces for Communications/Datacomm

A 3.2Gb/s Clock and Data Recovery Circuit Without Reference Clock for a High-Speed Serial Data Link

ISSCC 2003 / SESSION 13 / 40Gb/s COMMUNICATION ICS / PAPER 13.7

8 Gbps CMOS interface for parallel fiber-optic interconnects

A 10-Gb/s CMOS Clock and Data Recovery Circuit with a Half-Rate Linear Phase Detector

Managing High-Speed Clocks

ISSCC 2003 / SESSION 4 / CLOCK RECOVERY AND BACKPLANE TRANSCEIVERS / PAPER 4.7

A 1.62/2.7/5.4 Gbps Clock and Data Recovery Circuit for DisplayPort 1.2 with a single VCO

Clocking. Figure by MIT OCW Spring /18/05 L06 Clocks 1

BURST-MODE communication relies on very fast acquisition

Clock- and data-recovery IC with demultiplexer for a 2.5 Gb/s ATM physical layer controller

DS2187 Receive Line Interface

IN RECENT YEARS, the increase of data transmission over

ECEN689: Special Topics in High-Speed Links Circuits and Systems Spring 2010

Pericom PCI Express 1.0 & PCI Express 2.0 Advanced Clock Solutions

Jitter Transfer Functions in Minutes

Jitter Budget for 10 Gigabit Ethernet Applications with SiTime SiT9120/1 Oscillators

TIMING-DRIVEN PHYSICAL DESIGN FOR DIGITAL SYNCHRONOUS VLSI CIRCUITS USING RESONANT CLOCKING

Selecting the Optimum PCI Express Clock Source

11. High-Speed Differential Interfaces in Cyclone II Devices

Simplifying System Design Using the CS4350 PLL DAC

A Laser Scanner Chip Set for Accurate Perception Systems

6.976 High Speed Communication Circuits and Systems Lecture 1 Overview of Course

Achieving New Levels of Channel Density in Downstream Cable Transmitter Systems: RF DACs Deliver Smaller Size and Lower Power Consumption

ICS379. Quad PLL with VCXO Quick Turn Clock. Description. Features. Block Diagram

A 10,000 Frames/s 0.18 µm CMOS Digital Pixel Sensor with Pixel-Level Memory

Fairchild Solutions for 133MHz Buffered Memory Modules

76-77 GHz RF Transmitter Front-end for W-band Radar Applications

The Future of Multi-Clock Systems

Explore Efficient Test Approaches for PCIe at 16GT/s Kalev Sepp Principal Engineer Tektronix, Inc

Lecture 2. High-Speed I/O

A receiver TDC chip set for accurate pulsed time-of-flight laser ranging

Clock Recovery in Serial-Data Systems Ransom Stephens, Ph.D.

Analysis and Design of Robust Multi-Gb/s Clock and Data Recovery Circuits

VITESSE SEMICONDUCTOR CORPORATION. 16:1 Multiplexer. Timing Generator. CMU x16

An All-Digital Phase-Locked Loop with High Resolution for Local On-Chip Clock Synthesis

Product Specification PE9304

Timing Errors and Jitter

Phase-Locked Loop Based Clock Generators

A CMOS Clock Recovery Circuit for 2.5-Gb/s NRZ Data

PCI-SIG ENGINEERING CHANGE NOTICE

Model-Based Synthesis of High- Speed Serial-Link Transmitter Designs

ICS SPREAD SPECTRUM CLOCK SYNTHESIZER. Description. Features. Block Diagram DATASHEET

Chapter 6 PLL and Clock Generator

High-Speed, 5 V, 0.1 F CMOS RS-232 Driver/Receivers ADM202/ADM203

DM Segment Decoder/Driver/Latch with Constant Current Source Outputs

8B/10B Coding 64B/66B Coding

Design and Modelling of Clock and Data Recovery Integrated Circuit in 130 nm CMOS Technology for 10 Gb/s Serial Data Communications

A Gigabit Transceiver for Data Transmission in Future HEP Experiments and An overview of optoelectronics in HEP

White Paper Utilizing Leveling Techniques in DDR3 SDRAM Memory Interfaces

Lecture 10: Latch and Flip-Flop Design. Outline

W a d i a D i g i t a l

PAC52XX Clock Control Firmware Design

Volumes. Goal: Drive optical to high volumes and low costs

XR-T5683A PCM Line Interface Chip

A 1.7 Gbps DLL-Based Clock Data Recovery for a Serial Display Interface in 0.35-μm CMOS

Application Note. PCIEC-85 PCI Express Jumper. High Speed Designs in PCI Express Applications Generation GT/s

Topics of Chapter 5 Sequential Machines. Memory elements. Memory element terminology. Clock terminology

Dispersion penalty test 1550 Serial

PECL and LVDS Low Phase Noise VCXO (for MHz Fund Xtal) XIN XOUT N/C N/C CTRL VCON (0,0) OESEL (Pad #25) 1 (default)

PowerPC Microprocessor Clock Modes

Alpha CPU and Clock Design Evolution

All-optical fiber-optics networks

Oscilloscope Bandwidth Requirements for Emerging Serial Data Interfaces

24-Bit Analog-to-Digital Converter (ADC) for Weigh Scales FEATURES S8550 VFB. Analog Supply Regulator. Input MUX. 24-bit Σ ADC. PGA Gain = 32, 64, 128

IDT80HSPS1616 PCB Design Application Note - 557

Application Note 83 Fundamentals of RS 232 Serial Communications

PL-277x Series SuperSpeed USB 3.0 SATA Bridge Controllers PCB Layout Guide

Evaluating AC Current Sensor Options for Power Delivery Systems

PS25202 EPIC Ultra High Impedance ECG Sensor Advance Information

High-Frequency Integrated Circuits

Source-Synchronous Serialization and Deserialization (up to 1050 Mb/s) Author: NIck Sawyer

Jitter in PCIe application on embedded boards with PLL Zero delay Clock buffer

Methode Electronics. DM-317-XXXX 40 Gbps QSFP+ Passive Cable RoHS COMPLIANT.

ICS514 LOCO PLL CLOCK GENERATOR. Description. Features. Block Diagram DATASHEET

A High Frequency Divider in 0.18 um SiGe BiCMOS Technology

High-Speed Electronics

DS2186. Transmit Line Interface FEATURES PIN ASSIGNMENT

CHARGE pumps are the circuits that used to generate dc

Using the On-Chip Signal Quality Monitoring Circuitry (EyeQ) Feature in Stratix IV Transceivers

Accurate Loss-of-Signal Detection in 10Gbps Optical Receivers using the MAX3991

Modification Details.

Title: Low EMI Spread Spectrum Clock Oscillators

CIRCUITS AND SYSTEMS- Assembly and Printed Circuit Board (PCB) Package Mohammad S. Sharawi ASSEMBLY AND PRINTED CIRCUIT BOARD (PCB) PACKAGE

Clocks Basics in 10 Minutes or Less. Edgar Pineda Field Applications Engineer Arrow Components Mexico

Chapter 9 Latches, Flip-Flops, and Timers

PCI Express: The Evolution to 8.0 GT/s. Navraj Nandra, Director of Marketing Mixed-Signal and Analog IP, Synopsys

Web Streamed SDR s in Service at GB2RHQ Hack Green Bunker

USB 3.0 CDR Model White Paper Revision 0.5

Bus Interconnect Evolution in embedded computing

LONGLINE QSFP+ SR4. Features. Applications. Description. Page 1 of 13

Signal Types and Terminations

Rong-Jyi YANG, Nonmember and Shen-Iuan LIU a), Member

Abstract. Cycle Domain Simulator for Phase-Locked Loops

Table 1 SDR to DDR Quick Reference

Next-Generation BTL/Futurebus Transceivers Allow Single-Sided SMT Manufacturing

Transcription:

SA 19.1: A 10Gb/s Si-Bipolar TX/RX Chipset for Computer Data Transmission Richard C. Walker, Kuo-Chiang Hsieh, Thomas A. Knotts, Chu-Sun Yen Hewlett-Packard Laboratories, Palo Alto, CA With Internet host counts doubling every five to seven months, there is a pressing need for high-speed interconnect circuits in routers, switches, and computer systems [1]. These transmitter (TX) and receiver (RX) chips for 10Gb/s serial data transmission provide clock generation, 16:1 multiplexing, clock recovery, 1:16 demultiplexing and loss of signal (LOS) detection. No errors are seen in a 24-hour test period across 21 feet of 0.190" diameter coax, implying a BER better than 10-14 and demonstrating the feasibility of short-distance 10Gb/s transmission using copper-based links. The monolithic 3.0W TX and 5.5W RX chips are implemented in a 25GHz f T Si-bipolar process [2]. A data rate of 40% of f T is the highest reported architectural performance for a chipset at this level of integration. A simplified block diagram of the TX chip is shown in Figure 1. 16bit. Parallel data is provided to the chip along with a 625MHz clock. The on-chip PLL locks the internal 2.5GHz multi-phase VCO to the incoming clock using a wide-range phase/frequency detector. The input data is serialized in two stages and drives a high-speed output amplifer. The RX chip uses a multiphase 2.5GHz clock generator consisting of a 4-stage (8-phase) ring oscillator followed by phase-interpolators (Figure 2). The four main clock phases at 0º, 90º, 180º and 270º are used to sequentially latch the incoming data bits. An intermediate 45º phase and two non-critical phases at 22º and 67º are used to implement phase and data-lock detectors. The PLL and statemachine are otherwise identical to that previously described [3]. A conventional 8-phase 2.5GHz ring oscillator requires four 50ps delay stages, with each delay stage contributing 45º to the oscillation period (Figure 3a). Because it is not possible to achieve a 50ps variable-delay cell over process and temperature, a leapfrog VCO structure is implemented to accommodate larger nominal gate delays. In Figure 3b, the input of delay stage n is phase advanced by summing in a signal from the output of stage n-2. Each stage of a 2.5GHz oscillator contributes 67.5º (75ps) rather than the 45º (50ps) of a conventional circuit. Figure 6 shows the transmitter output eye with 2 31-1 PRBS pattern, along with the same signal after 21 feet of 0.190" coax. Under these conditions, with differential drive, measured link BER is better than 10-14. Measurement of the 4 output phases of the non-retimed mux shows a systematic skew <3 ps. This performance is due to careful layout and design of the precision 8-phase ring-oscillator and mux. The non-systematic TX jitter is 3.3ps rms and 18ps peak/peak. An enhanced thick-film hybrid is used for mounting both chips. Gold thick-film paste is printed across a 25mil thick alumina substrate and photolithographically etched back to achieve 0.002" lines and spaces. The hybrids are mounted in a custom 100-pin Kovar QFP package with ceramic sidewalls. The package achieved better than 20dB return loss up to 10GHz for the differential highspeed inputs and outputs. Figures 7 and 8 are micrographs of the 2.6x4.4mm 2 TX and 3.9x4.4mm 2 RX die. Both designs use full-custom layout for the critical multiphase-vco and f T -doubler circuitry. The lower-speed MUX, DEMUX, and state-machine logic are implemented in ECL gate-array cells for ease of design modifications. Acknowledgments: The authors thank J. Kerley and L. Dove for hybrid and package development, and J. Norman for microassembly of the TX chip prototypes. References: [1] Lotter, M., Internet Growth (1981-1991), RFC 1296, Stanford Research Institute, Jan., 1992. [2] Huang, W., et al., A high-speed bipolar technology featuring selfaligned single-poly base and submicrometer emitter contacts, IEEE Electron Device Letters., vol. 11, no. 9, pp. 412-414, Sept., 1990. [3] Walker, R., C. Stout, C-S. Yen, A 2.488 Gb/s Si-Bipolar Clock and Data Recovery IC with Robust Loss of Signal Detection, ISSCC Digest of Technical Papers, pp.246-247, Feb., 1997. [4] Battjes, Amplifier Circuit, U.S. Patent 3633120, assigned to Tektronix Corp., 1972. In the RX, a three-stage amplifier, shown in Figure 4, conditions incoming 10Gb/s data. Each stage is an f T -doubler circuit formed by two cross-coupled differential pairs (an architecture often used in small-signal linear amplifiers) [4]. The enhanced small-signal bandwidth of this circuit improves the jitter performance of the amplifier. A similar circuit is used in the TX output driver. A conventional multi-clock-phase 4-to-1 data multiplexer (MUX), shown in Figure 5a, experiences an unequal propagation delay at the output MXOUT from clocks applied to lower CLK00 and upper CLK90 switches. This effect produces a non-equal data width at the output. An improved version of a 4:1 data MUX is shown in Figure 5b, where f T -doubler circuits replace the lower switches. A primary 4:1 MUX core (Q1 to Q8) and a replica (Q11 to Q18) are connected so that CLK00 is applied, with proper level shift, at the lower and upper switches of the primary and replica cores. A similar connection is applied from CLK90. The different propagation delays from CLK00 and CLK90 to the output are now common to all four data channels. Combining the f T -doubler and the replica MUX, an equal data width, 4:1 10Gb/s MUX is achieved. Figure 1: Transmitter block diagram. 19.1-1

a b Figure 2: Receiver block diagram. Figure 3: (a) Conventional 4-stage 2.5 GHz oscillaltor requiring 50ps gate delays; b: "Leap frog" architecture using 75ps gates. Figure 4: 3 Stage f T -doubler input amplifier. Figure 5: Left circuit is conventional 4:1 multiplexer. Right circuit is improved non-retimed mux with f T -doubler core. 19.1-2

SA 19.1: A 10Gb/s Si-Bipolar TX/RX Chipset for Computer Data Transmission Figure 6: Top trace is transmitter output eye diagram. Bottom trace after 21 feet of 0.190" diameter coax. Figure 7: TX die micrograph. Figure 8: RX die micrograph. 19.1-3

Figure 1: Transmitter block diagram. 19.1-4

Figure 2: Receiver block diagram. 19.1-5

a b Figure 3: (a) Conventional 4-stage 2.5 GHz oscillaltor requiring 50ps gate delays; b: "Leap frog" architecture using 75ps gates. 19.1-6

Figure 4: 3 Stage f T -doubler input amplifier. 19.1-7

Figure 5: Left circuit is conventional 4:1 multiplexer. Right circuit is improved non-retimed mux with f T -doubler core. 19.1-8

Figure 6: Top trace is transmitter output eye diagram. Bottom trace after 21 feet of 0.190" diameter coax. 19.1-9

Figure 7: TX die micrograph. 19.1-10

Figure 8: RX die micrograph. 19.1-11