Download Paper

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
Sampling Receive Equalizer
with Bit-Rate Flexible Operation up to 10 Gbit/s
Markus Grözing, Bernd Philipp, Matthias Neher, and Manfred Berroth
Institute of Electrical and Optical Communication Engineering
University of Stuttgart
Stuttgart, Germany
0
90 cm
- 18 dB
@ 5.0 GHz
-10
173 cm
S21 [dB]
Abstract—A receive equalizer IC implemented in 0.13 µm
standard CMOS technology is presented. The equalizer filter
works with sampled analog signals within a half-rate
architecture. Due to its discrete-time nature, the circuit
operates continuously on bit rates ranging from 0.5 Gbit/s to
10 Gbit/s. The equalizer core consists of a 3-tap finite-impulseresponse filter and a subsequent decision-feedback filter with
first and second post cursor feedback taps. Up to 24 dB
channel loss at the Nyquist frequency can be compensated. The
reception of a 231-1 PRBS binary data stream transmitted over
a 90 cm long trace on FR4 with 10 Gbit/s and over a 173 cm
long trace with 7 Gbit/s with a BER < 10-12 and receive-only
equalization is presented. The power consumption of the
equalizer core is 21 mW and the core area is 60 µm x 56 µm.
-20
- 19 dB
@ 3.0 GHz
-30
- 24 dB
@ 3.5 GHz
-40
-50
I.
0
INTRODUCTION
Baseband data transmission over PCB-backplanes, in
memory-processor interfaces and in fiber local area networks
reaches multi-Gbit/s rates per channel. But electrical
transmission over traces on standard PCB substrate, as well
as optical transmission over multimode fibers, suffers from
dispersion, losses and reflections. Simple binary transmitters
and receivers fail due to intersymbol interference (ISI) when
used beyond a certain transmission distance, which can be
observed at closing eye diagrams at the receiver.
Transmitter predistortion and receiver equalization are
methods to mitigate the channel impairments [1]. An
accepted method is a finite-impulse-response (FIR) filter in
the transmitter and a decision-feedback equalizer (DFE) in
the receiver [2]. Additional continuous time peaking
amplifiers at the front-end of the receiver are also used [3].
Disadvantages of predistortion are the required knowledge at
the transmitter side about the channel characteristics as well
as increased high-frequency signal power at the transmitter
that potentially couples to other channels. Equalizers for
optical systems often employ relatively long FIR-filters in
the receiver, mostly implemented with power- and areaintensive full-rate buffered LC-delay lines [4]. In contrast,
heavy 8-fold interleaving is used in the 4-tap discrete-time
FIR-only backplane receive equalizer presented in [5].
1-4244-0303-4/06/$20.00 ©2006 IEEE.
516
1
2
3
4
5
6
7
8
9
10
frequency [GHz]
Figure 1. Measured frequency response S21 of a 90 cm long and a
173 long 50 Ω transmission line on FR4 standard PCB substrate.
The equalizer presented in this work integrates a receiveside discrete-time 3-tap FIR-filter with a subsequent 2feedback-tap DFE filter. Basic track & hold circuits are used
as the FIR-filter delay elements. Neither area-consuming
LC- nor transmission line delays, nor any other continuoustime analog delays are necessary within this concept. The tap
delay is governed by the data-synchronous clock, making the
filter concept fully bit-rate flexible. Moderate two-fold
interleaving of the discrete-time signal processing is used to
enhance the maximum speed of the equalizer and to decrease
the length of the delay lines.
II.
CHANNEL RESPONSE & EQUALIZER CONCEPT
Balanced pairs of 50 Ω transmission line traces on
standard FR4 PCB substrate are used as test channels for the
equalizer. The frequency response of a 90 cm and a 173 cm
trace is shown in Figure 1. The loss of the 90 cm trace is
18 dB at the Nyquist frequency for 10 Gbit/s transmission,
19 dB for the 173 cm trace at 6 Gbit/s and 24 dB for the
173 cm trace at 7 Gbit/s.
relative response
S&H
0.4
0.3
T
T
T
T
Dout
0.2
0.1
CLK
0.0
-0.1
-5
-4
-3
-2
-1
precursor ISI
0.4
0
1
time [UI]
2
3
4
5
c-1
c1
c-2
c2
Figure 3. Filter structure of the sampling equaliser.
postcursor ISI
0.3
Dout1
c-2
c2
c-1
c1
c-1
c1
to output drivers
relative response
finite-impulse-response filter decision-feedback filter
Din
c-2
c2
Dout2
D
Din
0.2
0.1
CLK
2
0.0
-0.1
-5
-4
-3
-2
-1
0
1
time [UI]
2
3
4
5
T&H 1
Figure 2. Measured single symbol response of the
90 cm trace @ 10 Gbit/s (top) and the 173 cm trace @ 6 Gbit/s (bottom).
A. Filter design
The time-domain response of the 90 cm and 173 cm
channels to a single symbol with one unit interval (UI) length
and unity amplitude at 10 Gbit/s respectively 6 Gbit/s data
rate are shown in Figure 2. The sampled discrete-time
response is the starting point for the filter design. The
sampled response has two strong interferers preceding the
main peak – precursor ISI - and two strong interferers
following the main peak – postcursor ISI. Most of the
precursor ISI is effectively removed by a FIR filter with one
fixed and two variable feedforward paths and most of the
remaining postcursor ISI is removed by a DFE filter with a
variable first and second post cursor feedback path. The
resulting structure of the implemented sampling equalizer
filter is shown in Figure 3.
B. Half-rate signal processing
The unit interval delays necessary for the FIR-filter are
implemented with track & hold (T&H) circuits. In a full-rate
clocked concept, two cascaded T&Hs in a master-slave
configuration are necessary to implement a single unit
interval delay, with only 50% of the unit interval time
available for tracking. Thus, a chain of 6 cascaded T&Hs is
necessary for the FIR-filter, as the front-end sample & hold
circuit also incorporates two cascaded T&Hs. The
bandwidth of the T&Hs in the chain is limited by both the
sampling switch RC bandwidth at the input and the
bandwidth of the capacitively loaded output. As a result, the
overall cascaded bandwidth of a 6 stage T&H chain is too
small for 10 Gbit/s operation in 0.13 µm CMOS. Thus, a
half-rate filter structure as shown in Figure 4 is
implemented. Two interacting and interleaved FIR filters
work in push-pull manner, clocked by a differential half-rate
clock. The bandwidth advantage of the half-rate concept is
two-fold. Firstly, the unit interval delay is now implemented
with a single T&H, i.e. the chain length of T&Hs in the FIR
517
T&H 2
T&H 3
FIR filter
adder
DFE adder &
decision latch
D-Latch
Figure 4. Half rate implementation of the sampling equaliser core.
delay line is halved. Secondly, the whole unit interval is
now available for tracking. Thus, the cascaded bandwidth
problem is relaxed by a factor of four, and the problem of
accumulated noise and distortion throughout the chain is
relaxed by a factor of two. A disadvantage is the necessity
of matching between the two paths, but this issue can be
solved by symmetrical layout of the circuit. Compared to
the FIR-only receive equalizer presented in [5], the order of
interleaving is reduced drastically and is even smaller than
the order of the FIR-filter. Thus, substantial savings
concerning chip area and power consumption are possible
with the presented concept.
The half-rate concept is continued in the DFE part of the
equalizer [6]. The feedback signal addition is integrated into
the decision latches. The upper decision latch is in its
regenerative decision mode during the half-rate clock lowphase, and the up-building decision result is simultaneously
applied to the lower decision latch, which is in sense mode.
During the half-rate clock high-phase, the decision latch
modes are interchanged.
III.
CIRCUIT DESIGN
A. Track & hold circuit
The T&H circuit shown in Figure 5 is a combination of a
differential CMOS transfer gate with a unity gain buffer.
The hold capacitance is composed of the buffer input
capacitance and the hold-side transfer gate capacitance. The
clock for the n-channel transfer MOSFET is bootstrapped,
as the common mode level of the input voltage is only about
300 mV beneath the VDD level. To ensure the symmetry of
the clock paths for the n-channel and the p-channel transfer
MOSFET, the clock for the p-channel transfer transistor is
bootstrapped, too. Single-ended clock feedthrough is
minimized due to the complementary transfer gate topology.
Parametric amplification of the differential hold voltage
CLK/2n
_CLK/2p
VDD
VDD
Vout,D
I1+
I1-
I2+
Vout,D
I2-
Vin,D
Vin,D
I0
CLK/2
VSS
Figure 5. Track & hold circuit.
/2(I-2+I-1)
1
CLK/2
n/p
CLKin
V-1,D
V0,D
CLK/2
CLK/2n
CLK/2p
_CLK/2n
_CLK/2p
_CLK/2
÷2
BootStrap
CLK/2out
(V DD-VSS)/2
I0
VSS
V-2,D
V2,D
Figure 7. DFE decision latch with integrated adder.
Vout,D
I-1
VSS
V1,D
VDD
I-2
_CLK/2
I0
Figure 8. Clock generation circuitry.
Figure 6. Clocked FIR filter adder.
occurs when the transfer gate passes from track to hold
mode [7]. This effect is accepted, as it provides linear hold
voltage amplification and does not compromise the filter
action. The unity gain buffer is dimensioned for maximum
linearity by using relatively narrow differential pair
transistors and a relatively large load resistor voltage drop.
This dimensioning also optimizes the buffer speed.
B. FIR filter adder
The clocked FIR filter adder shown in Figure 6 is set up
by three differential transmission gates connected to
differential pair transconductors that add up their output
currents on a common pair of load resistors. As only the
magnitude and not the sign of the optimum FIR tap weights
changes with PCB trace length and data rate, the tap sign is
fixed by wiring. The tap weights maxima are set by the
dimensioning of the respective transconductor transistor
widths. The weights are adjusted by steering the currents I-1
and I-2 via current mirrors, that are fed by externally applied
control currents. Half the sum of the currents I-1 and I-2 is
injected into each output node by p-channel MOSFETs to
ensure a constant common mode voltage level at the output.
This is especially important for the input of the following
DFE adder and avoids degeneration of the differential pair
transconductors at large tap weights.
C. DFE decision latch with integrated adder
The adder for the decision feedback equalization is
integrated into a CML decision latch as shown in Figure 7.
The input voltage is applied to the clocked track
transconductor, and the decision is made by the clocked
cross-coupled latch transconductor. During tracking, the
feedback signals are added via adjustable differential
518
transconductors. A separate transconductor is attached for
plus and minus feedback sign. The respective sign is selected
by applying the DFE tap control currents to the respective
current mirror control input.
D. Clock generation circuitry
Six individual half-rate clock paths have to be provided
for the equalizer core. The DFE works with a
complementary pair of full swing CMOS clock signals. The
T&H circuits and the clocked FIR filter adder require
complementary pairs of bootstrapped near full swing signals
for both the n-channel and the p-channel transfer transistor.
The clock generation circuitry is shown in Figure 8. The
externally provided full rate clock signal is first amplified
and made differentially by an input clock driver chain. The
differential clock signal is then divided by two cascaded
CML-latches. The divided signal enters a tapered CML
driver chain and then two parallel tapered CMOS inverter
chains. The first CMOS inverter decision level is controlled
by a feedback loop to ensure 50% duty cycle at the clock
driver outputs. The complementary bootstrapped clock
signals are generated by bootstrap circuits that incorporate
capacitors and small clocked MOSFETs. The two bootstrap
common mode levels are preset in the circuit, but they may
be tuned by external control voltages.
IV.
EXPERIMENTAL RESULTS
The PCB trace inputs are fed by the differential data
signals of an Anritsu MP1763C pulse-pattern generator
(PPG). The PCB trace outputs and the PPG clock are fed to
the equalizer chip via on-wafer probe heads. The equalizer
half rate data and clock outputs are connected to an error
detector (ED) and to a 20 GHz bandwidth sampling scope.
x: 50 ps/div, y: 60 mV/div
x: 25 ps/div, y: 60 mV/div
Figure 11. Chip photograph with equalizer core magnified.
Chip area: 1400 µm x 600 µm. Equalizer core area: 60 µm x 56 µm
TABLE I.
supply voltage VDD
1.3 V
PDC equalizer core*
21 mW
maximum bit rate fmax 10 GBit/s PDC clock generator**
33 mW
minimum bit rate fmin 0.5 Gbit/s PDC total (with drivers) 200 mW
max. poss. channel loss for BER < 10-12, PRBS 231-1, 7 Gbit/s
24 dB
min. Vpp,PPG for BER < 10-11, PRBS 27-1, 10 Gbit/s, 90 cm FR4 300 mV
min. Vpp,PPG for BER < 10-11, PRBS 27-1, 6 Gbit/s, 173 cm FR4 230 mV
*circuitry corresponding to figure 4; **circuitry in dashed box of figure 8.
equal. off
-2
-4
PRBS
-4
31
-6
-8
PRBS
-10
2 -1
2 -1
7
log(BER)
log(BER)
x: 50 ps/div, y: 50 mV/div
x: 25 ps/div, y: 60 mV/div
173 cm FR4 trace @ 6 Gbit/s
90 cm FR4 trace @ 10 Gbit/s
Figure 9. PRBS 231-1 transmission with Vpp,PPG = 400mV.
Eye diagrams of PCB trace output (top),
of equalizer half-rate output with the equalization disabled (center),
and with the equalization enabled (bottom).
0
0
-2
-6
equalization of channels with about 18 to 19 dB loss is
300 mV at 10 Gbit/s and 230 mV at 6 Gbit/s.
V.
equal. off
PRBS
PRBS
-8
7
31
2 -1
2 -1
-10
-12
-0.5
0.0
0.5
sampling phase [UI]
-12
173 cm FR4 trace @ 6 Gbit/s
90 cm FR4 trace @ 10 Gbit/s
-0.5
EQUALISER PERFORMANCE SUMMARY
0.0
0.5
sampling phase [UI]
CONCLUSION
A 0.5 Gbit/s to 10 Gbit/s sampling receive equalizer is
presented. The combined FIR and DFE filter characteristic
can be frequency-shifted continuously due to the sampling
principle. A channel loss of up to 24 dB can be compensated
with an equaliser core power consumption of 21 mW plus
33 mW for clocking. The core chip area is 60 µm x 56 µm.
The topic of automatic adaption that is integrated into the
equalizer circuit has to be addressed in future work.
Figure 10. Measured BER versus equalizer sampling phase
with Vpp,PPG = 400mV and the equalizer coefficients held constant.
REFERENCES
Pseudo-random bit sequences (PRBS) of length 27-1 and
231-1 are used for bit error rate (BER) testing. The equalizer
coefficients are set manually by external current sources.
Measured eye diagrams at the PCB trace output are
shown in Figure 9 (top). The equalizer output eyes with the
equalization coefficients set to zero (center) and to the
optimum values (bottom) are shown beneath. As the
equalizer filter is clocked and as the DFE latches and the
output driver chains offer a lot of gain, an eye impression is
present even with the equalization disabled, but many
decisions are false or remain metastable. With the
equalization enabled, the eyes become clearly open. The
BER performance versus the equalizer sampling phase
(varied with the PPG clock delay feature) is shown in
Figure 10. A virtual eye appears, when the equalization is
enabled. Table I summarizes the equalizer performance. A
maximum loss of 24 dB at the Nyquist frequency can be
compensated by the equalizer at 7 Gbit/s. The minimum
required voltage swing at the PCB trace inputs for
519
[1]
[2]
[3]
[4]
[5]
[6]
[7]
J. Liu, X. Lin, “Equalization in high-speed communication systems,”
IEEE Circuits and Systems Magazine, vol. 4, no. 2, pp. 4-17, April
2004.
R. Payne et al, “A 6.25-Gb/s binary transceiver in 0.13-µm CMOS for
serial data transmission across high loss legacy backplane channels,”
IEEE JSSC, vol. 40, no. 12, pp. 2646-2657, December 2005.
T. Beukema et al., “A 6.4-Gb/s CMOS SerDes core with feedforward and decision-feedback equalization,” IEEE JSSC, vol. 40, no.
12, pp. 2646-2657, December 2005.
S. Reynolds, P. Pepeljugoski, J. Schaub, J. Tierno, D. Beisser, “A 7tap transverse analog-FIR filter in 0.13µm CMOS for equalization of
10Gb/s fiber-optic data systems,” ISSCC 2005, pp. 330-331, February
2005.
J. E. Jaussi et al., “8-Gb/s source-synchronous I/O link with adaptive
receiver equalization, offset cancellation, and clock de-skew,” IEEE
JSSC, vol. 40 no. 1, pp. 80-88, January 2005.
Y.-S. Sohn, S.-J. Bae, H.-J. Park, S.-I. Cho, “A 1.2 Gbps CMOS DFE
receiver with the extended sampling time window for application to
the SSTL channel,” 2002 Symp. on VLSI Circuits, pp. 92-93,
February 2005.
S. Ranganathan, Y. Tsividis, “Discrete-time parametric amplification
based on a three-terminal MOS varactor: analysis and experimental
results,” IEEE JSSC, vol. 38 no. 12, pp. 2087-2093, December 2003.