Download this PDF file - Technology and Research Publications

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Power over Ethernet wikipedia , lookup

Variable-frequency drive wikipedia , lookup

Power inverter wikipedia , lookup

Electric power system wikipedia , lookup

Power engineering wikipedia , lookup

Buck converter wikipedia , lookup

Audio power wikipedia , lookup

Spectral density wikipedia , lookup

Transmission line loudspeaker wikipedia , lookup

Flip-flop (electronics) wikipedia , lookup

Mains electricity wikipedia , lookup

Bode plot wikipedia , lookup

Electronic engineering wikipedia , lookup

Amtrak's 25 Hz traction power system wikipedia , lookup

Integrated circuit wikipedia , lookup

Power dividers and directional couplers wikipedia , lookup

Regenerative circuit wikipedia , lookup

Islanding wikipedia , lookup

Chirp spectrum wikipedia , lookup

Three-phase electric power wikipedia , lookup

Opto-isolator wikipedia , lookup

Alternating current wikipedia , lookup

Pulse-width modulation wikipedia , lookup

Atomic clock wikipedia , lookup

Utility frequency wikipedia , lookup

Time-to-digital converter wikipedia , lookup

Switched-mode power supply wikipedia , lookup

Power electronics wikipedia , lookup

Heterodyne wikipedia , lookup

CMOS wikipedia , lookup

Phase-locked loop wikipedia , lookup

Transcript
Proposed Architecture for Low Power and Area
Efficient Edge Combiner
Sohaib A. Qazi1, Syed Asmat Ali Shah, Saad Arslan
CAST Department, COMSATS Institute of Information Technology Islamabad, Pakistan
[email protected], phone:+92 345 507 6730
1
Abstract— This paper proposes an architecture of edge combiner
circuit that consumes very low power as compared to a
conventional XOR edge multiplier design. The proposed edge
combiner's architecture is well suited for frequency multiplier
applications and it is tested for input frequencies of up to
250MHz. The design is implemented on 65nm complementary
metal oxide semiconductor (CMOS) process. Simulation results
show that the total power consumption of proposed design is 7.68
times less than the total power of conventional XOR edge
combiner. The switching power of proposed design is only 8% of
the total power which was 25% in conventional edge combiner.
Area of the edge combiner is 7.8nm i.e. 3.33 times less than the
XOR edge combiner.
Index Terms— Edge Combiner, Delay Locked Loop (DLL),
Frequency Multiplier, Phase Detector, Complementary Metal Oxide
Semiconductor (CMOS).
I.
INTRODUCTION
Advancement in the field of digital design and mixed signal
systems has resulted in increased bandwidth requirements for
digital and very large scale integration (VLSI) equipment. High
bandwidth operations also require relatively high frequency.
The efficient and low jitter clock generators play an important
role in high speed VLSI, digital and analog applications. Onchip clock generator circuits are very commonly used now-adays as they provide in-phase on-board clock signals which are
more effective than off-board clocks.
Historically, Phase Locked Loops (PLLs) were used for
high performance in high speed microprocessors. The biggest
hurdles that PLLs are facing these days are complexity and
cost. The PLLs use voltage controlled oscillator which makes
PLL a higher order system. The higher order systems are hard
to design for low power and area efficient applications [1].
Digital phase locked loops (DLLs) emerged as a viable
alternative to traditional PLLs [2] [3]. The DLLs have now
been adopted widely as they have considerable advantages over
conventional PLLs. The DLLs are first order systems and are
easier to design. The first order systems are preferred as theses
are stable systems [4]. Moreover, DLL has a limited locking
range as compared to PLLs [3].
Research in the area has proposed many improvements in
DLLs, but there is still room for improvement. C. Kim, et.al
has proposed a phase detector with reset feature added and a
new frequency multiplication technique in [4]. The frequency
multiplier proposed in [4] has timing issues in switching of
transistors, i.e. the transistors being in cutoff region or in linear
region. The transistors are not turned on (linear region)
simultaneously as one NMOS transistor is in direct contact
with the input signal and the other has a delay of three gate
cycles, so the data transfer through the transistors will be
corrupted in some cases. A frequency multiplier for Systemon-Chip (SoC) applications is proposed by K. H. Cheng et.al in
[5]. As the design proposed in [5] avoids the use of LC tanks so
it is claimed that the design consumes less area and power.
However, the edge combiner circuit design used in [5]
consumes considerable area because of the buffers, SR latches
and OR gates that are used in the design. A research conducted
at university of California by G. Chien and P. R. Gray has
proposed a DLL based oscillator for personal communication
systems [6]. LC tanks are used at the outputs, to enhance the
output impedance, consuming large area; LC tanks are not a
viable solution for small area and low power applications. P. C.
Huang et.al [7] has proposed a phase error calibration DLL
with edge combiner. The edge combiner proposed in [7] is
based on symmetric AND gates so it consumes more area than
the architecture proposed in this article. The design proposed in
this article is optimized for area and power.
The article is divided in following sections. The next
section, i.e. second section describes the conventional DLL
multiplier. The third section describes the proposed
architecture. Section four is composed of the simulation results,
comparison of conventional and proposed edge combiner and
the final section i.e. section five concludes the discussion on
the proposed architecture.
Figure 1: System diagram of DLL based Frequency Multiplier
Figure 2: Conventional Edge Combiner Circuit that is Implemented using
XOR Gates
Sohaib A. Qazi|Introduction
45
II.
CONVENTIONAL DLL MULTIPLIER
The increase in frequency of operation has also emphasized on
the importance of circuits that are helpful in providing reduced
skew and jitter. In analog, digital and mixed signal circuits
where delay is an important factor to be considered on receiver
end, skew plays an important role. Skew and jitter must be
minimized to achieve desired results. The Digital Locked Loop
(DLL) based frequency multiplier shown in Figure 1 is an
applicable solution to provide on chip reduced jitter and skew
clocks for the analog, digital and mixed signal circuits. As the
name suggests, this DLL based frequency multiplier is also
helpful in providing in-phase clocks of different frequencies to
the same circuit. DLL frequency multiplier can reduce skew
and jitter problems quite significantly. Also, the circuit can be
used for frequency multiplication besides reduction in skew
and jitter.
Phase Detector (PD), shown in Figure 1, detects the
difference in phase of reference clock and the Voltage
Controlled Delay Line’s (VCDL) output. The output of PD (up
and down signals) indicates whether the phase is leading or
lagging. The charge pump is controlled by the phase detector’s
output. The amount of current produced by the charge pump
depends upon the outputs of PD. The VCDL consists of
differential voltage-controlled inverter stages. When the phase
of reference input signal and the output of delay line is locked,
the VCDL generates equally spaced clock phases within one
clock period. The edge combiner of Figure 1 generates the
multiplied version of the reference clock.
multiplied frequency depends upon the phase difference of the
input clocks [8].
The conventional edge combiner is based on XOR gates.
The multiplication factor of four is achieved by eight different
phase signals. All these phase signals are combined in such a
way that it ends up in multiplying the phase signals four times
the original reference clock [9].
In order to achieve the multiplication factor of M, we need
2 × M different phase clocks from the delay line shown in
Figure 1. The edge combiner is needed to combine these
phases in such a way that we get a frequency M times the
1
reference frequency. Each stage is delayed in phase by
2×𝑀
times the input signal. Here M is the number with which the
reference signal will be multiplied, resulting in a delay of 0.125
time units of the input signal when M is set to four. This
architecture requires 6 XOR gates and a single 2-input OR
gate. The schematic design of an XOR gate is shown in Figure
3. In architecture of Figure 3, 16 PMOS and NMOS transistors
are used for implementation of a single XOR gate. This is quite
high number of transistors for designing an XOR gate as
compared to the design proposed in this article. Similarly OR
gate, shown in Figure 4, also consumes 6 PMOS and NMOS
gates. So an XOR edge multiplier design consumes 51 NMOS
and 51 PMOS gates. Since the edge combiner operates with a
large number of transistors, so the area and power consumption
is quite high.
Figure 5: Relation Between Reference Signal and Delayed Input Signals of
Edge Combiner
Figure 3: Transisitor Schematic of a Single XOR Gate Showing all
Transistors Used.
Figure 6: Output Clock is Compared with Reference Input Clock
III.
Figure 4: Transistor Level Schematic of OR Gate
Edge combiner is a vital part of the DLL frequency
multiplier as it is used to combine inputs of different phase
clocks and generate an output with higher frequency. The
PROPOSED EDGE COMBINER
The proposed architecture only involves four AND gates
and a single OR gate. To achieve a multiplication factor of
four, the VCDL of Figure 1 should produce eight different
phase signals, because the output depends on Equation 1.
𝑂𝑢𝑡𝑝𝑢𝑡𝑐𝑙𝑘 = 𝑅𝑒𝑓𝑒𝑟𝑒𝑛𝑐𝑒𝑐𝑙𝑘 ×
𝑁
(1)
2
Here ’N’ represents number of phases that are required to
generate desired output. According to Equation 1, eight clocks
Sohaib A. Qazi|Conventional DLL Multiplier
46
are required to achieve four times the input reference
frequency. Figure 5 shows the behavior of input reference
clock signal and the delayed phase clocks that are required to
generate output signal. These eight delayed clock signals must
occur within one clock cycle of the reference clock as shown in
Figure 5.
TABLE I.
POWER AND AREA COMPARISON OF AND & XOR BASED
EDGE MULTIPLIER
Area
(nm)
7.8
These 8 delayed clocks should be manipulated in such a
𝑇 𝑇
3𝑇
way that it results in 4 rising edges at 0, , and
of the
4
2
4
𝑇 3𝑇 5𝑇
original reference clock and the falling edges occur at ,
and
7𝑇
8
8
8
,
8
, respectively, as shown in Figure 6.
This paper proposes an architecture that produces a
behavior that was described in Figure 6. The architecture
required for producing the behavior shown in Figure 6 is
described in Figure 7. The proposed architecture includes four
AND gates and a single OR gate. This architecture requires
less number of Metal Oxide (MOS) gates as compared to the
architecture presented in [8]. The only problem with this
proposed architecture is its sensitivity towards the power
supply. This issue can be resolved by using linear regulators,
active and passive filters and differential circuits [10]. Area and
power consumption is a major concern when working to reduce
the power supply interference. The architectures must be
chosen accordingly to reduce area and power.
Power
(µW)
XOR Edge
Multiplier
26
Cell Internal Power
1.432
8.991
Net Switching Power
0.123
2.947
Total Internal Power
1.555
11.938
𝑇
To achieve the rising edges at prescribed location, i.e. 0, ,
𝑇 3𝑇
4
𝑇 3𝑇 5𝑇 7𝑇
,
and falling edges at , , , , the delayed signals are
2 4
8 8
8 8
fed to the AND gates in a specific order.
The order of the signals fed to the AND gates are shown in
Figure 7. The combinations used for AND gates’ input are
first/sixth phase, third/eighth phase, second/fifth and
fourth/seventh phase. Finally, the OR gate combines the output
pulses of AND gates and produces an output as shown in
Figure 8. The AND gates produce rising and falling edges at
𝑇
𝑇 3𝑇
𝑇 5𝑇
3𝑇 7𝑇
(0, ), ( ), ), ( , ) and ( , ).
8
4
8
2
8
IV.
Figure 7: Proposed Edge Combiner Circuit is Implemented using AND Gates
AND Edge
Multiplier
Power
4
8
SIMULATION RESULTS
The simulations are carried out in ModelSim, a tool by
Mentor Graphics. The reference clock frequency used was
250MHz and the output frequency obtained from edge
multiplier was 1GHz. The simulation results are shown in
figure 9. The figure 9 indicates the input clock of 40ns which
corresponds to 250MHz frequency. The functionality of both
architectures is same which is indicated by signals
"f_out_and" and "f_out_xor" in Figure 9. The frequency of
output signals is 1GHz which is four times the input frequency
of the edge combiner. The proposed edge multiplier is
implemented using 65nm CMOS technology. Power and area
consumption is estimated using Design Vision, a tool from
synopsys. The supply voltage for the design is 1 Volt.
Figure 8: Multiplication Principle for Edge Multiplier
Sohaib A. Qazi|/
47
Figure 9: Comparison of Conventional and Proposed Edge Combiner Results in ModelSim
The total power consumption of the design proposed is
1.5551μW (reduced by 7.68% of the power that was consumed
by the conventional XOR multiplier). The power consumption
of the design is estimated using the design compiler from
Synopsys.
The total power consumed in design is shown in Table I.
Switching power is another main concern for the designers to
look for. The design proposed in this article reduces the
switching power quite significantly. The conventional edge
multiplier’s switching power consumption is about 25% of the
total power, but the design proposed in this article consumes
only 8% of the total power for switching purpose.
The edge multiplier consumes only 7.8 units which is
about 3.33 times less than the XOR edge multiplier. The
comparison of area and power consumption is shown in Table
I.
V.
CONCLUSION
The proposed architecture is well suited for low power
applications as it consumes 7.68% less power than the
conventional XOR edge combiner. The edge combiner also
occupies very less area that is 3.33 times less than the
conventional edge combiner.
In future we are also looking forward to integrate this
architecture in a DLL frequency multiplier.
ACKNOWLEDGMENT
The authors would like to thank Mr. M. Touqeer Pasha at
Linkoping University, who provided assistance for edge
combiner.
REFERENCES
V. R. von Kaenel, “A high speed low power clock generator for a
microprocessor application”, in IEEE Journal of Solid-State Circuits,
vol. 33, pp. 1634-1639, Nov. 1998.
[2] R. B. Staszewski, J. L. Wallberg, S. Rezeq, C. M. Hung, O. E. Eliezer,
S. K. Vemulapalli, C. Fernando, K. Maggio, R. Staszewski, N. Barton,
M. C. Lee, P. Cruise, M. Entezari, K. Muhammad and D. Leipold, “Alldigital PLL and transmitter for mobile phone”, in IEEE Journal of SolidState Circuits, vol. 40, pp. 2469-2482, Dec. 2005.
[3] B. Razavi, “Phase-Locking in High-Performance Systems”, Wiley-IEEE
Press, pp. 13-22, ed 1, Mar. 2003.
[4] C. Kim, I. C. Hwang and S. M. (Steve) Kang, “A Low power SmallArea 7.28-ps-Jitter 1-GHz DLL-based Clock Generator”, IEEE
International Solid State Circuits (ISSCC) Conference, vol. 1, pp. 142453, 2002.
[5] K. H. Cheng, S. M. Chung, Y. L. Lo and S. Y. Jiang, “A 2.2 GHz
Programmable DLL based Frequency Multiplier for SOC Applications”,
in Proceedings of IEEE Asia-Pacific Conference on Advanced System
Integrated Circuits, pp. 72-75, 4-5 Aug. 2004.
[6] G. Chien and P. R. Gray, “A 900 MHz Local Oscillator using a DLL
based Frequency Multiplier Technique for PCS Applications”, IEEE
International Solid-State Circuits Conference (ISSCC), pp. 1996-1999,
Dec. 2000.
[7] P. C. Huang, C. J. Shih, Y. C. Tsai and K. H. Cheng, “A Phase Error
Calibration DLL with Edge Combiner for Wide-Range Operation”, 9th
International IEEE New Circuits and Systems Conference (NEWCAS),
pp. 1-4, 26-29 Jun., 2011.
[8] Y. H. Tu, H. H. Chang, C. L. Hung, K. H. Cheng, “A 3 GHz DLL-based
clock generator with stuck locking protection”, 17th IEEE International
Conference on Electronics Circuits and Systems, pp. 106-109, 12-15
Dec. 2010.
[9] S. A. Qazi, A. A. Shah, S. M. Asghar, R. Ali, M. A. Khan, “DLL Based
Frequency Multiplier with reduced Chip Area and Power Consumption,
International Journal of VLSI & Signal Processing Applications, vol. 2,
pp. 275-283, Jun 2012.
[10] Silicon Labs, “Power Supply Rejection for Low-Jitter Clocks”, Texas,
USA.
[1]
Sohaib A. Qazi|Conclusion
48