Download ppt

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the work of artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
Digital System Clocking:
High-Performance and Low-Power Aspects
Vojin G. Oklobdzija, Vladimir M. Stojanovic, Dejan M. Markovic, Nikola M. Nedovic
Chapter 3: Timing and Energy Parameters
Wiley-Interscience and IEEE Press, January 2003
Basic timing diagram in flip-flops
D
Clk
Q
tCQ,LH
tCQ,HL
U H
t<U
Setup time violation
Definitions:
• Clock-to-Q Delay:
tCQ
low-to high= tCQ,LH high-to-low= tCQ,HL
• Setup time U
• Hold time H
Nov. 14, 2003
Digital System Clocking: Oklobdzija, Stojanovic, Markovic, Nedovic
2
Setup and hold time behavior as a function of
clock-to-output delay
350
300
Clk-Output
[ps]
250
200
Setup
Hold
150
100
50
-200
-150
-100
-50
00
50
100
150
200
Data-Clk [ps]
Neither Setup nor Hold time are fixed constant parameters.
They are function of Data-to-Clock time “distance”.
Nov. 14, 2003
Digital System Clocking: Oklobdzija, Stojanovic, Markovic, Nedovic
3
Data to Output Delay
Setup time behavior as a function of data-to-output delay
Constant Clk-Q
Region
Failure
Region
Variable Clk-Q
Region
D-Q
Clk-Q
45o
Uopt
DDQm
Data arrives early
Data to Clock Delay
Data arrives late
When D-to-Q delay is observed, we can see that data can come closer
to the triggering event than we thought.
Data-to-Q is the RELEVANT parameter – NOT Clock-to-Q as many think !
Nov. 14, 2003
Digital System Clocking: Oklobdzija, Stojanovic, Markovic, Nedovic
4
D-Q and Clk-Q delay as a function of D-Clk offset
Determining the optimal setup Uopt and hold time Hopt.
Nov. 14, 2003
Digital System Clocking: Oklobdzija, Stojanovic, Markovic, Nedovic
5
Setup time, hold time, sampling window and clock
width in a flip-flop
D
Q
FF
Sampling
Window
Clk
U H
Clk
tCW
D
Q
Nov. 14, 2003
tCQ
Digital System Clocking: Oklobdzija, Stojanovic, Markovic, Nedovic
6
Latch: setup and hold time
D
Q
L
Clk
U
Unlike Flip-Flop, Latch
has two situations:
(a) Data is ready – waiting
for the clock
(b)Latch is open – data
arrives during the
active clock signal
Clk
D1
Q1
H
tCW
tCQ
(a)
D2
tDQ
Q2
(b)
(a) early data D1 arrival; (b) late data D2 arrival
Nov. 14, 2003
Digital System Clocking: Oklobdzija, Stojanovic, Markovic, Nedovic
7
Illustration of a data path
U H
D
Q
A
Q
Logic
tLogic
D
Q
B
Q
Clk
tClk-Q
Nov. 14, 2003
tskew
Digital System Clocking: Oklobdzija, Stojanovic, Markovic, Nedovic
8
Late Data Arrival and Time Borrowing in a
pipelined design
D
Q
Logic
D
Q
Source
Clk
Q
Q
Destination
TCR1
TCR2
Clk
dw1
Cycle-1
dw2 Cycle-2
dw1>dw3>dw2
dw3
TCR1> TCR2
Data-to-Output (Q) time window moves around the time axis;
(becoming larger or smaller)
Nov. 14, 2003
Digital System Clocking: Oklobdzija, Stojanovic, Markovic, Nedovic
9
Early Data Arrival and Internal Race Imunity
• The maximum clock skew that system can
tolerate is determined by the clock
storage elements:
– If Clk-Q delay of CSE is shorter than H, race
can occur if there is no logic inbetween.
– If Clk-Q is greater than H + possible skew – no
problem:
tClk-Q> H+ tskew
• Internal race immunity:
R = tClk-Q-H
Nov. 14, 2003
Digital System Clocking: Oklobdzija, Stojanovic, Markovic, Nedovic
10
tClk-Q [ns]
Impact of supply voltage on the sampling
window
2.0
1.5
1.0
0.5
2.5
2.2
V DD
1.9
]
[V
1.6
300
100
1.3
-100
1 -500
-300
D-Clk [ps]
Sampling window determines the minimum required duration of data signal
Nov. 14, 2003
Digital System Clocking: Oklobdzija, Stojanovic, Markovic, Nedovic
11
Energy Parameters
Nov. 14, 2003
Digital System Clocking: Oklobdzija, Stojanovic, Markovic, Nedovic
12
Components of Energy Consumption
•
•
•
•
Switching Energy
Short-Circuit Energy
Leakage Energy
Static Energy
Nov. 14, 2003
Digital System Clocking: Oklobdzija, Stojanovic, Markovic, Nedovic
13
Components of Energy Consumption
t T
E   VDD  iVDD ( )  d
t
Energy consumed by CSE during one
clock period T, where t is chosen to
include all relevant transitions: arrival
of new data, clock pulse, and output
transition.
This energy has four components:
E  Eswitching  Eshortcircuit  Eleakage  Estatic
Switching Energy:
N
Eswitching   01 (i )  Ci  Vswing (i )  VDD
i 1
•
•
•
•
Nov. 14, 2003
N is the number of nodes
Ci is the capacitance of the node I
0-1(i) is the probability that a transition occurs at the
node i
Vswing(i) is voltage swing of the node I
Digital System Clocking: Oklobdzija, Stojanovic, Markovic, Nedovic
14
Short-circuit current in an inverter
CL
Vin
Vout
iL
iL
ishort-circ
(a)
Vout
ishort-circ ipull-down
ipull-up
Vin
CL
(b)
(a) pull-up; (b) pull-down operation
Nov. 14, 2003
Digital System Clocking: Oklobdzija, Stojanovic, Markovic, Nedovic
15
Projected leakage currents
Leakage power will soon become a significant portion of the total
power consumption in modern microprocessors
Assuming doubling of transistors / generation the leakage current will
increase about 7.5 times corresponding to a 5 times increase in total
leakage power
Nov. 14, 2003
Digital System Clocking: Oklobdzija, Stojanovic, Markovic, Nedovic
16
Where does the Energy go in CSE ?
1. Internal clocked nodes in
storage elements
2. Internal non-clocked nodes in
storage elements
3. Data and clock input load
4. Output load.
Nov. 14, 2003
Digital System Clocking: Oklobdzija, Stojanovic, Markovic, Nedovic
17
Energy Breakdown
1. Internal clocking energy
2. Data and Clock Input Energy
3. Energy in Internal Non-clocked
Nodes
4. Energy in Output Load
5. Energy per Transition
6. Glitching Energy
Nov. 14, 2003
Digital System Clocking: Oklobdzija, Stojanovic, Markovic, Nedovic
18
Energy Breakdown in Clocked-Storage
Elements during one of the possible input
data transitions
EClk
Eint
Eext
E0-0
E0-1
E1-0
E1-1
Y/N
Y/N
N
Y
Y
Y/N
Y
Y
Y/N
Y/N
Y/N
N
Two cases:
• Storage elements without pre-charge nodes
• Storage elements with pre-charge nodes
Nov. 14, 2003
Digital System Clocking: Oklobdzija, Stojanovic, Markovic, Nedovic
19
CSE characterization and Test setup
VDD-CSE
VDD-D
D
Dt
Clk
Clkt
• Clock Energy
D
Q
Q
CL
VDD-Clk
• Internal Energy: Eint
E01  E10

 EClk  ELoad
2
• Energy in Output Load
Nov. 14, 2003
Digital System Clocking: Oklobdzija, Stojanovic, Markovic, Nedovic
20
Energy per transition
• The energy-per-transition is the total energy consumed in a CSE during
one clock cycle for a specified input data transition: 0-0, 0-1, 1-0, or 1-1
• This metric is crucial in that it yields significant insight about circuit
energy
• By inspection of the node activity in a CSE for different input data
transitions, the energy-per-transition can be utilized to obtain the
energy breakdown between clocked nodes, internal nodes, and the
external output load.
• This forms a good basis for the study of alternative circuit techniques
that deal with internal clock gating.
• The energy breakdown information also offers valuable information
about the tradeoffs associated with reduced clocking energy and the
energy penalty incurred by the clock-gating logic, thus providing a
better understanding of the optimization goals for the overall design
Nov. 14, 2003
Digital System Clocking: Oklobdzija, Stojanovic, Markovic, Nedovic
21
Energy per transition
pE
pE
0
1
pE
pE
Eaverage  p00  E00  p01  E01  p10  E10  p11  E11
Nov. 14, 2003
Digital System Clocking: Oklobdzija, Stojanovic, Markovic, Nedovic
22
Glitching Energy in CSEs
Clk
D
init. int.
nodes
(D = 0)
1
init. int.
nodes
(D = 1)
2
pos. glitch pos. glitch
(Clk = 0) (Clk = 1)
3
4
neg. glitch neg. glitch
(Clk = 0) (Clk = 1)
• Glitches are generated by the unintended transitions propagating
from the fan-in gates, termed propagating glitches.
• Glitches produced by non-glitch transitions at the inputs, called
generated glitches
4
Eavg  glitch    i  E gi
i 1
Nov. 14, 2003
Digital System Clocking: Oklobdzija, Stojanovic, Markovic, Nedovic
23
Interface with Clock Network and
Combinational Logic
• We assumed that the data and clock inputs
were supplied by drivers with sufficient drive
strength.
• The input clock and data capacitances are
important interface parameters for the clock
network and logic design.
• The clock network designer and logic designer
need to be aware of these capacitances in
order to design circuits that drive storage
elements.
Nov. 14, 2003
Digital System Clocking: Oklobdzija, Stojanovic, Markovic, Nedovic
24
Interface with Clock Network and
Combinational Logic
Interface with Combinational Logic:
The relevant parameters to the combinational
logic designer are:
• CSE input data slope
• Input data capacitance
The data slope affects performance and
energy consumption of both driving logic and
storage elements.
Clock and data slopes are generally not equal.
Nov. 14, 2003
Digital System Clocking: Oklobdzija, Stojanovic, Markovic, Nedovic
25
Interface with Clock Network and
Combinational Logic
Interface with Clock Network:
• CSEs are affected by clock skew and clock slope.
• The total load of the clock distribution network is
defined by the input capacitance of the clock node and
number of CSEs on a chip.
• Increase in clock slope results in degradation of the
CSE performance - the clock network designer has to
know what slopes CSE can tolerate.
• This is especially important if Flip-Flops are used.
• The clock slope also affects energy consumption of the
clock distribution network.
– If larger clock drivers with smaller fanout are used, the clock
edges are sharper and the storage element performance better,
at the expense of an increase in energy consumption of the
clock network.
– Optimal tradeoff is achieved with minimal energy consumption
that delivers the desired storage element performance.
Nov. 14, 2003
Digital System Clocking: Oklobdzija, Stojanovic, Markovic, Nedovic
26
Interface with Clock Network and
Combinational Logic
• To evaluate the total clocking energy per clock cycle in
the entire clock subsystem, one needs to add the energy
consumed in the clock distribution network.
• The energy consumed in the clock distribution network
depends on the total switched capacitance which is
determined by the total number of clocked storage
elements on a chip and the input capacitance of their
clock inputs, the total wiring capacitance, and the total
switched capacitance of clock drivers as given by:
Cdistribnet  NFF  CinClk , FF  Cwire  Cswbuff
The last two terms depend on buffer insertion/placement
strategy and should be minimized.
Nov. 14, 2003
Digital System Clocking: Oklobdzija, Stojanovic, Markovic, Nedovic
27
Related documents