Survey
* Your assessment is very important for improving the work of artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the work of artificial intelligence, which forms the content of this project
Digital System Clocking: High-Performance and Low-Power Aspects Vojin G. Oklobdzija, Vladimir M. Stojanovic, Dejan M. Markovic, Nikola M. Nedovic Chapter 3: Timing and Energy Parameters Wiley-Interscience and IEEE Press, January 2003 Basic timing diagram in flip-flops D Clk Q tCQ,LH tCQ,HL U H t<U Setup time violation Definitions: • Clock-to-Q Delay: tCQ low-to high= tCQ,LH high-to-low= tCQ,HL • Setup time U • Hold time H Nov. 14, 2003 Digital System Clocking: Oklobdzija, Stojanovic, Markovic, Nedovic 2 Setup and hold time behavior as a function of clock-to-output delay 350 300 Clk-Output [ps] 250 200 Setup Hold 150 100 50 -200 -150 -100 -50 00 50 100 150 200 Data-Clk [ps] Neither Setup nor Hold time are fixed constant parameters. They are function of Data-to-Clock time “distance”. Nov. 14, 2003 Digital System Clocking: Oklobdzija, Stojanovic, Markovic, Nedovic 3 Data to Output Delay Setup time behavior as a function of data-to-output delay Constant Clk-Q Region Failure Region Variable Clk-Q Region D-Q Clk-Q 45o Uopt DDQm Data arrives early Data to Clock Delay Data arrives late When D-to-Q delay is observed, we can see that data can come closer to the triggering event than we thought. Data-to-Q is the RELEVANT parameter – NOT Clock-to-Q as many think ! Nov. 14, 2003 Digital System Clocking: Oklobdzija, Stojanovic, Markovic, Nedovic 4 D-Q and Clk-Q delay as a function of D-Clk offset Determining the optimal setup Uopt and hold time Hopt. Nov. 14, 2003 Digital System Clocking: Oklobdzija, Stojanovic, Markovic, Nedovic 5 Setup time, hold time, sampling window and clock width in a flip-flop D Q FF Sampling Window Clk U H Clk tCW D Q Nov. 14, 2003 tCQ Digital System Clocking: Oklobdzija, Stojanovic, Markovic, Nedovic 6 Latch: setup and hold time D Q L Clk U Unlike Flip-Flop, Latch has two situations: (a) Data is ready – waiting for the clock (b)Latch is open – data arrives during the active clock signal Clk D1 Q1 H tCW tCQ (a) D2 tDQ Q2 (b) (a) early data D1 arrival; (b) late data D2 arrival Nov. 14, 2003 Digital System Clocking: Oklobdzija, Stojanovic, Markovic, Nedovic 7 Illustration of a data path U H D Q A Q Logic tLogic D Q B Q Clk tClk-Q Nov. 14, 2003 tskew Digital System Clocking: Oklobdzija, Stojanovic, Markovic, Nedovic 8 Late Data Arrival and Time Borrowing in a pipelined design D Q Logic D Q Source Clk Q Q Destination TCR1 TCR2 Clk dw1 Cycle-1 dw2 Cycle-2 dw1>dw3>dw2 dw3 TCR1> TCR2 Data-to-Output (Q) time window moves around the time axis; (becoming larger or smaller) Nov. 14, 2003 Digital System Clocking: Oklobdzija, Stojanovic, Markovic, Nedovic 9 Early Data Arrival and Internal Race Imunity • The maximum clock skew that system can tolerate is determined by the clock storage elements: – If Clk-Q delay of CSE is shorter than H, race can occur if there is no logic inbetween. – If Clk-Q is greater than H + possible skew – no problem: tClk-Q> H+ tskew • Internal race immunity: R = tClk-Q-H Nov. 14, 2003 Digital System Clocking: Oklobdzija, Stojanovic, Markovic, Nedovic 10 tClk-Q [ns] Impact of supply voltage on the sampling window 2.0 1.5 1.0 0.5 2.5 2.2 V DD 1.9 ] [V 1.6 300 100 1.3 -100 1 -500 -300 D-Clk [ps] Sampling window determines the minimum required duration of data signal Nov. 14, 2003 Digital System Clocking: Oklobdzija, Stojanovic, Markovic, Nedovic 11 Energy Parameters Nov. 14, 2003 Digital System Clocking: Oklobdzija, Stojanovic, Markovic, Nedovic 12 Components of Energy Consumption • • • • Switching Energy Short-Circuit Energy Leakage Energy Static Energy Nov. 14, 2003 Digital System Clocking: Oklobdzija, Stojanovic, Markovic, Nedovic 13 Components of Energy Consumption t T E VDD iVDD ( ) d t Energy consumed by CSE during one clock period T, where t is chosen to include all relevant transitions: arrival of new data, clock pulse, and output transition. This energy has four components: E Eswitching Eshortcircuit Eleakage Estatic Switching Energy: N Eswitching 01 (i ) Ci Vswing (i ) VDD i 1 • • • • Nov. 14, 2003 N is the number of nodes Ci is the capacitance of the node I 0-1(i) is the probability that a transition occurs at the node i Vswing(i) is voltage swing of the node I Digital System Clocking: Oklobdzija, Stojanovic, Markovic, Nedovic 14 Short-circuit current in an inverter CL Vin Vout iL iL ishort-circ (a) Vout ishort-circ ipull-down ipull-up Vin CL (b) (a) pull-up; (b) pull-down operation Nov. 14, 2003 Digital System Clocking: Oklobdzija, Stojanovic, Markovic, Nedovic 15 Projected leakage currents Leakage power will soon become a significant portion of the total power consumption in modern microprocessors Assuming doubling of transistors / generation the leakage current will increase about 7.5 times corresponding to a 5 times increase in total leakage power Nov. 14, 2003 Digital System Clocking: Oklobdzija, Stojanovic, Markovic, Nedovic 16 Where does the Energy go in CSE ? 1. Internal clocked nodes in storage elements 2. Internal non-clocked nodes in storage elements 3. Data and clock input load 4. Output load. Nov. 14, 2003 Digital System Clocking: Oklobdzija, Stojanovic, Markovic, Nedovic 17 Energy Breakdown 1. Internal clocking energy 2. Data and Clock Input Energy 3. Energy in Internal Non-clocked Nodes 4. Energy in Output Load 5. Energy per Transition 6. Glitching Energy Nov. 14, 2003 Digital System Clocking: Oklobdzija, Stojanovic, Markovic, Nedovic 18 Energy Breakdown in Clocked-Storage Elements during one of the possible input data transitions EClk Eint Eext E0-0 E0-1 E1-0 E1-1 Y/N Y/N N Y Y Y/N Y Y Y/N Y/N Y/N N Two cases: • Storage elements without pre-charge nodes • Storage elements with pre-charge nodes Nov. 14, 2003 Digital System Clocking: Oklobdzija, Stojanovic, Markovic, Nedovic 19 CSE characterization and Test setup VDD-CSE VDD-D D Dt Clk Clkt • Clock Energy D Q Q CL VDD-Clk • Internal Energy: Eint E01 E10 EClk ELoad 2 • Energy in Output Load Nov. 14, 2003 Digital System Clocking: Oklobdzija, Stojanovic, Markovic, Nedovic 20 Energy per transition • The energy-per-transition is the total energy consumed in a CSE during one clock cycle for a specified input data transition: 0-0, 0-1, 1-0, or 1-1 • This metric is crucial in that it yields significant insight about circuit energy • By inspection of the node activity in a CSE for different input data transitions, the energy-per-transition can be utilized to obtain the energy breakdown between clocked nodes, internal nodes, and the external output load. • This forms a good basis for the study of alternative circuit techniques that deal with internal clock gating. • The energy breakdown information also offers valuable information about the tradeoffs associated with reduced clocking energy and the energy penalty incurred by the clock-gating logic, thus providing a better understanding of the optimization goals for the overall design Nov. 14, 2003 Digital System Clocking: Oklobdzija, Stojanovic, Markovic, Nedovic 21 Energy per transition pE pE 0 1 pE pE Eaverage p00 E00 p01 E01 p10 E10 p11 E11 Nov. 14, 2003 Digital System Clocking: Oklobdzija, Stojanovic, Markovic, Nedovic 22 Glitching Energy in CSEs Clk D init. int. nodes (D = 0) 1 init. int. nodes (D = 1) 2 pos. glitch pos. glitch (Clk = 0) (Clk = 1) 3 4 neg. glitch neg. glitch (Clk = 0) (Clk = 1) • Glitches are generated by the unintended transitions propagating from the fan-in gates, termed propagating glitches. • Glitches produced by non-glitch transitions at the inputs, called generated glitches 4 Eavg glitch i E gi i 1 Nov. 14, 2003 Digital System Clocking: Oklobdzija, Stojanovic, Markovic, Nedovic 23 Interface with Clock Network and Combinational Logic • We assumed that the data and clock inputs were supplied by drivers with sufficient drive strength. • The input clock and data capacitances are important interface parameters for the clock network and logic design. • The clock network designer and logic designer need to be aware of these capacitances in order to design circuits that drive storage elements. Nov. 14, 2003 Digital System Clocking: Oklobdzija, Stojanovic, Markovic, Nedovic 24 Interface with Clock Network and Combinational Logic Interface with Combinational Logic: The relevant parameters to the combinational logic designer are: • CSE input data slope • Input data capacitance The data slope affects performance and energy consumption of both driving logic and storage elements. Clock and data slopes are generally not equal. Nov. 14, 2003 Digital System Clocking: Oklobdzija, Stojanovic, Markovic, Nedovic 25 Interface with Clock Network and Combinational Logic Interface with Clock Network: • CSEs are affected by clock skew and clock slope. • The total load of the clock distribution network is defined by the input capacitance of the clock node and number of CSEs on a chip. • Increase in clock slope results in degradation of the CSE performance - the clock network designer has to know what slopes CSE can tolerate. • This is especially important if Flip-Flops are used. • The clock slope also affects energy consumption of the clock distribution network. – If larger clock drivers with smaller fanout are used, the clock edges are sharper and the storage element performance better, at the expense of an increase in energy consumption of the clock network. – Optimal tradeoff is achieved with minimal energy consumption that delivers the desired storage element performance. Nov. 14, 2003 Digital System Clocking: Oklobdzija, Stojanovic, Markovic, Nedovic 26 Interface with Clock Network and Combinational Logic • To evaluate the total clocking energy per clock cycle in the entire clock subsystem, one needs to add the energy consumed in the clock distribution network. • The energy consumed in the clock distribution network depends on the total switched capacitance which is determined by the total number of clocked storage elements on a chip and the input capacitance of their clock inputs, the total wiring capacitance, and the total switched capacitance of clock drivers as given by: Cdistribnet NFF CinClk , FF Cwire Cswbuff The last two terms depend on buffer insertion/placement strategy and should be minimized. Nov. 14, 2003 Digital System Clocking: Oklobdzija, Stojanovic, Markovic, Nedovic 27