Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Chapter 5 CMOS Inverter Boonchuay Supmonchai Integrated Design Application Research (IDAR) Laboratory July 5, 2004; Revised - June 25, 2005 B.Supmonchai Goals of This Chapter Quantification of Design Metrics of an inverter Static (or Steady-State) Behavior Dynamic (or Transient Response) Behavior Energy Efficiency Optimization of an inverter design Technology Scaling and its impact on the inverter metrics 2102-545 Digital ICs CMOS Inverter 2 B.Supmonchai Digital Gate Design Metrics: Recap Cost Complexity and Area Reliability and Robustness Static Behavior Noise Margin, Regenerative Property Performance Dynamic Behavior Speed (Delay) Energy Efficiency Energy and Power Consumption, Energy-Delay 2102-545 Digital ICs CMOS Inverter 3 B.Supmonchai Why CMOS Inverter? CMOS because it is the dominating technology of the era. High Packing Density Relatively Easy Process Inverter because it is the nucleus of all digital designs. Behavior of more intricate structures (logic gates, adders, etc.) can be almost completely derived by extrapolating the results obtained from the inverters. 2102-545 Digital ICs CMOS Inverter 4 B.Supmonchai CMOS Inverter: A First Glance V DD Driven by Output Of another gate => Fanin PMOS V in V out CL NMOS Collective Capacitances Of Wires and Gates => Fanout 2102-545 Digital ICs CMOS Inverter 5 B.Supmonchai CMOS Inverter: Physical View Recap N Well VDD VDD PMOS 2l PMOS In Contacts Out In NMOS Out Metal 1 Polysilicon NMOS GND 2102-545 Digital ICs CMOS Inverter 6 B.Supmonchai Two CMOS Inverters: Physical View Share power and ground VDD Connect In Metal Abut Cells 2102-545 Digital ICs CMOS Inverter 7 B.Supmonchai CMOS Inverter Static Behavior V DD V DD Rp Vout Vout Rn Vin = 0 Vin = VDD State of Transistors ON: |VGT = VGS - VT| > |VT|, Ron OFF: |VGT = VGS - VT | > |VT|, Roff finite 2102-545 Digital ICs CMOS Inverter 8 B.Supmonchai CMOS Inverter Dynamic Behavior VDD Rp VDD Charge Discharge Vout Vout CL CL Rn Low to High Vin = 0 High to Low Vin = V DD Gate response time is determined by the time to charge CL through Rp (discharge CL through Rn) 2102-545 Digital ICs CMOS Inverter 9 B.Supmonchai CMOS Properties Full rail-to-rail swing High noise margins Logic levels not dependent upon the relative device sizes => Ratioless Transistors can be minimum size Regenerative Property Low output impedance Large Fan-out (albeit with degraded performance) Typical output resistance in k range. 2102-545 Digital ICs CMOS Inverter 10 B.Supmonchai CMOS Properties (2) Extremely high input resistance (MOS transistor is near perfect insulator) nearly zero steady-state input current No direct path between power and ground under steady-state (but there always exists a path with finite resistance between the output and either VDD or GND) no static power dissipation Propagation delay a function of load capacitance and resistance of transistors 2102-545 Digital ICs CMOS Inverter 11 B.Supmonchai NMOS Short Channel I-V Plot Recap 2.5 X 10-4 VGS = 2.5 V 2 VGS = 2.0 V 1.5 VGS = 1.5 V 1 VGS = 1.0 V 0.5 0 0 0.5 1 1.5 2 2.5 VDS (V) NMOS transistor, 0.25 m, Ld = 0.25 m, W/L = 1.5, VDD = 2.5 V, VT = 0.4 V 2102-545 Digital ICs CMOS Inverter 12 B.Supmonchai PMOS Short Channel I-V Plot Recap All polarities of all voltages and currents are reversed -2 VDS (V) -1 0 0 VGS = -1.0 V -0.2 VGS = -1.5 V -0.4 VGS = -2.0 V -0.6 -0.8 VGS = -2.5 V -1 X 10-4 PMOS transistor, 0.25 m, Ld = 0.25 m, W/L = 1.5, VDD = 2.5 V, VT = -0.4 V 2102-545 Digital ICs CMOS Inverter 13 B.Supmonchai Transforming PMOS I-V Plot NMOS and PMOS VTC must be put into a common coordinate set of Vin, Vout, and IDn I Dn IDSp = -IDSn VGSn = Vin ; VGSp = Vin - VDD VDSn = Vout ; VDSp = Vout - VDD VGSp = -1 VGSp = -2.5 2102-545 Digital ICs Vout Vin = 0 Vin = 0 Vin = 1.5 Vin = 1.5 Mirror around x-axis Vin = VDD + VGSp IDn = -IDp CMOS Inverter Horiz. shift over VDD Vout = VDD + VDSp 14 B.Supmonchai CMOS Inverter Load-Line Plot PMOS NMOS X 10-4 2.5 Vin = 0 V Vin = 2.5 V 2 Vin = 0.5 V Vin = 2.0 V 1.5 Vin = 1.0 V 1 Vin = 1.5 V Vin = 2 V Vin = 1.5 V Vin = 1 V Vin = 0.5 V 0.5 Vin = 1.5 V Vin = 1.0 V Vin = 2.0 V Vin = 0.5 V 0 Vin = 2.5 V 0 0.5 1 1.5 Vout (V) 2 2.5 Vin = 0 V CMOS 0.25 m, W/Ln = 1.5, W/Lp = 4.5, VDD = 2.5 V, VTn = 0.4 V, VTp = -0.4 V 2102-545 Digital ICs CMOS Inverter 15 B.Supmonchai CMOS Inverter VTC * VTC = Voltage-Transfer Characteristics NMOS off PMOS res 2.5 NMOS sat PMOS res Vout (V) 2 1.5 NMOS sat PMOS sat 1 NMOS res PMOS sat NMOS res PMOS off 0.5 0 0 0.5 1 1.5 2 2.5 Vin (V) 2102-545 Digital ICs CMOS Inverter 16 B.Supmonchai Robustness of CMOS Inverter Precise Values of Switching Threshold, VM VM is defined as the point where Vin = Vout Noise Margins Piece-Wise Linear Approximation Maximization Process Variations Device Variations Technology Scaling 2102-545 Digital ICs CMOS Inverter 17 B.Supmonchai Switching Threshold At VM where Vin = Vout, both PMOS and NMOS transistors are in saturation (since VDS = VGS) VM rVDD/(1 + r) where r = kpVDSATp/knVDSATn Switching threshold set by the ratio r, which compares the relative driving strengths of the PMOS and NMOS transistors Goal: To set VM = VDD/2 (to maximize noise margins), so r 1 W Lp W Ln 2102-545 Digital ICs kn 'VDSAT,n VM VT ,n VDSAT,n /2 k p 'VDSAT,p VDD VM VT ,p VDSAT,p /2 CMOS Inverter 18 B.Supmonchai Switch Threshold Example In our generic 0.25 micron CMOS process, using the process parameters from Table 3.2, at VDD = 2.5V, and a minimum size NMOS device ((W/L)n of 1.5) VT0(V) (V0.5) VDSAT(V) k’(A/V2) l(V-1) NMOS 0.43 0.4 0.63 115 x 10-6 0.06 PMOS -0.4 -0.4 -1 -30 x 10-6 -0.1 (W/L)p 115 x 10-6 0.63 (1.25 – 0.43 – 0.63/2) = (W/L)n -30 x x 10-6 x -1.0 (1.25 – 0.4 – 1.0/2) = 3.5 (W/L)p = 3.5 x 1.5 = 5.25 for a VM of 1.25 V 2102-545 Digital ICs CMOS Inverter 19 B.Supmonchai Example: Simulated Results 1.8 1.7 1.6 1.5 1.3 M V (V) 1.4 1.25 V 1.2 1.1 1 0.9 r = 3.4 0.8 10 0 10 1 W p /W n Minimum Width-to-Length = 1.5 2102-545 Digital ICs CMOS Inverter 20 B.Supmonchai Observations I VM is relatively insensitive to variations in device ratio Small Variations of the ratio do not significantly disturb VTC. Common Industry Practice to set Wp smaller than the requirement. Increasing the width of the PMOS moves VM towards VDD Increasing the width of the NMOS moves VM toward GND 2102-545 Digital ICs CMOS Inverter 21 B.Supmonchai Noise Margins: Determining VIH and VIL 3 By definition, VIH and VIL are where gain dVout/dVin = -1 Gain g = Slope VOH = VDD NMH = VDD - VIH NML = VIL - GND 2 VM Approximating: VIH = VM - VM /g VIL = VM + (VDD - VM )/g 1 Slope = g VOL = GND 0 VVIL IL VIH Vin VIH A piece-wise linear approximation of VTC 2102-545 Digital ICs CMOS Inverter So high gain in the transition region is very desirable 22 B.Supmonchai CMOS Voltage Gain Vin 0 0.5 0 1 1.5 2 Gain is a strong function of the slopes of the currents in the saturation region, for Vin = VM -2 -4 -6 -8 -10 -12 -14 -16 -18 Determined only by technology parameters, especially channel length modulation (l). Only designer influence through supply voltage and VM (transistor sizing). 2102-545 Digital ICs CMOS Inverter 23 B.Supmonchai Example: VTC and Noise Margin For a 0.25m, (W/L)p/(W/L)n = 3.4, (W/L)n = 1.5 (min size) VDD = 2.5V VM 1.25 V, g = -27.5 VIL = 1.2 V, VIH = 1.3 V NML = NMH = 1.2 Real Value VIL = 1.03 V, VIH = 1.45 V NML = 1.03, NMH = 1.05 Output resistance Sensitivity of gate output with respect to noise low-output = 2.4 k high-output = 3.3 k Preferably as low as possible 2102-545 Digital ICs CMOS Inverter 24 B.Supmonchai Observations II First-Order Analysis overestimates the gain Max. gain only 17 at VM VIL = 1.17V, VIH = 1.33V Piecewise Linear Approximation is too overly optimistic Major contributor to deviation from the true gain CMOS inverter is a poor analog amplifier! One of the major differences between analog and digital designs is that digital circuits operate in the regions of extreme nonlinearity. Well-defined and well-separated high and low signals 2102-545 Digital ICs CMOS Inverter 25 B.Supmonchai Impact of Process Variation on VTC 2.5 Good PMOS Bad NMOS Vout(V) 2 1.5 1 Bad PMOS Good NMOS 0.5 0 0 0.5 1 1.5 2 2.5 Vin(V) Process variations (mostly) cause a shift in the switching threshold 2102-545 Digital ICs CMOS Inverter 26 B.Supmonchai Scaling the Supply Voltage 2.5 0.2 2 1.5 Vout(V) Vout(V) 0.15 1 0.1 0.05 0.5 0 0 0 0.5 1 1.5 2 2.5 0 Gain=-1 Vin(V) Reducing VDD improves Gain… 0.05 0.1 0.15 0.2 Vin(V) But it deteriorates for very low VDD Practical Lower Bound: VDDmin > 2 to 4 kt /q 2102-545 Digital ICs CMOS Inverter 27 B.Supmonchai Observations III Reducing the supply voltage has a positive impact on the energy dissipation … But is also detrimental to the delay of the gate DC Characteristic becomes increasingly sensitive to device variations once supply and intrinsic voltages become comparable Scaling the supply voltage = reducing the swing Reduce internal noise (e.g., crosstalk) More susceptible to external noise that do not scale 2102-545 Digital ICs CMOS Inverter 28 B.Supmonchai CMOS Inverter Dynamic Behavior Transient behavior of the gate is determined by the time it takes to charge and discharge the load capacitance, CL, through on-transistors Delay is a function of load capacitances and transistor on-resistances Getting CL as small as possible is crucial to the realization of high-performance CMOS circuits Transistor Capacitances Wire Capacitances Fanout Wire Resistances also become more important. 2102-545 Digital ICs CMOS Inverter 29 B.Supmonchai Computing the Capacitances VDD VDD Extrinsic M2 Vin Cgd12 M4 Cg4 Cdb2 Vout Cdb1 Cw Vout2 Cg3 M3 M1 Interconnect Intrinsic Fanout Fanout Vin Simplified Model Vin VVout out CL CL Vout2 Simplified Model 2102-545 Digital ICs CMOS Inverter 30 B.Supmonchai Finding Cgd: The Miller Effect M1 and M2 are either in cut-off or in saturation. The floating gate-drain capacitor is replaced by a capacitance-to-ground (gate-bulk capacitor). Cgd1 V Vout Vout Vin V 2Cgd1 V M1 V Vin M1 “A capacitor experiencing identical but opposite voltage swings at both its terminals can be replaced by a capacitor to ground whose value is two times the original value” 2102-545 Digital ICs CMOS Inverter 31 B.Supmonchai Diffusion Capacitances: Cdb1 and Cdb2 We can simplify the diffusion capacitance calculations by using a Keq to linearize the nonlinear capacitor to the value of the junction capacitance under zero-bias Ceq = Keq Cj0 0.25 m Process high-to-low low-to-high Keqbp Keqsw Keqbp Keqsw NMOS 0.57 0.61 0.79 0.81 PMOS 0.79 0.86 0.59 0.7 2102-545 Digital ICs CMOS Inverter 32 B.Supmonchai Extrinsic Capacitances: Cg3 and Cg4 Simplification of the actual situation Assumes all the components of Cgate are between Vout and GND (or VDD) Assumes the channel capacitances of the loading gates are constant The extrinsic, or fan-out, capacitance is the total gate capacitance of the loading gates M3 and M4. Cfan-out = Cgate(NMOS) + Cgate(PMOS) = (CGSOn+ CGDOn+ WnLnCox) + (CGSOp+ CGDOp+ WpLpCox) 2102-545 Digital ICs CMOS Inverter 33 B.Supmonchai Example: Layout of Two Inverters VDD PMOS 1.125/0.25 AD = Drain Area PD = Drain Perimeter AS = Source Area PS = Source Perimeter Out In Metal1 Polysilicon l = 0.125 NMOS 0.375/0.25 Minimum Drawn Length GND 0.25 m W/L AD (m2) PD (m) AS (m2) PS (m) NMOS 0.375/0.25 0.3 1.875 0.3 1.875 PMOS 1.125/0.25 0.7 2.375 0.7 2.375 2102-545 Digital ICs CMOS Inverter 34 B.Supmonchai Example: Components of CL (0.25 m) C Term Expression Value (fF) HL Value (fF) LH Cgd1 2 Cgd0n Wn 0.23 0.23 Cgd2 2 Cgd0p Wp 0.61 0.61 Cdb1 KeqbpnADnCj + KeqswnPDnCjsw 0.66 0.90 Cdb2 KeqbppADpCj + KeqswpPDpCjsw 1.5 1.15 Cg3 (2 Cgd0n)Wn + CoxWnLn 0.76 0.76 Cg4 (2 Cgd0p)Wp + CoxWpLp 2.28 2.28 Cw from extraction 0.12 0.12 CL 6.1 6.0 2102-545 Digital ICs CMOS Inverter 35 B.Supmonchai Wiring Capacitance The wiring capacitance depends upon the length and width of the connecting wires and is a function of the fan-out from the driving gate and the number of fan-out gates. Wiring capacitance is growing in importance with the scaling of technology. 2102-545 Digital ICs CMOS Inverter 36 B.Supmonchai Inverter Propagation Delay “Propagation delay is proportional to the time-constant of the network formed by the on-resistance and the load capacitance” VDD Rp Charge Vout CL Low to High Vin = 0 VDD tp = f(Ron, CL) tpLH = 0.69 Reqp CL tpHL = 0.69 Reqn CL t pHL t pLH tp 2 Reqn Reqp 0.69C L 2 Discharge Vout CL Rn High to Low Vin = V DD To equalize rise and fall times make the on-resistance of the NMOS and PMOS approximately equal. 2102-545 Digital ICs CMOS Inverter 37 B.Supmonchai Inverter Transient Response (0.25 µm) Simulation Analysis VDD= 2.5V W/Ln = 1.5 W/Lp = 4.5 Reqn= 13 k /1.5 Reqp= 31 k /4.5 3 Vin, Vout(V) 2.5 2 tpLH 1.5 1 0.5 tpHL 0 -0.5 0 50 100 150 200 250 tpHL = 36 psec tpLH = 29 psec t(psec) tpHL = 39.9 psec and tpLH = 31.7 psec tp = (36+29)/2 = 32.5 psec Analysis results is too optimistic ~ 10% better 2102-545 Digital ICs CMOS Inverter 38 B.Supmonchai Inverter Propagation Delay, Revisited To see how a designer can optimize the delay of a gate, we have to expand Req in the delay equation. tp(normalized) 5.5 5 4.5 4 3.5 3 2.5 2 tpHL = 0.69 Reqn CL = 0.69(3CVDD)/(4IDSATn) t pHL CL 0.52 W Ln knVDSATn 1.5 1 0.8 1 1.2 1.4 1.6 1.8 2 2.2 2.4 VDD(V) 2102-545 Digital ICs CMOS Inverter 39 B.Supmonchai Minimizing Propagation Delay Reduce CL Keep the drain diffusion as small as possible Increase W/L ratio of the transistor Most powerful and effective way Watch out for self-loading! When the intrinsic capacitance dominates Increase VDD Trade off energy efficiency for performance Very minimal improvement above a certain level Reliability concerns enforce a firm upper bound on VDD 2102-545 Digital ICs CMOS Inverter 40 B.Supmonchai PMOS-to-NMOS Ratio So far PMOS and NMOS have been sized such that their Req’s match (ratio of 3 to 3.5) symmetrical VTC equal high-to-low and low-to-high propagation delays If speed is the only concern, reduce the width of the PMOS device! widening the PMOS degrades the tpHL due to larger parasitic capacitance = (W/L)p/(W/L)n CW opt r 1 C C dn2 Cgn 2 r = Reqp/Reqn resistance ratio of identically-sized PMOS and NMOS 2102-545 Digital ICs CMOS Inverter 41 B.Supmonchai PMOS-to-NMOS Ratio Effects 50 tpLH tpHL • of 2.4 (= 31 k/13 k) gives symmetrical response tp(psec) 45 • opt ~ 1.6 - 1.9 40 tp 35 Analytic Simulated 2.4 30 1 1.5 2 2.5 3 3.5 4 4.5 5 • When wire capacitance is negligible (Cdn1+Cgn2 >> CW), opt = r • If wire capacitance dominates then larger value of must be used 2102-545 Digital ICs CMOS Inverter 42 B.Supmonchai Device Sizing for Performance Divide capacitive load, CL, into Cint : intrinsic - diffusion and Miller effect Cext : extrinsic - wiring and fanout tp = 0.69 Req Cint (1 + Cext/Cint) = tp0 (1 + Cext/Cint) where tp0 = 0.69 Req Cint is the intrinsic (unloaded) delay of the gate Widening both PMOS and NMOS by a factor S reduces Req by an identical factor (Req = Rref/S), but raises the intrinsic capacitance by the same factor (Cint = SCiref) tp = 0.69 Rref Ciref (1 + Cext/(SCiref)) = tp0(1 + Cext/(SCiref)) 2102-545 Digital ICs CMOS Inverter 43 B.Supmonchai Observation IV Intrinsic Delay of the inverter tp0 is independent of the sizing of the gate; • tp0 can be determined purely by technology and inverter layout • With no load the increased drive strength of the gate is totally offset by the increased capacitance Any S sufficiently larger than (Cext/Cint) would yield a much better performance gain with a substantial area increase 2102-545 Digital ICs CMOS Inverter 44 B.Supmonchai Sizing Impacts on Delay The majority of the improvement is already obtained for S = 5. for a fixed load tp(psec) 38 36 34 Sizing factors larger than 10 barely yield any extra gain (and cost significantly more area). 32 30 28 26 24 22 20 1 3 5 7 9 S 2102-545 Digital ICs 11 13 15 self-loading effect (intrinsic capacitance dominates) CMOS Inverter 45 B.Supmonchai Impact of Fanout on Delay Extrinsic capacitance, Cext, is a function of the fanout of the gate the larger the fanout, the larger the external load. First determine the input loading effect of the inverter. Both Cg and Cint are proportional to the gate sizing, so Cint = Cg is independent of gate sizing and tp = tp0 (1 + Cext/ Cg) = tp0 (1 + f /) The delay of an inverter is a function of the ratio between its external load capacitance and its input gate capacitance: the effective fan-out f f = Cext/Cg 2102-545 Digital ICs CMOS Inverter 46 B.Supmonchai Inverter Chain Goal: to minimize the delay through an inverter chain In Out Cg,1 1 2 N CL The delay of the j-th inverter stage is tp,j = tp0 (1 + Cg,j+1/(Cg,j)) = tp0(1 + fj/ ) Overall Delay: tp = tp,j = tp0 (1 + Cg,j+1/(Cg,j)) If CL is given How should the inverters be sized? How many stages are needed to minimize the delay? 2102-545 Digital ICs CMOS Inverter 47 B.Supmonchai Sizing the Inverters in the Chain The optimum size of each inverter is the geometric mean of its neighbors – meaning that if each inverter is sized up by the same factor f wrt the preceding gate, it will have the same effective fan-out and the same delay f N CL Cg,1 N F where F represents the overall effective fan-out of the circuit (F = CL/Cg,1) The minimum delay through the inverter chain is t p Nt p0 1 N F / The relationship between tp and F is linear for one inverter, square root for two, etc. 2102-545 Digital ICs CMOS Inverter 48 B.Supmonchai Example: Inverter Chain Sizing In Out Cg,1 1 f2 = 4 f=2 CL = 8 Cg,1 CL/Cg,1 has to be evenly distributed over N = 3 inverters CL/Cg,1 = 8/1 3 f = 8 = 2 2102-545 Digital ICs CMOS Inverter 49 B.Supmonchai Determining N: Optimal Number of Inverters What is the optimal value for N given F (=fN) ? If the number of stages is too large, the intrinsic delay of the stages becomes dominate If the number of stages is too small, the effective fan-out of each stage becomes dominate The optimum N is found by differentiating the minimum delay expression divided by the number of stages and setting the result to 0, giving N N + F - ( F lnF)/N = 0 For = 0 (ignoring self-loading) N = ln (F) and the effective-fan out becomes f = e = 2.71828 For = 1 (the typical case) the optimum effective fan-out (tapering factor) turns out to be close to 3.6 2102-545 Digital ICs CMOS Inverter 50 B.Supmonchai Optimum Effective Fan-Out 7 5 Normalized Delay 6 Fopt 4.5 4 3.5 5 4 3 2 3 1 2.5 0 0 0.5 1 1.5 2 2.5 3 1 1.5 2 2.5 3 3.5 4 4.5 5 f Choosing f larger than optimum has little effect on delay and reduces the number of stages (and area). Common practice to use f = 4 (for = 1) But too many stages has a substantial negative impact on delay 2102-545 Digital ICs CMOS Inverter 51 B.Supmonchai Example: Inverter (Buffer) Staging 1 Cg,1 = 1 1 CL = 64 Cg,1 1 CL = 64 Cg,1 4 tp 1 64 65 2 8 18 3 4 15 4 2.8 15.3 16 Cg,1 = 1 2.8 Cg,1 = 1 2102-545 Digital ICs f 8 Cg,1 = 1 1 N CL = 64 Cg,1 8 22.6 CL = 64 Cg,1 CMOS Inverter 52 B.Supmonchai Impact of Buffer Staging for Large CL 11 Two Stage Chain 8.3 Opt. Inverter Chain 8.3 100 101 22 16.5 1,000 1001 65 24.8 10,000 10,001 202 33.1 F ( = 1) Unbuffered 10 Impressive speed-ups with optimized cascaded inverter chain for very large capacitive loads. 2102-545 Digital ICs CMOS Inverter 53 B.Supmonchai Input Signal Rise/Fall Time In reality, the input signal changes gradually (and both PMOS and NMOS conduct for a brief time). This affects the current available for charging/discharging CL and impacts propagation delay. t = input signal slope 54 s 52 ts is due to the limited driving capability of the preceding gate 50 tp(psec) tp increases linearly with increasing input slope, ts, once ts > tp 48 46 44 42 40 38 36 0 20 40 60 80 tS(psec) for a minimum-size inverter with a fan-out of a single gate 2102-545 Digital ICs CMOS Inverter 54 B.Supmonchai Design Challenge A gate is never designed in isolation: its performance is affected by both the fan-out and the driving strength of the gate(s) feeding its inputs. (Revised tp expression) tip = tistep + ti-1step ( 0.25) Keep signal rise times smaller than or equal to the gate propagation delays. good for performance good for power consumption Keeping rise and fall times of the signals small and of approximately equal values is one of the major challenges in high-performance designs - slope engineering. 2102-545 Digital ICs CMOS Inverter 55 B.Supmonchai Delay with Long Interconnect When gates are farther apart, wire capacitance and resistance can no longer be ignored. (rw, cw, L) Vin cint Vout cfan tp = 0.69RdrCint + (0.69Rdr+0.38Rw)Cw + 0.69(Rdr+Rw)Cfan where Rdr = (Reqn + Reqp)/2 tp = 0.69Rdr(Cint+Cfan) + 0.69(Rdrcw+rwCfan)L + 0.38rwcwL2 Wire delay rapidly becomes the dominant factor (due to the quadratic term) in the delay budget for longer wires. 2102-545 Digital ICs CMOS Inverter 56 B.Supmonchai Where Does Power Go? Static Power Consumption Ideally zero for static CMOS but in the real world.. Leakage Current Loss Diodes and Transistors constantly losing charge Dynamic Power Consumption Charging/Discharging Capacitances Major Source of Power Dissipation in CMOS Circuits Direct-Path Current Loss Short circuit between Power Rail during Switching 2102-545 Digital ICs CMOS Inverter 57 B.Supmonchai Dynamic Power Consumption VDD iVDD(t) Vin Vout CL i Energy Supplied/Cycle = VDD (t)VDD dt = CL * VDD2 0 Energy Stored/Cycle = i VDD (t)v out (t)dt 0 = CL * VDD2 / 2 Pdyn = Energy/cycle * fclk = CL * VDD2 * fclk 2102-545 Digital ICs CMOS Inverter 58 B.Supmonchai Switching Activity Power dissipation does not depend on the size of the devices but depends on how often the circuit is switched. Switching Activity frequency of energy-consuming transition = f 01 Clock Pdyn = CL * VDD2 * f 01 = CL * VDD2 * P01 * fclk Gate output = Ceff * VDD2 * fclk P01 = 0.25, 2102-545 Digital ICs f01 = fclk / 4 Effective Capacitance Ceff = Average Capacitance Switched per clock cycle CMOS Inverter 59 B.Supmonchai Lowering Dynamic Power Lowering Physical Capacitance Capacitance: Function of fan-out, wire length, transistor sizes Quadratic Effect Supply Voltage: Has been dropping with successive generations Pdyn = CL VDD2 P01 f Activity factor: How often, on average, do gates switch? Clock frequency: Increasing… Reduction can be obtained only at Logic and Architectural Abstraction Levels 2102-545 Digital ICs CMOS Inverter 61 B.Supmonchai Short Circuit Power Consumption VDD tsc Isc Vin Vout CL Finite slope of the input signal causes a direct current path between VDD and GND for a short period of time during switching when both the NMOS and PMOS transistors are conducting (active). 2102-545 Digital ICs CMOS Inverter 62 B.Supmonchai Short Circuit Currents Determinates Esc = tsc VDD Ipeak P01 Psc = tsc VDD Ipeak f01 tsc = Duration of the slope of the input signal Ipeak determined by the saturation current of the PMOS and NMOS transistors which depend on their sizes, process technology, temperature, etc. strong function of the ratio between input and output slopes a function of CL 2102-545 Digital ICs CMOS Inverter 63 B.Supmonchai Impact of CL on Psc VDD VDD Isc Imax Isc 0 Vin Vout Vin CL Vout CL Large capacitive load Small capacitive load Output fall time significantly larger than input rise time. Output fall time substantially smaller than input rise time. 2102-545 Digital ICs CMOS Inverter 64 B.Supmonchai Ipeak as a Function of CL x 10-4 When load capacitance is small, Ipeak is large. 2.5 CL = 20 fF 2 1.5 CL = 100 fF 1 CL = 500 fF 0.5 0 0 2 4 -0.5 Short circuit dissipation is minimized by matching the rise/fall times of the input and output signals - slope engineering. 6 x 10-10 time (sec) 500 psec input slope 2102-545 Digital ICs CMOS Inverter 65 B.Supmonchai Psc as a Function of Rise/Fall Times VDD= 3.3 V 8 When load capacitance is small (tsin/tsout > 2 for VDD > 2V) the power is dominated by Psc 7 6 5 VDD = 2.5 V 4 3 2 VDD = 1.5V 1 0 0 1 2 3 4 5 If VDD < VTn + |VTp| then Psc is eliminated since both devices are never on at the same time. tsin/tsout W/Lp = 1.125 m/0.25 m W/Ln = 0.375 m/0.25 m CL = 30 fF 2102-545 Digital ICs normalized wrt zero input rise-time dissipation CMOS Inverter 66 B.Supmonchai Static (Leakage) Power Consumption VDD Pstat= VDD Istat VDD Vout = VDD Drain junction leakage Gate leakage dominant factor. Sub-threshold current All leakages increase exponentially with temperature Junction leakage doubles every 9C Sub-threshold current becomes more concern in vDSM The closer the threshold voltage to zero, the larger the leakage current at VGS = 0V (when NMOS off) 2102-545 Digital ICs CMOS Inverter 67 B.Supmonchai Leakage as a Function of VT Continued scaling of supply voltage and the subsequent scaling of threshold voltage will make sub-threshold conduction a dominant component of power dissipation. 1.E-02 ID (A) 1.E-04 1.E-06 1.E-08 VT=0.4V VT=0.1V 1.E-10 An 90mV/decade VT roll-off - so each 255mV increase in VT gives 3 orders of magnitude reduction in leakage (but adversely affects performance) 1.E-12 0 0.2 0.4 0.6 0.8 1 VGS (V) 2102-545 Digital ICs CMOS Inverter 68 B.Supmonchai TSMC Processes Leakage and VT From MPR, June 2000, pp. 19 – Performance of various TSMC processes (G generic, LP low power, ULP ultra low power, HS high speed) CL018 G CL018 LP CL018 ULP CL018 HS CL015 HS CL013 HS Vdd 1.8 V 1.8 V 1.8 V 2V 1.5 V 1.2 V Tox (effective) 42 Å 42 Å 42 Å 42 Å 29 Å 24 Å Lgate 0.16 m 0.16 m 0.18 m 0.13 m 0.11 m 0.08 m IDSat (n/p) (A/m) 600/260 500/180 320/130 780/360 860/370 920/400 20 1.60 0.15 300 1,800 13,000 0.42 V 0.63 V 0.73 V 0.40 V 0.29 V 0.25 V 30 22 14 43 52 80 Ioff (leakage) (A/m) VTn FET Perf. (GHz) 2102-545 Digital ICs CMOS Inverter 69 B.Supmonchai Exponential Increase in Leakages Leakage currents double every 10 degree increase in temperature Ileakage(nA/m) 10000 0.10 m 0.13 m 1000 0.18 m 0.25 m 100 10 1 30 40 50 60 70 80 90 100 110 Temperature (C) The Leakage Power is six orders of magnitude smaller than the dynamic power (at room temperature) 2102-545 Digital ICs CMOS Inverter 70 B.Supmonchai Energy and Power Equations E = CL VDD2 P01 + tsc VDD Ipeak P01 + VDD IleakageTclock f01 = P01 * fclock P = CL VDD2 f01 + tsc VDD Ipeak f01 + VDD Ileakage Dynamic power (~90% today and decreasing relatively) 2102-545 Digital ICs Short-circuit power (~8% today and decreasing absolutely) CMOS Inverter Leakage power (~2% today and increasing) 71 B.Supmonchai Sizing for Minimum Energy In Out Cg1 f 1 Cext Goal: Minimize Energy of the whole circuit Design parameters: f and VDD tp tpref of circuit with f = 1 and VDD = Vref Overall Effective Fan-out F = Cext/Cg1 Intrinsic Delay of the inverter tp0 ~ VDDt/(VDDt - VTE) f F t p t p 01 1 f 2102-545 Digital ICs CMOS Inverter 72 B.Supmonchai Sizing for Minimum Energy II Performance Constraint (=1) tp t pref tp0 t p 0ref F 2 f f VDD Vref VTE Vref VDD VTE 3 F F 2 f f 1 3 F Energy for single Transition 2 E VDD Cg1 1 1 f F V 2 2 f F E DD E ref Vref 4 F 2 2102-545 Digital ICs CMOS Inverter 73 B.Supmonchai Sizing for Minimum Energy III 1.5 4 3.5 F=1 2 1 2.5 E/Eref VDD (V) 3 5 2 1.5 1 10 0.5 20 0 1 2 3 4 5 6 0.5 7 0 2 3 4 5 6 7 f f 1 Optimum sizing occurs at fopt = F Increasing device sizes beyond fopt increase self-loading factor Deteriorate performance and require increase in supply voltage 2102-545 Digital ICs CMOS Inverter 74 B.Supmonchai Observation V Device sizing, combined with supply voltage reduction, is very effective in reducing the energy consumption For F = 1, minimum size device is the most effective For network with large effective fan-out (F >> 1), a large reduction factor of almost 10 can be obtained. Oversizing transistors beyond the optimal value results in a hefty increase of energy Unfortunately, a common approach in many today’s design Optimal sizing factor for energy is smaller than the one for performance (delay), especially for large F For a fan-out of 20, fopt(energy) = 3.53, fopt(delay) = 4.47 2102-545 Digital ICs CMOS Inverter 75 B.Supmonchai Power-Delay and Energy-Delay Product Power-delay product (PDP) = Pav * tp = (CLVDD2)/2 PDP is the average energy consumed per switching event (Watts * sec = Joule) Lower power design could simply be a slower design Energy-delay product (EDP) = PDP * tp = Pav * tp2 EDP is the average energy consumed multiplied by the computation time required Takes into account that one can trade increased delay for lower energy/operation (e.g., via supply voltage scaling that increases delay, but decreases energy consumption) 2102-545 Digital ICs CMOS Inverter 76 B.Supmonchai Energy-Delay (no rmalized) Energy-Delay Plot 0.25 micron 15 EDP Energy-Delay 10 3 CL2VDD 2VDD VTE Where VTE = VT+VDSAT/2 5 Energy 3 VDDopt VTE 2 Delay 0 0.5 1 1.1 V 1.5 Vdd (V) 2 2.5 VTE ≈ (VTn +| VTp |)/2 = 0.8 V V VTn = 0.43 V, VDSATn = 0.63 V, VTEn = 0.74 VTp = -0.4 V, VDSATp = -1 V, VTEp = -0.9 V 2102-545 Digital ICs CMOS Inverter VDDopt = (3/2)*0.8 = 1.2 V 77 B.Supmonchai Observation VI Voltage Dependence of the EDP Higher Supply Voltages reduce delay, but harm the energy. Vice Versa for low voltages VDDopt simultaneously optimizes performance (delay) and energy For submicron technologies with VT in the range of 0.5 V, VDDopt ~ 1V. VDDopt does not necessarily represent the optimum voltage for a given design problem Goal of the design (speed or power) determinates the supply voltage 2102-545 Digital ICs CMOS Inverter 78 B.Supmonchai Goals of Technology Scaling Make things cheaper: Want to sell more functions (transistors) per chip for the same money Build same products cheaper, sell the same part for less money Price per transistor has to be reduced But also want to be faster, smaller, lower power 2102-545 Digital ICs CMOS Inverter 79 B.Supmonchai Technology Scaling Goals of scaling the dimensions by 30%: Reduce gate delay by 30% (increase operating frequency by 43%) Double transistor density Reduce energy per transition by 65% (50% power savings @ 43% increase in frequency Die size used to increase by 14% per generation Technology generation spans 2-3 years 2102-545 Digital ICs CMOS Inverter 80 B.Supmonchai Technology Evolution (ITRS2000) International Technology Roadmap for Semiconductors (ITRS) (http://public.itrs.net) Year of Introduction 1999 Technology node [nm] 180 Supply [V] 1.5-1.8 Wiring levels 2000 2001 2004 2008 2011 2014 130 90 60 40 30 1.5-1.8 1.2-1.5 0.9-1.2 0.6-0.9 0.5-0.6 0.3-0.6 6-7 6-7 7 8 9 9-10 10 Max frequency [GHz],Local-Global 1.2 1.6-1.4 2.1-1.6 3.5-2 7.1-2.5 11-3 14.9 -3.6 Max P power [W] 90 106 130 160 171 177 186 Bat. power [W] 1.4 1.7 2.0 2.4 2.1 2.3 2.5 Node years: 2007/65nm, 2010/45nm, 2013/33nm, 2016/23nm 2102-545 Digital ICs CMOS Inverter 81 B.Supmonchai Technology Evolution (1999) 2102-545 Digital ICs CMOS Inverter 82 B.Supmonchai Technology Scaling Models Full Scaling (Constant Electrical Field) Ideal model - dimensions and voltage scale together by the same factor S Fixed Voltage Scaling Most common until recently Only dimensions scale, voltages remain constant General Scaling Most realistic for todays situation Voltages and dimensions scale with different factors 2102-545 Digital ICs CMOS Inverter 83 B.Supmonchai Scaling Long Channel Devices 2102-545 Digital ICs CMOS Inverter 84 B.Supmonchai Scaling Short Channel Devices 2102-545 Digital ICs CMOS Inverter 85 B.Supmonchai Scaling Wire Capacitances S = Technology Scaling, U = Voltage Scaling, SL = Wire-length Scaling c = impact of fringing and interwire capacitance Parameter Wire Capacitance Wire Delay Wire Energy Relation General Scaling WL/t c/SL RonCint c/SL CmV2 c/SLU2 Wire Delay/Intrinsic Delay cS/SL Wire Energy/ Intrinsic Energy cS/SL 2102-545 Digital ICs CMOS Inverter 86 B.Supmonchai Power Density vs. Scaling Factor In correspondance 1000 2 Power Density (mW/mm) Power density increase approximately with S2 with fixed-voltage scaling 100 Recent Trend is more in line with Full-scaling Constant power 10 density 11 10 Scaling Factork ?i normalized by 4m design rule?j 2102-545 Digital ICs CMOS Inverter Accelerated VDD scaling and more attention to powerreducing design techniques 87 B.Supmonchai Evolution of Wire Delay and Gate Delay How the ratio of wire over intrinsic contributions will actually evolve is debatable 2102-545 Digital ICs CMOS Inverter 88 B.Supmonchai Looking into the Future… (Year 2010) Performance 2X/16 months 1 TIP (terra instructions/s) 30 GHz clock Size No of transistors: 2 Billion Die: 40*40 mm Power 10kW!! Leakage: 1/3 active Power 2102-545 Digital ICs CMOS Inverter 89 B.Supmonchai Some Interesting Questions What will cause this model to break? When will it break? Will the model gradually slow down? Power and power density Leakage Process Variation 2102-545 Digital ICs CMOS Inverter 90