* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download ee222-lect1
Stray voltage wikipedia , lookup
Power factor wikipedia , lookup
Power inverter wikipedia , lookup
Wireless power transfer wikipedia , lookup
Standby power wikipedia , lookup
Electrical substation wikipedia , lookup
Pulse-width modulation wikipedia , lookup
Electrification wikipedia , lookup
Integrated circuit wikipedia , lookup
Audio power wikipedia , lookup
Power over Ethernet wikipedia , lookup
Rectiverter wikipedia , lookup
Electric power system wikipedia , lookup
Immunity-aware programming wikipedia , lookup
Buck converter wikipedia , lookup
Amtrak's 25 Hz traction power system wikipedia , lookup
Voltage optimisation wikipedia , lookup
History of electric power transmission wikipedia , lookup
Power electronics wikipedia , lookup
Distribution management system wikipedia , lookup
Power MOSFET wikipedia , lookup
Power engineering wikipedia , lookup
Switched-mode power supply wikipedia , lookup
EE222 Winter 2013 Sung Mo (Steve) Kang • • • • Office BE235 Phone 831-459-3580 Cell 831-706-5456Office BE23531-459-3580 [email protected] References • Main: Jan Rabaey, Low Power Design Essentials, Springer, 2009 • Others: – A. P. Chandrakasan, et al., Low Power CMOS Digital Design, Kluwer Academic Publishers,1995 – ITRS 2011 ( http://www.itrs.net) – R. Sarpeshkar, Ultra Low Power Bioelecrronics, Cambridge Univ. Press, 2010 – F. Catthoor, Unified Low Power Design Flow, Kluwer Academic Publishers, 2000 – Selected articles in journals and conferences Lecture No. Date Subject Reference 1 Jan 8 (T) Introduction Rby-Ch1 2 Jan 10 (Th) Power, Energy Basics Rby-Ch3 3 Jan 15 (T) Circuit level power optimization Rby-Ch4 4 Jan 17 (Th) Systems level power optimization Rby-Ch5 5 Jan 22 (T) continued Rby-Ch5 6 Jan 24 (Th) Interconnects Rby-Ch6 7 Jan 29 (T) Clock signaling Rby-Ch6 8 Jan 31 (Th) Low power memory Rby-Ch7 9 Feb 5 (T) Low power memory Rby-Ch9 10 Feb 7 (Th) Midterm exam 11 Feb 12 (T) Low power CAS Rby-Ch8 12 Feb 14 (Th) Low power CAS Rby-Ch10 13 Feb 19 (T) Project proposal presentation 14 Feb 21 (Th) Ultra low power/voltage design 15 Feb 26 (T) continued 16 Feb 28 (Th) Low power design flows 17 Mar 5 (T) Continued 18 Mar 7 (Th) Project presentation 19 Mar 12 (T) Continued 20 Mar 14 (Th) Course review Rby-Ch11 Rby-Ch12 Note Course Grade Policy • Midterm examination • Project proposal • Final project report – Presentation 25% – Report document 25% 25% 25% 50% Guidelines for Course Projects • Each team is consisted of 2 members • Two members are expected to contribute equally. • Project options – Selection of a major milestone paper published in a journal or a conference that addresses low power design of VLSI circuits. – Comprehensive and critical review of the paper. – If possible, propose ways to improve the technical contents. – Both proposals and final presentations will be peer reviewed by other teams. From Transistor toofIntegrated Circuit The Origin VLSI Systems From •Transistor Invention device to ICconcept (1945) Shockley’s semiconductor • Bardeen and Brattain’s point-junction transistor (1947) • Shockley’s junction (sandwich) transistor (1950) • Kahng and Attala’s MOSFET (1960) 1956 Nobel Prize, Physics (J. Bardeen, W. Shockley, and W. Brattain) 2000 Nobel Prize, Physics (Jack Kilby) Dec. 16, 1947 Germanium,1T, 1C, 3R, Oscillator, 0.04 inch X 0.06 inch (Sept. 12, 1958) Intel 4004 (‘71), Pentium 3B (‘99), Xeon Nahalem (‘10) Powered by Moore’s Law 9.5M transistors (2.5um), > 4,000X (2X every 2.3 yrs) 2,300 transistors (10um) 2.3B transistors in 8-core Xeon Nehalem-EX (45nm) 240X over 3B processor (2X every 1.4 yrs) . Beyond Moore’s Law . . Next Generation Flexible Electronic System . Ref. A. Nathan, et al., Flexible Electronics, Proc. of the IEEE, May 2012 k is typically 2 ΔΔΔ 𝑃𝑑𝑦𝑛𝑎𝑚𝑖𝑐 = α 𝐶 𝑉𝑑𝑑 ΔV f 𝑃𝑑𝑦𝑛𝑎𝑚𝑖𝑐 = α 𝐶 𝑉𝑑𝑑 ΔV f • α denotes a fractional switching rate less than or equal to 1 • • • • C is the switched capacitance 𝑉𝑑𝑑 is the power supply voltage ΔV is the voltage swing f is the clock frequency To reduce 𝑃𝑑𝑦𝑛𝑎𝑚𝑖𝑐 it is desirable to reduce all these quantities so long The performance goals can be met. Hierarchy of Limits of Power • • • • • 5 -System 4 -Circuit 3 -Device 2 -Material 1 -Fundamental Fundamental Limit Ref. J. D. Meindl,Chapter 2, Low Power Digital CMOS Design, A. P. Chandrasakan & R. W. Brodersen • The minimum allowable power supply voltage [R. Swanson72] 𝑉𝑠 𝑚𝑖𝑛 ≥ (2 − 4) • 𝑘𝑇 , 𝑞 where 𝑘𝑇 =25mV 𝑞 at T=300°K, thus 50 to 100mV. 3 important limits are due to the basic principles of thermodynamics, quantum mechanics, and electromagnetics. • R 𝑒𝑛 Mean circuit Noise voltage 𝑅 Due to thermodynamicsGiven a noise voltage 𝑒𝑛 and a resistance R between a node on a chip and the ground, the power max. deliverable is 2 𝑒𝑛 2 , 4𝑅 𝑃𝑎𝑣𝑎𝑖𝑙𝑎𝑏𝑙𝑒 =(𝑒𝑛 /(R+R)) x R= where 𝑒𝑛 2 = 4kTRB and thus 1 𝑃𝑎𝑣𝑎𝑖𝑙𝑎𝑏𝑙𝑒 = kTB, where B=bandwidth= Δ𝑡 𝑃𝑠𝑖𝑔𝑛𝑎𝑙 ≥ γ 𝑃𝑎𝑣𝑎𝑖𝑙𝑎𝑏𝑙𝑒 and𝐸𝑠𝑖𝑔𝑛𝑎𝑙 = 𝑃𝑠𝑖𝑔𝑛𝑎𝑙 Δt ≥ γ kT For T=300°K, γ =4, 𝐸𝑠𝑖𝑔𝑛𝑎𝑙 ≥ 1.66 𝑋10−20 joules=0.104 eV, A limit to move an electron across a potential difference of 0.104V. P= γkT/𝑡𝑑 vs. P=h/𝑡𝑑 2 • Quantum-mechanical limit A measurable energy change associated with switching must satisfy the following limit based on the Heisenberg’s uncertainty principle: ΔE ≥ h/𝑡𝑑 and P ≥ h/𝑡𝑑 2 where h= Planck’s constant=6.582x10−16 eV.s P 10−3 P=h/𝑡𝑑 2 10−6 P= γkT/𝑡𝑑 10−9 10−6 𝑡𝑑 Hierarchy of Limits of Power • • • • • 5 -System 4 -Circuit 3 -Device 2 -Material 1 -Fundamental System Level Approach to Minimizing Power Level Particular tasks System Power down, System partitioning Algorithm Complexity, Locality, Concurrency, Regularity, Data representation Architecture Concurrency, Instruction set selection, Signal correlation, Data representation Circuit/Logic Logic optimization, Novel circuits, Transistor sizing, Voltage islands Physical Design Compact layout, Interconnects Technology SOI, Advanced packaging Cray Supercomputer, Interconnects . Complexity of interconnects (Rent’s rule) Neuromorphic Computing Cat scale cortical simulation on LLNL Dawn Blue Gene/P supercomputer with 147,456 CPUs, 144TB main memory (Communications of ACM, July 2011) Ref. G. Snider, Memristor and Memristive Systems Symp., Nov. 2008 Cognitive Computing With Neuroscience, Supercomputing, and Nanotechnology Cognitive Computing may lead to novel learning systems Non Von Neumann architectures new programming paradigms integration, analysis and action on vast amounts of data from many sources at once [Ref. D. Modha, et al., Communications of ACM, Aug. 2011] Mouse Rat Cat Neurons (B) 0.016 0.055 0.763 2 20 Synapses (T) 0.128 0.442 6.10 16 200 . Monkey Human The Missing Link in Constitutive Relations • V = R I (Resistor) V • Q = C V (Capacitor) • Φ = L I (Inductor) V = d/dt Φ Φ • Φ = f (Q) Q dΦ /dt= df(Q)/dt = df(Q)/dQ. dQ/dt I = d/dt Q V= M I Memristance M=M(Q) I Nonvolatile Memristive Memory • Nanotechnology enables ultra dense Early 1kB memory based on p-Si/a-Si/Ag by Univ. Michigan memory HP-Hynix collaboration on next generation memory products High quality memristive memory chips in a few years Expected features (compared to FLASH) Speed: >10x. Power <0.1X, Density >5X S.H. Cho, et. al., Nano Letters, 2009; New York Times, Aug. 2010 Nanostore-Based Distributed System with 3D-Stacked Memristors HP- Collocate processors and Memristor memory on chip Axes of Device Requirements (Memristor)