1 / 28

Distributed Sleep Transistor Network for Power Reduction*

Distributed Sleep Transistor Network for Power Reduction*. Changbo Long ECE Department, UW-Madison clong@cae.wisc.edu Lei He EDA Research Group EE Department, UCLA lhe@ee.ucla.edu. *Partially sponsored by NSF CAREER Award 0093273, SRC grant HJ-1008 and Intel Corporation. Outline.

silas
Télécharger la présentation

Distributed Sleep Transistor Network for Power Reduction*

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Distributed Sleep Transistor Network for Power Reduction* Changbo Long ECE Department, UW-Madison clong@cae.wisc.edu Lei He EDA Research Group EE Department, UCLA lhe@ee.ucla.edu *Partially sponsored by NSF CAREER Award 0093273, SRC grant HJ-1008 and Intel Corporation

  2. Outline • Motivation • Background • Distributed sleep transistor network (DSTN) • Structure, advantages, modeling and sizing algorithm • Experiment results • Conclusion and future work

  3. Leakage power will become the dominant power component Reduced feature size Increased system integration  more idle modules Leakage reduction techniques To reduce leakage for active modules Dual threshold voltage assignment for sub-threshold leakage [Mahesh et-al, ICCAD’02] Pin reordering for gate leakage [Lee et-al, DAC’03] To reduce leakage for idle modules Input vector control [Johnson et-al, DAC’99] Power gating [Kao et-al, DAC’98][Anis-et al, DAC’02] Motivation

  4. System level: use power management processor (PMP) to generate control signals [Mutoh et-al, JSSC’96] PMP can be distributed Gate level: use sleep transistors to turns off power supply Concerned with performance loss and area overhead Motivation Vdd Sleeptr. gn g1 Virtual GND Sleep Sleep Sleep tr. PMP

  5. Performance Loss • Performance loss  Increase in the propagation delay • Performance loss is proportional to Vst • ist  Maximum Simultaneous Switching Current (MSSC) Vdd ist g1 gn ist

  6. MSSC • MSSC: maximum current in the time domain and the input vector domain Time MSSC g1 g3 g2 t t t t g1 g3 g2 t t t t Input vector ig2 ig1 ig3 + + itotal =

  7. Area Overhead • Area overhead: the sleep transistor area and the routing area of virtual ground wires • Design convention: given performance loss , minimize area overhead Vdd g1 gn MSSC

  8. Related Work • Module-based design methodology [Mutoh-et al, JSSC’95 ’96] [Kao-et al, DAC’98] • A single and large sleep transistor accommodates entire module [JSSC’96] • Manual sizing automatic sizing considering dischargepatterns [Kao-et al, DAC’98] • Voltage drop on long virtual ground wires is nontrivial, and results in large area

  9. Related Work • Module-based design methodology [Mutoh-et al, JSSC’95 ’96] [Kao-et al, DAC’98] • A single and large sleep transistor accommodates entire module [JSSC’96] • Manual sizing automatic sizing considering dischargepatterns [Kao-et al, DAC’98] • Voltage drop on long virtual ground wires is nontrivial, and results in large area • Cluster-based design methodology [Anis-et al, DAC’02] • Group gates into clusters and minimize peak current in clusters by clustering algorithms • Insert a sleep transistor for each cluster to avoid long virtual ground wires • Clustering may conflict with time-driven placement

  10. Sleep transistor area • Area*: the sleep transistor area ignoring the resistance of virtual ground wires • MSSCmodule < ∑iMSSCcluster_i  area*module<area*cluster

  11. Sleep transistor area • Area*: the sleep transistor area ignoring the resistance of virtual ground wires • MSSCmodule < ∑iMSSCcluster_i  area*module<area*cluster • Considering the resistance of virtual ground wires, Areamod > Areaclu [Anis-et al, DAC’02] • DSTN has the smallest area • AreaDSTN≈ Area*mod

  12. DSTN: Distributed Sleep Transistor Network • DSTN enhancescluster-based design by connecting clusters with extra virtual ground wires DSTN Cluster-based design

  13. Current Discharging Balance Reduces Size DSTN Cluster-based design • Cluster-based design • Current discharges by its private sleep transistor  large transistor size • DSTN • Current discharges by bothprivate and neighboring sleep transistors  small transistor size

  14. Additional Advantages of DSTN • DSTN introduces NO constraint on placement • Wire overhead of DSTN is small Additionalwires Sleeptr. Sleeptr. Cluster Cluster-based design DSTN

  15. Modeling of DSTN • Entire module  resistance network plus current source Switching current Ri Rst

  16. DSTN Sizing Problem • DSTN Sizing Problem (DSTN/SP) • Given DSTN topology, DSTN/SP finds the size for every sleep transistor such that the total transistor area of DSTN is minimized and the performance loss constraint is satisfied for every cluster Switching current PL< W=? W=? Rst=? Rst=? Vst<ε Vst<ε PL< Rst=? Rst=? W=? W=? Vst<ε Vst<ε

  17. Difficulties of DSTN/SP • Primary challenge: current source • Dependency between the current sources • Current varies w.r.t. time • Secondary challenge: resistance network • Given current source, size Rst to minimize transistor area while satisfy performance loss constraints • Does any algorithms exist in the literature? • No exact solution • Close solution for Power/Ground network sizing [Boyd, et-al ISPD’01] • We have developed an algorithm based on special properties of DSTN/SP

  18. Properties of DSTN/SP Solutions • P1: Assuming Ri=0, • : Performance loss constraint,MSSC: Maximum current

  19. Properties of DSTN/SP Solutions • P2: given current source, AreaDSTN increases when Ri increases • The increase is limited because Ri << Rst • Ri=∞, AreaDSTN=Areacluster

  20. Properties of DSTN/SP Solutions • P3: Assuming cluster current and AreaDSTN to be constant, to achieve minimum performance loss,

  21. Algorithm for DSTN/SP • P1, P2: Total sleep transistor area of DSTN is determined by •  [0.05, 0.5], empirical parameter increases when Ri increases • P3: Size of each individual sleep transistor is • Key is to estimate MSSCmodule and MSSCcluster

  22. Maximum Current Estimation • Estimate MSSCmodule • Circuit current strongly depends on input vector • The space of input vector increase exponentially with the number of primary input • Genetic algorithm (GA) based algorithm is used [Jiang et-al, TVLSI’00] • Efficient algorithm to estimate MSSCcluster has been proposed in the paper

  23. Base-line Case: Cluster-based Design • Cluster-based design without considering placement constraint • Given a circuit and cluster size, partition gates into clusters such that ∑i MSSCcluster_i is minimized and Areacluster is minimized in turn • Clustering algorithm • Simulated Annealing (SA) • Sizing algorithm • Each individual sleep transistor • Total area

  24. Experiment Setup • Gate level synthesis • Sizing • Estimate maximum current for clusters and the entire module • Apply the sizing algorithms • Verification • Simulate the circuit and obtain the current source by 10,000 random input vectors • Obtain performance loss by solving the resistance network with circuit KCL and KVL equations • Find the maximum performance loss among the performance loss for each input vector • Custom layout • Implement a four-bit CLA using 0.35μm technology • Determine size by SPICE simulation • Cluster-based design: each cluster satisfy the performance loss constraint • DSTN: the entire module satisfy the performance loss constraint

  25. Result of Gate Level Synthesis Cluster-based DSTN • On average, DSTN reduces total W/L by 49.8% with smaller performance loss Maximum Performance Loss W/L of Sleep Transistors C880 C432 C499 C880 C432 C1355 C499 C5315 C1908 C7552 C3540 C2670 C6288 C1355 C5315 C1908 C7552 C3540 C2670 C6288

  26. Cluster-based design Custom Layout in 0.35μm Sleep transistors Each cluster is accommodated by a sleep transistor • DSTN Sleep transistors Sleep transistors are connected by virtual ground wires Virtual ground wires

  27. Custom Layout Comparison No sleep transistor Cluster-based DSTN Leakage current delay Sleep tr. Area Total area • DSTN reduces runtime leakage by 50x and 5x • compared to no sleep transistor and cluster-based design, respectively • DSTN reduces sleep transistor area by 6.83x with 6.6% smaller performance degradation • compared to the cluster-based design

  28. Conclusion and Future Work • We have proposed DSTN and the sizing algorithm • DSTN has reduced area, less leakage current and supply voltage drop • Future work • Ideal power/ground network is assumed in this paper • Investigate the co-design of DSTN and the power/ground network

More Related