1 / 18

Chop-SPICE: An Efficient SPICE Simulation Technique For Buffered RC Trees

Chop-SPICE: An Efficient SPICE Simulation Technique For Buffered RC Trees. Myung-Chul Kim, Dong-Jin Lee and Igor L. Markov Dept. of EECS, University of Michigan. Fast SPICE Simulation: Motivation.

teo
Télécharger la présentation

Chop-SPICE: An Efficient SPICE Simulation Technique For Buffered RC Trees

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Chop-SPICE: An Efficient SPICE Simulation Technique For Buffered RC Trees Myung-Chul Kim, Dong-Jin Lee and Igor L. Markov Dept. of EECS, University of Michigan TAU 2011, Myung-Chul Kim, University of Michigan

  2. Fast SPICE Simulation: Motivation IC timing closure, especially at advanced technology nodes, heavily depends on highly-accurate timing simulations Increasing impact of PVT variation Rigorous clock skew/slew constraints Circuit size and complexity rapidly increasing Scalable SPICE technique is critical TAU 2011, Myung-Chul Kim, University of Michigan

  3. Key Feature of Chop-SPICE Developed as a compromise simulator (fast yet sufficiently accurate) for use by Contango2 software in the ISPD 2010 contest Simple and practical divide-and-conquer approach Can capture PVT variation and spatial correlation Flexible trade-off between runtime and solution quality Adaptability to various SPICE simulators TAU 2011, Myung-Chul Kim, University of Michigan

  4. ISPD10 Clock Tree Synthesis Contest 45nm 2GHz CPU benchmarks from IBM and Intel Objective: Minimize the overall capacitance of the clock network Subject to constraints: Monte-Carlo SPICE simulations with PVT variations Local clock skew < 7.5 ps Slew rate < 100ps Hard runtime limit per benchmark< 12 hours Low-skew clock trees are especially unforgiving to timing-analysis inaccuracies TAU 2011, Myung-Chul Kim, University of Michigan

  5. Prior Work Ideal Timing Evaluator Speed Elmore,D2M,LnD Delay Models SPICE, AWE Simulation Accuracy • Ideal Timing Evaluator • Fastruntimewithout sacrificing accuracy • High fidelity, adaptability to various SPICE tools TAU 2011, Myung-Chul Kim, University of Michigan

  6. Chop-SPICE Algorithm Definition: Probing Points Given an RC tree , probing points are defined as Input nodes of buffers Sink nodes = Set of probing points = Number of fanouts to probing points at node si Example TAU 2011, Myung-Chul Kim, University of Michigan

  7. Chop-SPICE Algorithm Definition: Granularity Maximum Granularity: Minimum Granularity: Granularity Range: Target Granularity: Target Granularity determines minimum number of probing points to be included in sub-circuits TAU 2011, Myung-Chul Kim, University of Michigan

  8. Target granularityreached? RC Tree instance End RC treeexhausted? Chop-SPICE Flow RCTree traversal no yes Sub-circuitgeneration Apply input slew stimuli Delay and slew propagation InvokeSPICEsimulation Delay and slew update no yes

  9. Sub-circuit Generation Sub-circuits are always delimited by buffers If a probing point is an input node of buffer(s), all fanout buffers are explicitly included in current sub-circuit Buffers at the boundary of a sub-circuit may also appear in another sub-circuit. Facilitating accurate reconstruction of circuit delay from sub-circuit simulation data Can reduce AC sweep time for sub-circuits TAU 2011, Myung-Chul Kim, University of Michigan

  10. Delay Propagation Purpose : After retrieving probing points’ delay from SPICE, they can be propagated in order to capture delay for probing points in subsequent sub-circuits. Calculation of delay from the root node s0 to node sj Find the sub-circuit containing sj . Identify the shortest tree path from s0 to sj , and the earliest node si in the sub-circuit that lies on this tree path (Assume that signal delay from s0 to si was computed recursively). The delay from si to sj is obtained by SPICE simulation and added to delay at si. TAU 2011, Myung-Chul Kim, University of Michigan

  11. Slew Propagation Purpose : After retrieving probing points’ slew from SPICE, they can be used in order to capture slew for probing points in subsequent sub-circuits. Slew at a given node can be expressed as a function of input slew of a sub-circuit. Slew measured at the previous stage (up to the root node si in a given sub-circuit) should be accounted for when stimuli for the current sub-circuit are generated. Slew at a node is directly calculated by SPICE simulation. TAU 2011, Myung-Chul Kim, University of Michigan

  12. Empirical Results: ISPD10 Benchmarks Experimental setup Single threaded runs on a 3.2GHz Intel core i7 Quad CPU Q660 Linux workstation Buffered RC networks generated by applying Contango2 to ISPD’10 high-performance CNS contest benchmark suite Open-source NgSPICE-2.2 Target granularity Varies from (full-scale SPICE simulation) to in order to examine trade-offs TAU 2011, Myung-Chul Kim, University of Michigan

  13. Empirical Results: Avg. Error TAU 2011, Myung-Chul Kim, University of Michigan

  14. Empirical Results: Max. Error and Trade-off

  15. Fidelity Fidelity suggests whether Chop-SPICE is effective as a replacement of full-scale SPICE during optimization On intermediate clock trees produced by Contango2, we use Chop-SPICE and full-scale SPICE to measure sink delays before and after optimization TAU 2011, Myung-Chul Kim, University of Michigan

  16. Future work Extension to general RC networks An algorithm for computing signal delays in non-tree RC networks by partitioning a given circuit into a spanning tree and non-tree links, and invoking an RC-tree computation is given [6] A recent study [16] report 98% correlation to full SPICE runs. Using parallelism Two sub-circuits can be simulated in parallel if they do not lie on the same path to root. The larger the RC tree, the more parallelism can be found. TAU 2011, Myung-Chul Kim, University of Michigan

  17. Conclusions Accurate estimation of circuit delay is becoming more difficult at new technology nodes Clock-skew estimation in CNS requires picosecond precision Chop-SPICE partitions the original RC tree into sub-circuits, simulates each of them with SPICE, and reconstructs global results from simulation data for sub-circuits Empirical validation shows that Chop-SPICE offers attractive trade-offs between accuracy and runtime Chop-SPICE provides not only good accuracy, but also fidelity sufficient for use in external optimization algorithms Can be applied to any SPICE simulators TAU 2011, Myung-Chul Kim, University of Michigan

  18. Questions and Answers Thank you! Time for Questions TAU 2011, Myung-Chul Kim, University of Michigan

More Related