1 / 22

Hybrid Multi-Core Architecture for Boosting Single-Threaded Performance

Hybrid Multi-Core Architecture for Boosting Single-Threaded Performance. Presented by: Peyman Nov 2007. Overview. Previous Architectures New Hybrid Architecture Possible Benefits Scrutiny Experimental Results Relation to Project. Something Old. CMP (single-Chip Multi-core Processors)

ellery
Télécharger la présentation

Hybrid Multi-Core Architecture for Boosting Single-Threaded Performance

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Hybrid Multi-Core Architecture for Boosting Single-Threaded Performance Presented by: Peyman Nov 2007

  2. Overview • Previous Architectures • New Hybrid Architecture • Possible Benefits • Scrutiny • Experimental Results • Relation to Project

  3. Something Old • CMP (single-Chip Multi-core Processors) • Two or more independent cores • Single ISA heterogeneous multiprocessors • Cores of varying size, performance • Same ISA • Improve throughput for multi-threaded • Single-Threaded?

  4. Superscalar • Increase performance w/o recompiling • Efficiently handle runtime events • Branch Direction • Target Address • Load Latency • Memory Dependency • Limited ILP: Hardware Instruction Window

  5. VLIW • Very Long Instruction Word • Shift Hardware complexity to compiler • High Clock Frequency • Energy-Efficient • No need to analyze data dependency • No scheduling of independent instruction

  6. Something New • Dual-Core Architecture [1] • Bus-based snooping • Communicate Using L2 • In Future: • Interconnections • Small operand transfer buffer

  7. Potential Benefits • VLIW core can operate at high clock rate • Simple Superscalar core • More aggressive compiler optimization • Due to the superscalar speculative operations • Simple hardware • Energy Efficient • Scalable

  8. Hybrid Compiler • At TLP aware of: • Execution Bandwidth • Frequencies • At ILP: • Architectural details of Superscalar? • # functional units and latencies of VLIW • Helper threads

  9. Optimization Phases • Phase 1 • Exploit speculative threads (helper threads) • Phase 2 • Extract non-speculative multi-grain parallelism • Partition source code • Predictable (static analysis or profiling) • Unpredictable (suitable for superscalar core) • A lot more …

  10. Did that sound right? • Will the data be in the L2 cache when the VLIW core needs it?

  11. What if?

  12. Pre-Execution • Not a new idea • Using superscalar core to minimize L2 miss stalls • Stalling VLIW pipelines • Predictable load latencies? • Cache profiling

  13. Definitions • Delinquent Loads • Small number of load operations are responsible for the majority of data cache misses. • Delinquent Loads Threshold • A pre-set threshold for number of allowable stall cycles caused by a static load instruction

  14. Pre-Execution Thread • Make load operations non-faulting • Remove all store operations

  15. Evaluation • Simulated Cores [1]

  16. Evaluation (2) • Hybrid compiler built upon Trimaran compiler • A cycle-accurate model • Based on integration of • VLIW simulator from Trimaran • Superscalar simulator: simplescalar

  17. Evaluation (3) • Seven single-threaded applications from • SPEC 2000 INT • SPEC 92 FP

  18. Base, Pre-Execution, Prefetch

  19. L2 Miss Latency

  20. Delinquent Loads Threshold

  21. Relation? • Relation to course project • Project focuses on scalability of optimization techniques • Relation to course • How multi-cores can help single-threaded applications

  22. Reference • [1] Yan J., Zhang W., "Hybrid multi-core architecture for boosting single-threaded performance", ACM SIGARCH Computer Architecture News 35(1): 141-148, 2007

More Related