1 / 1

Break it, Use it!

B. C. D. Core. Core. L1 Cache. L1 Cache. BlkState. BlkState. DirState. Break it, Use it!. L2 Cache. So what is MV5?. DRAM. GPU-like SIMD (SIMT) Hardware thread scheduler API for SIMD threads. Directory-based coherence (MESI, MSI). M5 based. On-chip Network (Mesh). A.

paley
Télécharger la présentation

Break it, Use it!

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. B C D Core Core L1 Cache L1 Cache BlkState BlkState DirState Break it, Use it! L2 Cache So what is MV5? DRAM GPU-like SIMD (SIMT) Hardware thread scheduler API for SIMD threads Directory-based coherence (MESI, MSI) M5 based On-chip Network (Mesh) A A / 11111111 B / 11111001 C / 00000110 Simulation Management for space exploration D / 11111111 Post-dominator time (b) Branch divergence and re-convergence (a) The example program Dual Core Tiled Cores OO+SIMD Note: SIMD cores can share the same address space with other cores over coherent caches! MV5: A RECONFIGURABLE SIMULATOR FOR HETEROGENEOUS MULTICORE ARCHITECTURESJiayuan Meng*, Kevin SkadronUniversity of Virginia * Now at Argonne National Laboratory Single-Instruction, Multiple-Threads Do you need it? Simulators for Today’s Architectures • If you want to explore: • SIMD + IO/OO • SIMD + coherent caches • SIMD + OCN • Simple Banked Cache • Underlying middleware • If you are OK with • System emulation • Kernels Out-of-Order (OO) core: SimpleScalar Simultaneous Multithreading: SMTSIM Chip-multiprocessor w/t OO cores: SESC Chip-multiprocessor w/t In-Order (IO), OO cores: Simics+Gems+Garnet, SimFlex Intel’s Microarchitecture: PTLSim GPU: GPGPUSim But Future is Unpredictable… Separate basic cache functionalities with coherence protocols General purpose / Heterogeneous / Integrated Accelerators? • Diversity • Modularity • Scalability • Co-design M5 provides such a platform MV5 is based on M5 Potential Configurations MESI/MSI SIMD OO On-chip Network In-order • MV5 Website • https://sites.google.com/site/mv5sim/home • MV5 Mailing list: • http://groups.google.com/group/mv5sim caches DRAM Acknowledgements This work was supported in part by SRC grant No. 1607, NSF grant nos. IIS-0612049 and CNS-0615277, a grant from Intel Research, and a professor partnership award from NVIDIA Research. We would like to thank Jeremy W. Sheaffer, David Tarjan, Shuai Che, and Jiawei Huang for their helpful inputs in power modeling, area estimation, and benchmark implementations.

More Related