90 likes | 288 Vues
Ocelot and the SST- MacSim Simulator. Genie Hsieh § , Andrew Kerr, Hyesoon Kim, Jaekyu Lee, Nagesh Lakshminarayana , Arun Rodrigues § , Sudhakar Yalamanchili. School of Computer Science and School of Electrical and Computer Engineering Georgia Institute of Technology
 
                
                E N D
Ocelot and the SST-MacSim Simulator Genie Hsieh§, Andrew Kerr, Hyesoon Kim, JaekyuLee, NageshLakshminarayana, Arun Rodrigues§, Sudhakar Yalamanchili School of Computer Science and School of Electrical and Computer Engineering Georgia Institute of Technology Atlanta, GA. 30332 §Scalable Computer Architecture Department Sandia National Laboratories Albuquerque, NM. 87185
System Diversity Amazon EC2 GPU Instances Mobile Platforms Heterogeneity is Mainstream Tianhe-1A KeenelandSystem
Heterogeneity On-Chip Vector Extensions AES Instructions Programmable Pipeline (GEN6) Programmable Accelerator Multiple models of Computation Multi-ISA Denver Sandy Bridge 16, PowerPC cores ARM Style • Accelerators • Crypto Engine • RegEx Engine • XML Engine Memory PowerEN
Heterogeneous Systems: Keeneland Courtesy J. Vetter (GT/ORNL) Keeneland System (7 Racks) Rack (6 Chassis) S6500 Chassis (4 Nodes) ProLiant SL390s G7 (2CPUs, 3GPUs) M2070 Xeon 5660 201528 GFLOPS 40306 GFLOPS 12000-Series Director Switch 6718 GFLOPS 1679GFLOPS 24/18 GB 515GFLOPS 67GFLOPS Integrated with NICS Datacenter GPFS and TG Full PCIe X16 bandwidth to all GPUs 4
Heterogeneous Architecture & Systems Research Common Research Themes • Lexical Analyzer • Parser • Semantic analysis Focus on explicitly data parallel languages – bulk synchronous models • Memory Optimizations • Program Transformations • Control Flow Optimizations • + Many more • Optimization • Code generation • Post pass optimization Instruction set architecture • Microarchitecture • Memory systems • Network on Chip • Power Management • + Many more SIMT (Fermi) VLIW (Caymen) New Designs 5
Research Infrastructure Challenges • Open source • Compiler infrastructures for GPU computing • Microarchitecture cycle-level timing simulators for heterogeneous architectures • Integration between compiler, simulators, and models • Scalable simulation infrastructures • Simulation wall! • Ability to integrate point tools Tile Tile Tile Tile Tile Tile Tile Tile Tile Tile
Tutorial Overview Low level Compiler Infrastructure for GPU Computing Ocelot Dynamic Execution Infrastructure Andrew Kerr, SudhakarYalamanchili MacSim Heterogeneous Architecture Simulator Heterogeneous Cycle-level Architecture Models J. Lee, N. Lakshminarayana, H. Kim SST: Structural Simulation Toolkit Parallel Simulation Infrastructure G. Hsieh, A. Rodrigues