Performance modeling in GPGPU computing

Performance modeling in GPGPU computing Wenjingxu Professor: Dr.Box

What’s GPGPU? GPU-accelerated computing is the use of a graphics processing unit (GPU) together with a CPU to accelerate scientific, engineering, and enterprise applications.

What’s modeling a simplified representation of a system or phenomenon it is the most explicit way in which to describe a system or phenomenon use the parameter we set to build formula to Analysis system

Relate work Hong and Kim [3] introduce two metrics, Memory Warp Parallelism (MWP) and Computation Warp Parallelism (CWP) in order to describe the GPU parallel architecture. Zhang and Owens [4] develop a performance model based on their microbenchmarks so that they can identify bottlenecks in the program. Supada[5] performance model consider memory latencies are varied depending on the data type and the type of memory

1 Introduction and background Different application and device cannot use same setting Find the relationship between each parameters in this model, and choose the best block size for each application on different device to get peak performance.

varies data size with varies size of block have different performance

How GPU working

Memory latency hiding

The structure of threads

Specification of GeForce GTX 650

Parameters

Block size setting under threads limitation NMB>= NTB = N* NTW (N is integer) >= NRT/ NRB

Memory resource

Block size setting under stream multiprocessor resource MR / MTR >= N* NTB (N is integer) N* NTB (N is integer) <= NRT N<= MSM / MSB

Conclusion Though more threads can hide memory access latency, but the more thread use the more resource needed. Find the balance point between resource limitation and memory latency is a shortcut to touch the peak performance. By different application and device this performance model shows it advantage, adaptable and without any rework and redesign let application running on the best tuning.

Performance modeling in GPGPU computing

Performance modeling in GPGPU computing

Presentation Transcript

Performance Modeling

HIGH PERFORMANCE COMPUTING

GPGPU in NGS Bioinformatics

MODELING OF HIGH PERFORMANCE PROGRAMS TO SUPPORT HETEROGENEOUS COMPUTING

Welcome to Modeling time in computing

A GPGPU transparent virtualization component for high performance computing clouds

Performance Modeling in GPGPU

Modeling time in computing

High Performance Computing

Performance in Medical Image Computing

Modeling Quantum Computing in Haskell

High-Performance Computing

Computing Performance

PERFORMANCE MODELING AND CHARACTERIZATION OF MULTICORE COMPUTING SYSTEMS

Performance Modeling

S03: High Performance Computing with CUDA Heterogeneous GPU Computing for Molecular Modeling

GPGPU

Alternative Processors, Heterogeneous Multi-Core and GPGPU Computing in HPC

Automatic Performance Tuning of SpMV on GPGPU

Performance Modeling

Automatic Performance Tuning of SpMV on GPGPU

Computing Performance