Performance Comparison of x86 CPU and NVIDIA GPU for Cellular Automata Programming

GPU Architectural Considerations for Cellular Automata ProgrammingA comparison of performance between a x86 CPU and nVidia Graphics Card Stephen Orchowski, CSE 520, 12/3/2008

Project Goals • To study the architecture of a GPU • To study a programming model based on that architecture and gain experience using it • To determine how the various architectural features affect performance and to what degree • To suggest an optimum configuration of a particular algorithm for the selected GPU

Cellular Automata 2-Dimensional grid where each cell value in one generation is based on simple rules using the values of adjacent surrounding neighbors Can start with random or pre-defined patterns Successive generations evolve into complex patterns Highly parallelizable!

Results…so far…

Conclusions…so far… • GPUs offer speedup over a serial processor, but there is no “silver bullet” for programming techniques • Developers have to tweak the programming to get maximum performance out of the architecture • Return on programming effort doesn’t always justify use of the GPU – Implementing CA on a GPU probably isn’t worth the effort unless the grid size is extensively large, but CA does have applications to more complex algorithms with Computational Fluid Dynamics and does provide a good framework for studying the architecture • Even different versions of a GPU will also necessitate further tweaks to the program, even if the card is of the same GPU hardware family • CUDA vs. Cell programming and other architectures – tradeoffs?

Performance Comparison of x86 CPU and NVIDIA GPU for Cellular Automata Programming

Performance Comparison of x86 CPU and NVIDIA GPU for Cellular Automata Programming

Presentation Transcript

CSE 520 Advanced Computer Architecture Lec 2 - Introduction

November 12, 2008

CSE 520 Advanced Computer Architecture Lec 2 - Introduction

12 2008

CSE Spring 2008

CSE 321 Discrete Structures

Spring 2008 CSE 1105

Introduction to Computer Architecture CSE 520 Fall 2007

CSE 341 Lecture 12

CSE 403 Lecture 12

CSE 8A Lecture 12

2008. 11. 12

Stephen Orchowski – 11/15/2008 CSE 520 – Advanced Computer Architecture

CSE 520: Advanced Computer Architecture: Reliability

CSE 524: Lecture 12

CSE 7348 - class 12

CSE 321 Discrete Structures

CSE 524: Lecture 12

CSE 303 Lecture 12

CSE 113