Exploring ARM CPU Benchmarking for High-Performance Computing
Delve into ARM CPU benchmarking for enhanced performance evaluation of advanced RISC machine processors in various applications and industries, from smartphones to supercomputers.
Exploring ARM CPU Benchmarking for High-Performance Computing
E N D
Presentation Transcript
ARM CPU Benchmarking Robert Reed | University of the Witwatersrand
Introduction 2 Quad Cores in the Samsung S4 • What is an ARM Processor • Advanced RISC Machine • Where is it being used • 95% of Smartphone Market • Consumer Products • Supercomputers (K-Supercomputer Japan - RISC) • Why use them? • Power efficient • Low capital cost
Introduction Solution Problem China’s Tianhe-2 • How are we going to use ARMs • High-Throughput Supercomputer • Large numbers of ARMs = many cores = parallel
Benchmark Selection • Characterising the ARM architecture • The main factors to look at: • CPU • Cache • RAM • Connectivity
Benchmark Selection • CoreMark by EEMBC • Supported by ARM Holdings • Uses common algorithms • Strict submission rules
Benchmark Selection • High Performance LINPACK by Jack Dongarra • Used in TOP500 list
Understanding the benchmarks • Improvement: Almost 30% • Compile time: Approx 22 hours • Math Libraries • Linear Algebra Package (ATLAS)
Conclusion • Great for high throughput • Energy Efficient • Need better performance • GPU co processing • Problem Specific
ARM CPU Benchmarking BACKUP SLIDES
Layout A7 Cubieboard X5 Switch Server INTERNET A9 X5 Wandboard A15 X1 ODroid Physical Layout
Results - HPL Scalability – 4x A9
Results - HPL Block Size
Understanding the benchmarks • NB test for the A9 • Dependant on array size • Larger array = large NB • Dependant on whole system • Changing Block Allocation NB
Results - HPL Multi Precision
Results - HPL Power Measurements