210 likes | 236 Views
Delve into ARM CPU benchmarking for enhanced performance evaluation of advanced RISC machine processors in various applications and industries, from smartphones to supercomputers.
E N D
ARM CPU Benchmarking Robert Reed | University of the Witwatersrand
Introduction 2 Quad Cores in the Samsung S4 • What is an ARM Processor • Advanced RISC Machine • Where is it being used • 95% of Smartphone Market • Consumer Products • Supercomputers (K-Supercomputer Japan - RISC) • Why use them? • Power efficient • Low capital cost
Introduction Solution Problem China’s Tianhe-2 • How are we going to use ARMs • High-Throughput Supercomputer • Large numbers of ARMs = many cores = parallel
Benchmark Selection • Characterising the ARM architecture • The main factors to look at: • CPU • Cache • RAM • Connectivity
Benchmark Selection • CoreMark by EEMBC • Supported by ARM Holdings • Uses common algorithms • Strict submission rules
Benchmark Selection • High Performance LINPACK by Jack Dongarra • Used in TOP500 list
Understanding the benchmarks • Improvement: Almost 30% • Compile time: Approx 22 hours • Math Libraries • Linear Algebra Package (ATLAS)
Conclusion • Great for high throughput • Energy Efficient • Need better performance • GPU co processing • Problem Specific
ARM CPU Benchmarking BACKUP SLIDES
Layout A7 Cubieboard X5 Switch Server INTERNET A9 X5 Wandboard A15 X1 ODroid Physical Layout
Results - HPL Scalability – 4x A9
Results - HPL Block Size
Understanding the benchmarks • NB test for the A9 • Dependant on array size • Larger array = large NB • Dependant on whole system • Changing Block Allocation NB
Results - HPL Multi Precision
Results - HPL Power Measurements