Performance Portability Across Diverse Computer Architectures
A performance analysis of the first generation of HPC-optimized Arm processors
Scaling Results from the First Generation of Arm-based Supercomputers
Evaluating attainable memory bandwidth of parallel programming models via BabelStream
Comparative Benchmarking of the First Generation of HPC-Optimised Arm Processors on Isambard
Portable methods for measuring cache hierarchy performance
GPU-STREAM: now in 2D!
GPU-STREAM v2.0: Benchmarking the achievable memory bandwidth on many-core processors across diverse parallel programming models