Publications

Books

  1. McIntosh-Smith, Simon, Michael Klemm, Bronis R. de Supinski, Tom Deakin, and Jannis Klinkenberg, eds. OpenMP: Advanced Task-Based, Device and Compiler Programming. Lecture Notes in Computer Science. Cham, Switzerland: Springer, 2023. https://doi.org/10.1007/978-3-031-40744-4.
    doi more details
  2. Deakin, Tom, and Timothy G. Mattson. Programming Your GPU with OpenMP. Cambridge, MA: MIT Press, 2023. https://mitpress.mit.edu/9780262547536/programming-your-gpu-with-openmp/.
    Available November 7, 2023
    url more details

Conferences and Workshops

  1. Curtis, Samuel, Harry Waugh, Tom Deakin, and Gihan Mudalige. “Mini-Combust—an Open-Source Unstructured FGM Combustion Mini-App for Co-Designing Aero-Engines at Extreme Scale.” In International Conference on High Performance Computing, Data, and Analytics. IEEE, 2024.
    more details
  2. Lane Cassell, Thomas, Tom Deakin, Aksel Alpay, Vincent Heuveline, and Gonzalo Brito Gadeschi. “Efficient Tree-Based Parallel Algorithms for N-Body Simulations Using C++ Standard Parallelism.” In Workshop on Irregular Applications: Architectures and Algorithms Held in Conjunction with Supercomputing (P3HPC). IEEE, 2024. https://doi.org/10.1109/SCW63240.2024.00099.
    doi pdf more details
  3. Lin, Wei-Chen, Tom Deakin, and Simon McIntosh-Smith. “A Metric for HPC Programming Model Productivity.” In International Workshop on Performance, Portability and Productivity in HPC Held in Conjunction with Supercomputing (P3HPC). IEEE, 2024. https://doi.org/10.1109/SCW63240.2024.00160.
    doi more details
  4. Wilkinson, Finn, Alex Cockrean, Wei-Chen Lin, Simon McIntosh-Smith, and Tom Deakin. “Assessing the GPU Offload Threshold of GEMM and GEMV Kernels on Modern Heterogeneous HPC Systems.” In International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems Held in Conjunction with Supercomputing (PMBS). IEEE, 2024. https://doi.org/10.1109/SCW63240.2024.00188.
    doi pdf more details
  5. Applencourt, Thomas, Aditya Sadawarte, Servesh Muralidharan, Colleen Bertoni, JaeHyuk Kwack, Ye Luo, Esteban Rangel, et al. “Ponte Vecchio Across the Atlantic: Single-Node Benchmarking of Two Intel GPU Systems.” In International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems Held in Conjunction with Supercomputing (PMBS). IEEE, 2024. https://doi.org/10.1109/SCW63240.2024.00184.
    doi more details
  6. Moore, Joseph, Tom Deakin, and Simon McIntosh-Smith. “AI-Assisted Design-Space Analysis of High-Performance Arm Processors.” In International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems Held in Conjunction with Supercomputing (PMBS). IEEE, 2024. https://doi.org/10.1109/SCW63240.2024.00186.
    doi more details
  7. Ngueguin, Roussel Desmond Nzoyem, David A.W. Barton, and Tom Deakin. “A Comparison of Mesh-Free Differentiable Programming and Data-Driven Strategies for Optimal Control under PDE Constraints.” In Fourth Workshop on Artificial Intelligence and Machine Learning for Scientific Applications Help in Conjunction with Supercomputing (AI4S). IEEE, 2023. https://doi.org/10.1145/3624062.3626078.
    In press
    doi url more details
  8. Koskela, Tuomas, Ilektra Christidi, Mosè Giordano, Emily Dubrovska, Jamie Quinn, Christopher Maynard, Dave Case, Kaan Olgu, and Tom Deakin. “Principles for Automated and Reproducible Benchmarking.” In First International Workshop on HPC Testing and Evaluation of Systems, Tools, and Software Held in Conduction with Supercomputing (HPCTESTS). IEEE, 2023. https://doi.org/10.1145/3624062.3624133.
    In press
    doi more details
  9. Deakin, Tom, James Cownie, Wei-Chen Lin, and Simon McIntosh-Smith. “Heterogeneous Programming for the Homogeneous Majority.” In International Workshop on Performance, Portability and Productivity in HPC Held in Conjunction with Supercomputing (P3HPC). IEEE, 2022. https://doi.org/10.1109/P3HPC56579.2022.00006.
    doi pdf more details
  10. Hammond, Jeff R., Tom Deakin, James Cownie, and Simon McIntosh-Smith. “Benchmarking Fortran DO CONCURRENT on CPUs and GPUs Using BabelStream.” In International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems Held in Conjunction with Supercomputing (PMBS). IEEE, 2022. https://doi.org/10.1109/PMBS56514.2022.00013.
    doi pdf more details
  11. Lin, Wei-Chen, Tom Deakin, and Simon McIntosh-Smith. “Evaluating ISO C++ Parallel Algorithms on Heterogeneous HPC Systems.” In International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems Held in Conjunction with Supercomputing (PMBS). IEEE, 2022. https://doi.org/10.1109/PMBS56514.2022.00009.
    doi pdf more details
  12. Deakin, Tom, Simon McIntosh-Smith, S. John Pennycook, and Jason Sewall. “Analyzing Reduction Abstraction Capabilities.” In International Workshop on Performance, Portability and Productivity in HPC Held in Conjunction with Supercomputing (P3HPC). IEEE, 2021. https://doi.org/10.1109/P3HPC54578.2021.00007.
    doi more details
  13. Deakin, Tom, Simon McIntosh-Smith, Aksel Alpay, and Vincent Heuveline. “Benchmarking and Extending SYCL Hierarchical Parallelism.” In Workshop on Hierarchical Parallelism for Exascale Computing Held in Conjunction with Supercomputing (HiPAR). IEEE, 2021. https://doi.org/10.1109/HiPar54615.2021.00007.
    doi more details
  14. Poenaru, Andrei, Tom Deakin, Simon McIntosh-Smith, Simon D. Hammond, and Andrew J. Younge. “An Evaluation of the Fujitsu A64FX for HPC Applications.” In Cray User Group (CUG), 2021.
    pdf more details
  15. Lin, Tom, Tom Deakin, and Simon McIntosh-Smith. “On Measuring the Maturity of SYCL Implementations by Tracking Historical Performance Improvements.” In International Workshop on OpenCL and SYCLCon (IWOCL/SYCLCon). ACM, 2021. https://doi.org/10.1145/3456669.3456701.
    doi more details
  16. Deakin, Tom, James Cownie, Simon McIntosh-Smith, Justin Lovegrove, and Richard Smedley-Stevenson. “Hostile Cache Implications for Small, Dense Linear Solves.” In Workshop on Memory Centric High Performance Computing Held in Conjunction with Supercomputing (MCHPC). IEEE, 2020. https://doi.org/10.1109/MCHPC51950.2020.00010.
    doi more details
  17. Deakin, Tom, Andrei Poenaru, Tom Lin, and Simon McIntosh-Smith. “Tracking Performance Portability on the Yellow Brick Road to Exascale.” In International Workshop on Performance, Portability and Productivity in HPC Held in Conjunction with Supercomputing (P3HPC). IEEE, 2020. https://doi.org/10.1109/P3HPC51967.2020.00006.
    doi more details
  18. Sewall, Jason, John Pennycook, Douglas W Jacobsen, Tom Deakin, and Simon McIntosh-Smith. “Interpreting and Visualizing Performance Portability Metrics.” In International Workshop on Performance, Portability and Productivity in HPC Held in Conjunction with Supercomputing (P3HPC). IEEE, 2020. https://doi.org/10.1109/P3HPC51967.2020.00007.
    All authors have equal contribution to this work
    doi more details
  19. Deakin, Tom, and Simon McIntosh-Smith. “Evaluating the Performance of HPC-Style SYCL Applications.” In International Workshop on OpenCL and SYCLCon (IWOCL/SYCLCon). ACM, 2020. https://doi.org/10.1145/3388333.3388643.
    doi more details
  20. Deakin, Tom, Simon McIntosh-Smith, James Price, Andrei Poenaru, Patrick Atkinson, Codrin Popa, and Justin Salmon. “Performance Portability Across Diverse Computer Architectures.” In International Workshop on Performance, Portability and Productivity in HPC Held in Conjunction with Supercomputing (P3HPC). IEEE, 2019. https://doi.org/10.1109/P3HPC49587.2019.00006.
    doi more details
  21. Deakin, Tom, Simon McIntosh-Smith, Justin Lovegrove, Richard Smedley-Stevenson, and Andrew Hagues. “Developing a Mini-App for Exploring Algorithms for Unstructured Mesh Deterministic Discrete Ordinates Transport on Many-Core Architectures.” In International Conference on Mathematics and Computational Methods Applied to Nuclear Science and Engineering (M&C). ANS, 2019. https://doi.org/https://www.ans.org/pubs/proceedings/article-46837/.
    doi pdf url more details
  22. McIntosh-Smith, Simon, James Price, Tom Deakin, and Andrei Poenaru. “Scaling Results from the First Generation of Arm-Based Supercomputers.” In Cray User Group (CUG), 2019. https://research-information.bris.ac.uk/en/publications/scaling-results-from-the-first-generation-of-arm-based-supercompu.
    pdf url more details
  23. Deakin, Tom, Simon McIntosh-Smith, Justin Lovegrove, Richard Smedley-Stevenson, and Andrew Hagues. “UnSNAP: A Mini-App for Exploring the Performance of Deterministic Discrete Ordinates Transport on Unstructured Meshes.” In Workshop on Representative Applications Help in Conjuction with IEEE International Conference on Cluster Computing (WRAp). IEEE, 2018. https://doi.org/10.1109/CLUSTER.2018.00076.
    doi more details
  24. McIntosh-Smith, Simon, James Price, Tom Deakin, and Andrei Poenaru. “Comparative Benchmarking of the First Generation of HPC-Optimised Arm Processors on Isambard.” In Cray User Group (CUG), 2018.
    pdf more details
  25. McIntosh-Smith, Simon, Matt Martineau, Tom Deakin, Grzegorz Pawelczak, Wayne Gaudin, Paul Garrett, Wei Liu, Richard Smedley-Stevenson, and David Beckingsale. “TeaLeaf: A Mini-Application to Enable Design-Space Explorations for Iterative Sparse Linear Solvers.” In Workshop on Representative Applications Help in Conjuction with IEEE International Conference on Cluster Computing (WRAp). IEEE, 2017. https://doi.org/10.1109/CLUSTER.2017.105.
    doi more details
  26. Deakin, Tom, Wayne Gaudin, and Simon McIntosh-Smith. “On the Mitigation of Cache Hostile Memory Access Patterns on Many-Core CPU Architectures.” In Intel Xeon Phi Users Group Workshop Held in Conjuction with International Supercomputing Conference (IXPUG). Springer, Cham, 2017. https://doi.org/10.1007/978-3-319-67630-2_26.
    doi more details
  27. Deakin, Tom, James Price, Matt Martineau, and Simon McIntosh-Smith. “GPU-STREAM v2.0: Benchmarking the Achievable Memory Bandwidth on Many-Core Processors across Diverse Parallel Programming Models.” In Performance Portable Programming Models for Accelerators Workshop Held in Conjuction with International Supercomputing Conference (P3MA). Springer, Cham, 2016. https://doi.org/10.1007/978-3-319-46079-6_34.
    doi more details
  28. Deakin, Tom, Simon McIntosh-Smith, and Wayne Gaudin. “Many-Core Acceleration of a Discrete Ordinates Transport Mini-App at Extreme Scale.” In International Supercomputing Conference (ISC). Springer, Cham, 2016. https://doi.org/10.1007/978-3-319-41321-1_22.
    doi more details
  29. ———. “Expressing Parallelism on Many-Core for Deterministic Discrete Ordinates Transport.” In Workshop on Representative Applications Help in Conjuction with IEEE International Conference on Cluster Computing (WRAp). IEEE, 2015. https://doi.org/10.1109/CLUSTER.2015.127.
    doi more details

Journals

  1. Pennycook, S. J., J. D. Sewall, D. W. Jacobsen, Tom Deakin, and S. McIntosh-Smith. “Navigating Performance, Portability and Productivity.” Computing in Science and Engineering, 2021. https://doi.org/10.1109/MCSE.2021.3097276.
    Special issue on Performance Portability for Advanced Architectures, invited article
    doi more details
  2. Deakin, Tom, Simon McIntosh-Smith, Justin Lovegrove, Richard Smedley-Stevenson, and Andrew Hagues. “Reviewing the Computational Performance of Deterministic SN Transport Sweeps on Many-Core Architectures.” Journal of Computational and Theoretical Transport 49, no. 3 (2020): 121–43. https://doi.org/10.1080/23324309.2020.1775096.
    doi more details
  3. McIntosh-Smith, Simon, James Price, Tom Deakin, and Andrei Poenaru. “A Performance Analysis of the First Generation of HPC-Optimised Arm Processors.” Concurrency and Computation: Practice and Experience, 2019. https://doi.org/10.1002/cpe.5110.
    special issue
    doi more details
  4. McIntosh-Smith, Simon, James Price, Andrei Poenaru, and Tom Deakin. “Benchmarking the First Generation of Production Quality Arm-Based Supercomputers.” Concurrency and Computation: Practice and Experience, November 2019. https://doi.org/10.1002/cpe.5569.
    special issue
    doi more details
  5. Deakin, Tom, James Price, Matt Martineau, and Simon McIntosh-Smith. “Evaluating Attainable Memory Bandwidth of Parallel Programming Models via BabelStream.” International Journal of Computational Science and Engineering 17, no. 3 (2018): 247–62. https://doi.org/10.1504/IJCSE.2018.095847.
    Special Issue on Novel Strategies for Programming Accelerators
    doi pdf more details
  6. Deakin, Tom, Simon McIntosh-Smith, Matt Martineau, and Wayne Gaudin. “An Improved Parallelism Scheme for Deterministic Discrete Ordinates Transport.” International Journal of High Performance Computing Applications 32, no. 4 (2018): 555–69. https://doi.org/10.1177/1094342016668978.
    special issue
    doi more details

Peer-reviewed presentations

  1. Deakin, Tom, Simon McIntosh-Smith, Justin Lovegrove, Richard Smedley-Stevenson, and Andrew Hagues. “Reviewing the Computational Performance of Deterministic SN Transport Sweeps on Many-Core Architectures,” 2019.
    Peer-reviewed abstract for presentation at International Conference on Transport Theory (ICTT-26)
    pdf more details
  2. Raman, Karthik, Tom Deakin, James Price, and Simon McIntosh-Smith. “Improving Achieved Memory Bandwidth from C++ Codes on Intel Xeon Phi Processor (Knights Landing),” 2017. https://research-information.bris.ac.uk/en/publications/improving-achieved-memory-bandwidth-from-c-codes-on-intel-xeon-ph.
    Presentation at Intel Xeon Phi Users Group (IXPUG) Spring Meeting
    pdf url more details
  3. Deakin, Tom, John Pennycook, Andrew Mallinson, Wayne Gaudin, and Simon McIntosh-Smith. “The MEGA-STREAM Benchmark on Intel Xeon Phi Processors (Knights Landing),” 2017.
    Presentation at Intel Xeon Phi Users Group (IXPUG) Spring Meeting
    more details

Posters

  1. Deakin, Tom, James Price, and Simon McIntosh-Smith. “Portable Methods for Measuring Cache Hierarchy Performance.” Poster at Supercomputing, The International Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2017. https://sc17.supercomputing.org/SC17%20Archive/tech_poster/tech_poster_pages/post155.html.
    pdf url more details
  2. Deakin, Tom, James Price, Matt Martineau, and Simon McIntosh-Smith. “GPU-STREAM: Now in 2D!” Poster at Supercomputing, The International Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2016. https://research-information.bris.ac.uk/en/publications/gpu-stream-now-in-2d.
    pdf url more details
  3. Deakin, Tom, and Simon McIntosh-Smith. “GPU-STREAM: Benchmarking the Achievable Memory Bandwidth of Graphics Processing Units.” Poster at Supercomputing, The International Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2015. http://sc15.supercomputing.org/sites/all/themes/SC15images/tech_poster/tech_poster_pages/post150.html.
    pdf url more details

Pre-print

  1. Lin, Wei-Chen, Simon McIntosh-Smith, and Tom Deakin. “Preliminary Report: Initial Evaluation of StdPar Implementations on AMD GPUs for HPC,” 2024. https://doi.org/10.48550/arXiv.2401.02680.
    doi url more details

Thesis

  1. Deakin, Tom. “Leveraging Many-Core Technology for Deterministic Neutral Particle Transport at Extreme Scale.” PhD thesis, University of Bristol, 2018. https://hdl.handle.net/1983/0d202285-71d3-4073-acb6-eec437137c5d.
    url more details

standards

  1. The Khronos SYCL Working Group. SYCL 2020 Specification. Khronos, 2020. https://khronos.org/sycl.
    url more details