Publications

Research papers and publications related to the Shamrock framework for astrophysical hydrodynamics

Numerical planet formation on exascale architectures

T. David--Cléris

PhD Thesis

Numerical simulations are essential to our understanding of star and planet formation. They imply processes being multi-physics, complex, multi-scale, out of equilibrium, and non linear. Recently, the computing power of supercomputer in- creased up to the exascale, namely a quintillion operations per seconds. In principle, this computing power makes it possible to resolve crucial questions about planet formation, thanks to simulations of unprecedented accuracy. To achieve this, it is necessary to develop code based on algorithms capable of taking advantage of this new computing power. The aim of this thesis is to develop Shamrock, the first astrophysical code with exascale multi-methods (particles or adaptive grids). The core of this work is the adaptation and optimization of a binary algorithm for finding randomly distributed neighbors, which is fully parallelizable on architectures using graphics cards. In its current version, Shamrock achieves a parallel efficiency of over 90% for a Sedov test performed with the Smoothed Particle Hydrodynamics (SPH) method on 1024 nodes, enabling the first simulations with 65 billion particles to be carried out in 7 seconds per time step.

September 2024 École normale supérieure de Lyon
View Thesis

The SHAMROCK code: I - smoothed particle hydrodynamics on GPUs

T. David--Cléris, G. Laibe, Y. Lapeyre

Journal Article

We present SHAMROCK, a performance portable framework developed in C++ 17 with the SYCL programming standard, tailored for numerical astrophysics on Exascale architectures. The core of SHAMROCK is an accelerated parallel tree with negligible construction time, whose efficiency is based on binary algebra. The smoothed particle hydrodynamics algorithm of the PHANTOM code is implemented in SHAMROCK. On-the-fly tree construction circumvents the necessity for extensive data communications. In tests displaying a uniform density with global time-stepping with tens of billions of particles, SHAMROCK completes a single time-step in a few seconds using over the thousand of GPUs of a supercomputer. This corresponds to processing billions of particles per second, with tens of millions of particles per GPU. The parallel efficiency across the entire cluster is larger than ~90 per cent .

May 2025 Monthly Notices of the Royal Astronomical Society
View Paper

Shamrock: Exascale hydrodynamics for astrophysics using SYCL.

T. David--Cléris

Conference Paper

We present Shamrock, a native SYCL framework for astrophysics, designed to implement various numerical methods for modelling hydrodynamic flows, in particular Smoothed Particle Hydrodynamics (SPH). At the core of Shamrock lies a fast radix tree building algorithm that allows the tree to be rebuilt at each timestep with minimal cost, eliminating the need for tree communications or updates. Additionally, a domain decomposition method is used on top of the radix tree, allowing for a nearly linear multi-GPU weak scalability, resulting in 92% weak scaling efficiency on 1024 Mi250x AMD graphical accelerators for large SPH simulations.

2025 ACM International Conference on Supercomputing
View Paper