We detail the performance optimizations made in rocHPL, AMD's open-sourc...
We present hipBone, an open source performance-portable proxy applicatio...
In this paper we describe the research and development activities in the...
Efficient exploitation of exascale architectures requires rethinking of ...
In this paper we consider a level set reinitialization technique based o...
The development of NekRS, a GPU-oriented thermal-fluids simulation code ...
This paper is devoted to the development of highly efficient kernels
per...
We consider several methods for generating initial guesses when iterativ...
We present a GPU-accelerated version of a high-order discontinuous Galer...
We present a GPU-accelerated version of a high-order discontinuous Galer...
This paper is devoted to GPU kernel optimization and performance analysi...