This paper advocates for an intertwined design of the dense linear algeb...
The remarkable positive impact of Deep Neural Networks on many Artificia...
The evolution of High-Performance Computing (HPC) platforms enables the
...
We evolve PyDTNN, a framework for distributed parallel training of Deep
...
This work is based on the seminar titled “Resiliency in Numerical Algori...
Krylov methods provide a fast and highly parallel numerical tool for the...
In this paper, we present Ginkgo, a modern C++ math library for scientif...
The Preconditioned Conjugate Gradient method is often employed for the
s...
The considerable impact of Convolutional Neural Networks on many Artific...
Adaptive workloads can change on–the–fly the configuration of their jobs...
We address the parallelization of the LU factorization of hierarchical
m...
We investigate a parallelization strategy for dense matrix factorization...
We address the reduction to compact band forms, via unitary similarity
t...
We propose two novel techniques for overcoming load-imbalance encountere...
Dense linear algebra libraries, such as BLAS and LAPACK, provide a relev...