In this paper, we address some of the key limitations to realizing a gen...
CUDA is one of the most popular choices for GPU programming, but it can ...
This work evaluates the benefits of using a "smart" network interface ca...
As systems and applications grow more complex, detailed simulation takes...
Graph processing is typically considered to be a memory-bound rather tha...
Recent characterizations of data movement performance have evaluated
opt...
The Emu Chick is a prototype system designed around the concept of migra...
Memories that exploit three-dimensional (3D)-stacking technology, which
...