1 code implementation • 30 Jun 2017 • Alexander Matthes, René Widera, Erik Zenker, Benjamin Worpitz, Axel Huebl, Michael Bussmann
On some of these we are able to reach almost 50\% of the peak floating point operation performance using the aforementioned means.
Distributed, Parallel, and Cluster Computing
1 code implementation • 9 Jun 2016 • Erik Zenker, René Widera, Axel Huebl, Guido Juckeland, Andreas Knüpfer, Wolfgang E. Nagel, Michael Bussmann
With the appearance of the heterogeneous platform OpenPower, many-core accelerator devices have been coupled with Power host processors for the first time.
Distributed, Parallel, and Cluster Computing
1 code implementation • 26 Feb 2016 • Erik Zenker, Benjamin Worpitz, René Widera, Axel Huebl, Guido Juckeland, Andreas Knüpfer, Wolfgang E. Nagel, Michael Bussmann
The model exploits parallelism and memory hierarchies on a node at all levels available in current hardware.
Distributed, Parallel, and Cluster Computing Mathematical Software