Search Results for author: Xiangyi Zhang

Found 2 papers, 2 papers with code

Fully-fused Multi-Layer Perceptrons on Intel Data Center GPUs

1 code implementation26 Mar 2024 Kai Yuan, Christoph Bauinger, Xiangyi Zhang, Pascal Baehr, Matthias Kirchhart, Darius Dabert, Adrien Tousnakhoff, Pierre Boudier, Michael Paulitsch

We compare our approach to a similar CUDA implementation for MLPs and show that our implementation on the Intel Data Center GPU outperforms the CUDA implementation on Nvidia's H100 GPU by a factor up to 2. 84 in inference and 1. 75 in training.

Image Compression Physics-informed machine learning

Cannot find the paper you are looking for? You can Submit a new open access paper.