Search Results for author: Thomas B. Preußer

Found 3 papers, 0 papers with code

Post-Training Quantization with Low-precision Minifloats and Integers on FPGAs

no code implementations21 Nov 2023 Shivam Aggarwal, Alessandro Pappalardo, Hans Jakob Damsgaard, Giuseppe Franco, Thomas B. Preußer, Michaela Blott, Tulika Mitra

However, the exploration of floating-point formats smaller than 8 bits and their comparison with integer quantization remains relatively limited.

Model Compression Quantization

MicroRec: Efficient Recommendation Inference by Hardware and Data Structure Solutions

no code implementations12 Oct 2020 Wenqi Jiang, Zhenhao He, Shuai Zhang, Thomas B. Preußer, Kai Zeng, Liang Feng, Jiansong Zhang, Tongxuan Liu, Yong Li, Jingren Zhou, Ce Zhang, Gustavo Alonso

MicroRec accelerates recommendation inference by (1) redesigning the data structures involved in the embeddings to reduce the number of lookups needed and (2) taking advantage of the availability of High-Bandwidth Memory (HBM) in FPGA accelerators to tackle the latency by enabling parallel lookups.

Recommendation Systems

Inference of Quantized Neural Networks on Heterogeneous All-Programmable Devices

no code implementations21 Jun 2018 Thomas B. Preußer, Giulio Gambardella, Nicholas Fraser, Michaela Blott

Neural networks have established as a generic and powerful means to approach challenging problems such as image classification, object detection or decision making.

Decision Making Image Classification +4

Cannot find the paper you are looking for? You can Submit a new open access paper.