no code implementations • 7 Nov 2022 • Hongrui Shi, Valentin Radu, Po Yang
The heterogeneity of hardware and data is a well-known and studied problem in the community of Federated Learning (FL) as running under heterogeneous settings.
no code implementations • 5 Nov 2021 • Hongrui Shi, Valentin Radu
Our experiments show that only 1. 6% of the initially exchanged data can effectively transfer the characteristic of the client data to the global model in our FL approach, using split networks.
no code implementations • 20 Oct 2020 • Rik Mulder, Valentin Radu, Christophe Dubach
This process requires a lengthy profiling stage, iterating over all the available primitives for each layer configuration, to measure their execution time on the target platform.
no code implementations • 21 May 2020 • Yuan Wen, Andrew Anderson, Valentin Radu, Michael F. P. O'Boyle, David Gregg
We optimize the trade-off between execution time and memory consumption by: 1) attempting to minimize execution time across the whole network by selecting data layouts and primitive operations to implement each layer; and 2) allocating an appropriate workspace that reflects the upper bound of memory footprint per layer.
no code implementations • 20 Feb 2020 • Valentin Radu, Kuba Kaszyk, Yuan Wen, Jack Turner, Jose Cano, Elliot J. Crowley, Bjorn Franke, Amos Storkey, Michael O'Boyle
We evaluate higher level libraries, which analyze the input characteristics of a convolutional layer, based on which they produce optimized OpenCL (Arm Compute Library and TVM) and CUDA (cuDNN) code.
no code implementations • 28 Dec 2018 • Adrian Cosma, Ion Emilian Radoi, Valentin Radu
Video processing holds utility for many emerging applications and data labelling in the IoT space.
no code implementations • 24 Oct 2018 • Jack Turner, Elliot J. Crowley, Valentin Radu, José Cano, Amos Storkey, Michael O'Boyle
The task of accelerating large neural networks on general purpose hardware has, in recent years, prompted the use of channel pruning to reduce network size.
1 code implementation • 19 Sep 2018 • Jack Turner, José Cano, Valentin Radu, Elliot J. Crowley, Michael O'Boyle, Amos Storkey
Convolutional Neural Networks (CNNs) are extremely computationally demanding, presenting a large barrier to their deployment on resource-constrained devices.