Search Results for author: Stefanos Laskaridis

Found 17 papers, 1 papers with code

MELTing point: Mobile Evaluation of Language Transformers

no code implementations19 Mar 2024 Stefanos Laskaridis, Kleomenis Katevas, Lorenzo Minto, Hamed Haddadi

Transformers have revolutionized the machine learning landscape, gradually making their way into everyday tasks and equipping our computers with ``sparks of intelligence''.

Benchmarking Quantization

Maestro: Uncovering Low-Rank Structures via Trainable Decomposition

no code implementations28 Aug 2023 Samuel Horvath, Stefanos Laskaridis, Shashank Rajput, Hongyi Wang

We further apply our technique on DNNs and empirically illustrate that Maestro enables the extraction of lower footprint models that preserve model performance while allowing for graceful accuracy-latency tradeoff for the deployment to devices of different capabilities.

Low-rank compression Quantization

Federated Learning for Inference at Anytime and Anywhere

no code implementations8 Dec 2022 Zicheng Liu, Da Li, Javier Fernandez-Marques, Stefanos Laskaridis, Yan Gao, Łukasz Dudziak, Stan Z. Li, Shell Xu Hu, Timothy Hospedales

Federated learning has been predominantly concerned with collaborative training of deep networks from scratch, and especially the many challenges that arise, such as communication cost, robustness to heterogeneous data, and support for diverse device capabilities.

Federated Learning

The Future of Consumer Edge-AI Computing

no code implementations19 Oct 2022 Stefanos Laskaridis, Stylianos I. Venieris, Alexandros Kouris, Rui Li, Nicholas D. Lane

In the last decade, Deep Learning has rapidly infiltrated the consumer end, mainly thanks to hardware acceleration across devices.

Fluid Batching: Exit-Aware Preemptive Serving of Early-Exit Neural Networks on Edge NPUs

no code implementations27 Sep 2022 Alexandros Kouris, Stylianos I. Venieris, Stefanos Laskaridis, Nicholas D. Lane

With deep neural networks (DNNs) emerging as the backbone in a multitude of computer vision tasks, their adoption in real-world applications broadens continuously.

Autonomous Vehicles Scheduling

FedorAS: Federated Architecture Search under system heterogeneity

no code implementations22 Jun 2022 Lukasz Dudziak, Stefanos Laskaridis, Javier Fernandez-Marques

In this paper we explore the question of whether we can design architectures of different footprints in a cross-device federated setting, where the device landscape, availability and scale are very different.

Federated Learning Neural Architecture Search

Smart at what cost? Characterising Mobile Deep Neural Networks in the wild

no code implementations28 Sep 2021 Mario Almeida, Stefanos Laskaridis, Abhinav Mehrotra, Lukasz Dudziak, Ilias Leontiadis, Nicholas D. Lane

To this end, we analyse over 16k of the most popular apps in the Google Play Store to characterise their DNN usage and performance across devices of different capabilities, both across tiers and generations.

16k

Adaptive Inference through Early-Exit Networks: Design, Challenges and Directions

no code implementations9 Jun 2021 Stefanos Laskaridis, Alexandros Kouris, Nicholas D. Lane

DNNs are becoming less and less over-parametrised due to recent advances in efficient model design, through careful hand-crafted or NAS-based methods.

Position

Multi-Exit Semantic Segmentation Networks

no code implementations7 Jun 2021 Alexandros Kouris, Stylianos I. Venieris, Stefanos Laskaridis, Nicholas D. Lane

At the same time, the heterogeneous capabilities of the target platforms and the diverse constraints of different applications require the design and training of multiple target-specific segmentation models, leading to excessive maintenance costs.

Robot Navigation Segmentation +2

DynO: Dynamic Onloading of Deep Neural Networks from Cloud to Device

no code implementations20 Apr 2021 Mario Almeida, Stefanos Laskaridis, Stylianos I. Venieris, Ilias Leontiadis, Nicholas D. Lane

Recently, there has been an explosive growth of mobile and embedded applications using convolutional neural networks(CNNs).

It's always personal: Using Early Exits for Efficient On-Device CNN Personalisation

no code implementations2 Feb 2021 Ilias Leontiadis, Stefanos Laskaridis, Stylianos I. Venieris, Nicholas D. Lane

On-device machine learning is becoming a reality thanks to the availability of powerful hardware and model compression techniques.

Model Compression

SPINN: Synergistic Progressive Inference of Neural Networks over Device and Cloud

no code implementations14 Aug 2020 Stefanos Laskaridis, Stylianos I. Venieris, Mario Almeida, Ilias Leontiadis, Nicholas D. Lane

Despite the soaring use of convolutional neural networks (CNNs) in mobile applications, uniformly sustaining high-performance inference on mobile has been elusive due to the excessive computational demands of modern CNNs and the increasing diversity of deployed devices.

Collaborative Inference

HAPI: Hardware-Aware Progressive Inference

no code implementations10 Aug 2020 Stefanos Laskaridis, Stylianos I. Venieris, Hyeji Kim, Nicholas D. Lane

Convolutional neural networks (CNNs) have recently become the state-of-the-art in a diversity of AI tasks.

EmBench: Quantifying Performance Variations of Deep Neural Networks across Modern Commodity Devices

no code implementations17 May 2019 Mario Almeida, Stefanos Laskaridis, Ilias Leontiadis, Stylianos I. Venieris, Nicholas D. Lane

In recent years, advances in deep learning have resulted in unprecedented leaps in diverse tasks spanning from speech and object recognition to context awareness and health monitoring.

Object Recognition

Cannot find the paper you are looking for? You can Submit a new open access paper.