Search Results for author: Lukas Cavigelli

Found 37 papers, 15 papers with code

W Convolutional Network Accelerator

no code implementations • 14 Dec 2015 • Lukas Cavigelli, Luca Benini

An ever increasing number of computer vision and image/video processing challenges are being approached using deep convolutional neural networks, obtaining state-of-the-art results in object recognition and detection, semantic segmentation, action recognition, optical flow and superresolution.

Action Recognition Object Recognition +3

Paper
Add Code

YodaNN: An Architecture for Ultra-Low Power Binary-Weight CNN Acceleration

no code implementations • 17 Jun 2016 • Renzo Andri, Lukas Cavigelli, Davide Rossi, Luca Benini

Convolutional neural networks (CNNs) have revolutionized the world of computer vision over the last few years, pushing image classification beyond human accuracy.

General Classification Image Classification

Paper
Add Code

Deep Structured Features for Semantic Segmentation

no code implementations • 26 Sep 2016 • Michael Tschannen, Lukas Cavigelli, Fabian Mentzer, Thomas Wiatowski, Luca Benini

We propose a highly structured neural network architecture for semantic segmentation with an extremely small model size, suitable for low-power embedded and mobile platforms.

General Classification Segmentation +1

Paper
Add Code

Computationally Efficient Target Classification in Multispectral Image Data with Deep Neural Networks

no code implementations • 9 Nov 2016 • Lukas Cavigelli, Dominic Bernath, Michele Magno, Luca Benini

The required communication links and archiving of the video data are still expensive and this setup excludes preemptive actions to respond to imminent threats.

General Classification Scene Labeling

Paper
Add Code

CAS-CNN: A Deep Convolutional Neural Network for Image Compression Artifact Suppression

1 code implementation • 22 Nov 2016 • Lukas Cavigelli, Pascal Hager, Luca Benini

Lossy image compression algorithms are pervasively used to reduce the size of images transmitted over the web and recorded on data storage media.

Image Compression

Paper
Code

Soft-to-Hard Vector Quantization for End-to-End Learning Compressible Representations

no code implementations • NeurIPS 2017 • Eirikur Agustsson, Fabian Mentzer, Michael Tschannen, Lukas Cavigelli, Radu Timofte, Luca Benini, Luc van Gool

We present a new approach to learn compressible representations in deep architectures with an end-to-end training strategy.

Image Compression Neural Network Compression +1

Paper
Add Code

CBinfer: Change-Based Inference for Convolutional Neural Networks on Video Data

1 code implementation • 14 Apr 2017 • Lukas Cavigelli, Philippe Degen, Luca Benini

Extracting per-frame features using convolutional neural networks for real-time processing of video data is currently mainly performed on powerful GPU-accelerated workstations and compute clusters.

Paper
Code

Efficient Convolutional Neural Network For Audio Event Detection

no code implementations • 28 Sep 2017 • Matthias Meyer, Lukas Cavigelli, Lothar Thiele

Wireless distributed systems as used in sensor networks, Internet-of-Things and cyber-physical systems, impose high requirements on resource efficiency.

Event Detection General Classification

Paper
Add Code

Chipmunk: A Systolically Scalable 0.9 mm${}^2$, 3.08 Gop/s/mW @ 1.2 mW Accelerator for Near-Sensor Recurrent Neural Network Inference

no code implementations • 15 Nov 2017 • Francesco Conti, Lukas Cavigelli, Gianna Paulin, Igor Susmelj, Luca Benini

Recurrent neural networks (RNNs) are state-of-the-art in voice awareness/understanding and speech recognition.

speech-recognition Speech Recognition

Paper
Add Code

Design Automation for Binarized Neural Networks: A Quantum Leap Opportunity?

no code implementations • 21 Nov 2017 • Manuele Rusci, Lukas Cavigelli, Luca Benini

Design automation in general, and in particular logic synthesis, can play a key role in enabling the design of application-specific Binarized Neural Networks (BNN).

Paper
Add Code

Hyperdrive: A Multi-Chip Systolically Scalable Binary-Weight CNN Inference Engine

no code implementations • 5 Mar 2018 • Renzo Andri, Lukas Cavigelli, Davide Rossi, Luca Benini

Deep neural networks have achieved impressive results in computer vision and machine learning.

Quantization

Paper
Add Code

XNORBIN: A 95 TOp/s/W Hardware Accelerator for Binary Convolutional Neural Networks

no code implementations • 5 Mar 2018 • Andrawes Al Bahou, Geethan Karunaratne, Renzo Andri, Lukas Cavigelli, Luca Benini

Deploying state-of-the-art CNNs requires power-hungry processors and off-chip memory.

Quantization

Paper
Add Code

Fast and Accurate Multiclass Inference for MI-BCIs Using Large Multiscale Temporal and Spectral Features

2 code implementations • 18 Jun 2018 • Michael Hersche, Tino Rellstab, Pasquale Davide Schiavone, Lukas Cavigelli, Luca Benini, Abbas Rahimi

Accurate, fast, and reliable multiclass classification of electroencephalography (EEG) signals is a challenging task towards the development of motor imagery brain-computer interface (MI-BCI) systems.

Classification EEG +1

Paper
Code

CBinfer: Exploiting Frame-to-Frame Locality for Faster Convolutional Network Inference on Video Streams

2 code implementations • 15 Aug 2018 • Lukas Cavigelli, Luca Benini

The last few years have brought advances in computer vision at an amazing pace, grounded on new findings in deep neural network construction and training as well as the availability of large labeled datasets.

object-detection Object Detection +1

Paper
Code

Extended Bit-Plane Compression for Convolutional Neural Network Accelerators

1 code implementation • 1 Oct 2018 • Lukas Cavigelli, Luca Benini

After the tremendous success of convolutional neural networks in image classification, object detection, speech recognition, etc., there is now rising demand for deployment of these compute-intensive ML models on tightly power constrained embedded and mobile systems at low cost as well as for pushing the throughput in data centers.

Image Classification object-detection +3

Paper
Code

Additive Noise Annealing and Approximation Properties of Quantized Neural Networks

1 code implementation • 24 May 2019 • Matteo Spallanzani, Lukas Cavigelli, Gian Paolo Leonardi, Marko Bertogna, Luca Benini

We present a theoretical and experimental investigation of the quantization problem for artificial neural networks.

Image Classification Quantization

Paper
Code

EBPC: Extended Bit-Plane Compression for Deep Neural Network Inference and Training Accelerators

2 code implementations • 30 Aug 2019 • Lukas Cavigelli, Georg Rutishauser, Luca Benini

In the wake of the success of convolutional neural networks in image classification, object recognition, speech recognition, etc., the demand for deploying these compute-intensive ML models on embedded and mobile systems with tight power and energy constraints at low cost, as well as for boosting throughput in data centers, is growing rapidly.

Image Classification Object Recognition +2

Paper
Code

FANN-on-MCU: An Open-Source Toolkit for Energy-Efficient Neural Network Inference at the Edge of the Internet of Things

1 code implementation • 8 Nov 2019 • Xiaying Wang, Michele Magno, Lukas Cavigelli, Luca Benini

The growing number of low-power smart devices in the Internet of Things is coupled with the concept of "Edge Computing", that is moving some of the intelligence, especially machine learning, towards the edge of the network.

BIG-bench Machine Learning Edge-computing +1

Paper
Code

HR-SAR-Net: A Deep Neural Network for Urban Scene Segmentation from High-Resolution SAR Data

no code implementations • 10 Dec 2019 • Xiaying Wang, Lukas Cavigelli, Manuel Eggimann, Michele Magno, Luca Benini

Synthetic aperture radar (SAR) data is becoming increasingly available to a wide range of users through commercial service providers with resolutions reaching 0. 5m/px.

Scene Segmentation Segmentation

Paper
Add Code

RPR: Random Partition Relaxation for Training; Binary and Ternary Weight Neural Networks

no code implementations • 4 Jan 2020 • Lukas Cavigelli, Luca Benini

We present Random Partition Relaxation (RPR), a method for strong quantization of neural networks weight to binary (+1/-1) and ternary (+1/0/-1) values.

Quantization

Paper
Add Code

InfiniWolf: Energy Efficient Smart Bracelet for Edge Computing with Dual Source Energy Harvesting

no code implementations • 28 Feb 2020 • Michele Magno, Xiaying Wang, Manuel Eggimann, Lukas Cavigelli, Luca Benini

This work presents InfiniWolf, a novel multi-sensor smartwatch that can achieve self-sustainability exploiting thermal and solar energy harvesting, performing computationally high demanding tasks.

Edge-computing

Paper
Add Code

Q-EEGNet: an Energy-Efficient 8-bit Quantized Parallel EEGNet Implementation for Edge Motor-Imagery Brain--Machine Interfaces

1 code implementation • 24 Apr 2020 • Tibor Schneider, Xiaying Wang, Michael Hersche, Lukas Cavigelli, Luca Benini

We quantize weights and activations to 8-bit fixed-point with a negligible accuracy loss of 0. 4% on 4-class MI, and present an energy-efficient hardware-aware implementation on the Mr. Wolf parallel ultra-low power (PULP) System-on-Chip (SoC) by utilizing its custom RISC-V ISA extensions and 8-core compute cluster.

EEG Motor Imagery

Paper
Code

ChewBaccaNN: A Flexible 223 TOPS/W BNN Accelerator

no code implementations • 12 May 2020 • Renzo Andri, Geethan Karunaratne, Lukas Cavigelli, Luca Benini

Furthermore, it can perform inference on a binarized ResNet-18 trained with 8-bases Group-Net to achieve a 67. 5% Top-1 accuracy with only 3. 0 mJ/frame -- at an accuracy drop of merely 1. 8% from the full-precision ResNet-18.

Paper
Add Code

EEG-TCNet: An Accurate Temporal Convolutional Network for Embedded Motor-Imagery Brain-Machine Interfaces

1 code implementation • 31 May 2020 • Thorir Mar Ingolfsson, Michael Hersche, Xiaying Wang, Nobuaki Kobayashi, Lukas Cavigelli, Luca Benini

Experimental results on the BCI Competition IV-2a dataset show that EEG-TCNet achieves 77. 35% classification accuracy in 4-class MI.

EEG General Classification +2

Paper
Code

CUTIE: Beyond PetaOp/s/W Ternary DNN Inference Acceleration with Better-than-Binary Energy Efficiency

no code implementations • 3 Nov 2020 • Moritz Scherer, Georg Rutishauser, Lukas Cavigelli, Luca Benini

We present a 3. 1 POp/s/W fully digital hardware accelerator for ternary neural networks.

Hardware Architecture

Paper
Add Code

Sound Event Detection with Binary Neural Networks on Tightly Power-Constrained IoT Devices

no code implementations • 12 Jan 2021 • Gianmarco Cerutti, Renzo Andri, Lukas Cavigelli, Michele Magno, Elisabetta Farella, Luca Benini

This BNN reaches a 77. 9% accuracy, just 7% lower than the full-precision version, with 58 kB (7. 2 times less) for the weights and 262 kB (2. 4 times less) memory in total.

Event Detection Object Recognition +2

Paper
Add Code

Mixed-Precision Quantization and Parallel Implementation of Multispectral Riemannian Classification for Brain--Machine Interfaces

1 code implementation • 22 Feb 2021 • Xiaying Wang, Tibor Schneider, Michael Hersche, Lukas Cavigelli, Luca Benini

With Motor-Imagery (MI) Brain--Machine Interfaces (BMIs) we may control machines by merely thinking of performing a motor action.

General Classification Motor Imagery +1

Paper
Code

ECG-TCN: Wearable Cardiac Arrhythmia Detection with a Temporal Convolutional Network

1 code implementation • 25 Mar 2021 • Thorir Mar Ingolfsson, Xiaying Wang, Michael Hersche, Alessio Burrello, Lukas Cavigelli, Luca Benini

With 9. 91 GMAC/s/W, it is 23. 0 times more energy-efficient and 46. 85 times faster than an implementation on the ARM Cortex M4F (0. 43 GMAC/s/W).

Arrhythmia Detection

Paper
Code

Reinforcement Learning for Scalable Logic Optimization with Graph Neural Networks

no code implementations • 4 May 2021 • Xavier Timoneda, Lukas Cavigelli

Logic optimization is an NP-hard problem commonly approached through hand-engineered heuristics.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Sub-100uW Multispectral Riemannian Classification for EEG-based Brain--Machine Interfaces

no code implementations • 18 Dec 2021 • Xiaying Wang, Lukas Cavigelli, Tibor Schneider, Luca Benini

Motor imagery brain--machine interfaces enable us to control machines by merely thinking of performing a motor action.

Classification EEG +1

Paper
Add Code

Vau da muntanialas: Energy-efficient multi-die scalable acceleration of RNN inference

no code implementations • 14 Feb 2022 • Gianna Paulin, Francesco Conti, Lukas Cavigelli, Luca Benini

For quantifying the overall system power, including I/O power, we built Vau da Muntanialas, to the best of our knowledge, the first demonstration of a systolic multi-chip-on-PCB array of RNN accelerator.

Quantization speech-recognition +2

Paper
Add Code

Towards On-device Domain Adaptation for Noise-Robust Keyword Spotting

1 code implementation • IEEE International Conference on Artificial Intelligence Circuits and Systems (AICAS) 2022 • Cristian Cioflan, Lukas Cavigelli, Manuele Rusci, Miguel de Prado, Luca Benini

The accuracy of a keyword spotting model deployed on embedded devices often degrades when the system is exposed to real environments with significant noise.

Edge-computing Keyword Spotting

Paper
Code

Going Further With Winograd Convolutions: Tap-Wise Quantization for Efficient Inference on 4x4 Tile

no code implementations • 26 Sep 2022 • Renzo Andri, Beatrice Bussolino, Antonio Cipolletta, Lukas Cavigelli, Zhe Wang

The Winograd-enhanced DSA achieves up to 1. 85x gain in energy efficiency and up to 1. 83x end-to-end speed-up for state-of-the-art segmentation and detection networks.

Quantization

Paper
Add Code

RL-based Stateful Neural Adaptive Sampling and Denoising for Real-Time Path Tracing

1 code implementation • NeurIPS 2023 • Antoine Scardigli, Lukas Cavigelli, Lorenz K. Müller

Monte-Carlo path tracing is a powerful technique for realistic image synthesis but suffers from high levels of noise at low sample counts, limiting its use in real-time applications.

Denoising Image Generation

Paper
Code

Stella Nera: Achieving 161 TOp/s/W with Multiplier-free DNN Acceleration based on Approximate Matrix Multiplication

1 code implementation • 16 Nov 2023 • Jannis Schönleber, Lukas Cavigelli, Renzo Andri, Matteo Perotti, Luca Benini

From classical HPC to deep learning, MatMul is at the heart of today's computing.

Quantization

200

Paper
Code

Boosting keyword spotting through on-device learnable user speech characteristics

no code implementations • 12 Mar 2024 • Cristian Cioflan, Lukas Cavigelli, Luca Benini

Keyword spotting systems for always-on TinyML-constrained applications require on-site tuning to boost the accuracy of offline trained classifiers when deployed in unseen inference conditions.

Few-Shot Learning Keyword Spotting

Paper
Add Code

On-Device Domain Learning for Keyword Spotting on Low-Power Extreme Edge Embedded Systems

no code implementations • 12 Mar 2024 • Cristian Cioflan, Lukas Cavigelli, Manuele Rusci, Miguel de Prado, Luca Benini

Keyword spotting accuracy degrades when neural networks are exposed to noisy environments.

Domain Adaptation Keyword Spotting

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.