Search Results for author: Andrii Skliar

Found 6 papers, 1 papers with code

Think Big, Generate Quick: LLM-to-SLM for Fast Autoregressive Decoding

no code implementations • 26 Feb 2024 • Benjamin Bergner, Andrii Skliar, Amelie Royer, Tijmen Blankevoort, Yuki Asano, Babak Ehteshami Bejnordi

We investigate the combination of encoder-decoder LLMs with both encoder-decoder and decoder-only SLMs from different model families and only require fine-tuning of the SLM.

Instruction Following Language Modelling +1

Paper
Add Code

Hyperbolic Convolutional Neural Networks

no code implementations • 29 Aug 2023 • Andrii Skliar, Maurice Weiler

However, no papers have yet suggested a general approach to using Hyperbolic Convolutional Neural Networks for structured data processing, although these are the most common examples of data used.

Explainable Models Image Classification +1

Paper
Add Code

Revisiting Single-gated Mixtures of Experts

no code implementations • 11 Apr 2023 • Amelie Royer, Ilia Karmanov, Andrii Skliar, Babak Ehteshami Bejnordi, Tijmen Blankevoort

Mixture of Experts (MoE) are rising in popularity as a means to train extremely large-scale models, yet allowing for a reasonable computational cost at inference time.

Paper
Add Code

Simple and Efficient Architectures for Semantic Segmentation

1 code implementation • 16 Jun 2022 • Dushyant Mehta, Andrii Skliar, Haitam Ben Yahia, Shubhankar Borse, Fatih Porikli, Amirhossein Habibian, Tijmen Blankevoort

Though the state-of-the architectures for semantic segmentation, such as HRNet, demonstrate impressive accuracy, the complexity arising from their salient design choices hinders a range of model acceleration tools, and further they make use of operations that are inefficient on current hardware.

Image Classification Segmentation +1

Paper
Code

Cyclical Pruning for Sparse Neural Networks

no code implementations • 2 Feb 2022 • Suraj Srinivas, Andrey Kuzmin, Markus Nagel, Mart van Baalen, Andrii Skliar, Tijmen Blankevoort

Current methods for pruning neural network weights iteratively apply magnitude-based pruning on the model weights and re-train the resulting model to recover lost accuracy.

Paper
Add Code

Distilling Optimal Neural Networks: Rapid Search in Diverse Spaces

no code implementations • ICCV 2021 • Bert Moons, Parham Noorzad, Andrii Skliar, Giovanni Mariani, Dushyant Mehta, Chris Lott, Tijmen Blankevoort

Second, a rapid evolutionary search finds a set of pareto-optimal architectures for any scenario using the accuracy predictor and on-device measurements.

Knowledge Distillation Model Compression +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.