Search Results for author: Hannes Fassold

Found 12 papers, 1 papers with code

A survey of manifold learning and its applications for multimedia

no code implementations8 Sep 2023 Hannes Fassold

Manifold learning is an emerging research domain of machine learning.

Do the Frankenstein, or how to achieve better out-of-distribution performance with manifold mixing model soup

no code implementations28 Aug 2023 Hannes Fassold

The standard recipe applied in transfer learning is to finetune a pretrained model on the task-specific dataset with different hyperparameter settings and pick the model with the highest accuracy on the validation dataset.

Image Classification Transfer Learning

FastHebb: Scaling Hebbian Training of Deep Neural Networks to ImageNet Level

no code implementations7 Jul 2022 Gabriele Lagani, Claudio Gennaro, Hannes Fassold, Giuseppe Amato

Learning algorithms for Deep Neural Networks are typically based on supervised end-to-end Stochastic Gradient Descent (SGD) training with error backpropagation (backprop).

A qualitative investigation of optical flow algorithms for video denoising

no code implementations19 Apr 2022 Hannes Fassold

A good optical flow estimation is crucial in many video analysis and restoration algorithms employed in application fields like media industry, industrial inspection and automotive.

Denoising Optical Flow Estimation +1

AdaFamily: A family of Adam-like adaptive gradient methods

no code implementations3 Mar 2022 Hannes Fassold

We propose AdaFamily, a novel method for training deep neural networks.

Image Classification

Some like it tough: Improving model generalization via progressively increasing the training difficulty

1 code implementation25 Oct 2021 Hannes Fassold

In this work, we propose to progressively increase the training difficulty during learning a neural network model via a novel strategy which we call mini-batch trimming.

Image Classification

Detecting speaking persons in video

no code implementations25 Oct 2021 Hannes Fassold

We present a novel method for detecting speaking persons in video, by extracting facial landmarks with a neural network and analysing these landmarks statistically over time

Hyper360 -- a Next Generation Toolset for Immersive Media

no code implementations1 Aug 2021 Hannes Fassold, Antonis Karakottas, Dorothea Tsatsou, Dimitrios Zarpalas, Barnabas Takacs, Christian Fuhrhop, Angelo Manfredi, Nicolas Patz, Simona Tonoli, Iana Dulskaia

Spherical 360{\deg} video is a novel media format, rapidly becoming adopted in media production and consumption of immersive media.

Automatic cinematography for 360 video

no code implementations2 Sep 2020 Hannes Fassold

We describe our method for automatic generation of a visually interesting camera path (automatic cinematography)from a 360 video.

OmniTrack: Real-time detection and tracking of objects, text and logos in video

no code implementations14 Oct 2019 Hannes Fassold, Ridouane Ghermi

The automatic detection and tracking of general objects (like persons, animals or cars), text and logos in a video is crucial for many video understanding tasks, and usually real-time processing as required.

object-detection Object Detection +2

Adapting Computer Vision Algorithms for Omnidirectional Video

no code implementations22 Jul 2019 Hannes Fassold

Omnidirectional (360{\deg}) video has got quite popular because it provides a highly immersive viewing experience.

Cannot find the paper you are looking for? You can Submit a new open access paper.