Search Results for author: Hannes Fassold

Found 12 papers, 1 papers with code

A survey of manifold learning and its applications for multimedia

no code implementations • 8 Sep 2023 • Hannes Fassold

Manifold learning is an emerging research domain of machine learning.

Paper
Add Code

Do the Frankenstein, or how to achieve better out-of-distribution performance with manifold mixing model soup

no code implementations • 28 Aug 2023 • Hannes Fassold

The standard recipe applied in transfer learning is to finetune a pretrained model on the task-specific dataset with different hyperparameter settings and pick the model with the highest accuracy on the validation dataset.

Image Classification Transfer Learning

Paper
Add Code

A real-time algorithm for human action recognition in RGB and thermal video

no code implementations • 4 Apr 2023 • Hannes Fassold, Karlheinz Gutjahr, Anna Weber, Roland Perko

Monitoring the movement and actions of humans in video in real-time is an important task.

Action Recognition object-detection +4

Paper
Add Code

FastHebb: Scaling Hebbian Training of Deep Neural Networks to ImageNet Level

no code implementations • 7 Jul 2022 • Gabriele Lagani, Claudio Gennaro, Hannes Fassold, Giuseppe Amato

Learning algorithms for Deep Neural Networks are typically based on supervised end-to-end Stochastic Gradient Descent (SGD) training with error backpropagation (backprop).

Paper
Add Code

A qualitative investigation of optical flow algorithms for video denoising

no code implementations • 19 Apr 2022 • Hannes Fassold

A good optical flow estimation is crucial in many video analysis and restoration algorithms employed in application fields like media industry, industrial inspection and automotive.

Denoising Optical Flow Estimation +1

Paper
Add Code

AdaFamily: A family of Adam-like adaptive gradient methods

no code implementations • 3 Mar 2022 • Hannes Fassold

We propose AdaFamily, a novel method for training deep neural networks.

Image Classification

Paper
Add Code

Some like it tough: Improving model generalization via progressively increasing the training difficulty

1 code implementation • 25 Oct 2021 • Hannes Fassold

In this work, we propose to progressively increase the training difficulty during learning a neural network model via a novel strategy which we call mini-batch trimming.

Image Classification

Paper
Code

Detecting speaking persons in video

no code implementations • 25 Oct 2021 • Hannes Fassold

We present a novel method for detecting speaking persons in video, by extracting facial landmarks with a neural network and analysing these landmarks statistically over time

Paper
Add Code

Hyper360 -- a Next Generation Toolset for Immersive Media

no code implementations • 1 Aug 2021 • Hannes Fassold, Antonis Karakottas, Dorothea Tsatsou, Dimitrios Zarpalas, Barnabas Takacs, Christian Fuhrhop, Angelo Manfredi, Nicolas Patz, Simona Tonoli, Iana Dulskaia

Spherical 360{\deg} video is a novel media format, rapidly becoming adopted in media production and consumption of immersive media.

Paper
Add Code

Automatic cinematography for 360 video

no code implementations • 2 Sep 2020 • Hannes Fassold

We describe our method for automatic generation of a visually interesting camera path (automatic cinematography)from a 360 video.

Paper
Add Code

OmniTrack: Real-time detection and tracking of objects, text and logos in video

no code implementations • 14 Oct 2019 • Hannes Fassold, Ridouane Ghermi

The automatic detection and tracking of general objects (like persons, animals or cars), text and logos in a video is crucial for many video understanding tasks, and usually real-time processing as required.

object-detection Object Detection +2

Paper
Add Code

Adapting Computer Vision Algorithms for Omnidirectional Video

no code implementations • 22 Jul 2019 • Hannes Fassold

Omnidirectional (360{\deg}) video has got quite popular because it provides a highly immersive viewing experience.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.