no code implementations • 19 Jan 2024 • Jukka I. Ahonen, Nam Le, Honglei Zhang, Antti Hallapuro, Francesco Cricri, Hamed Rezazadegan Tavakoli, Miska M. Hannuksela, Esa Rahtu
To the best of our knowledge, this is the first research paper showing a hybrid video codec that outperforms VVC on multiple datasets and multiple machine vision tasks.
no code implementations • 19 Jan 2024 • Nam Le, Honglei Zhang, Francesco Cricri, Ramin G. Youvalari, Hamed Rezazadegan Tavakoli, Emre Aksu, Miska M. Hannuksela, Esa Rahtu
Image coding for machines (ICM) aims at reducing the bitrate required to represent an image while minimizing the drop in machine vision analysis accuracy.
no code implementations • 23 Aug 2021 • Nam Le, Honglei Zhang, Francesco Cricri, Ramin Ghaznavi-Youvalari, Esa Rahtu
Over recent years, deep learning-based computer vision systems have been applied to images at an ever-increasing pace, oftentimes representing the only type of consumption for those images.
no code implementations • 23 Aug 2021 • Nam Le, Honglei Zhang, Francesco Cricri, Ramin Ghaznavi-Youvalari, Hamed Rezazadegan Tavakoli, Esa Rahtu
One possible solution approach consists of adapting current human-targeted image and video coding standards to the use case of machine consumption.
no code implementations • 27 May 2019 • Nam Le
This paper presents a technique called evolving self-supervised neural networks - neural networks that can teach themselves, intrinsically motivated, without external supervision or reward.
no code implementations • 4 Apr 2019 • Nam Le
Moreover, the self-taught neural network is the product of the interplay between evolution and learning.
no code implementations • 6 Dec 2018 • Nam Le, Jean-Marc Odobez
Collecting labeled data to train deep neural networks is costly and even impractical for many tasks.
no code implementations • 10 Jul 2017 • Nam Le, Jean-Marc Odobez
Learning speaker turn embeddings has shown considerable improvement in situations where conventional speaker modeling approaches fail.