Search Results for author: Vicky Kalogeiton

Found 22 papers, 13 papers with code

Analysis of Classifier-Free Guidance Weight Schedulers

no code implementations19 Apr 2024 Xi Wang, Nicolas Dufour, Nefeli Andreou, Marie-Paule Cani, Victoria Fernandez Abrevaya, David Picard, Vicky Kalogeiton

Classifier-Free Guidance (CFG) enhances the quality and condition adherence of text-to-image diffusion models.

FunnyNet-W: Multimodal Learning of Funny Moments in Videos in the Wild

1 code implementation8 Jan 2024 Zhi-Song Liu, Robin Courant, Vicky Kalogeiton

In this paper, we propose FunnyNet-W, a model that relies on cross- and self-attention for visual, audio and text data to predict funny moments in videos.

Language Modelling Large Language Model +1

Collaborating Foundation Models for Domain Generalized Semantic Segmentation

1 code implementation15 Dec 2023 Yasser Benigmim, Subhankar Roy, Slim Essid, Vicky Kalogeiton, Stéphane Lathuilière

Domain Generalized Semantic Segmentation (DGSS) deals with training a model on a labeled source domain with the aim of generalizing to unseen domains during inference.

Domain Generalization Segmentation +1

Learning the What and How of Annotation in Video Object Segmentation

no code implementations8 Nov 2023 Thanos Delatolas, Vicky Kalogeiton, Dim P. Papadopoulos

To reduce this annotation cost, in this paper, we propose EVA-VOS, a human-in-the-loop annotation framework for video object segmentation.

Segmentation Semantic Segmentation +3

Reward Function Design for Crowd Simulation via Reinforcement Learning

no code implementations22 Sep 2023 Ariel Kwiatkowski, Vicky Kalogeiton, Julien Pettré, Marie-Paule Cani

Crowd simulation is important for video-games design, since it enables to populate virtual worlds with autonomous avatars that navigate in a human-like manner.

Navigate reinforcement-learning

BluNF: Blueprint Neural Field

no code implementations7 Sep 2023 Robin Courant, Xi Wang, Marc Christie, Vicky Kalogeiton

BluNF provides a robust and user-friendly 2D blueprint, enabling intuitive scene editing.

Novel View Synthesis

One-shot Unsupervised Domain Adaptation with Personalized Diffusion Models

1 code implementation31 Mar 2023 Yasser Benigmim, Subhankar Roy, Slim Essid, Vicky Kalogeiton, Stéphane Lathuilière

Departing from the common notion of transferring only the target ``texture'' information, we leverage text-to-image diffusion models (e. g., Stable Diffusion) to generate a synthetic target dataset with photo-realistic images that not only faithfully depict the style of the target domain, but are also characterized by novel scenes in diverse contexts.

Data Augmentation One-shot Unsupervised Domain Adaptation +2

MEDIMP: 3D Medical Images with clinical Prompts from limited tabular data for renal transplantation

1 code implementation22 Mar 2023 Leo Milecki, Vicky Kalogeiton, Sylvain Bodard, Dany Anglicheau, Jean-Michel Correas, Marc-Olivier Timsit, Maria Vakalopoulou

Our goal is to learn meaningful manifolds of renal transplant DCE MRI, interesting for the prognosis of the transplant or patient status (2, 3, and 4 years after the transplant), fully exploiting the limited available multi-modal data most efficiently.

Contrastive Learning Representation Learning

Machine Learning for Brain Disorders: Transformers and Visual Transformers

no code implementations21 Mar 2023 Robin Courant, Maika Edberg, Nicolas Dufour, Vicky Kalogeiton

For image classification, the most common Transformer Architecture uses only the Transformer Encoder in order to transform the various input tokens.

Image Classification

UGAE: A Novel Approach to Non-exponential Discounting

no code implementations11 Feb 2023 Ariel Kwiatkowski, Vicky Kalogeiton, Julien Pettré, Marie-Paule Cani

We also show experimentally that agents with non-exponential discounting trained via UGAE outperform variants trained with Monte Carlo advantage estimation.

SCAM! Transferring humans between images with Semantic Cross Attention Modulation

1 code implementation10 Oct 2022 Nicolas Dufour, David Picard, Vicky Kalogeiton

In this work, we introduce SCAM (Semantic Cross Attention Modulation), a system that encodes rich and diverse information in each semantic region of the image (including foreground and background), thus achieving precise generation with emphasis on fine details.

Pose Transfer Reconstruction +1

Understanding reinforcement learned crowds

1 code implementation19 Sep 2022 Ariel Kwiatkowski, Vicky Kalogeiton, Julien Pettré, Marie-Paule Cani

Each of these choices has a significant, and potentially nontrivial impact on the results, and so researchers should be mindful about choosing and reporting them in their work.

A Survey on Reinforcement Learning Methods in Character Animation

no code implementations7 Mar 2022 Ariel Kwiatkowski, Eduardo Alvarado, Vicky Kalogeiton, C. Karen Liu, Julien Pettré, Michiel Van de Panne, Marie-Paule Cani

Reinforcement Learning is an area of Machine Learning focused on how agents can be trained to make sequential decisions, and achieve a particular goal within an arbitrary environment.

reinforcement-learning Reinforcement Learning (RL)

Name Your Style: An Arbitrary Artist-aware Image Style Transfer

1 code implementation28 Feb 2022 Zhi-Song Liu, Li-Wen Wang, Wan-Chi Siu, Vicky Kalogeiton

Moreover, it can mimic the styles of one or many artists to achieve attractive results, thus highlighting a promising direction in image style transfer.

Style Transfer

Multiple Style Transfer via Variational AutoEncoder

1 code implementation13 Oct 2021 Zhi-Song Liu, Vicky Kalogeiton, Marie-Paule Cani

Modern works on style transfer focus on transferring style from a single image.

Style Transfer

Face, Body, Voice: Video Person-Clustering with Multiple Modalities

no code implementations20 May 2021 Andrew Brown, Vicky Kalogeiton, Andrew Zisserman

In this paper we make contributions to address both these deficiencies: first, we introduce a Multi-Modal High-Precision Clustering algorithm for person-clustering in videos using cues from several modalities (face, body, and voice).

Clustering Face Clustering

Smooth-AP: Smoothing the Path Towards Large-Scale Image Retrieval

2 code implementations ECCV 2020 Andrew Brown, Weidi Xie, Vicky Kalogeiton, Andrew Zisserman

Optimising a ranking-based metric, such as Average Precision (AP), is notoriously challenging due to the fact that it is non-differentiable, and hence cannot be optimised directly using gradient-descent methods.

Image Instance Retrieval Metric Learning +2

Action Tubelet Detector for Spatio-Temporal Action Localization

2 code implementations ICCV 2017 Vicky Kalogeiton, Philippe Weinzaepfel, Vittorio Ferrari, Cordelia Schmid

We propose the ACtion Tubelet detector (ACT-detector) that takes as input a sequence of frames and outputs tubelets, i. e., sequences of bounding boxes with associated scores.

Spatio-Temporal Action Localization Temporal Action Localization

Cannot find the paper you are looking for? You can Submit a new open access paper.