Causal Triplet: An Open Challenge for Intervention-centric Causal Representation Learning

1 code implementation12 Jan 2023 Yuejiang Liu, Alexandre Alahi, Chris Russell, Max Horn, Dominik Zietlow, Bernhard Schölkopf, Francesco Locatello

Recent years have seen a surge of interest in learning high-level causal representations from low-level image pairs under interventions.

Representation Learning

1st Workshop on Maritime Computer Vision (MaCVi) 2023: Challenge Results

no code implementations24 Nov 2022 Benjamin Kiefer, Matej Kristan, Janez Perš, Lojze Žust, Fabio Poiesi, Fabio Augusto de Alcantara Andrade, Alexandre Bernardino, Matthew Dawkins, Jenni Raitoharju, Yitong Quan, Adem Atmaca, Timon Höfer, Qiming Zhang, Yufei Xu, Jing Zhang, DaCheng Tao, Lars Sommer, Raphael Spraul, Hangyue Zhao, Hongpu Zhang, Yanyun Zhao, Jan Lukas Augustin, Eui-ik Jeon, Impyeong Lee, Luca Zedda, Andrea Loddo, Cecilia Di Ruberto, Sagar Verma, Siddharth Gupta, Shishir Muralidhara, Niharika Hegde, Daitao Xing, Nikolaos Evangeliou, Anthony Tzes, Vojtěch Bartl, Jakub Špaňhel, Adam Herout, Neelanjan Bhowmik, Toby P. Breckon, Shivanand Kundargi, Tejas Anvekar, Chaitra Desai, Ramesh Ashok Tabib, Uma Mudengudi, Arpita Vats, Yang song, Delong Liu, Yonglin Li, Shuman Li, Chenhao Tan, Long Lan, Vladimir Somers, Christophe De Vleeschouwer, Alexandre Alahi, Hsiang-Wei Huang, Cheng-Yen Yang, Jenq-Neng Hwang, Pyong-Kun Kim, Kwangju Kim, Kyoungoh Lee, Shuai Jiang, Haiwen Li, Zheng Ziqiang, Tuan-Anh Vu, Hai Nguyen-Truong, Sai-Kit Yeung, Zhuang Jia, Sophia Yang, Chih-Chung Hsu, Xiu-Yu Hou, Yu-An Jhang, Simon Yang, Mau-Tsuen Yang

The 1$^{\text{st}}$ Workshop on Maritime Computer Vision (MaCVi) 2023 focused on maritime computer vision for Unmanned Aerial Vehicles (UAV) and Unmanned Surface Vehicle (USV), and organized several subchallenges in this domain: (i) UAV-based Maritime Object Detection, (ii) UAV-based Maritime Object Tracking, (iii) USV-based Maritime Obstacle Segmentation and (iv) USV-based Maritime Obstacle Detection.

object-detection Object Detection +1

Body Part-Based Representation Learning for Occluded Person Re-Identification

1 code implementation7 Nov 2022 Vladimir Somers, Christophe De Vleeschouwer, Alexandre Alahi

Firstly, individual body part appearance is not as discriminative as global appearance (two distinct IDs might have the same local appearance), this means standard ReID training objectives using identity labels are not adapted to local feature learning.

Human Parsing Part-based Representation Learning +3

Motion Style Transfer: Modular Low-Rank Adaptation for Deep Motion Forecasting

1 code implementation6 Nov 2022 Parth Kothari, Danya Li, Yuejiang Liu, Alexandre Alahi

To this end, we introduce two components that exploit our prior knowledge of motion style shifts: (i) a low-rank motion style adapter that projects and adjusts the style features at a low-dimensional bottleneck; and (ii) a modular adapter strategy that disentangles the features of scene context and motion history to facilitate a fine-grained choice of adaptation layers.

Motion Forecasting Motion Style Transfer +2

A generic diffusion-based approach for 3D human pose prediction in the wild

no code implementations11 Oct 2022 Saeed Saadatnejad, Ali Rasekh, Mohammadreza Mofayezi, Yasamin Medghalchi, Sara Rajabzadeh, Taylor Mordan, Alexandre Alahi

We also propose a generic framework to improve any 3D pose forecasting model by leveraging our diffusion model in two additional steps: a pre-processing step to repair the inputs and a post-processing step to refine the outputs.

 Ranked #1 on Human Pose Forecasting on 3DPW (FDE@560ms (mm) metric)

Human Pose Forecasting Pose Prediction

SoccerNet 2022 Challenges Results

6 code implementations5 Oct 2022 Silvio Giancola, Anthony Cioppa, Adrien Deliège, Floriane Magera, Vladimir Somers, Le Kang, Xin Zhou, Olivier Barnich, Christophe De Vleeschouwer, Alexandre Alahi, Bernard Ghanem, Marc Van Droogenbroeck, Abdulrahman Darwish, Adrien Maglo, Albert Clapés, Andreas Luyts, Andrei Boiarov, Artur Xarles, Astrid Orcesi, Avijit Shah, Baoyu Fan, Bharath Comandur, Chen Chen, Chen Zhang, Chen Zhao, Chengzhi Lin, Cheuk-Yiu Chan, Chun Chuen Hui, Dengjie Li, Fan Yang, Fan Liang, Fang Da, Feng Yan, Fufu Yu, Guanshuo Wang, H. Anthony Chan, He Zhu, Hongwei Kan, Jiaming Chu, Jianming Hu, Jianyang Gu, Jin Chen, João V. B. Soares, Jonas Theiner, Jorge De Corte, José Henrique Brito, Jun Zhang, Junjie Li, Junwei Liang, Leqi Shen, Lin Ma, Lingchi Chen, Miguel Santos Marques, Mike Azatov, Nikita Kasatkin, Ning Wang, Qiong Jia, Quoc Cuong Pham, Ralph Ewerth, Ran Song, RenGang Li, Rikke Gade, Ruben Debien, Runze Zhang, Sangrok Lee, Sergio Escalera, Shan Jiang, Shigeyuki Odashima, Shimin Chen, Shoichi Masui, Shouhong Ding, Sin-wai Chan, Siyu Chen, Tallal El-Shabrawy, Tao He, Thomas B. Moeslund, Wan-Chi Siu, Wei zhang, Wei Li, Xiangwei Wang, Xiao Tan, Xiaochuan Li, Xiaolin Wei, Xiaoqing Ye, Xing Liu, Xinying Wang, Yandong Guo, YaQian Zhao, Yi Yu, YingYing Li, Yue He, Yujie Zhong, Zhenhua Guo, Zhiheng Li

The SoccerNet 2022 challenges were the second annual video understanding challenges organized by the SoccerNet team.

Action Spotting Camera Calibration +3

Safety-compliant Generative Adversarial Networks for Human Trajectory Forecasting

no code implementations25 Sep 2022 Parth Kothari, Alexandre Alahi

Human trajectory forecasting in crowds presents the challenges of modelling social interactions and outputting collision-free multimodal distribution.

Trajectory Forecasting

Pedestrian 3D Bounding Box Prediction

1 code implementation28 Jun 2022 Saeed Saadatnejad, Yi Zhou Ju, Alexandre Alahi

Safety is still the main issue of autonomous driving, and in order to be globally deployed, they need to predict pedestrians' motions sufficiently in advance.

Action Anticipation Autonomous Driving

Pedestrian Stop and Go Forecasting with Hybrid Feature Fusion

2 code implementations4 Mar 2022 Dongxu Guo, Taylor Mordan, Alexandre Alahi

Considering the lack of suitable existing datasets for it, we release TRANS, a benchmark for explicitly studying the stop and go behaviors of pedestrians in urban traffic.

Autonomous Driving motion prediction +1

A Shared Representation for Photorealistic Driving Simulators

1 code implementation9 Dec 2021 Saeed Saadatnejad, Siyuan Li, Taylor Mordan, Alexandre Alahi

We build on successful cGAN models to propose a new semantically-aware discriminator that better guides the generator.

Autonomous Vehicles Image Generation +1

Do Pedestrians Pay Attention? Eye Contact Detection in the Wild

1 code implementation8 Dec 2021 Younes Belkada, Lorenzo Bertoni, Romain Caristan, Taylor Mordan, Alexandre Alahi

In urban or crowded environments, humans rely on eye contact for fast and efficient communication with nearby people.

Autonomous Vehicles Domain Adaptation +1

Vehicle trajectory prediction works, but not everywhere

1 code implementation CVPR 2022 Mohammadhossein Bahari, Saeed Saadatnejad, Ahmad Rahimi, Mohammad Shaverdikondori, Amir-Hossein Shahidzadeh, Seyed-Mohsen Moosavi-Dezfooli, Alexandre Alahi

We further show that the generated scenes (i) are realistic since they do exist in the real world, and (ii) can be used to make existing models more robust, yielding 30-40 reductions in the off-road rate.

Scene Generation Self-Driving Cars +1

TTT++: When Does Self-Supervised Test-Time Training Fail or Thrive?

1 code implementation NeurIPS 2021 Yuejiang Liu, Parth Kothari, Bastien Van Delft, Baptiste Bellot-Gurlet, Taylor Mordan, Alexandre Alahi

In this work, we first provide an in-depth look at its limitations and show that TTT can possibly deteriorate, instead of improving, the test-time performance in the presence of severe distribution shifts.

Contrastive Learning Self-Supervised Learning

DriverGym: Democratising Reinforcement Learning for Autonomous Driving

no code implementations12 Nov 2021 Parth Kothari, Christian Perone, Luca Bergamini, Alexandre Alahi, Peter Ondruska

Despite promising progress in reinforcement learning (RL), developing algorithms for autonomous driving (AD) remains challenging: one of the critical issues being the absence of an open-source platform capable of training and effectively validating the RL policies on real-world data.

Autonomous Driving OpenAI Gym +2

SVG-Net: An SVG-based Trajectory Prediction Model

1 code implementation7 Oct 2021 Mohammadhossein Bahari, Vahid Zehtab, Sadegh Khorasani, Sana Ayromlou, Saeed Saadatnejad, Alexandre Alahi

Finally, we illustrate how, by using SVG, one can benefit from datasets and advancements in other research fronts that also utilize the same input format.

Autonomous Driving Trajectory Prediction +1

Keypoint Communities

1 code implementation ICCV 2021 Duncan Zauss, Sven Kreiss, Alexandre Alahi

We present a fast bottom-up method that jointly detects over 100 keypoints on humans or objects, also referred to as human/object pose estimation.

Car Pose Estimation Keypoint Detection

Interpretable Social Anchors for Human Trajectory Forecasting in Crowds

no code implementations CVPR 2021 Parth Kothari, Brian Sifringer, Alexandre Alahi

Human trajectory forecasting in crowds, at its core, is a sequence prediction problem with specific challenges of capturing inter-sequence dependencies (social interactions) and consequently predicting socially-compliant multimodal distributions.

Trajectory Forecasting

Injecting Knowledge in Data-driven Vehicle Trajectory Predictors

1 code implementation8 Mar 2021 Mohammadhossein Bahari, Ismail Nejjar, Alexandre Alahi

On the other hand, recent works use data-driven approaches which can learn complex interactions from the data leading to superior performance.

Trajectory Prediction

OpenPifPaf: Composite Fields for Semantic Keypoint Detection and Spatio-Temporal Association

6 code implementations3 Mar 2021 Sven Kreiss, Lorenzo Bertoni, Alexandre Alahi

We present a generic neural network architecture that uses Composite Fields to detect and construct a spatio-temporal pose which is a single, connected graph whose nodes are the semantic keypoints (e. g., a person's body joints) in multiple frames.

Association Keypoint Detection +2

Social NCE: Contrastive Learning of Socially-aware Motion Representations

4 code implementations ICCV 2021 Yuejiang Liu, Qi Yan, Alexandre Alahi

Learning socially-aware motion representations is at the core of recent advances in multi-agent problems, such as human motion forecasting and robot navigation in crowds.

Autonomous Navigation Motion Forecasting +1

Detecting 32 Pedestrian Attributes for Autonomous Vehicles

1 code implementation4 Dec 2020 Taylor Mordan, Matthieu Cord, Patrick Pérez, Alexandre Alahi

By increasing the number of attributes jointly learned, we highlight an issue related to the scales of gradients, which arises in MTL with numerous tasks.

Autonomous Driving Multi-Task Learning

Pedestrian Intention Prediction: A Multi-task Perspective

1 code implementation20 Oct 2020 Smail Ait Bouhsain, Saeed Saadatnejad, Alexandre Alahi

This work tries to solve this problem by jointly predicting the intention and visual states of pedestrians.

Autonomous Vehicles Multi-Task Learning

Perceiving Traffic from Aerial Images

1 code implementation16 Sep 2020 George Adaimi, Sven Kreiss, Alexandre Alahi

Drones or UAVs, equipped with different sensors, have been deployed in many places especially for urban traffic monitoring or last-mile delivery.

object-detection Object Detection

Perceiving Humans: from Monocular 3D Localization to Social Distancing

1 code implementation1 Sep 2020 Lorenzo Bertoni, Sven Kreiss, Alexandre Alahi

Our neural network estimates human 3D body locations and their orientation with a measure of uncertainty.

MonStereo: When Monocular and Stereo Meet at the Tail of 3D Human Localization

2 code implementations25 Aug 2020 Lorenzo Bertoni, Sven Kreiss, Taylor Mordan, Alexandre Alahi

Monocular and stereo visions are cost-effective solutions for 3D human localization in the context of self-driving cars or social robots.

Self-Driving Cars

Human Trajectory Forecasting in Crowds: A Deep Learning Perspective

1 code implementation7 Jul 2020 Parth Kothari, Sven Kreiss, Alexandre Alahi

In this work, we present an in-depth analysis of existing deep learning-based methods for modelling social interactions.

Trajectory Forecasting

Using Image Priors to Improve Scene Understanding

no code implementations2 Oct 2019 Brigit Schroeder, Hanlin Tang, Alexandre Alahi

We propose a simple yet effective method for leveraging these image priors to improve semantic segmentation of images from sequential driving datasets.

Autonomous Driving Scene Understanding +1

Deep Visual Re-Identification with Confidence

1 code implementation11 Jun 2019 George Adaimi, Sven Kreiss, Alexandre Alahi

We argue that such loss function is not suited for the visual re-identification task hence propose to model confidence in the representation learning framework.

Person Re-Identification Representation Learning

Convolutional Relational Machine for Group Activity Recognition

no code implementations CVPR 2019 Sina Mokhtarzadeh Azar, Mina Ghadimi Atigh, Ahmad Nickabadi, Alexandre Alahi

We present an end-to-end deep Convolutional Neural Network called Convolutional Relational Machine (CRM) for recognizing group activities that utilizes the information in spatial relations between individual persons in image or video.

Group Activity Recognition

PifPaf: Composite Fields for Human Pose Estimation

2 code implementations CVPR 2019 Sven Kreiss, Lorenzo Bertoni, Alexandre Alahi

We propose a new bottom-up method for multi-person 2D human pose estimation that is particularly well suited for urban mobility such as self-driving cars and delivery robots.

Association Keypoint Detection +2

Collaborative Sampling in Generative Adversarial Networks

1 code implementation2 Feb 2019 Yuejiang Liu, Parth Kothari, Alexandre Alahi

The standard practice in Generative Adversarial Networks (GANs) discards the discriminator during sampling.

Image Generation

Enhancing Discrete Choice Models with Representation Learning

1 code implementation23 Dec 2018 Brian Sifringer, Virginie Lurkin, Alexandre Alahi

In discrete choice modeling (DCM), model misspecifications may lead to limited predictability and biased parameter estimates.

Representation Learning

Crowd-Robot Interaction: Crowd-aware Robot Navigation with Attention-based Deep Reinforcement Learning

5 code implementations24 Sep 2018 Changan Chen, Yuejiang Liu, Sven Kreiss, Alexandre Alahi

We propose to (i) rethink pairwise interactions with a self-attention mechanism, and (ii) jointly model Human-Robot as well as Human-Human interactions in the deep reinforcement learning framework.

Human Dynamics Navigate +3

Social GAN: Socially Acceptable Trajectories with Generative Adversarial Networks

7 code implementations CVPR 2018 Agrim Gupta, Justin Johnson, Li Fei-Fei, Silvio Savarese, Alexandre Alahi

Understanding human motion behavior is critical for autonomous moving platforms (like self-driving cars and social robots) if they are to navigate human-centric environments.

Motion Forecasting Multi-future Trajectory Prediction +3

CAR-Net: Clairvoyant Attentive Recurrent Network

no code implementations ECCV 2018 Amir Sadeghian, Ferdinand Legros, Maxime Voisin, Ricky Vesel, Alexandre Alahi, Silvio Savarese

We exploit two sources of information: the past motion trajectory of the agent of interest and a wide top-view image of the navigation scene.

Trajectory Forecasting

Tracking The Untrackable: Learning To Track Multiple Cues with Long-Term Dependencies

no code implementations ICCV 2017 Amir Sadeghian, Alexandre Alahi, Silvio Savarese

To address this challenge, we present a structure of Recurrent Neural Networks (RNN) that jointly reasons on multiple cues over a temporal window.


Unsupervised Learning of Long-Term Motion Dynamics for Videos

no code implementations CVPR 2017 Zelun Luo, Boya Peng, De-An Huang, Alexandre Alahi, Li Fei-Fei

We present an unsupervised representation learning approach that compactly encodes the motion dependencies in videos.

Representation Learning

Recurrent Attention Models for Depth-Based Person Identification

no code implementations CVPR 2016 Albert Haque, Alexandre Alahi, Li Fei-Fei

We present an attention-based model that reasons on human body shape and motion dynamics to identify individuals in the absence of RGB information, hence in the dark.

Person Identification reinforcement-learning +1

Knowledge Transfer for Scene-specific Motion Prediction

no code implementations22 Mar 2016 Lamberto Ballan, Francesco Castaldo, Alexandre Alahi, Francesco Palmieri, Silvio Savarese

When given a single frame of the video, humans can not only interpret the content of the scene, but also they are able to forecast the near future.

motion prediction Trajectory Prediction +1

RGB-W: When Vision Meets Wireless

no code implementations ICCV 2015 Alexandre Alahi, Albert Haque, Li Fei-Fei

Inspired by the recent success of RGB-D cameras, we propose the enrichment of RGB data with an additional "quasi-free" modality, namely, the wireless signal (e. g., wifi or Bluetooth) emitted by individuals' cell phones, referred to as RGB-W.


Learning to Track: Online Multi-Object Tracking by Decision Making

no code implementations ICCV 2015 Yu Xiang, Alexandre Alahi, Silvio Savarese

Online Multi-Object Tracking (MOT) has wide applications in time-critical video analysis scenarios, such as robot navigation and autonomous driving.

Association Autonomous Driving +5

Socially-aware Large-scale Crowd Forecasting

no code implementations CVPR 2014 Alexandre Alahi, Vignesh Ramanathan, Li Fei-Fei

In crowded spaces such as city centers or train stations, human mobility looks complex, but is often influenced only by a few causes.

