1 code implementation • 12 Jan 2023 • Yuejiang Liu, Alexandre Alahi, Chris Russell, Max Horn, Dominik Zietlow, Bernhard Schölkopf, Francesco Locatello
Recent years have seen a surge of interest in learning high-level causal representations from low-level image pairs under interventions.
no code implementations • 24 Nov 2022 • Benjamin Kiefer, Matej Kristan, Janez Perš, Lojze Žust, Fabio Poiesi, Fabio Augusto de Alcantara Andrade, Alexandre Bernardino, Matthew Dawkins, Jenni Raitoharju, Yitong Quan, Adem Atmaca, Timon Höfer, Qiming Zhang, Yufei Xu, Jing Zhang, DaCheng Tao, Lars Sommer, Raphael Spraul, Hangyue Zhao, Hongpu Zhang, Yanyun Zhao, Jan Lukas Augustin, Eui-ik Jeon, Impyeong Lee, Luca Zedda, Andrea Loddo, Cecilia Di Ruberto, Sagar Verma, Siddharth Gupta, Shishir Muralidhara, Niharika Hegde, Daitao Xing, Nikolaos Evangeliou, Anthony Tzes, Vojtěch Bartl, Jakub Špaňhel, Adam Herout, Neelanjan Bhowmik, Toby P. Breckon, Shivanand Kundargi, Tejas Anvekar, Chaitra Desai, Ramesh Ashok Tabib, Uma Mudengudi, Arpita Vats, Yang song, Delong Liu, Yonglin Li, Shuman Li, Chenhao Tan, Long Lan, Vladimir Somers, Christophe De Vleeschouwer, Alexandre Alahi, Hsiang-Wei Huang, Cheng-Yen Yang, Jenq-Neng Hwang, Pyong-Kun Kim, Kwangju Kim, Kyoungoh Lee, Shuai Jiang, Haiwen Li, Zheng Ziqiang, Tuan-Anh Vu, Hai Nguyen-Truong, Sai-Kit Yeung, Zhuang Jia, Sophia Yang, Chih-Chung Hsu, Xiu-Yu Hou, Yu-An Jhang, Simon Yang, Mau-Tsuen Yang
The 1$^{\text{st}}$ Workshop on Maritime Computer Vision (MaCVi) 2023 focused on maritime computer vision for Unmanned Aerial Vehicles (UAV) and Unmanned Surface Vehicle (USV), and organized several subchallenges in this domain: (i) UAV-based Maritime Object Detection, (ii) UAV-based Maritime Object Tracking, (iii) USV-based Maritime Obstacle Segmentation and (iv) USV-based Maritime Obstacle Detection.
1 code implementation • 7 Nov 2022 • Vladimir Somers, Christophe De Vleeschouwer, Alexandre Alahi
Firstly, individual body part appearance is not as discriminative as global appearance (two distinct IDs might have the same local appearance), this means standard ReID training objectives using identity labels are not adapted to local feature learning.
Ranked #1 on
Person Re-Identification
on P-DukeMTMC-reID
1 code implementation • 6 Nov 2022 • Parth Kothari, Danya Li, Yuejiang Liu, Alexandre Alahi
To this end, we introduce two components that exploit our prior knowledge of motion style shifts: (i) a low-rank motion style adapter that projects and adjusts the style features at a low-dimensional bottleneck; and (ii) a modular adapter strategy that disentangles the features of scene context and motion history to facilitate a fine-grained choice of adaptation layers.
1 code implementation • 12 Oct 2022 • Megh Shukla, Roshan Roy, Pankaj Singh, Shuaib Ahmed, Alexandre Alahi
We begin with a simple premise: pose estimators often predict incoherent poses for out-of-distribution samples.
no code implementations • 11 Oct 2022 • Saeed Saadatnejad, Ali Rasekh, Mohammadreza Mofayezi, Yasamin Medghalchi, Sara Rajabzadeh, Taylor Mordan, Alexandre Alahi
We also propose a generic framework to improve any 3D pose forecasting model by leveraging our diffusion model in two additional steps: a pre-processing step to repair the inputs and a post-processing step to refine the outputs.
Ranked #1 on
Human Pose Forecasting
on 3DPW
(FDE@560ms (mm) metric)
6 code implementations • 5 Oct 2022 • Silvio Giancola, Anthony Cioppa, Adrien Deliège, Floriane Magera, Vladimir Somers, Le Kang, Xin Zhou, Olivier Barnich, Christophe De Vleeschouwer, Alexandre Alahi, Bernard Ghanem, Marc Van Droogenbroeck, Abdulrahman Darwish, Adrien Maglo, Albert Clapés, Andreas Luyts, Andrei Boiarov, Artur Xarles, Astrid Orcesi, Avijit Shah, Baoyu Fan, Bharath Comandur, Chen Chen, Chen Zhang, Chen Zhao, Chengzhi Lin, Cheuk-Yiu Chan, Chun Chuen Hui, Dengjie Li, Fan Yang, Fan Liang, Fang Da, Feng Yan, Fufu Yu, Guanshuo Wang, H. Anthony Chan, He Zhu, Hongwei Kan, Jiaming Chu, Jianming Hu, Jianyang Gu, Jin Chen, João V. B. Soares, Jonas Theiner, Jorge De Corte, José Henrique Brito, Jun Zhang, Junjie Li, Junwei Liang, Leqi Shen, Lin Ma, Lingchi Chen, Miguel Santos Marques, Mike Azatov, Nikita Kasatkin, Ning Wang, Qiong Jia, Quoc Cuong Pham, Ralph Ewerth, Ran Song, RenGang Li, Rikke Gade, Ruben Debien, Runze Zhang, Sangrok Lee, Sergio Escalera, Shan Jiang, Shigeyuki Odashima, Shimin Chen, Shoichi Masui, Shouhong Ding, Sin-wai Chan, Siyu Chen, Tallal El-Shabrawy, Tao He, Thomas B. Moeslund, Wan-Chi Siu, Wei zhang, Wei Li, Xiangwei Wang, Xiao Tan, Xiaochuan Li, Xiaolin Wei, Xiaoqing Ye, Xing Liu, Xinying Wang, Yandong Guo, YaQian Zhao, Yi Yu, YingYing Li, Yue He, Yujie Zhong, Zhenhua Guo, Zhiheng Li
The SoccerNet 2022 challenges were the second annual video understanding challenges organized by the SoccerNet team.
no code implementations • 25 Sep 2022 • Parth Kothari, Alexandre Alahi
Human trajectory forecasting in crowds presents the challenges of modelling social interactions and outputting collision-free multimodal distribution.
1 code implementation • 28 Jun 2022 • Saeed Saadatnejad, Yi Zhou Ju, Alexandre Alahi
Safety is still the main issue of autonomous driving, and in order to be globally deployed, they need to predict pedestrians' motions sufficiently in advance.
2 code implementations • 4 Mar 2022 • Dongxu Guo, Taylor Mordan, Alexandre Alahi
Considering the lack of suitable existing datasets for it, we release TRANS, a benchmark for explicitly studying the stop and go behaviors of pedestrians in urban traffic.
1 code implementation • 9 Dec 2021 • Saeed Saadatnejad, Siyuan Li, Taylor Mordan, Alexandre Alahi
We build on successful cGAN models to propose a new semantically-aware discriminator that better guides the generator.
1 code implementation • 8 Dec 2021 • Younes Belkada, Lorenzo Bertoni, Romain Caristan, Taylor Mordan, Alexandre Alahi
In urban or crowded environments, humans rely on eye contact for fast and efficient communication with nearby people.
1 code implementation • 7 Dec 2021 • Mohammad Reza Samsami, Mohammadhossein Bahari, Saber Salehkaleybar, Alexandre Alahi
CIM explicitly discovers the causal model and utilizes it to train the policy.
1 code implementation • CVPR 2022 • Mohammadhossein Bahari, Saeed Saadatnejad, Ahmad Rahimi, Mohammad Shaverdikondori, Amir-Hossein Shahidzadeh, Seyed-Mohsen Moosavi-Dezfooli, Alexandre Alahi
We further show that the generated scenes (i) are realistic since they do exist in the real world, and (ii) can be used to make existing models more robust, yielding 30-40 reductions in the off-road rate.
1 code implementation • NeurIPS 2021 • Yuejiang Liu, Parth Kothari, Bastien Van Delft, Baptiste Bellot-Gurlet, Taylor Mordan, Alexandre Alahi
In this work, we first provide an in-depth look at its limitations and show that TTT can possibly deteriorate, instead of improving, the test-time performance in the presence of severe distribution shifts.
2 code implementations • CVPR 2022 • Yuejiang Liu, Riccardo Cadei, Jonas Schweizer, Sherwin Bahmani, Alexandre Alahi
Learning behavioral patterns from observational data has been a de-facto approach to motion forecasting.
no code implementations • 12 Nov 2021 • Parth Kothari, Christian Perone, Luca Bergamini, Alexandre Alahi, Peter Ondruska
Despite promising progress in reinforcement learning (RL), developing algorithms for autonomous driving (AD) remains challenging: one of the critical issues being the absence of an open-source platform capable of training and effectively validating the RL policies on real-world data.
1 code implementation • 7 Oct 2021 • Mohammadhossein Bahari, Vahid Zehtab, Sadegh Khorasani, Sana Ayromlou, Saeed Saadatnejad, Alexandre Alahi
Finally, we illustrate how, by using SVG, one can benefit from datasets and advancements in other research fronts that also utilize the same input format.
1 code implementation • ICCV 2021 • Duncan Zauss, Sven Kreiss, Alexandre Alahi
We present a fast bottom-up method that jointly detects over 100 keypoints on humans or objects, also referred to as human/object pose estimation.
Ranked #1 on
Car Pose Estimation
on ApolloCar3D
1 code implementation • ICCV 2021 • Xuanchi Ren, Tao Yang, Li Erran Li, Alexandre Alahi, Qifeng Chen
The ability to predict unseen vehicles is critical for safety in autonomous driving.
2 code implementations • 24 Aug 2021 • Saeed Saadatnejad, Mohammadhossein Bahari, Pedram Khorsandi, Mohammad Saneian, Seyed-Mohsen Moosavi-Dezfooli, Alexandre Alahi
An attack is a small yet carefully-crafted perturbations to fail predictors.
no code implementations • CVPR 2021 • Parth Kothari, Brian Sifringer, Alexandre Alahi
Human trajectory forecasting in crowds, at its core, is a sequence prediction problem with specific challenges of capturing inter-sequence dependencies (social interactions) and consequently predicting socially-compliant multimodal distributions.
1 code implementation • 8 Mar 2021 • Mohammadhossein Bahari, Ismail Nejjar, Alexandre Alahi
On the other hand, recent works use data-driven approaches which can learn complex interactions from the data leading to superior performance.
6 code implementations • 3 Mar 2021 • Sven Kreiss, Lorenzo Bertoni, Alexandre Alahi
We present a generic neural network architecture that uses Composite Fields to detect and construct a spatio-temporal pose which is a single, connected graph whose nodes are the semantic keypoints (e. g., a person's body joints) in multiple frames.
Ranked #7 on
Multi-Person Pose Estimation
on COCO
4 code implementations • ICCV 2021 • Yuejiang Liu, Qi Yan, Alexandre Alahi
Learning socially-aware motion representations is at the core of recent advances in multi-agent problems, such as human motion forecasting and robot navigation in crowds.
Ranked #1 on
Trajectory Prediction
on TrajNet++
1 code implementation • 4 Dec 2020 • Taylor Mordan, Matthieu Cord, Patrick Pérez, Alexandre Alahi
By increasing the number of attributes jointly learned, we highlight an issue related to the scales of gradients, which arises in MTL with numerous tasks.
1 code implementation • 20 Oct 2020 • Smail Ait Bouhsain, Saeed Saadatnejad, Alexandre Alahi
This work tries to solve this problem by jointly predicting the intention and visual states of pedestrians.
1 code implementation • 16 Sep 2020 • George Adaimi, Sven Kreiss, Alexandre Alahi
Drones or UAVs, equipped with different sensors, have been deployed in many places especially for urban traffic monitoring or last-mile delivery.
1 code implementation • 1 Sep 2020 • Lorenzo Bertoni, Sven Kreiss, Alexandre Alahi
Our neural network estimates human 3D body locations and their orientation with a measure of uncertainty.
2 code implementations • 25 Aug 2020 • Lorenzo Bertoni, Sven Kreiss, Taylor Mordan, Alexandre Alahi
Monocular and stereo visions are cost-effective solutions for 3D human localization in the context of self-driving cars or social robots.
2 code implementations • NeurIPS 2020 • Alexandre Carlier, Martin Danelljan, Alexandre Alahi, Radu Timofte
Scalable Vector Graphics (SVG) are ubiquitous in modern 2D interfaces due to their ability to scale to different resolutions.
Ranked #1 on
Vector Graphics Animation
on SVG-Icons8
1 code implementation • 7 Jul 2020 • Parth Kothari, Sven Kreiss, Alexandre Alahi
In this work, we present an in-depth analysis of existing deep learning-based methods for modelling social interactions.
Ranked #3 on
Trajectory Prediction
on TrajNet++
no code implementations • 2 Oct 2019 • Brigit Schroeder, Hanlin Tang, Alexandre Alahi
We propose a simple yet effective method for leveraging these image priors to improve semantic segmentation of images from sequential driving datasets.
3 code implementations • ICCV 2019 • Lorenzo Bertoni, Sven Kreiss, Alexandre Alahi
We tackle the fundamentally ill-posed problem of 3D human localization from monocular RGB images.
1 code implementation • 11 Jun 2019 • George Adaimi, Sven Kreiss, Alexandre Alahi
We argue that such loss function is not suited for the visual re-identification task hence propose to model confidence in the representation learning framework.
no code implementations • CVPR 2019 • Sina Mokhtarzadeh Azar, Mina Ghadimi Atigh, Ahmad Nickabadi, Alexandre Alahi
We present an end-to-end deep Convolutional Neural Network called Convolutional Relational Machine (CRM) for recognizing group activities that utilizes the information in spatial relations between individual persons in image or video.
2 code implementations • CVPR 2019 • Sven Kreiss, Lorenzo Bertoni, Alexandre Alahi
We propose a new bottom-up method for multi-person 2D human pose estimation that is particularly well suited for urban mobility such as self-driving cars and delivery robots.
Ranked #10 on
Keypoint Detection
on COCO test-dev
1 code implementation • 2 Feb 2019 • Yuejiang Liu, Parth Kothari, Alexandre Alahi
The standard practice in Generative Adversarial Networks (GANs) discards the discriminator during sampling.
1 code implementation • 23 Dec 2018 • Brian Sifringer, Virginie Lurkin, Alexandre Alahi
In discrete choice modeling (DCM), model misspecifications may lead to limited predictability and biased parameter estimates.
5 code implementations • 24 Sep 2018 • Changan Chen, Yuejiang Liu, Sven Kreiss, Alexandre Alahi
We propose to (i) rethink pairwise interactions with a self-attention mechanism, and (ii) jointly model Human-Robot as well as Human-Human interactions in the deep reinforcement learning framework.
7 code implementations • CVPR 2018 • Agrim Gupta, Justin Johnson, Li Fei-Fei, Silvio Savarese, Alexandre Alahi
Understanding human motion behavior is critical for autonomous moving platforms (like self-driving cars and social robots) if they are to navigate human-centric environments.
Ranked #4 on
Trajectory Prediction
on ETH
no code implementations • ECCV 2018 • Amir Sadeghian, Ferdinand Legros, Maxime Voisin, Ricky Vesel, Alexandre Alahi, Silvio Savarese
We exploit two sources of information: the past motion trajectory of the agent of interest and a wide top-view image of the navigation scene.
no code implementations • 1 Aug 2017 • Albert Haque, Michelle Guo, Alexandre Alahi, Serena Yeung, Zelun Luo, Alisha Rege, Jeffrey Jopling, Lance Downing, William Beninati, Amit Singh, Terry Platchek, Arnold Milstein, Li Fei-Fei
One in twenty-five patients admitted to a hospital will suffer from a hospital acquired infection.
no code implementations • CVPR 2017 • Katsuyuki Nakamura, Serena Yeung, Alexandre Alahi, Li Fei-Fei
Physiological signals such as heart rate can provide valuable information about an individual's state and activity.
no code implementations • ICCV 2017 • Agrim Gupta, Justin Johnson, Alexandre Alahi, Li Fei-Fei
Recent progress in style transfer on images has focused on improving the quality of stylized images and speed of methods.
no code implementations • ICCV 2017 • Amir Sadeghian, Alexandre Alahi, Silvio Savarese
To address this challenge, we present a structure of Recurrent Neural Networks (RNN) that jointly reasons on multiple cues over a temporal window.
no code implementations • CVPR 2017 • Zelun Luo, Boya Peng, De-An Huang, Alexandre Alahi, Li Fei-Fei
We present an unsupervised representation learning approach that compactly encodes the motion dependencies in videos.
no code implementations • CVPR 2017 • Timur Bagautdinov, Alexandre Alahi, François Fleuret, Pascal Fua, Silvio Savarese
We present a unified framework for understanding human social behaviors in raw image sequences.
Ranked #2 on
Action Recognition
on Volleyball
no code implementations • CVPR 2016 • Albert Haque, Alexandre Alahi, Li Fei-Fei
We present an attention-based model that reasons on human body shape and motion dynamics to identify individuals in the absence of RGB information, hence in the dark.
no code implementations • CVPR 2016 • Alexandre Alahi, Kratarth Goel, Vignesh Ramanathan, Alexandre Robicquet, Li Fei-Fei, Silvio Savarese
Different from the conventional LSTM, we share the information between multiple LSTMs through a new pooling layer.
Ranked #1 on
Trajectory Prediction
on Stanford Drone
(ADE (8/12) @K=5 metric)
77 code implementations • 27 Mar 2016 • Justin Johnson, Alexandre Alahi, Li Fei-Fei
We consider image transformation problems, where an input image is transformed into an output image.
Ranked #4 on
Nuclear Segmentation
on Cell17
2 code implementations • 23 Mar 2016 • Albert Haque, Boya Peng, Zelun Luo, Alexandre Alahi, Serena Yeung, Li Fei-Fei
We propose a viewpoint invariant model for 3D human pose estimation from a single depth image.
Ranked #4 on
Pose Estimation
on ITOP top-view
no code implementations • 22 Mar 2016 • Lamberto Ballan, Francesco Castaldo, Alexandre Alahi, Francesco Palmieri, Silvio Savarese
When given a single frame of the video, humans can not only interpret the content of the scene, but also they are able to forecast the near future.
no code implementations • 5 Jan 2016 • Alexandre Robicquet, Alexandre Alahi, Amir Sadeghian, Bryan Anenberg, John Doherty, Eli Wu, Silvio Savarese
We present an extensive evaluation where different methods for trajectory forecasting are evaluated and compared.
no code implementations • ICCV 2015 • Alexandre Alahi, Albert Haque, Li Fei-Fei
Inspired by the recent success of RGB-D cameras, we propose the enrichment of RGB data with an additional "quasi-free" modality, namely, the wireless signal (e. g., wifi or Bluetooth) emitted by individuals' cell phones, referred to as RGB-W.
no code implementations • ICCV 2015 • Yu Xiang, Alexandre Alahi, Silvio Savarese
Online Multi-Object Tracking (MOT) has wide applications in time-critical video analysis scenarios, such as robot navigation and autonomous driving.
Ranked #18 on
Multiple Object Tracking
on KITTI Tracking test
no code implementations • CVPR 2014 • Alexandre Alahi, Vignesh Ramanathan, Li Fei-Fei
In crowded spaces such as city centers or train stations, human mobility looks complex, but is often influenced only by a few causes.