Search Results for author: Hamid Rezatofighi

Found 47 papers, 18 papers with code

Deep Auto-Set: A Deep Auto-Encoder-Set Network for Activity Recognition Using Wearables

no code implementations • 20 Nov 2018 • Alireza Abedin Varamin, Ehsan Abbasnejad, Qinfeng Shi, Damith Ranasinghe, Hamid Rezatofighi

Automatic recognition of human activities from time-series sensor data (referred to as HAR) is a growing area of research in ubiquitous computing.

Activity Recognition Multi-class Classification +2

Paper
Add Code

TrackerBots: Software in the Loop Study of Quad-Copter Robots for Locating Radio-tags in a 3D Space

1 code implementation • 1 Dec 2018 • Hoa Van Nguyen, Hamid Rezatofighi, David Taggart, Bertram Ostendorf, Damith C. Ranasinghe

We investigate the problem of tracking and planning for a UAV in a task to locate multiple radio-tagged wildlife in a three-dimensional (3D) setting in the context of our TrackerBots research project.

Management TAG

Paper
Code

Learning Pairwise Relationship for Multi-object Detection in Crowded Scenes

no code implementations • 12 Jan 2019 • Yu Liu, Lingqiao Liu, Hamid Rezatofighi, Thanh-Toan Do, Qinfeng Shi, Ian Reid

As the post-processing step for object detection, non-maximum suppression (GreedyNMS) is widely used in most of the detectors for many years.

object-detection Object Detection

Paper
Add Code

Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression

10 code implementations • CVPR 2019 • Hamid Rezatofighi, Nathan Tsoi, JunYoung Gwak, Amir Sadeghian, Ian Reid, Silvio Savarese

By incorporating this generalized $IoU$ ($GIoU$) as a loss into the state-of-the art object detection frameworks, we show a consistent improvement on their performance using both the standard, $IoU$ based, and new, $GIoU$ based, performance measures on popular object detection benchmarks such as PASCAL VOC and MS COCO.

Object object-detection +2

12,034

Paper
Code

CVPR19 Tracking and Detection Challenge: How crowded can it get?

no code implementations • 10 Jun 2019 • Patrick Dendorfer, Hamid Rezatofighi, Anton Milan, Javen Shi, Daniel Cremers, Ian Reid, Stefan Roth, Konrad Schindler, Laura Leal-Taixe

Standardized benchmarks are crucial for the majority of computer vision applications.

Multiple Object Tracking Multiple People Tracking +1

Paper
Add Code

Meta Learning with Differentiable Closed-form Solver for Fast Video Object Segmentation

no code implementations • 28 Sep 2019 • Yu Liu, Lingqiao Liu, Haokui Zhang, Hamid Rezatofighi, Ian Reid

This paper tackles the problem of video object segmentation.

Meta-Learning Object +4

Paper
Add Code

JRDB: A Dataset and Benchmark of Egocentric Robot Visual Perception of Humans in Built Environments

1 code implementation • 25 Oct 2019 • Roberto Martín-Martín, Mihir Patel, Hamid Rezatofighi, Abhijeet Shenoi, JunYoung Gwak, Eric Frankel, Amir Sadeghian, Silvio Savarese

We present JRDB, a novel egocentric dataset collected from our social mobile manipulator JackRabbot.

Autonomous Navigation Human Detection

142

Paper
Code

Approximating the Permanent by Sampling from Adaptive Partitions

1 code implementation • NeurIPS 2019 • Jonathan Kuck, Tri Dao, Hamid Rezatofighi, Ashish Sabharwal, Stefano Ermon

Computing the permanent of a non-negative matrix is a core problem with practical applications ranging from target tracking to statistical thermodynamics.

Paper
Code

Learn to Predict Sets Using Feed-Forward Neural Networks

no code implementations • 30 Jan 2020 • Hamid Rezatofighi, Tianyu Zhu, Roman Kaskman, Farbod T. Motlagh, Qinfeng Shi, Anton Milan, Daniel Cremers, Laura Leal-Taixé, Ian Reid

In our formulation we define a likelihood for a set distribution represented by a) two discrete distributions defining the set cardinally and permutation variables, and b) a joint distribution over set elements with a fixed cardinality.

Multi-Label Image Classification object-detection +1

Paper
Add Code

JRMOT: A Real-Time 3D Multi-Object Tracker and a New Large-Scale Dataset

1 code implementation • 19 Feb 2020 • Abhijeet Shenoi, Mihir Patel, JunYoung Gwak, Patrick Goebel, Amir Sadeghian, Hamid Rezatofighi, Roberto Martín-Martín, Silvio Savarese

In this work we present JRMOT, a novel 3D MOT system that integrates information from RGB images and 3D point clouds to achieve real-time, state-of-the-art tracking performance.

Ranked #8 on Multiple Object Tracking on KITTI Tracking test

Autonomous Navigation Motion Planning +2

142

Paper
Code

MOT20: A benchmark for multi object tracking in crowded scenes

1 code implementation • 19 Mar 2020 • Patrick Dendorfer, Hamid Rezatofighi, Anton Milan, Javen Shi, Daniel Cremers, Ian Reid, Stefan Roth, Konrad Schindler, Laura Leal-Taixé

The benchmark for Multiple Object Tracking, MOTChallenge, was launched with the goal to establish a standardized evaluation of multiple object tracking methods.

Multi-Object Tracking Multiple Object Tracking with Transformer +2

12,034

Paper
Code

Joint Learning of Social Groups, Individuals Action and Sub-group Activities in Videos

no code implementations • ECCV 2020 • Mahsa Ehsanpour, Alireza Abedin, Fatemeh Saleh, Javen Shi, Ian Reid, Hamid Rezatofighi

In this paper, we solve the problem of simultaneously grouping people by their social interactions, predicting their individual actions and the social activity of each social group, which we call the social task.

Group Activity Recognition

Paper
Add Code

Socially and Contextually Aware Human Motion and Pose Forecasting

no code implementations • 14 Jul 2020 • Vida Adeli, Ehsan Adeli, Ian Reid, Juan Carlos Niebles, Hamid Rezatofighi

In this paper, we propose a novel framework to tackle both tasks of human motion (or trajectory) and body skeleton pose forecasting in a unified end-to-end pipeline.

Human Dynamics Robot Navigation

Paper
Add Code

Attend And Discriminate: Beyond the State-of-the-Art for Human Activity Recognition using Wearable Sensors

no code implementations • 14 Jul 2020 • Alireza Abedin, Mahsa Ehsanpour, Qinfeng Shi, Hamid Rezatofighi, Damith C. Ranasinghe

Wearables are fundamental to improving our understanding of human activities, especially for an increasing number of healthcare applications from rehabilitation to fine-grained gait analysis.

Human Activity Recognition

Paper
Add Code

How Trustworthy are Performance Evaluations for Basic Vision Tasks?

no code implementations • 8 Aug 2020 • Tran Thien Dat Nguyen, Hamid Rezatofighi, Ba-Ngu Vo, Ba-Tuong Vo, Silvio Savarese, Ian Reid

This paper examines performance evaluation criteria for basic vision tasks involving sets of objects namely, object detection, instance-level segmentation and multi-object tracking.

Multi-Object Tracking object-detection +1

Paper
Add Code

Probabilistic Tracklet Scoring and Inpainting for Multiple Object Tracking

no code implementations • CVPR 2021 • Fatemeh Saleh, Sadegh Aliakbarian, Hamid Rezatofighi, Mathieu Salzmann, Stephen Gould

Despite the recent advances in multiple object tracking (MOT), achieved by joint detection and tracking, dealing with long occlusions remains a challenge.

Multiple Object Tracking

Paper
Add Code

MOLTR: Multiple Object Localisation, Tracking, and Reconstruction from Monocular RGB Videos

no code implementations • 9 Dec 2020 • Kejie Li, Hamid Rezatofighi, Ian Reid

Given a new RGB frame, MOLTR firstly applies a monocular 3D detector to localise objects of interest and extract their shape codes that represent the object shapes in a learned embedding space.

Benchmarking Object +1

Paper
Add Code

Looking Beyond Two Frames: End-to-End Multi-Object Tracking Using Spatial and Temporal Transformers

1 code implementation • 27 Mar 2021 • Tianyu Zhu, Markus Hiller, Mahsa Ehsanpour, Rongkai Ma, Tom Drummond, Ian Reid, Hamid Rezatofighi

Tracking a time-varying indefinite number of objects in a video sequence over time remains a challenge despite recent advances in the field.

Multi-Object Tracking Object +1

Paper
Code

TRiPOD: Human Trajectory and Pose Dynamics Forecasting in the Wild

no code implementations • ICCV 2021 • Vida Adeli, Mahsa Ehsanpour, Ian Reid, Juan Carlos Niebles, Silvio Savarese, Ehsan Adeli, Hamid Rezatofighi

Joint forecasting of human trajectory and pose dynamics is a fundamental building block of various applications ranging from robotics and autonomous driving to surveillance systems.

Autonomous Driving Human-Object Interaction Detection

Paper
Add Code

JRDB-Act: A Large-scale Dataset for Spatio-temporal Action, Social Group and Activity Detection

no code implementations • CVPR 2022 • Mahsa Ehsanpour, Fatemeh Saleh, Silvio Savarese, Ian Reid, Hamid Rezatofighi

However, learning to recognise human actions and their social interactions in an unconstrained real-world environment comprising numerous people, with potentially highly unbalanced and long-tailed distributed action labels from a stream of sensory data captured from a mobile robot platform remains a significant challenge, not least owing to the lack of a reflective large-scale dataset.

Action Detection Action Understanding +1

Paper
Add Code

Unsupervised Image Segmentation by Mutual Information Maximization and Adversarial Regularization

no code implementations • 1 Jul 2021 • S. Ehsan Mirsadeghi, Ali Royat, Hamid Rezatofighi

In this paper, we propose a novel fully unsupervised semantic segmentation method, the so-called Information Maximization and Adversarial Regularization Segmentation (InMARS).

Ranked #2 on Unsupervised Semantic Segmentation on COCO-Stuff-15

Image Segmentation Scene Understanding +4

Paper
Add Code

ODAM: Object Detection, Association, and Mapping using Posed RGB Video

1 code implementation • ICCV 2021 • Kejie Li, Daniel DeTone, Steven Chen, Minh Vo, Ian Reid, Hamid Rezatofighi, Chris Sweeney, Julian Straub, Richard Newcombe

Localizing objects and estimating their extent in 3D is an important step towards high-level 3D scene understanding, which has many applications in Augmented Reality and Robotics.

3D Object Detection Object +2

Paper
Code

Guided-GAN: Adversarial Representation Learning for Activity Recognition with Wearables

no code implementations • 12 Oct 2021 • Alireza Abedin, Hamid Rezatofighi, Damith C. Ranasinghe

Human activity recognition (HAR) is an important research field in ubiquitous computing where the acquisition of large-scale labeled sensor data is tedious, labor-intensive and time consuming.

Generative Adversarial Network Human Activity Recognition +1

Paper
Add Code

GMFlow: Learning Optical Flow via Global Matching

4 code implementations • CVPR 2022 • Haofei Xu, Jing Zhang, Jianfei Cai, Hamid Rezatofighi, DaCheng Tao

Learning-based optical flow estimation has been dominated with the pipeline of cost volume with convolutions for flow regression, which is inherently limited to local correlations and thus is hard to address the long-standing challenge of large displacements.

Ranked #8 on Optical Flow Estimation on Spring

Optical Flow Estimation regression

883

Paper
Code

Accurate and Real-time 3D Pedestrian Detection Using an Efficient Attentive Pillar Network

1 code implementation • 31 Dec 2021 • Duy-Tho Le, Hengcan Shi, Hamid Rezatofighi, Jianfei Cai

Efficiently and accurately detecting people from 3D point cloud data is of great importance in many robotic and autonomous driving applications.

Ranked #1 on Birds Eye View Object Detection on KITTI Pedestrian Hard

3D Object Detection Autonomous Driving +3

Paper
Code

Learning of Global Objective for Network Flow in Multi-Object Tracking

no code implementations • CVPR 2022 • Shuai Li, Yu Kong, Hamid Rezatofighi

This paper concerns the problem of multi-object tracking based on the min-cost flow (MCF) formulation, which is conventionally studied as an instance of linear program.

Multi-Object Tracking

Paper
Add Code

SoMoFormer: Multi-Person Pose Forecasting with Transformers

1 code implementation • 30 Aug 2022 • Edward Vendrow, Satyajit Kumar, Ehsan Adeli, Hamid Rezatofighi

Although there are several previous works targeting the problem of multi-person dynamic pose forecasting, they often model the entire pose sequence as time series (ignoring the underlying relationship between joints) or only output the future pose sequence of one person at a time.

Human Pose Forecasting motion prediction +2

Paper
Code

LAVA: Label-efficient Visual Learning and Adaptation

1 code implementation • 19 Oct 2022 • Islam Nassar, Munawar Hayat, Ehsan Abbasnejad, Hamid Rezatofighi, Mehrtash Harandi, Gholamreza Haffari

We present LAVA, a simple yet effective method for multi-domain visual transfer learning with limited data.

Few-Shot Learning Transfer Learning

Paper
Code

JRDB-Pose: A Large-scale Dataset for Multi-Person Pose Estimation and Tracking

no code implementations • CVPR 2023 • Edward Vendrow, Duy Tho Le, Jianfei Cai, Hamid Rezatofighi

In crowded human scenes with close-up human-robot interaction and robot navigation, a deep understanding requires reasoning about human motion and body dynamics over time with human body pose estimation and tracking.

Multi-Person Pose Estimation Multi-Person Pose Estimation and Tracking +1

Paper
Add Code

Unifying Flow, Stereo and Depth Estimation

1 code implementation • 10 Nov 2022 • Haofei Xu, Jing Zhang, Jianfei Cai, Hamid Rezatofighi, Fisher Yu, DaCheng Tao, Andreas Geiger

We present a unified formulation and model for three motion and 3D perception tasks: optical flow, rectified stereo matching and unrectified stereo depth estimation from posed images.

Ranked #1 on Optical Flow Estimation on Sintel-clean

Optical Flow Estimation Stereo Depth Estimation +1

883

Paper
Code

MARLIN: Masked Autoencoder for facial video Representation LearnINg

1 code implementation • CVPR 2023 • Zhixi Cai, Shreya Ghosh, Kalin Stefanov, Abhinav Dhall, Jianfei Cai, Hamid Rezatofighi, Reza Haffari, Munawar Hayat

This paper proposes a self-supervised approach to learn universal facial representations from videos, that can transfer across a variety of facial analysis tasks such as Facial Attribute Recognition (FAR), Facial Expression Recognition (FER), DeepFake Detection (DFD), and Lip Synchronization (LS).

Ranked #1 on Emotion Classification on CMU-MOSEI

Action Classification Attribute +9

186

Paper
Code

ActiveRMAP: Radiance Field for Active Mapping And Planning

no code implementations • 23 Nov 2022 • Huangying Zhan, Jiyang Zheng, Yi Xu, Ian Reid, Hamid Rezatofighi

We, for the first time, present an RGB-only active vision framework using radiance field representation for active 3D reconstruction and planning in an online manner.

3D Reconstruction

Paper
Add Code

Predicting Topological Maps for Visual Navigation in Unexplored Environments

no code implementations • 23 Nov 2022 • Huangying Zhan, Hamid Rezatofighi, Ian Reid

We propose a robotic learning system for autonomous exploration and navigation in unexplored environments.

Visual Navigation

Paper
Add Code

Energy-based Self-Training and Normalization for Unsupervised Domain Adaptation

no code implementations • ICCV 2023 • Samitha Herath, Basura Fernando, Ehsan Abbasnejad, Munawar Hayat, Shahram Khadivi, Mehrtash Harandi, Hamid Rezatofighi, Gholamreza Haffari

EBL can be used to improve the instance selection for a self-training task on the unlabelled target domain, and 2. alignment and normalizing energy scores can learn domain-invariant representations.

Unsupervised Domain Adaptation

Paper
Add Code

Tracking Different Ant Species: An Unsupervised Domain Adaptation Framework and a Dataset for Multi-object Tracking

no code implementations • 25 Jan 2023 • Chamath Abeysinghe, Chris Reid, Hamid Rezatofighi, Bernd Meyer

This approach is built upon a joint-detection-and-tracking framework that is extended by a set of domain discriminator modules integrating an adversarial training strategy in addition to the tracking loss.

Multi-Object Tracking Unsupervised Domain Adaptation

Paper
Add Code

ProtoCon: Pseudo-label Refinement via Online Clustering and Prototypical Consistency for Efficient Semi-supervised Learning

no code implementations • CVPR 2023 • Islam Nassar, Munawar Hayat, Ehsan Abbasnejad, Hamid Rezatofighi, Gholamreza Haffari

Finally, ProtoCon addresses the poor training signal in the initial phase of training (due to fewer confident predictions) by introducing an auxiliary self-supervised loss.

Online Clustering Pseudo Label

Paper
Add Code

Knowledge Combination to Learn Rotated Detection Without Rotated Annotation

1 code implementation • CVPR 2023 • Tianyu Zhu, Bryce Ferenczi, Pulak Purkait, Tom Drummond, Hamid Rezatofighi, Anton Van Den Hengel

Annotating rotated bounding boxes is such a laborious process that they are not provided in many detection datasets where axis-aligned annotations are used instead.

Paper
Code

Real-time Trajectory-based Social Group Detection

1 code implementation • 12 Apr 2023 • Simindokht Jahangard, Munawar Hayat, Hamid Rezatofighi

These results demonstrate that our proposed method is suitable for real-time robotic applications.

Graph Clustering Robot Navigation

Paper
Code

Physically Plausible 3D Human-Scene Reconstruction from Monocular RGB Image using an Adversarial Learning Approach

no code implementations • 27 Jul 2023 • Sandika Biswas, Kejie Li, Biplab Banerjee, Subhasis Chaudhuri, Hamid Rezatofighi

This paper proposes using an implicit feature representation of the scene elements to distinguish a physically plausible alignment of humans and objects from an implausible one.

3D Reconstruction Robot Navigation

Paper
Add Code

JRDB-Traj: A Dataset and Benchmark for Trajectory Forecasting in Crowds

1 code implementation • 5 Nov 2023 • Saeed Saadatnejad, Yang Gao, Hamid Rezatofighi, Alexandre Alahi

To address this, we introduce a novel dataset for end-to-end trajectory forecasting, facilitating the evaluation of models in scenarios involving less-than-ideal preceding modules such as tracking.

Autonomous Navigation Benchmarking +1

Paper
Code

Series2Vec: Similarity-based Self-supervised Representation Learning for Time Series Classification

1 code implementation • 7 Dec 2023 • Navid Mohammadi Foumani, Chang Wei Tan, Geoffrey I. Webb, Hamid Rezatofighi, Mahsa Salehi

Our evaluation of Series2Vec on nine large real-world datasets, along with the UCR/UEA archive, shows enhanced performance compared to current state-of-the-art self-supervised techniques for time series.

Data Augmentation Representation Learning +4

Paper
Code

Improving Visual Perception of a Social Robot for Controlled and In-the-wild Human-robot Interaction

no code implementations • 4 Mar 2024 • Wangjie Zhong, Leimin Tian, Duy Tho Le, Hamid Rezatofighi

Social robots often rely on visual perception to understand their users and the environment.

Paper
Add Code

HYDRA: A Hyper Agent for Dynamic Compositional Visual Reasoning

no code implementations • 19 Mar 2024 • Fucai Ke, Zhixi Cai, Simindokht Jahangard, Weiqing Wang, Pari Delir Haghighi, Hamid Rezatofighi

Recent advances in visual reasoning (VR), particularly with the aid of Large Vision-Language Models (VLMs), show promise but require access to large-scale datasets and face challenges such as high computational costs and limited generalization capabilities.

Reinforcement Learning (RL) Visual Reasoning

Paper
Add Code

JRDB-PanoTrack: An Open-world Panoptic Segmentation and Tracking Robotic Dataset in Crowded Human Environments

no code implementations • 2 Apr 2024 • Duy-Tho Le, Chenhui Gou, Stavya Datta, Hengcan Shi, Ian Reid, Jianfei Cai, Hamid Rezatofighi

JRDB-PanoTrack includes (1) various data involving indoor and outdoor crowded scenes, as well as comprehensive 2D and 3D synchronized data modalities; (2) high-quality 2D spatial panoptic segmentation and temporal tracking annotations, with additional 3D label projections for further spatial understanding; (3) diverse object classes for closed- and open-world recognition benchmarks, with OSPA-based metrics for evaluation.

Decision Making Panoptic Segmentation +1

Paper
Add Code

DifFUSER: Diffusion Model for Robust Multi-Sensor Fusion in 3D Object Detection and BEV Segmentation

no code implementations • 6 Apr 2024 • Duy-Tho Le, Hengcan Shi, Jianfei Cai, Hamid Rezatofighi

Diffusion models have recently gained prominence as powerful deep generative models, demonstrating unmatched performance across various domains.

3D Object Detection Denoising +2

Paper
Add Code

JRDB-Social: A Multifaceted Robotic Dataset for Understanding of Context and Dynamics of Human Interactions Within Social Groups

no code implementations • 6 Apr 2024 • Simindokht Jahangard, Zhixi Cai, Shiki Wen, Hamid Rezatofighi

Understanding human social behaviour is crucial in computer vision and robotics.

Paper
Add Code

Social-MAE: Social Masked Autoencoder for Multi-person Motion Representation Learning

no code implementations • 8 Apr 2024 • Mahsa Ehsanpour, Ian Reid, Hamid Rezatofighi

The framework uses masked modeling to pre-train the encoder to reconstruct masked human joint trajectories, enabling it to learn generalizable and data efficient representations of motion in human crowded scenes.

Action Understanding Multi-Person Pose forecasting +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.