Search Results for author: Hossein Rahmani

Found 36 papers, 7 papers with code

LLMs are Good Sign Language Translators

no code implementations1 Apr 2024 Jia Gong, Lin Geng Foo, Yixuan He, Hossein Rahmani, Jun Liu

Sign Language Translation (SLT) is a challenging task that aims to translate sign videos into spoken language.

Sign Language Translation Translation

Action Detection via an Image Diffusion Process

no code implementations1 Apr 2024 Lin Geng Foo, Tianjiao Li, Hossein Rahmani, Jun Liu

Action detection aims to localize the starting and ending points of action instances in untrimmed videos, and predict the classes of those instances.

Action Detection Image Generation

DestripeCycleGAN: Stripe Simulation CycleGAN for Unsupervised Infrared Image Destriping

no code implementations14 Feb 2024 Shiqi Yang, Hanlin Qin, Shuai Yuan, Xiang Yan, Hossein Rahmani

However, when applied to the infrared destriping task, it becomes challenging for the vanilla auxiliary generator to consistently produce vertical noise under unsupervised constraints.

Denoising Image Restoration

Sports-QA: A Large-Scale Video Question Answering Benchmark for Complex and Professional Sports

1 code implementation3 Jan 2024 Haopeng Li, Andong Deng, Qiuhong Ke, Jun Liu, Hossein Rahmani, Yulan Guo, Bernt Schiele, Chen Chen

Reasoning over sports videos for question answering is an important task with numerous applications, such as player training and information retrieval.

Action Understanding counterfactual +4

3D Points Splatting for Real-Time Dynamic Hand Reconstruction

no code implementations21 Dec 2023 Zheheng Jiang, Hossein Rahmani, Sue Black, Bryan M. Williams

This is followed by a self-adaptive deformation that deforms the hand from the canonical space to the target pose, adapting to the dynamic changing of canonical points which, in contrast to the common practice of subdividing the MANO model, offers greater flexibility and results in improved geometry fitting.

AI-Generated Content (AIGC) for Various Data Modalities: A Survey

no code implementations27 Aug 2023 Lin Geng Foo, Hossein Rahmani, Jun Liu

Due to its wide range of applications and the demonstrated potential of recent works, AIGC developments have been attracting lots of attention recently, and AIGC methods have been developed for various data modalities, such as image, video, text, 3D shape (as voxels, point clouds, meshes, and neural implicit fields), 3D scene, 3D human avatar (body and head), 3D motion, and audio -- each presenting different characteristics and challenges.

Distribution-Aligned Diffusion for Human Mesh Recovery

no code implementations ICCV 2023 Lin Geng Foo, Jia Gong, Hossein Rahmani, Jun Liu

Inspired by their capability, we explore a diffusion-based approach for human mesh recovery, and propose a Human Mesh Diffusion (HMDiff) framework which frames mesh recovery as a reverse diffusion process.

Denoising Human Mesh Recovery

A Probabilistic Attention Model with Occlusion-aware Texture Regression for 3D Hand Reconstruction from a Single RGB Image

1 code implementation CVPR 2023 Zheheng Jiang, Hossein Rahmani, Sue Black, Bryan M. Williams

The experimental results demonstrate our probabilistic model's state-of-the-art accuracy in 3D hand and texture reconstruction from a single image in both training schemes, including in the presence of severe occlusions.

3D Hand Pose Estimation regression

Token Boosting for Robust Self-Supervised Visual Transformer Pre-training

no code implementations CVPR 2023 Tianjiao Li, Lin Geng Foo, Ping Hu, Xindi Shang, Hossein Rahmani, Zehuan Yuan, Jun Liu

Pre-training VTs on such corrupted data can be challenging, especially when we pre-train via the masked autoencoding approach, where both the inputs and masked ``ground truth" targets can potentially be unreliable in this case.

Progressive Channel-Shrinking Network

no code implementations1 Apr 2023 Jianhong Pan, Siyuan Yang, Lin Geng Foo, Qiuhong Ke, Hossein Rahmani, Zhipeng Fan, Jun Liu

Currently, salience-based channel pruning makes continuous breakthroughs in network compression.

GradMDM: Adversarial Attack on Dynamic Networks

no code implementations1 Apr 2023 Jianhong Pan, Lin Geng Foo, Qichen Zheng, Zhipeng Fan, Hossein Rahmani, Qiuhong Ke, Jun Liu

Dynamic neural networks can greatly reduce computation redundancy without compromising accuracy by adapting their structures based on the input.

Adversarial Attack

Unified Pose Sequence Modeling

no code implementations CVPR 2023 Lin Geng Foo, Tianjiao Li, Hossein Rahmani, Qiuhong Ke, Jun Liu

We propose a Unified Pose Sequence Modeling approach to unify heterogeneous human behavior understanding tasks based on pose data, e. g., action recognition, 3D pose estimation and 3D early action prediction.

3D Pose Estimation Action Recognition +1

DiffPose: Toward More Reliable 3D Pose Estimation

2 code implementations CVPR 2023 Jia Gong, Lin Geng Foo, Zhipeng Fan, Qiuhong Ke, Hossein Rahmani, Jun Liu

Monocular 3D human pose estimation is quite challenging due to the inherent ambiguity and occlusion, which often lead to high uncertainty and indeterminacy.

3D Pose Estimation Monocular 3D Human Pose Estimation

Dynamic Spatio-Temporal Specialization Learning for Fine-Grained Action Recognition

no code implementations3 Sep 2022 Tianjiao Li, Lin Geng Foo, Qiuhong Ke, Hossein Rahmani, Anran Wang, Jinghua Wang, Jun Liu

We design a novel Dynamic Spatio-Temporal Specialization (DSTS) module, which consists of specialized neurons that are only activated for a subset of samples that are highly similar.

Fine-grained Action Recognition

ERA: Expert Retrieval and Assembly for Early Action Prediction

no code implementations20 Jul 2022 Lin Geng Foo, Tianjiao Li, Hossein Rahmani, Qiuhong Ke, Jun Liu

Early action prediction aims to successfully predict the class label of an action before it is completely performed.

Early Action Prediction Retrieval

Graph-Context Attention Networks for Size-Varied Deep Graph Matching

1 code implementation CVPR 2022 Zheheng Jiang, Hossein Rahmani, Plamen Angelov, Sue Black, Bryan M. Williams

Deep learning for graph matching has received growing interest and developed rapidly in the past decade.

Ranked #4 on Graph Matching on PASCAL VOC (matching accuracy metric)

Graph Matching

Meta Agent Teaming Active Learning for Pose Estimation

no code implementations CVPR 2022 Jia Gong, Zhipeng Fan, Qiuhong Ke, Hossein Rahmani, Jun Liu

The existing pose estimation approaches often require a large number of annotated images to attain good estimation performance, which are laborious to acquire.

Active Learning Pose Estimation

Recent Advances of Continual Learning in Computer Vision: An Overview

no code implementations23 Sep 2021 Haoxuan Qu, Hossein Rahmani, Li Xu, Bryan Williams, Jun Liu

In contrast to batch learning where all training data is available at once, continual learning represents a family of methods that accumulate knowledge and learn continuously with data available in sequential order.

Continual Learning Knowledge Distillation

Multi-Branch with Attention Network for Hand-Based Person Recognition

1 code implementation4 Aug 2021 Nathanael L. Baisa, Bryan Williams, Hossein Rahmani, Plamen Angelov, Sue Black

In this paper, we propose a novel hand-based person recognition method for the purpose of criminal investigations since the hand image is often the only available information in cases of serious crime such as sexual abuse.

Person Recognition

Hand-Based Person Identification using Global and Part-Aware Deep Feature Representation Learning

no code implementations13 Jan 2021 Nathanael L. Baisa, Bryan Williams, Hossein Rahmani, Plamen Angelov, Sue Black

Our proposed method, Global and Part-Aware Network (GPA-Net), creates global and local branches on the conv-layer for learning robust discriminative global and part-level features.

Person Identification Pose Estimation +1

DL-Reg: A Deep Learning Regularization Technique using Linear Regression

1 code implementation31 Oct 2020 Maryam Dialameh, Ali Hamzeh, Hossein Rahmani

This linear constraint, which is further adjusted by a regularization factor, prevents the network from the risk of overfitting.

regression

Learning Action Recognition Model From Depth and Skeleton Videos

no code implementations ICCV 2017 Hossein Rahmani, Mohammed Bennamoun

Depth sensors open up possibilities of dealing with the human action recognition problem by providing 3D human skeleton data and depth images of the scene.

Action Recognition Human-Object Interaction Detection +1

3D Action Recognition From Novel Viewpoints

no code implementations CVPR 2016 Hossein Rahmani, Ajmal Mian

We propose a human pose representation model that transfers human poses acquired from different unknown views to a view-invariant high-level space.

3D Action Recognition Clustering

Learning a Deep Model for Human Action Recognition from Novel Viewpoints

no code implementations2 Feb 2016 Hossein Rahmani, Ajmal Mian, Mubarak Shah

The strength of our technique is that we learn a single R-NKTM for all actions and all viewpoints for knowledge transfer of any real human action video without the need for re-training or fine-tuning the model.

Action Recognition Temporal Action Localization +1

Histogram of Oriented Principal Components for Cross-View Action Recognition

no code implementations24 Sep 2014 Hossein Rahmani, Arif Mahmood, Du Huynh, Ajmal Mian

We propose the Histogram of Oriented Principal Components (HOPC) descriptor that is robust to noise, viewpoint, scale and action speed variations.

3D Action Recognition

HOPC: Histogram of Oriented Principal Components of 3D Pointclouds for Action Recognition

no code implementations17 Aug 2014 Hossein Rahmani, Arif Mahmood, Du. Q. Huynh, Ajmal Mian

In contrast, we directly process the pointclouds and propose a new technique for action recognition which is more robust to noise, action speed and viewpoint variations.

3D Action Recognition Keypoint Detection

A Novel Scheme for Intelligent Recognition of Pornographic Images

no code implementations24 Feb 2014 Seyed Mostafa Kia, Hossein Rahmani, Reza Mortezaei, Mohsen Ebrahimi Moghaddam, Amer Namazi

To test the proposed method, performance of system was evaluated over 18354 download images from internet.

Cannot find the paper you are looking for? You can Submit a new open access paper.