Search Results for author: Mohsen Zand

Found 10 papers, 7 papers with code

Diffusion Models with Deterministic Normalizing Flow Priors

1 code implementation3 Sep 2023 Mohsen Zand, Ali Etemad, Michael Greenspan

We use normalizing flows to parameterize the noisy data at any arbitrary step of the diffusion process and utilize it as the prior in the reverse diffusion process.

Denoising Image Generation

Multiscale Residual Learning of Graph Convolutional Sequence Chunks for Human Motion Prediction

1 code implementation31 Aug 2023 Mohsen Zand, Ali Etemad, Michael Greenspan

Our experiments on two challenging benchmark datasets, CMU Mocap and Human3. 6M, demonstrate that our proposed method is able to effectively model the sequence information for motion prediction and outperform other techniques to set a new state-of-the-art.

Human motion prediction motion prediction

Diffusion Dataset Generation: Towards Closing the Sim2Real Gap for Pedestrian Detection

no code implementations16 May 2023 Andrew Farley, Mohsen Zand, Michael Greenspan

We propose a method that augments a simulated dataset using diffusion models to improve the performance of pedestrian detection in real-world data.

Pedestrian Detection

Keypoint Cascade Voting for Point Cloud Based 6DoF Pose Estimation

1 code implementation14 Oct 2022 Yangzheng Wu, Alireza Javaheri, Mohsen Zand, Michael Greenspan

We propose a novel keypoint voting 6DoF object pose estimation method, which takes pure unordered point cloud geometry as input without RGB information.

Keypoint Estimation Pose Estimation +1

ObjectBox: From Centers to Boxes for Anchor-Free Object Detection

1 code implementation14 Jul 2022 Mohsen Zand, Ali Etemad, Michael Greenspan

We present ObjectBox, a novel single-stage anchor-free and highly generalizable object detection approach.

Object object-detection +1

Multiscale Crowd Counting and Localization By Multitask Point Supervision

1 code implementation21 Feb 2022 Mohsen Zand, Haleh Damirchi, Andrew Farley, Mahdiyar Molahasani, Michael Greenspan, Ali Etemad

As the detection and localization tasks are well-correlated and can be jointly tackled, our model benefits from a multitask solution by learning multiscale representations of encoded crowd images, and subsequently fusing them.

Crowd Counting

Multistream ValidNet: Improving 6D Object Pose Estimation by Automatic Multistream Validation

no code implementations12 Jun 2021 Joy Mazumder, Mohsen Zand, Michael Greenspan

Applying our method can also improve the pose estimation average precision results of Op-Net by 6. 06% on average.

6D Pose Estimation using RGB

Oriented Bounding Boxes for Small and Freely Rotated Objects

no code implementations24 Apr 2021 Mohsen Zand, Ali Etemad, Michael Greenspan

A novel object detection method is presented that handles freely rotated objects of arbitrary sizes, including tiny objects as small as $2\times 2$ pixels.

Novel Object Detection object-detection +1

Flow-based Spatio-Temporal Structured Prediction of Motion Dynamics

1 code implementation9 Apr 2021 Mohsen Zand, Ali Etemad, Michael Greenspan

We specifically propose to use conditional priors to factorize the latent space for the time dependent modeling.

motion prediction Structured Prediction +3

Vote from the Center: 6 DoF Pose Estimation in RGB-D Images by Radial Keypoint Voting

1 code implementation6 Apr 2021 Yangzheng Wu, Mohsen Zand, Ali Etemad, Michael Greenspan

We propose a novel keypoint voting scheme based on intersecting spheres, that is more accurate than existing schemes and allows for fewer, more disperse keypoints.

 Ranked #1 on 6D Pose Estimation using RGBD on YCB-Video (ADDS AUC metric)

6D Pose Estimation using RGBD

Cannot find the paper you are looking for? You can Submit a new open access paper.