Search Results for author: Mohsen Zand

Found 10 papers, 7 papers with code

Diffusion Models with Deterministic Normalizing Flow Priors

1 code implementation • 3 Sep 2023 • Mohsen Zand, Ali Etemad, Michael Greenspan

We use normalizing flows to parameterize the noisy data at any arbitrary step of the diffusion process and utilize it as the prior in the reverse diffusion process.

Denoising Image Generation

Paper
Code

Multiscale Residual Learning of Graph Convolutional Sequence Chunks for Human Motion Prediction

1 code implementation • 31 Aug 2023 • Mohsen Zand, Ali Etemad, Michael Greenspan

Our experiments on two challenging benchmark datasets, CMU Mocap and Human3. 6M, demonstrate that our proposed method is able to effectively model the sequence information for motion prediction and outperform other techniques to set a new state-of-the-art.

Human motion prediction motion prediction

Paper
Code

Diffusion Dataset Generation: Towards Closing the Sim2Real Gap for Pedestrian Detection

no code implementations • 16 May 2023 • Andrew Farley, Mohsen Zand, Michael Greenspan

We propose a method that augments a simulated dataset using diffusion models to improve the performance of pedestrian detection in real-world data.

Pedestrian Detection

Paper
Add Code

Keypoint Cascade Voting for Point Cloud Based 6DoF Pose Estimation

1 code implementation • 14 Oct 2022 • Yangzheng Wu, Alireza Javaheri, Mohsen Zand, Michael Greenspan

We propose a novel keypoint voting 6DoF object pose estimation method, which takes pure unordered point cloud geometry as input without RGB information.

Keypoint Estimation Pose Estimation +1

Paper
Code

ObjectBox: From Centers to Boxes for Anchor-Free Object Detection

1 code implementation • 14 Jul 2022 • Mohsen Zand, Ali Etemad, Michael Greenspan

We present ObjectBox, a novel single-stage anchor-free and highly generalizable object detection approach.

Object object-detection +1

131

Paper
Code

Multiscale Crowd Counting and Localization By Multitask Point Supervision

1 code implementation • 21 Feb 2022 • Mohsen Zand, Haleh Damirchi, Andrew Farley, Mahdiyar Molahasani, Michael Greenspan, Ali Etemad

As the detection and localization tasks are well-correlated and can be jointly tackled, our model benefits from a multitask solution by learning multiscale representations of encoded crowd images, and subsequently fusing them.

Crowd Counting

Paper
Code

Multistream ValidNet: Improving 6D Object Pose Estimation by Automatic Multistream Validation

no code implementations • 12 Jun 2021 • Joy Mazumder, Mohsen Zand, Michael Greenspan

Applying our method can also improve the pose estimation average precision results of Op-Net by 6. 06% on average.

6D Pose Estimation using RGB

Paper
Add Code

Oriented Bounding Boxes for Small and Freely Rotated Objects

no code implementations • 24 Apr 2021 • Mohsen Zand, Ali Etemad, Michael Greenspan

A novel object detection method is presented that handles freely rotated objects of arbitrary sizes, including tiny objects as small as $2\times 2$ pixels.

Novel Object Detection object-detection +1

Paper
Add Code

Flow-based Spatio-Temporal Structured Prediction of Motion Dynamics

1 code implementation • 9 Apr 2021 • Mohsen Zand, Ali Etemad, Michael Greenspan

We specifically propose to use conditional priors to factorize the latent space for the time dependent modeling.

motion prediction Structured Prediction +3

Paper
Code

Vote from the Center: 6 DoF Pose Estimation in RGB-D Images by Radial Keypoint Voting

1 code implementation • 6 Apr 2021 • Yangzheng Wu, Mohsen Zand, Ali Etemad, Michael Greenspan

We propose a novel keypoint voting scheme based on intersecting spheres, that is more accurate than existing schemes and allows for fewer, more disperse keypoints.

Ranked #1 on 6D Pose Estimation using RGBD on YCB-Video (ADDS AUC metric)

6D Pose Estimation using RGBD

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.