Search Results for author: Shaoyi Du

Found 15 papers, 6 papers with code

Learning a Category-level Object Pose Estimator without Pose Annotations

no code implementations8 Apr 2024 Fengrui Tian, Yaoyao Liu, Adam Kortylewski, Yueqi Duan, Shaoyi Du, Alan Yuille, Angtian Wang

Instead of using manually annotated images, we leverage diffusion models (e. g., Zero-1-to-3) to generate a set of images under controlled pose differences and propose to learn our object pose estimator with those images.

Object Pose Estimation

Semantic Flow: Learning Semantic Field of Dynamic Scenes from Monocular Videos

1 code implementation8 Apr 2024 Fengrui Tian, Yueqi Duan, Angtian Wang, Jianfei Guo, Shaoyi Du

As there is 2D-to-3D ambiguity problem in the viewing direction when extracting 3D flow features from 2D video frames, we consider the volume densities as opacity priors that describe the contributions of flow features to the semantics on the frames.

Hypergraph-based Multi-View Action Recognition using Event Cameras

no code implementations28 Mar 2024 Yue Gao, Jiaxuan Lu, Siqi Li, Yipeng Li, Shaoyi Du

By treating segments as vertices and constructing hyperedges using rule-based and KNN-based strategies, a multi-view hypergraph neural network that captures relationships across viewpoint and temporal features is established.

Action Recognition

ModaLink: Unifying Modalities for Efficient Image-to-PointCloud Place Recognition

1 code implementation27 Mar 2024 Weidong Xie, Lun Luo, Nanfei Ye, Yi Ren, Shaoyi Du, Minhang Wang, Jintao Xu, Rui Ai, Weihao Gu, Xieyuanli Chen

Experimental results on the KITTI dataset show that our proposed methods achieve state-of-the-art performance while running in real time.

Depth Estimation

GUESS:GradUally Enriching SyntheSis for Text-Driven Human Motion Generation

1 code implementation4 Jan 2024 Xuehao Gao, Yang Yang, Zhenyu Xie, Shaoyi Du, Zhongqian Sun, Yang Wu

The whole text-driven human motion synthesis problem is then divided into multiple abstraction levels and solved with a multi-stage generation framework with a cascaded latent diffusion model: an initial generator first generates the coarsest human motion guess from a given text description; then, a series of successive generators gradually enrich the motion details based on the textual description and the previous synthesized results.

Motion Synthesis

Tolerating Annotation Displacement in Dense Object Counting via Point Annotation Probability Map

no code implementations29 Jul 2023 Yuehai Chen, Jing Yang, Badong Chen, Hua Gang, Shaoyi Du

To improve the robustness to annotation displacement, we design an effective transport cost function based on GGD.

Object Counting regression

HybridPoint: Point Cloud Registration Based on Hybrid Point Sampling and Matching

1 code implementation29 Mar 2023 Yiheng Li, Canhui Tang, Runzhao Yao, Aixue Ye, Feng Wen, Shaoyi Du

Firstly, we propose to use salient points with prominent local features as nodes to increase patch repeatability, and introduce some uniformly distributed points to complete the point cloud, thus constituting hybrid points.

Patch Matching Point Cloud Registration

Decompose More and Aggregate Better: Two Closer Looks at Frequency Representation Learning for Human Motion Prediction

no code implementations CVPR 2023 Xuehao Gao, Shaoyi Du, Yang Wu, Yang Yang

Encouraged by the effectiveness of encoding temporal dynamics within the frequency domain, recent human motion prediction systems prefer to first convert the motion representation from the original pose space into the frequency space.

Human motion prediction motion prediction +1

MonoNeRF: Learning a Generalizable Dynamic Radiance Field from Monocular Videos

1 code implementation ICCV 2023 Fengrui Tian, Shaoyi Du, Yueqi Duan

More specifically, we learn an implicit velocity field to estimate point trajectory from temporal features with Neural ODE, which is followed by a flow-based feature aggregation module to obtain spatial features along the point trajectory.

Counting Varying Density Crowds Through Density Guided Adaptive Selection CNN and Transformer Estimation

no code implementations21 Jun 2022 Yuehai Chen, Jing Yang, Badong Chen, Shaoyi Du

Thus, CNN could locate and estimate crowds accurately in low-density regions, while it is hard to properly perceive the densities in high-density regions.

Crowd Counting

Region-Aware Network: Model Human's Top-Down Visual Perception Mechanism for Crowd Counting

no code implementations23 Jun 2021 Yuehai Chen, Jing Yang, Dong Zhang, Kun Zhang, Badong Chen, Shaoyi Du

More specifically, we scan the whole input images and its priority maps in the form of column vector to obtain a relevance matrix estimating their similarity.

Crowd Counting

Color Point Cloud Registration Based on Supervoxel Correspondence

no code implementations IEEE Access 2020 YANG YANG, WEILE CHEN, Muyi Wang, DEXING ZHONG, Shaoyi Du

Different from traditional feature-based methods, we design a hybrid feature representation with color moments of the point, which could be applied naturally for any color point cloud.

Point Cloud Registration

An Effective Approach for Point Clouds Registration Based on the Hard and Soft Assignments

no code implementations1 Jun 2017 Congcong Jin, Jihua Zhu, Yaochen Li, Shaoyi Du, Zhongyu Li, Huimin Lu

For the registration of partially overlapping point clouds, this paper proposes an effective approach based on both the hard and soft assignments.

Cannot find the paper you are looking for? You can Submit a new open access paper.